Distance Data Codebase

Welcome to the Distance Data codebase! This codebase provides functionalities for analyzing and visualizing distance data between zip codes, as well as performing data preprocessing and manipulation tasks. It is designed to work with CSV files containing distance data and a zip code database.

Functionalities

The main functionalities of the Distance Data codebase include:

Reading and writing distance data to/from CSV files.
Plotting histograms of distance data.
Filtering distance data based on desired state.
Computing patient/dispensary ratios and distance*patient values.
Adding latitude and longitude information to zip codes in dispensary and patient data.

Installation

To use the Distance Data codebase, follow these steps:

Clone the repository:

git clone https://github.com/DoctorGoose/PAMJ.git

Navigate to the codebase directory:
```
cd PAMJ
```

Install the required dependencies:

pip install pandas numpy matplotlib geopandas zipfile

Usage

Reading and Writing Distance Data

To read distance data from a CSV file, use the following code:

import pandas as pd

df = pd.read_csv('Distance Data.csv')

To write distance data to a CSV file, use the following code:

df.to_csv('Distance Data.csv', index=False)

Plotting Histograms

To plot a histogram of the "Nearest Disp Distance" column in the distance data, use the following code:

import matplotlib.pyplot as plt

plt.hist(df['Nearest Disp Distance'])
plt.show()

Filtering Distance Data

To filter the distance data to include only zip codes from a desired state (e.g., Pennsylvania), use the following code:

desired_state = 'PA'
filtered_df = df[df['state'] == desired_state]

Computing Ratios and Values

To compute the patient/dispensary ratio and distance*patient values, use the following code:

df['Patient/Dispensary Ratio'] = df.apply(lambda row: row['Patient Count'] if row['Dispensary Count'] == 0 else row['Patient Count']/row['Dispensary Count'], axis=1)
df['Distance*Patient'] = df['Nearest Disp Distance'] * df['Patient Count']

Adding Latitude and Longitude Information

To add latitude and longitude information to zip codes in the dispensary and patient data, use the following code:

df_dispo = pd.read_csv('DispoZipLatLong.csv')
df_pat = pd.read_csv('PatientZipLatLong.csv')

zipcodes_dispo = df_dispo['Zipcode'].unique()
zipcodes_pat = df_pat['Zipcode'].unique()

zipcode_counts_dispo = df_dispo['Zipcode'].value_counts()
zipcode_counts_pat = df_pat['Zipcode'].value_counts()

zipcode_counts_combined = pd.concat([zipcode_counts_dispo, zipcode_counts_pat], axis=1)
zipcode_counts_combined.columns = ['Dispensary Count', 'Patient Count']

Contributors

The Distance Data codebase is maintained by DoctorGoose.

Contributing

Contributions to the Distance Data codebase are welcome! If you encounter any issues or have suggestions for improvements, please open an issue on GitHub.

License

The Distance Data codebase is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
figs		figs
.gitattributes		.gitattributes
.gitignore		.gitignore
Analysis.ipynb		Analysis.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distance Data Codebase

Functionalities

Installation

Usage

Reading and Writing Distance Data

Plotting Histograms

Filtering Distance Data

Computing Ratios and Values

Adding Latitude and Longitude Information

Contributors

Contributing

License

About

Releases

Packages

Languages

License

DoctorGoose/PAMJ

Folders and files

Latest commit

History

Repository files navigation

Distance Data Codebase

Functionalities

Installation

Usage

Reading and Writing Distance Data

Plotting Histograms

Filtering Distance Data

Computing Ratios and Values

Adding Latitude and Longitude Information

Contributors

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages