Information on the datasets used in this analysis can be located at the following link EDA for the Subway delay dataset
- Link to google drive folder: subway_eda
Folder structure: 2 Colab(IPYNB) Notebooks
- One for data_prep
- Second for EDA
- Other supporting data in Csv and JSONs used in the code for cleaning and analysis.
How to run the code:
- Make a shortcut of the shared folder in google drive.
- Mount the drive(check for the folder you made a shortcut of in step 1) Run the data_prep notebook first that will clean the data and save a CSV on which EDA is performed . (this CSV is already present in the folder for reference)
- Run the EDA notebook to see all the data analysis done on the above-cleaned data.