Skip to content

Latest commit

 

History

History
4 lines (3 loc) · 283 Bytes

File metadata and controls

4 lines (3 loc) · 283 Bytes

Quick hack to load files directly to GCS, without Airflow. Downloads csv files from https://nyc-tlc.s3.amazonaws.com/trip+data/ and uploads them to your Cloud Storage Account as parquet files.

  1. Install pre-reqs (more info in web_to_gcs.py script)
  2. Run: python web_to_gcs.py