Skip to content

Latest commit

 

History

History
4 lines (3 loc) · 348 Bytes

readme.md

File metadata and controls

4 lines (3 loc) · 348 Bytes

Using Google Dataproc to create Delta Tables

This repo presents code in Pyspark to read datasets from Google Bigquery, transform and generate reports with Pyspark on Google Dataproc and then save the final reports as Delta Tables.

The code named "update_bq_save_deltatable.py" also perform a merge operation to add new data to a existing report.