Skip to content

jefersonmsantos/deltatable-dataproc-bq

Repository files navigation

Using Google Dataproc to create Delta Tables

This repo presents code in Pyspark to read datasets from Google Bigquery, transform and generate reports with Pyspark on Google Dataproc and then save the final reports as Delta Tables.

The code named "update_bq_save_deltatable.py" also perform a merge operation to add new data to a existing report.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published