Skip to content

Latest commit

 

History

History
21 lines (14 loc) · 592 Bytes

README.md

File metadata and controls

21 lines (14 loc) · 592 Bytes

spark-python-talk

Content for a talk on Spark using Python (UDF, Pandas_UDF, Koalas)

Running the Notebooks

With Docker installed, run the following command to run the Jupyter notebooks:

docker run --rm -p 8888:8888 -v "$PWD":/home/jovyan/work jupyter/pyspark-notebook

This command will display the Jupyter notebook URL.

Generating slides

Run this command to generate the html slides for the presentation from a Jupyter terminal. This will output an html file that can be opened on the host machine:

jupyter nbconvert work/pyspark_talk.ipynb --to slides