This repository contains notes for the Summer edition of INFO-UB23: Introduction to Programming and Data Science, taught at NYU Stern. It is a fork of the course materials repository designed by Panos Ipeirotis, who deserves full credit for the content included here. The materials are focused on teaching programming for data science to non-CS majors. The emphasis is on offering live examples that students can use directly to complete their goals.
We setup and deploy our data science environment (effectively, Jupyter with Python and R support, plus MySQL) using docker. As our default option, we allow students to connect to a JupyterHub server that runs on Kubernetes. We also give the option to students to run the same environment locally on their laptops, or deploy the Docker image on AWS or Google Cloud.