forked from cis-ds/course-site
-
Notifications
You must be signed in to change notification settings - Fork 0
/
cm017.Rmd
34 lines (24 loc) · 1.02 KB
/
cm017.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
title: "Distributed learning: relational data"
output: html_document
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(cache=TRUE)
```
# cm017 - November 21, 2016
## Overview
* Introduce relational data
* Demonstrate how tables are linked to one another
* Demonstrate methods in `dplyr` for linking and merging related tables
* Discuss the need for distributed computing
* Define SQL
* Demonstrate how to access local and remote SQL databases
## Slides and links
* [Slides](extras/cm017_slides.html)
* ["Relational data" in *R for Data Science*](http://r4ds.had.co.nz/relational-data.html)
* [Accessing databases using `dplyr`](https://cran.r-project.org/web/packages/dplyr/vignettes/databases.html)
* [Taxi dataset](https://cloud.google.com/bigquery/public-data/nyc-tlc-trips)
* [`bigrquery`](https://github.com/rstats-db/bigrquery) - instructions for setting up an account to access Google Bigquery databases
## To do for Wednesday
* [Finish homework 8](hw08-statistical_learning.html)
* Final projects - need I say any more?