*Class schedule is subject to revision throughout the semester.
W | Date | Due (before class @ 12:30pm) | Topics Tools |
|
#To-do/Homework Project |
||||
1 | 1/7 | [slides] Course introduction, setup | ||
1/9 | #1 | [slides] Data in linguistics | ||
2 | 1/14 | Homework 1 | [slides] Processing linguistic data | |
1/16 | #2 | Data processing fundamentals, statistics | [slides] Python's numpy library | |
3 | 1/21 | #3 | [slides] Data frames with pandas | |
1/23 | #4 | [slides] Text processing, stats intro | ||
4 | 1/28 | Stats crash course | ||
1/30 | #5 | Data visualization | ||
5 | 2/4 | Homework 2 | HW2 review | |
2/6 | Corpus linguistics, annotation | [slides] Corpus concepts, building & processing | ||
6 | 2/11 | #6 | [slides] Annotation, data standards & exchange formats | |
2/13 | #7 | Open access & data publishing | [slides] Guest speakers Lauren Collister and Dominic Bordelon | |
7 | 2/18 | #8 | Data mining and machine learning | [slides] Data-mining web & social media |
2/20 | #9 | Regression modeling | ||
8 | 2/25 | NB classifier, count vectors, TF-IDF | ||
2/27 | #10 | Classifiers continued, categorical data | ||
9 | 3/3 | #11 | Dimensionality reduction, cross-validation | |
3/5 | Homework 3 | Homework 3 review | ||
No class: Spring break | ||||
10 | 3/17 | 2nd progress report |
Big data | Bash and command line Command line, BASH, Unix tools |
3/19 | #12 | Command line, grep | ||
11 | 3/24 | #13 | Supercomputing at CRC, SSH, command line. Guest speaker Barry Moore III | |
3/26 | #14 | Computational efficiency, machine learning big data, word embeddings | ||
12 | 3/31 | Homework 4 | Homework 4 review | |
4/2 | #15 | Speech & multimedia | Speech data, ASR theory Praat |
|
13 | 4/7 | 3rd progress report |
More speech data, multimodal data
Elan |
|
4/9 | #16 | Project presentations | ||
14 | 4/14 | |||
4/16 | ||||
15 | 4/24 | Final project report |
No class: finals week |