Name		Name	Last commit message	Last commit date
parent directory ..
.ipynb_checkpoints		.ipynb_checkpoints
md-versions		md-versions
metastore_db		metastore_db
ML-Lecture2.ipynb		ML-Lecture2.ipynb
ML-Lecture3.1.ipynb		ML-Lecture3.1.ipynb
ML-Lecture3.2-public.ipynb		ML-Lecture3.2-public.ipynb
ML-Lecture4.1.ipynb		ML-Lecture4.1.ipynb
ML-Lecture4.2.ipynb		ML-Lecture4.2.ipynb
ML-Lecture5.1.ipynb		ML-Lecture5.1.ipynb
ML-Lecture5.2.ipynb		ML-Lecture5.2.ipynb
ML-Lecture6.1.ipynb		ML-Lecture6.1.ipynb
ML-Lecture7.1.ipynb		ML-Lecture7.1.ipynb
ML-Lecture7.2.ipynb		ML-Lecture7.2.ipynb
ML-Lecture7.3.ipynb		ML-Lecture7.3.ipynb
ML-PracticeFinal.ipynb		ML-PracticeFinal.ipynb
MakingYourOwnTree.ipynb		MakingYourOwnTree.ipynb
PandasSklearn.ipynb		PandasSklearn.ipynb
PracticeMLFinalP1.ipynb		PracticeMLFinalP1.ipynb
PracticeMLFinalP2.ipynb		PracticeMLFinalP2.ipynb
RandomForestWalkthrough.ipynb		RandomForestWalkthrough.ipynb
derby.log		derby.log
readme.md		readme.md

readme.md

Machine Learning

Unofficial Lecture Note Website

Welcome to the MSAN 2017 Machine Learning 1 unofficial lecture notes website. I've collected all my inclass notes from our 8-week course in the fall. Each of the 2-hour lecture notes are provided below.

Lecture 1

Introductions and class basics

Lecture 2 Notes

Python basics
Git, Symlink, AWS
Python notebook basics
Crash course on pandas
FastAI introduction
add_datepart
train_cats
Feather Format
Run your first Random Forest

Lecture 3 Notes

R^2 accuracy
How to make validation sets
Test vs. Validation Set
Diving into RandomForests
Examination of One tree
What is 'bagging'
What is OOB Out-of-Box score
RF Hyperparameter 1: Trees
RF Hyperparameter 2: max Samples per leaf
RF Hyperparameter 3: max features

Lecture 4 Notes

Forecasting: Grocery Kaggle discussion, Parallel to Rossman stores
Random Forests: Confidence based tree variance
Random Forests: Feature Importance Intro
Random Forests: Decoupled Shuffling

Lecture 5 Notes

Summary of Random Forests
- Data needs to be numeric
- Categories go to numbers
Subsampling in different trees
- Tree size
- Records per node
- Information Gain (improvement)
- Repeat process for different subsetes
- Each tree should be better
- Trees should not be correlated
Min Leaf Samples
Max Features
n_jobs
oob
interpretting OOB vs. Training vs. Test score
Feature Importance Deep dive
One hot encoding
Redundant features
Partial Dependence

Lecture 6 Notes

What makes a good validation set?
What makes a good test set?
Random Forest from scratch : setup framework

Lecture 7 Notes

Motivations for data science
Thinking about the business implications
Tell the story
Review of Confidence in Tree Prediction Variance, Feature importance, Partial Dependence

Lecture 8 Notes

Building a Decision Tree from scratch
Optimizing and comparing to SKlearn
How to do 2 levels of decision trees
Fleshing out the RF predict function
Assembling our own decision tree
Cython

Lecture 9 Notes

Deep Learning
Using pytorch and a 1-level NN
Walkthrough of MNIST number sets
Binary Loss func
Making a LogReg equivalent NN pytorch

Lecture 10 Notes

Rewriting the 1-layer NN from scratch
Rewrite LinearLayer
Rewrite Softmax
Understanding numpy and torch matrix operations
Understanding Broadcasting rules
Rewriting matrix mult from scratch
Start looking at the fit function

Lecture 11

Rewriting fit from scratch
Digression of Momentum
Rewriting gradient and step within fit function
NLP
Bag of words / CountVectorizer
LogisticRegression w. Sentiment

Lecture 12

NLP : trigrams
Naive Bayes Classifier
Binarized version of NB
NBSVM - combination of probs
Storage efficiency of 1-hot
RossMan store examination
Introduction to embeddings

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

msan621_ml1

msan621_ml1

readme.md

Machine Learning

Unofficial Lecture Note Website

Lecture 1

Lecture 2 Notes

Lecture 3 Notes

Lecture 4 Notes

Lecture 5 Notes

Lecture 6 Notes

Lecture 7 Notes

Lecture 8 Notes

Lecture 9 Notes

Lecture 10 Notes

Lecture 11

Lecture 12

Files

msan621_ml1

Directory actions

More options

Directory actions

More options

Latest commit

History

msan621_ml1

Folders and files

parent directory

Machine Learning

Unofficial Lecture Note Website

Lecture 1