Submission:
- Please submit your project via GitHub and send a private message on Slack to both Dan and Ivan with a link to it.
In this project, you will perform a logistic regression on the admissions data we've been working with in projects 1 and 2.
Objective: Completed Jupyter notebook that includes basic modeling using logistic regression.
- Requirements:
- Create one-hot encoding using binary variables.
- Calculate odds ratios by hand.
- Complete a logistic regression using
statsmodels
and interpret your findings. - Calculate predicted probabilities.
- Do a similar analysis using
sklearn
.
The dataset is available here.
Review the questions in the starter code notebook provided.
- Review logistic regression, odds ratios and probabilities from prior lessons.
- Read the documenation for
statsmodels
andsklearn
. Most of the time, there is a tutorial that you can follow; learning to read documentation is crucial to your success as a data scientist.
The rubric is available here.