Capstone Team Optimization

Optimization method to select Capstone teams from the preference survey.

Georgetown students fill out a project interest survey at the start of Foundations, which we then use to attempt to optimize project teams curation. This serves as an intereting icebreaker to get people talking together about potential projects, but also as a mechanism to show optimization techniques in real life. Though obviously this method is more for demonstration purposes, I think it highlights a few key techniques.

The optimization works as follows:

Assign students into random teams.
Compute the cost of those team assignments across the entire cohort (cost function to follow).
Select a random number of swaps between 10 and 100
For each swap, switch two members of teams, if resulting cost is less, continue; otherwise revert to original
Repeat steps 2-4 until minimum error or maximum searches

So basically this is a random hill climbing type search (or is intended to be). There are a number of ways to improve this function of course, but it's for demonstration only.

The cost function is as follows:

Start with cost = 0 (perfect teams have no cost)
Add the square difference of each team's size with the optimal team size
Add the number of unique OS per team - 1 (e.g. same OS is zero cost)
Add cost of missing roles (e.g. don't have a programmer on the team)
Add domain alignment cost (similar domains selected is better)
Add dataset alignment cost (similar datasets selected is better)

Note that this cost function can change and be added to - make sure you review the notebook for the latest version of costs.

Getting Started

The first step is to download a CSV copy of the survey responses and save them to:

fixtures/cohort#-preferences.csv

Noting that # should be the cohort number. Run the Jupyter notebook:

$ jupyter notebook optimize.ipynb

This should open a browser window with the selection tools.

In the first cell modify the following constants as needed:

COHORT = # 
TEAM_SIZE = 4

Setting the cohort # to match the one saved in the preferences file. At this point you should be able to hit run all and have the assigned teams print out at the bottom of the notebook.

NOTE: Occassionally I have to munge the CSV file, but it's usually minor and I can never remember what I have to do.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
fixtures		fixtures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
optimize.ipynb		optimize.ipynb
selection-old.ipynb		selection-old.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Capstone Team Optimization

Getting Started

About

Releases

Packages

Contributors 2

Languages

License

georgetown-analytics/capstone-team-optimization

Folders and files

Latest commit

History

Repository files navigation

Capstone Team Optimization

Getting Started

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages