Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

topic proposal: machine learning from first principles #24

Closed
hristog opened this issue Mar 11, 2024 · 2 comments
Closed

topic proposal: machine learning from first principles #24

hristog opened this issue Mar 11, 2024 · 2 comments
Labels
discuss An issue or PR currently being discussed in-content Is the issue in lesson content? propose-addition A suggestion for an addition to content or infrastructure
Milestone

Comments

@hristog
Copy link
Contributor

hristog commented Mar 11, 2024

This is a bit more general than #4.

The proposal involves introducing fundamental machine learning concepts such as data preprocessing, feature engineering, model training, validation and evaluation (essentially, steps which make up a machine learning pipeline).

I've been working on a library called RocLearn, with API that is close to Python's sklearn library. The idea is to implement fundamental machine learning algorithms (k-nearest neighbours, Principal Component Analysis, Support Vector Machines, Linear Regression etc) from first principles and at the same time abstract out the details of each one, in order to enable them to be viewed as modules that could be plugged in and out, as part of the more general steps, associated with a machine learning pipeline.

@gvwilson
Copy link
Collaborator

As with neural networks, blockchain, and the proposal to do turtle graphics, my concern is that these aren't tools that most programmers use while programming: they're very cool (I work for a company using ML for drug discovery), but the previous two books used things like package managers and linting as examples so that there was a discernible theme. Thoughts?

@gvwilson gvwilson added discuss An issue or PR currently being discussed in-content Is the issue in lesson content? propose-addition A suggestion for an addition to content or infrastructure labels Mar 12, 2024
@gvwilson gvwilson added this to the topic-outline milestone Mar 12, 2024
@hristog
Copy link
Contributor Author

hristog commented Mar 23, 2024

@gvwilson, understood, yes! I completely agree that these are topics that aren't sufficiently aligned with the intended overall theme.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discuss An issue or PR currently being discussed in-content Is the issue in lesson content? propose-addition A suggestion for an addition to content or infrastructure
Projects
None yet
Development

No branches or pull requests

2 participants