The 100 Days of Code is a well-known coding challenge that anyone, regardless of their background, can undertake to significantly enhance their coding skills. Rules are straightforward: maintain a continuous streak of 100 days where you learn a new skill and put it into practice. On top of that, I have made the decision that by the end of the 100 Days of Machine Learning challenge, I will create ONE project based on solving a real world problem.
Topics I have covered :
- Descriptive Statistics
- Probability Distribution Functions
- Gaussian Distribution
- Non Gaussian Distributions
- What is ML?
- Types of ML
- Machine Learning Development Life Cycle
- Tensors
- Working with CSV files and JSON files
- Fetching data from API
- Fetchind data from a DB (MySQL)
- Web Scraping using BeautifulSoup
- Questions to ask
- Univariate, Bivariate , Multivariate Analysis
- Pandas Profiling
- Introduction
- Scaling - Standarsization and Normalisation
- Encoding Categorical Data - Ordinal Encoding and One Hot Encoding
- Column Transformations (Power,Function,Log,Reciprocal,Square Root , Box-Cox and YeoJohnson)
- Binning and Binarization
- Discretization
- Quantile Binning
- KMeans Binning
- Principal Component Analysis
- Imputation Techniques
- Simple Linear Regression
- Regression Metrics
- Multiple Linear Regression
- Gradient Descent - Batch , Stochastic , Mini Batch
- Polynomial Regression
- Bias Variance Tradeoff
- Regularisation
- Perceptron Trick
- Logistic Regression
- Softmax Regression
- Decision Trees and Regression Trees
- Support Vector Machines
- Voting Ensemble
- Bagging Ensemble and Random Forest Ensemble
- AdaBoosting Ensemble
- Gradient Boosting Ensemble
- Stacking Ensemble
- K means Clustering
- Agglomerative Clustering (Hierarchical)
- DBSCAN Clustering
- Hyperparameter Optimisation Techniques:
- Fundamentals of Tensorflow
- Types of Neural Networks
- MLP and MLP notation
- Loss Functions in model.compile()
- Forward Propagation
- Backpropagation Algorithm (and how to improve it using meoisation)
- Vanishing Gradeint Descent Problem
- Fine tuning hyper parameters for improving neural networks
- Regularisation Techniques : Early Stopping , Dropout Layers
- Activation Functions : Tanh(x) function , sigmoid function
- Rectified Linear Unit Function and its variants
- Weight Initialisation Techniques : with zero , with random , Xavier Glorat , He method
- Batch Normalisation
- Optimisers : SDG with momentum , Nesterov , Ada Grad , RMSprop , Adam
- Keras Tuner for Hyper parameter tuning
- Convolutional Neural Network
- CNN architecture , pooling functions
- Strides and Padding
- Recurrent Neural Network
- Text Preprocessing
- End to End Pipeline Understandin(Theory)
- Text Preprocessing (Removing , Tokenisation , Stemming and Lemitisation)
- Parts of Speech Tagging using Spacy
- Text Representation and Word Embeddings (Word2Vec)
- Text Classification
- Streamlit for Web applications
- Restful APIs
- Flask
- Django
Numpy,Pandas,Matplotlib,Seaborn BeautifulSoup Tensorflow PandasProfile Streamlit NLTK Sci-Kit Learn KerasTuner makepipeline spacy
During my #100DaysofMachineLearning journey, I'll be using various resources to learn and explore different topics. Here are some of the key resources that I'll be referring to:
-
Books:
- Storytelling with Data - Willey
- The Hundred-Page Machine Learning Book - Andriy Burkov
- Deep Learning - MIT , Ian Goodfellow and Yoshua Bengio and Aaron Courville
- Deep Learning Cookbook - Douwe Osinga
- Real world machine learning - Henrik Brink
-
YouTube Resources
- Krish Naik
- CodeBasics
- Danieal Bourke
- Gourav Gupta
- CampusX
- Serrano Academy
- Jovian
- Harvard CS50 course on FreeCodeCamp
- Made with ML
-
Websites and Blogs:
- Towards Data Science
- Kaggle Datasets and Notebooks
- DataVoyage
- weight initialisation techniques
- computational graphs
-
Documentation:
- Scikit-learn Documentation
- TensorFlow Documentation
- Keras Documentation
- Streamlit Documentation
- Book Recommender System
- Spam Classifier System
- Plagiarism Detector
- Skincare Recommender App (in progress)
I'm excited to embark on this #100DaysofMachineLearning challenge and explore the fascinating world of machine learning. I'll be sharing my progress, code, and insights in this repository, so feel free to follow along and provide feedback.
Let's dive into the world of machine learning together!