AutoML pipeline for imabalanced classification

A project for the AutoML course WS22/23 by Stefan Solarski at LMU.

Setup

Clone the repository and use pip, or another package manager, to install the requirements.

git clone https://github.com/SSolarski/automl_imbalanced.git
cd automl_imbalanced
pip install -r requirements.txt

Necessary packages are given in requirements.txt, we used Python v3.8.16.

We used the following packages:

pandas -> manipulating and displaying the datasets and results
jinja2 -> improves the style of pandas dataframes
numpy -> numerical calculations
openml -> importing datasets from openml
scikit-learn -> basic classifiers, pipelines and preprocessing
imbalanced-learn -> classifiers, pipelines and preprocessing for imbalanced datasets
xgboost -> XGBoost classifier
scikit-optimize -> hyperparameter tuning using Bayesian optimization
matplotlib -> visualizing performance of the automl system
jupyter -> running Jupyter notebooks

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
backup		backup
saved		saved
ImbalancedAutoML.py		ImbalancedAutoML.py
README.md		README.md
Stefan Solarski Report AutoML.pdf		Stefan Solarski Report AutoML.pdf
benchmark.py		benchmark.py
configuration.py		configuration.py
data.py		data.py
models_cofig.py		models_cofig.py
notebook.ipynb		notebook.ipynb
plotting.py		plotting.py
requirements.txt		requirements.txt