From 10611eab156d903d68195a5d0baf77ef5ec60eb9 Mon Sep 17 00:00:00 2001 From: kevin1kevin1k Date: Fri, 18 Oct 2024 01:55:48 +0800 Subject: [PATCH] Add ML packages and misc tools --- README.md | 22 +++++++++++++++++++++- 1 file changed, 21 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 011de50..87b47e3 100644 --- a/README.md +++ b/README.md @@ -358,6 +358,10 @@ These are some Machine Learning and Data Mining algorithms and models help you t * [Deepchecks](https://github.com/deepchecks/deepchecks) * [scikit-survival](https://scikit-survival.readthedocs.io/en/stable) * [interpretable](https://pypi.org/project/interpretable) +* [XGBoost](https://github.com/dmlc/xgboost) +* [LightGBM](https://github.com/microsoft/LightGBM) +* [CatBoost](https://github.com/catboost/catboost) +* [JAX](https://github.com/google/jax) ### Deep Learning Packages @@ -455,6 +459,7 @@ These are some Machine Learning and Data Mining algorithms and models help you t - [C3](https://c3js.org/) - [TensorWatch](https://github.com/microsoft/tensorwatch) - [geomap](https://pypi.org/project/geomap/) +- [Dash](https://plotly.com/dash/) ### Miscellaneous Tools **[`^ back to top ^`](#awesome-data-science)** @@ -561,6 +566,21 @@ These are some Machine Learning and Data Mining algorithms and models help you t | [Opik](https://github.com/comet-ml/opik) | Evaluate, test, and ship LLM applications across your dev and production lifecycles. | | [Synthical](https://synthical.com) | AI-powered collaborative environment for research. Find relevant papers, create collections to manage bibliography, and summarize content — all in one place | | [teeplot](https://github.com/mmore500/teeplot) | Workflow tool to automatically organize data visualization output | +| [Streamlit](https://github.com/streamlit/streamlit) | App framework for Machine Learning and Data Science projects | +| [Gradio](https://github.com/gradio-app/gradio) | Create customizable UI components around machine learning models | +| [Weights & Biases](https://github.com/wandb/wandb) | Experiment tracking, dataset versioning, and model management | +| [DVC](https://github.com/iterative/dvc) | Open-source version control system for machine learning projects | +| [Optuna](https://github.com/optuna/optuna) | Automatic hyperparameter optimization software framework | +| [Ray Tune](https://github.com/ray-project/ray) | Scalable hyperparameter tuning library | +| [Apache Airflow](https://github.com/apache/airflow) | Platform to programmatically author, schedule, and monitor workflows | +| [Prefect](https://github.com/PrefectHQ/prefect) | Workflow management system for modern data stacks | +| [Kedro](https://github.com/kedro-org/kedro) | Open-source Python framework for creating reproducible, maintainable data science code | +| [SHAP](https://github.com/slundberg/shap) | Game theoretic approach to explain the output of any machine learning model | +| [LIME](https://github.com/marcotcr/lime) | Explaining the predictions of any machine learning classifier | +| [flyte](https://github.com/flyteorg/flyte) | Workflow automation platform for machine learning | +| [dbt](https://github.com/dbt-labs/dbt-core) | Data build tool | +| [SHAP](https://github.com/slundberg/shap) | Game theoretic approach to explain the output of any machine learning model | +| [LIME](https://github.com/marcotcr/lime) | Explaining the predictions of any machine learning classifier | ## Literature and Media @@ -801,7 +821,7 @@ This section includes some additional reading material, channels to watch, and t - [Datacast](https://jameskle.com/writes/category/Datacast) - [DataFramed](https://www.datacamp.com/community/podcast) - [DataTalks.Club](https://anchor.fm/datatalksclub) -- [Gradient Dissent](https://wandb.ai/fully-connected/gradient-dissent) +- [Gradient Descent](https://wandb.ai/fully-connected/gradient-descent) - [Learning Machines 101](https://www.learningmachines101.com/) - [Let's Data (Brazil)](https://www.youtube.com/playlist?list=PLn_z5E4dh_Lj5eogejMxfOiNX3nOhmhmM) - [Linear Digressions](https://lineardigressions.com/)