This project was built with numerous tools and technologies, this is a summary document. Therefore, if you want to obtain more statistical and computational information see Google Colaboratory, to read about the conclusions found about the project, analyze the Report. You can also access application created in Web App and see more information of datasets at Kaggle.
The objective of this project is to segment customers using a clustering prediction algorithm, in addition we answer some businesses questions that are grounded on the jupyter notebook. Also, this is a complete project as we go through several steps of a usual data science project such as data collection, feature engineering, data cleansing, data transformation, data visualization, data analysis, modeling and cloud deployment.
The current project can be used to help companies and retailers find out which customers buy, what they buy, how they spend and the like. Through this project, companies can create advertisements or discounts for specific customers. Although this program is part of my personal portfolio, please feel free to use it for studies, repairs and improvements. 🤙
This project was developed to be part of my personal portfolio and served both to test my skills and for my learning, since countless technologies could be used in it. Despite being an end-to-end project, it still needs some future improvements, such as having a larger and more diverse dataset. 😃
- Enter customer data;
- know which client group belongs to;
- Menu;
- About the author;
- Rerun;
- Automatically update the app when the underlying code is updated;
- Enable wide mode so the app takes up the entire width of the screen;
- Choose by dark or light theme;
- Record a video or audio of the screen;
- Report a bug;
- Get Help;
- About.
The application is already running and it is not necessary to install anything on your machine, however, if you want to run the application locally, you must install the Python language on your machine. In addition, you must have the libraries listed below on your machine.
- Pandas 1.4.1;
- Imblearn 0.0;
- seaborn 0.11.2;
- pickle-mixin 1.0.2;
- numpy 1.22.2;
- Streamlit 1.6.0;
- scikit-learn 1.0.2;
- Matplotlib 3.2.2;
- Requests 2.27.1.
Hosted In:
- Streamlit
The installations of the libraries are already explained in the links above, but if you want to be in the same versions I do:
pip install scikit-learn==1.0.2
pip install streamlit==1.6.0
pip install numpy==1.22.2
pip install pickle-mixin==1.0.2
pip install pandas==1.4.1
pip install imblearn==0.0
pip install matplotlib==3.2.2
done, go to the Deploy folder and type:
streamlit run main.py
and see the application run on your machine. 😮
Criticism, doubts and suggestions feel free to send me:
e-mail: [email protected]
LinkedIn: https://www.linkedin.com/in/marcos-matheus-silva-089699b3/ 🤗
Marcos Matheus de Paiva Silva
The code written in Google Colaboratory was based on the steps of the book Aurelien Geron - hands on machine learning-2019. In addition, this code was developed based on everything I learned from: Jesse E.Agbe, Siddhardhan, Lucas Grassano Lattari, Shashank Kalanithi, Walisson Silva, Israel Dryer, Fernando Nakamuta, Alex Freberg.
This project is licensed under the MIT License - see the file LICENSE for more details.