Skip to content
View keitazoumana's full-sized avatar
🎯
Learning
🎯
Learning

Block or report keitazoumana

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
keitazoumana/README.md

Hi there πŸ‘‹

My name is

Zoumana Keita

  • ⚑ Previously I worked as Machine Learning Engineer at Lincoln for a couple of weeks before moving to the US for my Master in Business & Data Science at Texas Tech University, Rawls College of Business. Before that, I was Data Scientist for 2 years at Axionable, first Sustainable AI startup in France and Canada. Also I spent 2 years and 6 months at IBM as Machine Learning Consultant.
  • ❀️ I love Data Science, Natural Language Processing, Cloud Computing & MLOps
  • 🩺 What keeps me in shape
    • When I was in France, I had Taekwondo classes πŸ₯‹ on Tuesday, Thursday, Friday & Saturday at Mudo Club Argenteuil
    • Daily morning runner πŸƒπŸΎ
    • Occasional football player ⚽️ with friends
    • AttiΓ©kΓ©, Yassa, MafΓ©, Thieb, etc. πŸ˜‹
  • 🌱 I’m addicted to continuous learning, which makes me grow on a regular basis
  • 🌏 I'm sharing my knowledge through my blog in order to make good impact on others life
  • πŸ“« How to find me

πŸ† My Github Stats:

Zoumana's GitHub stats GitHub Views

πŸ… My Most Used Languages:

Zoumana's Top Languages


Data Science, Machine Learning & MLOPs Resources.

This is the collection of all the resources I have created, organized by topics.

Subscribe to:

Content

  1. Data Science
  2. Machine Learning
  3. MLOps
  4. Natural Language Processing
  5. Large Language Models
  6. Retrieval Augmented Generation
  7. Python
  8. Pandas & Python Tricks
  9. Computer Vision

Data Science

Title Article Link Video
A simple way to understand Association Rule from the Customer Basket Analysis Use Case πŸ”—
Different Metrics to Evaluate Binary Classification Models and Some Strategies to Choose the Right One πŸ”—
Introduction to Mito: Spreadsheet for Data Scientists That Also Generates Python Codes πŸ”—
When R Meets SQL to Query Dataframes πŸ”—
5 Essential Tools to Start a Career in Data Science and Data Analytics πŸ”—
4 Types of SQL JOIN Every Data Scientist Should Know: Visual Representation πŸ”—
Data Preprocessing Using Pipeline in Pandas πŸ”— πŸ”—
The guide to choosing the right database for my project: MongoDB vs. MySQL πŸ”—
How to Run SQL Queries On Your Pandas DataFrames With Python πŸ”— πŸ”—
Algorithmic Bias in Healthcare and Some Strategies for Mitigating It πŸ”—
Which One of These 2 Open-Source Libraries Is Better for Processing Gigabytes of Data? πŸ”— πŸ”—
ChatGPT for Data Scientists, Data Analysts, and Programmers πŸ”— πŸ”—
Tableau Data Blending Tutorialβ€Šβ€”β€ŠA Step-By-Step Guide For Beginners πŸ”—
Fundamentals of Statistics All Data Scientists & Analysts Should Knowβ€Šβ€”β€ŠWith Codeβ€Šβ€”β€ŠPart 1 πŸ”— πŸ”—
Everything You Need to Know About Heatmap β€” Tutorial With PowerBI πŸ”—
Top Techniques to Handle Missing Values Every Data Scientist Should Know πŸ”—
An Introduction to Hierarchical Clustering in Python πŸ”—
Multiple Linear Regression in R: Tutorial With Examples πŸ”—
NoSQL Databases: What Every Data Scientist Needs to Know πŸ”—

Machine Learning

Title Article Link Video
Transfer Learning: Understand the Big Picture & Make the Right Choices for Your Use Case πŸ”—
Overview Of 4 Model Validation Approaches to Mitigate Overfitting Problem πŸ”—
eXplainable AI (XAI): LIME & SHAP, Two Great Candidates to Help You Explain Your Machine Learning Models πŸ”—
Using Gradio To Create Apps For Your Machine Learning Models πŸ”— πŸ”—
How to Perform KMeans Clustering Using Python πŸ”— πŸ”—
Classification in Machine Learning: An Introduction πŸ”—

MLOps

Title Article Link Video
Create An Awesome Streamlit App & Deploy it With Docker πŸ”—
Machine Learning models monitoring made easy with Mlfow, a concrete use case with Python API πŸ”—
When Your Machine Learning model teams up with Django REST API, A successful deployment into production πŸ”—
NLP MLops Project With DagsHub β€” Multi-Language Sentiment Classification Using Transformers β€” Part 1 πŸ”—
NLP MLops Project With DagsHub β€” Deploy Your Streamlit App On AWS EC2 Instance β€” Part 2 πŸ”—
Step-by-step Approach to Build Your Machine Learning API Using Fast API πŸ”—
Data And Model Versioning With DVC And Azure Blob Storage πŸ”—
GitHub Actions for Machine Learning: Train, Test and Deploy Your ML Model on AWS EC2. πŸ”—
CI/CD for Machine Learning Model Training with GitHub Actions πŸ”—
Speed Up Your Model Training with DagsHub Direct Data Access on AWS πŸ”—
Git Reset and Revert Tutorial for Beginners πŸ”—

Natural Language Processing

Title Article Link Video
Do You Want To Cluster Unlabeled Text Data? Try Out Topic Modeling πŸ”—
Financial Text Classification With Deep Learning Using FinBERT πŸ”—
Named Entity Recognition with Spacy and the Mighty roBERTa πŸ”— πŸ”—
Scientific Documents Similarity Search With Deep Learning Using Transformers (SciBERT) πŸ”—
Meet BERTopicβ€” BERT’s Cousin For Advanced Topic Modeling πŸ”— πŸ”—
Unsupervised Multilingual Text Classification With Zero-Shot Approach πŸ”—
Semantic Keywords And Keyphrases Extraction With KeyBERT πŸ”—
4 NLP Libraries for Automatic Language Identification of Text Data In Python πŸ”—
Data Augmentation in NLP Using Back Translation With MarianMT πŸ”— πŸ”—
Social Media Sentiment Analysis In Python With VADER β€” No Training Required! πŸ”— πŸ”—
Stemming, Lemmatizationβ€” Which One is Worth Going For? πŸ”—
VADER Vs. TextBlob β€” Which One Is Better For Social Media Sentiment Analysis? πŸ”—
Most Common Text Processing Tasks In Natural Language Processing πŸ”— πŸ”—
How to Perform Speech-to-Text and Translate Any Speech to English With OpenAI’s Whisper πŸ”— πŸ”—
Plagiarism Detection Using Transformers πŸ”— πŸ”—
Text-to-Image and Image-to-image search Using CLIP πŸ”—
A Step-by-step Guide to Solving 4 Real-life Problems With Transformers and Hugging Face πŸ”— πŸ”—
Text data representation with one-hot encoding, Tf-Idf, Count Vectors, Co-occurrence Vectors and Word2Vec πŸ”—
Fine-Tuning GPT-3 Using the OpenAI API and Python πŸ”—

Large Language Models

Title Article Link Video
Multimodal Retrieval Augmented Generation Applied To Real World Case β€” With Code πŸ”— πŸ”—
A Framework For Efficiently Serving Your Large Language Models πŸ”— πŸ”—
How To Scrape a Web Page With ChatGPT β€” No Coding Required! πŸ”— πŸ”—
How to Chat With Any PDFs and Image Files Using Large Language Models β€” With Code πŸ”— πŸ”—
Multimodal Retrieval Augmented Generation Applied To Real World Case β€” With Code πŸ”— πŸ”—
Document Parsing Using Large Language Models β€” With Code πŸ”— πŸ”—
How to Build Anything With AI Agents - With Code πŸ”—

RAG

Title Article Link Video
How I Built A Video Recommendation System Using Large Language Models and Vector Database πŸ”—
How to Build RAG based Chatbot: 5 Steps with Amazon Bedrock πŸ”—

Python

Title Article Link Video
5 Python open-source tools to extract text and tabular data from PDF Files πŸ”—
When Should You Consider Using Datatable Instead of Pandas to Process Large Data? πŸ”—
Convert Any Type of Document to Text With Apache Tika Using Python API πŸ”—
Collect Data From Reddit and Twitterβ€” 600+ Million Monthly Active Users Platforms πŸ”—
Knockknock β€” Probably The Best Python Library For Notifications πŸ”—
Extract Text Written in Different Languages from Images with Python πŸ”—
Introduction to Twint: Say Goodbye to Twitter Rate Limitations β€” Also No Need for A Twitter API! πŸ”—
Avoid Using β€œpip freeze” β€” Use β€œpipreqs” instead πŸ”—
Extract Tweets Without Limitations in a Few Lines of Code Using Python πŸ”— πŸ”—
Collect Data from Twitter: A Step-by-Step Implementation Using Tweepy πŸ”—
How to Create a Virtual Environment and Use it on Jupyter Notebook πŸ”— πŸ”—

Pandas & Python Tricks

Title Article Link Video
Pandas and Python Tips and Tricks for Data Science and Data Analysis πŸ”— πŸ”—
Pandas & Python Tricks for Data Science & Data Analysis β€” Part 2 πŸ”— πŸ”—

Computer Vision

Title Article Link Video
Five Simple Image Data Augmentation Techniques to Mitigate Overfitting In Computer Vision πŸ”—
YOLO Object Detection Explained πŸ”—
How to Measure Model Performance in Computer Vision: A Comprehensive Guide πŸ”—

Popular repositories Loading

  1. Medium-Articles-Notebooks Medium-Articles-Notebooks Public

    Jupyter Notebook 124 80

  2. LLMs LLMs Public

    Repository for my LLM notebooks

    Jupyter Notebook 21 8

  3. Fastapi-tutorial Fastapi-tutorial Public

    Python 20 9

  4. keitazoumana keitazoumana Public

    17 6

  5. multimodal-rag-esg multimodal-rag-esg Public

    The application of multimodal RAG for Sustainable finance

    Jupyter Notebook 16 7

  6. streamlit-spam-detector streamlit-spam-detector Public

    Jupyter Notebook 9 7