Skip to content
View kennethleungty's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report kennethleungty

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kennethleungty/README.md

👋 Hello, I'm Kenneth Leung

  • Thanks for popping by! As an avid learner, bold builder, curious explorer, and driven doer with a bias towards action, I enjoy seeking and solving meaningful problems with data and technology while having fun at the same time.
  • I welcome you to join me on a journey of data science discovery! Follow me on GitHub, Medium, and LinkedIn to stay updated with more engaging and practical content.
  • You can find my data science portfolio here, where every project and article was born out of inspiration, curiosity, and motivation. Feel free to connect for a chat (coffee or virtual) to discuss shared interests and topics!

Project Count

How to reach me

    Buy Me A Coffee 

Portfolio Contents

  1. Computer Vision
  2. Database Management
  3. Data Extraction and Web Scraping
  4. Data Science Certification Guides
  5. Data Science Toolkit
  6. Data Science in the Real World
  7. Generative AI
  8. Insights from Data Science Talks
  9. Machine Learning
  10. MLOps
  11. Natural Language Processing
  12. Networks and Graphs
  13. Responsible AI
  14. Sports Analytics
  15. Visualization
  16. Web Development
  17. Web3 and Metaverse
  18. Writing for DataCamp
  19. Writing Tips

Projects with ⭐ are my personal favourites, so do check them out!


Computer Vision 👁️

Title Article Repo
Classifying Images of Alcoholic Beverages with fast.ai v2 🔗 🔗
Russian Car Plate Detection with OpenCV and TesseractOCR 🔗 🔗
Evaluate OCR Output Quality with Character Error Rate (CER) and Word Error Rate (WER) 🔗 🔗
Top Python libraries for Image Augmentation in Computer Vision 🔗 🔗
⭐ PyTorch Ignite Tutorial - Classifying Tiny ImageNet with EfficientNet 🔗 🔗
Practical Guide to Transfer Learning in TensorFlow for Multiclass Image Classification 🔗 🔗

Database Management 🗄️

Title Article Repo
⭐ Definitive Guide to Creating a SQL Database on Cloud with AWS and Python 🔗 🔗
PyMySQL - Connecting Python and SQL for Data Science 🔗 🔗

Data Extraction and Web Scraping 🧰

Title Article Repo
Using OneMap API to extract Singapore postal codes, coordinates and travel distance - 🔗
A Detailed Web Scraping Walkthrough Using Python and Selenium 🔗 🔗
⭐ How to Web Scrape Wikipedia using LangChain Agents and Tools with OpenAI's LLMs and Function Calling 🔗 🔗

Data Science Certification Guides 👨‍🎓

Title Article Repo
3 Steps to Get AWS Cloud Practitioner Certified in 2 Weeks 🔗 🔗
3 Steps to Get Tableau Desktop Certified in 2 Weeks 🔗 -
⭐ No-Frills Guide to Passing the AWS Certified Machine Learning Specialty Exam 🔗 -

Data Science Toolkit 🛠️

Title Article Repo
Common Python codes for Data Wrangling - 🔗
Enhance your Python code’s readability with pycodestyle 🔗 -
Free Resources for Generating Realistic Fake Data 🔗 -
Most Starred and Forked GitHub Repos for Data Science and Python 🔗 -
Most Starred and Forked GitHub Repos for Data Science and R 🔗 -
Automatically Generate Machine Learning Code with Just a Few Clicks 🔗 -
Read and Modify Image Metadata with Python 🔗 🔗
Top Tips to Google Search Like a Seasoned Data Scientist 🔗 -
How to Swap Day and Month of Incorrectly Formatted Excel Dates 🔗 -

Data Science in the Real World 🌏

Title Article Repo
Exploring Illegal Drugs in Singapore — A Data Perspective 🔗 🔗
Pharmacokinetic Modeling of Drug Concentration Trajectories using Ordinary Differential Equations (ODE) and Global Optimization with Differential Evolution - 🔗
Healthcare’s AI Future — In Conversation with Andrew Ng and Fei-Fei Li 🔗 -
Real-World Data Science Use Cases in the Insurance Industry 🔗 -
⭐ Failed-ML: Compilation of high-profile real-world examples of failed machine learning projects 🔗 🔗

Generative AI 🤖

Title Article Repo
Generative AI Pharmacist - Macy 🔗 🔗
⭐ ChatPod - Q&A over your Podcasts with Whisper, FAISS, and LangChain 🔗 🔗
⭐ Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A 🔗 🔗
Domain LLMs - Compilation of Customized LLMs for Specific Domains and Industries - 🔗
⭐ Text-to-Audio Generation with Bark, Clearly Explained 🔗 🔗
Guide to ChatGPT's Advanced Settings — Top P, Frequency Penalties, Temperature, and More 🔗 -
Inside the Leaked System Prompts of GPT-4, Gemini 1.5, Claude 3, and More 🔗 -
⭐ Exposing Jailbreak Vulnerabilities in LLM Applications with ARTKIT 🔗 🔗

Insights from Data Science Talks 👨‍🏫

Title Article Repo
Bridging AI’s Proof-of-Concept to Production Gap — Insights from Andrew Ng 🔗 -

Machine Learning 🎰

Title Article Repo
Exploring Condominium Rental Prices with Web Scraping and Exploratory Data Analysis 🔗 🔗
Using Ensemble Regressors to Predict Condominium Rental Prices 🔗 🔗
The Dying ReLU Problem, Clearly Explained 🔗 -
Why Bootstrapping Actually Works 🔗 -
⭐ Assumptions of Logistic Regression, Clearly Explained 🔗 🔗
Data-Centric AI Competition - Tips and Tricks of a Top 5% Finish 🔗 🔗
Credit Card Fraud Detection with AutoXGB 🔗 🔗
⭐ Micro, Macro & Weighted Averages of F1 Score, Clearly Explained 🔗 -
Principal Component Regression - Clearly Explained and Implemented 🔗 🔗
⭐ Feature Selection with Simulated Annealing in Python, Clearly Explained 🔗 🔗
Quick Primer on Types of Missing Data and Imputation Techniques 🔗 -
Imputation of Missing Data in Tables with DataWig 🔗 🔗

MLOps - Machine Learning Operations 👨‍🔧

Title Article Repo
Key Learning Points from MLOps Specialization — Course 1/4 🔗 🔗
Key Learning Points from MLOps Specialization — Course 2/4 🔗 🔗
Key Learning Points from MLOps Specialization — Course 3/4 🔗 🔗
Key Learning Points from MLOps Specialization — Course 4/4 🔗 🔗
⭐ End-to-End AutoML Pipeline with H2O AutoML, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell 🔗 🔗
⭐ How to Dockerize Machine Learning Applications Built with H2O, MLflow, FastAPI, and Streamlit 🔗 🔗
⭐ Building and Managing an Isolation Forest Anomaly Detection Pipeline with Kedro 🔗 🔗

Natural Language Processing 📑

Title Article Repo
COVID-19 Vaccine — What’s the Public Sentiment? 🔗 🔗
Keyword Extraction and Analysis Pipeline with KeyBERT and Taipy 🔗 🔗

Networks and Graphs 🌐

Title Article Repo
⭐ Network Analysis and Visualization of Drug-Drug Interactions 🔗 🔗
How to Deploy Interactive Pyvis Network Graphs on Streamlit 🔗 🔗
A No-Code Approach to Building Knowledge Graphs 🔗 🔗

Responsible AI 👮

Title Article Repo
Responsible AI Masterclass (for Institute of Banking and Finance Singapore) 🔗 🔗

Sports Analytics ⚽

Title Article Repo
⭐ Analyzing English Premier League VAR Football Decisions 🔗 🔗
Combining Python and R for FIFA Football World Ranking Analysis 🔗 🔗

Visualization 📈

Title Article Repo
Uniform Singapore Energy Price and Demand Forecast Dashboard (with Plotly Dash) - 🔗
Visualizing Fortune 500 Companies in a Bar Chart Race 🔗 🔗
How to Easily Draw Neural Network Architecture Diagrams 🔗 🔗

Web Development 🖥️

Title Article Repo
⭐ Post COVID-19 Vaccination Wait-Time Tracker (with Python Flask) 🔗 🔗
From HTTP to HTTPS — Easily Secure Flask Web Apps With Talisman 🔗 -
⭐ Food King Directory (in collaboration with Night Owl Cinematics) 🔗 🔗

Web3 and Metaverse 👨‍💻

Title Article Repo
The Web3 / Metaverse Glossary — A Keyword Guide to the Tech Future 🔗 -

Writing for DataCamp ✍️

Title Article Repo
⭐ What Mature Data Infrastructure Looks Like 🔗 -
Democratizing Data in Government Agencies 🔗 -
A Survey Into Data Governance Tools 🔗 -
Scaling Data Science With Data Governance 🔗 -
3 Reasons Why All Teams Should Learn SQL 🔗 -
3 Reasons Why All Teams Should Learn R 🔗 -
How Tableau Helps Your Organization Achieve Greater Data Insights 🔗 -
How PowerBI Helps Your Organization Achieve Greater Data Insights 🔗 -

Writing Tips 📜

Title Article Repo
Create a Clickable Table of Contents for Your Medium Posts 🔗 -

Popular repositories Loading

  1. Llama-2-Open-Source-LLM-CPU-Inference Llama-2-Open-Source-LLM-CPU-Inference Public

    Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

    Python 952 211

  2. AWS-Certified-Cloud-Practitioner-Notes AWS-Certified-Cloud-Practitioner-Notes Public

    Notes compiled based on AWS E-Learning lessons and transcripts

    889 357

  3. Neural-Network-Architecture-Diagrams Neural-Network-Architecture-Diagrams Public

    Diagrams for visualizing neural network architecture (Created with diagrams.net)

    763 468

  4. Failed-ML Failed-ML Public

    Compilation of high-profile real-world examples of failed machine learning projects

    719 47

  5. MLOps-Specialization-Notes MLOps-Specialization-Notes Public

    Notes for Machine Learning Engineering for Production (MLOps) Specialization course by DeepLearning.AI & Andrew Ng

    351 150

  6. Generative-AI-Pharmacist Generative-AI-Pharmacist Public

    Generative AI Pharmacist (For Demo Purposes Only)

    77 24