Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Credit Card Fraud Detection Model #116

Merged
merged 2 commits into from
Jun 2, 2024
Merged

Credit Card Fraud Detection Model #116

merged 2 commits into from
Jun 2, 2024

Conversation

PadmalathaKasireddy
Copy link
Contributor

This PR introduces a Credit Card Fraud Detection model using machine learning techniques. The dataset used for this project is highly imbalanced, with legitimate transactions vastly outnumbering fraudulent ones. The primary objective is to accurately detect fraudulent transactions while addressing the challenges posed by the imbalanced dataset.

Key Features

  1. Exploratory Data Analysis (EDA):
    • Analysed the dataset to understand the distribution of features and the target variable.
    • Visualized the distribution of transaction amounts and transaction times.

image

image

  • Created a correlation heatmap to identify relationships between features.
  1. Data Preprocessing:

  2. Handling Imbalanced Data:

    • The dataset is highly imbalanced with only 0.2% fraudulent transactions.
      image

    • Applied Synthetic Minority Over-Sampling Technique (SMOTE) to create a balanced training dataset by generating synthetic samples for the minority class.

  3. Model Training and Evaluation:

    • Trained and evaluated multiple machine learning models:
      • Logistic Regression
      • Decision Tree
      • Random Forest
      • Gradient Boosting Classifier
    • Compared models using key metrics such as accuracy, log loss, AUC-ROC, F1-score, and recall.
  4. Chosen Model:

    • Selected the Gradient Boosting Classifier as the final model based on its performance metrics.
    • Gradient Boosting showed a good balance between precision and recall, and provided a high AUC-ROC score, indicating its effectiveness in distinguishing between legitimate and fraudulent transactions.
  5. Performance Metrics:

    • AUC-ROC Score: 0.9670
    • F1-score: 0.2884
    • Recall: 0.8444
  6. ROC Curve:

    • Plotted the Receiver Operating Characteristic (ROC) curve to visualize the model's performance in distinguishing between legitimate and fraudulent transactions.
      image

Copy link

github-actions bot commented Jun 1, 2024

Thank you for submitting your pull request! 🙌 We'll review it as soon as possible. In the meantime, please ensure that your changes align with our CONTRIBUTING.md. If there are any specific instructions or feedback regarding your PR, we'll provide them here. Thanks again for your contribution! 😊

@PadmalathaKasireddy
Copy link
Contributor Author

@sanjay-kv
Please review this PR and provide me with any suggestions.
Thank you.

@Jatin9826
Copy link

please assign me this project i want to contribute this and i want to do some changes in this

@Jatin9826
Copy link

assign me this project

@sanjay-kv sanjay-kv merged commit 051744d into recodehive:main Jun 2, 2024
2 checks passed
@Jatin9826
Copy link

Jatin9826 commented Jun 5, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants