Skip to content

This work proposes a deep learning based approach to deepfake detection for both single and multi face videos. VStream is a video streaming application designed to provide users with an authentic and secure experience by ensuring that only real videos are uploaded and displayed on the platform.

Notifications You must be signed in to change notification settings

shreyavaidya2311/mimvit-deepfake-detection

Repository files navigation

MIM-ViT: Deepfake Detection using Masked Image Modelling and Vision Transformer

Publication Link

Introduction

The term “deepfake” is derived from the words “deep learning” and “fake”. Deepfakes are synthetic media that use deep learning to manipulate or replace the original content of an image, audio, or video file to create a new version that appears to be authentic. The number of deepfake videos has been multiplying rapidly in the past 5 years. The alarming rate of growth of deepfakes is a major cause of concern. Deepfake detection is important for several reasons:
  • Preventing Spread of Misinformation

  • Protecting Personal Privacy

  • Ensuring Digital Trust

This work proposes a deep learning based approach to the above-stated problem statement which addresses three major research gaps, Multi-face Deepfake Detection, Face Quality Detection and Convergence with less data. The solution consists of two sub-models working in parallel, namely the Multiscale Vision Transformer and the Masked Autoencoder, ConvNeXt. A novel facial quality detection algorithm is developed, which helps improve the data quality by overcoming the challenge of misrepresented facial data.

Proposed Model


Multi-face Deepfake Detection


Applications - VStream

VStream is a video streaming application designed to provide users with an authentic and secure experience by ensuring that only real videos are uploaded and displayed on the platform. The application allows users to upload and watch videos, with a focus on ensuring the authenticity of the content. VStream utilizes the proposed model MIM-ViT to detect deepfake videos and prevents them from being shared.

Product Functions

  • Video Upload

  • Video Viewing

  • Deepfake Detection

  • User Profiles

  • Search Functionality

Tech Stack

Working Application


Applications - Deepfake Detector Discord Bot

The Deepfake Detector bot is designed to operate within a Discord server and monitor messages containing videos to identify and flag any videos that may contain deepfake content.

Functional Requirements

  • User Registration and Authentication

  • Monitoring Messages

  • Deepfake Detection

  • Flagging Deepfakes

Non-Functional Requirements

  • Performance

  • Security

  • Compatibility

Tech Stack

Working Application


Installation

  1. Clone the repository

    git clone https://github.com/shreyavaidya2311/mimvit-deepfake-detection.git
    
  2. API Keys

    Command Link
    Youtube API Link
    Discord Bot Token Link
    Dropbox API Link

    Replace DISCORD_BOT_TOKEN, DROPBOX_TOKEN and YOUTUBE_API_KEY in the code with your keys

  3. Frontend Dependencies

    cd frontend && npm i
    
  4. Backend Dependencies

    pip install -r requirements.txt
    pip install discord.py
    
  5. Run the frontend (http://localhost:3000/)

    cd frontend && npm start
    
  6. Run the backend (http://localhost:8000/)

    uvicorn app:app --host=0.0.0.0 --port=8000
    
  7. Run the Discord Bot

    Before running the bot, create a bot on the Discord Developer Console and add it to your server. Paste the bot token in bot.py

    python3 bot.py
    

Contributors

About

This work proposes a deep learning based approach to deepfake detection for both single and multi face videos. VStream is a video streaming application designed to provide users with an authentic and secure experience by ensuring that only real videos are uploaded and displayed on the platform.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •