Skip to content

Latest commit

 

History

History
48 lines (26 loc) · 4 KB

README.md

File metadata and controls

48 lines (26 loc) · 4 KB

AI Resume Screening

Introduction

AI Resume Screening is a tool that uses artificial intelligence to automate the process of resume screening and shortlisting. The tool uses natural language processing and machine learning algorithms to analyze resumes and classify them to the job roles based on the words in their resume.

Our tool will be designed to make the hiring process easier for both HR teams and job seekers. Our idea is to develop a user-friendly web page that enables HR teams to select the specific job role they are recruiting for. Once the job role is chosen, a link is shared with potential candidates who can then upload their resumes.

Using pre-trained machine learning models, which are trained on thousands of resumes , our tool automatically filters out resumes that do not match the job requirements. Resumes are categorized based on specific keywords and phrases related to the job role ( for example , python developer – main words can be python , developed , project , computer science etc). If a candidate's resume matches the job description, they are immediately notified and their resume is uploaded to the company's database. Even if a candidate’s resume doesn’t match the description , it shows the potential role which is suitable for the uploaded resume.

Our tool saves HR teams valuable time and energy as they no longer need to manually sift through hundreds of resumes. Job seekers also benefit from receiving immediate feedback on their eligibility, streamlining their job search process.

Features

  1. Automated resume screening: AI Resume Screening saves time and effort by automatically screening resumes based on job requirements and pre-defined criteria.

  2. Improved accuracy: The tool uses advanced algorithms to analyze resumes, reducing the likelihood of human bias and improving the accuracy of the shortlisting process. We have achieved 91 percent accuracy.

  3. Efficient process: Each resume is categorized with the model in less than 5 seconds.

  4. Detailed output : The HR gets a detailed output of the Name, Email, Location, and the candidate's resumé, based on the scores.

Usage

The code consists of the following parts:

  1. Upload_Resume.py : This is the code which integrates the UI with the Trained model. Use the command “streamlit run Upload_Resume.py” which opens the web browser where the candidate can upload the resume. Alternatively the code can be hosted online using streamlit and the link can directly be sent to the candidate,to upload the resume.

  2. model_training.ipynb : It is the jupyter notebook which we used to train the final model on all algorithms. It also contains the accuracies of each algorithm we have used.TF-IDF was used to extract the features from the pre-processed resume.

  3. cv.pickle : It is the pickled file which contains the features of the model trained on the resumes by using TF-IDF. This pickle file is used to compare the features of the Uploaded resume with the model.

  4. RF.joblib.zip: It contains the compressed machine learning model (Random Forest) which had the highest accuracy. This is the model used to predict the category in which the resume fits. Decompress this file to get the pretrained model 'RF.joblib'

  5. SQL.txt : Contains the MySQL queries to set up a database to store the details of the short listed candidates.

  6. HR.py: The page where HR enters the job role which is open for hiring, based on which shortlisting of candidates is done. Use the command “streamlit run HR.py” to run it in the local server.

  7. pages/Show_Resumes.py: It displays the resumes of the shortlisted candidates by sorting them in descending order of the scores.

Other things to know:

  1. The data of shortlisted candidates will be stored in a MySQL database, making it feasible to view their profiles.

  2. The 'pages' folder should be placed in the same parent folder which contains 'HR.py'

Datasets used for the model:

  1. https://www.kaggle.com/datasets/gauravduttakiit/resume-dataset

  2. https://www.kaggle.com/datasets/snehaanbhawal/resume-dataset