Deep Transformer-Based Passage Ranking System

Lingling Zhang, Toronto Metropolitan University

This project aims to build an information retrieval (IR) passage ranking system using deep transformer-based pre-trained language models. It leverages MS MARCO passage ranking dataset and compared different modelling strategies.

It implemented both the ranking system using single models and retrieve - rerank system using both Bi-Enocder and Cross-Encoder models. The project also implemented MRR@10 and MAP@10 metrics to evaluate the IR systems. As for single models, it found that using a pre-trained model and then fine-tune a Cross-Encoder model achieve better result than other models. The experiment results echo what is reported in previous literature.

The project also produces Train, Validation, Test subsets from MS MARCO data offered by TREC 2023 competition. Each of them have 1000 queries and 20 passages for each query (10 most relevent and 10 least relevent passages out of the total 100 ranked passages offered in original data source). These subsets can be found in "output" folder and can be used for study purpose.

Model Performance:

Note:

Please note this project is for study purpose only. I used this project to understand how to build and evaluate an IR system. The datasets used are small and the strategies implemented are referenced from previous literature. The code is not optimal (I am still learning). Also I purposely did not use MS MARCO pre-trained model to avoid possible leaking information to Test set.
The data pipeline takes 3 hours to run and requires large memory. Therefore I have uploaded the output csv files to the “output” folder. Users can skip the data pipeline and use the csv files directly for modelling.
Explanation for modelling notebooks:

M001 serves as baseline;
M002 and M004 are no finetuned models, they serve as comparison;
M003 and M005 are the fine-tuned BiEncoder and CrossEncoder models;
M006 loads in models trained in M003 and M005 to build a retrieve-rerank system

NB M003 runs over 900 minutes for me (using CPU). It is just for me for studying purpose. Users can choose to skip running this NB.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
output		output
.gitignore		.gitignore
D001_data_pipeline.ipynb		D001_data_pipeline.ipynb
M001_word2vec_model_pipeline.ipynb		M001_word2vec_model_pipeline.ipynb
M002_retrieve_system_BiEncoder_nofinetue_model_pipeline.ipynb		M002_retrieve_system_BiEncoder_nofinetue_model_pipeline.ipynb
M003_retrieve_system_BiEncoder_finetune_model_pipeline.ipynb		M003_retrieve_system_BiEncoder_finetune_model_pipeline.ipynb
M004_rank_system_CrossEncoder_TinyBERT_nofinetune_model_pipeline.ipynb		M004_rank_system_CrossEncoder_TinyBERT_nofinetune_model_pipeline.ipynb
M005_rank_system_CrossEncoder_TinyBERT_finetune_model_pipeline.ipynb		M005_rank_system_CrossEncoder_TinyBERT_finetune_model_pipeline.ipynb
M006_retrieve_rerank system_BiEncoder_and_CrossEncoder_model_pipeline.ipynb		M006_retrieve_rerank system_BiEncoder_and_CrossEncoder_model_pipeline.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Transformer-Based Passage Ranking System

About

Releases

Packages

Languages

littlebeanbean7/TREC2023_passage_ranking

Folders and files

Latest commit

History

Repository files navigation

Deep Transformer-Based Passage Ranking System

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages