Competition Link: https://www.kaggle.com/competitions/learning-equality-curriculum-recommendations
Competition Goal: Recommend content items to curators for potential inclusion in a topic, to reduce the time they spend searching for and discovering relevant content to consider including in each topic.
Solution:
- Stage 1: Use sentence transformers to extract embeddings for topics and contents. Fine-tuning with MultipleNegativesRanking loss boosted the score.
- Train a K-Nearest Neighbors (KNN) or Approximate Nearest Neighbors (ANN) algorithm to recommend related contents to the topic.
- Stage 2: Re-rank (Binary Classification) using fine-tuned model from Stage 1. The metric is F2 so it is important to select proper threshold.