Recommender System 2023 Challenge - Polimi

Welcome to the Recommender System 2023 Challenge repository, dedicated to the project developed for the competition hosted by Kaggle and exclusively available to students of the Recommender Systems course at Politecnico di Milano.

Competition Overview

The application domain is book recommendation. The datasets provided contain interactions of users with books, in particular, if the user attributed to the book a rating of at least 4. The main goal of the competition is to discover which items (books) a user will interact with.

The datasets include around 600k interactions, 13k users, 22k items (books). The training-test split is done via random holdout, 80% training, 20% test. The goal is to recommend a list of 10 potentially relevant items for each user. MAP@10 is used for evaluation. Any kind of recommender algorithm written in Python can be used.

Project Overview

The project aims to build a recommender system capable of suggesting items to users based on their past interactions.

The best performance was achieved through a combination of different strategies:

Linear Combination Ensembles

The system uses an ensemble of different recommenders. These recommenders are assembled sequentially, adding one recommender at a time only if it improves the performance. The recommenders are added from the best performing to the least. For more details, please refer to the script Sequential Ensamble.ipynb.

User-Specific Recommender

The recommender system divides the users based on the number of interactions and predicts for each of these groups with a different ensemble. For example, for predicting for users with few interactions, TopPop has a higher weight than others. For more details, please refer to the script UserSpecific.ipynb.

XGBoost

The system leverages XGBoost, using features such as user activity, popularity score, latent features coming from a Deep Encoder, and scores coming from predictions of the most performant ensemble. For more details, please refer to the script XGBoost.ipynb.

Data Preprocessing

The data preprocessing phase involved several steps to enhance the performance of the recommenders. Cold users and items were removed from the User-Item Interaction Matrix (URM) to reduce noise and improve the accuracy of the recommendations. A mapping was implemented to revert to the original user and item IDs for submission. The existing class was modified to automatically manage the presence of cold users and items and preprocessed URMs. The implementation can be found in Data Preprocessing.ipynb.

Wrappers

A wrapper for LinearCombinationEnsemble was created from scratch to automate the training and inference for recommenders built of different ones. This wrapper exploits the ability of the ensemble to combine the strengths of multiple recommenders and mitigate their weaknesses, leading to more accurate and diverse recommendations.

Analogously the UserSpecific class was implemented, making the use of a combination of Recommenders specific to the user, based on its profile length, more intuitive.

See Recommenders folder for a deeper insight.

Results

The combination of these strategies resulted in a robust and performant recommender system, achieving high scores in the competition.

Name		Name	Last commit message	Last commit date
Latest commit History 278 Commits
CythonCompiler		CythonCompiler
Cython_examples		Cython_examples
Data_manager		Data_manager
Datasets		Datasets
Evaluation		Evaluation
HyperparameterTuning		HyperparameterTuning
Original Data		Original Data
Recommenders		Recommenders
Utils		Utils
XGBoost		XGBoost
result_experiments		result_experiments
result_experiments_FUNK		result_experiments_FUNK
scripts - KNNEnsambles		scripts - KNNEnsambles
.gitignore		.gitignore
Data Preprocessing.ipynb		Data Preprocessing.ipynb
FunkSVD_hyperparameters.ipynb		FunkSVD_hyperparameters.ipynb
FunkSVDdiy.ipynb		FunkSVDdiy.ipynb
ItemBasedCF.ipynb		ItemBasedCF.ipynb
ItemBasedCFoptuna.ipynb		ItemBasedCFoptuna.ipynb
PerformanceAssessment.ipynb		PerformanceAssessment.ipynb
PrecisionEnsamble4Pipeline.ipynb		PrecisionEnsamble4Pipeline.ipynb
README.md		README.md
SLIM-MSE.ipynb		SLIM-MSE.ipynb
Sequential Ensamble.ipynb		Sequential Ensamble.ipynb
TopPopRecommender.ipynb		TopPopRecommender.ipynb
UserBasedCF.ipynb		UserBasedCF.ipynb
UserSpecific.ipynb		UserSpecific.ipynb
precision-ensamble (1).ipynb		precision-ensamble (1).ipynb
precision-ensamble.ipynb		precision-ensamble.ipynb
run_compile_all_cython.py		run_compile_all_cython.py
score_fold.ipynb		score_fold.ipynb
user_activity_distribution.png		user_activity_distribution.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recommender System 2023 Challenge - Polimi

Competition Overview

Project Overview

Linear Combination Ensembles

User-Specific Recommender

XGBoost

Data Preprocessing

Wrappers

Results

About

Releases

Packages

Languages

melaniasala/RecSys-Competition

Folders and files

Latest commit

History

Repository files navigation

Recommender System 2023 Challenge - Polimi

Competition Overview

Project Overview

Linear Combination Ensembles

User-Specific Recommender

XGBoost

Data Preprocessing

Wrappers

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages