Skip to content

Onboarding guide to Jimmy Lin's research group at the University of Waterloo

Notifications You must be signed in to change notification settings

castorini/onboarding

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 

Repository files navigation

Onboarding Guide

Undergraduates at the University of Waterloo: if you want to work with me read this guide first.

🧱 Foundations of Retrieval

This onboarding path provides the starting point of working in our group and comprises the following lessons:

  1. Begin your journey here.
  2. BM25 Baselines for MS MARCO Passage Ranking in Anserini.
  3. BM25 Baseline for MS MARCO Passage Ranking in Pyserini.
  4. A Conceptual Framework for Retrieval
  5. Contriever Baseline for NFCorpus
  6. A Deeper Dive into Dense and Sparse Representations

Resources

This repository introduces several methods for users without local GPU resources.

Training monoBERT from Scratch

This is the guide to fine-tuning monoBERT on MS MARCO Passage dataset, based on Capreolus toolkit. For Compute Canada users, you may need to set up the environment following this guide.

About

Onboarding guide to Jimmy Lin's research group at the University of Waterloo

Resources

Stars

Watchers

Forks