Add MiniLM cross-encoder reranker #200

hugoabonizio · 2021-07-09T20:12:17Z

This PR adds a new reranker based on a MiniLM cross-encoder pretrained on MS-MARCO provided by the SentenceTransformers package. MiniLM-based models are much faster than MonoT5/MonoBERT while maintaining a similar result when compared to MonoT5.

Although it adds a new dependency, SentenceTransformers handles smart batching and other optimizations that otherwise would need to be implemented in PyGaggle. I also added AMP integration for FP16 inference, which should make inferences even faster (I can add this to other models if that's desirable).

Besides MiniLM-based models, there is a list of supported models.

ronakice

Thanks a lot for this! Some comments, yes it will be great if you can add AMP support in separate PR after we get this merged.

ronakice · 2021-07-10T09:56:28Z

pygaggle/rerank/transformer.py

@@ -247,3 +249,29 @@ def rescore(self, query: Query, texts: List[Text]) -> List[Text]:
            text.score = max(smax_val.item(), emax_val.item())

        return texts
+
+
+class CrossEncoderReranker(Reranker):


I think calling this a CrossEncoderReranker while the others not is not a good naming convention since all of them are, how about change it to be SentenceTransformersReranker

Sure, that'd be better!

ronakice · 2021-07-10T09:57:18Z

pygaggle/rerank/transformer.py

+                 device=None,
+                 use_amp=None):
+        device = device or ('cuda' if torch.cuda.is_available() else 'cpu')
+        self.use_amp = use_amp or (device == 'cuda')


Why would we want to default to amp if there is a gpu it could bring performance drop, more explicit a flag would be better.

You're right, I'm changing it to disabled by default.

rodrigonogueira4

LGTM! Thanks a lot for doing this

@ronakice, I will let you merge when you think it is ready.

pygaggle/rerank/transformer.py

ronakice · 2021-07-12T18:10:26Z

Thanks a lot @hugoabonizio, I'm merging!

Add MiniLM cross-encoder reranker

a8df26a

ronakice reviewed Jul 10, 2021

View reviewed changes

Change the reranker name and make AMP disabled by default

c0f03d5

rodrigonogueira4 approved these changes Jul 10, 2021

View reviewed changes

ronakice reviewed Jul 12, 2021

View reviewed changes

pygaggle/rerank/transformer.py Outdated Show resolved Hide resolved

Update pygaggle/rerank/transformer.py

a577978

ronakice merged commit 0a05d43 into castorini:master Jul 12, 2021

hugoabonizio deleted the feature/add-minilm-cross-encoder branch July 12, 2021 20:01

hugoabonizio mentioned this pull request Jul 13, 2021

Add AMP inference to MonoT5, DuoT5 and MonoBERT #201

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MiniLM cross-encoder reranker #200

Add MiniLM cross-encoder reranker #200

hugoabonizio commented Jul 9, 2021

ronakice left a comment

ronakice Jul 10, 2021

hugoabonizio Jul 10, 2021

ronakice Jul 10, 2021

hugoabonizio Jul 10, 2021

rodrigonogueira4 left a comment

ronakice commented Jul 12, 2021

Add MiniLM cross-encoder reranker #200

Add MiniLM cross-encoder reranker #200

Conversation

hugoabonizio commented Jul 9, 2021

ronakice left a comment

Choose a reason for hiding this comment

ronakice Jul 10, 2021

Choose a reason for hiding this comment

hugoabonizio Jul 10, 2021

Choose a reason for hiding this comment

ronakice Jul 10, 2021

Choose a reason for hiding this comment

hugoabonizio Jul 10, 2021

Choose a reason for hiding this comment

rodrigonogueira4 left a comment

Choose a reason for hiding this comment

ronakice commented Jul 12, 2021