Skip to content

castorini/ragamuffin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RAGamuffin: The Redis of IR

If you have $O(100K)$ documents, own a beefy GPU, and want a fluent and efficient information retrieval toolkit that "just works," then RAGamuffin is for you!

Code Sample

import ragamuffin as rg

documents = [...]
encode_pipeline = rg.Chunkify() | rg.Tokenize() | (rg.DenseDocumentEncoder() & rg.BM25())
db = encode_pipeline(documents)

query_pipeline = rg.Chunkify() | rg.Tokenize() | (rg.DenseQueryEncoder() & rg.BM25()) | db | TopK(10)
relevant_documents = query_pipeline("...")

About

An in-memory, everything-on-GPU retrieval system.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published