Add support for open source models based on text-embeddings-inference #66

bluechanel · 2025-01-23T06:28:49Z

Add support for broader integration of embedding models. This update leverages the open-source embedding inference project text-embeddings-inference by Hugging Face.

Signed-off-by: wileyzhang bluechanel612@gmail.com

…ce. Signed-off-by: wileyzhang <bluechanel612@gmail.com>

Signed-off-by: wileyzhang <bluechanel612@gmail.com>

codingjaguar · 2025-01-23T07:02:34Z

Thanks for the contribution! Curious, what is the extra value of OpenSourceRerankFunction compared to CrossEncoderRerankFunction?

bluechanel · 2025-01-23T08:08:45Z

Thanks for the contribution! Curious, what is the extra value of OpenSourceRerankFunction compared to CrossEncoderRerankFunction?

OpenSourceRerankFunction benefits from leveraging text-embeddings-inference, which provides significantly higher throughput and lower latency compared to CrossEncoderRerankFunction. For detailed performance metrics, you can refer to the official benchmark results here: https://huggingface.co/docs/text-embeddings-inference/index#text-embeddings-inference.

codingjaguar

Could you please help sketch a documentation similar to https://milvus.io/docs/embed-with-sentence-transform.md with simple examples? That would be really helpful to other users of milvus model lib.

src/pymilvus/model/dense/__init__.py

README.md

codingjaguar · 2025-01-23T08:11:03Z

src/pymilvus/model/dense/opensource.py

+    @property
+    def dim(self):
+        if self._dim is None:
+            self._dim = self._call_api(["get dim"])[0].shape[0]


does this really work? i.e. self._call_api(["get dim"]) aka self._session.post(self.api_url,
json= {"input": ["get dim"]},) will return the vector shape? That sounds magical

This works by sending a dummy message to the API to retrieve the vector dimension, as the original API does not directly provide this information. I'll add a comment here for clarification.

codingjaguar · 2025-01-23T08:12:16Z

src/pymilvus/model/reranker/opensource.py

+            self.api_url,
+            json={
+                "query": query,
+                "raw_scores": False,


shall these params be configurable?

and what does raw_scores mean? say will it not return scores?

When raw_scores is set to false, the returned scores are normalized to a range of 0-1. When set to true, the scores are the raw, unnormalized values. I believe it should default to false to align with mdoel like JinaAI Rerank. Perhaps I should consider removing this configuration entirely.

Co-authored-by: codingjaguar <codingjaguar@gmail.com>

Signed-off-by: wileyzhang <bluechanel612@gmail.com>

…e' into support_text-embeddings-inference

Signed-off-by: wileyzhang <bluechanel612@gmail.com>

bluechanel · 2025-01-23T10:11:08Z

Thanks for the contribution! Curious, what is the extra value of OpenSourceRerankFunction compared to CrossEncoderRerankFunction?

The example documentation has been submitted at milvus-io/milvus-docs#2998.

zc277584121 · 2025-01-23T10:54:41Z

/lgtm

wiley added 5 commits January 23, 2025 14:24

✨ Add support for open source models based on text-embeddings-inferen…

8bc54c1

…ce. Signed-off-by: wileyzhang <bluechanel612@gmail.com>

✨ Add support for open source models based on text-embeddings-inferen…

68ca669

…ce. Signed-off-by: wileyzhang <bluechanel612@gmail.com>

✨ Add support for open source models based on text-embeddings-inferen…

7530119

…ce. Signed-off-by: wileyzhang <bluechanel612@gmail.com>

✨ Add support for open source models based on text-embeddings-inference

1238eb7

Signed-off-by: wileyzhang <bluechanel612@gmail.com>

✨ Add support for open source models based on text-embeddings-inference

2fb7f46

Signed-off-by: wileyzhang <bluechanel612@gmail.com>

codingjaguar reviewed Jan 23, 2025

View reviewed changes

bluechanel and others added 4 commits January 23, 2025 16:21

Update README.md

30d0d96

Co-authored-by: codingjaguar <codingjaguar@gmail.com>

🎨 Modify the name and delete redundant configurations

6bd6a78

Signed-off-by: wileyzhang <bluechanel612@gmail.com>

Merge remote-tracking branch 'origin/support_text-embeddings-inferenc…

8cfca21

…e' into support_text-embeddings-inference

🐛 top_k limit

1d42ad0

Signed-off-by: wileyzhang <bluechanel612@gmail.com>

bluechanel mentioned this pull request Jan 23, 2025

Add TEIEmbeddingFunction and TEIRerankFunction with usage documentation milvus-io/milvus-docs#2998

Open

codingjaguar approved these changes Jan 23, 2025

View reviewed changes

junjiejiangjjj merged commit 4974e2d into milvus-io:main Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for open source models based on text-embeddings-inference #66

Add support for open source models based on text-embeddings-inference #66

bluechanel commented Jan 23, 2025

codingjaguar commented Jan 23, 2025

bluechanel commented Jan 23, 2025

codingjaguar left a comment

codingjaguar Jan 23, 2025

bluechanel Jan 23, 2025

codingjaguar Jan 23, 2025

codingjaguar Jan 23, 2025

codingjaguar Jan 23, 2025

bluechanel Jan 23, 2025

codingjaguar Jan 23, 2025

bluechanel commented Jan 23, 2025

zc277584121 commented Jan 23, 2025

Add support for open source models based on text-embeddings-inference #66

Add support for open source models based on text-embeddings-inference #66

Conversation

bluechanel commented Jan 23, 2025

codingjaguar commented Jan 23, 2025

bluechanel commented Jan 23, 2025

codingjaguar left a comment

Choose a reason for hiding this comment

codingjaguar Jan 23, 2025

Choose a reason for hiding this comment

bluechanel Jan 23, 2025

Choose a reason for hiding this comment

codingjaguar Jan 23, 2025

Choose a reason for hiding this comment

codingjaguar Jan 23, 2025

Choose a reason for hiding this comment

codingjaguar Jan 23, 2025

Choose a reason for hiding this comment

bluechanel Jan 23, 2025

Choose a reason for hiding this comment

codingjaguar Jan 23, 2025

Choose a reason for hiding this comment

bluechanel commented Jan 23, 2025

zc277584121 commented Jan 23, 2025