-
-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
🗃️ feat: General File Support for OpenAI, Azure, Custom, Anthropic and Google (RAG) #2143
Conversation
…ts and successful RAG
…n new conversation
…embedded files to be deleted on new conversation
…n new conversation
Hello I tryed to setup it with hugingface but when I try to send a pdf file I have this issue : I did the rag.yml setup with .env and db setup I restarted everything to apply the changes This error happend on two of my endpoints :
in addition this is my logs:
|
Try without huggingface first, because I haven't fully tested it aside from the default model listed. Also it would be more helpful to see the RAG API logs in this case. Embedding dimensions might be an issue but you can try |
@delta-whiplash Note, you may need to use Also it may be related to your device and I can look into a more flexible setup. This setup works for me: EMBEDDINGS_PROVIDER=huggingface
EMBEDDINGS_MODEL=sentence-transformers/all-MiniLM-L6-v2 |
Ok I reviewed my .env (the rag-api one) I changed it and restarted everything but the issue is still there this is the logs from rag-api :
the logs from librechat :
this is my rag-api .env file :
this is my librechat yaml :
|
I believe you are not making a successful connection to the RAG API. I will add a health check route to verify this. try in your browser to access it: http://rag_api:port/docs |
You're right I can't access it BTW I observed multiples restart from the container
|
Happens when the API can't access the PSQL/pgvector db. note that |
I FOUND IT !!!! now I have new logs :
|
Thanks will look into it. I ask that you start a discussion here for further help as I'm locking this PR as merged. |
Summary
Implements general file support with a basic implementation of RAG using augmented prompt approach.
This commit will introduce the feature mainly for testing as it will not be part of the default setup for now.
Setup
- Use
rag.yml
compose file:docker compose -f rag.yml up
- More info: https://github.com/danny-avila/rag_api
- Uses PSQL/PGVector + FastAPI
- Uses OpenAI embeddings by default
- Can use HuggingFace Sentence transformers but not thoroughly tested
- Open to contributions
- Current setup relies on sharing volume storage, namely the
uploads
directory at project root.- Set
RAG_API_URL
accordinglyRAG_API_URL=http://localhost:8000
RAG_API_URL=http://host.docker.internal:8000
Current Approach to RAG is subject to change for improvement.
Resend Files
resendImages
withresendFiles
and now defaults to true.Other Changes
Change Type
Checklist