You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When using the official Ollama model of snowflake-arctic-embed-l (latest/335m - 21ab8b9b0545), if input is greater than 512 tokens, instead of truncating, the model encounters an error.
On a previous version (0.3.9) when you pass it more than 512 tokens, it returns only [0,0,0...] embeddings.
In 0.5.4, Ollama returns a 500 error and the logs show that "Process xxxxxx (ollama_llama_se) of user xxx dumped core"
I've checked my Ollama parameters and this occurs when "truncate": true. Other embedding models properly truncates the input and I see the INFO log in Ollama say "input truncated". I don't see this message with snowflake-arctic-embed-l.
When "truncate" is set to false, I get the expected "input length exceeds maximum context length".
The problem can be worked around by setting num_ctx for the model to the actual context length of the model, rather than the default value of 2048 that ollama uses. You can either do that by setting num_ctx in the API call ("options":{"num_ctx":512}) or by creating a copy of the model with the parameter:
What is the issue?
When using the official Ollama model of snowflake-arctic-embed-l (latest/335m - 21ab8b9b0545), if input is greater than 512 tokens, instead of truncating, the model encounters an error.
On a previous version (0.3.9) when you pass it more than 512 tokens, it returns only [0,0,0...] embeddings.
In 0.5.4, Ollama returns a 500 error and the logs show that "Process xxxxxx (ollama_llama_se) of user xxx dumped core"
Logs:
I've checked my Ollama parameters and this occurs when "truncate": true. Other embedding models properly truncates the input and I see the INFO log in Ollama say "input truncated". I don't see this message with snowflake-arctic-embed-l.
When "truncate" is set to false, I get the expected "input length exceeds maximum context length".
https://ollama.com/library/snowflake-arctic-embed
OS
Linux
GPU
Nvidia
CPU
Intel
Ollama version
0.5.4
The text was updated successfully, but these errors were encountered: