Cohere2 aka Cohere2ForCausalLM #2843

kno10 · 2024-12-14T16:33:19Z

Model description

E.g., https://huggingface.co/CohereForAI/c4ai-command-r7b-12-2024

ValueError: Unsupported model type cohere2 rank=0

Supposedly a simpler addition, a key difference to earlier models seems to be that it interleaves layers with RoPE attention and layers with no positional encoding, which allows the model to attend to tokens at an arbitrary distance. This may be beneficial with system prompts I guess - I have no actual experience with this model, and cannot provide you with further details than the links here, please contact the authors.

Open source status

The model implementation is available
The model weights are available

Provide useful links for the implementation

Implementation in transformers: https://github.com/huggingface/transformers/blob/main/src/transformers/models/cohere2/modeling_cohere2.py

Pretrained model: https://huggingface.co/CohereForAI/c4ai-command-r7b-12-2024

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cohere2 aka Cohere2ForCausalLM #2843

Cohere2 aka Cohere2ForCausalLM #2843

kno10 commented Dec 14, 2024

Cohere2 aka Cohere2ForCausalLM #2843

Cohere2 aka Cohere2ForCausalLM #2843

Comments

kno10 commented Dec 14, 2024

Model description

Open source status

Provide useful links for the implementation