Skip to content

Tags: mlc-ai/mlc-llm

Tags

v0.18.1

Toggle v0.18.1's commit message
Tag v0.18.1

v0.19.dev0

Toggle v0.19.dev0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Model] Add support for OLMo architecture (#3046)

This PR add support for OLMo architecture.

Additional support: add support for clip-qkv.

Test: already tested on android(pixel 4) and cuda(setting tensor_parallel_shrads=2)

v0.17.2

Toggle v0.17.2's commit message
[Bench] Update API backend names (#2968)

This PR updates the backend names, introducing one name per
backend framework. These backends may refer to the same api endpoint.

v0.18.dev0

Toggle v0.18.dev0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Added hermes 3 support (#2886)

* added hermes 3 support

* modified format

* fixed lint

v0.17.1

Toggle v0.17.1's commit message
Tag v0.17.1

v0.1.dev0

Toggle v0.1.dev0's commit message
Initial commit

---------

Co-authored-by: Hongyi Jin <jinhongyi02@gmail.com>
Co-authored-by: Ruihang Lai <ruihangl@cs.cmu.edu>
Co-authored-by: Tianqi Chen <tqchen@cmu.edu>
Co-authored-by: Junru Shao <junrushao@apache.org>
Co-authored-by: Zihao Ye <zhye@cs.washington.edu>