Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upstream merge oct31 #41

Merged
merged 75 commits into from
Oct 30, 2023
Merged
Changes from 1 commit
Commits
Show all changes
75 commits
Select commit Hold shift + click to select a range
898db76
[API] Add GenerationConfig (#1024)
davidpissarra Oct 8, 2023
ad3a6b9
Fix two bugs in kv-cache backtrack loop (#856)
shenberg Oct 8, 2023
6e40c21
[Build] Added --pdb flag to build.py, drop into pdb on error (#1017)
Lunderberg Oct 8, 2023
bae37b3
[Android] Use `AlertDialog` instead of `Toast` (#1039)
cyx-6 Oct 8, 2023
b44f679
Add doc for ChatConfig, ConvConfig, GenerationConfig, BuildArgs (#1040)
CharlieFRuan Oct 9, 2023
3a9849a
[Android] Add Llama2 q4f16_0 (#1041)
spectrometerHBH Oct 9, 2023
bed9e60
[Docs] Model prebuilts tracking page revamp (#1000)
CharlieFRuan Oct 9, 2023
c02fdaf
Update compile_models.rst (#1038)
yongjer Oct 9, 2023
85001ed
Support for the Stable LM 3B model (#1008)
jeethu Oct 9, 2023
a032d40
[Docs] Iterate model prebuilts docs (#1043)
CharlieFRuan Oct 9, 2023
a58605f
Update README.md
junrushao Oct 9, 2023
bdd9d9b
[CPP] Separate common utils out from llm_chat.cc (#1044)
MasterJH5574 Oct 9, 2023
20131fb
Update README.md (#1045)
junrushao Oct 9, 2023
1e6fb11
add verbose stats to mlc-chat REST API (#1049)
denise-k Oct 11, 2023
b9179cf
[Transform] Apply split_rotary optimization on prefill (#1033)
Lunderberg Oct 12, 2023
98ebd28
[Docs] Add `mlc.ai/package` to `DEPENDENCY INSTALLATION` group (#1055)
LeshengJin Oct 12, 2023
bfaa5b9
Revert "[Transform] Apply split_rotary optimization on prefill (#1033…
MasterJH5574 Oct 12, 2023
ca8c11b
[BugFix] Set the right `max_sequence_length` for both Llama-1 and Lla…
sunggg Oct 13, 2023
edab9b5
[Doc] Use -U instead of --force-reinstall (#1062)
junrushao Oct 13, 2023
d854105
[Model] Initial batching support for Llama (#1048)
MasterJH5574 Oct 14, 2023
c2b8cbc
Fix Stable LM 3B build (#1061)
jeethu Oct 14, 2023
481cd92
[Core] Remove duplication in MODEL.get_model calls (#1054)
Lunderberg Oct 14, 2023
8184431
[ParamManager] Cleanup creation of quantization IRModule (#1053)
Lunderberg Oct 14, 2023
9010d48
Minor typo fix (#1064)
jeethu Oct 15, 2023
b0bfc88
Add links to Python API Reference (#1068)
junrushao Oct 15, 2023
204860b
[Fix] ChatModule incorrect temperature buffer shape (#1070)
MasterJH5574 Oct 15, 2023
d202077
[ParamManager] Added progress bar for get_item/set_item (#1063)
Lunderberg Oct 16, 2023
9872c48
[Python] Extract common device str parse function in ChatModule (#1074)
MasterJH5574 Oct 16, 2023
3aefd9f
[Bugfix] Compilation Error in q4f32_1 (#1078)
junrushao Oct 17, 2023
2625945
Establish `mlc_chat.compiler` (#1082)
junrushao Oct 19, 2023
56a8004
Update README.md for Multi-GPU (#1090)
junrushao Oct 19, 2023
b0373d1
Support lib_path override in C++. Improvements on docs and error mess…
rickzx Oct 19, 2023
830656f
StreamIterator (#1057)
varshith15 Oct 19, 2023
9bf5723
Update `benchmark.py` according to #1086 (#1091)
junrushao Oct 19, 2023
62d0c03
Disable Disco for q4f16_ft and q8f16_ft quantization (#1094)
LeshengJin Oct 20, 2023
cf39bf6
[Format] Apply isort and black for `python/` (#1097)
junrushao Oct 20, 2023
e9b85ce
More formatting (#1099)
junrushao Oct 21, 2023
03c641a
Enable Python Linter (#1098)
junrushao Oct 21, 2023
46d11e6
Add Basic Pylint and Mypy Tooling (#1100)
junrushao Oct 21, 2023
6159cc4
[CI] Add clang-format (#1103)
junrushao Oct 22, 2023
16dd2ae
[Slim-LM] Smart path finding for config and weight (#1088)
LeshengJin Oct 23, 2023
f57c9c9
[Transform] Provide IRModule transform for rewrite_attention (#1052)
Lunderberg Oct 23, 2023
e5927ce
[ParamManager] Use BundleModelParams for transform_dequantize (#1056)
Lunderberg Oct 23, 2023
7ae8c6d
[Slim-LM] Introduce HFLoad for loading Pytorch and SafeTensor weights…
LeshengJin Oct 23, 2023
5a7dcd8
[WINDOWS] reduce noise in windows build (#1115)
tqchen Oct 24, 2023
61179a0
Add CLI commands for compilation (#1109)
junrushao Oct 24, 2023
8ce7793
Auto updated submodule references
Oct 24, 2023
488017d
fix mismatched argument name (#1117)
Sing-Li Oct 24, 2023
206103b
[Docs] Add doc for max and mean gen len, shift factor; and buildArgs …
CharlieFRuan Oct 24, 2023
2aa6809
Revert "[ParamManager] Use BundleModelParams for transform_dequantize…
junrushao Oct 24, 2023
9cb8e8e
Remove inaccurate warning message (#1121)
junrushao Oct 24, 2023
9166edb
[REST] OpenAI compatible Rest API (#1107)
Kartik14 Oct 24, 2023
a4279e3
Add --opt flag parsing to CLI (#1123)
junrushao Oct 25, 2023
973f9fc
[ParamManager][Redo] Use BundleModelParams for transform_dequantize (…
Lunderberg Oct 25, 2023
24f795e
added details to windows installation (#1133)
goutham2688 Oct 27, 2023
2c492e5
Grammatical and Typographical improvements (#1139)
tmsagarofficial Oct 28, 2023
2ec0cc8
Minor enhancements to `ChatModule` (#1132)
YuchenJin Oct 28, 2023
27ac5ac
Updating tvm install docs (#1143)
David-Sharma Oct 29, 2023
2b6d832
Make the help info consistent with program name (#1137)
fennecJ Oct 29, 2023
878ae84
Support parameter packing (#1146)
junrushao Oct 29, 2023
c0c3a8d
[Slim-LM] Enable Group Quant (#1129)
zxybazh Oct 29, 2023
2193767
Enable Mypy and Pylint in mlc_chat Python Package (#1149)
junrushao Oct 29, 2023
0a25374
Migrate Compiler Passes (#1150)
junrushao Oct 30, 2023
1a79a53
Compile Model Preset without External `config.json` (#1151)
junrushao Oct 30, 2023
ba67835
Update attention layer (#1153)
junrushao Oct 30, 2023
fee2cb5
Add batched Llama model definition using vLLM paged attention (#1134)
masahi Oct 30, 2023
ece97b1
[Transform][Redo] Apply split_rotary optimization on prefill (#1125)
Lunderberg Oct 30, 2023
b190578
Apply rewrite for normal attention and MQA (#1138)
Lunderberg Oct 30, 2023
8ca0176
[Rest] Fix emoji handling in Rest API. (#1142)
YuchenJin Oct 30, 2023
3cf5605
[Utility] Check for isinstance(exc, Exception) before entering pdb (#…
Lunderberg Oct 30, 2023
0a9d6c7
[Utils] Remove conversion to numpy array in utils.save_params (#1083)
Lunderberg Oct 30, 2023
425a2cb
[Fix][REST] Use lowered-cased "app" (#1159)
junrushao Oct 30, 2023
ed10571
sync llama def
masahi Oct 30, 2023
bab2e50
Merge remote-tracking branch 'mlc-ai/main' into upstream-merge-oct31
masahi Oct 30, 2023
2252c66
fi
masahi Oct 30, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Updating tvm install docs (mlc-ai#1143)
Updating the tvm install docs to assist a user in finding and copying zstd.dll to the correct folder.
  • Loading branch information
David-Sharma authored Oct 29, 2023
commit 27ac5ac90bd7d7705b4258682273f19929a48cf1
2 changes: 1 addition & 1 deletion docs/install/tvm.rst
Original file line number Diff line number Diff line change
Expand Up @@ -132,7 +132,7 @@ A nightly prebuilt Python package of Apache TVM Unity is provided.

FileNotFoundError: Could not find module 'path\to\site-packages\tvm\tvm.dll' (or one of its dependencies). Try using the full path with constructor syntax.

It is likely `zstd`, a dependency to LLVM, was missing. Please `download <https://github.com/facebook/zstd/releases/tag/v1.5.5>`__ the precompiled binary, rename it to `zstd.dll` and copy to the same folder as `tvm.dll`.
It is likely `zstd`, a dependency to LLVM, was missing. Please `download <https://github.com/facebook/zstd/releases/tag/v1.5.5>`__ the precompiled binary, rename it to `zstd.dll` and copy to the same folder as `tvm.dll`. Hint - To locate the "tvm.dll" file in Conda, navigate to your user home directory (e.g., "/users/xxxx"). Search for "tvm.dll" and find the folder whose path contains the name of the current environment, such as "mlc-chat-venv." Once located, copy "zstd.dll" to that specific folder.


.. _tvm-unity-build-from-source:
Expand Down