Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add AWQ (Activation-aware Weight Quantization) for llama, llama2, mpt, and mistral models #4593
Add AWQ (Activation-aware Weight Quantization) for llama, llama2, mpt, and mistral models #4593
Changes from all commits
2ea3934
8a3cece
0adf4c7
e851199
eb9a790
576d28b
4cad8d7
f97c587
ef61a66
f8cf783
1b300cb
8fece75
8177ad4
d2e9d00
0610672
c02f6df
71c0a27
741b7fb
e04b8f0
48cd819
6fcdb07
b00e2d9
440cc2f
00f48ad
9b742c5
e8fae2d
a600c61
2187a8d
e9ad5fe
13f60c4
44f4ce2
d089842
278f3e9
9174699
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing