group_size is not passed to quantizer while export model with gptq quant #2710

luguanyu1234 · 2024-12-19T10:58:43Z

Describe the bug

diff --git a/swift/llm/export/quant.py b/swift/llm/export/quant.py
index 4e598c03..f3e17cde 100644
--- a/swift/llm/export/quant.py
+++ b/swift/llm/export/quant.py
@@ -210,6 +210,7 @@ class QuantEngine(ProcessorMixin):
                 bits=args.quant_bits,
                 dataset=','.join(args.dataset),
                 batch_size=args.quant_batch_size,
+                group_size=args.group_size,
                 block_name_to_quantize=self.get_block_name_to_quantize(self.model, args.model_type))
             gptq_quantizer.serialization_keys.append('block_name_to_quantize')
             logger.info('Start quantizing the model...')

Your hardware and system info
Write your system info like CUDA version/system/GPU/torch version here(在这里给出硬件信息和系统信息，如CUDA版本，系统，GPU型号和torch版本等)

Additional context
Add any other context about the problem here(在这里补充其他信息)

Jintao-Huang · 2024-12-20T09:32:41Z

Sorry, I didn't understand what you meant.

Jintao-Huang mentioned this issue Dec 20, 2024

fix gptq group_size #2720

Merged

Jintao-Huang closed this as completed in #2720 Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

group_size is not passed to quantizer while export model with gptq quant #2710

group_size is not passed to quantizer while export model with gptq quant #2710

luguanyu1234 commented Dec 19, 2024 •

edited

Loading

Jintao-Huang commented Dec 20, 2024

group_size is not passed to quantizer while export model with gptq quant #2710

group_size is not passed to quantizer while export model with gptq quant #2710

Comments

luguanyu1234 commented Dec 19, 2024 • edited Loading

Jintao-Huang commented Dec 20, 2024

luguanyu1234 commented Dec 19, 2024 •

edited

Loading