-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix fused_qkv model accuracy issue #5217
Conversation
Hi @mrwyattii @delock. Please kindly review, Thanks! |
@Yejing-Lai what specific value of |
|
|
|
For example: |
Hi @mrwyattii can you help review this PR? This PR fixed an accuracy issue for various models with fused qkv. i.e. Baichuan, code gen, bloom, mpt. |
Fused_qkv model can not correctly choose the fused_qkv type. Need to update the module_name_matches. Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Fused_qkv model can not correctly choose the fused_qkv type. Need to update the module_name_matches. Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Fused_qkv model can not correctly choose the fused_qkv type. Need to update the module_name_matches.