-
Notifications
You must be signed in to change notification settings - Fork 191
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added blobs support for Static LLM pipeline #811
Added blobs support for Static LLM pipeline #811
Conversation
c885b82
to
be7066f
Compare
be7066f
to
bf253f7
Compare
cf30b0e
to
980eebf
Compare
@ilya-lavrenov @Wovchena Could you have a look, please? |
// (1) Check that neither MAX_PROMPT_LEN nor MIN_RESPONSE_LEN is | ||
// exposed in the config | ||
if (pipeline_config.count("MAX_PROMPT_LEN") || | ||
pipeline_config.count("MIN_RESPONSE_LEN")) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
so many hardcoded strings.. how users would know that such strings should be passed?
will it be mentioned in NPU specific docs?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I believe these parameters can be reflected in "Additional configuration options": https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide/genai-guide-npu.html
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You refused to do that 2 weeks ago: #795 (review). In personal messages you stated that you don't want to advertise them. What changed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Didn't refuse to be honest, these parameters will be documented for 2024.4
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok
980eebf
to
bae4b46
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great! Thank you @AsyaPronina, LGTM 👍
Added support of importing blobs for `StaticLLMPipeline`
Added support of importing blobs for
StaticLLMPipeline