-
Notifications
You must be signed in to change notification settings - Fork 487
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Transformers 4.44 support #1996
Conversation
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
41c81d5
to
ea84dde
Compare
@@ -28,7 +28,7 @@ | |||
|
|||
class TokenClassificationProcessing(TaskProcessor): | |||
ACCEPTED_PREPROCESSOR_CLASSES = (PreTrainedTokenizerBase,) | |||
DEFAULT_DATASET_ARGS = "conll2003" | |||
DEFAULT_DATASET_ARGS = {"path": "conll2003", "trust_remote_code": True} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we shouldn't set trust_remote_code
to True by default here, where is this needed ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this dataset was being used with "remote code" (it generates the dataset) for as long as it existed, but the datasets
package introduced the new argument trust_remote_code
lately, this preprocessing is used in in tflite tests, we can either find a similar dataset that doesn't use code or allow remote code (that was already being executed).
https://huggingface.co/datasets/eriktks/conll2003/blob/main/conll2003.py
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we need to change the dataset to something that doesn't require trust_remote_code
to be set to True by default then (something that should be explicitly set by users)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. Not very knowledgeable on the bloom bettertransformer part though.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks @IlyasMoutawwakil
What does this PR do?
Adding support for transformers 4.44. Biggest issue is with the changes in the bloom model attention block and its kv cache changes. Aslo fixing the failing TFLite tests (all green).
Before submitting
Who can review?