Update CLIP to a functional model #2393

divyashreepathihalli · 2024-03-18T18:59:27Z

No description provided.

keras_cv/models/feature_extractor/clip/clip_model.py

tirthasheshpatel · 2024-03-19T06:35:53Z

Is our test suite running with Tf 1.16? I keep getting this error constantly when testing with TensorFlow 1.16.1:

---------------------------------------------------------------------------
KeyError                                  Traceback (most recent call last)
Cell In[15], line 1
----> 1 processor = CLIPProcessor(
      2     MODEL_CONFIGS[config_name]["image_resolution"], VOCAB_PATH, MERGE_PATH
      3 )
      4 image = processor.process_images(["two_cats.jpg"])
      5 text_input = ["mountains", "cat on tortoise", "two cats"]

File ~/oss/keras-cv/keras_cv/models/feature_extractor/clip/clip_processor.py:58, in CLIPProcessor.__init__(self, input_resolution, vocabulary, merges, **kwargs)
     51 self.image_transform = self.transform_image
     52 self.tokenizer = CLIPTokenizer(
     53     vocabulary=self.vocabulary,
     54     merges=self.merges,
     55     unsplittable_tokens=["</w>"],
     56 )
     57 self.packer = StartEndPacker(
---> 58     start_value=self.tokenizer.token_to_id("<|startoftext|>"),
     59     end_value=self.tokenizer.token_to_id("<|endoftext|>"),
     60     pad_value=None,
     61     sequence_length=77,
     62     return_padding_mask=True,
     63 )

File ~/oss/virtualenvs/keras-cv-dev/lib/python3.11/site-packages/keras_nlp/src/tokenizers/byte_pair_tokenizer.py:420, in BytePairTokenizer.token_to_id(self, token)
    418 """Convert a string token to an integer id."""
    419 self._check_vocabulary()
--> 420 return self.vocabulary[token]

KeyError: '<|startoftext|>'

divyashreepathihalli · 2024-03-19T13:16:01Z

Is our test suite running with Tf 1.16? I keep getting this error constantly ...

This needs to be a conditional import, let me send a fix

divyashreepathihalli · 2024-03-19T13:29:37Z

#2396

update model input format update golden values update CLIP to functional model update tests code reformat use dict instead of list Update keras_cv/models/feature_extractor/clip/clip_model.py Co-authored-by: Tirth Patel <tirthasheshpatel@gmail.com> remove build and compute output shape update model input format update golden values Refactor CLIP Refactor includes: - CLIPProcessor is now a Keras layer and uses some utilities from KerasNLP to support all types of python types and array inputs - CLIPImageEncoder, CLIPTextEncoder, and CLIPEncoder now implement a `.compute_output_shape` method (required for CLIP to work with the functional API) - CLIPHead added to remove raw variables from the CLIP Task models; having variables in `keras.Model` class is tricky since functional API doesn't allow state. - CLIP checkpointing script has been updated to now work with the new API: new weights will be uploaded to Kaggle. TODO: attribute KerasNLP wherever relevant TODO: upload new weights to Kaggle TODO: refactor the CLIPProcessor class and the CLIP class to also pull tokenizer vocab and merges from Kaggle. remove build and compute output shape Some fixes for the refactor Fix the tests, update presets update to layers instead of models

tirthasheshpatel · 2024-04-08T22:46:13Z

In the latest commit, I have removed some numerics tests since we don't match the HF model. Also, updated the presets and squashed everything down to one commit. Sorry, I lost authorship when doing this. I will approve but leave merging up to @divyashreepathihalli since I have a lot of contributions to this. Please let me know if you have any changes in mind, otherwise feel free to merge. Thanks!

tirthasheshpatel

I verified the refactor has the same numerics as the model we had before. But note that those numerics are broken too, we are very far from the equivalent HF model.

tirthasheshpatel · 2024-04-08T22:50:27Z

keras_cv/models/feature_extractor/clip/clip_model_test.py

+        # These values are NOT computing using HF as the reference model.
+        # Currently, the numerics of the CLIP model don't match the
+        # HF model exactly (for the same inputs). For the time being,
+        # these tests just confirm that unrelated changed don't affect
+        # the numerics. Once the fix for the numerics is in, we can remove
+        # this comment and the xfail below.
+        self.assertAllClose(
+            outputs["image_logits"], [[10.246354, 10.246353, 10.246354]]
+        )
        self.assertAllClose(
-            text_logits, ops.transpose([[1.896712, 1.896712, 1.896712]])
+            outputs["text_logits"],
+            ops.transpose([[10.246354, 10.246353, 10.246354]]),
        )

+        # True reference values computed using HF:
+        # image_logits: [[17.8013, 17.8013, 17.8013]]
+        # text_logits: image_logits.T
+
+        # xfail after assertion
+        pytest.xfail("KerasCV CLIP doesn't match the HF model.")


@divyashreepathihalli I have included the reference values here computed using the same inputs to the HF model. Currently, xfailing the test to indicate we need to fix this.

tirthasheshpatel · 2024-04-08T23:36:00Z

Unrelated Keras 2 failure. @divyashreepathihalli Do we need to the initializer? I am inclined to remove it since we don't yet support training.

divyashreepathihalli · 2024-04-09T02:39:42Z

Unrelated Keras 2 failure. @divyashreepathihalli Do we need to the initializer? I am inclined to remove it since we don't yet support training.

yes! we can remove those.

tirthasheshpatel reviewed Mar 18, 2024

View reviewed changes

keras_cv/models/feature_extractor/clip/clip_model.py Outdated Show resolved Hide resolved

tirthasheshpatel reviewed Mar 19, 2024

View reviewed changes

keras_cv/models/feature_extractor/clip/clip_model.py Outdated Show resolved Hide resolved

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Mar 19, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Mar 19, 2024

tirthasheshpatel mentioned this pull request Apr 8, 2024

Refactor CLIP to a functional model divyashreepathihalli/keras-cv#5

Merged

divyashreepathihalli requested review from tirthasheshpatel and sampathweb April 8, 2024 16:49

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Apr 8, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Apr 8, 2024

tirthasheshpatel force-pushed the fix_clip_jax branch 2 times, most recently from 8fdabd5 to c6e84b9 Compare April 8, 2024 22:40

tirthasheshpatel force-pushed the fix_clip_jax branch from c6e84b9 to 3c54540 Compare April 8, 2024 22:41

tirthasheshpatel approved these changes Apr 8, 2024

View reviewed changes

tirthasheshpatel added the kokoro:force-run Runs Tests on GPU label Apr 8, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Apr 8, 2024

Attempt to fix the Keras 2 error

e865222

tirthasheshpatel added the kokoro:force-run Runs Tests on GPU label Apr 8, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Apr 8, 2024

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Apr 9, 2024