Part 13 (5 points, coding task)
Do the following tasks:
-
Define your model by calling
model_CLIP = MyCLIP()
. -
Fix all parameter values in the ViT and Bert blocks in your model. That is, you are only allowed to train
-
Out-projection matrices in the image and text encoders.
-
Temperature.
-