Libraries for building AI applications, LLM integrations, and autonomous agents.
Abstract: Pretrained vision-language models (VLMs) like CLIP exhibit exceptional generalization across diverse downstream tasks. While recent studies reveal their vulnerability to adversarial attacks, ...