gautierdagcast

0

Cast

Vision Language Models (VLMs) are typically evaluated with Visual Question Answering (VQA) tasks which assess a model's understanding of scenes. Good VQA performance is taken as evidence that the model will perform well on a broader range of tasks that require both visual and language inputs.…

Features

Evaluation Benchmarks - Tests cross-modal alignment similarity for vision models.

Cast

Features

Star history