# gautierdag/cast

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/gautierdag-cast).**

2 stars · 1 forks · Python · MIT

## Links

- GitHub: https://github.com/gautierdag/cast
- awesome-repositories: https://awesome-repositories.com/repository/gautierdag-cast.md

## Description

Vision Language Models (VLMs) are typically evaluated with Visual Question Answering (VQA) tasks which assess a model's understanding of scenes. Good VQA performance is taken as evidence that the model will perform well on a broader range of tasks that require both visual and language inputs.…

## Tags

### Part of an Awesome List

- [Evaluation Benchmarks](https://awesome-repositories.com/f/awesome-lists/ai/evaluation-benchmarks.md) — Tests cross-modal alignment similarity for vision models.
