# yuliang-liu/monkey

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/yuliang-liu-monkey).**

1,948 stars · 140 forks · Python · MIT

## Links

- GitHub: https://github.com/Yuliang-Liu/Monkey
- awesome-repositories: https://awesome-repositories.com/repository/yuliang-liu-monkey.md

## Description

Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)

## Tags

### Part of an Awesome List

- [Foundation Models](https://awesome-repositories.com/f/awesome-lists/ai/foundation-models.md) — Multimodal model for high-resolution image understanding.
- [Specialized Multimodal Tasks](https://awesome-repositories.com/f/awesome-lists/ai/specialized-multimodal-tasks.md) — OCR-free model for document and text-rich understanding.
