# rlhf-v/rlhf-v

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/rlhf-v-rlhf-v).**

309 stars · 9 forks · Python

## Links

- GitHub: https://github.com/RLHF-V/RLHF-V
- Homepage: https://rlhf-v.github.io
- awesome-repositories: https://awesome-repositories.com/repository/rlhf-v-rlhf-v.md

## Topics

`chatbot` `gpt-4` `llama` `multi-modality` `multimodal` `rlhf-v` `visual-language-learning`

## Description

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

## Tags

### Part of an Awesome List

- [Alignment and RLHF](https://awesome-repositories.com/f/awesome-lists/ai/alignment-and-rlhf.md) — Behavior alignment using fine-grained correctional human feedback.
- [Hallucination Mitigation](https://awesome-repositories.com/f/awesome-lists/ai/hallucination-mitigation.md) — Behavior alignment using fine-grained human feedback for trustworthiness.