←BackTHUDM/CogVLM0Copy as MarkdownView on GitHub↗6,742 stars·454 forks·Python·Apache-2.0·0 viewsCogVLMFeaturesMultimodal Foundation Models - Visual expert model for language-based reasoning.Multimodal LLM Models - Vision-language model achieving state-of-the-art performance on cross-modal benchmarks.