awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Multimodal Agent Capabilities · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesMultimodal Agent Capabilities

Integrations that enable agents to process and generate non-textual data like images, audio, and video.

Distinguishing note: Focuses on the capability to toggle and manage multimodal features (vision, speech) within an agentic framework.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Multimodal Agent Capabilities. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Multimodal Agent Capabilities

Awesome Multimodal Agent Capabilities GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • zhayujie/chatgpt-on-wechat

    zhayujie/chatgpt-on-wechat

    41,334View on GitHub↗

    This project is an autonomous agent framework designed to integrate large language models with popular messaging platforms. It functions as a middleware platform that enables automated, multimodal interactions by decomposing complex user goals into sequential plans, executing them through external tools, and maintaining persistent context across sessions. The framework distinguishes itself through a modular skill architecture and a hybrid memory system. Users can extend system capabilities by installing custom logic modules from community hubs or generating them through natural language. The

    Agent framework enables features like vision, image generation, and speech processing by selecting specific vendors and models through a centralized interface.

    Pythonaiai-agentchatgpt
    41,334View on GitHub↗