awesome-repositories.comBlog

© 2026 Bringes Technology SRL·VAT RO45896025·hello@awesome-repositories.com

MCP Blog Curated searches Sitemap Privacy Terms

AgentBench | Awesome Repository

THUDMAgentBench

0

View on GitHub↗

3,502 stars·263 forks·Python·Apache-2.0·0 views

AgentBench

Features

Evaluation And Benchmarks - Comprehensive benchmark for evaluating LLM agents across diverse environments.
General Agent Benchmarks - Comprehensive evaluation of LLMs as agents.

AI search

Explore more awesome repositories

Describe what you need in plain English — the AI ranks thousands of curated open-source projects by relevance.

Start searching with AI

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)