1 repo

Awesome GitHub RepositoriesConversational Evaluation Suites

Frameworks designed to simulate and score multi-turn dialogues between users and AI agents.

Distinguishing note: Focuses on stateful, multi-turn interaction quality rather than single-shot prompt-response evaluation.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Conversational Evaluation Suites. Refine with filters or upvote what's useful.

Find the best repos with AI.We'll search the best matching repositories with AI.

mlflow/mlflow
mlflow/mlflow
24,319View on GitHub
Assess conversational agents by simulating multi-turn dialogues and applying scorers to evaluate interaction quality and safety at every step.
Pythonagentopsagentsai
24,319View on GitHub