1 repo
Frameworks designed to simulate and score multi-turn dialogues between users and AI agents.
Distinguishing note: Focuses on stateful, multi-turn interaction quality rather than single-shot prompt-response evaluation.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Conversational Evaluation Suites. Refine with filters or upvote what's useful.
Assess conversational agents by simulating multi-turn dialogues and applying scorers to evaluate interaction quality and safety at every step.