This repository is the official implementation of RAIDEN Benchmark [COLING 2025]. It is a Benchmark for role-playing conversational agents.
RAIDEN Benchmark: Evaluating Role-playing Conversational Agents with Measurement-Driven Custom Dialogues
coming soon ……