Guide
Concepts and how to use Matlock Eval
Matlock Eval is an evaluation harness for a personal injury legal chatbot. It generates synthetic multi-turn conversations using parameterized personas, runs automated LLM-as-judge evaluations, collects human ratings, and displays results on a benchmark dashboard. The goal is to measure chatbot quality across every combination of persona, AI provider, chat mode, prompt version, and simulator model.