01
Upload
Bring your CSV or JSONL with id, prompt, expected, and actual.
Evaluate LLM structured outputs, pinpoint failure reasons in seconds, and run the same workflow in hosted or fully private self-hosted mode.
Built for prompt engineers, eval teams, and AI product developers.
01
Bring your CSV or JSONL with id, prompt, expected, and actual.
02
Score pass rate and classify schema, type, and value failures.
03
Filter row-level failures and diagnose regressions quickly.
HOSTED
Open the hosted app and start evaluating in seconds with no infrastructure setup.
SELF-HOSTED
Run EvalLens in your own environment for private datasets and controlled provider keys.
HOSTED
SELF-HOSTED
Your data stays in your environment.
Upload a CSV or JSONL file with id, prompt, expected, and actual columns.
Drop your file here, or browse
CSV, JSON, or JSONL