LLM Evaluation Harness

Run evals as part of CI — measure LLM output quality on every change

5 packages · Health-checked and current

RolePackageHealthScoreAlternatives
Evaluation
Active
Hero Score 58
, ,
LLM Orchestration
Recently updated
Hero Score 77
,
Data Validation
Recently updated
Hero Score 88
, , ,
Monitoring
Recently updated
Hero Score 77
, , ,
Testing
Active
Hero Score 90
, , ,
Packages and scores are updated monthly. Alternatives link to full comparison pages.