LLM Evaluation Harness
Run evals as part of CI — measure LLM output quality on every change
5 packages · Health-checked and current
RolePackageHealthScoreAlternatives
Evaluation
Active
Hero Score 58
, ,
LLM Orchestration
Recently updated
Hero Score 77
,
Data Validation
Recently updated
Hero Score 88
, , ,
Monitoring
Recently updated
Hero Score 77
, , ,
Testing
Active
Hero Score 90
, , ,