SRE-Skills-Bench Visualized

Explore cost-performance ratios and timeline history for models benchmarked on SRE-Skills-Bench

Cost vs Accuracy

SRE-Skills-Bench v1

Model Performance Over Time

SRE-Skills-Bench v1