Interpret {{model}} scores on {{benchmark_suite}}. Lens A - Absolute performance: - Compare to fixed thresholds - Identify capability levels - Flag concerning gaps Lens B - Relative performance: - Compare to {{competitor_models}} - Identify competitive position - Flag relative weaknesses Lens C - Trend analysis: - Compare to {{previous_versions}} - Identify improvement trajectory - Flag regressions Synthesize all lenses into actionable insights for {{stakeholders}}.
99 copies0 forks
Details
Category
AnalysisUse Cases
Benchmark analysisMulti-lens interpretationInsight synthesis
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared