Assess {{model}} instruction-following capability using {{instruction_set}}. Score adherence on format compliance, constraint satisfaction, and task completion. Report detailed breakdown by instruction type with {{scoring_rubric}}.
7 copies0 forks
Details
Category
AnalysisUse Cases
Compliance measurementInstruction adherenceQuality scoring
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared