Create an evaluation rubric for assessing {{model}} on {{task_type}}. Design rubric with: 1. 5-point scale definitions for each {{quality_dimension}} 2. Concrete examples at each score level 3. Common pitfalls and how to score them 4. Edge case guidance Output rubric in format suitable for {{evaluator_type}} with inter-rater reliability notes.
61 copies0 forks
Details
Category
AnalysisUse Cases
Rubric designScoring guidanceEvaluator training
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared