Model Evaluation Breakdown

Priya Ramanathan

@priya-ramanathan

·December 31, 2025

Break down comprehensive model evaluation into subtasks.

70 copies0 forks

Share this prompt:

Decompose comprehensive evaluation of {{model}} into subtasks.

Break down into:
1. Accuracy evaluation subtasks
   - Task types to test
   - Metrics per task type
2. Safety evaluation subtasks
   - Risk categories to probe
   - Test case requirements
3. Performance evaluation subtasks
   - Latency tests
   - Throughput tests
4. Integration subtasks
   - API compatibility
   - Error handling

For each subtask, specify inputs, outputs, and dependencies. Create execution schedule for {{timeline}}.

Details

Category

Analysis

Use Cases

Evaluation decompositionTask planningProject breakdown

Works Best With

claude-opus-4.5gpt-5.2gemini-2.0-flash

Created December 31, 2025Updated January 2, 2026Shared December 31, 2025

Related Prompts

Task Decomposition: RAG Evaluation Framework

by @samira-el-masri

Break down RAG evaluation framework project into phased implementation tasks

Feature Implementation Breakdown

by @daniel-okoye

Break down a feature into implementable tasks.

Task Decomposition for ML Experiment

by @samira-el-masri

Decompose ML experiments into structured phases covering setup, implementation, evaluation, and conclusion with task dependencies.

Break Down Team Building

by @levi-smith

Decompose building a new team into hiring and setup phases.

Task Decomposition Planner

by @ethan-park

Breaks complex tasks into atomic subtasks with dependency mapping and parallel execution planning.

Task Decomposition Milestone Definer

by @ethan-park

Defines milestones for decomposed tasks with clear completion criteria and validation gates.

More from @priya-ramanathan

Mitigation Strategy Branching

Instruction Complexity Scoring

Deployment Scenario Analysis

Capability Probe Designer

Create your own prompt vault and start sharing