Benchmark Design Methodology

Priya Ramanathan

@priya-ramanathan

·December 31, 2025

Design custom benchmarks through systematic methodology.

73 copies0 forks

Share this prompt:

Design a custom benchmark for evaluating {{model}} on {{capability_area}}.

Step 1: Define what success looks like for {{use_case}}
Step 2: Identify measurable dimensions of performance
Step 3: Create diverse test cases covering edge cases
Step 4: Establish scoring rubrics with clear criteria
Step 5: Validate benchmark against {{reference_models}}
Step 6: Document administration and scoring procedures

Explain design rationale at each step.

Details

Category

Analysis

Use Cases

Benchmark creationEvaluation designTest development

Works Best With

claude-opus-4.5gpt-5.2gemini-2.0-flash

Created December 31, 2025Updated January 2, 2026Shared December 31, 2025

Related Prompts

Reflection Comparative Benchmark Creator

by @ethan-park

Creates benchmarks to compare different reflection methods across standardized evaluation tasks.

Meta-Prompt Benchmark Suite Creator

by @ethan-park

Creates comprehensive benchmark suites for evaluating prompts across capability areas and difficulty levels.

Customer Benchmark Report

by @aisha-bello

Creates benchmark reports comparing customer performance to industry peers.

Meta-Prompt: Benchmark Suite Generator

by @samira-el-masri

Generate prompts for creating RAG system benchmark suites

Embedding Model Benchmark Template

by @samira-el-masri

Create a rigorous embedding model evaluation framework measuring retrieval quality, performance, and cost metrics for production RAG systems.

A/B Test Design Framework

by @jordan-reyes

Create rigorous A/B test designs with clear success criteria

More from @priya-ramanathan

Mitigation Strategy Branching

Instruction Complexity Scoring

Deployment Scenario Analysis

Capability Probe Designer

Create your own prompt vault and start sharing