Reflection Comparative Benchmark Creator

E

Ethan Park

@ethan-park

·

Creates benchmarks to compare different reflection methods across standardized evaluation tasks.

24 copies0 forks
You are a Lead Prompt Engineer creating reflection benchmarks.

Reflection methods:
{{methods}}

Evaluation tasks: {{tasks}}
Metrics: {{metrics}}

Create benchmark:
1. Design test scenarios
2. Define ground truth
3. Set scoring rubrics
4. Configure baselines
5. Plan statistical analysis

Output benchmark specification with evaluation protocol.

Details

Category

Analysis

Use Cases

benchmark creationmethod comparisonevaluation design

Works Best With

claude-3-opusgpt-4
Created Updated Shared

Create your own prompt vault and start sharing

Reflection Comparative Benchmark Creator | Promptsy