Complete {{model}} assessment on {{test_suite}} within {{time_limit}}. Constraints: - Total evaluation time: {{max_hours}} hours - Per-sample time limit: {{seconds_per_sample}} seconds - No retries on timeouts Prioritize tests by {{priority_criteria}}. Report coverage achieved. Flag tests skipped due to time constraints.
90 copies0 forks
Details
Category
AnalysisUse Cases
Time-limited testingRapid assessmentPriority evaluation
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared