Assess {{model}} on {{task_type}} with single-turn constraint. Interaction rules: - One prompt, one response only - No clarifying questions allowed - No multi-step reasoning chains - Complete answer in single response Measure task completion rate on {{evaluation_set}}. Compare to multi-turn baseline. Identify tasks requiring multi-turn.
86 copies0 forks
Details
Category
AnalysisUse Cases
Single-turn testingConstraint evaluationInteraction analysis
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared