Investigate {{model}} failure on {{failure_case}}. Hypothesis 1 - Context overload: - Test with reduced context - Measure improvement - Evaluate likelihood Hypothesis 2 - Instruction ambiguity: - Test with clearer instructions - Measure improvement - Evaluate likelihood Hypothesis 3 - Knowledge gap: - Test with added context - Measure improvement - Evaluate likelihood Synthesize findings and identify root cause with confidence level.
31 copies0 forks
Details
Category
AnalysisUse Cases
Failure diagnosisHypothesis testingRoot cause analysis
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared