Evaluate {{model}} performance with context lengths from {{min_tokens}} to {{max_tokens}}. Measure accuracy degradation, response quality, and recall across the context window. Identify the optimal context size for {{task_type}}.
34 copies0 forks
Details
Category
AnalysisUse Cases
Context limit testingMemory evaluationCapacity planning
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared