Design an evaluation pipeline for {{model}} on {{evaluation_goals}}. Generate pipeline specification: 1. Data collection and preprocessing steps 2. Evaluation metric selection with rationale 3. Automated scoring implementation 4. Human review integration points 5. Reporting and visualization approach Output implementation guide with {{technology_stack}} recommendations.
98 copies0 forks
Details
Category
AnalysisUse Cases
Pipeline designEvaluation architectureAutomation planning
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared