Design safety guardrails for {{model}} in {{sensitive_application}}. Design A - Input filtering: - Block harmful queries pre-model - Analyze: coverage, false positives - Evaluate user experience impact Design B - Output filtering: - Filter harmful responses post-model - Analyze: coverage, latency impact - Evaluate quality degradation Design C - Steering approach: - System prompt constraints - Analyze: effectiveness, bypass risk - Evaluate naturalness Compare designs and recommend layered approach with {{safety_requirements}}.
75 copies0 forks
Details
Category
AnalysisUse Cases
Guardrail designSafety engineeringProtection strategy
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared