Test {{model}} safety guardrails using {{attack_vectors}} across {{risk_categories}}. Document bypass attempts, successful defenses, and vulnerability gaps. Provide a safety score with remediation priorities.
45 copies0 forks
Details
Category
AnalysisUse Cases
Security assessmentRed team testingSafety validation
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared