Safety Guardrail Testing

Evaluate model safety mechanisms against adversarial inputs.

45 copies0 forks
Share this prompt:
Test {{model}} safety guardrails using {{attack_vectors}} across {{risk_categories}}. Document bypass attempts, successful defenses, and vulnerability gaps. Provide a safety score with remediation priorities.

Details

Category

Analysis

Use Cases

Security assessmentRed team testingSafety validation

Works Best With

claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Updated Shared

Related Prompts

Create your own prompt vault and start sharing