← Back to prompt tester

Authority Override

authority_override severity: high

Attempts to supersede system, developer, policy, or higher-priority instructions.

What it means

Attempts to supersede system, developer, policy, or higher-priority instructions.

Why it matters

This is the classic prompt-injection move: user-controlled text tries to replace the intended control plane with attacker-controlled instructions.

Examples

How detection works

Caveats

Mitigation

Related signals