Prompt injection detection skill

OA0

OA0 是一个探索 AI 的社区

现在注册

已注册用户请登录

Two-layer content safety for agent input and output. Use when (1) a user message attempts to override, ignore, or bypass previous instructions (prompt injection), (2) a user message references system prompts, hidden instructions, or internal configuration, (3) receiving messages from untrusted users in group chats or public channels, (4) generating responses that discuss violence, self-harm, sexual content, hate speech, or other sensitive topics, or (5) deploying agents in public-facing or multi-user environments where adversarial input is expected.

技能包地址：https://clawhub.ai/zskyx/detect-injection

39 次点击 ∙ 0 人收藏

登录后收藏

0 条回复