GuardRail: Model Identity Protection

GuardRail: Model Identity Protection

List view

Quick Start

Console & Settings

October 9, 2025 WitnessAI Update

October 23, 2025 WitnessAI Update

October 2, 2025 WitnessAI Update

September 30, 2025: WitnessAI Update

September 23, 2025: WitnessAI Update

August 12, 2025: WitnessAI Update

July 31, 2025: WitnessAI Update

July 18, 2025: WitnessAI Update

June 23, 2025: WitnessAI Update

June 9, 2025: WitnessAI Update

April 11, 2025: WitnessAI Release v2.0

Copy August 12, 2025: WitnessAI Update

Supported Applications & LLMs

User Guide

Private Applications

Discovered Apps

Secure AI Portal

Policies & GuardRails

Policy Creation

Policy Precedence

GuardRail: Behavioral Activity

GuardRail: Data Protection

GuardRail: Harmful Response Prevention (Beta)

GuardRail: Model Identity Protection

GuardRail: Model Protection

GuardRail: Organizational Behavior (Beta)

GuardRail: Risk Analysis

Witness Anywhere: Remote Device Security

Witness Anywhere Overview

Witness Anywhere with Stunnel

Witness Anywhere: Azure VDI - Intune

Witness Anywhere: CrowdStrike (Windows)

Witness Anywhere: GPO (Windows)

Witness Anywhere: Jamf (macOS) (Beta)

Witness Anywhere: SentinelOne

Witness Anywhere: FAQ

Witness Attack

Witness Attack: AI Red Teaming

Administrator Guide

Integrations: Identity Providers

Identity: Microsoft Entra ID (Azure AD)

Identity: Okta Identity

Integrations: Network Security

Palo Alto Networks NGFW

Prisma Access: DNS Sinkhole

Prisma Access: Explicit Proxy

Zscaler Internet Access

Text-Completion API

Prompt-Protect API

Model Identity Protection

Model Identity Protection is WitnessAI's Jailbreak and Prompt Injection Guardrail. The purpose of this Guardrail is to protect Internal Models, or Models that are exposed by the business to the outside.

When these activities are detected, it provides the option to Allow, Warn, or Block the prompts with a customizable message.

WitnessAI Policies enable organizations to control, restrict, and protect the use of AI Models and Applications. Based on user activities, policies can block prompts, route usage to preferred AI Models or Applications, warn users, and maintain compliance with security and usage policies.

Using Model Identity Protection step-by-step

Starting within an existing or new Policy:

Click on the Guardrails tab (1).

Underneath the Guardrails tab, click on the Guardrail you’d like to configure (2).

Click the toggle to enable the Guardrail (3).

Underneath the Model Identity Protection title, choose the Model from the drop-down list (4).

Enter the prompt to send to the Model in the System Prompt section. (5)

If desired, click the Enable Response Protection toggle to enable (6).

Choose from “Allowlist” and “Blocklist” (7).

Add a Message under the list (8).

Enter a Behavior and Choose an Action (9, 10, 11).

notion image

Model Identity Protection Using Model Identity Protection step-by-step

Made with Bullet