The restricted-pilot pattern for risky AI systems
A restricted pilot is a formal disposition state: the AI system may operate within a defined scope, on the condition that named controls are in place and named triggers will initiate a re-review. This piece defines how to write a restricted-pilot disposition.
Some AI systems present residual risks that cannot be fully characterised before any deployment — because the risk only becomes visible at scale, in real operational conditions, with real user populations. For these systems, the choice is not between “approved for production” and “not approved.” It is between an uncontrolled informal pilot (which happens anyway) and a formal restricted-pilot disposition that creates accountability for the pilot scope and a defined path to full production clearance.
The restricted-pilot pattern formalises the pilot. It is not a compromise between approval and rejection — it is a distinct disposition state with its own governance requirements, its own controls, and its own criteria for success.
What a restricted pilot is
A restricted pilot is a formal disposition that authorises an AI system to operate within a precisely defined scope — a specific user population, a specific data class, a specific operational context — under named controls, with a defined path to full production clearance and a defined trigger for pilot hold.
The three defining characteristics that make a restricted pilot different from an informal pilot or a conditional approval:
- Explicitly bounded scope.The scope is defined in terms of who, what, and where — not just size. “Internal HR team (12 people), for non-regulated internal queries only, within the HR information system” is a bounded scope. “Limited deployment” is not.
- Pilot-specific controls. The restricted pilot may require controls that would not be required at full production scale — for example, a mandatory human review of all model outputs during the pilot phase that would be impractical at scale but is appropriate for the limited pilot population.
- Defined path forward. The pilot exists to gather evidence for a production clearance decision. The evidence requirements are specified in the pilot disposition — not determined after the pilot completes.
Restricted pilot pattern
Restricted pilot definition
- A formal disposition state — not an informal phase
- The system may operate within a stated boundary only
- Boundary defined by: user group, data class, workflow, and geography
- Full production approval has not been granted and requires separate decision
What it gates
- Access to production data classes not in the pilot scope
- Customer-facing or regulated-data populations not in the pilot scope
- Tool or capability expansions not in the original pilot disposition
- Removal of human oversight mechanisms defined as pilot controls
Exit criteria
- All before-production controls verified with documented evidence
- Pilot-phase monitoring completed without triggering a hold
- Red-team or adversarial testing completed for the full production scope
- Disposition committee reviews and approves the production transition
When to use it
The restricted-pilot pattern is appropriate in two scenarios:
Scenario 1: Risks that require operational evidence. The threat model identifies residual risks that can only be properly evaluated at limited scale — for example, risks related to how users interact with the system, risks related to edge cases that only emerge in real use, or risks whose severity depends on user behaviour that cannot be simulated in a design-time review. The pilot provides the operational evidence that the full production clearance decision requires.
Scenario 2: Controls that cannot all be verified before deployment.Some controls cannot be fully verified until the system is operating. A monitoring control that detects anomalous usage patterns cannot be verified until there is usage to monitor. An escalation process cannot be verified until there is something to escalate. The pilot phase validates these controls at limited scale before they are required to function at production scale.
The restricted pilot is not a way to deploy a system before it is ready. It is a way to acquire the evidence needed to determine whether the system can be made ready — while maintaining accountability for the scope within which it operates in the meantime.
Scope definition
Scope definition is the most important element of the restricted-pilot disposition. The scope must be defined across four dimensions:
- User population:The specific users, roles, or user groups authorised to use the system during the pilot. Named teams, specific roles, or a maximum user count. Not “a small group of users.”
- Data classes: The specific categories of data the system is authorised to process during the pilot. Any data class not listed is explicitly excluded. If the system may encounter excluded data, there must be a control that prevents it from processing that data.
- Operational context: The specific use cases, systems, or business processes the system is authorised to support during the pilot. Adjacent use cases not listed are explicitly excluded.
- Explicit boundaries: What the system is explicitly prohibited from doing during the pilot — capabilities disabled, data classes excluded, user-facing outputs limited. These boundaries define what is out of scope regardless of what the system is technically capable of.
Required controls at pilot
Pilot-phase controls are different from production controls in two ways: they may be more intensive (because the pilot is specifically designed to evaluate risks that cannot be assessed design-time), and they may include controls that are not scalable to full production.
Controls typically required specifically at pilot phase:
- Manual review of all or a sample of model outputs during the pilot period.
- Enhanced logging that captures inputs, outputs, and model decisions at a level of detail that would not be retained in production.
- Named pilot users who have been specifically briefed on the system's limitations and expected to report unexpected behaviour.
- A named pilot owner responsible for monitoring the pilot, collecting pilot-phase evidence, and escalating anomalies.
- A defined review interval — typically weekly — at which the pilot owner reports on pilot-phase observations to the governance committee.
Path to production
The path to production defines the evidence the pilot must produce and the gates that must be cleared before the system can receive full production clearance.
The evidence requirements should be specified at the time of the pilot disposition — not determined retrospectively. Typical evidence requirements for a restricted pilot:
- Pilot-phase incident log — a complete record of anomalous events observed during the pilot, their severity, and their resolution.
- Control verification evidence — confirmation that the pilot-phase controls operated as designed throughout the pilot period.
- Pilot-phase performance data — evidence relevant to the residual risks identified in the threat model, showing whether they materialised and at what rate.
- Pilot-phase user feedback — structured feedback from pilot users regarding the system's behaviour in edge cases.
The gates to full production clearance are the governance committee's review of this evidence and confirmation that the evidence supports production clearance. The gates must be specified — not just “pilot review.” What is the minimum evidence package, and what would cause the committee to decline production clearance after reviewing it?
Trigger for pilot hold
A pilot hold is the immediate suspension of the pilot in response to a specific event. The pilot hold trigger must be defined in the pilot disposition — not determined ad hoc when a problem occurs.
Standard pilot hold triggers:
- Any incident involving data outside the authorised data classes.
- Any model output that causes identifiable harm to a pilot user.
- Any breach of the pilot scope — users outside the authorised population, use cases outside the authorised context.
- Failure of any pilot-specific control to operate as designed.
- A vendor incident (for vendor AI) that affects the system during the pilot period.
A pilot hold requires governance committee review before the pilot can resume. This creates an automatic escalation mechanism for events that exceed the pilot owner's authority to manage independently.
Evidence requirements
The restricted-pilot disposition must produce a documented evidence record that includes:
- The full scope definition with explicit boundaries.
- The pilot-specific controls with owner and verification mechanism.
- The path to production evidence requirements.
- The pilot hold triggers and the response process.
- The named pilot owner and the pilot review interval.
- The pilot duration — the date by which the pilot must produce the required evidence, or the system returns to the review queue.
For the broader disposition record structure, see The anatomy of an AI evidence pack.
Blog
Get new posts in your inbox
AI security review, OWASP Agentic Top 10, ISO 42001 evidence, and what AI Committees actually need. No cadence promises — we publish when there's something worth reading.
Formalise your AI pilots
Drel provides a structured restricted-pilot disposition that defines scope, controls, path to production, and hold triggers — so pilots are accountable, not informal.
A note on scope: Drel reviews assessed systems against documented architecture, configuration and intent. It does not ingest live telemetry from production environments. Dispositions reflect the assessed system at the time of review and the re-assessment triggers that govern when the disposition must be revisited.