A new category of security tooling has arrived inside the AI coding assistant. IDE plugins, MCP servers, PR review bots — they inject security rules directly into the developer's workflow. When code is written that violates a rule, the tool flags it in context, before the pull request, before the review, before production. The category is real, the problem it addresses is real, and the timing is right. Developers building AI systems need security context earlier than they are getting it.
There is a prerequisite the tools themselves do not address. Every guardrail enforces a policy. That policy has to come from somewhere.
The source-of-rules problem
In practice, security rules injected into guardrail tools come from one of three places.
The first is a generic library: OWASP LLM Top 10 patterns, common prompt injection guards, credential exposure checks. These are useful across many systems. They are not specific to any system. An agentic procurement system that executes purchase orders has a different risk profile than a customer service RAG that summarizes tickets. Generic rules will catch common failure modes and miss the specific ones that define the actual risk of a given deployment.
The second is a security engineer's manual judgment. Accurate for the specific system, expensive to produce, not documented in terms of rationale, and not connected to any governance artifact. If an auditor asks what policy the rules enforce, the answer is “our security engineer decided.” That is not an answer that survives an AI governance review.
The third is a threat model: the right source, the right specificity, the right connection to the actual system. But a threat model only becomes a governance artifact when it has been reviewed, when someone has looked at it and decided the risk is acceptable under the stated controls, and when that decision is on record. A threat model produced in isolation, not connected to a clearance decision, has unknown status.
A guardrail without a reviewed source enforces confidence, not policy. There is a difference, and it shows up exactly when it matters most.
The guardrail is only as good as the model it enforces. The model is only as good as the review that produced it.
What happens when you skip the review
When guardrails run without a reviewed source, several things go wrong, quietly.
The rules become an average. They are tuned to what the tool knows about AI systems in general, not to what is true about this system. The agentic supply chain orchestrator that has write access to the ERP system, the RAG assistant with access to employee salary data, the coding copilot that can open and close tickets on behalf of any user — each of these has a specific threat profile that generic rules will partially miss and partially over-block.
The escalation path disappears. When a guardrail flags a pattern, the developer's question is: why is this rule here, who decided it, and does it apply to what I'm building? Without a reviewed source, the answer is opaque. The developer either overrides it (losing the safety benefit) or complies with it without understanding it (losing the understanding that would let them generalize the lesson). Neither produces a more secure developer.
The governance story collapses. An auditor or regulator reviewing an AI system will ask: what are your security controls, how were they derived, and who signed off that they are sufficient? The guardrail tool can answer the first question (here are the rules it enforces) but not the second or third. The review that would answer those questions was never done.
The enforcement layer can tell you what rules it is running. It cannot tell you whether those rules are right.
The right sequence
The sequence that produces defensible enforcement is not complicated, but it has to be respected as a sequence. Review, then rules, then enforce. Not in parallel. Not reversed.
The review is the point at which the specific AI system — its architecture, its data flows, its model and tool behavior, its actor permissions, its residual risk — is examined by people who can be held accountable for the conclusion. The outcome is not a score or a dashboard. It is a disposition: a signed record of what was found, what controls are required, what gaps remain open, and who accepted the residual risk under what conditions.
From that disposition, a rules set follows naturally. The required controls from the reviewed case — translated into patterns the guardrail can enforce — become the live policy for the development team. The guardrail now enforces a cleared baseline, not a generic one. When a developer violates a rule, the escalation path is also clear: this pattern was specifically identified in the review as a control requirement, and this deviation may constitute a material system change that triggers re-assessment.
This sequence also changes what the guardrail is. It is no longer a static rules file checked into a repository. It is a live link to the governance decision. When the decision is updated — because the system changed, because new threats emerged, because the risk was re-evaluated — the rules update too. The enforcement layer and the governance layer become the same artifact, consulted in different contexts.
Why the sequence is governance-load-bearing
This is not a technical argument. It is a governance argument, and it becomes a regulatory argument the moment the system reaches production in a regulated context.
The EU AI Act, ISO/IEC 42001, and the emerging enterprise AI governance frameworks all ask a version of the same question: was this AI system reviewed before deployment, by whom, under what controls, and is the record available for inspection? A guardrail tool produces enforcement logs. It does not produce a decision record. The two are different artifacts for different audiences.
“These rules were generated from the approved AI Security Review Case, signed by the CISO delegate and AI Governance, with residual risk accepted by the business owner.” That is a defensible statement. “Our security engineer wrote these” is not.
The distinction matters at every level of the governance stack. For the developer, it is the difference between following a rule they understand and following one they don't. For the security architect, it is the difference between enforcing a reviewed model and maintaining a ruleset in perpetuity. For the AI Committee, it is the difference between a live connection to the governance decision and an opaque enforcement layer they cannot inspect. For the auditor, it is the difference between a review they can trace and a control they cannot attribute.
An enforcement log tells you what was blocked. A decision record tells you why, who decided, and whether they can still be held to it.
What this asks of builders
If you are building an AI system — using Claude Code, Cursor, or any agentic framework to write the code — the right order is not to start with the guardrail. The right order is:
- Review the system before the code is written or while it is being written. What are the threats? What controls are required? What is the residual risk? Who accepts it?
- Get the review approved by the people who are accountable. Not reviewed for quality. Signed, with the residual risk attributed to a named role.
- Enforce the approved controls through guardrails, CI checks, and PR gates — derived from the review, not written independently of it.
- Re-assess when the system changes materially. A new model, a new tool, increased autonomy, a new data access path — each is a trigger to revisit the decision, not to update the guardrail file and continue.
This sequence is not overhead. It is the only order in which the enforcement layer actually means something. A guardrail that enforces a reviewed model is a genuine control. A guardrail that enforces an unreviewed one is a confidence signal. Governance, and eventually regulation, will demand to know which one you have.
The tools coming next
The AI coding guardrails category is building the enforcement layer. It is building it well. The layer it needs to sit on top of — a reviewed, approved AI security model — is the layer that is currently missing from most organizations deploying AI systems.
That gap is where the next useful tool lives. Not another enforcement layer, but the source of what the enforcement layer enforces: the review case that the guardrail reads from, the decision record that the auditor inspects, the governance artifact that travels with the AI system wherever it is deployed.
The right architecture is: review produces the cleared baseline. The cleared baseline is machine-readable and versioned. The guardrail tool pulls from it directly. The developer sees rules with provenance. The auditor sees enforcement connected to a decision. The AI Committee sees a live link between what they approved and what the team is building. This is not a product vision. It is a description of how these layers are supposed to fit together — and the only configuration in which they can all be held to what they promised.
Building the review layer is where Drel is built to live. Not as an alternative to the guardrail, but as the thing the guardrail reads from.