LLM supply-chain risk — models, weights, and dependencies
LLM applications have a supply chain that extends to pre-trained models, fine-tuning datasets, inference providers, and plugin ecosystems. This piece maps the supply-chain attack surface and the review questions for each layer.
The supply chain of an LLM application extends well beyond the software dependencies that a standard SCA scanner covers. It includes the pre-trained model, the fine-tuning datasets, the inference provider, any plugins or extensions the model uses, and the software packages that connect these components. Each layer introduces risks that the application team cannot fully control and must evaluate at design time.
OWASP LLM05 (supply chain vulnerabilities) is underweighted in most AI security reviews we have assessed — teams acknowledge the category but do not produce evidence that the individual layers have been evaluated. This piece provides the review framework for each layer. For the full OWASP context, see the OWASP LLM Top 10 Assessment.
The AI supply chain
Traditional software supply chains — code dependencies, build tools, base container images — are well understood and have mature tooling: SCA scanners, SBOM generation, dependency pinning, vulnerability feeds. The AI supply chain shares some of these properties and adds several that traditional tooling does not cover.
The additions: pre-trained model weights are opaque — you cannot read them the way you can read source code. Training data provenance cannot be inferred from the model artifact. Alignment and capability claims are made by the model provider and cannot be independently verified from the weights alone. The inference provider processes your prompts and completions — its data retention practices determine whether your data stays within your control boundary.
A standard software SBOM tells you what code is in your application. An AI supply chain review must also document what the model was trained on, who processed the inference, and what capabilities the plugins add — none of which appears in a traditional dependency manifest.
The five layers of the AI supply chain require different review questions and produce different evidence. The sections below address each in turn.
Supply chain layers — attack surface and required control
| Supply chain layer | Attack surface | Control required |
|---|---|---|
| Base model weights | Opaque training corpus; undisclosed capabilities; alignment claims unverifiable from weights | Evaluate provider safety documentation; assess capability profile against deployment scope |
| Fine-tuning dataset | Poisoned examples; memorisation of sensitive training data; alignment drift post-fine-tune | Data provenance review per source; post-fine-tune behavioural evaluation including safety cases |
| Model provider API | Prompt and completion data retained by provider; subprocessor chain; ToS constraints | Review data retention terms against data handling obligations; document subprocessors |
| Inference infrastructure | Side-channel exposure during inference; provider-side logging; shared-tenant isolation | Enterprise or zero-retention SLA where required; review provider security certifications |
| Plugin / tool dependencies | Third-party plugin behaviour changes on update; external service connections; manipulated tool descriptions | Pin plugin versions; review permissions and external connections per tool; verify tool descriptions |
| System prompt templates | Credentials or secrets embedded in prompt; extraction-vulnerable trust rules | Remove credentials from prompts; design for disclosure resilience; test for extraction |
Layer 1 — base model risks
The base model is the pre-trained foundation the application is built on. For most organisations this is a third-party model from a major AI provider — GPT-4, Claude, Llama, Gemini, or equivalent. The risks at this layer are not exploitable through the application code; they are properties of the model itself that the application must account for.
Training data provenance.What data was the model trained on? The provider's documentation may describe the general composition of the training corpus but rarely provides a complete enumeration. For most foundation models, the training corpus includes significant quantities of web-scraped content with unknown provenance. The implication: the model may have memorised content that appears in the training corpus, including PII that appeared in public web content. For most deployments this risk is manageable. For deployments handling regulated data or operating under strict copyright constraints, it is material.
Alignment and safety claims.Model providers publish safety evaluation results, red-team findings, and alignment documentation. These are self-assessments — they represent the provider's claims about the model's behaviour, not independent verification. A security review should document that these materials have been reviewed and assessed against the deployment context, not just acknowledged.
Capability claims.The model may have capabilities beyond what the deployment requires. A coding assistant built on a general-purpose model with medical knowledge, legal reasoning, and synthesis capabilities has excess capabilities that could be exploited if the model is manipulated. The base model's capability profile should be compared to the deployment scope.
Layer 2 — fine-tuning risks
Fine-tuning adapts a base model to a specific task or domain using additional training data. It adds two distinct risk surfaces: the fine-tuning dataset itself (which becomes part of the model's effective training corpus) and the alignment risk from fine-tuning altering the model's safety-relevant behaviour.
Fine-tuning dataset provenance.Every example in the fine-tuning dataset contributes to the model's learned behaviour. Poisoned examples — crafted to teach the model specific behaviours on specific trigger patterns — are indistinguishable from legitimate examples in the dataset unless the dataset is reviewed. The review question is: what is the provenance of each data source in the fine-tuning set, and was each source reviewed for anomalous content?
Alignment drift from fine-tuning. Fine-tuning on a narrow task corpus can weaken alignment properties present in the base model. A model fine-tuned to be maximally helpful at a customer service task may follow user instructions more aggressively than the base model, including instructions that cross safety boundaries. Post-fine-tuning behavioural evaluation should include safety-relevant test cases, not just task-performance metrics.
Training data as sensitive data. Fine-tuning corpora built from internal documents may contain sensitive information that the resulting model can reproduce. See the sensitive information disclosure article for the memorisation risk and controls.
Layer 3 — inference provider risks
Every prompt sent to a hosted inference provider and every completion returned is processed by the provider's infrastructure. The inference provider sees the full content of every interaction — including system prompts, user messages, and retrieved documents that are injected into the context. This has direct implications for data handling obligations.
Data retention.Does the provider retain prompt and completion data? For how long? Under what conditions? Providers vary significantly — some retain data for abuse monitoring, model improvement, or commercial purposes unless the customer opts out or purchases an enterprise agreement that includes a zero-retention SLA. The retention terms must be reviewed against the organisation's data handling obligations, particularly for any prompts that may contain personal data.
Subprocessors.Inference providers use subprocessors for infrastructure, logging, and operations. The subprocessor chain affects the geographic scope of data processing, which has implications for GDPR adequacy decisions and other jurisdictional requirements. The provider's subprocessor list should be documented and assessed.
Terms of service alignment. API terms of service impose constraints on how the model can be used — prohibited use cases, output restrictions, ownership of fine-tuned derivatives. These constraints must be evaluated against the deployment use case before production deployment.
Layer 4 — plugin and extension risks
Plugins and extensions extend the model's capabilities through tool calls. Each plugin is a dependency with its own supply chain: the plugin code, the external services it connects to, and the permissions it exercises. The tool manifest — the set of plugins available to the model — defines the model's effective capability scope.
Third-party plugins present specific risks. A plugin sourced from an external developer may change behaviour on update, may connect to external services that change their behaviour, or may contain intentional backdoors. The tool descriptions in a plugin manifest influence how the model decides when to invoke the plugin — a manipulated tool description can cause the model to invoke a tool in unintended contexts.
Review questions for each plugin: What is the plugin's source and update mechanism? What permissions does it exercise? What external services does it connect to? What data does it process? Is the tool description accurate and appropriately scoped?
Layer 5 — software dependency risks
The software dependencies that connect the application to the model — LLM SDKs, prompt engineering libraries, vector database clients, embedding libraries — are the layer most similar to traditional software supply chain risks and the layer most amenable to existing tooling.
Standard SCA scanning covers known CVEs in these packages. Beyond CVEs, the AI dependency surface has some additional considerations: prompt injection libraries that promise injection detection may have bypass rates that are not publicly documented; embedding libraries may process input data in ways that are not transparent; model fine-tuning frameworks may load weights from URLs specified in configuration files — a configuration injection vector.
Review checklist
A supply chain review for an LLM application must produce evidence across all five layers. The checklist below is the minimum required for each.
| Layer | Required evidence |
|---|---|
| Base model | Provider safety documentation reviewed; capability profile assessed against deployment scope; re-review trigger defined for model version changes |
| Fine-tuning | Fine-tuning dataset provenance record; behavioural evaluation results post-fine-tuning including safety test cases; memorisation evaluation results |
| Inference provider | Data retention terms reviewed and documented; subprocessor list reviewed against data handling obligations; API terms assessed against use case |
| Plugins / tools | Tool manifest reviewed — source, permissions, and external connections documented for each plugin; tool description accuracy verified |
| Software dependencies | SCA scan results; dependencies pinned to specific versions; known CVEs addressed; AI-specific library versions documented |
The OWASP LLM Top 10 Assessment provides a structured framework for collecting this evidence and incorporating it into the clearance decision for the assessed system.
Blog
Get new posts in your inbox
AI security review, OWASP Agentic Top 10, ISO 42001 evidence, and what AI Committees actually need. No cadence promises — we publish when there's something worth reading.
Review your LLM application supply chain
Drel maps the supply chain of assessed systems across all five layers — base model, fine-tuning, inference provider, plugins, and software dependencies — and documents the control gaps that affect the clearance decision.
A note on scope: Drel reviews assessed systems against documented architecture, configuration and intent. It does not ingest live telemetry from production environments. Dispositions reflect the assessed system at the time of review and the re-assessment triggers that govern when the disposition must be revisited.