S2.03 - Control
S2.03 ? Control
flowchart LR
A1[Policy without operational constraint]
A2[Undocumented prompt variation]
A3[Untracked tool and model settings]
A4[Weak retrieval boundaries]
A5[Inconsistent reviewer intervention]
B[RAIDT - run-level evidence framework]
C[[Control - bounded, approved, and reviewable run conditions]]
D[Governance move - evidence over assertion, reviewability, contestability, audit readiness]
E1[Evidence pack]
E2[RAIDT score profile]
E3[Reviewer reconstruction]
E4[Organisational learning]
E5[Policy alignment]
F1[Healthcare discharge drafting]
F2[Finance reporting support]
F3[Education feedback generation]
F4[Prompt registry]
F5[Configuration log]
F6[Retrieval whitelist]
F7[Reviewer checkpoint]
A1 --> B
A2 --> B
A3 --> B
A4 --> B
A5 --> B
B --> C
C --> D
C --> E1
C --> E2
C --> E3
C --> E4
C --> E5
F1 --> C
F2 --> C
F3 --> C
F4 --> C
F5 --> C
F6 --> C
F7 --> C? Star S2 - Governance Meaning and Problem Context
Star context: Clarifies governance as oversight, control, accountability, reviewability, and continuous improvement rather than a vague ethics label. Within RAIDT, control makes governance operational by showing how a run is deliberately bounded, configured, and checked.
Academic picture
Definition / background
Control refers to the extent to which a GenAI run is intentionally constrained, directed, and governed through approved prompts, policies, tool settings, retrieval boundaries, model configurations, access conditions, and review rules. In governance terms, it answers a basic question: what prevented this run from becoming an unconstrained, opaque, or improvised interaction?
Conceptually, control sits close to ideas such as oversight, assurance, internal governance, compliance, and quality management, but it is not identical to any of them. Oversight concerns who watches and intervenes. Accountability concerns who is answerable. Reviewability concerns whether a run can be examined. Control is more immediate: it concerns the mechanisms that shape the run before, during, and after execution so that the system behaves within an authorised envelope.
This matters especially in GenAI settings because the same model can produce very different outputs depending on prompt phrasing, tool invocation, retrieval context, temperature or decoding settings, and human intervention. Without explicit control, organisational use easily drifts from approved purpose into local improvisation. A policy may exist, but the run may still be effectively uncontrolled. RAIDT addresses this by treating the run itself as the unit of governance and asking what evidence shows that this specific use was properly bounded.
Within RAIDT, control belongs directly to run-level evidence. It is reflected in the evidence pack through configuration records, prompt registries, policy mappings, approval states, reviewer actions, and logs showing what was enabled or disabled. It also affects the five-pillar score profile because weak control tends to degrade responsibility, auditability, dependability, and traceability simultaneously. Control therefore links governance intent to operational proof.
Why this concept matters
Control matters because organisations often talk about governing AI at the level of principles while actual risk emerges at the level of concrete use. The problem is not only whether a model is generally safe or approved, but whether a particular run was performed under the right conditions for the right task with the right constraints.
This concept avoids a common confusion between having a policy and having a controlled process. A team may have an AI policy, approved tooling, and named owners, yet still allow staff to alter prompts freely, bypass retrieval restrictions, change model settings, or use outputs without required review. In that case, governance exists on paper but not in practice.
If control is missing, several risks appear: outputs become harder to justify, harmful drift becomes easier, audit trails become thinner, reviewers cannot tell whether non-compliance was due to policy failure or execution failure, and continuous improvement becomes guesswork rather than evidence-led learning. For organisations using GenAI in operational settings, this is not a minor issue. It determines whether governance can be defended to supervisors, auditors, regulators, partners, and internal decision-makers.
RAIDT uses control to move from broad principles to operational governance. It asks what controlled the run, how that control was documented, whether it was followed, and whether deviations can be identified and reviewed.
Key idea: Control matters because responsible GenAI governance is credible only when each run is visibly constrained by evidence-backed rules, settings, and review conditions.
What this item controls
- The authorised purpose and task boundary of a run.
- The prompt template, system instruction, or approved input structure used in execution.
- The model, model version, parameter settings, and tool configuration allowed for the task.
- The retrieval sources, knowledge boundaries, and external data inputs that may influence outputs.
- The human review checkpoints, escalation rules, and sign-off conditions attached to the run.
- The extent to which users can override, improvise, or bypass approved governance settings.
- The evidential traces needed to show that the run remained within an approved operational envelope.
Practical example / likely audience question
Audience question
Where is control in RAIDT, and how is it different from simply having an AI policy or a named reviewer?
Answer
The concern behind this question is usually that governance is being treated too abstractly. Many organisations can point to a policy, a committee, or a responsible manager, but none of those automatically shows that a particular GenAI run was actually controlled. The direct answer is that, in RAIDT, control is evidenced through the artefacts that shaped the run: approved prompt versions, policy-linked configuration records, model and tool settings, retrieval permissions, reviewer checkpoints, and logs of any deviations or overrides.
A practical example is a team using a GenAI assistant to draft internal policy summaries. A generic governance approach may state that staff must use approved tools and comply with policy. RAIDT goes further by showing whether the specific run used the approved prompt template, the authorised model configuration, the permitted document set, and the required reviewer decision before circulation. That makes control inspectable rather than assumed.
RAIDT handles this better than generic AI governance because it treats control as a run-level evidential property. Instead of asking only whether governance exists somewhere in the organisation, it asks whether this run can be shown to have been governed in a reconstructable way.
Practical example in RAIDT terms
Consider a healthcare setting in which a GenAI system helps draft discharge-summary language for clinicians. The use case appears routine, but the run-level issue is whether the system was used with the approved clinical prompt, the approved model version, the restricted patient data context, and mandatory clinician review before anything entered the record.
The evidence needed would include the prompt template version, model and temperature settings, retrieval boundaries, patient-data access conditions, the identity and role of the reviewer, and a record of whether the draft was accepted, amended, or rejected. If an unauthorised retrieval source or unapproved prompt variant was used, the run may no longer count as properly controlled even if the final text looked acceptable.
In RAIDT pillar terms, this strongly affects Responsibility, Auditability, Dependability, and Traceability, with a secondary effect on Interpretability. Good control improves governance readiness because it lets the organisation show not just that clinical AI use is governed in theory, but that this specific draft was produced under approved conditions and can be defended if challenged.
Detailed link to RAIDT
Control links to RAIDT in four ways.
First, it links to RAIDT's core idea by making governance concrete at the level of the run rather than leaving it at the level of abstract policy.
Second, it links to run-level evidence because control must be demonstrated through artefacts such as prompt versions, configuration states, tool permissions, and reviewer decisions.
Third, it links to the evidence pack and score profile because strong or weak control directly changes what evidence can be assembled and how confidently the run can be scored.
Fourth, it links to reviewability, contestability, audit readiness, and organisational learning because a controlled run is easier to reconstruct, challenge, compare, and improve.
Control ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness
In this sense, control is one of the mechanisms through which RAIDT converts organisational intent into inspectable governance practice.
Link to the five RAIDT pillars
Control has its strongest effects on Responsibility, Auditability, Dependability, and Traceability, but it also supports Interpretability by clarifying the conditions under which outputs were produced.
Responsibility
Control supports responsibility by showing that roles, permissions, and decision rights were translated into actual run conditions rather than left implicit.
Example evidence / implication:
- Approved prompt owner, system owner, or reviewer role linked to the run.
- Evidence that escalation or sign-off rules were applied before downstream use.
Auditability
Control is central to auditability because auditors need to inspect what constrained the run and whether those constraints were followed.
Example evidence / implication:
- Configuration logs showing model version, tool access, and retrieval constraints.
- A recorded comparison between approved settings and actual settings used in the run.
Interpretability
Control improves interpretability indirectly by narrowing the space of possible causes behind an output and clarifying which inputs and settings shaped it.
Example evidence / implication:
- Stored prompt template and retrieval context explaining why the output took a particular form.
- Reviewer notes that connect output characteristics to known run conditions.
Dependability
Control strengthens dependability by reducing arbitrary variation and ensuring repeated runs occur within an approved operational envelope.
Example evidence / implication:
- Locked or versioned run templates that reduce uncontrolled drift.
- Rules preventing high-risk tasks from using unapproved tools or model settings.
Traceability
Control strengthens traceability because it creates the artefacts needed to follow a run from authorisation to output and subsequent review.
Example evidence / implication:
- Traceable links between policy requirements, run configuration, and reviewer action.
- Logs of overrides, exceptions, or deviations from the standard controlled pathway.
Why this item is more than a generic concept
In general AI governance, control may simply mean that an organisation has policies, access restrictions, or a broad approval process. In RAIDT, control means that a specific run can be shown to have been shaped by identifiable constraints and decisions that are evidenced, reviewable, and reconstructable.
The RAIDT meaning is more operational because it is tied to run-level evidence. It does not stop at saying that the organisation intended to control AI use. It asks what, exactly, controlled this run; what records prove that; and how those records affect governance judgement.
Common misunderstanding
Misunderstanding
Control means rigidly preventing users from exercising judgement or adapting the system to context.
Correction
In RAIDT, control does not mean eliminating human discretion. It means bounding discretion within an authorised and inspectable framework. For example, a reviewer may legitimately amend a model output or choose between approved prompt variants, but that discretion should occur within documented rules and leave an evidential trace. The issue is not flexibility itself; the issue is undocumented flexibility that makes governance impossible to verify.
Boundary and limitation
Control does not prove that a run was ethically good, legally compliant, or substantively correct. A tightly controlled run can still produce a poor output if the policy is weak, the prompt is badly designed, the retrieval source is incomplete, or the reviewer misses an error. Control also does not replace oversight, accountability, or reviewability; it works alongside them.
Its effectiveness depends on the quality of the controls themselves and on whether they are proportionate to the task. Overly weak controls create unmanaged risk, while overly rigid controls can generate workarounds or superficial compliance. RAIDT handles this limitation by asking not only whether controls existed, but whether they were appropriate, evidenced, followed, and open to review and improvement over time.
Implementation levels
Manual implementation
A researcher or small team can implement control manually through approved prompt sheets, written run protocols, simple configuration checklists, version-labelled templates, and recorded reviewer decisions. Even a spreadsheet-based control register can materially improve governance if it captures what was allowed, what was used, and who approved exceptions.
Semi-automated implementation
Semi-automated implementation can use structured metadata, form-based run submission, prompt registries, model selection controls, retrieval whitelists, and templated review checkpoints. This reduces omission risk and makes evidence collection more consistent across runs.
Fully automated implementation
At scale, control can be implemented through platform wrappers, orchestration layers, policy engines, permission-aware tool routing, immutable logging, dashboard-based approval states, and governance pipelines that enforce configuration rules before a run is executed. In this form, the system itself helps ensure that high-risk uses cannot proceed without the required controls and evidential capture.
Practical use in the RAIDT project
Within the RAIDT project, control is useful in several ways. In Paper 08 Foundations, it helps define how governance concepts become operational at the run level rather than remaining conceptual. In Paper 09 Empirical Validation, it can be examined through real runs to show whether stronger control corresponds to stronger evidence packs, more consistent scoring, or improved reviewer confidence. In Paper 10 Policy Pathways, it provides a bridge between policy aspiration and implementable governance mechanisms.
It is also directly relevant to sector playbooks, because different domains will operationalise control differently depending on risk, regulation, and workflow. In the evidence pack, control helps determine what artefacts must be collected. In the scoring rubric, it influences how governance quality is judged. In supervisor explanation, viva defence, and journal positioning, it helps show that RAIDT is not just another principles framework; it is a method for evidencing whether governance actually shaped a run.
Key audience questions to prepare for
Q1. Is control just another word for compliance?
No. Compliance is one possible outcome or frame, but control is broader. It concerns the actual mechanisms and constraints that shape the run, whether they arise from policy, risk management, quality assurance, or workflow design. RAIDT asks for evidence that those mechanisms were active in the run.
Q2. Why focus on control at the run level rather than system level?
Because many failures and governance gaps appear in specific uses rather than in abstract system descriptions. A model may be approved at system level yet still be used in an uncontrolled way in practice. RAIDT therefore inspects the run as the unit where governance either holds or fails.
Q3. Can a highly controlled run still be wrong?
Yes. Control improves governance discipline, not substantive infallibility. A controlled run may still contain hallucinations, bias, or task mismatch. What control adds is a defensible basis for review, reconstruction, and improvement.
Q4. Does stronger control always mean stricter restriction?
Not necessarily. Good control is proportionate. Low-risk uses may permit wider discretion, while high-risk uses require tighter boundaries. The key issue is whether the level of control is justified, documented, and reviewable.
Q5. How would you show control to an examiner or auditor?
By presenting run-level artefacts: approved prompt version, configuration records, model and tool settings, retrieval scope, user permissions, reviewer actions, any deviations or overrides, and a clear mapping from these artefacts to the governance judgement made in the evidence pack and score profile.
Suggested citation concepts to support this item
- AI governance operational control mechanisms
- run-level AI governance evidence
- configuration management in machine learning systems
- prompt governance and prompt version control
- policy enforcement in generative AI workflows
- human oversight and control in organisational AI use
- AI assurance artefacts and audit trails
- retrieval governance and data boundary controls
- socio-technical control in automated decision support
- internal controls for responsible AI deployment
Short explanation for presentation
Control in RAIDT means that a GenAI run is not treated as a free-form interaction but as a governed event with defined constraints. The key point is that control must be visible in evidence. That includes approved prompt versions, model and tool settings, retrieval boundaries, reviewer checkpoints, and records of deviations or overrides. This matters because organisations often claim to govern AI through policies, but policy alone does not show that a particular run was actually bounded in practice. RAIDT makes control operational by tying it to run-level evidence, the evidence pack, and the score profile. In supervision or viva terms, control helps explain how RAIDT moves from principle-based governance to inspectable governance that is reviewable, contestable, and capable of supporting continuous organisational learning.
One-line takeaway
Control is the evidence-backed constraint of a GenAI run because RAIDT governs not just AI systems in general, but specific runs in specific organisational contexts.
Related items in governance meaning and problem context
Anchored questions
No anchored questions were present in the source item.
Mentioned in reference-paper summaries (5)
Paper summaries live in Port/93-References/pdf_summaries/. Each file listed below contains the key term at least once.
REF-017__Bhat-2023.mdREF-019__Bodendorf-2025.mdREF-020__Bommasani-2021.mdREF-022__Breck-2017.mdREF-024__Charness-2009.md