S2.03 - Control

S2.03 ? Control

flowchart LR
    A1[Policy without operational constraint]
    A2[Undocumented prompt variation]
    A3[Untracked tool and model settings]
    A4[Weak retrieval boundaries]
    A5[Inconsistent reviewer intervention]

    B[RAIDT - run-level evidence framework]
    C[[Control - bounded, approved, and reviewable run conditions]]
    D[Governance move - evidence over assertion, reviewability, contestability, audit readiness]

    E1[Evidence pack]
    E2[RAIDT score profile]
    E3[Reviewer reconstruction]
    E4[Organisational learning]
    E5[Policy alignment]

    F1[Healthcare discharge drafting]
    F2[Finance reporting support]
    F3[Education feedback generation]
    F4[Prompt registry]
    F5[Configuration log]
    F6[Retrieval whitelist]
    F7[Reviewer checkpoint]

    A1 --> B
    A2 --> B
    A3 --> B
    A4 --> B
    A5 --> B

    B --> C
    C --> D
    C --> E1
    C --> E2
    C --> E3
    C --> E4
    C --> E5

    F1 --> C
    F2 --> C
    F3 --> C
    F4 --> C
    F5 --> C
    F6 --> C
    F7 --> C

? Star S2 - Governance Meaning and Problem Context

Star context: Clarifies governance as oversight, control, accountability, reviewability, and continuous improvement rather than a vague ethics label. Within RAIDT, control makes governance operational by showing how a run is deliberately bounded, configured, and checked.


Academic picture
Definition / background

Control refers to the extent to which a GenAI run is intentionally constrained, directed, and governed through approved prompts, policies, tool settings, retrieval boundaries, model configurations, access conditions, and review rules. In governance terms, it answers a basic question: what prevented this run from becoming an unconstrained, opaque, or improvised interaction?

Conceptually, control sits close to ideas such as oversight, assurance, internal governance, compliance, and quality management, but it is not identical to any of them. Oversight concerns who watches and intervenes. Accountability concerns who is answerable. Reviewability concerns whether a run can be examined. Control is more immediate: it concerns the mechanisms that shape the run before, during, and after execution so that the system behaves within an authorised envelope.

This matters especially in GenAI settings because the same model can produce very different outputs depending on prompt phrasing, tool invocation, retrieval context, temperature or decoding settings, and human intervention. Without explicit control, organisational use easily drifts from approved purpose into local improvisation. A policy may exist, but the run may still be effectively uncontrolled. RAIDT addresses this by treating the run itself as the unit of governance and asking what evidence shows that this specific use was properly bounded.

Within RAIDT, control belongs directly to run-level evidence. It is reflected in the evidence pack through configuration records, prompt registries, policy mappings, approval states, reviewer actions, and logs showing what was enabled or disabled. It also affects the five-pillar score profile because weak control tends to degrade responsibility, auditability, dependability, and traceability simultaneously. Control therefore links governance intent to operational proof.

Why this concept matters

Control matters because organisations often talk about governing AI at the level of principles while actual risk emerges at the level of concrete use. The problem is not only whether a model is generally safe or approved, but whether a particular run was performed under the right conditions for the right task with the right constraints.

This concept avoids a common confusion between having a policy and having a controlled process. A team may have an AI policy, approved tooling, and named owners, yet still allow staff to alter prompts freely, bypass retrieval restrictions, change model settings, or use outputs without required review. In that case, governance exists on paper but not in practice.

If control is missing, several risks appear: outputs become harder to justify, harmful drift becomes easier, audit trails become thinner, reviewers cannot tell whether non-compliance was due to policy failure or execution failure, and continuous improvement becomes guesswork rather than evidence-led learning. For organisations using GenAI in operational settings, this is not a minor issue. It determines whether governance can be defended to supervisors, auditors, regulators, partners, and internal decision-makers.

RAIDT uses control to move from broad principles to operational governance. It asks what controlled the run, how that control was documented, whether it was followed, and whether deviations can be identified and reviewed.

Key idea: Control matters because responsible GenAI governance is credible only when each run is visibly constrained by evidence-backed rules, settings, and review conditions.

What this item controls
Practical example / likely audience question

Audience question

Where is control in RAIDT, and how is it different from simply having an AI policy or a named reviewer?

Answer

The concern behind this question is usually that governance is being treated too abstractly. Many organisations can point to a policy, a committee, or a responsible manager, but none of those automatically shows that a particular GenAI run was actually controlled. The direct answer is that, in RAIDT, control is evidenced through the artefacts that shaped the run: approved prompt versions, policy-linked configuration records, model and tool settings, retrieval permissions, reviewer checkpoints, and logs of any deviations or overrides.

A practical example is a team using a GenAI assistant to draft internal policy summaries. A generic governance approach may state that staff must use approved tools and comply with policy. RAIDT goes further by showing whether the specific run used the approved prompt template, the authorised model configuration, the permitted document set, and the required reviewer decision before circulation. That makes control inspectable rather than assumed.

RAIDT handles this better than generic AI governance because it treats control as a run-level evidential property. Instead of asking only whether governance exists somewhere in the organisation, it asks whether this run can be shown to have been governed in a reconstructable way.

Practical example in RAIDT terms

Consider a healthcare setting in which a GenAI system helps draft discharge-summary language for clinicians. The use case appears routine, but the run-level issue is whether the system was used with the approved clinical prompt, the approved model version, the restricted patient data context, and mandatory clinician review before anything entered the record.

The evidence needed would include the prompt template version, model and temperature settings, retrieval boundaries, patient-data access conditions, the identity and role of the reviewer, and a record of whether the draft was accepted, amended, or rejected. If an unauthorised retrieval source or unapproved prompt variant was used, the run may no longer count as properly controlled even if the final text looked acceptable.

In RAIDT pillar terms, this strongly affects Responsibility, Auditability, Dependability, and Traceability, with a secondary effect on Interpretability. Good control improves governance readiness because it lets the organisation show not just that clinical AI use is governed in theory, but that this specific draft was produced under approved conditions and can be defended if challenged.

Detailed link to RAIDT

Control links to RAIDT in four ways.

First, it links to RAIDT's core idea by making governance concrete at the level of the run rather than leaving it at the level of abstract policy.
Second, it links to run-level evidence because control must be demonstrated through artefacts such as prompt versions, configuration states, tool permissions, and reviewer decisions.
Third, it links to the evidence pack and score profile because strong or weak control directly changes what evidence can be assembled and how confidently the run can be scored.
Fourth, it links to reviewability, contestability, audit readiness, and organisational learning because a controlled run is easier to reconstruct, challenge, compare, and improve.

Control ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

In this sense, control is one of the mechanisms through which RAIDT converts organisational intent into inspectable governance practice.

Link to the five RAIDT pillars

Control has its strongest effects on Responsibility, Auditability, Dependability, and Traceability, but it also supports Interpretability by clarifying the conditions under which outputs were produced.

Responsibility

Control supports responsibility by showing that roles, permissions, and decision rights were translated into actual run conditions rather than left implicit.

Example evidence / implication:

Auditability

Control is central to auditability because auditors need to inspect what constrained the run and whether those constraints were followed.

Example evidence / implication:

Interpretability

Control improves interpretability indirectly by narrowing the space of possible causes behind an output and clarifying which inputs and settings shaped it.

Example evidence / implication:

Dependability

Control strengthens dependability by reducing arbitrary variation and ensuring repeated runs occur within an approved operational envelope.

Example evidence / implication:

Traceability

Control strengthens traceability because it creates the artefacts needed to follow a run from authorisation to output and subsequent review.

Example evidence / implication:

Why this item is more than a generic concept

In general AI governance, control may simply mean that an organisation has policies, access restrictions, or a broad approval process. In RAIDT, control means that a specific run can be shown to have been shaped by identifiable constraints and decisions that are evidenced, reviewable, and reconstructable.

The RAIDT meaning is more operational because it is tied to run-level evidence. It does not stop at saying that the organisation intended to control AI use. It asks what, exactly, controlled this run; what records prove that; and how those records affect governance judgement.

Common misunderstanding

Misunderstanding

Control means rigidly preventing users from exercising judgement or adapting the system to context.

Correction

In RAIDT, control does not mean eliminating human discretion. It means bounding discretion within an authorised and inspectable framework. For example, a reviewer may legitimately amend a model output or choose between approved prompt variants, but that discretion should occur within documented rules and leave an evidential trace. The issue is not flexibility itself; the issue is undocumented flexibility that makes governance impossible to verify.

Boundary and limitation

Control does not prove that a run was ethically good, legally compliant, or substantively correct. A tightly controlled run can still produce a poor output if the policy is weak, the prompt is badly designed, the retrieval source is incomplete, or the reviewer misses an error. Control also does not replace oversight, accountability, or reviewability; it works alongside them.

Its effectiveness depends on the quality of the controls themselves and on whether they are proportionate to the task. Overly weak controls create unmanaged risk, while overly rigid controls can generate workarounds or superficial compliance. RAIDT handles this limitation by asking not only whether controls existed, but whether they were appropriate, evidenced, followed, and open to review and improvement over time.

Implementation levels

Manual implementation

A researcher or small team can implement control manually through approved prompt sheets, written run protocols, simple configuration checklists, version-labelled templates, and recorded reviewer decisions. Even a spreadsheet-based control register can materially improve governance if it captures what was allowed, what was used, and who approved exceptions.

Semi-automated implementation

Semi-automated implementation can use structured metadata, form-based run submission, prompt registries, model selection controls, retrieval whitelists, and templated review checkpoints. This reduces omission risk and makes evidence collection more consistent across runs.

Fully automated implementation

At scale, control can be implemented through platform wrappers, orchestration layers, policy engines, permission-aware tool routing, immutable logging, dashboard-based approval states, and governance pipelines that enforce configuration rules before a run is executed. In this form, the system itself helps ensure that high-risk uses cannot proceed without the required controls and evidential capture.

Practical use in the RAIDT project

Within the RAIDT project, control is useful in several ways. In Paper 08 Foundations, it helps define how governance concepts become operational at the run level rather than remaining conceptual. In Paper 09 Empirical Validation, it can be examined through real runs to show whether stronger control corresponds to stronger evidence packs, more consistent scoring, or improved reviewer confidence. In Paper 10 Policy Pathways, it provides a bridge between policy aspiration and implementable governance mechanisms.

It is also directly relevant to sector playbooks, because different domains will operationalise control differently depending on risk, regulation, and workflow. In the evidence pack, control helps determine what artefacts must be collected. In the scoring rubric, it influences how governance quality is judged. In supervisor explanation, viva defence, and journal positioning, it helps show that RAIDT is not just another principles framework; it is a method for evidencing whether governance actually shaped a run.

Key audience questions to prepare for

Q1. Is control just another word for compliance?

No. Compliance is one possible outcome or frame, but control is broader. It concerns the actual mechanisms and constraints that shape the run, whether they arise from policy, risk management, quality assurance, or workflow design. RAIDT asks for evidence that those mechanisms were active in the run.

Q2. Why focus on control at the run level rather than system level?

Because many failures and governance gaps appear in specific uses rather than in abstract system descriptions. A model may be approved at system level yet still be used in an uncontrolled way in practice. RAIDT therefore inspects the run as the unit where governance either holds or fails.

Q3. Can a highly controlled run still be wrong?

Yes. Control improves governance discipline, not substantive infallibility. A controlled run may still contain hallucinations, bias, or task mismatch. What control adds is a defensible basis for review, reconstruction, and improvement.

Q4. Does stronger control always mean stricter restriction?

Not necessarily. Good control is proportionate. Low-risk uses may permit wider discretion, while high-risk uses require tighter boundaries. The key issue is whether the level of control is justified, documented, and reviewable.

Q5. How would you show control to an examiner or auditor?

By presenting run-level artefacts: approved prompt version, configuration records, model and tool settings, retrieval scope, user permissions, reviewer actions, any deviations or overrides, and a clear mapping from these artefacts to the governance judgement made in the evidence pack and score profile.

Suggested citation concepts to support this item
Short explanation for presentation

Control in RAIDT means that a GenAI run is not treated as a free-form interaction but as a governed event with defined constraints. The key point is that control must be visible in evidence. That includes approved prompt versions, model and tool settings, retrieval boundaries, reviewer checkpoints, and records of deviations or overrides. This matters because organisations often claim to govern AI through policies, but policy alone does not show that a particular run was actually bounded in practice. RAIDT makes control operational by tying it to run-level evidence, the evidence pack, and the score profile. In supervision or viva terms, control helps explain how RAIDT moves from principle-based governance to inspectable governance that is reviewable, contestable, and capable of supporting continuous organisational learning.

One-line takeaway

Control is the evidence-backed constraint of a GenAI run because RAIDT governs not just AI systems in general, but specific runs in specific organisational contexts.

Related items in governance meaning and problem context
Anchored questions

No anchored questions were present in the source item.

Mentioned in reference-paper summaries (5)

Paper summaries live in Port/93-References/pdf_summaries/. Each file listed below contains the key term at least once.

Powered by Forestry.md