S11.04 - Proportionality
S11.04 ? Proportionality
flowchart LR
A[One-size-fits-all governance fails] --> B[RAIDT - run-level evidence framework]
A2[Low-risk uses can be over-burdened] --> B
A3[High-stakes uses can be under-governed] --> B
B --> C[[Proportionality - calibrates governance effort to each run]]
H1[Healthcare discharge summaries] --> C
H2[Public-service casework] --> C
H3[Educational feedback] --> C
H4[Internal drafting] --> C
C --> D[Right-sized evidence pack]
C --> E[Meaningful RAIDT score profile]
C --> F[Reviewer reconstruction]
C --> G[Governance readiness]
D --> I[Audit readiness]
E --> J[Reviewability and contestability]
F --> K[Organisational learning]? Star S11 - Boundaries, Limitations and Future Questions
Star context: Prevents overclaiming and explains what RAIDT can and cannot solve by matching governance effort to the seriousness, context, and consequences of each run.
Academic picture
Definition / background
Proportionality means that the depth, form, and scrutiny of governance evidence should match the level of risk, consequence, and organisational dependence associated with a particular run of a generative AI system. In RAIDT, this is not merely a policy preference; it is a design principle for deciding how much documentation, review, justification, and follow-up should accompany a run-level evidence pack.
The concept has roots in law, regulation, risk management, and public administration, where interventions are expected to be commensurate with the seriousness of the issue being addressed. In AI governance, proportionality matters because generative AI is used across tasks with very different stakes. An internal brainstorming prompt, a citizen-facing decision support tool, and a clinical summarisation workflow should not all be governed identically. Treating them as equivalent either creates wasteful process or leaves important risks unmanaged.
Proportionality differs from simple minimalism. It does not mean capturing as little evidence as possible. It means capturing enough evidence to justify trust, enable review, and support accountability for the actual use context. For RAIDT, that distinction is crucial because the framework is built around the run as the unit of governance. The relevant question is not whether a model is generally risky in the abstract, but what happened in this configured use, at this time, for this task, in this organisational setting.
This is why proportionality belongs centrally within RAIDT. Run-level evidence packs and five-pillar score profiles are only useful if their depth reflects the significance of the run they describe. A proportionate approach keeps RAIDT credible as a governance method: serious enough to support audit readiness and contestability, but flexible enough to be deployable in real organisational work.
Why this concept matters
Without proportionality, AI governance frameworks often fail in one of two ways. They either become too generic to guide action, or they become too burdensome to use consistently. RAIDT is explicitly designed to avoid both outcomes. Proportionality helps translate governance ambition into a usable operating logic for real tasks, teams, and decision environments.
This concept matters because organisations need a defensible basis for deciding when light-touch oversight is acceptable and when enhanced evidence capture is necessary. It helps prevent confusion between low-stakes experimentation and high-stakes operational reliance. It also avoids the false reassurance that a standard template or a single policy statement is sufficient for every case.
If proportionality is missing, three risks appear. First, staff may resist RAIDT as administratively excessive. Second, high-impact uses may be governed too weakly because the same limited process is applied everywhere. Third, review bodies may be unable to see why a run was treated as acceptable, especially when harm, challenge, or audit occurs later.
Key idea: Proportionality makes RAIDT usable and credible by matching governance effort to the actual stakes of each run.
What this item controls
- How much run-level evidence must be captured for a given use case.
- How much human review, sign-off, or escalation is required before use.
- How much justification is needed to explain model choice, prompt design, safeguards, and intended use.
- How detailed the evidence pack should be for later reconstruction and challenge.
- How scoring under the five RAIDT pillars should be interpreted in relation to risk and context.
- How organisations distinguish exploratory, advisory, and consequential uses of generative AI.
- How governance resources are allocated so that attention is concentrated where harm or reliance is greatest.
Practical example / likely audience question
Audience question
Is RAIDT too heavy?
Answer
Audience question: Is RAIDT too heavy? Answer: not if implemented proportionately according to task risk.
The concern behind the question is understandable. Many AI governance schemes appear to assume that every use of a model should produce the same volume of records, approvals, and review materials. That becomes impractical in organisations where generative AI is used across a wide range of tasks, from low-stakes drafting support to high-stakes decision preparation.
The direct answer is that RAIDT is not intended to be uniformly heavy. Its run-level structure allows governance effort to scale with the seriousness of the task. A low-risk internal ideation run may need basic metadata, prompt and output capture, and a short note on intended use. By contrast, a run that informs a welfare eligibility recommendation or a clinical summary would require stronger evidence, clearer reviewer roles, and a more developed explanation of safeguards, limitations, and verification.
RAIDT handles this better than a generic AI governance approach because it does not rely only on abstract policy tiers or model-level claims. It ties proportionality to evidence at the level where work actually happens: the individual run. That makes the justification inspectable. A reviewer can see not only that a lighter or heavier process was used, but why that level of governance was judged appropriate.
Practical example in RAIDT terms
Consider a healthcare organisation using a generative AI system to draft discharge summaries for clinicians. The use case is not fully automated decision-making, but it sits close to patient safety and continuity of care. The run-level issue is therefore not simply whether the model can generate fluent text; it is whether this particular run, for this patient context, was properly bounded, checked, and documented.
A proportionate RAIDT approach would require stronger evidence than would be expected for ordinary internal brainstorming. The evidence pack might include the task purpose, user role, model and version, prompt template, source data boundaries, human review confirmation, known error modes, and a record of whether any material corrections were needed before the summary was used. The most affected pillars would be Responsibility, Dependability, and Traceability, with Auditability also becoming important because later reconstruction may be necessary if a concern arises.
Proportionality improves governance readiness here because it justifies enhanced scrutiny without claiming that every healthcare-related run must be treated identically. A low-risk educational mock exercise for staff training and a live patient-facing summary workflow would sit at different evidence depths even if they use similar tools. RAIDT makes that distinction visible and defensible.
Detailed link to RAIDT
Proportionality links to RAIDT in four ways.
First, it supports RAIDT's core idea that governance should move from broad principle statements to inspectable evidence tied to actual use.
Second, it operates at the level of the run, because the amount of assurance required depends on the task, context, users, and consequences of that particular configured use.
Third, it shapes both the evidence pack and the score profile by determining what should be captured, how deeply it should be assessed, and how scores should be interpreted.
Fourth, it strengthens reviewability, contestability, audit readiness, and organisational learning by making the reasoning for lighter or heavier governance visible after the fact.
Proportionality ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness
In this chain, proportionality is the calibration logic. It prevents the evidence pack from becoming either superficial or unnecessarily bloated, and it makes the resulting score profile more meaningful because the assessment is tied to the stakes of the run rather than to a one-size-fits-all template.
Link to the five RAIDT pillars
Responsibility
Proportionality helps define what level of human responsibility is needed for a run and what justification is expected from those deploying it. Higher-stakes uses require clearer ownership, stronger role definition, and more explicit acceptance of residual risk.
Example evidence / implication:
- Named accountable owner for the run and its outcome context.
- Rationale for why the level of review and sign-off was appropriate to the stakes.
Auditability
Proportionality determines how reconstructable a run needs to be. Not every run requires the same evidential granularity, but consequential runs require enough detail for later scrutiny, challenge, and explanation.
Example evidence / implication:
- Retention of prompts, outputs, configuration details, and reviewer notes for higher-risk runs.
- Clear statement of why the recorded evidence was judged sufficient for future review.
Interpretability
Proportionality affects how much explanation is needed about system behaviour, limitations, and user understanding. Where users may rely heavily on outputs, stronger interpretive support is needed.
Example evidence / implication:
- Notes on expected failure modes and user guidance for interpreting outputs.
- Explanation of when outputs should be treated as advisory rather than decision-determinative.
Dependability
This item strongly affects Dependability because higher-risk runs need stronger confidence that outputs are suitably checked, bounded, and reliable for the intended task.
Example evidence / implication:
- Verification steps proportionate to likely harm if the output is wrong.
- Record of fallback procedures when model output quality is uncertain or unstable.
Traceability
Proportionality shapes how far a run must be traceable back to inputs, settings, decision points, and post-run actions. This matters especially when outcomes may later be disputed.
Example evidence / implication:
- Trace of model version, prompt template, date, operator, and downstream use.
- Escalated logging for runs likely to influence external-facing or high-impact actions.
Proportionality affects all five pillars, but it has especially strong practical consequences for Responsibility, Dependability, and Traceability because these pillars are most sensitive to the seriousness of real-world use.
Why this item is more than a generic concept
In general AI governance, proportionality often means little more than the claim that controls should vary by risk category. That is a useful starting point, but it remains too abstract to govern day-to-day generative AI use. Categories alone do not show what evidence should exist for a specific deployment moment.
In RAIDT, proportionality becomes operational because it is tied to run-level evidence. The framework asks what evidence this run requires, what level of reconstruction should be possible later, what review burden is justified, and how the resulting score profile should be interpreted. That makes proportionality inspectable, not merely aspirational.
Common misunderstanding
Misunderstanding
Proportionality means lowering governance expectations so that organisations can use generative AI more easily.
Correction
Proportionality does not mean relaxing standards indiscriminately. It means applying the right level of governance to the actual seriousness of the run. For example, a team using a model to generate rough meeting titles does not need the same evidence depth as a team using a model to prepare case summaries for vulnerable service users. In the first case, minimal metadata and ordinary human judgment may be enough. In the second, stronger traceability, review, and justification are required. The principle is not convenience first; it is commensurate assurance.
Boundary and limitation
Proportionality does not prove that a run is safe, lawful, fair, or effective. It only helps determine how much evidence and scrutiny are appropriate for the context. If the underlying risk assessment is weak, proportionality can be misapplied and used to justify under-documentation. It also cannot remove the need for domain-specific obligations such as privacy, professional standards, or sector regulation.
Within RAIDT, this limitation is handled by combining proportionality with explicit run definition, evidence pack structure, and pillar-based scoring. In other words, proportionality must be anchored to a defensible account of task significance, affected stakeholders, and likely consequences. It is therefore a calibration principle, not a substitute for governance judgement.
Implementation levels
Manual implementation
A researcher or small team can apply proportionality manually by classifying the run context, noting likely consequences of error, and deciding what minimum evidence must be captured before the run is accepted for use. This can be done with structured templates and supervisor review.
Semi-automated implementation
Semi-automated implementation can use metadata fields, run-intake forms, and rule-based templates that expand or reduce evidence requirements depending on the declared use case, domain, user role, and risk characteristics.
Fully automated implementation
At scale, a governance platform or orchestration layer can automatically assign evidence depth, retention rules, approval workflows, and logging requirements based on run attributes. A dashboard can then show whether the evidence pack and score profile are proportionate to the declared context and whether escalation thresholds were triggered.
Practical use in the RAIDT project
Within the wider RAIDT project, proportionality helps explain why the framework is intended for real organisational adoption rather than as a purely theoretical model. In Paper 08 Foundations, it can justify the conceptual claim that run-level governance must be calibrated rather than uniform. In Paper 09 Empirical Validation, it offers a basis for examining whether users and reviewers see the framework as workable across tasks of differing stakes. In Paper 10 Policy Pathways, it helps position RAIDT as compatible with risk-based and context-sensitive governance expectations without collapsing into vague principle language.
The concept is also useful across sector playbooks because proportionality explains why the same RAIDT architecture can operate in healthcare, education, public services, law, cybersecurity, and enterprise productivity while still demanding different evidence depths. For viva defence and supervisor discussion, it is a strong answer to concerns that RAIDT may be either too burdensome or too weak. For journal positioning, it helps differentiate RAIDT from governance models that stay at the level of static policy statements or model-level claims.
Key audience questions to prepare for
Q1. How does proportionality stop RAIDT from becoming administratively excessive?
It allows evidence depth, review burden, and retention requirements to scale with the seriousness of the run. Low-risk uses can be documented lightly, while consequential uses receive stronger scrutiny.
Q2. Does proportionality make RAIDT subjective?
It introduces judgement, but not unchecked subjectivity. In RAIDT, the judgement is tied to explicit run characteristics, recorded reasoning, and inspectable evidence, which makes the calibration open to review and challenge.
Q3. Why is run-level proportionality better than model-level risk classification alone?
Because the same model may be used in very different contexts. Model-level classification cannot fully explain what evidence is needed for a specific task, user, and consequence environment at a particular moment.
Q4. Can proportionality be misused to justify weak governance?
Yes, if organisations understate risk or fail to record their reasoning. RAIDT addresses this by requiring the calibration decision itself to be visible in the evidence pack and reflected in the score profile.
Q5. What is the main contribution of proportionality to governance readiness?
It makes governance operationally sustainable. Organisations can apply stronger controls where needed without treating every run as equally consequential, which improves both compliance credibility and adoption feasibility.
Suggested citation concepts to support this item
- proportionality in AI governance
- risk-based governance for generative AI
- evidence-based assurance for high-stakes AI use
- run-level accountability in sociotechnical systems
- commensurate oversight in algorithmic governance
- audit readiness and documentation burden in AI systems
- context-sensitive assurance for automated decision support
- governance scalability for organisational AI deployment
- human oversight calibration in generative AI workflows
- traceability requirements by risk severity
Short explanation for presentation
Proportionality is the principle that RAIDT should require evidence in line with the stakes of the specific run being governed. The key point is that generative AI uses are not all alike: some are low-risk support tasks, while others sit close to decisions, services, or harms. RAIDT therefore does not assume one fixed level of documentation for every case. Instead, it calibrates evidence capture, review, and justification to the seriousness of the task, the likely consequences of error, and the level of organisational reliance on the output. This matters because it keeps the framework practical without making it superficial. In supervisory and viva terms, proportionality is the answer to the question of whether RAIDT is too heavy: it is only as demanding as the run requires, but it remains evidentially defensible.
One-line takeaway
Proportionality is the principle of matching governance effort to run-specific stakes because RAIDT turns that judgement into inspectable evidence.
Related items in boundaries, limitations and future questions
Anchored questions
No anchored questions were present in the original item.
Mentioned in reference-paper summaries (1)
Paper summaries live in Port/93-References/pdf_summaries/. Each file listed below contains the key term at least once.
REF-035__European-2024.md