S11.04 - Proportionality

S11.04 ? Proportionality

flowchart LR
    A[One-size-fits-all governance fails] --> B[RAIDT - run-level evidence framework]
    A2[Low-risk uses can be over-burdened] --> B
    A3[High-stakes uses can be under-governed] --> B
    B --> C[[Proportionality - calibrates governance effort to each run]]
    H1[Healthcare discharge summaries] --> C
    H2[Public-service casework] --> C
    H3[Educational feedback] --> C
    H4[Internal drafting] --> C
    C --> D[Right-sized evidence pack]
    C --> E[Meaningful RAIDT score profile]
    C --> F[Reviewer reconstruction]
    C --> G[Governance readiness]
    D --> I[Audit readiness]
    E --> J[Reviewability and contestability]
    F --> K[Organisational learning]

? Star S11 - Boundaries, Limitations and Future Questions

Star context: Prevents overclaiming and explains what RAIDT can and cannot solve by matching governance effort to the seriousness, context, and consequences of each run.


Academic picture
Definition / background

Proportionality means that the depth, form, and scrutiny of governance evidence should match the level of risk, consequence, and organisational dependence associated with a particular run of a generative AI system. In RAIDT, this is not merely a policy preference; it is a design principle for deciding how much documentation, review, justification, and follow-up should accompany a run-level evidence pack.

The concept has roots in law, regulation, risk management, and public administration, where interventions are expected to be commensurate with the seriousness of the issue being addressed. In AI governance, proportionality matters because generative AI is used across tasks with very different stakes. An internal brainstorming prompt, a citizen-facing decision support tool, and a clinical summarisation workflow should not all be governed identically. Treating them as equivalent either creates wasteful process or leaves important risks unmanaged.

Proportionality differs from simple minimalism. It does not mean capturing as little evidence as possible. It means capturing enough evidence to justify trust, enable review, and support accountability for the actual use context. For RAIDT, that distinction is crucial because the framework is built around the run as the unit of governance. The relevant question is not whether a model is generally risky in the abstract, but what happened in this configured use, at this time, for this task, in this organisational setting.

This is why proportionality belongs centrally within RAIDT. Run-level evidence packs and five-pillar score profiles are only useful if their depth reflects the significance of the run they describe. A proportionate approach keeps RAIDT credible as a governance method: serious enough to support audit readiness and contestability, but flexible enough to be deployable in real organisational work.

Why this concept matters

Without proportionality, AI governance frameworks often fail in one of two ways. They either become too generic to guide action, or they become too burdensome to use consistently. RAIDT is explicitly designed to avoid both outcomes. Proportionality helps translate governance ambition into a usable operating logic for real tasks, teams, and decision environments.

This concept matters because organisations need a defensible basis for deciding when light-touch oversight is acceptable and when enhanced evidence capture is necessary. It helps prevent confusion between low-stakes experimentation and high-stakes operational reliance. It also avoids the false reassurance that a standard template or a single policy statement is sufficient for every case.

If proportionality is missing, three risks appear. First, staff may resist RAIDT as administratively excessive. Second, high-impact uses may be governed too weakly because the same limited process is applied everywhere. Third, review bodies may be unable to see why a run was treated as acceptable, especially when harm, challenge, or audit occurs later.

Key idea: Proportionality makes RAIDT usable and credible by matching governance effort to the actual stakes of each run.

What this item controls
Practical example / likely audience question

Audience question

Is RAIDT too heavy?

Answer

Audience question: Is RAIDT too heavy? Answer: not if implemented proportionately according to task risk.

The concern behind the question is understandable. Many AI governance schemes appear to assume that every use of a model should produce the same volume of records, approvals, and review materials. That becomes impractical in organisations where generative AI is used across a wide range of tasks, from low-stakes drafting support to high-stakes decision preparation.

The direct answer is that RAIDT is not intended to be uniformly heavy. Its run-level structure allows governance effort to scale with the seriousness of the task. A low-risk internal ideation run may need basic metadata, prompt and output capture, and a short note on intended use. By contrast, a run that informs a welfare eligibility recommendation or a clinical summary would require stronger evidence, clearer reviewer roles, and a more developed explanation of safeguards, limitations, and verification.

RAIDT handles this better than a generic AI governance approach because it does not rely only on abstract policy tiers or model-level claims. It ties proportionality to evidence at the level where work actually happens: the individual run. That makes the justification inspectable. A reviewer can see not only that a lighter or heavier process was used, but why that level of governance was judged appropriate.

Practical example in RAIDT terms

Consider a healthcare organisation using a generative AI system to draft discharge summaries for clinicians. The use case is not fully automated decision-making, but it sits close to patient safety and continuity of care. The run-level issue is therefore not simply whether the model can generate fluent text; it is whether this particular run, for this patient context, was properly bounded, checked, and documented.

A proportionate RAIDT approach would require stronger evidence than would be expected for ordinary internal brainstorming. The evidence pack might include the task purpose, user role, model and version, prompt template, source data boundaries, human review confirmation, known error modes, and a record of whether any material corrections were needed before the summary was used. The most affected pillars would be Responsibility, Dependability, and Traceability, with Auditability also becoming important because later reconstruction may be necessary if a concern arises.

Proportionality improves governance readiness here because it justifies enhanced scrutiny without claiming that every healthcare-related run must be treated identically. A low-risk educational mock exercise for staff training and a live patient-facing summary workflow would sit at different evidence depths even if they use similar tools. RAIDT makes that distinction visible and defensible.

Detailed link to RAIDT

Proportionality links to RAIDT in four ways.

First, it supports RAIDT's core idea that governance should move from broad principle statements to inspectable evidence tied to actual use.
Second, it operates at the level of the run, because the amount of assurance required depends on the task, context, users, and consequences of that particular configured use.
Third, it shapes both the evidence pack and the score profile by determining what should be captured, how deeply it should be assessed, and how scores should be interpreted.
Fourth, it strengthens reviewability, contestability, audit readiness, and organisational learning by making the reasoning for lighter or heavier governance visible after the fact.

Proportionality ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

In this chain, proportionality is the calibration logic. It prevents the evidence pack from becoming either superficial or unnecessarily bloated, and it makes the resulting score profile more meaningful because the assessment is tied to the stakes of the run rather than to a one-size-fits-all template.

Link to the five RAIDT pillars

Responsibility

Proportionality helps define what level of human responsibility is needed for a run and what justification is expected from those deploying it. Higher-stakes uses require clearer ownership, stronger role definition, and more explicit acceptance of residual risk.

Example evidence / implication:

Auditability

Proportionality determines how reconstructable a run needs to be. Not every run requires the same evidential granularity, but consequential runs require enough detail for later scrutiny, challenge, and explanation.

Example evidence / implication:

Interpretability

Proportionality affects how much explanation is needed about system behaviour, limitations, and user understanding. Where users may rely heavily on outputs, stronger interpretive support is needed.

Example evidence / implication:

Dependability

This item strongly affects Dependability because higher-risk runs need stronger confidence that outputs are suitably checked, bounded, and reliable for the intended task.

Example evidence / implication:

Traceability

Proportionality shapes how far a run must be traceable back to inputs, settings, decision points, and post-run actions. This matters especially when outcomes may later be disputed.

Example evidence / implication:

Proportionality affects all five pillars, but it has especially strong practical consequences for Responsibility, Dependability, and Traceability because these pillars are most sensitive to the seriousness of real-world use.

Why this item is more than a generic concept

In general AI governance, proportionality often means little more than the claim that controls should vary by risk category. That is a useful starting point, but it remains too abstract to govern day-to-day generative AI use. Categories alone do not show what evidence should exist for a specific deployment moment.

In RAIDT, proportionality becomes operational because it is tied to run-level evidence. The framework asks what evidence this run requires, what level of reconstruction should be possible later, what review burden is justified, and how the resulting score profile should be interpreted. That makes proportionality inspectable, not merely aspirational.

Common misunderstanding

Misunderstanding

Proportionality means lowering governance expectations so that organisations can use generative AI more easily.

Correction

Proportionality does not mean relaxing standards indiscriminately. It means applying the right level of governance to the actual seriousness of the run. For example, a team using a model to generate rough meeting titles does not need the same evidence depth as a team using a model to prepare case summaries for vulnerable service users. In the first case, minimal metadata and ordinary human judgment may be enough. In the second, stronger traceability, review, and justification are required. The principle is not convenience first; it is commensurate assurance.

Boundary and limitation

Proportionality does not prove that a run is safe, lawful, fair, or effective. It only helps determine how much evidence and scrutiny are appropriate for the context. If the underlying risk assessment is weak, proportionality can be misapplied and used to justify under-documentation. It also cannot remove the need for domain-specific obligations such as privacy, professional standards, or sector regulation.

Within RAIDT, this limitation is handled by combining proportionality with explicit run definition, evidence pack structure, and pillar-based scoring. In other words, proportionality must be anchored to a defensible account of task significance, affected stakeholders, and likely consequences. It is therefore a calibration principle, not a substitute for governance judgement.

Implementation levels

Manual implementation

A researcher or small team can apply proportionality manually by classifying the run context, noting likely consequences of error, and deciding what minimum evidence must be captured before the run is accepted for use. This can be done with structured templates and supervisor review.

Semi-automated implementation

Semi-automated implementation can use metadata fields, run-intake forms, and rule-based templates that expand or reduce evidence requirements depending on the declared use case, domain, user role, and risk characteristics.

Fully automated implementation

At scale, a governance platform or orchestration layer can automatically assign evidence depth, retention rules, approval workflows, and logging requirements based on run attributes. A dashboard can then show whether the evidence pack and score profile are proportionate to the declared context and whether escalation thresholds were triggered.

Practical use in the RAIDT project

Within the wider RAIDT project, proportionality helps explain why the framework is intended for real organisational adoption rather than as a purely theoretical model. In Paper 08 Foundations, it can justify the conceptual claim that run-level governance must be calibrated rather than uniform. In Paper 09 Empirical Validation, it offers a basis for examining whether users and reviewers see the framework as workable across tasks of differing stakes. In Paper 10 Policy Pathways, it helps position RAIDT as compatible with risk-based and context-sensitive governance expectations without collapsing into vague principle language.

The concept is also useful across sector playbooks because proportionality explains why the same RAIDT architecture can operate in healthcare, education, public services, law, cybersecurity, and enterprise productivity while still demanding different evidence depths. For viva defence and supervisor discussion, it is a strong answer to concerns that RAIDT may be either too burdensome or too weak. For journal positioning, it helps differentiate RAIDT from governance models that stay at the level of static policy statements or model-level claims.

Key audience questions to prepare for

Q1. How does proportionality stop RAIDT from becoming administratively excessive?

It allows evidence depth, review burden, and retention requirements to scale with the seriousness of the run. Low-risk uses can be documented lightly, while consequential uses receive stronger scrutiny.

Q2. Does proportionality make RAIDT subjective?

It introduces judgement, but not unchecked subjectivity. In RAIDT, the judgement is tied to explicit run characteristics, recorded reasoning, and inspectable evidence, which makes the calibration open to review and challenge.

Q3. Why is run-level proportionality better than model-level risk classification alone?

Because the same model may be used in very different contexts. Model-level classification cannot fully explain what evidence is needed for a specific task, user, and consequence environment at a particular moment.

Q4. Can proportionality be misused to justify weak governance?

Yes, if organisations understate risk or fail to record their reasoning. RAIDT addresses this by requiring the calibration decision itself to be visible in the evidence pack and reflected in the score profile.

Q5. What is the main contribution of proportionality to governance readiness?

It makes governance operationally sustainable. Organisations can apply stronger controls where needed without treating every run as equally consequential, which improves both compliance credibility and adoption feasibility.

Suggested citation concepts to support this item
Short explanation for presentation

Proportionality is the principle that RAIDT should require evidence in line with the stakes of the specific run being governed. The key point is that generative AI uses are not all alike: some are low-risk support tasks, while others sit close to decisions, services, or harms. RAIDT therefore does not assume one fixed level of documentation for every case. Instead, it calibrates evidence capture, review, and justification to the seriousness of the task, the likely consequences of error, and the level of organisational reliance on the output. This matters because it keeps the framework practical without making it superficial. In supervisory and viva terms, proportionality is the answer to the question of whether RAIDT is too heavy: it is only as demanding as the run requires, but it remains evidentially defensible.

One-line takeaway

Proportionality is the principle of matching governance effort to run-specific stakes because RAIDT turns that judgement into inspectable evidence.

Related items in boundaries, limitations and future questions
Anchored questions

No anchored questions were present in the original item.

Mentioned in reference-paper summaries (1)

Paper summaries live in Port/93-References/pdf_summaries/. Each file listed below contains the key term at least once.

Powered by Forestry.md