S4.16 - Review_decision_and_reviewer_notes

S4.16 ? Review decision and reviewer notes

flowchart LR
    A[Traditional limitation:
output retained but review reasoning missing] --> B[RAIDT:
run-level evidence framework] B --> C[[S4.16 Review decision
and reviewer notes]] H[Decision category
reviewer role
timestamp
rationale note
escalation link] --> C C --> D[Evidence pack] C --> E[RAIDT score profile] C --> I[Reviewer reconstruction] D --> F[Reviewability and contestability] E --> G[Governance readiness] I --> J[Organisational learning and policy alignment]

? Star S4 - Evidence Architecture and Artefacts

Star context: Defines the concrete review fields and human oversight artefacts that make a RAIDT run record inspectable, challengeable, and governable rather than merely logged.


Academic picture
Definition / background

Review decision and reviewer notes record the formal human judgement applied to a generated output at the point of oversight. The item captures both the decision itself, such as accepted, accepted with edits, rejected, escalated, or linked to an incident, and the reviewer notes that explain the grounds for that decision. In governance terms, this is the evidential bridge between model behaviour and organisational action.

Conceptually, this item draws on established practices from quality assurance, editorial control, safety review, audit trails, and professional sign-off. Those traditions distinguish between what a system produced and what an authorised human decided to do with it. RAIDT brings that distinction into generative AI governance by making the review outcome a first-class part of the run record rather than a separate, informal, or forgotten step.

This item is not the same as the generated output, the output hash, or the final business outcome. The output shows what the model produced. The review decision shows how that output was treated. The reviewer notes explain why. That distinction matters because governance failures often arise not only from poor outputs, but also from weak review, inconsistent judgement, or absent documentation of human intervention.

Within RAIDT, the item belongs inside run-level evidence because the framework treats each run as the unit that must be reviewable, contestable, and reconstructable. It strengthens the evidence pack by showing how oversight operated in practice, and it informs the score profile by indicating whether responsibility, auditability, and traceability were actually supported by recorded human judgement.

Why this concept matters

Organisations often claim that a human reviewed AI output, but without a recorded decision and notes that claim remains weak. It is difficult to distinguish genuine oversight from a nominal sign-off, difficult to understand why an output was allowed through, and difficult to learn from near misses or harmful failures. This item solves that problem by turning review from an assertion into evidence.

It also avoids a common confusion in GenAI governance: the assumption that keeping the model output is enough. In reality, oversight is not visible from the output alone. A clean final document may conceal substantial reviewer edits, a serious concern, or an escalation path. Review records therefore preserve the organisational reasoning that sits between machine generation and operational use.

If this item is missing, several risks appear at once: inconsistent approval decisions across teams, poor defensibility during audit, weak contestability when users challenge outcomes, and limited organisational learning after incidents or complaints. RAIDT addresses these risks by locating review decisions inside the run record itself, where they can be connected to the wider evidence architecture.

Key idea: Review decision and reviewer notes matter because RAIDT treats human oversight as evidence that must be inspectable at run level, not as an informal claim made after the fact.

What this item captures
Practical example / likely audience question

Audience question

Why are reviewer notes part of the evidence pack rather than just internal working notes?

Answer

The concern behind this question is the belief that evidence should be limited to technical artefacts such as prompts, model identifiers, and outputs. That view is too narrow for responsible GenAI governance. A run becomes organisationally meaningful only when somebody decides whether the generated output is safe, accurate enough, policy-compliant, and fit for use. If that judgement is undocumented, a critical part of governance disappears from the record.

In RAIDT, reviewer notes are evidence because they document how oversight was actually exercised. Suppose a reviewer receives a generated draft policy summary and notices that the system omitted an important legal exception. The reviewer edits the text, marks the decision as accepted with edits, and records a note stating that the omission would otherwise have created compliance risk. That note is not incidental. It explains the governance intervention that changed the risk profile of the run.

RAIDT handles this better than a generic AI governance approach because it ties the review decision to the same run-level record as the prompt version, model configuration, and output trace. Rather than saying only that human review exists in principle, RAIDT shows how it occurred in a specific run, by a specific role, for a specific reason.

Practical example in RAIDT terms

Consider a healthcare trust using a GenAI assistant to draft discharge instructions after a clinician has completed the core medical record. In one run, the model generates instructions that incorrectly simplify a medication timing requirement. A pharmacist reviewer checks the draft before release.

The run-level issue is not only that the output contained a potentially harmful simplification, but also that the organisation must be able to show how the error was detected and addressed. The relevant evidence includes the output itself, the output hash, the reviewer role, the review decision of accepted with edits, the reviewer note stating that dosage timing was clinically ambiguous, and any escalation or incident linkage if the issue suggests a wider pattern.

This directly affects RAIDT pillars. Responsibility is implicated because a named review role applied professional judgement. Auditability is strengthened because the rationale is recorded. Dependability is improved because the error was corrected before deployment. Traceability is preserved because the decision can be linked to the precise run. In governance-readiness terms, this item shows that the organisation can demonstrate not only generation, but safe intervention and accountable approval.

Detailed link to RAIDT

Review decision and reviewer notes links to RAIDT in four ways.

First, it connects directly to RAIDT's core idea that governance should rest on inspectable evidence rather than broad assurances about responsible use.
Second, it links to the run because the review outcome is part of what happened in that specific configured use of the system, at that specific time, in that specific context.
Third, it enriches the evidence pack and informs the score profile by showing whether meaningful human oversight was documented and how review judgements were made.
Fourth, it supports reviewability, contestability, audit readiness, and organisational learning by making the human reasoning around a run visible and reconstructable.

Review decision and reviewer notes ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

Link to the five RAIDT pillars

Responsibility

This item is strongly relevant to Responsibility because it records who exercised oversight and how that responsibility was enacted in practice.

Example evidence / implication:

Auditability

This item is strongly relevant to Auditability because auditors need to understand not just what the system produced, but how reviewers responded to it.

Example evidence / implication:

Interpretability

This item contributes indirectly to Interpretability by revealing how human reviewers interpreted the adequacy, clarity, or meaning of outputs in context.

Example evidence / implication:

Dependability

This item contributes materially to Dependability because dependable systems require evidence that unsafe or unreliable outputs were identified and handled appropriately.

Example evidence / implication:

Traceability

This item is strongly relevant to Traceability because it links a final organisational action back to a specific run and its oversight pathway.

Example evidence / implication:

Why this item is more than a generic concept

In general AI governance, review decisions are often described vaguely as part of human-in-the-loop oversight. In RAIDT, the concept is narrower, more operational, and more evidential. It refers to a structured run-level field that records the outcome of review and the accompanying rationale in a form that can be inspected, compared, challenged, and used for governance scoring.

That RAIDT meaning is more operational because it does not stop at saying that humans review outputs. It asks what was decided, why it was decided, how it was documented, and whether that record can support audit, contestation, and improvement. The item therefore converts a broad governance principle into a concrete evidence artefact.

Common misunderstanding

Misunderstanding

If the final approved output is stored, reviewer notes are unnecessary because the organisation already knows what was used.

Correction

Storing only the final output shows the endpoint, not the oversight pathway. For example, a reviewer may have removed a fabricated statistic, softened a risky recommendation, or escalated a potentially harmful instruction before release. Without the review decision and notes, that intervention disappears from the evidence record. RAIDT treats this as a governance loss because the organisation can no longer explain how human judgement shaped the final artefact.

Boundary and limitation

This item does not prove that the review decision was correct, fair, or sufficiently expert. It records that a judgement was made and explains the stated rationale, but reviewer notes can still be incomplete, biased, inconsistent, or overly brief. Nor does it replace domain assurance processes, legal compliance checks, or substantive evaluation of output quality.

Its value depends on structured capture, reviewer discipline, and meaningful review criteria. If reviewer notes are unstandardised or purely performative, the evidence quality will be weak. RAIDT handles this limitation by treating the item as one part of a broader evidence architecture: it gains strength when connected to role data, timestamps, output hashes, policy identifiers, escalation pathways, and downstream scoring rules.

Implementation levels

Manual implementation

A researcher or small team can apply this item manually by requiring each reviewed run to include a simple decision label and a short free-text note explaining the reason for acceptance, editing, rejection, or escalation.

Semi-automated implementation

A semi-automated approach can use structured templates, review forms, drop-down decision categories, rationale codes, and mandatory note fields so that review records are easier to compare across runs and teams.

Fully automated implementation

At scale, a platform or governance pipeline can bind reviewer decisions to run IDs automatically, capture role and timestamp metadata, link notes to incidents or tickets, surface patterns in dashboards, and feed decision statistics into RAIDT evidence packs and score-profile generation.

Practical use in the RAIDT project

This item is useful across the RAIDT project because it gives a concrete example of how principles of responsible oversight become operational evidence. In Paper 08 Foundations, it helps define the evidential role of human review within the run-level model. In Paper 09 Empirical Validation, it provides an observable field for comparing governance maturity across cases. In Paper 10 Policy Pathways, it supports arguments about audit readiness, contestability, and accountability in institutional deployment. It is also useful in sector playbooks, the evidence-pack design, the scoring rubric, and viva defence because it gives supervisors and reviewers a precise answer to the question, "How do you know human oversight actually happened in a governable way?"

Key audience questions to prepare for

Q1. Does this item record only approval, or also uncertainty and disagreement?

It should record more than simple approval. A useful RAIDT implementation captures rejection, editing, escalation, and notes about uncertainty or disagreement because those states are often the most informative for governance and learning.

Q2. Why is a reviewer note needed if there is already a decision category?

A category alone shows the outcome, but not the reasoning. The note explains what concern triggered the decision, which makes later audit, challenge, and pattern analysis possible.

Q3. Is this mainly a compliance field or a safety field?

It is both, but not only those. The item supports compliance, safety, accountability, and learning because it records how human judgement mediated the use of generated content in context.

Q4. Could reviewer notes become burdensome in high-volume environments?

Yes, if poorly designed. That is why structured categories, templates, and proportionate note requirements matter. RAIDT does not require maximal prose; it requires sufficient evidence for reconstruction and governance.

Q5. What does this item add beyond a claim that there was human oversight?

It adds inspectable proof of how that oversight was exercised in a specific run. This is the difference between policy language and evidential governance.

Suggested citation concepts to support this item
Short explanation for presentation

Review decision and reviewer notes capture the human judgement applied after a GenAI system produces an output. In RAIDT, that matters because governance is not only about what the model generated, but also about how the organisation responded to it. A recorded decision such as accepted, edited, rejected, or escalated, together with the reviewer note explaining why, turns oversight into evidence rather than assertion. This strengthens the run-level evidence pack, supports scoring across the RAIDT pillars, and improves audit readiness, contestability, and organisational learning. Without this item, a team may be able to show the output, but not the reasoning that determined whether that output was safe, suitable, or policy-compliant for actual use.

One-line takeaway

Review decision and reviewer notes is the run-level record of human oversight outcome and rationale because RAIDT makes review judgement inspectable as evidence, not merely assumed as process.

Related items in evidence architecture and artefacts
Anchored questions
Powered by Forestry.md