S2.08 - Reconstructability

S2.08 ? Reconstructability

flowchart LR
    A[Traditional governance limitation
Outputs exist but run context is incomplete] --> B[RAIDT
Run-level evidence framework]
    B --> C[[Reconstructability
Rebuild the specific GenAI run in evidence terms]]
    H[Healthcare, finance, public services,
enterprise workflows, logging tools] --> C
    C --> D[Evidence pack]
    C --> E[RAIDT score profile]
    C --> F[Reviewer reconstruction and contestability]
    C --> G[Organisational learning and audit readiness]
    C --> I[Governance move:
evidence over assertion]

? Star S2 - Governance Meaning and Problem Context

Star context: Clarifies governance as oversight, control, accountability, reviewability, contestability and continuous improvement rather than a vague ethics label. In RAIDT, reconstructability makes that governance stance operational by showing whether a specific GenAI run can be rebuilt in evidence terms for later scrutiny.

Academic picture

Definition / background

Reconstructability means that a GenAI run can be rebuilt in evidence terms after the event. In RAIDT, this means a reviewer can recover enough structured information to understand what was input, how the system was configured, what data or retrieval context shaped the response, what the model generated, what checks were applied, what human decisions were made, and what evidence was retained. The concept is therefore practical rather than merely descriptive: it concerns the real ability to reconstruct a run, not just the aspiration to document one.

Conceptually, reconstructability sits close to auditability, traceability, reviewability, and provenance, but it is not identical to any of them. Auditability is the broader governance capacity to support formal inspection. Traceability is the ability to follow links across actors, systems, artefacts, and decisions. Provenance concerns origin and lineage. Reconstructability is narrower and more operational: it asks whether a specific run can be rebuilt well enough for meaningful scrutiny. In that sense, reconstructability is one of the practical conditions that makes auditability work.

This matters especially in generative AI governance because GenAI outputs are contingent on time, prompts, model versions, retrieval context, settings, workflow design, and human intervention. Without reconstructability, organisations may know that a problematic output occurred but remain unable to explain why. That weakness undermines incident review, responsibility assignment, regulatory response, and process improvement.

Reconstructability belongs centrally inside RAIDT because RAIDT treats the run as the unit of governance. A run-level evidence framework must make each governed event sufficiently recoverable for later review. Reconstructability therefore links directly to the run-level evidence pack and indirectly to the five-pillar score profile. If a run is poorly reconstructable, this lowers confidence in governance claims across multiple pillars, especially Auditability and Traceability, but also Responsibility, Interpretability, and Dependability.

Why this concept matters

Reconstructability solves a basic governance problem: organisations often want accountability after a GenAI-supported action, yet they have not retained enough evidence to understand what actually happened. In such cases, governance becomes rhetorical. Policies may exist, but post hoc review is weak because reviewers cannot rebuild the run in a disciplined way.

The concept also avoids a common confusion between having some logs and having usable reconstructive evidence. A system may retain timestamps and outputs but still fail to preserve prompts, retrieved passages, model version, user role, approval history, or validation checks. That kind of partial record creates a false sense of assurance. RAIDT treats reconstructability as a practical threshold for reviewability rather than a generic record-keeping ideal.

If reconstructability is missing, several risks appear. Contested outputs cannot be meaningfully examined. Errors become difficult to attribute to model behaviour, workflow design, human misuse, or data conditions. Audit preparation becomes expensive and reactive. Organisational learning is weakened because teams cannot compare runs in a structured way. In high-stakes settings, this can also undermine legitimacy when decisions affect patients, clients, employees, citizens, or regulated processes.

For organisations using GenAI, reconstructability helps move governance from principles to operational control. It allows questions such as "What happened in this run?", "What evidence supports that account?", and "What should change next time?" to be answered from retained evidence rather than recollection.

Key idea: Reconstructability matters because responsible GenAI governance depends on being able to rebuild a specific run well enough for review, challenge, and learning.

What this item enables

Rebuilding a specific GenAI run in evidence terms rather than relying on memory or informal explanation.
Distinguishing between prompt effects, system configuration effects, retrieval effects, and human workflow effects.
Supporting reviewability when a supervisor, auditor, manager, or regulator asks how an output was produced.
Making contestability practical by giving affected parties something concrete to examine and challenge.
Improving the evidential quality of the RAIDT evidence pack.
Strengthening the defensibility of RAIDT pillar scoring because scores can be tied to retained run evidence.
Enabling continuous improvement through comparison of problematic, typical, and exemplary runs.
Reducing the gap between governance policy language and operational governance practice.

Practical example / likely audience question

Audience question

What is the difference between auditability and reconstructability?

Answer

The concern behind this question is that the two terms appear to overlap, and in many governance discussions they are treated as interchangeable. The direct answer is that auditability is the broader governance capability to support inspection, whereas reconstructability is the practical ability to rebuild what happened in a specific run. In other words, auditability is the wider institutional condition; reconstructability is one of the concrete capacities that auditability relies on.

A useful practical example is a GenAI-supported drafting workflow in a regulated organisation. An internal reviewer may want to audit a decision-support output. If the organisation has a policy, role definitions, and an audit procedure, it may claim auditability at a governance level. But if it cannot recover the prompt, retrieval sources, model version, approval checks, and final edited output for the disputed run, then reconstructability is weak. The audit process exists in name, yet the practical evidential basis for the audit is missing.

RAIDT handles this better than a generic AI governance approach because it asks the question at the run level. Rather than stopping at policy statements such as "all AI activity is logged", RAIDT asks whether this specific run can be reconstructed from retained evidence. That shift from organisational assertion to run-level evidence is exactly why reconstructability is a useful distinction within the framework.

Practical example in RAIDT terms

Consider a healthcare administrative setting where a GenAI tool is used to draft a patient discharge summary from clinician notes and hospital records. The run-level issue is that a senior reviewer later notices a medication instruction that may not match the source notes. To govern the case properly, the organisation needs evidence showing the original prompt, the clinical notes available to the system, any retrieval layer or template used, the model and configuration, the generated draft, the clinician edits, and the approval checkpoint before release.

In RAIDT terms, reconstructability determines whether that run can be rebuilt as a reviewable event. The evidence needed would include prompt text, metadata about the model version and time of use, source-document identifiers, retrieval or context snippets, output versions, user actions, and records of human verification. The most affected pillars are Auditability, Traceability, and Dependability, with important implications for Responsibility and Interpretability as well.

If this evidence exists, the organisation can establish whether the error came from source ambiguity, retrieval failure, generation behaviour, or insufficient human checking. If it does not exist, governance readiness is weakened because the run cannot be examined in a disciplined way. Reconstructability therefore improves not only technical diagnosis but also oversight quality, accountability, and learning for future deployments.

Detailed link to RAIDT

Reconstructability links to RAIDT in four ways.

First, it supports RAIDT's core idea that governance should focus on the run as the unit of accountable analysis rather than on abstract claims about a system in general.

Second, it links directly to run-level evidence because reconstructability asks whether the evidence retained for a specific run is sufficient to rebuild what happened.

Third, it shapes the quality of both the evidence pack and the score profile. A well-reconstructed run makes the evidence pack more credible and allows scoring judgements to be justified from observable evidence rather than assumption.

Fourth, it underpins reviewability, contestability, audit readiness, and organisational learning. A contested or problematic output can only be reviewed seriously if the underlying run can be reconstructed with enough fidelity.

Reconstructability ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

Within RAIDT, reconstructability is therefore not an optional documentation preference. It is a governance bridge between the occurrence of a run and the organisation's ability to review, defend, contest, improve, and learn from that run later.

Link to the five RAIDT pillars

Responsibility

Reconstructability supports Responsibility by making it clearer who configured, reviewed, approved, or relied on a run. It does not by itself prove that responsibilities were well assigned, but it makes role attribution more evidence-based.