S3.05 - Reconstructability
S3.05 ? Reconstructability
flowchart LR
A[Fragmented records, memory dependence,
incomplete logs after a GenAI run] --> B[RAIDT
run-level evidence framework]
H[Run fields
identifier, prompt version, model config,
retrieved context, output, review, retention] --> C[[Reconstructability
determine what happened from stored evidence]]
B --> C
C --> D[Evidence pack]
C --> E[RAIDT score profile]
C --> F[Reviewer reconstruction]
D --> G[Governance readiness, contestability,
audit readiness, organisational learning]
E --> G
F --> G? Star S3 - Run-Level Evidence Logic
Star context: Explains the proof-object logic inside RAIDT by asking whether a run can be determined from evidence after the event, so that review, comparison, challenge, and governance judgement do not depend on memory or informal explanation.
Academic picture
Definition / background
Reconstructability is the ability to determine what happened in a run from stored records, even if the original user is unavailable. In RAIDT, this is a core evidential requirement inside run-level evidence logic: a run should leave behind enough structured material for another competent reviewer to understand the task, the conditions of use, the output produced, and the human and organisational handling around that output.
Conceptually, reconstructability sits between mere record keeping and full replay. It is stronger than simply having some artefacts, because the artefacts must be sufficient to support meaningful reconstruction. It is different from replayability, because RAIDT does not always require the run to be reproduced exactly; rather, it requires the run to be examinable after the fact. It is also different from comparability, because comparison across runs depends first on each run being reconstructable on its own terms.
Within GenAI governance, reconstructability matters because many organisational questions arise after a run has already happened. A manager, auditor, supervisor, regulator, or affected colleague may need to ask what the user was trying to do, what context was provided, which model configuration was active, what output was generated, what checks were applied, and why the output was accepted, edited, rejected, or escalated. If those questions cannot be answered from evidence, governance remains too dependent on memory, trust, or retrospective narrative.
Inside RAIDT, reconstructability belongs directly to the run-level evidence framework because it determines whether a run can become a credible proof object. It underpins the evidence pack, strengthens the basis of the five-pillar score profile, and helps translate the project?s central commitment to evidence over assertion into an operational governance test.
Why this concept matters
Reconstructability solves a fundamental governance problem: organisations often know that GenAI use should be reviewable, but they do not always know what minimum evidence is needed for review after the event. Without reconstructability, a run may leave traces without leaving enough proof. That creates confusion between having records and having usable evidence.
The concept also prevents a common failure mode in responsible AI practice. An organisation may have policy statements, training material, model documentation, or procurement assurance, yet still be unable to explain one disputed or high-impact run. RAIDT treats this as a serious gap because governance credibility depends on what can be shown when a real case is examined, not only on what was promised in advance.
If reconstructability is weak, several risks follow: incident review becomes partial, disputes become harder to resolve, score profiles become less defensible, and organisational learning is reduced because the run cannot be revisited in a disciplined way. Reconstructability therefore helps move governance from principle-level aspiration to operational evidence.
Key idea: Reconstructability matters because RAIDT can govern a run only if the run can later be determined from evidence rather than recalled from memory.
What this item enables
- It enables after-the-fact determination of what happened in one GenAI run.
- It enables independent review by someone other than the original user or operator.
- It enables a run to function as a proof object rather than as an anecdote.
- It enables evidence-pack assembly from structured run records rather than informal recollection.
- It enables more credible five-pillar scoring because judgements can be traced back to artefacts.
- It enables challenge, contestability, and supervisory questioning when outputs are disputed.
- It enables cross-run learning because comparable cases can be examined on a reconstructable basis.
- It enables retention and review policies to be tied to governance need rather than to ad hoc documentation habits.
Practical example / likely audience question
Audience question
What minimum does reconstructability need?
Answer
The concern behind this question is whether reconstructability is being defined so broadly that it becomes impractical. The direct answer is that reconstructability needs enough evidence to let another reviewer determine what happened in the run without relying on the original user?s memory. In RAIDT terms, that usually means identifiers, prompt/version, model/configuration, retrieved context, output, review and retention data.
A practical example is an enterprise productivity workflow in which a member of staff uses GenAI to draft a client-facing summary. If only the final text is retained, the organisation may know that GenAI was used but may not know which prompt template was used, what source material was supplied, which model version generated the draft, whether retrieval brought in additional context, who reviewed the result, or how long the evidence should remain available. In that situation, reconstructability is weak even though some records exist.
RAIDT handles this issue better than a generic AI governance approach because it defines reconstructability at the run level. It does not ask only whether policies or logs exist in general. It asks whether this specific run can be rebuilt into an evidential account that supports review, challenge, and scoring. That makes the concept both practical and auditable.
Practical example in RAIDT terms
Consider a finance setting in which a bank employee uses a GenAI assistant to draft an internal credit-risk briefing before a lending decision meeting. The GenAI use case is plausible and efficient, but the run-level issue arises when a reviewer later notices that a key risk factor was omitted from the briefing. The governance question is not merely whether the organisation had an AI policy. It is whether this specific run can be reconstructed in enough detail to determine what happened.
The evidence needed would include the run identifier, the prompt or template used, the prompt version, the source financial documents provided, any retrieved contextual material, the model and configuration, the generated draft, the employee?s edits, the reviewer?s annotations, the decision about whether the draft was accepted or rejected, and the retention record showing whether the evidence remains available for review. Responsibility is affected because reviewers need to know who initiated and signed off the run. Auditability is affected because the run must be examinable later. Interpretability is affected because the relationship between instructions, evidence, and output must be intelligible. Dependability is affected because repeated omissions may reveal an unstable process. Traceability is affected because each artefact must connect back to the specific run.
In governance-readiness terms, reconstructability improves the organisation?s position because the disputed case can be analysed as an evidence-based event rather than as a vague recollection. The evidence pack can be assembled credibly, the score profile can be justified, and lessons can be fed back into prompt controls, review thresholds, and retention policy.
Detailed link to RAIDT
Reconstructability links to RAIDT in four ways.
First, it supports RAIDT?s core idea that governance should attach to what happened in one actual use event rather than to abstract claims about systems in general.
Second, it is inseparable from the run because the run is the unit of governance and reconstructability is the test of whether the run remains examinable after completion.
Third, it strengthens both the evidence pack and the RAIDT score profile by ensuring that evidence-based judgement is grounded in recoverable run records.
Fourth, it supports reviewability, contestability, audit readiness, and organisational learning because a reconstructable run can be revisited, questioned, compared, and improved.
Reconstructability ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness
Link to the five RAIDT pillars
Responsibility
Reconstructability supports Responsibility by showing whether accountable roles, decisions, and review actions can be tied back to the run rather than left implicit.
Example evidence / implication:
- User, reviewer, approver, or escalation roles linked to the run identifier.
- Evidence showing whether a human accepted, amended, rejected, or relied on the output.
Auditability
This item has a particularly strong effect on Auditability because auditability is weakened immediately if a reviewer cannot reconstruct the sequence and substance of the run.
Example evidence / implication:
- Preserved prompt, inputs, retrieved context, output, timestamps, and review notes.
- Evidence sufficient for an internal auditor, supervisor, or examiner to determine what occurred.
Interpretability
Reconstructability supports Interpretability by preserving enough context to explain how an output emerged in practice, even when the internal model remains only partially interpretable.
Example evidence / implication:
- Prompt wording and task instructions retained alongside the generated output.
- Notes showing why reviewers judged the output appropriate, risky, incomplete, or misleading.
Dependability
Reconstructability supports Dependability because recurring failures, quality variation, and process instability can only be analysed reliably when individual runs can be revisited in detail.
Example evidence / implication:
- Repeated patterns of error detectable across reconstructable runs.
- Evidence showing whether process controls worked consistently under comparable conditions.
Traceability
This item is also central to Traceability because the run must remain linked to actor, time, configuration, source material, and downstream artefacts.
Example evidence / implication:
- Stable identifiers and timestamps connecting the run to relevant documents and outputs.
- Clear links between source inputs, generated artefacts, human interventions, and final use.
Reconstructability affects all five pillars, but it is especially foundational for Auditability and Traceability because those pillars depend directly on whether the run can be rebuilt from evidence.
Why this item is more than a generic concept
In general AI governance, reconstructability may simply mean that someone can later describe what happened in broad terms. In RAIDT, it has a stricter and more operational meaning: a run is reconstructable only when evidence preserved at run level is sufficient for a reviewer to determine the relevant sequence, context, artefacts, and governance handling of that run.
The RAIDT meaning is therefore more practical than a generic call for transparency. It is tied to run-level evidence, it contributes to the evidence pack, it supports five-pillar scoring, and it determines whether governance claims can survive scrutiny after a concrete event. Reconstructability in RAIDT is not a rhetorical ideal; it is an evidential threshold.
Common misunderstanding
Misunderstanding
If a system keeps logs, the run is automatically reconstructable.
Correction
Logs may help, but they do not guarantee reconstructability. A technical log may show that an API call happened at a certain time, yet still fail to preserve the prompt version, the retrieved context, the organisational purpose, the reviewer?s judgement, or the final decision about use. For example, a public-service team might retain timestamped request records but still be unable to show why a draft advisory note was accepted or who checked it. In RAIDT, reconstructability requires evidence that is sufficient for governance review, not merely evidence that a system event occurred.
Boundary and limitation
Reconstructability does not prove that a run was correct, fair, safe, lawful, or well judged. It only shows that the run can be determined after the fact from evidence. A reconstructable run may still reveal poor practice, incomplete oversight, or harmful outcomes. The concept therefore supports governance judgement, but it does not replace that judgement.
It also depends on proportionate evidence design. If evidence capture is too thin, reconstruction fails; if it is too burdensome, staff may bypass or degrade the process. Reconstructability may also be limited by external tools, ephemeral context windows, privacy constraints, retention rules, or missing integrations across systems. RAIDT handles this by treating reconstructability as a design objective for evidence packs and governance workflows, not as an assumption that all relevant information is automatically available.
Implementation levels
Manual implementation
A researcher or small team can implement reconstructability manually by using a structured run record for significant GenAI uses. This can include a run identifier, task purpose, prompt text or template reference, model details, source context, output, reviewer comments, and a note on retention or disposal.
Semi-automated implementation
Semi-automated implementation can combine templates, forms, and metadata capture. A wrapper or workflow tool might automatically attach timestamps, user identifiers, model settings, and storage references while prompting the user or reviewer to complete context, justification, and approval fields.
Fully automated implementation
At scale, a governance layer, orchestration service, or evidence pipeline can capture and link run identifiers, prompts, versions, retrieved context, outputs, review actions, retention states, and scoring inputs automatically. This allows the platform to assemble evidence packs, support audit queries, and measure whether reconstructability is being achieved consistently across teams and use cases.
Practical use in the RAIDT project
Within the RAIDT project, reconstructability is especially useful for Paper 08 Foundations because it sharpens the framework?s claim that run-level governance depends on proof objects that remain reviewable after the event. It also matters for Paper 09 Empirical Validation because an empirical test of RAIDT should examine whether reviewers can in fact reconstruct runs from the available evidence and whether that improves the defensibility of scoring.
For Paper 10 Policy Pathways, reconstructability provides a policy-relevant bridge between abstract expectations of accountability and the operational question of what records organisations should keep for meaningful post hoc review. It is also directly relevant to sector playbooks, because the minimum reconstructive record may differ across healthcare, finance, education, social care, or public administration, even though the RAIDT logic remains stable.
In the evidence pack and scoring rubric, reconstructability helps define what counts as sufficient evidence and why some runs deserve stronger audit confidence than others. In supervisor explanation, viva defence, and journal positioning, it is useful because it shows that RAIDT is not merely asking for more documentation; it is specifying the evidential condition under which governance claims become reviewable and contestable.
Key audience questions to prepare for
Q1. What minimum does reconstructability need?
At minimum, it needs enough stored evidence for another reviewer to determine what happened in the run. In RAIDT terms, that usually means identifiers, prompt/version, model/configuration, retrieved context, output, review and retention data.
Q2. How is reconstructability different from replayability?
Replayability is about rerunning or reproducing the event, often under similar technical conditions. Reconstructability is about determining what happened from the preserved record, even if exact replay is impossible or unnecessary.
Q3. Why is reconstructability important if an organisation already has an audit trail?
An audit trail may show a sequence of events, but reconstructability asks whether the trail and associated artefacts are sufficient for substantive review. RAIDT therefore treats reconstructability as a higher evidential standard than simple event logging.
Q4. Can low-risk uses of GenAI require reconstructability as well?
Yes, although the required depth can be proportionate. Even low-risk organisational uses may need later review, quality assurance, or managerial explanation, so some reconstructive evidence remains useful.
Q5. What happens in RAIDT if reconstructability is weak?
The evidence pack becomes thinner, the score profile becomes less defensible, and governance readiness is reduced because reviewers cannot confidently determine what occurred in the run.
Suggested citation concepts to support this item
- AI audit trails and post hoc reconstruction
- Generative AI documentation for organisational accountability
- Evidence sufficiency in AI governance
- Traceability and reconstructability in sociotechnical systems
- Human oversight records in AI-assisted decision-making
- Logging versus governance evidence in AI systems
- Incident review and contestability in generative AI use
- Record retention and reviewability for AI-enabled workflows
- Operational accountability in enterprise GenAI deployment
- Evidence-based assurance for AI-supported organisational tasks
Short explanation for presentation
Reconstructability is the RAIDT idea that a GenAI run should remain understandable after it has happened. It means that stored records are sufficient for a reviewer to determine what the task was, what prompt and configuration were used, what context informed the run, what output was produced, and how the organisation handled that output. This matters because governance often fails not at the level of policy statements, but at the moment when someone asks what happened in one real case. RAIDT therefore treats reconstructability as a run-level evidential requirement. If a run is reconstructable, it can support a credible evidence pack, a defensible five-pillar score profile, and meaningful review, challenge, and learning. If it is not reconstructable, governance remains too dependent on memory, trust, and informal explanation.
One-line takeaway
Reconstructability is the evidential ability to determine what happened in a run because RAIDT treats reconstructable run records as the basis of reviewable governance.
Related items in run-level evidence logic
Anchored questions
No anchored questions are currently listed in the source note.