S3.02 - Evidence_object
S3.02 ? Evidence object
flowchart LR
A[Fragmented evidence landscape
logs, policy, screenshots, recollections] --> B[RAIDT
run-level evidence framework]
H[Practical run artefacts
prompt, inputs, outputs, metadata, review notes] --> C[[Evidence object
bounded proof artefact for one run]]
B --> C
C --> D[Evidence pack]
C --> E[RAIDT score profile]
C --> F[Reviewer reconstruction
and contestability]
D --> G[Governance readiness
and organisational learning]
E --> G? Star S3 - Run-Level Evidence Logic
Star context: Explains how RAIDT turns run-level evidence into a bounded governance artefact that can be inspected, reconstructed, compared, and challenged rather than treated as an abstract claim of assurance.
Academic picture
Definition / background
An evidence object in RAIDT is a bounded set of records assembled around one governed run so that a governance claim can be supported, inspected, compared, or challenged. The phrase matters because RAIDT does not treat evidence as a vague collection of anything that might later prove useful. Instead, it treats evidence as an identifiable object with scope, boundaries, contents, and a governance purpose.
Conceptually, the evidence object sits between raw traces and formal judgement. It is more structured than a loose archive of files, screenshots, and system logs, because it defines what belongs to the governed run and why. It is also more operational than a general assurance statement, because it contains the concrete artefacts from which reconstruction and review can actually occur. In that sense, the evidence object is the practical governance artefact that makes a run examinable.
This matters in generative AI governance because many existing frameworks speak about transparency, assurance, or documentation without clearly specifying the object that reviewers inspect when they test a real use event. RAIDT resolves that ambiguity by defining the run-level evidence object. The object can contain prompt material, inputs, outputs, metadata, human review notes, workflow decisions, and contextual markers, but its value lies in being bounded and inspectable rather than merely extensive.
Within RAIDT, the evidence object belongs directly to the logic of run-level evidence. It is the artefact from which a run-level evidence pack can be assembled and through which a five-pillar score profile can be justified. Without an evidence object, scoring becomes weakly grounded, comparisons across runs become inconsistent, and challenges to governance claims are harder to answer. The evidence object therefore links the run, the evidence pack, the score profile, and the practical goal of governance readiness.
Why this concept matters
The evidence object solves a persistent governance problem: organisations often have fragments of information about GenAI use, but not a defined object of proof that can be reviewed after the event. When an output is disputed, a decision is questioned, or a supervisor asks how one run should be assessed, the organisation needs more than policy and more than technical telemetry. It needs a bounded evidential unit that tells reviewers what they are inspecting.
The concept also avoids confusion between evidence collection and evidence governance. Collecting data is not the same as defining the artefact through which a claim can be examined. By naming the evidence object, RAIDT clarifies what must be assembled, retained, and reviewed in order to support reconstructability, comparability, and challenge.
If the evidence object is missing, governance becomes fragile. Reviewers may rely on partial logs, anecdotal recollection, or selectively retrieved records. Auditability weakens because no shared object exists for inspection. Traceability weakens because links between prompt, output, actor, and review step can break. Responsibility weakens because decision points and human interventions may not be evident. RAIDT uses the evidence object to prevent governance from collapsing into scattered documentation and post hoc interpretation.
Key idea: The evidence object matters because RAIDT needs a bounded, inspectable proof artefact for each governed run, not just a loose collection of records or high-level assurance claims.
What this item captures
- The bounded evidential scope of one governed GenAI run.
- The records that are necessary to support or challenge a claim about that run.
- The connection between prompts, inputs, outputs, settings, human actions, and review decisions.
- The minimum structure needed for reconstruction, comparison, and contestability.
- The practical object from which an evidence pack can be assembled.
- The evidential basis for justifying RAIDT pillar scores with something reviewable rather than impressionistic.
- The shift from scattered documentation to a governance artefact that can be inspected by supervisors, auditors, reviewers, or practitioners.
Practical example / likely audience question
Audience question
What object does RAIDT actually add that generic AI governance frameworks often leave undefined?
Answer
The concern behind this question is that many governance frameworks already ask for documentation, logging, or assurance, so it can seem unnecessary to introduce another term. The direct answer is that RAIDT adds a run-level evidence object: a bounded governance artefact containing the records needed to inspect one specific GenAI run.
That distinction matters because generic approaches often identify types of documentation but do not define the concrete object a reviewer examines when testing a claim about a real use event. In practice, this leaves organisations with fragments such as model documentation, workflow notes, screenshots, or system logs, but no agreed evidential unit. RAIDT closes that gap by saying that governance attaches to the run and that the run is accompanied by an evidence object that can be inspected and challenged.
A practical example is an enterprise team using GenAI to draft a client-facing risk summary. If concerns later arise about the accuracy of a claim in the draft, generic governance may point to policy, staff training, or vendor assurances. RAIDT goes further by asking for the evidence object for that run: the prompt, source material, model settings, output draft, human edits, approval notes, and contextual metadata. That gives reviewers something concrete to inspect. RAIDT therefore handles the issue better than a generic approach because it defines the inspectable object of governance rather than assuming evidence will somehow be available when needed.
Practical example in RAIDT terms
Consider a finance setting in which an analyst uses a GenAI assistant to draft a preliminary anti-money-laundering case summary for internal review. The use case is not the final compliance decision; it is a productivity-support run that still carries governance risk because the generated summary may shape how a human reviewer interprets transactions and flags concerns.
The run-level issue is whether the summary can later be justified if a case is escalated or criticised. The evidence needed includes the task purpose, the analyst's role, the source transaction data supplied to the model, the prompt template, any instructions about risk thresholds, the model version or tool configuration, the generated text, the analyst's corrections, the compliance review note, and the final disposition of the summary.
In RAIDT terms, those records become the evidence object for that run. Responsibility is affected because the organisation must show who initiated and checked the summary. Auditability is affected because a reviewer must be able to inspect what was generated and how it was handled. Interpretability is affected because the prompt and review notes help explain why the output took the form it did. Dependability is affected because repeated cases can be compared for consistency and failure patterns. Traceability is affected because the artefacts are tied to time, actor, source material, and downstream review. The evidence object therefore improves governance readiness by turning a potentially disputed AI-assisted action into an examinable case rather than an unstructured memory of what happened.
Detailed link to RAIDT
Evidence object links to RAIDT in four ways.
First, it gives practical form to RAIDT's core commitment to evidence over assertion by defining the actual artefact through which governance claims are tested.
Second, it links directly to the run because RAIDT treats the run as the unit of governance and the evidence object as the bounded proof artefact attached to that unit.
Third, it supports the evidence pack and the score profile because the evidence object provides the inspectable material from which packaging, review, and five-pillar scoring can be justified.
Fourth, it strengthens reviewability, contestability, audit readiness, and organisational learning because reviewers can inspect the same object when reconstructing a disputed or exemplary run.
Evidence object ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness
Link to the five RAIDT pillars
Responsibility
The evidence object supports Responsibility by showing who was involved in the run, what role they held, and which review or approval actions they performed.
Example evidence / implication:
- Named actor or role attached to the run and its approval path.
- Review notes showing whether human judgement accepted, amended, escalated, or rejected the output.
Auditability
This item has a very strong effect on Auditability because it defines the object that an auditor, supervisor, or examiner can actually inspect.
Example evidence / implication:
- Preserved artefacts showing the run sequence from input and prompt to output and review.
- A bounded set of records that another reviewer can access without reconstructing the case from scattered systems.
Interpretability
The evidence object supports Interpretability by preserving enough contextual and procedural information to explain how an output emerged in practice.
Example evidence / implication:
- Prompt wording, task framing, and source references retained alongside the generated content.
- Reviewer annotations explaining why the output was considered acceptable, risky, misleading, or incomplete.
Dependability
The evidence object supports Dependability because repeated objects from comparable runs can be examined for quality consistency, failure modes, and control performance.
Example evidence / implication:
- Comparable run artefacts that reveal whether a workflow is reliably producing usable outputs.
- Records of corrections, exceptions, or repeated weaknesses across similar tasks.
Traceability
The evidence object is central to Traceability because it ties together the run's time, actor, source material, tool configuration, output, and downstream handling.
Example evidence / implication:
- Timestamped metadata connecting the run to a specific task instance and organisational context.
- Clear linkage from source inputs through generated output to final reviewed outcome.
The evidence object affects all five pillars, but it is especially foundational for Auditability and Traceability because both depend on the existence of a bounded, inspectable artefact.
Why this item is more than a generic concept
In general AI governance, an evidence object might simply mean any document bundle or artefact that supports a claim. In RAIDT, the term is narrower and more operational. It means the bounded run-level artefact that collects the records needed to inspect one governed use of GenAI.
The RAIDT meaning is more operational because it is tied to the run as the unit of governance, to the assembly of an evidence pack, to the justification of a score profile, and to the practical requirements of reviewability and contestability. The term therefore does more than label documentation. It specifies the governance object through which evidence becomes inspectable.
Common misunderstanding
Misunderstanding
An evidence object is just a folder of files or a dump of logs.
Correction
A folder of files may contain evidence, but it is not automatically an evidence object in the RAIDT sense. The RAIDT evidence object is bounded, purposeful, and review-oriented. For example, a raw log export may show that prompts were submitted and outputs returned, but it may not show the task purpose, the role of the user, the source materials relied upon, the human edits, or the decision taken after review. RAIDT turns scattered records into an inspectable governance artefact by defining what belongs to the object and why those contents matter for supporting or challenging a claim about the run.
Boundary and limitation
The evidence object does not by itself prove that a run was ethically acceptable, legally compliant, or substantively correct. It provides the inspectable basis on which such judgements can be made. A well-formed evidence object can still reveal a poor decision, incomplete review, or unreliable workflow. Equally, a high-quality model or policy does not compensate for a weak evidence object when a concrete run must be examined.
The concept also depends on proportionate design. If an organisation defines the evidence object too narrowly, critical context may be lost. If it defines it too broadly, capture burden may become excessive and evidence quality may deteriorate. RAIDT handles this limitation by treating the evidence object as a bounded and purpose-driven artefact: sufficient for reconstruction and challenge, but not an indiscriminate accumulation of every possible trace.
Implementation levels
Manual implementation
A researcher or small team can implement the evidence object manually by using a structured run template. For each important GenAI run, they can retain the prompt, input materials, output, timestamps, reviewer notes, decision outcome, and a short record of why the run mattered. The key is to define the object consistently so that another person can inspect it later.
Semi-automated implementation
Semi-automated implementation can use forms, wrappers, metadata templates, and review checkpoints to populate the evidence object more reliably. For example, a workflow tool can automatically capture timestamps, user identity, prompt text, and output files while still asking a human reviewer to complete contextual notes, approval status, and exception reasons.
Fully automated implementation
At scale, a platform or orchestration layer can generate evidence objects automatically by binding prompts, inputs, outputs, configuration data, review events, and workflow metadata into one governed artefact per run. Governance dashboards can then assemble evidence packs, support RAIDT scoring, surface missing fields, and enable audit or quality-review queries across many runs without losing the run-level unit of inspection.
Practical use in the RAIDT project
Within the RAIDT project, this item is especially useful for Paper 08 Foundations because it sharpens the framework's conceptual claim that governance needs an inspectable proof artefact at run level, not merely principles or documentation categories. It also supports Paper 09 Empirical Validation because empirical testing of RAIDT depends on whether evidence objects can be formed consistently enough to support reconstruction, scoring, and comparison across cases.
For Paper 10 Policy Pathways, the evidence object helps translate governance language into implementable organisational practice. It gives policy-facing work a clearer answer to what organisations should actually retain and review when GenAI is used in operational settings. The same logic is useful in sector playbooks, where the composition of the object may vary by context while the RAIDT structure remains stable.
The item also matters for the evidence pack and scoring rubric because those outputs need a bounded evidential source. For supervisor explanation, viva defence, and journal positioning, the concept is useful because it answers a precise question: when RAIDT says governance should be evidence-based, what is the evidence-bearing artefact? The answer is the run-level evidence object.
Key audience questions to prepare for
Q1. Why not rely on existing logs and documents instead of defining an evidence object?
Because scattered records do not automatically create a reviewable governance artefact. RAIDT defines the evidence object so that the material supporting a claim about one run is bounded, inspectable, and comparable across cases.
Q2. Is the evidence object the same as the evidence pack?
Not exactly. The evidence object is the bounded artefact containing the relevant records for a run. The evidence pack is the practical packaging or presentation of that evidence for review, assessment, or reporting. The pack draws on the object.
Q3. Does every low-risk GenAI interaction need a full evidence object?
Not at the same depth. RAIDT implies proportionate governance. The concept still applies, but the object can be lighter for low-risk or routine runs and richer for consequential, regulated, or contested ones.
Q4. How does this differ from a proof object?
They are closely related, but the evidence object emphasises the bounded set of inspectable records, whereas a proof object emphasises the evidential force of that artefact in supporting or challenging a governance claim. In practice, the evidence object is what makes proof possible.
Q5. What makes the RAIDT evidence object distinctive?
Its distinctiveness lies in being explicitly tied to one run, to reviewability and challenge, and to downstream outputs such as the evidence pack and five-pillar score profile. It is not generic documentation; it is the operational governance artefact for a concrete GenAI event.
Suggested citation concepts to support this item
- AI governance evidence artefacts and accountability
- Run-level documentation for generative AI use
- Auditability and traceability in AI-assisted organisational workflows
- Evidence-based governance for human-AI decision support
- Sociotechnical records for AI incident review
- Documentation boundaries and proof objects in AI governance
- Operational assurance for generative AI deployments
- Human oversight records in AI-supported professional work
- Organisational reconstruction of AI-assisted decisions
- Reviewability and contestability in responsible AI governance
Short explanation for presentation
An evidence object is the bounded governance artefact that RAIDT attaches to one run of generative AI. Instead of treating evidence as a loose collection of logs, screenshots, and documents, RAIDT defines an inspectable object containing the records needed to support or challenge a claim about what happened in that run. This matters because governance questions usually arise in concrete cases, not in the abstract. If an organisation can produce a coherent evidence object, it can reconstruct the run, assemble an evidence pack, justify a five-pillar score profile, and improve reviewability, contestability, and audit readiness. If it cannot, governance remains largely declarative. The evidence object therefore gives RAIDT a practical answer to the question of what exactly is being examined when a GenAI use event is reviewed.
One-line takeaway
Evidence object is the bounded, inspectable proof artefact for one governed run because RAIDT turns run-level evidence into something that can be reviewed, challenged, and scored.