S4.01 - run_id
S4.01 ? run_id
flowchart LR
A[Fragmented records
prompts, outputs, logs, notes] --> B[RAIDT
run-level evidence framework]
H[Practical fields
timestamp, prompt ID, hashes,
tool trace, reviewer notes] --> C[[run_id
unique linking spine for one run]]
B --> C
C --> D[Evidence pack]
C --> E[RAIDT score profile]
D --> F[Reviewer reconstruction
and contestability]
E --> G[Governance readiness
and organisational learning]? Star S4 - Evidence Architecture and Artefacts
Star context: Specifies the concrete fields and artefacts that make a run record inspectable, joinable, and reconstructable inside RAIDT's run-level evidence architecture.
Academic picture
Definition / background
run_id is the unique identifier assigned to one specific GenAI run so that all evidence generated in that run can be connected, retrieved, and reviewed as a single unit. In RAIDT, a run is one configured use of a generative AI system for a defined task, at a particular time, in a particular organisational context. The run_id is therefore the identifier for the unit of governance itself.
Conceptually, run_id sits at the boundary between record architecture and governance method. In ordinary data management, identifiers are often used for indexing or transaction handling. In RAIDT, the identifier has a more specific function: it makes the evidential unit inspectable by ensuring that prompts, system settings, retrieved materials, outputs, reviewer interventions, and decision records can be shown to belong to the same event of use.
This distinguishes run_id from nearby concepts. It is not the same as a timestamp, because many runs may occur close together in time. It is not the same as a prompt ID, because the same prompt template may be reused across many runs. It is not the same as a case ID, document ID, or user ID, because those refer to surrounding entities rather than to the run itself. run_id identifies the run as the core evidential container.
Within RAIDT, this item belongs inside Evidence Architecture and Artefacts because the framework depends on run-level evidence being linkable across multiple artefacts. Without a stable run_id, an evidence pack becomes fragile, the basis of a score profile becomes harder to defend, and the claim that a particular run can be reconstructed weakens significantly. The identifier is therefore a small field with large governance consequences.
Why this concept matters
run_id solves a basic but consequential governance problem: evidence can exist without being governable if there is no reliable way to prove which artefacts belong together. Organisations often capture many pieces of information around GenAI use, but when these are stored across prompts, tools, review forms, and logs, the absence of a common identifier makes later reconstruction uncertain.
The concept also prevents a common confusion between having data and having linked evidence. A folder full of outputs, screenshots, prompts, and logs may look comprehensive, yet if those artefacts cannot be tied to one specific run, reviewers cannot confidently determine what happened in that event. run_id converts scattered records into a coherent evidential chain.
If run_id is missing, risks appear quickly: duplicate or ambiguous records, weak audit trails, failed reviewer reconstruction, difficulty contesting a disputed output, and reduced confidence in any score assigned to the run. For organisations using GenAI in operational settings, this is not a cosmetic metadata issue. It is a precondition for evidence integrity.
RAIDT uses run_id to move governance from principles to operations. It provides the reference point that allows one run to be assembled, checked, challenged, compared, and learned from over time.
Key idea:
run_idmatters because RAIDT can only govern one run as one evidential unit if every artefact for that run is unambiguously linked together.
What this item enables
- It enables all artefacts from one run to be linked into a single evidence record.
- It enables reviewers to reconstruct the chronology and content of a run without guessing which files belong together.
- It enables evidence-pack assembly across prompts, outputs, logs, tool traces, and human review notes.
- It enables defensible scoring because the underlying evidence can be traced back to one identifiable run.
- It enables de-duplication and comparison between runs that use the same prompt, model, or workflow.
- It enables escalation, audit, and incident review by giving investigators a stable reference point.
- It enables organisational learning because lessons can be attached to a concrete run rather than to vague recollections of system use.
Practical example / likely audience question
Audience question
If a system already records timestamps, filenames, and user details, why does RAIDT still need a separate run_id?
Answer
The concern behind this question is that run_id may appear redundant if other metadata already exist. The direct answer is that timestamps, filenames, and user details describe aspects of a run, but they do not reliably define the run as a single governed event. Two runs may involve the same user, the same prompt template, similar filenames, and closely adjacent times. Without a distinct run identifier, the evidence boundary remains uncertain.
A practical example is a policy analyst using a GenAI drafting tool several times in one afternoon to generate alternative versions of a briefing note. The same operator, same task label, and same source folder may be involved in each attempt. If one output is later challenged, reviewers need to know exactly which prompt instance, model setting, retrieved material set, and review action belong to that contested run. run_id makes that possible.
RAIDT handles this better than a generic AI governance approach because it does not treat metadata as a loose collection of helpful fields. It treats the run as the unit of governance and therefore requires an explicit identifier for that unit. run_id is what turns many surrounding records into one inspectable object of review.
Practical example in RAIDT terms
Consider a finance setting in which a bank compliance analyst uses a GenAI assistant to draft a summary of unusual transaction activity for internal escalation. The GenAI use case is legitimate, but several similar drafts may be produced during the same case review as the analyst refines the request, checks source excerpts, and compares wording options.
The run-level issue is not simply whether the model performed well in general. The issue is whether one particular draft summary can be reconstructed and justified if it is later questioned by a senior reviewer or regulator. The evidence needed includes the run_id, timestamp, analyst role, task label, prompt version, prompt hash, model/version identifier, retrieved case documents, output hash, reviewer notes, and final escalation decision.
The most affected RAIDT pillars are Auditability and Traceability, because the organisation must show that the output under scrutiny is the output from this run rather than from a neighbouring attempt. Responsibility is also affected because the analyst and reviewer roles must be attached to the same run record. Governance readiness improves because the bank can present one coherent evidence pack rather than a bundle of partially related artefacts.
Detailed link to RAIDT
run_id links to RAIDT in four ways.
First, it gives operational form to RAIDT's core idea that governance should attach to one concrete run rather than to general claims about a model or policy.
Second, it links directly to the run as the unit of governance by naming that unit in a way that can be propagated across all relevant artefacts.
Third, it stabilises the evidence pack and the score profile because every prompt, output, review note, and trace element can be assembled under one run reference.
Fourth, it supports reviewability, contestability, audit readiness, and organisational learning because later reviewers can ask for one run_id and reconstruct the relevant event with less ambiguity.
run_id ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness
Link to the five RAIDT pillars
Responsibility
run_id supports Responsibility by ensuring that accountability records attach to the correct run rather than to a general workflow or user account.
Example evidence / implication:
- Reviewer sign-off is tied to the exact run that produced the output.
- Escalation or override decisions can be attributed to the relevant run rather than to a broader task stream.
Auditability
This item has a particularly strong effect on Auditability because auditors need a stable reference for reconstructing one event of use.
Example evidence / implication:
- A reviewer can request one
run_idand retrieve the linked prompt, settings, output, and review notes. - Audit sampling becomes more reliable because each sampled run has a unique evidential anchor.
Interpretability
run_id supports Interpretability indirectly. It does not explain model internals, but it ensures that the contextual materials needed to interpret one output can be brought together.
Example evidence / implication:
- Prompt wording, retrieved sources, and reviewer commentary can be examined as one package for the identified run.
- Comparative analysis across runs becomes easier because each interpretation is tied to a distinct identifier.
Dependability
run_id supports Dependability by allowing repeated runs, failure cases, and corrections to be compared without confusion about which artefacts belong to which attempt.
Example evidence / implication:
- Reliability analysis can distinguish between a successful run and a failed neighbouring run using the same workflow.
- Incident reviews can attach corrective actions to the exact run that exposed a weakness.
Traceability
run_id is central to Traceability because it is the joining key that connects distributed artefacts into one traceable chain.
Example evidence / implication:
- Tool-chain events, output hashes, and reviewer notes can all be traced back to one run.
- Downstream references to the run remain stable even when artefacts are stored in different systems.
run_id affects all five pillars, but its strongest influence is on Auditability and Traceability because both depend on the existence of a stable evidential reference point.
Why this item is more than a generic concept
In general AI governance or software operations, an identifier may simply mean a convenient label for storage, indexing, or debugging. In RAIDT, run_id has a narrower and more operational meaning: it is the governance anchor for one run-level evidence record.
The RAIDT meaning is more operational because the identifier is tied to evidence-pack assembly, five-pillar scoring, reviewer reconstruction, and challengeability. In other words, run_id is not included merely for database hygiene. It is included because without it the framework cannot reliably show what belongs to one governed use event.
Common misunderstanding
Misunderstanding
run_id is just a technical logging detail with no real governance significance.
Correction
It is true that run_id is implemented as a technical field, but its governance significance is substantial. A technical field becomes a governance mechanism when it determines whether evidence can later be reconstructed, challenged, and audited. For example, a team may have excellent prompt logs and reviewer notes, but if those records are not linked by the same run_id, they may not be defensibly attributable to the same event. In RAIDT, the identifier is therefore a practical condition for accountable review, not just an engineering convenience.
Boundary and limitation
run_id does not prove that a run was lawful, fair, safe, or high quality. It does not replace prompt capture, output preservation, timestamps, hashes, reviewer notes, or substantive evaluation. It only provides the linking mechanism that allows those artefacts to be assembled around one run.
It can also fail if implemented badly. Poor namespace design, duplicate identifiers, inconsistent propagation across systems, or retrospective assignment can weaken trust in the record. RAIDT handles this limitation by treating run_id as one element in a wider evidence architecture that should also include timestamps, hashes, prompt/version controls, and review records. The identifier is necessary, but not sufficient, for robust governance evidence.
Implementation levels
Manual implementation
A researcher or small team can implement run_id manually by assigning a unique identifier at the start of each meaningful GenAI run and recording it in a structured template, spreadsheet, or note. The same identifier should then be copied into the prompt record, output file, review note, and any related evidence summary.
Semi-automated implementation
Semi-automated implementation can use forms, wrappers, or templates that generate a run_id automatically when a user opens a governed task. The identifier can then be inserted into prompt registries, filenames, review forms, and export metadata while still allowing humans to add contextual notes.
Fully automated implementation
At scale, a platform, orchestration layer, or governance pipeline can mint an immutable run_id for every governed interaction and propagate it across logs, retrieval components, tool calls, outputs, storage objects, dashboards, and review queues. This supports automated evidence-pack assembly, cross-system joins, and run-level audit queries.
Practical use in the RAIDT project
Within the RAIDT project, this item is particularly useful for Paper 08 Foundations because it helps specify how the framework's abstract commitment to run-level governance becomes a concrete evidence architecture. It shows that the unit of governance must also have a stable unit identifier.
For Paper 09 Empirical Validation, run_id matters because empirical testing depends on whether evidence from actual runs can be retrieved and grouped reliably. If run records cannot be joined consistently, validation of the evidence pack and score profile becomes methodologically weaker.
For Paper 10 Policy Pathways and sector playbooks, run_id is a practical policy design point. It is the kind of requirement that organisations can operationalise through templates, wrappers, procurement clauses, and internal governance controls. In supervision, viva defence, and journal positioning, it helps answer a simple but important question: how does RAIDT know that the artefacts being reviewed belong to the same run? The answer is that the framework requires a stable run identifier as part of its evidence architecture.
Key audience questions to prepare for
Q1. Is run_id different from a case ID or document ID?
Yes. A case ID identifies the wider matter, and a document ID identifies a particular file or artefact. run_id identifies the specific GenAI use event inside that wider context.
Q2. Why not rely on timestamps alone?
Because timestamps describe when something happened, not necessarily which artefacts belong to the same governed run. Multiple similar runs can occur close together and become difficult to separate.
Q3. Does every GenAI interaction need a run_id?
Not every trivial interaction needs the same level of governance, but every run that RAIDT treats as a governed unit should have one. The point is proportional evidence capture, not unnecessary bureaucracy.
Q4. What happens if one artefact is missing the run_id?
The run may still be partially reconstructable, but confidence in the evidence pack decreases. Missing propagation is itself a governance weakness because it breaks the evidential chain.
Q5. Why is this concept important for viva or reviewer discussion?
Because it shows that RAIDT is operationally serious. It demonstrates that the framework is designed not only around principles, but around the practical mechanics needed to reconstruct one real run.
Suggested citation concepts to support this item
- unique identifiers in audit trails for AI systems
- evidence linkage and provenance in generative AI governance
- run-level traceability in human-AI workflows
- metadata integrity for AI accountability systems
- event reconstruction in sociotechnical audit systems
- provenance identifiers in machine learning operations
- chain of custody for digital evidence in organisational AI use
- operational governance records for generative AI deployments
- cross-system evidence joins in AI assurance infrastructures
- logging architecture for accountable AI review
Short explanation for presentation
run_id is the unique identifier for one governed GenAI run in RAIDT. Its importance is not merely technical. RAIDT treats the run as the unit of governance, so the framework needs a stable way to tie prompts, settings, retrieved materials, outputs, review actions, and final decisions to that one unit. Without run_id, an organisation may have many useful artefacts but still struggle to prove which ones belong together. With it, a run can be reconstructed as a coherent evidence pack and assessed through RAIDT's five-pillar score profile. This makes run_id a small but foundational part of governance readiness: it turns distributed metadata into a reviewable record of one actual use event.
One-line takeaway
run_idis the unique identifier for one GenAI run because RAIDT can only build run-level evidence, scoring, and governance readiness when all artefacts for that run are unambiguously linked.
Related items in evidence architecture and artefacts
- S4.02 ? Timestamp
- S4.03 ? User role / operator role
- S4.04 ? Task and domain label
- S4.05 ? Prompt registry
- S4.06 ? Prompt ID and version
- S4.07 ? Prompt hash
- S4.08 ? Model/provider/version identifier
- S4.09 ? Decoding parameters
- S4.10 ? Retrieval query and index ID
- S4.11 ? Retrieved document IDs and hashes
- S4.12 ? Tool-chain trace
- S4.13 ? Adapter ID / PEFT lineage
- S4.14 ? Alignment policy ID
- S4.15 ? Output hash
- S4.16 ? Review decision and reviewer notes
- ? and 1 more