S4.01 - run_id

S4.01 ? run_id

flowchart LR
    A[Fragmented records
prompts, outputs, logs, notes] --> B[RAIDT
run-level evidence framework] H[Practical fields
timestamp, prompt ID, hashes,
tool trace, reviewer notes] --> C[[run_id
unique linking spine for one run]] B --> C C --> D[Evidence pack] C --> E[RAIDT score profile] D --> F[Reviewer reconstruction
and contestability] E --> G[Governance readiness
and organisational learning]

? Star S4 - Evidence Architecture and Artefacts

Star context: Specifies the concrete fields and artefacts that make a run record inspectable, joinable, and reconstructable inside RAIDT's run-level evidence architecture.


Academic picture
Definition / background

run_id is the unique identifier assigned to one specific GenAI run so that all evidence generated in that run can be connected, retrieved, and reviewed as a single unit. In RAIDT, a run is one configured use of a generative AI system for a defined task, at a particular time, in a particular organisational context. The run_id is therefore the identifier for the unit of governance itself.

Conceptually, run_id sits at the boundary between record architecture and governance method. In ordinary data management, identifiers are often used for indexing or transaction handling. In RAIDT, the identifier has a more specific function: it makes the evidential unit inspectable by ensuring that prompts, system settings, retrieved materials, outputs, reviewer interventions, and decision records can be shown to belong to the same event of use.

This distinguishes run_id from nearby concepts. It is not the same as a timestamp, because many runs may occur close together in time. It is not the same as a prompt ID, because the same prompt template may be reused across many runs. It is not the same as a case ID, document ID, or user ID, because those refer to surrounding entities rather than to the run itself. run_id identifies the run as the core evidential container.

Within RAIDT, this item belongs inside Evidence Architecture and Artefacts because the framework depends on run-level evidence being linkable across multiple artefacts. Without a stable run_id, an evidence pack becomes fragile, the basis of a score profile becomes harder to defend, and the claim that a particular run can be reconstructed weakens significantly. The identifier is therefore a small field with large governance consequences.

Why this concept matters

run_id solves a basic but consequential governance problem: evidence can exist without being governable if there is no reliable way to prove which artefacts belong together. Organisations often capture many pieces of information around GenAI use, but when these are stored across prompts, tools, review forms, and logs, the absence of a common identifier makes later reconstruction uncertain.

The concept also prevents a common confusion between having data and having linked evidence. A folder full of outputs, screenshots, prompts, and logs may look comprehensive, yet if those artefacts cannot be tied to one specific run, reviewers cannot confidently determine what happened in that event. run_id converts scattered records into a coherent evidential chain.

If run_id is missing, risks appear quickly: duplicate or ambiguous records, weak audit trails, failed reviewer reconstruction, difficulty contesting a disputed output, and reduced confidence in any score assigned to the run. For organisations using GenAI in operational settings, this is not a cosmetic metadata issue. It is a precondition for evidence integrity.

RAIDT uses run_id to move governance from principles to operations. It provides the reference point that allows one run to be assembled, checked, challenged, compared, and learned from over time.

Key idea: run_id matters because RAIDT can only govern one run as one evidential unit if every artefact for that run is unambiguously linked together.

What this item enables
Practical example / likely audience question

Audience question

If a system already records timestamps, filenames, and user details, why does RAIDT still need a separate run_id?

Answer

The concern behind this question is that run_id may appear redundant if other metadata already exist. The direct answer is that timestamps, filenames, and user details describe aspects of a run, but they do not reliably define the run as a single governed event. Two runs may involve the same user, the same prompt template, similar filenames, and closely adjacent times. Without a distinct run identifier, the evidence boundary remains uncertain.

A practical example is a policy analyst using a GenAI drafting tool several times in one afternoon to generate alternative versions of a briefing note. The same operator, same task label, and same source folder may be involved in each attempt. If one output is later challenged, reviewers need to know exactly which prompt instance, model setting, retrieved material set, and review action belong to that contested run. run_id makes that possible.

RAIDT handles this better than a generic AI governance approach because it does not treat metadata as a loose collection of helpful fields. It treats the run as the unit of governance and therefore requires an explicit identifier for that unit. run_id is what turns many surrounding records into one inspectable object of review.

Practical example in RAIDT terms

Consider a finance setting in which a bank compliance analyst uses a GenAI assistant to draft a summary of unusual transaction activity for internal escalation. The GenAI use case is legitimate, but several similar drafts may be produced during the same case review as the analyst refines the request, checks source excerpts, and compares wording options.

The run-level issue is not simply whether the model performed well in general. The issue is whether one particular draft summary can be reconstructed and justified if it is later questioned by a senior reviewer or regulator. The evidence needed includes the run_id, timestamp, analyst role, task label, prompt version, prompt hash, model/version identifier, retrieved case documents, output hash, reviewer notes, and final escalation decision.

The most affected RAIDT pillars are Auditability and Traceability, because the organisation must show that the output under scrutiny is the output from this run rather than from a neighbouring attempt. Responsibility is also affected because the analyst and reviewer roles must be attached to the same run record. Governance readiness improves because the bank can present one coherent evidence pack rather than a bundle of partially related artefacts.

Detailed link to RAIDT

run_id links to RAIDT in four ways.

First, it gives operational form to RAIDT's core idea that governance should attach to one concrete run rather than to general claims about a model or policy.

Second, it links directly to the run as the unit of governance by naming that unit in a way that can be propagated across all relevant artefacts.

Third, it stabilises the evidence pack and the score profile because every prompt, output, review note, and trace element can be assembled under one run reference.

Fourth, it supports reviewability, contestability, audit readiness, and organisational learning because later reviewers can ask for one run_id and reconstruct the relevant event with less ambiguity.

run_id ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

Link to the five RAIDT pillars

Responsibility

run_id supports Responsibility by ensuring that accountability records attach to the correct run rather than to a general workflow or user account.

Example evidence / implication:

Auditability

This item has a particularly strong effect on Auditability because auditors need a stable reference for reconstructing one event of use.

Example evidence / implication:

Interpretability

run_id supports Interpretability indirectly. It does not explain model internals, but it ensures that the contextual materials needed to interpret one output can be brought together.

Example evidence / implication:

Dependability

run_id supports Dependability by allowing repeated runs, failure cases, and corrections to be compared without confusion about which artefacts belong to which attempt.

Example evidence / implication:

Traceability

run_id is central to Traceability because it is the joining key that connects distributed artefacts into one traceable chain.

Example evidence / implication:

run_id affects all five pillars, but its strongest influence is on Auditability and Traceability because both depend on the existence of a stable evidential reference point.

Why this item is more than a generic concept

In general AI governance or software operations, an identifier may simply mean a convenient label for storage, indexing, or debugging. In RAIDT, run_id has a narrower and more operational meaning: it is the governance anchor for one run-level evidence record.

The RAIDT meaning is more operational because the identifier is tied to evidence-pack assembly, five-pillar scoring, reviewer reconstruction, and challengeability. In other words, run_id is not included merely for database hygiene. It is included because without it the framework cannot reliably show what belongs to one governed use event.

Common misunderstanding

Misunderstanding

run_id is just a technical logging detail with no real governance significance.

Correction

It is true that run_id is implemented as a technical field, but its governance significance is substantial. A technical field becomes a governance mechanism when it determines whether evidence can later be reconstructed, challenged, and audited. For example, a team may have excellent prompt logs and reviewer notes, but if those records are not linked by the same run_id, they may not be defensibly attributable to the same event. In RAIDT, the identifier is therefore a practical condition for accountable review, not just an engineering convenience.

Boundary and limitation

run_id does not prove that a run was lawful, fair, safe, or high quality. It does not replace prompt capture, output preservation, timestamps, hashes, reviewer notes, or substantive evaluation. It only provides the linking mechanism that allows those artefacts to be assembled around one run.

It can also fail if implemented badly. Poor namespace design, duplicate identifiers, inconsistent propagation across systems, or retrospective assignment can weaken trust in the record. RAIDT handles this limitation by treating run_id as one element in a wider evidence architecture that should also include timestamps, hashes, prompt/version controls, and review records. The identifier is necessary, but not sufficient, for robust governance evidence.

Implementation levels

Manual implementation

A researcher or small team can implement run_id manually by assigning a unique identifier at the start of each meaningful GenAI run and recording it in a structured template, spreadsheet, or note. The same identifier should then be copied into the prompt record, output file, review note, and any related evidence summary.

Semi-automated implementation

Semi-automated implementation can use forms, wrappers, or templates that generate a run_id automatically when a user opens a governed task. The identifier can then be inserted into prompt registries, filenames, review forms, and export metadata while still allowing humans to add contextual notes.

Fully automated implementation

At scale, a platform, orchestration layer, or governance pipeline can mint an immutable run_id for every governed interaction and propagate it across logs, retrieval components, tool calls, outputs, storage objects, dashboards, and review queues. This supports automated evidence-pack assembly, cross-system joins, and run-level audit queries.

Practical use in the RAIDT project

Within the RAIDT project, this item is particularly useful for Paper 08 Foundations because it helps specify how the framework's abstract commitment to run-level governance becomes a concrete evidence architecture. It shows that the unit of governance must also have a stable unit identifier.

For Paper 09 Empirical Validation, run_id matters because empirical testing depends on whether evidence from actual runs can be retrieved and grouped reliably. If run records cannot be joined consistently, validation of the evidence pack and score profile becomes methodologically weaker.

For Paper 10 Policy Pathways and sector playbooks, run_id is a practical policy design point. It is the kind of requirement that organisations can operationalise through templates, wrappers, procurement clauses, and internal governance controls. In supervision, viva defence, and journal positioning, it helps answer a simple but important question: how does RAIDT know that the artefacts being reviewed belong to the same run? The answer is that the framework requires a stable run identifier as part of its evidence architecture.

Key audience questions to prepare for

Q1. Is run_id different from a case ID or document ID?

Yes. A case ID identifies the wider matter, and a document ID identifies a particular file or artefact. run_id identifies the specific GenAI use event inside that wider context.

Q2. Why not rely on timestamps alone?

Because timestamps describe when something happened, not necessarily which artefacts belong to the same governed run. Multiple similar runs can occur close together and become difficult to separate.

Q3. Does every GenAI interaction need a run_id?

Not every trivial interaction needs the same level of governance, but every run that RAIDT treats as a governed unit should have one. The point is proportional evidence capture, not unnecessary bureaucracy.

Q4. What happens if one artefact is missing the run_id?

The run may still be partially reconstructable, but confidence in the evidence pack decreases. Missing propagation is itself a governance weakness because it breaks the evidential chain.

Q5. Why is this concept important for viva or reviewer discussion?

Because it shows that RAIDT is operationally serious. It demonstrates that the framework is designed not only around principles, but around the practical mechanics needed to reconstruct one real run.

Suggested citation concepts to support this item
Short explanation for presentation

run_id is the unique identifier for one governed GenAI run in RAIDT. Its importance is not merely technical. RAIDT treats the run as the unit of governance, so the framework needs a stable way to tie prompts, settings, retrieved materials, outputs, review actions, and final decisions to that one unit. Without run_id, an organisation may have many useful artefacts but still struggle to prove which ones belong together. With it, a run can be reconstructed as a coherent evidence pack and assessed through RAIDT's five-pillar score profile. This makes run_id a small but foundational part of governance readiness: it turns distributed metadata into a reviewable record of one actual use event.

One-line takeaway

run_id is the unique identifier for one GenAI run because RAIDT can only build run-level evidence, scoring, and governance readiness when all artefacts for that run are unambiguously linked.

Related items in evidence architecture and artefacts
Anchored questions
Powered by Forestry.md