RAIDT Core

flowchart LR
    A[Responsible AI gap] --> B[Principles lack run evidence]
    B --> C[RAIDT framework]
    C --> D[Circle 0: RAIDT Core]
    D --> E[Run evidence pack]
    D --> F[Five-pillar profile]
    E --> G[Reviewer reconstruction]
    F --> H[Governance intervention]
    G --> I[Organisational assurance]
    H --> I[Organisational assurance]

RAIDT

Ring: Core conceptual hub

Function

To define RAIDT as a run-level governance framework for generative AI in organisational work, clarify its central claims, specify the run as the unit of governance, and explain the two core outputs: the run-level evidence pack and the five-pillar RAIDT score profile.

Role in the project

This note is the conceptual anchor for the whole RAIDT project. It sits primarily in the foundations layer, but it also links theory, evidence design, implementation logic, empirical validation, and policy translation. In practice, Circle 0 explains what RAIDT is, why the project is organised around the run rather than around the model alone, and how the framework is intended to support responsible AI governance across real organisational settings. It therefore supports all downstream work on pillars, evidence capture, scoring, governance interventions, sector playbooks, and policy alignment.

Stars in this circle (1)
Main questions answered by this star
Workshop discussion prompts
Main message

RAIDT is the core organising idea of the project: a framework for governing generative AI through evidence about individual runs. In RAIDT, a run is one configured use of a generative AI system for a particular task, at a particular time, in a particular organisational context. This is a deliberate move away from treating governance as something that sits only at the level of the base model, the vendor, or a broad policy statement. Those levels still matter, but they do not tell decision-makers enough about what actually happened when a system was used to produce an output for work.

This matters because organisational use of generative AI is highly contingent. The same model can behave differently depending on prompt design, retrieved context, tool permissions, fine-tuning choices, safety settings, user expertise, domain constraints, and review practices. A procurement analyst drafting a supplier summary, a university administrator preparing a policy note, and a clinician using a summarisation support tool may all technically use the same model family, yet the governance questions are not the same. RAIDT therefore treats the run as the smallest practically meaningful unit at which evidence, accountability, and intervention can be assembled.

The framework produces two practical outputs. The first is a run-level evidence pack. This is the structured record of what the system was asked to do, how it was configured, what context it drew on, what output it produced, and what checking took place. The second is a five-pillar RAIDT score profile covering Responsibility, Auditability, Interpretability, Dependability, and Traceability. Together these outputs are intended to help organisations move from vague assurances about AI use to inspectable, contestable, and actionable governance evidence.

Responsibility concerns who designed, approved, operated, reviewed, or acted on a run. It makes explicit that governance is not only a technical matter; there must be named accountability, role clarity, and decision authority. Auditability asks whether an independent reviewer could reconstruct what happened and assess whether controls were followed. Interpretability concerns whether the rationale of the run is understandable enough for users and reviewers to make sense of the output, its basis, and its constraints. Dependability examines whether the run is sufficiently robust, reliable, and fit for purpose under the stated conditions of use. Traceability concerns the chain of evidence linking input, context, processing choices, output, review, and downstream action.

The problem RAIDT addresses is a recurring gap in responsible AI governance. Many governance approaches are strong at the level of principles, risk categories, or lifecycle controls, but weak at the point where an actual organisational use occurs. Generative AI creates particular pressure here because prompts are variable, retrieval pipelines can introduce hidden sources, output quality can shift across contexts, and human review is often inconsistent. As a result, organisations may claim that a system is aligned, documented, or policy-compliant while still lacking evidence about whether a specific output used in practice was produced responsibly.

RAIDT addresses this gap by connecting technical configuration to governance evidence. The run-level record can capture the instruction or prompt, the model and tool configuration, whether retrieval-augmented generation was used, what source materials were retrieved, whether a PEFT or LoRA variant was deployed, what alignment or RLHF-derived controls applied, what automated checks ran, what human review occurred, and whether escalation or override was necessary. This does not eliminate uncertainty, but it makes uncertainty visible and governable. For managerial settings, that is crucial. Managers often do not need metaphysical certainty about how a model ?thinks?; they need evidence about whether this use was appropriate, documented, reviewable, and defensible.

The framework also matters because it creates a bridge between academic theory and operational governance. In Information Systems terms, RAIDT can be read as a governance architecture for distributed sociotechnical action. In responsible AI terms, it operationalises accountability and oversight. In uncertainty terms, it reframes the problem from ?Can we fully trust this model?? to ?What can we evidence about this run, what uncertainty remains, and what governance response is proportionate?? This is especially relevant to domains where contestability matters. If a person challenges an AI-supported recommendation, an organisation needs more than a policy statement; it needs a reconstructable record.

A practical example helps. Suppose an HR team uses a GenAI tool to draft a first-pass job description. A RAIDT evidence pack would record the prompt, approved template, model version, whether internal competency documents were retrieved, what editing occurred, who approved release, and which checks ensured discriminatory or misleading language was removed. The score profile may show strong traceability and auditability but weaker interpretability if the source basis of some phrasing was unclear. That profile then informs a governance intervention, such as requiring mandatory source citation in future runs.

A second example is RAG-based policy drafting. A university team asks a model to produce a student-facing guidance note using internal regulations. Without run-level evidence, the team may only know that ?the model used the policy database?. With RAIDT, the run can record which documents were retrieved, their versions, the retrieval query, conflicts among sources, review steps, and whether the final output deviated from authoritative text. This makes governance concrete rather than rhetorical.

The framework has clear boundaries. RAIDT does not claim that every risk can be resolved at run level. Base model training data, vendor practices, upstream bias, and broader institutional power relations still require system-level and policy-level governance. Nor does RAIDT claim that scoring is fully objective. Scores are structured judgements supported by evidence, and they must be validated empirically. The value of RAIDT is not that it removes judgement, but that it disciplines judgement by tying it to a documented run.

Within the wider project, Circle 0 therefore establishes the central proposition to be tested and refined: responsible governance of generative AI in organisational work becomes more practical and defensible when the run is treated as the unit of evidence, assessment, and intervention. This proposition can then be explored methodologically in Paper 08, tested empirically in Paper 09, and translated into policy pathways and standards alignment in Paper 10.

Key questions and answers

Q1. What is RAIDT in one sentence?

Answer:
RAIDT is a run-level evidence framework for governing generative AI use in organisational work. Its purpose is to make each material AI-assisted task inspectable through structured evidence and a five-pillar score profile. Rather than asking only whether an AI system is generally safe or compliant, RAIDT asks whether a specific use was responsibly configured, documented, reviewed, and acted upon.

Practical example:
A policy officer uses a GenAI tool to draft a briefing note. RAIDT focuses on that single drafting event as the unit to document and assess.

Link to RAIDT:
This is the core project definition and explains why the run-level evidence pack and five-pillar scoring system are the main outputs.

Q2. Why is the run the unit of governance?

Answer:
The run is the point where model behaviour, prompt design, retrieved context, tools, and human oversight come together in practice. Governance at model or policy level remains necessary, but it is too abstract to explain whether one concrete organisational use was appropriate. The run is therefore the smallest unit at which responsibility and evidence can be assembled in a usable way.

Practical example:
The same model produces very different risk profiles when used for brainstorming workshop ideas versus summarising a legal contract.

Link to RAIDT:
The run anchors evidence capture, scoring, audit reconstruction, and targeted governance interventions.

Q3. What problem does RAIDT solve?

Answer:
RAIDT addresses the gap between high-level responsible AI principles and the messy reality of actual organisational use. Many organisations can show policies, procurement documents, or vendor claims, but cannot reconstruct what happened in a contested AI-assisted decision or output. RAIDT aims to close that gap.

Practical example:
An employee challenges an AI-assisted recommendation, but the organisation cannot show which prompt, context, or review process led to it.

Link to RAIDT:
The evidence pack is designed precisely to make such cases reconstructable and reviewable.

Q4. What belongs in a run-level evidence pack?

Answer:
At minimum, the pack should capture task purpose, user role, prompt or instruction, model and version, tool settings, retrieval sources where relevant, output artefacts, review steps, approvals, exceptions, and timestamps. In some contexts it should also include risk classification, escalation rules, and post-run outcomes.

Practical example:
A finance team run includes the prompt, model endpoint, retrieved policy documents, spreadsheet tool access, generated summary, and sign-off record.

Link to RAIDT:
The evidence pack is one of the two central practical outputs of the framework.

Q5. Why are the five pillars needed?

Answer:
The five pillars turn a broad governance ambition into assessable dimensions. Responsibility makes accountability visible; Auditability makes reconstruction possible; Interpretability supports understanding; Dependability addresses robustness and fitness for purpose; Traceability links evidence across the run lifecycle. Together they prevent governance from collapsing into a single vague judgement.

Practical example:
A run may be dependable in output quality but weak in traceability if the source basis cannot be reconstructed.

Link to RAIDT:
The RAIDT score profile uses these pillars to produce a structured assessment rather than a binary pass or fail.

Q6. How does RAIDT relate to prompt engineering?

Answer:
Prompt engineering is not just a performance technique; in RAIDT it is also a governance variable. Prompts shape task framing, output constraints, and potential failure modes, so they must be documented where they materially affect organisational outcomes.

Practical example:
A prompt that instructs a model to provide ?confident recommendations? may suppress uncertainty language that reviewers need to see.

Link to RAIDT:
Prompt design becomes part of run evidence and influences pillar scoring, especially Interpretability, Dependability, and Traceability.

Q7. How does RAIDT handle RAG and retrieved context?

Answer:
RAIDT treats retrieval as part of the run configuration because external or internal source selection strongly affects output quality and defensibility. Governance requires knowing not only that retrieval occurred, but which materials were retrieved, from where, and with what version status.

Practical example:
A student guidance answer is less defensible if it used an outdated policy PDF when a newer version existed.

Link to RAIDT:
Retrieved context is part of the evidence chain and is central to Traceability and Auditability.

Q8. Where do PEFT, LoRA, and alignment controls fit?

Answer:
These are configuration and assurance factors that shape how a model behaves in a given run. If an organisation uses a PEFT or LoRA-adapted variant, or relies on RLHF-derived alignment and safety controls, those choices should be captured as part of the technical and governance context. They do not replace run-level governance, but they materially affect it.

Practical example:
A customer-service assistant built on a LoRA-adapted model may behave differently from the base model in tone, constraints, and domain specificity.

Link to RAIDT:
Such factors belong in the run-level evidence pack and inform Dependability and Responsibility assessments.

Q9. How does RAIDT support standards and policy alignment?

Answer:
RAIDT can act as an operational layer that helps organisations show how abstract obligations are being enacted in day-to-day use. Standards and policy instruments often require documentation, accountability, risk management, and monitoring; RAIDT provides a concrete evidence structure through which such expectations can be demonstrated and tested.

Practical example:
An organisation maps evidence fields in the run pack to internal controls derived from the EU AI Act, ISO/IEC 42001, or the NIST AI RMF.

Link to RAIDT:
This is a major route from Circle 0 into Paper 10 and sector playbooks.

Q10. What would count as proof that RAIDT works?

Answer:
RAIDT is supported if empirical studies show that run-level evidence improves review quality, accountability clarity, contestability, and intervention design without creating unmanageable administrative burden. Evidence of usefulness would include better audit reconstruction, more consistent governance decisions, and clearer identification of where risks actually arise.

Practical example:
In a comparative study, teams using RAIDT produce more defensible records and detect more governance failures than teams relying only on general policy guidance.

Link to RAIDT:
This question directly connects Circle 0 to Paper 09 on empirical validation.

Practical examples
Evidence needed / what to capture
Link to RAIDT project
Citation ideas to support this note
Boundaries and limitations

RAIDT does not claim to solve all AI governance problems by documenting runs. It does not replace model evaluation, procurement scrutiny, organisational policy, legal review, or broader ethical analysis. It cannot guarantee that all outputs are correct, fair, or harmless. It also does not assume that the five-pillar score is mechanically objective; scoring remains a structured judgement that must be calibrated and validated. The framework is strongest when used as part of a wider governance stack that includes lifecycle controls, role design, standards alignment, and context-sensitive escalation.

Conclusion

RAIDT is the conceptual core of the project because it reframes responsible AI governance around the run rather than around abstract policy statements or model-level claims alone. A run is one concrete use of a generative AI system for a specific task, at a specific time, in a specific organisational setting. My argument is that this is the point at which governance becomes practically meaningful, because this is where prompts, retrieved context, model configuration, tool access, outputs, and review actually come together. RAIDT then produces two outputs: a run-level evidence pack and a five-pillar score profile covering Responsibility, Auditability, Interpretability, Dependability, and Traceability. The contribution is not that it removes uncertainty, but that it makes uncertainty governable through documentation, reconstruction, and targeted intervention. That matters for both theory and practice. Theoretically, it links responsible AI to Information Systems governance and uncertainty management. Practically, it offers a structure that supervisors, organisations, and policymakers can inspect, test, and adapt across sectors.

Slides
Slide 1 — why RAIDT exists

Purpose:
To frame the governance gap that RAIDT is designed to address.

Key message:
RAIDT exists because high-level AI principles do not by themselves explain whether a specific organisational use of GenAI was responsible.

Slide content:

  • Generative AI use is highly variable in practice
  • Policies and model claims are not enough
  • Organisations need evidence about actual use
  • RAIDT answers this at run level

Speaker note:
This slide introduces the central problem. Existing governance approaches often operate at the level of principles, procurement, or lifecycle controls, but organisational disputes usually arise around a particular output or decision support event. RAIDT focuses on that practical gap by asking what can be evidenced about one concrete use of a GenAI system.

Visual idea:
Comparison graphic showing policy-level governance on one side and run-level evidence on the other.

Link to RAIDT:
This is the project rationale for treating the run as the unit of governance.

Citation support to mention if asked:
Responsible AI governance literature; GenAI assurance and uncertainty sources.

Slide 2 — what a run means

Purpose:
To define the core unit of analysis used by RAIDT.

Key message:
A run is one configured use of a GenAI system for a specific task, time, and context.

Slide content:

  • Specific task in organisational work
  • Prompt or instruction used
  • Model, tools, and context configured
  • Output and checks captured

Speaker note:
Emphasise that RAIDT is not only about the model output. A run includes the instruction, the model or endpoint, tool permissions, retrieved sources where relevant, the output produced, and any human or automated checks. This definition makes governance concrete and inspectable.

Visual idea:
Simple process flow: task -> prompt/configuration -> model/tools/context -> output -> review.

Link to RAIDT:
The run is the anchor for evidence capture, scoring, and intervention.

Citation support to mention if asked:
Human-AI interaction workflow research; audit trail and documentation concepts.

Slide 3 — the two outputs of RAIDT

Purpose:
To explain what RAIDT produces in practical terms.

Key message:
RAIDT produces a run-level evidence pack and a five-pillar score profile.

Slide content:

  • Evidence pack records what happened
  • Score profile assesses governance quality
  • Both support review and contestability
  • Both enable targeted intervention

Speaker note:
This slide should clarify that RAIDT is designed as an operational framework, not only a conceptual model. The evidence pack creates a reconstructable record, while the score profile turns that record into a structured governance judgement. Together they support assurance, audit, and improvement.

Visual idea:
Two-column diagram: evidence pack on the left, score profile on the right, feeding governance decisions.

Link to RAIDT:
These are the two flagship outputs named in the core definition of the project.

Citation support to mention if asked:
Assurance, auditability, and AI documentation source categories.

Slide 4 — the five RAIDT pillars

Purpose:
To introduce the assessment logic of the framework.

Key message:
The five pillars translate responsible GenAI governance into assessable dimensions.

Slide content:

  • Responsibility
  • Auditability
  • Interpretability
  • Dependability
  • Traceability

Speaker note:
Briefly define each pillar in spoken form rather than overloading the slide. Responsibility concerns role clarity and accountability. Auditability concerns reconstruction. Interpretability concerns meaningful understanding. Dependability concerns reliability and fitness for purpose. Traceability concerns the evidence chain across the run.

Visual idea:
Five-part circle or radar profile.

Link to RAIDT:
The pillars are the basis of RAIDT scoring and governance diagnosis.

Citation support to mention if asked:
Responsible AI accountability frameworks; IS governance and assurance concepts.

Slide 5 — what the evidence pack captures

Purpose:
To show the minimum evidence logic behind a run record.

Key message:
Run-level governance depends on capturing configuration, context, output, and review evidence.

Slide content:

  • Task, roles, and risk level
  • Prompt, model, and tool settings
  • Retrieval sources and variants used
  • Output, review, and approval trail

Speaker note:
Explain that the evidence pack is not merely a log file. It is a governance record. It captures enough structured information to reconstruct the run, understand its constraints, and judge whether controls were proportionate to risk. Mention RAG, PEFT or LoRA variants, and human review where relevant.

Visual idea:
Evidence chain or table with four columns: task, configuration, output, review.

Link to RAIDT:
This slide connects directly to the run-level evidence pack and to Auditability and Traceability.

Citation support to mention if asked:
Documentation, provenance, RAG governance, and assurance source categories.

Slide 6 — example in organisational use

Purpose:
To make the framework concrete through a realistic organisational scenario.

Key message:
RAIDT turns an ordinary GenAI task into an evidence-based governance event.

Slide content:

  • Example: policy drafting with RAG
  • Retrieved sources and versions logged
  • Human review and edits recorded
  • Weak areas trigger intervention

Speaker note:
Walk through one example, such as a university policy team generating student guidance from internal rules. Without RAIDT, the team may only know that AI was used. With RAIDT, they can show which sources were retrieved, who reviewed the output, what changed, and where uncertainty remained. That makes the use contestable and governable.

Visual idea:
Step-by-step case vignette or before/after comparison.

Link to RAIDT:
Demonstrates how the framework supports evidence packs, pillar scoring, and governance interventions in practice.

Citation support to mention if asked:
Case-study literature on organisational GenAI use; RAG documentation practices.

Slide 7 — standards and policy relevance

Purpose:
To show why RAIDT matters beyond academic definition.

Key message:
RAIDT provides an operational bridge between day-to-day GenAI use and broader policy or standards expectations.

Slide content:

  • Supports accountable documentation
  • Helps map controls to standards
  • Improves review and assurance readiness
  • Informs sector playbooks and policy pathways

Speaker note:
Make clear that RAIDT is not itself a legal regime, but it can help organisations enact and evidence obligations drawn from frameworks such as the EU AI Act, ISO/IEC 42001, and the NIST AI RMF. The contribution is operationalisation: turning high-level expectations into inspectable run-level practice.

Visual idea:
Bridge diagram from run evidence to standards and policy frameworks.

Link to RAIDT:
This slide links Circle 0 to Paper 10 and to sector playbook development.

Citation support to mention if asked:
EU AI Act materials; ISO/IEC 42001; NIST AI RMF.

Slide 8 — contribution, limits, and next steps

Purpose:
To close the presentation by positioning RAIDT as a testable project contribution.

Key message:
RAIDT offers a defensible governance unit and evidence structure, but its claims still require empirical validation.

Slide content:

  • Stronger than principle-only governance
  • Does not replace system-level controls
  • Scoring requires calibration and testing
  • Papers 08, 09, and 10 carry this forward

Speaker note:
Conclude by stressing both ambition and restraint. RAIDT claims that run-level evidence improves the governance of organisational GenAI use. It does not claim to solve every problem of bias, legality, or power. The next step is empirical and policy work: validating the framework, refining scoring, and adapting it for sectoral use.

Visual idea:
Three-part closing graphic: foundations, validation, policy pathways.

Link to RAIDT:
This slide positions Circle 0 as the entry point to the wider RAIDT research programme.

Citation support to mention if asked:
Methodology, validation, and policy-translation source categories.

Powered by Forestry.md