C0.10 - Core_claim

C0.10 ? Core claim

flowchart LR
    A[High-level governance only:
principles, policies, model cards,
system cards, periodic audits] --> B[RAIDT:
run-level evidence framework] B --> C[[Core claim:
responsible GenAI governance
must be operationalised at the run level]] C --> D[Run-level evidence] D --> E[Evidence pack] D --> F[RAIDT score profile] E --> G[Reviewer reconstruction
and contestability] F --> H[Governance readiness
and organisational learning] I[Healthcare, finance, education,
public services, enterprise productivity] --> C

? Star C0 - RAIDT Core, Definition, Values, Claims and Innovation

Star context: Defines the project identity: RAIDT argues that responsible governance of GenAI in organisational work becomes credible only when grounded in run-level evidence, not only in principles, policies, model descriptions, or periodic audits.


Definition / background

The core claim of RAIDT is that responsible governance of generative AI in organisational work must be evidenced at the level where risk, judgement, and consequence actually materialise: the run. A run is one configured use of a GenAI system for a specific task, at a specific time, in a specific context. This claim does not deny the importance of model governance, organisational policy, or lifecycle assurance. Rather, it argues that these higher-level controls are insufficient on their own because they do not show what happened in a particular episode of use.

Conceptually, the claim responds to a persistent weakness in AI governance discourse. Many governance instruments describe what ought to happen, what a model is broadly capable of, or what an organisation intends to control. They are often less effective at showing what a human reviewer can inspect after a specific use event. RAIDT therefore repositions governance around the unit that can be reconstructed, evaluated, compared, and improved: the run.

Inside RAIDT, the core claim is the framework's central proposition. It justifies run-level evidence as the unit of proof, the evidence pack as the unit of review, and the score profile as the unit of structured judgement across Responsibility, Auditability, Interpretability, Dependability, and Traceability. Without this claim, RAIDT would remain a broad governance aspiration; with it, RAIDT becomes a method for producing practical evidence of governance readiness.

The item belongs in Star C0 because it states the thesis that holds the rest of the star together. The definition of RAIDT, the meaning of the run, the role of evidence over assertion, the emphasis on reviewability and contestability, and the account of innovation all depend on this claim being clear. It is therefore both a conceptual summary and a design rule for the whole project.

Why this concept matters

The core claim matters because it answers a simple but foundational question: where should responsible governance of GenAI be demonstrated? RAIDT's answer is that governance must be demonstrated where the system is actually used, configured, reviewed, and relied upon. If this is left vague, organisations can appear well governed at the policy level while remaining weak at the level of concrete operational use.

The concept also prevents a common confusion between governance declaration and governance evidence. A principle can declare that a system should be fair, safe, or accountable. A model card can describe intended use and known limitations. A periodic audit can review selected controls. None of these, by themselves, provide a reviewer with enough detail to reconstruct one situated use episode and assess whether governance expectations were actually met. The core claim therefore guards against over-claiming.

For organisations using GenAI, this matters in practical terms. Real harms and failures arise through specific uses: a misleading summary, an inappropriate recommendation, an untraceable prompt chain, an undocumented override, or a missing human review step. By focusing attention on runs, RAIDT provides a pathway from general governance ambition to operational governance practice.

Key idea: The core claim matters because it locates responsible GenAI governance at the point where use can be evidenced, reviewed, challenged, and improved.

What this item explains
Practical example / likely audience question

Audience question

Why is RAIDT's core claim stronger than simply saying that organisations should use AI responsibly?

Answer

The concern behind this question is that many governance frameworks already endorse responsible AI, so RAIDT may appear to be restating a familiar principle. The direct answer is that RAIDT does not stop at endorsement. Its core claim specifies where responsible governance must be made visible and how it can be evaluated. It argues that responsible use is not adequately demonstrated by policy commitments, model-level documentation, or occasional audit alone; it must be supported by evidence from the run in which the system was actually used.

A practical example makes the distinction clearer. An organisation may publish a strong policy on human oversight for GenAI-assisted report drafting. That policy is useful, but it does not show whether one particular report was generated with the approved prompt template, whether the correct model version was used, whether a human checked the output, whether sensitive data were handled properly, or whether downstream edits were logged. RAIDT's core claim insists that this level of detail matters because it is the level at which governance succeeds or fails in practice.

RAIDT handles this issue better than a generic AI governance approach because it links the claim directly to operational artefacts: run-level evidence, evidence packs, and the five-pillar score profile. The result is not merely a statement that governance should exist, but a framework for showing that governance was enacted in a particular use episode.

Practical example in RAIDT terms

Consider a hospital using a GenAI system to draft discharge summaries for clinicians. The use case appears low-friction, but the governance question is not whether the hospital has an AI policy in general. The run-level question is whether one specific discharge-summary draft, produced for one patient encounter by one clinician at one time, was generated under the correct safeguards.

In RAIDT terms, the run-level issue includes the prompt template used, the model and version invoked, the clinical context, whether patient data were minimised or masked appropriately, whether a clinician reviewed the output before release, and whether any material corrections were made. The evidence needed would include run metadata, prompt and output records where policy permits, reviewer notes, approval status, exception flags, and provenance links to the surrounding workflow.

The most affected RAIDT pillars would be Responsibility, Auditability, Dependability, and Traceability, with Interpretability supporting explanation of why the output was accepted or amended. The core claim improves governance readiness here because it shifts the hospital from saying "we govern GenAI responsibly" to showing how a concrete clinical use can be reconstructed, assessed, and improved.

Detailed link to RAIDT

Core claim links to RAIDT in four ways.

First, it states RAIDT's central thesis: responsible governance becomes credible only when it is demonstrated at the level of actual GenAI use.
Second, it anchors the framework in the run and makes run-level evidence the basic unit of governance proof.
Third, it explains why RAIDT produces evidence packs and score profiles rather than relying on unstructured assurance claims.
Fourth, it connects the framework to reviewability, contestability, audit readiness, and organisational learning by making specific uses reconstructable and comparable.

Core claim ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

This chain is central to RAIDT because the claim is not an isolated statement. It is the logic that turns a theory of governance into a practical method for documenting, scoring, reviewing, and improving organisational GenAI use.

Link to the five RAIDT pillars

The core claim shapes all five RAIDT pillars, but it has its strongest direct force in Auditability and Traceability because both depend on being able to reconstruct a run.

Responsibility

The claim supports Responsibility by requiring that each run has a defined purpose, user role, decision context, and accountability structure. Responsibility is weakened when AI use is discussed only in generic organisational terms.

Example evidence / implication:

Auditability

The claim strongly supports Auditability because a reviewer can audit only what has been captured in a reconstructable form. Run-level evidence makes it possible to inspect whether a specific use complied with policy and procedure.

Example evidence / implication:

Interpretability

The claim supports Interpretability by requiring that a run can be understood in context: why the system was used, how the output was judged, and what reasoning accompanied acceptance, revision, or rejection.

Example evidence / implication:

Dependability

The claim supports Dependability because dependable governance depends on consistent control of repeated runs, not on isolated claims about system quality. Run-level evidence allows patterns of reliable or unreliable use to be identified.

Example evidence / implication:

Traceability

The claim strongly supports Traceability by insisting that each run can be followed across inputs, system components, human actions, outputs, and subsequent decisions. Without traceability, governance claims remain difficult to verify.

Example evidence / implication:

Why this item is more than a generic concept

In general AI governance discussion, a core claim may function as a mission statement or abstract thesis. In RAIDT, the core claim is more operational than that. It acts as a design principle stating that governance claims should be tested against evidence from specific runs.

That difference matters. A generic claim can be rhetorically persuasive without changing organisational practice. RAIDT's meaning is stricter: if a governance statement cannot be grounded in run-level evidence, it remains incomplete. The RAIDT interpretation is therefore more operational because it is tied directly to evidence capture, evidence-pack assembly, score profiling, and governance readiness.

Common misunderstanding

Misunderstanding

The core claim means that only the run matters, so model-level governance and organisational policy become secondary or unnecessary.

Correction

This is incorrect. RAIDT does not replace model governance, policy governance, procurement controls, or lifecycle assurance. It complements them by identifying the level at which those broader controls must become inspectable in practice. For example, a model card may describe intended use and limitations, but a run still needs evidence showing which model version was used, for what task, under what human oversight, and with what outcome. The core claim therefore narrows the operational focus of governance without narrowing governance itself.

Boundary and limitation

The core claim does not prove that a run was ethically acceptable merely because evidence exists. Good evidence can still reveal poor practice. Nor does the claim imply that run-level governance replaces legal compliance, sectoral regulation, technical testing, or organisational culture. It identifies the level at which governance should become reviewable; it does not exhaust every requirement of responsible AI governance.

The claim also depends on adequate evidence capture. If prompt records, reviewer actions, contextual metadata, or decision links are missing, then the practical force of the claim is weakened. RAIDT handles this limitation by coupling the claim to structured evidence-pack design and scoring criteria rather than treating the claim as self-sufficient.

Finally, the claim works best when runs are sufficiently well-defined. In highly fluid or informal workflows, organisations may need to formalise task boundaries, logging practices, and review checkpoints before the claim can be implemented robustly.

Implementation levels

Manual implementation

A researcher or small team can apply the core claim manually by defining what counts as a run, documenting each important GenAI use episode, and recording the minimum evidence needed for later review. This may involve templates, research logs, reviewer notes, and manually assembled evidence packs.

Semi-automated implementation

Semi-automated implementation adds structured metadata forms, workflow templates, and scoring rubrics that prompt users to capture key run details consistently. This level supports comparison across runs and reduces omission of important governance information.

Fully automated implementation

At scale, the core claim can be implemented through a platform layer, wrapper, orchestration service, or governance pipeline that automatically logs run metadata, model/version details, prompt chains, reviewer interactions, policy checks, and scoring inputs. In this form, the claim becomes an operational property of the system architecture rather than only a procedural expectation.

Practical use in the RAIDT project

Within the RAIDT project, this item has a foundational role. In Paper 08 Foundations, it helps articulate the framework's central thesis and distinguish RAIDT from principle-led or model-centred governance approaches. In Paper 09 Empirical Validation, it provides the rationale for testing whether run-level evidence and scoring improve review quality, agreement, and practical governance readiness. In Paper 10 Policy Pathways, it offers a bridge from conceptual governance language to implementable institutional controls.

The item is also useful for sector playbooks, evidence-pack design, and scoring-rubric explanation because it states why those artefacts exist in the first place. For supervisor explanation and viva defence, it gives a concise answer to the question of what RAIDT is really claiming. For journal positioning, it sharpens the contribution by showing that RAIDT is not merely another responsible AI checklist but a framework that relocates governance to the level of demonstrable use.

Key audience questions to prepare for

Q1. Is the core claim mainly conceptual, or does it have operational consequences?

It has direct operational consequences. The claim determines the unit of analysis, the structure of evidence collection, the design of the evidence pack, and the logic of the score profile. In RAIDT, a conceptual claim is only valuable if it changes how governance is evidenced and reviewed.

Q2. Why choose the run rather than the model as the main governance unit?

Because models are used differently across tasks, users, settings, and times. Many governance risks arise from situated use rather than from model properties alone. The run captures that situated use in a way that a model-level description cannot.

Q3. Does this claim imply that principles and policies are unimportant?

No. Principles and policies remain necessary because they define expectations and boundaries. RAIDT's point is that they are insufficient on their own unless they can be connected to evidence from actual use episodes.

Q4. How does the core claim help with audit readiness?

It helps by insisting that governance be documented in a reconstructable form. Auditors, reviewers, or supervisors need more than intent statements; they need evidence showing what happened in a specific run and how that run was judged.

Q5. What is the main contribution of this claim to the wider AI governance debate?

Its main contribution is to shift attention from abstract governance commitments to evidence-bearing use episodes. That shift makes governance more inspectable, contestable, and capable of continuous improvement in organisational practice.

Suggested citation concepts to support this item
Short explanation for presentation

The core claim of RAIDT is that responsible governance of generative AI must be demonstrated at the level of the run, not only at the level of policy, model description, or periodic audit. A run is one configured use of a GenAI system for a particular task in a particular context. This matters because real governance failures occur in situated use episodes: a misleading output, a missing review step, an undocumented override, or an untraceable prompt chain. RAIDT therefore argues that governance becomes meaningful only when those episodes can be evidenced, reconstructed, and scored. That is why the framework produces run-level evidence, assembles it into an evidence pack, and translates it into a five-pillar score profile. The claim is central because it turns responsible AI from aspiration into something operational, reviewable, and improvable.

One-line takeaway

Core claim is RAIDT's central thesis because it states that responsible GenAI governance becomes credible only when tied to run-level evidence.

Related items in RAIDT core, definition, values, claims and innovation
Mentioned in reference-paper summaries (2)

Paper summaries live in Port/93-References/pdf_summaries/. Each file listed below contains the key term at least once.

Anchored questions
Powered by Forestry.md