Academic Theory and Design Logic

flowchart LR
    A[Responsible AI principles] --> B[Model-level governance gap]
    B --> C[RAIDT run-level evidence]
    C --> D[Star S7 theory logic]
    D --> E[Evidence pack]
    D --> F[Five-pillar scoring]
    E --> G[Governance action]
    F --> G
    H[Organisational use cases] --> D

Circle 3 - Academic, adoption and boundary layer

Ring: Academic star

Function

Positions RAIDT as an Information Systems design-science and mid-range theory contribution for governing generative AI in organisational work. This star explains the design logic behind treating the run as the unit of governance, and shows how run-level evidence, pillar scoring, and governance interventions turn responsible AI principles into inspectable organisational practice.

Role in the project

This star sits in the academic and conceptual layer of the RAIDT project. Its role is to explain why RAIDT is not only a practical framework, but also a defensible theory-led contribution to Information Systems governance. It connects foundational concepts to the project's artefacts, mechanisms, and expected outcomes. In project terms, it supports:

Main questions answered by this star
Workshop discussion prompts
Items in this star (12)
Main message

RAIDT addresses a problem that is increasingly central to organisational use of generative AI: organisations are expected to govern systems whose behaviour is probabilistic, context-sensitive, and operationally shaped by many small design choices. A model card, a policy statement, or a high-level risk category may say something important about an AI system, but they do not adequately explain what happened in a particular use of that system at a particular moment. In practice, organisational risk often emerges not from the model in the abstract, but from the configured run: who used it, for what task, with which prompt, retrieval context, tools, settings, safeguards, and checks. RAIDT makes that operational insight the centre of its theory and design logic.

In academic terms, this positions RAIDT within Information Systems rather than within machine learning theory alone. Information Systems research is concerned with how technologies are configured, embedded, governed, and used within organisational settings. That is important because generative AI in organisations is never just a model. It is a socio-technical arrangement involving users, workflows, access controls, prompts, retrieved documents, tool connections, escalation routes, and accountability structures. RAIDT therefore treats governance as a problem of organisationally situated system use, not only of model performance.

The design-science contribution of RAIDT lies in the purposeful creation of governance artefacts intended to solve a recognised class of problems. The problem class is clear: organisations need a practical way to inspect, compare, justify, and improve individual uses of generative AI under uncertainty. RAIDT responds by producing two core artefacts. The first is a run-level evidence pack, which records the relevant elements of a specific run, including instructions, model and tool configuration, retrieved context where applicable, output, and human or automated checks. The second is a five-pillar RAIDT score profile, which converts the collected evidence into an interpretable governance view across Responsibility, Auditability, Interpretability, Dependability, and Traceability. Together, these artefacts support intervention rather than passive documentation.

The theory contribution is stronger when RAIDT is framed as a mid-range design theory. It is not a universal theory of all AI governance, nor is it merely a local case description. Instead, it proposes a bounded explanation of how certain governance outcomes can be improved under identifiable conditions. Its core constructs include the run, the evidence pack, the pillar scores, governance interventions, actors, tasks, controls, and boundary conditions. Its mechanisms explain how evidence capture and structured review change organisational capability. For example, making prompts, retrieval sources, model settings, LoRA or PEFT adapters, tool use, and checks visible creates audit lineage. Audit lineage then supports review, contestability, and targeted correction. Similarly, converting evidence into a standardised score profile allows cross-run comparison and highlights where uncertainty remains unacceptable.

This mechanism-based account matters because many responsible AI discussions remain at the level of principles. Principles such as accountability, transparency, or robustness are valuable, but they often become weak in operational settings because organisations cannot easily connect them to a specific decision episode. RAIDT translates those principles into governable objects. Responsibility becomes clearer when a run records user role, approval route, and intended task. Auditability improves when there is a stable evidence pack. Interpretability becomes more practical when the basis of the output can be examined through prompt structure, retrieved context, and stated checks, even if the underlying model remains only partially interpretable. Dependability is addressed by documenting checks, repeatability conditions, and failure patterns. Traceability is strengthened through audit lineage across input, configuration, output, review, and intervention.

The run-level focus also helps explain why RAIDT is relevant to AI and uncertainty, including managerial uncertainty. Managers are often not deciding whether a model is perfect. They are deciding whether a particular use is acceptable, contestable, and governable in context. A procurement manager may tolerate lower interpretability for low-risk drafting support but require stronger traceability and human sign-off for contract risk analysis. A public-sector team using retrieval-augmented generation may accept a narrow task if the retrieved documents are logged and the review process is explicit. In both cases, RAIDT provides a way to govern uncertainty by evidencing the concrete run and aligning responses to the risk of the task.

This also explains the relevance of technical foundations such as prompt engineering, RAG, PEFT or LoRA, and RLHF or alignment controls. These are not peripheral technical details. They are part of what shapes the behaviour of a run. A retrieval pipeline changes what contextual evidence is brought into the system. A domain adapter may improve task fit but introduce new versioning and assurance questions. Alignment controls may reduce certain harmful outputs while obscuring how edge cases are handled. RAIDT's design logic does not require a full causal model of internal weights. Instead, it requires enough run-level evidence to make governance judgements inspectable and actionable.

For the project as a whole, this star helps justify why RAIDT belongs in Information Systems governance scholarship. It contributes a theory of governable AI use in organisations, not only a technical method. It shows how the framework links concepts, artefacts, and outcomes: from socio-technical conditions, to run configuration, to evidence capture, to score formation, to governance action. That chain is especially important for Paper 08, where the project must explain its conceptual foundations and methodological pathway. It is equally important for Paper 09, because the theory implies empirical questions that can be tested: do evidence packs improve review quality, do pillar scores improve intervention prioritisation, and do sectoral contexts change which mechanisms matter most? It also supports Paper 10 by showing how policy frameworks such as the EU AI Act, ISO/IEC 42001, and the NIST AI RMF can be operationalised at run level rather than treated as external compliance labels.

The limitations of the theory should be stated clearly. RAIDT does not claim to solve all questions of AI safety, fairness, or legal compliance. It does not eliminate uncertainty, and it does not make opaque models fully transparent. It is best understood as a governance theory and artefact set for organisational use episodes. Its value depends on disciplined evidence capture, reasonable scoring criteria, and institutional willingness to act on the findings. Even so, that is a meaningful contribution: RAIDT explains how responsible AI governance can move from abstract aspiration to auditable practice, one run at a time.

Key questions and answers

Q1. What is the academic theory contribution of RAIDT?

Answer:
RAIDT contributes a design-science, mechanism-based mid-range theory for governing generative AI use in organisations. Its theoretical novelty lies in shifting attention from the model as a static object to the run as a situated governance unit. This matters because organisational risk, accountability, and assurance arise from specific uses of GenAI, not from model capability statements alone.

Practical example:
A university uses the same model for marketing copy, student support triage drafts, and research administration summaries. The model is constant, but the governance requirements differ by run.

Link to RAIDT:
RAIDT explains why each run should produce an evidence pack and a five-pillar score profile so that governance is tied to actual use, not abstract model claims.

Q2. Why is design science the right framing?

Answer:
Design science is appropriate because RAIDT creates purposeful artefacts to solve a real organisational problem. It does not only describe governance; it designs tools for governance, namely the run-level evidence pack, score profile, and linked intervention logic.

Practical example:
A public body needs a standard way to review GenAI use across departments. RAIDT provides a repeatable structure for documenting runs and comparing them.

Link to RAIDT:
The evidence pack and scoring model are the designed artefacts through which RAIDT produces practical value and researchable outcomes.

Q3. What makes RAIDT a mid-range theory rather than a grand theory?

Answer:
RAIDT explains a bounded class of phenomena: how run-level evidence and structured scoring support better governance of generative AI in organisational settings. It does not attempt to explain all technology adoption, all AI behaviour, or all institutional regulation.

Practical example:
RAIDT can explain why documenting retrieval sources and reviewer checks improves contestability in a legal drafting workflow. It does not claim to explain every aspect of legal organisational change.

Link to RAIDT:
This boundedness makes the framework testable in Paper 09 and adaptable in sector playbooks without overstating its scope.

Q4. What problem does the run-level focus solve?

Answer:
The run-level focus solves the governance gap between high-level AI principles and concrete operational decisions. Without a run-based unit, organisations struggle to determine what exactly happened, who was accountable, what context shaped the output, and what should be changed after a failure or near miss.

Practical example:
A claims analyst receives a flawed GenAI summary. If the organisation only knows the model name, it cannot diagnose the issue. If it records the prompt, retrieved files, settings, output, and checks, the failure becomes investigable.

Link to RAIDT:
The run-level evidence pack is the core mechanism through which RAIDT makes failure analysis, scoring, and governance intervention possible.

Q5. What are the main mechanisms proposed by RAIDT?

Answer:
RAIDT proposes mechanisms of visibility, comparability, contestability, and intervention. Evidence capture makes the run visible. Standardised scoring makes runs comparable. Audit lineage and documented checks make outputs contestable. Governance routes then turn those insights into intervention, such as escalation, redesign, retraining, tighter access controls, or prohibition.

Practical example:
Two departments use RAG differently. One logs retrieved documents and reviewer actions; the other does not. The first can explain and contest outputs; the second cannot.

Link to RAIDT:
These mechanisms map directly onto the evidence pack, pillar scoring, and governance interventions that define RAIDT's practical architecture.

Q6. How do the five pillars function in the theory?

Answer:
The pillars are not only reporting categories; they are governance dimensions that operationalise what a good run should demonstrate. Responsibility asks who owns and approves the use. Auditability asks whether the run can be reviewed. Interpretability asks whether there is sufficient explanatory basis for the use context. Dependability asks whether the run is reliable enough for the task. Traceability asks whether the history and lineage of the run can be reconstructed.

Practical example:
A finance drafting run may score well on traceability and auditability because inputs and reviews are logged, but poorly on dependability if outputs vary too much under similar conditions.

Link to RAIDT:
The five pillars are the scoring lens through which run evidence is transformed into a governance judgement.

Q7. Why does this count as Information Systems governance rather than only AI ethics?

Answer:
RAIDT examines how technology, organisational process, human roles, and control structures interact in real work settings. That makes it an Information Systems governance contribution. AI ethics provides normative direction, but RAIDT focuses on how governance is instantiated in systems, workflows, and accountability arrangements.

Practical example:
In a procurement workflow, the issue is not only whether fairness matters in principle. The issue is how prompts, document retrieval, reviewer sign-off, and audit logs are structured in the information system.

Link to RAIDT:
RAIDT connects responsible AI values to operational IS governance through evidence capture, scoring, and intervention design.

Q8. What evidence would show that RAIDT works?

Answer:
Useful evidence would show that RAIDT improves review quality, speeds root-cause analysis, supports clearer accountability, and helps organisations choose proportionate interventions. It may also show improved policy alignment or reduced ambiguity in supervision and audit.

Practical example:
In a pilot study, reviewers using RAIDT might detect missing approval steps or unstable RAG inputs more consistently than reviewers relying on informal notes.

Link to RAIDT:
Paper 09 can test these effects by comparing governance quality before and after adoption of the evidence pack and score profile.

Q9. How does this star connect to standards and policy?

Answer:
This star provides the theory for translating broad policy requirements into run-level operational evidence. Standards and laws often specify what organisations should be able to demonstrate, but they do not always specify how to evidence a particular use episode.

Practical example:
An organisation seeking alignment with ISO/IEC 42001 or the NIST AI RMF can use RAIDT to record controls, review actions, and residual uncertainties for specific high-stakes runs.

Link to RAIDT:
Paper 10 can use this design logic to show how RAIDT becomes a policy pathway, especially when mapping run evidence to control expectations and sector guidance.

Q10. What are the main boundary conditions of the theory?

Answer:
RAIDT is most suitable where organisations use generative AI in identifiable tasks and can capture meaningful evidence about each run. It is less suitable where runs are impossible to reconstruct, where governance authority is absent, or where the primary concern is low-level model internals beyond available evidence.

Practical example:
RAIDT is useful for assisted drafting, triage support, and summarisation workflows with logs and reviewers. It is less sufficient by itself for highly autonomous systems with weak trace capture.

Link to RAIDT:
Boundary conditions help define where scoring is defensible, where additional controls are needed, and where sector playbooks should narrow or extend the framework.

Practical examples
Evidence needed / what to capture
Link to RAIDT project
Citation ideas to support this note
Boundaries and limitations
Conclusion

This star explains the academic core of RAIDT. The argument is that RAIDT is not just a practical checklist for responsible AI, but a design-science contribution to Information Systems governance. Its novelty is to treat the run as the unit of governance. By run, I mean one configured use of a generative AI system for a specific task, at a specific time, in a specific context. That matters because most governance failures do not arise from the model in abstraction; they arise from a concrete use episode shaped by prompts, retrieval context, tools, settings, and checks.

The theory contribution is mid-range and mechanism-based. RAIDT proposes that if organisations capture structured run evidence and convert it into a five-pillar score profile, they improve visibility, auditability, contestability, and intervention capacity. The framework therefore links abstract responsible AI principles to operational governance action. This helps position Paper 08 conceptually, gives Paper 09 testable propositions, and supports Paper 10 by showing how standards and policy expectations can be translated into run-level evidence and governance practice.

Slides
Slide 1 — why academic theory and design logic matter

Purpose:
Frame why this star exists and why supervisors should care about it.

Key message:
RAIDT needs a clear academic theory contribution so it is understood as more than a practical governance checklist.

Slide content:

  • Organisations govern GenAI under uncertainty
  • High-level principles are not enough
  • RAIDT treats the run as the key governance unit
  • This creates a defensible Information Systems contribution

Speaker note:
Introduce the star as the note that explains what kind of academic contribution RAIDT is making. Emphasise that the project must show conceptual rigour as well as practical usefulness. The central move is to govern runs, not just models, because concrete organisational risk appears in situated use.

Visual idea:
Problem-to-contribution flow showing principles, governance gap, run-level unit, and academic contribution.

Link to RAIDT:
This slide frames the overall design logic behind the evidence pack, score profile, and governance interventions.

Citation support to mention if asked:
IS design science, responsible AI governance, socio-technical systems.

Slide 2 — the problem: model-level governance is too abstract

Purpose:
Explain the problem RAIDT is designed to solve.

Key message:
Model-level documentation does not tell an organisation what happened in a specific GenAI use episode.

Slide content:

  • Same model can support very different tasks
  • Risk depends on prompt, context, tools, and checks
  • Managers need evidence about a specific run
  • Governance fails when the run is not reconstructable

Speaker note:
Explain that organisations often rely on broad policies, model cards, or vendor claims, but these do not explain a particular output in a particular workflow. The governance issue is operational uncertainty: who used the system, for what purpose, with what context, and under what controls.

Visual idea:
Comparison table: model-level view versus run-level view.

Link to RAIDT:
This is the problem that justifies the run-level evidence pack and the five-pillar score profile.

Citation support to mention if asked:
AI uncertainty, auditability, organisational accountability, AI risk governance.

Slide 3 — RAIDT as a design-science contribution

Purpose:
Show that RAIDT is a designed solution to a class of governance problems.

Key message:
RAIDT creates purposeful artefacts for inspecting, comparing, and improving GenAI runs.

Slide content:

  • Problem class: govern GenAI use in organisations
  • Artefact 1: run-level evidence pack
  • Artefact 2: five-pillar score profile
  • Outcome: basis for governance intervention

Speaker note:
Position RAIDT within design science by stressing purposeful artefact creation. The project is not only describing governance challenges; it is designing a practical governance apparatus that can be studied, evaluated, and iterated.

Visual idea:
Artefact stack showing input run, evidence pack, score profile, intervention.

Link to RAIDT:
This slide directly names the two practical outputs that define the framework.

Citation support to mention if asked:
Design science research in Information Systems; artefact evaluation literature.

Slide 4 — RAIDT as a mid-range theory

Purpose:
Clarify the type and scope of theory RAIDT offers.

Key message:
RAIDT is a bounded theory of how run-level evidence and scoring improve governance in organisational contexts.

Slide content:

  • Not a grand theory of all AI governance
  • Not just a local checklist
  • Explains constructs, mechanisms, and outcomes
  • Testable across sectors and tasks

Speaker note:
Explain that mid-range theory is the right level of abstraction. RAIDT identifies stable constructs such as run, evidence pack, score profile, and intervention, while allowing boundary conditions to vary by sector, risk, and workflow.

Visual idea:
Three-level pyramid: grand theory, mid-range theory, local case description.

Link to RAIDT:
This is the conceptual basis for Paper 08 and for the empirical testing strategy in Paper 09.

Citation support to mention if asked:
Mid-range theory, design theory, mechanism-based explanation.

Slide 5 — mechanisms: how RAIDT produces governance value

Purpose:
Explain the causal logic of the framework.

Key message:
RAIDT works by making runs visible, comparable, contestable, and actionable.

Slide content:

  • Evidence capture creates visibility
  • Audit lineage supports contestability
  • Scoring supports comparison and prioritisation
  • Interventions convert evidence into action

Speaker note:
Walk through the mechanism chain. Once the organisation can see what happened in a run, it can compare runs, challenge outputs, identify weak controls, and decide what intervention is proportionate. This is where the five pillars become practically meaningful.

Visual idea:
Evidence chain or process arrow from run to intervention.

Link to RAIDT:
Maps directly to Responsibility, Auditability, Interpretability, Dependability, and Traceability.

Citation support to mention if asked:
Audit lineage, contestability, AI assurance, IS governance controls.

Slide 6 — what the evidence pack must capture

Purpose:
Make the design concrete and show what counts as governance evidence.

Key message:
A run is governable only if its configuration, context, output, and checks are captured systematically.

Slide content:

  • Prompt, task, user role, and approvals
  • Model, settings, tools, and adapters
  • Retrieved context and knowledge source versions
  • Output, checks, scores, and interventions

Speaker note:
Use this slide to show that RAIDT is operational rather than abstract. Mention prompt engineering, RAG, PEFT or LoRA, and alignment controls as governance-relevant features because they shape the behaviour of a run. The evidence pack is what turns these features into auditable organisational knowledge.

Visual idea:
Structured evidence table or evidence pack template.

Link to RAIDT:
This is the core of the run-level evidence pack and the input to the five-pillar score profile.

Citation support to mention if asked:
Documentation practices, prompt engineering, RAG governance, AI assurance methods.

Slide 7 — why this matters for validation and policy

Purpose:
Connect the theory to the later papers and to external governance frameworks.

Key message:
RAIDT provides a bridge between empirical validation and policy operationalisation.

Slide content:

  • Paper 09 can test the proposed mechanisms
  • Paper 10 can map policy to run-level evidence
  • Sector playbooks can adapt boundary conditions
  • Standards become operational rather than symbolic

Speaker note:
Explain that this star is not only conceptual. It structures later project stages. The empirical work can test whether evidence packs and score profiles improve governance outcomes. The policy work can show how EU AI Act, ISO/IEC 42001, and NIST AI RMF expectations are translated into actual evidence and review practice.

Visual idea:
Bridge graphic linking theory, empirical validation, and policy pathways.

Link to RAIDT:
This slide ties the academic theory star to Papers 08, 09, and 10, as well as to sector playbooks.

Citation support to mention if asked:
Policy and standards documents; empirical AI governance studies.

Slide 8 — boundaries, limits, and project significance

Purpose:
Close with a balanced statement of value and limitation.

Key message:
RAIDT does not solve all AI governance problems, but it makes organisational GenAI use substantially more governable.

Slide content:

  • Does not remove uncertainty
  • Does not replace legal or domain judgement
  • Depends on evidence quality and institutional uptake
  • Offers a practical theory of governable AI use

Speaker note:
End by being explicit about scope. RAIDT is not claiming perfect transparency or universal applicability. Its significance lies in offering a rigorous, practical, and researchable way to move from responsible AI principles to auditable organisational action.

Visual idea:
Balanced two-column slide: contribution versus limitation.

Link to RAIDT:
Reinforces RAIDT as a run-level governance framework with clear academic, empirical, and policy relevance.

Citation support to mention if asked:
Responsible AI limitations literature; governance and assurance scholarship.

Powered by Forestry.md