S9.10 - Incident_response

S9.10 ? Incident response

flowchart LR
    A[Fragmented handling of harmful or disputed outputs] --> B[RAIDT - run-level evidence framework]
    A2[Generic policy without reconstructable evidence] --> B
    B --> C[[Incident response]]
    C --> D[Run-level reconstruction]
    D --> E[Evidence pack]
    D --> F[RAIDT score profile]
    C --> G[Corrective action]
    C --> H[Reviewer reconstruction]
    E --> I[Reviewability]
    F --> J[Governance readiness]
    G --> K[Organisational learning]
    H --> K
    L[Healthcare] --> C
    M[Public services] --> C
    N[Finance] --> C
    O[Cybersecurity] --> C
    P[Enterprise productivity] --> C

? Star S9 - Policy, Standards and Assurance

Star context: Connects RAIDT to policy instruments, standards, assurance, procurement, audit and organisational accountability, showing how governance claims are tested when a specific GenAI run causes concern.


Academic picture
Definition / background

Incident response is the structured process used to identify, triage, investigate, remediate and learn from a problematic GenAI run. In a RAIDT context, the concept is narrower and more operational than a broad organisational crisis response, yet broader than a simple technical bug fix. It covers harmful outputs, policy breaches, unsafe recommendations, data-handling concerns, failed review steps, retrieval faults, prompt manipulation and other events in which a particular run requires formal reconstruction and corrective action.

Conceptually, incident response sits at the point where governance principles are tested by concrete events. Many AI governance frameworks state that systems should be safe, accountable and auditable, but incident response asks what an organisation can actually do when an output is contested or harm is alleged. RAIDT makes this actionable by treating the run as the unit of governance. A run is a specific use of a GenAI system for a defined task, at a specific time, in a specific context. That framing allows investigators to examine what was configured, what evidence was produced and how decisions were made.

This item differs from adjacent concepts. It is not the same as general risk management, because it addresses an actual or suspected event rather than a prospective risk register. It is not identical to post-market monitoring, because incident response is often immediate and case-specific, whereas monitoring is broader and longitudinal. It is not just internal audit, because the purpose is not only assurance after the fact but timely reconstruction, containment and improvement. Inside RAIDT, incident response belongs centrally because run-level evidence, evidence packs and score profiles provide the practical substrate for reviewability, contestability and continuous improvement.

Why this concept matters

Incident response matters because organisations using GenAI need a credible way to move from abstract commitments to operational accountability. When something goes wrong, the key governance question is not whether principles existed in policy, but whether the organisation can reconstruct the event, explain what happened, identify control weaknesses and implement proportionate corrective action. Without that capability, governance remains declarative.

The concept also prevents a common failure mode in GenAI deployment: treating a disputed output as an isolated anomaly with no structured learning loop. If incident response is weak, organisations may over-blame the model, over-blame the user, or settle for anecdotal explanation. RAIDT reduces that ambiguity by anchoring the inquiry in run-level evidence, so the discussion can move from opinion to documented sequence, context and control performance.

Key idea: Incident response matters because RAIDT turns a problematic GenAI event into a reconstructable, reviewable and improvable governance case at the level of the individual run.

What this item enables
Practical example / likely audience question

Audience question

What if a harmful output occurs? Is that simply a model failure, or can RAIDT actually support incident response in a useful way?

Answer

The concern behind this question is that many organisations assume incident response for GenAI is either too technical for governance teams or too ambiguous for meaningful review. The direct answer is that RAIDT is useful precisely because it narrows the scope of inquiry to a specific run and the evidence associated with it. Instead of debating the system in the abstract, the organisation can reconstruct what the prompt asked for, what context was retrieved, which model or configuration was used, what output was produced, whether a human review step occurred and what happened next.

For example, if a drafting assistant generates a harmful recommendation, a generic governance approach may only record that "the AI produced an unsafe answer". RAIDT supports a more precise response: investigators can examine whether the retrieval source was outdated, whether the prompt omitted a safeguard, whether the reviewer missed a warning, or whether the use context itself was inappropriate. That is better than a generic approach because it creates evidence-backed causal explanation rather than broad speculation, and it supports both immediate remediation and control redesign.

Practical example in RAIDT terms

In healthcare, a hospital uses a GenAI assistant to draft discharge summaries for clinicians. In one run, the draft includes an incorrect medication instruction that is almost sent to a patient. The run-level issue is not merely that the model was "wrong"; it is that a specific configured use, in a specific clinical context, produced a near miss requiring investigation. RAIDT would require evidence such as the prompt template, retrieved clinical guidance, model version, time stamp, user role, draft output, edits made by the clinician, review checkpoints and escalation record. The most affected pillars are Dependability, Traceability, Auditability and Responsibility, with Interpretability also relevant if the rationale for the suggestion is unclear. Incident response improves governance readiness here because the organisation can reconstruct the event, classify the control weakness, document the remedial action and show that learning feeds back into safer future runs.

Detailed link to RAIDT

Incident response links to RAIDT in four ways.

First, it expresses RAIDT's core idea that governance should be based on evidence from actual use rather than only on policy statements or design-time claims.
Second, it depends on the run as the unit of analysis, because incident response becomes precise only when a single configured use can be reconstructed.
Third, it turns run-level evidence into a usable evidence pack and a meaningful RAIDT score profile, showing where governance performance was strong or weak.
Fourth, it connects individual events to reviewability, contestability, audit readiness and organisational learning, so the outcome of an incident is not only correction but improved governance capacity.

Incident response -> Run-level evidence -> Evidence pack -> RAIDT score profile -> Governance readiness

This chain matters because incident response is one of the clearest demonstrations that RAIDT is not just descriptive. It creates a route from a contested event to documented review, score-informed diagnosis and governance improvement.

Link to the five RAIDT pillars

Responsibility

Incident response clarifies who is accountable for receiving, triaging, investigating and resolving a problematic run.

Example evidence / implication:

Auditability

This item strongly affects Auditability because incident response depends on whether the event can be reconstructed in a disciplined and inspectable way.

Example evidence / implication:

Interpretability

Incident response supports Interpretability by helping reviewers understand why an output may have appeared plausible, misleading or unsafe in context.

Example evidence / implication:

Dependability

This item strongly affects Dependability because repeated incidents often reveal unstable performance, weak controls or failure under realistic operating conditions.

Example evidence / implication:

Traceability

Incident response also strongly affects Traceability because investigators must follow the sequence from input conditions to output and downstream action.

Example evidence / implication:

Why this item is more than a generic concept

In general AI governance, incident response may simply mean having a policy that says issues should be reported, reviewed and escalated. In RAIDT, it means being able to reconstruct and evaluate a specific GenAI run through documented evidence. The RAIDT meaning is more operational because the concept is tied to run-level evidence capture, evidence packs, score interpretation and concrete governance readiness rather than broad procedural aspiration.

Common misunderstanding

Misunderstanding

Incident response only matters after serious harm has already occurred.

Correction

In RAIDT, incident response also covers near misses, contested outputs and repeated anomalies that reveal weak controls before major harm materialises. For example, if a policy-drafting assistant repeatedly invents sources that are caught during review, each event may be low impact in isolation, but together they justify structured incident handling because they expose a dependable governance weakness.

Boundary and limitation

Incident response does not guarantee that every harmful or problematic run will be detected, and it does not replace domain judgement, legal review or specialist investigation in high-stakes contexts. It also does not prove causality on its own; evidence may still be incomplete, ambiguous or contested. Its effectiveness depends on whether relevant run-level data, review decisions and workflow context were captured in the first place. RAIDT handles this limitation by making evidence capture and structured scoring part of ordinary governance practice, so incident response does not begin from a blank page when something goes wrong.

Implementation levels

Manual implementation

A researcher or small team can apply incident response manually by recording the run, preserving the disputed output, noting the prompt and context, documenting who reviewed the event and writing a short corrective-action summary linked to the evidence pack.

Semi-automated implementation

Metadata templates, structured review forms and standard incident categories can support faster and more consistent handling. At this level, RAIDT fields can prompt reviewers to capture the same core evidence each time and to map findings against the five pillars.

Fully automated implementation

At scale, a wrapper, orchestration layer or governance dashboard can automatically log run metadata, preserve relevant artefacts, trigger escalation workflows, generate incident case files and update score-profile indicators when thresholds or patterns suggest control failure. This does not automate judgement, but it automates evidence collection, routing and governance visibility.

Practical use in the RAIDT project

This item is useful across the RAIDT project because it shows how the framework handles adverse or contested real-world events rather than idealised use. In Paper 08 Foundations, it helps justify why the run is the correct unit for governance action. In Paper 09 Empirical Validation, it offers a concrete test of whether evidence packs and score profiles genuinely support reconstruction and review. In Paper 10 Policy Pathways, it helps explain how RAIDT can satisfy organisational expectations around assurance, accountability and continuous improvement. It is also valuable in sector playbooks, scoring-rubric development, governance interventions and viva defence because incident response is a practical question that supervisors, reviewers and practitioners immediately recognise.

Key audience questions to prepare for

Q1. What counts as an incident in RAIDT?

An incident is a run that triggers justified concern because the output, process or review pathway suggests harm, breach, failure or material contestability. RAIDT also recognises near misses and repeated anomalies when they reveal a governance weakness worth formal investigation.

Q2. How is incident response different from post-market monitoring?

Incident response is event-specific and often immediate; post-market monitoring is broader, more longitudinal and pattern-oriented. RAIDT supports both, but incident response focuses on reconstructing and acting on a particular run.

Q3. Why use the run rather than the whole system as the unit of response?

Because many governance failures in GenAI arise from context-specific use rather than from system design alone. The run captures the exact configuration, task and context that made the event meaningful.

Q4. Does incident response require storing everything a user does?

No. It requires proportionate evidence sufficient for reconstruction, review and accountability. Good RAIDT design balances traceability with privacy, confidentiality and data-minimisation obligations.

Q5. How does this support continuous improvement rather than one-off remediation?

Each incident can update controls, templates, reviewer guidance, escalation rules and pillar scoring. In that way, incident response becomes a learning mechanism that strengthens future governance performance.

Suggested citation concepts to support this item
Short explanation for presentation

Incident response in RAIDT is the mechanism that allows an organisation to investigate a problematic GenAI run as a governance event rather than as an anecdote. If a harmful, misleading or policy-breaching output appears, RAIDT makes it possible to reconstruct the prompt, context, model pathway, review steps and resulting action for that specific run. That matters because responsible AI is not demonstrated by principles alone; it is demonstrated by whether the organisation can explain what happened, identify weak controls and implement corrective action. In this sense, incident response links run-level evidence to evidence packs, score profiles, audit readiness and continuous improvement. It is one of the clearest examples of how RAIDT moves AI governance from assertion toward reviewable operational evidence.

One-line takeaway

Incident response is the structured investigation and remediation of a problematic GenAI run because RAIDT ties that process to run-level evidence, evidence packs and governance readiness.

Related items in policy, standards and assurance
Mentioned in reference-paper summaries (3)

Paper summaries live in Port/93-References/pdf_summaries/. Each file listed below contains the key term at least once.

Anchored questions
Powered by Forestry.md