S10.13 - Crisis_and_emergency_response

S10.13 ? Crisis and emergency response

flowchart LR
    A[Fast-moving events
Incomplete information
Public impact] --> B[RAIDT
Run-level evidence framework]
    H[Generic AI governance is too abstract for urgent use] --> B
    B --> C[[Crisis and emergency response
High-stakes RAIDT domain playbook]]
    I[Emergency messaging
Incident summaries
Health and cyber alerts
Local authority briefings] --> C
    C --> D[Run-level evidence pack]
    C --> E[Five-pillar score profile]
    C --> F[Reviewer reconstruction
Contestability
Audit readiness]
    D --> G[Governance readiness]
    E --> G
    F --> G

? Star S10 - Empirical Programme, Domains and Sector Playbooks

Star context: Positions crisis and emergency response as a high-stakes domain playbook within RAIDT, showing how run-level evidence becomes especially important when outputs shape urgent decisions, public messaging, and time-critical organisational action.

Academic picture

Definition / background

Crisis examples include public communications and incident summaries. Responsibility, clarity and provenance are critical during fast-moving events.

Within RAIDT, crisis and emergency response refers to the use of generative AI in situations where outputs may influence urgent organisational decisions, public understanding, escalation pathways, or immediate protective action. The concept therefore covers more than disaster response in a narrow sense. It includes any high-pressure context in which time is compressed, consequences are material, and the tolerance for ambiguity, fabrication, or undocumented intervention is low.

This matters conceptually because crisis settings expose the limits of generic AI governance language. A broad statement such as ?human oversight should be maintained? is too vague when a system is drafting an evacuation update, summarising a live cyber incident, or producing a briefing for emergency coordinators. RAIDT places the emphasis on the run as the unit of governance, so that each crisis-related use can be evidenced, reviewed, and scored in relation to the exact task, timing, prompts, source inputs, and checking process.

The item belongs inside RAIDT because crisis response magnifies all five pillars at once. Responsibility is tested because accountability must remain clear under pressure. Auditability and Traceability are tested because later review may need to reconstruct why a message or recommendation was produced. Interpretability matters because users must understand what the model has done and where uncertainty remains. Dependability matters because instability, omission, or hallucination can have immediate operational consequences. In this sense, crisis and emergency response is a demanding application domain through which RAIDT demonstrates why run-level evidence is necessary for governance readiness.

Why this concept matters

Crisis and emergency response solves a practical governance problem: organisations increasingly want AI assistance in urgent workflows, but the very features that make generative AI attractive, such as speed and fluency, can become liabilities when people act on outputs before they are properly checked. This concept helps distinguish low-stakes automation from high-stakes support where provenance, reviewability, and role clarity are non-negotiable.

Without this concept, organisations can easily confuse fast output generation with effective emergency support. They may assume that a plausible draft is operationally safe, or that a human in the loop automatically guarantees responsible use. RAIDT avoids that confusion by asking what evidence exists for the specific run, what controls operated at the point of use, and how the run can be reconstructed if challenged later.

For organisations using GenAI, the concept matters because emergency environments compress decision time but increase the need for scrutiny. A framework that remains only principle-based is weakest precisely where it is most needed. RAIDT makes the concept operational by tying crisis uses to evidence packs, pillar-based scoring, and governance interventions that can be calibrated, tested, and improved across scenarios.

Key idea: Crisis and emergency response matters in RAIDT because urgent AI-assisted outputs must be governable at the level of the individual run, not trusted on the basis of general policy claims.

What this item captures

High-stakes GenAI use contexts in which outputs can influence protective action, public behaviour, escalation decisions, or institutional credibility.
The governance demand for clear ownership, source-grounding, and review steps even when response time is limited.
The need to document run conditions, prompts, source inputs, reviewer actions, and release decisions for crisis-related outputs.
The difference between fluent emergency drafting and dependable emergency support.
How sector playbooks can adapt RAIDT to domains such as health incidents, public services, cybersecurity, and environmental emergencies.
Why governance readiness in crisis settings depends on evidence that survives retrospective scrutiny.

Practical example / likely audience question

Audience question

Why are crisis and emergency response use cases treated as especially high stakes in RAIDT rather than simply another application area for GenAI?

Answer

The concern behind the question is that many organisations already use drafting tools in routine communications, so crisis drafting can appear to be just a faster version of an existing task. RAIDT treats it differently because the combination of urgency, uncertainty, public impact, and compressed checking time changes the governance burden. In a crisis, a misleading summary, omitted qualification, or unsupported recommendation can shape behaviour before corrections are possible.

The direct answer is that crisis use is high stakes because output quality is not the only issue. What matters equally is whether the organisation can show who initiated the run, what evidence informed it, how uncertainty was handled, what review took place, and why the output was released or rejected. For example, if a local authority uses a model to draft a flood update, the draft may sound authoritative while still misrepresenting affected postcodes or exaggerating the certainty of timing. RAIDT requires that the run be evidenced so that the communication can be checked before release and reconstructed afterwards.

RAIDT handles this better than a generic AI governance approach because it does not stop at broad commitments such as safety, oversight, or transparency. It asks for run-level evidence: the prompt, the source material, the model configuration, the human review, the final decision, and the resulting score profile. That makes the governance claim inspectable rather than merely aspirational.

Practical example in RAIDT terms

A public services resilience team uses a generative AI system to draft an emergency public update after a chemical spill near a residential area. The use case is a time-critical communication that must summarise the incident, state protective advice, and remain aligned with verified operational information.

The run-level issue is that the model may produce a coherent message that blends confirmed facts with inferred details, for example implying that evacuation is mandatory when the current instruction is only to shelter indoors. The evidence needed includes the exact prompt, the verified incident log or source briefing provided to the model, the model version and settings, the time of generation, the human reviewer identity, the edits made before release, and the approval record showing whether the message was published.

The most affected RAIDT pillars are Responsibility, Dependability, and Traceability, with Auditability also critical for post-incident review. This item improves governance readiness because it turns an urgent communication workflow into an evidencable process: reviewers can test whether the system remained within source bounds, whether responsibility for release was clear, and whether the organisation can defend its use of AI if the communication is later challenged.

Detailed link to RAIDT

Crisis and emergency response links to RAIDT in four ways.

First, it expresses RAIDT's core idea that governance must be attached to specific uses of generative AI rather than to the system in the abstract. A model may appear acceptable in general, yet still be unsuitable or insufficiently controlled in a live emergency communication task.

Second, it makes the run central. In crisis settings, what matters is the exact run carried out at a specific time, by a specific actor, for a specific purpose, against specific source material and constraints. RAIDT makes that run visible and assessable.

Third, it connects directly to RAIDT's practical outputs. The evidence pack gathers the artefacts needed to understand and review the crisis run, while the five-pillar score profile helps translate those artefacts into a structured view of governance strengths and weaknesses.

Fourth, it supports reviewability, contestability, audit readiness, and organisational learning. Crisis use is one of the clearest settings in which an organisation may later need to explain, defend, or improve an AI-assisted action. RAIDT provides a disciplined route from event-time usage to retrospective scrutiny and future refinement.

Crisis and emergency response ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

Link to the five RAIDT pillars

Responsibility

Responsibility is central because crisis outputs can trigger action, reassure or alarm the public, and shape inter-agency coordination. RAIDT asks who owned the run, who checked it, and who authorised any downstream use.