S9.05 - Interoperability
S9.05 ? Interoperability
flowchart LR
A[Fragmented governance demands
procurement, audit, assurance, policy] --> B[RAIDT
run-level evidence framework]
H[Practical fields
healthcare, finance, public services, education] --> C[[Interoperability
one run evidence base reused across contexts]]
B --> C
C --> D[Evidence pack]
C --> E[RAIDT score profile]
C --> F[Reviewer reconstruction]
C --> G[Governance readiness
reviewability, contestability, audit readiness]
D --> G
E --> G
F --> G? Star S9 - Policy, Standards and Assurance
Star context: Positions RAIDT within policy instruments, standards, assurance, procurement, audit, and organisational accountability by showing how one run-level evidence base can be interpreted across multiple governance settings without forcing those settings to become identical.
Academic picture
Definition / background
Interoperability, in the RAIDT context, means that the same run evidence can support multiple governance frameworks, review processes, and assurance audiences without requiring those frameworks to be collapsed into a single standard. It is therefore not merely technical compatibility between systems, nor a loose claim that different policies are "similar enough". It is an evidential and governance property: run-level records are structured in a way that lets different actors interpret the same underlying evidence for their own legitimate purposes.
Conceptually, interoperability matters because organisations rarely face only one governance regime. A generative AI deployment may simultaneously be subject to internal policy, procurement controls, audit requirements, sector guidance, management-system standards, and regulatory expectations. Without interoperability, each audience asks for separate documentation, often restating the same facts in different formats. This creates duplication, inconsistency, and a drift back towards assertion-heavy governance, where teams write documents for audiences rather than maintain evidence about actual runs.
Within RAIDT, interoperability belongs naturally because RAIDT treats the run as the unit of governance. A run records a specific use of a configured GenAI system for a specific task, time, and context. If that run is documented properly, the resulting evidence pack can be mapped outward to different frameworks, while the five-pillar score profile provides a concise view of governance performance. Interoperability therefore connects run-level evidence to broader policy and assurance ecosystems.
This also distinguishes interoperability from harmonisation. Harmonisation seeks alignment between standards or legal regimes themselves. Interoperability is less ambitious but more practical: it allows one evidence base to be used across distinct governance settings even when those settings retain different vocabularies, thresholds, and institutional logics. RAIDT benefits from this distinction because it focuses on operational evidence rather than pretending that all frameworks ask identical questions.
Why this concept matters
Interoperability solves a practical governance problem: organisations using GenAI are often asked to answer similar questions repeatedly for different audiences. When each audience demands a bespoke narrative, governance becomes slower, more expensive, and less reliable. Teams spend time rewriting compliance stories instead of improving the quality of evidence captured at the point of use.
It also avoids a common confusion in AI governance. High-level principles often look transferable, but review processes are not automatically interoperable. A policy statement that says a system is "safe" or "fair" does not help much if audit, procurement, assurance, and regulatory teams each need different forms of evidence. RAIDT addresses this by making the run record sufficiently structured that different reviewers can examine the same event from different angles.
If interoperability is missing, several risks appear. Evidence becomes duplicated across teams, contradictory versions of the same run can circulate, reviewers cannot easily reconstruct what happened, and assurance becomes dependent on presentation quality rather than evidential quality. In that setting, organisations may appear compliant while still being poorly prepared for challenge, audit, or incident review.
For organisations using generative AI, interoperability matters because governance has to operate across operational, managerial, and external settings. RAIDT uses run-level evidence to move from principle statements to operational governance. Interoperability is the mechanism that lets one well-structured evidence base support that move across multiple institutional contexts.
Key idea: Interoperability matters because RAIDT makes one run-level evidence base reusable across different governance audiences, reducing duplication while increasing consistency, reviewability, and audit readiness.
What this item enables
- Reuse of the same run-level evidence across audit, procurement, standards review, and policy scrutiny.
- Consistent interpretation of a GenAI run without rewriting its history for each governance audience.
- Mapping between RAIDT evidence packs and external frameworks such as internal controls, assurance reviews, and standards-based assessments.
- More defensible governance conversations because claims can be traced back to specific runs, settings, outputs, and human decisions.
- Lower documentation burden for organisations that would otherwise maintain multiple overlapping evidence artefacts.
- Better organisational learning because evidence gathered for one review can inform later reviews, incidents, procurement cycles, or policy updates.
Practical example / likely audience question
Audience question
If procurement, internal audit, and a standards review all ask different questions, why not just prepare separate documents for each instead of designing for interoperability?
Answer
The concern behind the question is understandable: separate documents seem easier because each audience uses its own vocabulary. However, that approach treats governance as a writing exercise rather than an evidence exercise. Separate documents usually restate the same underlying facts about the model, prompts, safeguards, outputs, human oversight, and exceptions, but they do so in slightly different ways. Over time, those versions drift apart.
The direct answer is that interoperability improves reliability as much as efficiency. If one run-level evidence pack is the shared source, then procurement can examine supplier and control implications, audit can test reconstructability and exception handling, and a standards review can assess whether the organisation followed a defined management process. The audiences differ, but the factual substrate remains the same.
A practical example is a GenAI assistant used to draft board briefings. Procurement may want to know which model is being relied upon and under what contractual controls. Internal audit may want to know whether prompts, outputs, approvals, and overrides were logged. An ISO-style review may want to know whether the organisation followed documented procedures for risk treatment and oversight. RAIDT handles this better than a generic AI governance approach because it starts from a run-level record rather than from generic policy claims. The evidence is captured once, then interpreted many times.
Practical example in RAIDT terms
Consider a public-service organisation using a generative AI assistant to draft responses to citizen enquiries. A single run involves a specific staff member, a configured model, a prompt template, an operational context, and an output that is reviewed before sending. The run-level governance issue is not just whether the tool exists, but whether this particular use can be reconstructed and justified later.
The evidence needed would include the task context, prompt or prompt template, model and version, data-sensitivity classification, applicable policy constraints, human reviewer identity, edits made before release, and any flags raised during quality or risk review. RAIDT would also record pillar-relevant signals such as whether the process was documented, whether the output could be explained to a reviewer, whether the run behaved consistently, and whether the trace from input to decision remained intact.
The affected RAIDT pillars are all five, but Auditability and Traceability are especially prominent because different oversight audiences must be able to inspect the same run. Responsibility matters because staff roles and approvals must be clear; Interpretability matters because reviewers need to understand why the output was accepted; Dependability matters because repeated runs should behave within known operational tolerances.
Interoperability improves governance readiness here because the same evidence pack can support a service manager, an internal auditor, a procurement renewal discussion, or an external policy review. Instead of recreating the story of the run each time, the organisation presents one structured evidential account that different audiences can read for their own purposes.
Detailed link to RAIDT
Interoperability links to RAIDT in four ways.
First, it supports RAIDT's core idea that governance should be grounded in evidence about actual uses of generative AI, not only in high-level principles or supplier claims.
Second, it depends on the run as the unit of governance, because a run is the smallest practical point at which context, configuration, action, review, and outcome can be captured together.
Third, it makes the evidence pack and score profile useful beyond RAIDT itself, since both can inform audit, procurement, assurance, and policy-facing review.
Fourth, it strengthens reviewability, contestability, audit readiness, and organisational learning by ensuring that multiple stakeholders can interrogate the same evidential base rather than parallel narratives.
Interoperability ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness
In that chain, interoperability is the outward-facing property that allows RAIDT evidence to travel across institutional settings while still remaining anchored in the specific run from which the evidence originated.
Link to the five RAIDT pillars
Responsibility
Interoperability supports Responsibility because shared evidence clarifies who configured, approved, reviewed, and relied upon a given run. It reduces the temptation to blur accountability when documents are rewritten for different audiences.
Example evidence / implication:
- Role assignments for operator, reviewer, approver, and accountable owner can be reused across procurement, audit, and assurance review.
- Evidence of escalation, override, or sign-off helps show that responsibility was exercised rather than assumed.
Auditability
Interoperability strongly affects Auditability because the same run record must be inspectable by different reviewers without losing meaning. Auditability is therefore one of the pillars most directly strengthened by this concept.
Example evidence / implication:
- A reviewer can reconstruct what happened in the run from timestamps, prompts, outputs, controls, and human interventions.
- Different assurance audiences can test the same evidence rather than asking for separate bespoke artefacts.
Interpretability
Interoperability contributes to Interpretability by making evidence understandable across governance audiences with different backgrounds. A run record has to be intelligible enough for technical, managerial, and assurance readers to draw justified conclusions.
Example evidence / implication:
- Plain-language explanations of task purpose, model use, and decision points support cross-functional review.
- Score rationales help reviewers understand why a run was assessed in a particular way.
Dependability
Interoperability supports Dependability indirectly. If evidence is structured consistently across runs, reviewers can compare performance, identify recurring weaknesses, and determine whether operational safeguards are working reliably over time.
Example evidence / implication:
- Repeated evidence patterns can reveal whether the same safeguard is consistently applied across similar runs.
- Cross-framework review can expose gaps between claimed reliability and observed operational behaviour.
Traceability
Interoperability strongly affects Traceability because evidence must remain linked to the original run even when it is reused in different governance contexts. Without traceability, interoperability degenerates into detached summaries.
Example evidence / implication:
- Run identifiers, timestamps, version details, and document links allow external review demands to be traced back to the original event.
- Evidence mappings can point from policy or standards questions back to concrete run artefacts rather than abstract statements.
Interoperability is therefore especially strong for Auditability and Traceability, while still providing meaningful support to Responsibility, Interpretability, and Dependability.
Why this item is more than a generic concept
In general AI governance, interoperability may refer to compatibility between tools, alignment between standards, or the ability to compare policy frameworks at a high level. Those meanings are useful, but they often remain abstract and do not specify what evidence actually moves between contexts.
In RAIDT, interoperability has a more operational meaning. It is the capacity for one run-level evidence base to support multiple governance audiences because the evidence pack is structured, contextualised, and reviewable. The concept is therefore not only about framework comparison; it is about evidential portability anchored in specific runs.
That RAIDT meaning is more operational because it ties interoperability to concrete artefacts: run records, evidence packs, score profiles, review logs, and mappings to governance questions. Instead of saying that frameworks overlap in principle, RAIDT asks whether the same documented run can satisfy different legitimate review needs without losing fidelity.
Common misunderstanding
Misunderstanding
Interoperability means that one checklist or one document can automatically satisfy every policy, standard, audit, or legal requirement.
Correction
Interoperability does not mean that all governance regimes are equivalent, nor that one evidence pack removes the need for interpretation. Different audiences still apply different thresholds, obligations, and decision criteria. What interoperability means in RAIDT is that the same underlying run evidence can be reused as a common factual base.
For example, a run record for a GenAI-supported hiring workflow may help both internal audit and a policy review, but the audit team may focus on control effectiveness while the policy team focuses on procedural legitimacy and fairness safeguards. The evidence base can be shared; the judgement applied to it may still differ.
Boundary and limitation
Interoperability does not prove that a system is compliant, safe, fair, or lawful. It only improves the ability to reuse and interpret relevant evidence across governance settings. A weak evidence pack remains weak even if it is portable.
It also does not replace sector-specific requirements, legal analysis, or substantive governance judgement. Some frameworks require unique documentation, thresholds, or reporting formats that cannot be inferred automatically from a generic run record. Organisations still need human interpretation and, in some settings, specialist legal or domain review.
The concept may fail if evidence is incomplete, poorly structured, or detached from the original run. It may also fail where governance frameworks ask fundamentally different questions that require additional contextualisation. RAIDT handles this limitation by treating interoperability as a design objective supported by evidence grammar, mappings, and review processes rather than as a guaranteed property of any documentation set.
Implementation levels
Manual implementation
A researcher or small team can implement interoperability manually by using a standard RAIDT note template for each run, keeping consistent field names, recording the same categories of evidence, and maintaining a simple crosswalk showing how each evidence element answers different governance questions.
Semi-automated implementation
Semi-automated implementation adds structured metadata, template-driven evidence capture, and review checklists that map run fields to procurement, audit, standards, or policy questions. This reduces duplication while preserving human judgement in interpretation and scoring.
Fully automated implementation
At scale, a governance platform, orchestration layer, wrapper, or logging pipeline can capture run metadata automatically, link it to control libraries and policy mappings, populate evidence packs, and generate audience-specific views from the same underlying record. In that model, interoperability is implemented through structured schemas, traceable identifiers, dashboards, and workflow rules rather than through manual document rewriting.
Practical use in the RAIDT project
In Paper 08 Foundations, interoperability helps justify why the run is the correct unit for governance analysis: it is small enough to capture operational reality and rich enough to be mapped outward to multiple frameworks. In Paper 09 Empirical Validation, it provides a testable claim about whether reviewers from different backgrounds can use the same evidence pack consistently. In Paper 10 Policy Pathways, it becomes central to the argument that RAIDT can travel between organisational governance and external policy instruments without collapsing them into one regime.
The concept is also useful in sector playbooks because each sector faces multiple oversight audiences. It supports the design of the evidence pack, the scoring rubric, and any governance intervention that depends on evidence travelling across organisational boundaries. For supervision meetings, viva defence, and journal positioning, interoperability helps explain why RAIDT is not just another ethics framework: it offers an operational bridge between evidence, assurance, and policy-facing review.
Key audience questions to prepare for
Q1. Is interoperability mainly about standards mapping or about operational evidence?
In RAIDT it is mainly about operational evidence. Standards mapping is useful, but it only becomes meaningful when the mapping is anchored in actual run-level records rather than general policy statements.
Q2. Why is interoperability especially important for generative AI governance?
Generative AI is often deployed across fast-moving organisational contexts with multiple reviewers, suppliers, and oversight demands. Interoperability prevents governance from fragmenting into separate, inconsistent narratives for each audience.
Q3. Does interoperability reduce the need for legal or policy expertise?
No. It reduces duplication in evidence preparation, but expert interpretation is still needed because different frameworks apply different criteria to the same facts.
Q4. How would you know whether RAIDT has achieved interoperability in practice?
You would test whether different governance audiences can use the same evidence pack to answer their own questions with minimal re-documentation, while still tracing claims back to the original run and reaching defensible conclusions.
Q5. What is the practical gain for an organisation?
The practical gain is a better balance of rigour and efficiency: fewer duplicated artefacts, more consistent accounts of GenAI use, stronger audit readiness, and better organisational learning from the same body of evidence.
Suggested citation concepts to support this item
- AI governance interoperability evidence frameworks
- crosswalks between AI standards and regulatory requirements
- run-level documentation for AI assurance
- audit-ready evidence for generative AI deployments
- interoperable assurance artefacts in AI governance
- policy-to-practice translation in AI governance
- management system standards for AI and evidence reuse
- traceability and reviewability in responsible AI
- procurement, audit, and regulatory alignment for AI systems
- evidence portability across governance frameworks
Short explanation for presentation
Interoperability in RAIDT means that one well-structured run record can support several governance audiences without rewriting the facts each time. That matters because organisations using generative AI rarely answer to only one framework: they may face procurement review, internal audit, standards-based assurance, and external policy scrutiny at the same time. RAIDT addresses this by treating the run as the unit of governance and producing an evidence pack plus a five-pillar score profile. If those artefacts are structured properly, the same underlying evidence can travel across different review contexts while preserving traceability and accountability. So interoperability is not just abstract alignment between frameworks; it is the practical ability to reuse run-level evidence for reviewability, contestability, and audit readiness.
One-line takeaway
Interoperability is the ability for one RAIDT run evidence base to serve multiple governance audiences because RAIDT ties governance to structured, reviewable run-level evidence.
Related items in policy, standards and assurance
Mentioned in reference-paper summaries (4)
Paper summaries live in Port/93-References/pdf_summaries/. Each file listed below contains the key term at least once.
REF-027__Currie-2025.mdREF-110__United-2024.mdREF-118__World-2024.mdREF-120__Zaharia-2018.md
Anchored questions
- Q087: Why does RAIDT need standards and policy interoperability?
- Q092: How can one evidence pack work across the EU AI Act, ISO/IEC 42001, and NIST AI RMF?
- Q170: What does interoperability mean in RAIDT, and why does it matter for policy, audit, and procurement?
- Q260: H. Policy, Empirical & Adoption ? branch overview
- Q264: Interoperability ? definition, example, and why it matters in RAIDT
- Q284: How can it be implemented and mapped to audit, procurement and policy?