S8.02 - Semi-automated_implementation

S8.02 ? Semi-automated implementation

flowchart LR
    A[Manual burden and inconsistent documentation] --> B[RAIDT - run-level evidence framework]
    A2[Risk of unsupported governance summaries] --> B
    B --> C[[Semi-automated implementation]]
    C --> D[Structured metadata capture]
    C --> E[Human-verified evidence summaries]
    C --> F[Evidence pack completeness]
    C --> G[Score profile consistency]
    F --> H[Reviewer reconstruction]
    G --> I[Governance readiness]
    J[Healthcare, finance, education, public services] --> C
    K[Templates, wrappers, forms, dashboards] --> C

? Star S8 - Implementation and Operations

Star context: Shows how RAIDT can be adopted manually, semi-automatically or through orchestration, and how it becomes part of real governance routines. This item explains the middle ground in which structured tooling supports evidence production, but accountable human review still determines whether a run is governance-ready.


Academic picture
Definition / background

Semi-automated implementation is an operational mode in which parts of RAIDT evidence production are assisted by software, templates, wrappers, or model-supported summarisation, while final evidential judgement remains with a human reviewer or responsible team member. It sits between fully manual implementation, where people record and assemble all evidence by hand, and fully automated orchestration, where governance controls and evidence flows are embedded more extensively into platforms, pipelines, or workflow systems.

Conceptually, this matters because governance systems often fail at the point of routine execution. Organisations may have policies, principles, and review expectations, yet the practical burden of collecting run-level evidence can still be too high if every record, summary, and field must be produced manually. Semi-automated implementation reduces that burden by allowing structured systems to capture timestamps, prompts, model identifiers, task metadata, user roles, document references, review checkpoints, and draft summaries in a consistent form.

Within RAIDT, the concept belongs directly to implementation and operations because RAIDT is designed to convert governance from assertion into inspectable evidence at the level of the run. Semi-automated implementation helps make that feasible. It supports the creation of a run-level evidence pack and improves the consistency of inputs used to judge Responsibility, Auditability, Interpretability, Dependability, and Traceability. Its value is therefore not merely procedural efficiency; it is the operational strengthening of evidence quality.

Semi-automated implementation is not the same as automated governance. A model may assist with formatting or first-pass summarisation, but it must not invent evidence, suppress uncertainty, or replace accountable review. In RAIDT terms, assistance is acceptable only when it remains anchored to real run records, source identifiers, and reconstructable artefacts. The concept therefore combines efficiency with explicit evidential discipline.

Why this concept matters

Semi-automated implementation solves a basic but consequential governance problem: if evidence capture is too manual, it is often skipped, delayed, or completed inconsistently; if it is too automated too early, organisations may create a false impression of control without adequate scrutiny. This middle mode provides a workable path for teams that need stronger governance routines before they are ready for full orchestration.

It also avoids a common confusion in AI governance, namely the belief that better policy language automatically leads to better operational accountability. RAIDT assumes the opposite: governance quality depends on whether a specific run can be reconstructed, reviewed, challenged, and learned from. Semi-automated implementation matters because it makes those activities sustainable across repeated uses of generative AI in real work settings.

If this concept is missing, organisations risk fragmented logs, weak reviewer confidence, inconsistent scoring, and an inability to defend why one run was acceptable while another was not. In practice, that means lower audit readiness and poorer organisational learning. Semi-automated implementation helps move governance from aspiration to repeatable operational practice.

Key idea: Semi-automated implementation matters because it makes RAIDT evidence collection scalable without surrendering human accountability for what the evidence actually shows.

What this item enables
Practical example / likely audience question

Audience question

If semi-automated implementation uses a model to help produce governance documentation, how do you stop the governance record itself from becoming hallucinated or unreliable?

Answer

The concern behind the question is that a model-assisted governance process might appear efficient while quietly weakening evidential integrity. That concern is valid, because a system that invents evidence summaries, omits uncertainty, or rewrites what happened in a run would undermine the very purpose of RAIDT.

The direct answer is that semi-automated implementation in RAIDT is acceptable only when the automation layer is evidentially subordinate to the underlying run records. In other words, logs, identifiers, timestamps, prompt references, system settings, reviewer notes, and linked artefacts must remain the authoritative basis of the evidence pack. A model may help organise or summarise those materials, but it must not become the source of truth.

A practical example would be a wrapper around an internal GenAI drafting tool that automatically records model version, prompt template ID, user role, data source class, time of execution, and output location. After the run, a model-assisted template generates a draft summary of the run for a reviewer. The reviewer then checks that every claim in the summary is supported by actual records before approving inclusion in the evidence pack.

RAIDT handles this better than generic AI governance because RAIDT is not satisfied with a statement that a review occurred. It asks whether a specific run can be reconstructed and assessed using evidence. That run-level discipline creates a stronger safeguard against governance-by-summary alone.

Practical example in RAIDT terms

Consider an NHS-adjacent administrative workflow in which a generative AI tool drafts patient appointment letters from structured scheduling data and clinician instructions. The run-level issue is not simply whether the output text is fluent; it is whether the specific run can be reviewed for appropriateness, data handling, model configuration, and human oversight.

In a semi-automated RAIDT implementation, the system automatically records the run timestamp, user identity or role, model version, prompt template reference, source document identifiers, approval checkpoint, and final output destination. A structured template then prepares a draft evidence summary highlighting the task purpose, the safeguards applied, and any exceptions. A human reviewer confirms that the summary matches the underlying records and that no unsupported statement has been inserted.

The evidence needed would include prompt and template identifiers, model and version details, access and role information, source-document references, reviewer confirmation, exception flags, and any correction or escalation note. The most affected RAIDT pillars are Auditability, Traceability, and Responsibility, though Dependability also matters where repeated use of the workflow must remain stable over time.

This improves governance readiness because the organisation is no longer relying on a vague claim that the workflow is "supervised". Instead, it can show how each run generated structured evidence, how that evidence was checked, and how reviewers could contest or approve the resulting record.

Detailed link to RAIDT

Semi-automated implementation links to RAIDT in four ways.

First, it supports RAIDT's core idea that governance should attach to the individual run rather than to abstract system claims.
Second, it strengthens the run by ensuring that key metadata and review signals are captured consistently at the point of use.
Third, it improves the quality and completeness of the evidence pack and makes scoring across the five-pillar profile more defensible.
Fourth, it enhances reviewability, contestability, audit readiness, and organisational learning by making evidence production repeatable without removing accountable human judgement.

Semi-automated implementation ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

In this chain, semi-automated implementation is the operational bridge that turns a run from a transient event into a reviewable governance object.

Link to the five RAIDT pillars

Responsibility

Semi-automated implementation strengthens Responsibility when it makes human roles, review checkpoints, and approval duties more explicit rather than less visible. It should clarify who checked the run and who accepted the governance record.

Example evidence / implication:

Auditability

This item has a particularly strong effect on Auditability because semi-automation can make the evidence trail more consistent, retrievable, and comparable across runs. It reduces the chance that key governance details exist only in informal notes or memory.

Example evidence / implication:

Interpretability

Semi-automated implementation can improve Interpretability when it helps explain what was done in a run, what settings were used, and how reviewers understood the output. Its effect is indirect but valuable.

Example evidence / implication:

Dependability

Semi-automation supports Dependability when it makes governance routines more repeatable and less sensitive to individual variation in documentation quality. However, poor automation can also reproduce errors at scale if not checked.

Example evidence / implication:

Traceability

This item strongly affects Traceability because it helps preserve links between the run, its inputs, its outputs, and the governance record produced around it. Traceability is weakened if summaries lose their connection to source identifiers.

Example evidence / implication:

Semi-automated implementation most strongly affects Auditability and Traceability, with Responsibility and Dependability also significantly influenced. Its effect on Interpretability is meaningful when summaries are explanatory but still evidence-grounded.

Why this item is more than a generic concept

In general AI governance, semi-automation may simply mean using software to make compliance or documentation work easier. In RAIDT, it has a narrower and more operational meaning: semi-automated implementation is valuable only insofar as it improves the production, checking, and reuse of run-level evidence.

That RAIDT meaning is more operational because it is tied to reconstructable runs, evidence packs, scoring inputs, and review decisions. The concept is therefore not just about efficiency tooling. It is about building a governance pathway in which partial automation supports evidential discipline rather than replacing it.

Common misunderstanding

Misunderstanding

If evidence capture is partly automated, the governance process is effectively automated too, so human review becomes a formality.

Correction

Semi-automated implementation does not make human review optional. It changes the division of labour. Systems can gather fields, prepare templates, and draft summaries faster than people can, but they cannot be assumed to determine whether the resulting governance record is accurate, sufficient, or contestable.

For example, a platform may auto-populate a run summary with timestamps and identifiers. That is useful. But if the summary also states that the run used only approved data sources, that claim still needs validation against the underlying records. In RAIDT, the purpose of semi-automation is to support accountable review, not to bypass it.

Boundary and limitation

Semi-automated implementation does not prove that a run was responsible, safe, or compliant merely because a structured record exists. A well-formatted evidence pack can still reflect weak controls, incomplete data, or mistaken reviewer assumptions. The concept therefore improves the conditions for governance, but it does not replace governance judgement.

It may fail where source logs are poor, identifiers are inconsistent, automation templates are badly designed, or reviewers rely uncritically on machine-generated summaries. It is also less effective in environments where evidence cannot be linked back to stable artefacts or where operational ownership is unclear.

RAIDT handles this limitation by keeping the run as the unit of governance and by requiring that evidence remain contestable. Semi-automation is useful only when humans can still inspect, challenge, and, if necessary, correct the record.

Implementation levels

Manual implementation

A researcher or small team can apply this concept manually by recording each run in a structured form, attaching logs and outputs by hand, and writing a short reviewer-checked summary. This is viable for low-volume or exploratory use, but it is labour-intensive and prone to inconsistency.

Semi-automated implementation

At this level, metadata capture, evidence templates, wrapper tools, and model-assisted summary drafting reduce the burden of documentation while preserving human verification. The organisation gains consistency and speed without assuming that governance judgement can be fully delegated.

Fully automated implementation

At scale, a platform, orchestration layer, dashboard, or governance pipeline may automatically generate run records, validate required fields, trigger approvals, assemble evidence artefacts, and feed scoring workflows. Even then, the strongest implementations retain oversight checkpoints for exceptions, disputes, or high-risk runs.

Practical use in the RAIDT project

This item is useful across the RAIDT project because it explains how the framework can be adopted in realistic stages rather than only as a fully matured governance system. In Paper 08 Foundations, it helps position RAIDT as operationally feasible rather than conceptually idealised. In Paper 09 Empirical Validation, it supports the study of whether structured support improves completeness, consistency, and reviewer confidence in evidence collection.

In Paper 10 Policy Pathways, semi-automated implementation helps explain how organisations can move from governance principles to workable internal routines without waiting for enterprise-scale orchestration. It is also relevant to sector playbooks, because many real deployments will begin with templates, wrappers, forms, and partial logging rather than with a fully integrated governance platform.

For supervision, viva defence, and journal positioning, this item is especially useful because it shows that RAIDT is sensitive to implementation realism. It demonstrates that the framework can accommodate resource-constrained environments while still insisting on run-level evidence, evidence-pack discipline, and structured review.

Key audience questions to prepare for

Q1. Why not keep RAIDT entirely manual if human review is still required?

Because the practical burden of manual evidence capture often leads to inconsistency, omission, and weak comparability across runs. Semi-automated implementation preserves human accountability while making governance routines sustainable.

Q2. How is semi-automated implementation different from ordinary workflow tooling?

Ordinary workflow tooling may improve efficiency without improving governance quality. In RAIDT, semi-automated implementation is specifically tied to run reconstruction, evidence-pack assembly, and defensible scoring inputs.

Q3. What is the main risk of getting semi-automation wrong?

The main risk is creating persuasive but unreliable governance summaries that are no longer anchored to logs, identifiers, and source artefacts. That would weaken auditability while giving the appearance of maturity.

Q4. When is semi-automated implementation the right maturity level?

It is appropriate when manual governance is becoming too burdensome, but the organisation does not yet have the infrastructure, confidence, or control environment needed for full orchestration. It is a transitional but often durable operating mode.

Q5. Does semi-automated implementation improve every RAIDT pillar equally?

No. Its strongest effects are usually on Auditability and Traceability, with significant implications for Responsibility and Dependability. Interpretability benefits when explanatory summaries remain grounded in verified evidence.

Suggested citation concepts to support this item
Short explanation for presentation

Semi-automated implementation is the middle operating mode in RAIDT between fully manual governance and fully orchestrated governance. It allows systems to capture key run metadata, populate structured evidence templates, and even prepare draft summaries, but it keeps human reviewers responsible for checking that every governance claim is supported by real records. This matters because most organisations need a practical way to make run-level evidence collection repeatable before they can build end-to-end automation. In RAIDT, semi-automation is not just an efficiency feature. It is a governance design choice that helps produce stronger evidence packs, more consistent five-pillar scoring inputs, and better reviewability, contestability, and audit readiness across repeated uses of generative AI in organisational work.

One-line takeaway

Semi-automated implementation is the structured use of partial automation to produce and organise run-level governance evidence because RAIDT depends on scalable evidence collection without abandoning human accountability.

Related items in star s8 (11)
Anchored questions (4)
Powered by Forestry.md