S8.01 - Manual_implementation
S8.01 ? Manual implementation
flowchart LR
A[Fragmented prompts and informal AI use] --> B[RAIDT - run-level evidence framework]
A2[Policy aspirations without operational proof] --> B
B --> C[[Manual implementation]]
H[Controlled spreadsheets
Reviewer forms
Saved prompts and outputs
Scoring sheets
Timestamps] --> C
C --> D[Run-level evidence pack]
C --> E[RAIDT score profile]
C --> I[Reviewer reconstruction]
D --> F[Reviewability and contestability]
E --> G[Governance readiness]
I --> J[Organisational learning and policy alignment]? Star S8 - Implementation and Operations
Star context: Shows how RAIDT can be adopted manually, semi-automatically or through orchestration, and how it becomes part of real governance routines. Manual implementation is the entry point that proves RAIDT can operate as a disciplined governance method before dedicated tooling is introduced.
Academic picture
Definition / background
Manual implementation refers to the use of human-operated but structured processes to apply RAIDT without requiring a dedicated software platform. In practice, this means using controlled spreadsheets, saved prompt and output records, reviewer forms, scoring sheets, naming conventions, and simple evidence folders to document each run of a generative AI system. The concept matters because many organisations need a credible starting point for governance before they can justify more advanced infrastructure.
Conceptually, manual implementation sits at the operational end of RAIDT's governance model. RAIDT treats the run as the unit of governance, so the question is not whether a workflow is high-tech or low-tech, but whether each run is captured in a way that supports reviewability, reconstruction, and scoring. Manual implementation therefore differs from ad hoc manual working. A user casually saving a screenshot or copying a prompt into a notebook is not enough. Manual implementation in RAIDT requires structured evidence fields and repeatable handling rules.
This item belongs in RAIDT because it demonstrates that run-level evidence can be produced even when automation is absent. That matters for pilots, supervision workshops, proof-of-concept studies, small teams, and early-stage institutional adoption. It also clarifies the relationship between implementation mode and governance output: the evidence pack and the five-pillar score profile do not depend on sophisticated tooling in principle, but they do depend on disciplined capture of the right evidence at the right level.
Manual implementation is therefore best understood as a transitional but valid governance mode. It enables organisations to establish evidence habits, test reviewer workflows, refine rubric interpretation, and identify where later semi-automation or orchestration would add value. In RAIDT terms, it is not a weaker idea of governance; it is the minimum operational configuration that still takes evidence seriously.
Why this concept matters
Manual implementation solves a practical adoption problem. Organisations often agree that generative AI use should be governed, but they delay implementation because they assume governance only becomes real after dashboards, wrappers, logging pipelines, or enterprise platforms are available. RAIDT avoids that deadlock by showing that governance can begin with disciplined manual processes, provided that runs are documented systematically.
The concept also prevents a common confusion between principles and operations. A team may say that it values accountability, transparency, and oversight, yet still have no reproducible record of how a specific AI-assisted decision or output was produced. Manual implementation closes that gap by creating a tangible route from policy aspiration to evidence capture. It gives teams a manageable first step while preserving the core logic of RAIDT.
If this concept is missing, organisations risk treating governance as a future technical project rather than a present operational responsibility. That creates blind spots around contested outputs, reviewer disagreements, weak audit trails, and inconsistent scoring. Manual implementation matters because it makes governance start now, not after an ideal technical architecture appears.
Key idea: Manual implementation matters because RAIDT can begin as a disciplined evidence practice before it becomes a software-supported governance system.
What this item enables
- Adoption of RAIDT in pilots, workshops, early-stage studies, and low-resource organisational settings.
- Structured capture of prompts, outputs, reviewer judgements, timestamps, and contextual notes for each run.
- Construction of a run-level evidence pack without requiring a bespoke technical platform.
- Production of a five-pillar score profile from manually assembled but controlled evidence.
- Human review, contestation, and reconstruction of runs when questions arise later.
- Comparison of runs across teams or use cases using common templates and scoring sheets.
- Identification of where semi-automation or orchestration would reduce burden or improve consistency.
Practical example / likely audience question
Audience question
Can RAIDT start without software, or does manual implementation make the framework too weak for serious governance?
Answer
The concern behind this question is understandable: many people assume that if governance is manual, it is informal, fragile, or impossible to scale. The direct answer is that RAIDT can start manually without losing its core governance value, so long as the evidence fields are captured reliably and the review process is controlled. Manual implementation is not equivalent to improvised note-taking; it is a structured method for documenting runs before platform-level support exists.
A practical example would be a research team trialling a generative AI assistant for drafting internal policy summaries. The team may not yet have a wrapper application or integrated logging layer, but it can still require each run to be recorded in a spreadsheet, save the prompt and output in a versioned folder, use a reviewer form to assess quality and risk, and complete a RAIDT scoring sheet. That creates a usable evidence trail for later inspection.
RAIDT handles this better than a generic AI governance approach because it specifies what needs to be captured at run level and why. A generic policy might say that outputs should be reviewed by a human. RAIDT goes further by making that review evidential: which prompt was used, which output was produced, who reviewed it, what concerns were raised, how the run scored against the five pillars, and what follow-up action was taken.
Practical example in RAIDT terms
Consider a public-sector benefits team using a generative AI tool to draft first-pass response letters to citizens. The use case is administrative productivity, not autonomous decision-making, but each run still affects communication quality, fairness, and accountability. In a manual implementation of RAIDT, the officer records the task context, prompt, model used, date and time, source material, generated draft, and reviewer comments in a controlled evidence template.
The run-level issue is that a disputed or misleading letter may later need to be reconstructed. Without structured capture, the team may know that AI was used but be unable to show how the wording emerged or whether the draft was properly checked. Manual implementation addresses this by preserving the prompt-output pair, the reviewer sign-off, any edits made before issue, and the reasoning used for the RAIDT score.
The evidence needed includes the task description, prompt text, output text, reviewer identity, revision notes, applicable policy constraints, scoring sheet, and any escalation decision. Responsibility is affected because a human owner must remain accountable for the run. Auditability and Traceability are affected because the sequence of actions must be reconstructable. Interpretability matters because reviewers must explain why the output was acceptable or problematic. Dependability matters because repeated use should show stable review practice rather than arbitrary handling. The result is improved governance readiness even though the implementation remains manual.
Detailed link to RAIDT
Manual implementation links to RAIDT in four ways.
First, it links to RAIDT's core idea by showing that governance begins with evidence discipline around a specific run rather than with general statements about AI principles.
Second, it links to the run because manual implementation determines how the inputs, outputs, context, reviewers, and decisions for that run are captured and preserved.
Third, it links to the evidence pack and score profile because a manually assembled record can still support structured scoring across Responsibility, Auditability, Interpretability, Dependability, and Traceability.
Fourth, it links to reviewability, contestability, audit readiness, and organisational learning because the preserved record allows later inspection, challenge, comparison, and improvement.
Manual implementation ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness
In other words, manual implementation is the operational bridge that turns RAIDT from a conceptual framework into a reproducible governance routine.
Link to the five RAIDT pillars
Responsibility
Manual implementation strengthens Responsibility by making clear who initiated the run, who reviewed it, and who accepted the final output for use. It prevents the diffusion of accountability that often occurs when AI assistance is treated as an informal background tool.
Example evidence / implication:
- Named run owner and reviewer recorded on the form.
- Sign-off or escalation field showing who accepted or rejected the output.
Auditability
Manual implementation supports Auditability when the evidence captured is sufficient for another reviewer to inspect how the run was conducted. It does not require automation, but it does require an orderly record that can be checked later.
Example evidence / implication:
- Saved prompt-output pair stored with timestamp and version label.
- Reviewer checklist and scoring sheet retained with the run artefacts.
Interpretability
Manual implementation contributes to Interpretability by requiring reviewers to explain why an output was considered acceptable, unclear, risky, or in need of revision. This creates a practical interpretive record rather than relying on unexamined judgement.
Example evidence / implication:
- Free-text rationale explaining why a score was assigned.
- Notes on misleading phrasing, ambiguity, or missing justification in the output.
Dependability
Manual implementation affects Dependability by showing whether runs are handled consistently enough to support reliable organisational practice. It is especially useful in early deployment because it reveals process variance before scale is introduced.
Example evidence / implication:
- Repeated use of the same template across comparable runs.
- Documentation of corrections, exceptions, and recurring failure modes.
Traceability
Manual implementation is particularly strong on Traceability when naming conventions, folders, forms, and timestamps are used well. Traceability is weakened quickly if artefacts are scattered or inconsistently labelled, so disciplined handling is essential.
Example evidence / implication:
- Unique run identifier connecting prompt, output, review form, and score sheet.
- Record of revisions showing how the final output differed from the initial generation.
Manual implementation touches all five pillars, but its strongest immediate effects are usually on Auditability and Traceability because those pillars depend most directly on preserving a reconstructable run record.
Why this item is more than a generic concept
In general AI governance, manual implementation might simply mean that people review outputs without automation or that a team starts with spreadsheets before buying software. In RAIDT, the meaning is more specific and more operational. Manual implementation is a defined mode of applying run-level governance in which evidence fields, review steps, and scoring logic are preserved even when the workflow is low-tech.
The RAIDT meaning is therefore more rigorous than a generic reference to manual process. It is not enough that humans are involved; the run must still become inspectable. This is what makes manual implementation meaningful inside RAIDT: it retains the evidential architecture of the framework rather than reducing governance to informal oversight.
Common misunderstanding
Misunderstanding
If RAIDT is implemented manually, it is only a temporary workshop exercise and does not count as serious governance.
Correction
Manual implementation can certainly be used in workshops and pilots, but its importance is broader. It is a legitimate governance mode when the organisation has not yet built automation, when the number of runs is modest, or when a higher level of human scrutiny is needed. For example, a small legal team using generative AI to draft internal memos may prefer a manual evidence workflow precisely because it makes each run visible and reviewable. The weakness is not manuality itself; the weakness would be poor control over documentation, storage, and review discipline.
Boundary and limitation
Manual implementation does not guarantee good governance merely because records exist. It does not prove that reviewers made the right decision, remove the possibility of human inconsistency, or scale efficiently to high-volume environments. It can also become burdensome if too many fields are collected without clear purpose, or unreliable if users fail to save artefacts consistently.
The concept therefore has clear operating conditions. It works best when templates are standardised, responsibilities are assigned, storage is controlled, and the scope of reviewed runs is manageable. RAIDT handles the limitation by treating manual implementation as one level within a broader implementation pathway. Once pain points appear, teams can move toward semi-automation or orchestration while keeping the same run-level governance logic.
Implementation levels
Manual implementation
A researcher or small team can apply RAIDT manually by using a standard run template, a controlled spreadsheet, saved prompt-output files, reviewer forms, and a scoring sheet. This is suitable for pilots, early-stage fieldwork, doctoral demonstrations, supervision meetings, and low-volume organisational use.
Semi-automated implementation
Semi-automated implementation adds structure through metadata fields, shared templates, simple scripts, form-based capture, and partially pre-filled review workflows. This reduces administrative burden while retaining close human oversight and is often the next step after manual piloting.
Fully automated implementation
Fully automated implementation uses wrappers, dashboards, orchestration layers, logging systems, governance pipelines, or platform integrations to capture runs and generate review artefacts at scale. In RAIDT terms, automation changes the efficiency and consistency of evidence capture, but not the underlying requirement that each run remain reviewable and evidentially grounded.
Practical use in the RAIDT project
Within the RAIDT project, manual implementation is useful in several ways. For Paper 08 Foundations, it demonstrates that RAIDT is conceptually operationalisable even without specialised infrastructure. For Paper 09 Empirical Validation, it offers a realistic deployment mode for pilot studies, workshops, and comparative scoring exercises in live organisational settings. For Paper 10 Policy Pathways, it shows policymakers and institutional leaders that governance adoption need not wait for enterprise tooling.
The item also supports sector playbooks because many domains begin with constrained resources and cautious experimentation. It links directly to the evidence pack and scoring rubric by showing how those outputs can be assembled and defended in small-scale studies. For viva defence and supervisor explanation, manual implementation is especially valuable because it answers the practical challenge, ?What would this look like on Monday morning in a real organisation??
Finally, the concept helps journal positioning and audience communication. It shows that RAIDT is not dependent on a proprietary platform or a technically privileged environment. That makes the framework more transferable, more testable, and more relevant to organisations that need governance now rather than later.
Key audience questions to prepare for
Q1. Is manual implementation just a stopgap before the real system exists?
It can be a transitional stage, but it is also a valid governance mode in its own right for smaller-scale or high-scrutiny contexts. What matters is whether the run is documented in a structured and reviewable way.
Q2. What is the minimum evidence needed for manual RAIDT implementation?
At minimum, the run should preserve the task context, prompt, output, model or system used, reviewer judgement, timestamp, and scoring rationale. Without those elements, later reconstruction becomes weak.
Q3. Does manual implementation reduce the credibility of the RAIDT score profile?
No, not if the underlying evidence is captured consistently. The credibility of the score depends more on disciplined evidence and transparent judgement than on whether a platform generated the record automatically.
Q4. When should an organisation move beyond manual implementation?
The shift usually becomes necessary when run volume increases, review burden becomes excessive, or inconsistency appears across reviewers or teams. At that point, semi-automation improves efficiency without changing the core governance logic.
Q5. Why is manual implementation important in academic and supervisory settings?
It makes RAIDT demonstrable, teachable, and testable. Supervisors, examiners, and workshop participants can inspect the full governance process without needing access to a specialised system stack.
Suggested citation concepts to support this item
- run-level AI governance documentation
- manual audit trail methods for AI systems
- human-in-the-loop governance workflows for generative AI
- evidence capture for responsible AI deployment
- sociotechnical implementation of AI governance in organisations
- low-resource AI governance and compliance practices
- operationalising accountability in generative AI use
- documentation and traceability in AI-assisted decision support
- staged adoption models for AI governance
- reviewer-based evaluation frameworks for generative AI outputs
Short explanation for presentation
Manual implementation means applying RAIDT through structured human-operated processes rather than through a dedicated software platform. A team can use controlled spreadsheets, saved prompts and outputs, reviewer forms, and scoring sheets to capture each run as a unit of governance. This matters because RAIDT is designed around evidential discipline, not around a particular technical stack. If the right fields are preserved, a manual workflow can still produce a run-level evidence pack, a five-pillar score profile, and a clear basis for review, contestation, and audit preparation. In supervision, policy, and pilot settings, this is important because it shows that governance can begin immediately. The organisation does not need to wait for full automation before it starts generating accountable and reviewable evidence about how generative AI is being used.
One-line takeaway
Manual implementation is the structured low-tech application of RAIDT because it turns individual runs into reviewable evidence, evidence packs, and governance-ready score profiles.