S10.11 - Education
S10.11 ? Education
flowchart LR
A[Educational pressures
feedback at scale, adaptive learning,
assessment integrity] --> B[RAIDT
run-level evidence framework]
H[Educational run fields
rubric, prompt, source work,
model settings, tutor edits, timestamps] --> C[[Education
domain playbook for governable GenAI use]]
B --> C
C --> D[Evidence pack]
C --> E[RAIDT score profile]
D --> F[Reviewer reconstruction
moderation and challenge]
E --> G[Governance readiness
organisational learning and policy alignment]? Star S10 - Empirical Programme, Domains and Sector Playbooks
Star context: Shows how RAIDT is applied in the education domain, where teaching, learning, assessment, and feedback workflows require evidence that GenAI use was fair, reviewable, interpretable, and open to challenge.
Academic picture
Definition / background
Education, in this RAIDT item, refers to the domain-specific application of the framework to teaching, learning, assessment, feedback, student support, and related institutional workflows in schools, colleges, universities, and professional learning settings. It is not simply a label for AI in education as a broad research field. Within RAIDT, Education functions as a sector playbook that specifies how run-level evidence should be understood when GenAI outputs may shape learner understanding, progression, fairness, or academic judgement.
The concept matters because educational settings combine pedagogical goals with governance obligations. A GenAI output in education may appear low stakes when compared with clinical or legal advice, yet it can still influence grades, feedback quality, confidence, access to support, and perceptions of fairness. Educational use therefore raises distinctive questions about assessment integrity, tutor accountability, explainability of feedback, accessibility, and the developmental impact of automation on learners.
Conceptually, this item differs from generic discussions of educational technology adoption. A great deal of AI-in-education discourse focuses on innovation, efficiency, or learning outcomes at a system level. RAIDT instead asks what evidence exists for one concrete run. That shift matters because educational disputes and governance questions usually arise from specific events: one generated feedback report, one adaptive recommendation, one flagged assessment anomaly, one student-facing explanation, or one tutor-assisted drafting interaction.
This item belongs inside RAIDT because the framework's central claim is that responsible GenAI governance should attach to actual use events. In education, that means connecting a specific run to the run-level evidence pack and then to the five-pillar score profile across Responsibility, Auditability, Interpretability, Dependability, and Traceability. The Education item therefore translates RAIDT's general evidential logic into a domain where fairness, reviewability, and human educational judgement remain essential.
Why this concept matters
Education matters in RAIDT because educational organisations are under pressure to use GenAI productively while still protecting fairness, academic standards, and learner welfare. Without a domain-specific treatment, governance can remain too generic. Institutions may have an AI policy, a plagiarism policy, or a digital learning strategy, yet still struggle to explain whether one particular use of GenAI in feedback, tutoring, or assessment support was appropriate and reviewable.
The concept also prevents an important confusion: educational value is not the same as governance adequacy. A tool may appear helpful, save staff time, or even improve student satisfaction, but those benefits do not remove the need to examine what evidence exists for a specific run. RAIDT helps avoid the mistake of treating educational usefulness as sufficient proof of responsible use.
If this item is missing, organisations risk treating education as just another deployment context rather than a domain with its own accountability pressures. That can lead to weak moderation, poor documentation of AI-assisted feedback, unclear responsibility between teacher and tool, inconsistent handling of assessment integrity, and limited ability to defend practice to students, supervisors, quality offices, or regulators. RAIDT uses the Education item to move from broad principle to operational governance in a recognisably educational setting.
Key idea: Education matters in RAIDT because educational uses of GenAI affect learners and assessment in ways that must be evidenced at the level of the individual run, not assumed from policy alone.
What this item enables
- Translation of RAIDT's general framework into educational workflows such as feedback generation, adaptive support, assessment assistance, and student-facing communication.
- Identification of which educational GenAI runs require stronger evidence capture because they influence marks, progression, fairness, or academic integrity.
- Specification of the contextual information needed to review educational uses, including rubric alignment, source use, disclosure, moderation, and tutor intervention.
- Comparison of educational runs across the five RAIDT pillars so that governance readiness can be assessed rather than merely asserted.
- Clearer challenge and review when a student, marker, supervisor, or quality reviewer questions a GenAI-assisted output.
- Organisational learning about where GenAI adds value in education and where stronger controls or redesign are needed.
Practical example / likely audience question
Audience question
How does RAIDT help?
Answer
The concern behind this question is usually that education already has review processes, marking rubrics, moderation practices, and academic regulations, so a separate framework may appear unnecessary. The direct answer is that RAIDT helps by making one educational use of GenAI reconstructable. It records the evidence needed to examine feedback quality, source use, review, and challenge in a way that generic governance documents do not.
Consider a university module leader using GenAI to draft formative feedback comments on a large cohort's essays. If a student later argues that the feedback was inconsistent, misleading, or insufficiently grounded in the submission, the institution needs more than a general statement that staff were allowed to use AI. It needs evidence of the prompt template, the rubric or marking criteria supplied, the student work or excerpt used as input, the generated feedback, the tutor's edits, the moderation step, and any disclosure to students.
RAIDT handles this better than a generic AI governance approach because it does not stop at principle-level approval. It ties educational use to one run, one evidence pack, and one score profile. That allows the institution to reconstruct what happened, test whether safeguards were followed, and improve the workflow rather than relying on anecdote or assumption.
Practical example in RAIDT terms
A university uses a GenAI assistant to draft formative feedback on first-year social science essays. The educational use case is legitimate: tutors want faster, more consistent comments while retaining academic judgement. The run-level issue is whether the generated feedback is accurate, aligned with the rubric, appropriate to the student's work, and clearly subject to human review before release.
The evidence needed for that run includes the task definition, the essay or excerpt supplied, the rubric or criteria used, the prompt template, the model or tool version, any system settings, the generated feedback, the tutor's edits, the final released feedback, and the moderation or sign-off record. Responsibility is affected because the tutor and institution must remain accountable for what students receive. Auditability is affected because a reviewer must be able to reconstruct the run after a complaint or quality review. Interpretability is affected because the pathway from source material and prompt to feedback must be understandable enough to evaluate. Dependability is affected because repeated use should not produce erratic or low-quality guidance. Traceability is affected because the run must be linked to actor, time, artefacts, and outcome.
In governance-readiness terms, the Education item improves the organisation's position by turning AI-assisted feedback into a reviewable process rather than an informal convenience. It supports moderation, contestability, student assurance, and evidence-based refinement of institutional practice.
Detailed link to RAIDT
Education links to RAIDT in four ways.
First, it gives RAIDT a concrete domain in which the framework's core claim can be tested: governance should attach to actual organisational use of GenAI, not only to broad tool descriptions or institutional policy.
Second, it translates the run into educational terms by asking what happened in one teaching, feedback, assessment, or student-support event and what domain-specific context must be recorded for that run to be reviewable.
Third, it shapes the evidence pack and score profile by clarifying what evidence matters in education, such as rubric alignment, human moderation, learner impact, disclosure, and challenge pathways.
Fourth, it strengthens reviewability, contestability, audit readiness, and organisational learning by enabling educational institutions to examine disputed or high-impact AI-assisted events as concrete cases rather than abstract policy questions.
Education playbook ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness
Link to the five RAIDT pillars
Responsibility
In education, Responsibility is especially important because AI assistance must not obscure who is accountable for feedback, recommendations, or assessment-related actions. Educational judgement remains a human and institutional responsibility even when drafting or analytic support is automated.
Example evidence / implication:
- Record of the staff role responsible for initiating, reviewing, and approving the AI-assisted output.
- Evidence that accountability for the educational decision remained with the tutor, marker, or institution rather than being displaced onto the tool.
Auditability
Education has a strong Auditability requirement because students, supervisors, quality teams, and external reviewers may need to reconstruct how an output was produced and whether relevant procedures were followed.
Example evidence / implication:
- Preserved prompt, rubric, source material, output, and moderation notes for the specific run.
- Sufficient documentation to support a complaint review, internal moderation exercise, or quality assurance audit.
Interpretability
Interpretability matters because educational users need to understand how an output relates to the task, the source material, and the learning objective. A useful educational output is not enough if its rationale cannot be reasonably examined.
Example evidence / implication:
- Prompt wording and task instructions linked clearly to the generated recommendation or feedback.
- Reviewer notes explaining why the output was accepted, amended, or rejected in educational terms.
Dependability
Dependability matters because repeated educational use should produce outputs that are stable enough to support fair and reliable practice. If similar runs produce unpredictable feedback quality, the educational workflow becomes hard to defend.
Example evidence / implication:
- Comparison of generated feedback quality across repeated runs or across similar assignments.
- Record of errors, inappropriate suggestions, or drift from rubric standards that required correction.
Traceability
Traceability is essential because educational governance depends on linking the run to the relevant learner, task, timing, materials, and downstream action without losing the history of human intervention.
Example evidence / implication:
- Timestamped link between the run, the educational artefacts used, and the final feedback or decision released.
- Clear chain showing how source material, prompt, output, and human edits relate to each other.
Education affects all five pillars, but it is especially consequential for Responsibility, Auditability, and Traceability because educational disputes often turn on who approved an output, what evidence exists, and whether the case can be reconstructed fairly.
Why this item is more than a generic concept
In general AI governance, education may be treated as one sector among many, often discussed at the level of policy, innovation strategy, academic integrity concern, or edtech adoption. In RAIDT, Education has a more specific meaning. It is the domain playbook that operationalises how run-level governance should work when GenAI is used in teaching, learning, assessment, feedback, or student support.
The RAIDT meaning is more operational because it ties educational concerns to specific runs, evidence capture requirements, evidence packs, and score profiles. It therefore turns broad educational AI debates into reviewable governance practice. The point is not simply that education is an important field. The point is that educational uses become governable when they are evidenced and assessed at run level.
Common misunderstanding
Misunderstanding
Education is only a medium-risk or low-risk domain unless AI is directly assigning final grades.
Correction
That view is too narrow. Even when a GenAI system is used only for formative feedback, adaptive hints, student-facing explanations, or draft communications, it can still affect learner confidence, study choices, opportunity, and perceived fairness. For example, if AI-generated feedback repeatedly gives weaker guidance to certain students, or if a tutoring assistant offers misleading support before an assessment, the effect may be educationally significant even without automated final grading. RAIDT therefore treats education as a domain where run-level evidence still matters because developmental consequences and accountability demands can be substantial.
Boundary and limitation
This item does not prove that GenAI improves learning, guarantees pedagogical quality, or resolves broader debates about assessment design, academic misconduct, accessibility, privacy, or safeguarding. Nor does it replace institutional governance mechanisms such as ethics review, curriculum governance, procurement checks, staff development, or legal compliance processes. Its role is narrower and more practical: to specify how educational uses of GenAI should be evidenced and reviewed at run level.
The concept can also fail if it is applied too loosely or too rigidly. If institutions treat every trivial use as requiring exhaustive capture, the process may become burdensome and performative. If they capture too little, disputed educational runs remain opaque. RAIDT handles this limitation by emphasising proportionate, risk-sensitive evidence capture tied to the significance of the educational task and the likely consequence for learners or assessment.
Implementation levels
Manual implementation
A researcher, lecturer, or small educational team can apply this item manually by using a structured template for significant GenAI-assisted runs. The template can capture task purpose, learner context, prompt, source materials, output, reviewer comments, disclosure, and the final action taken.
Semi-automated implementation
Semi-automated implementation can use forms, LMS fields, prompt templates, and moderation checklists so that key metadata are captured consistently without requiring fully bespoke record-keeping each time. For example, a feedback workflow might auto-fill timestamps and user roles while still requiring staff to record rubric use and approval status.
Fully automated implementation
At scale, a governance wrapper around an educational platform, virtual learning environment, or institutional GenAI service can capture run metadata, prompts, artefacts, reviewer actions, and scoring inputs automatically. A dashboard or evidence service can then assemble education-specific evidence packs, support moderation and complaint review, and monitor governance readiness across courses or departments.
Practical use in the RAIDT project
Within the RAIDT project, the Education item is useful in Paper 08 Foundations because it shows that the framework is not merely abstract. It demonstrates how the run-level idea travels into a domain where fairness, explanation, and human judgement matter but where many uses are also routine and scalable. That helps clarify RAIDT's conceptual position between general AI governance and domain-specific implementation.
For Paper 09 Empirical Validation, education provides a strong test setting because it contains a mix of lower-stakes and higher-stakes tasks, repeated workflows, and visible challenge mechanisms such as moderation, appeals, and quality review. This makes it a useful domain for testing what evidence can actually be captured and how consistently scoring can be applied.
For Paper 10 Policy Pathways and the sector playbooks, this item helps show how institutional policy can move from broad statements about acceptable AI use toward operational controls, evidence thresholds, and review procedures. It is also useful for explaining the evidence pack, scoring rubric, governance interventions, and viva positioning because it gives a familiar, concrete domain through which the RAIDT logic can be defended to supervisors, reviewers, and practitioner audiences.
Key audience questions to prepare for
Q1. Does RAIDT require every educational use of GenAI to be documented in full detail?
No. RAIDT supports proportionate capture. A trivial drafting aid for low-consequence internal work may need minimal recording, whereas AI-assisted feedback, assessment support, or student-facing advice may require richer evidence because the governance consequences are higher.
Q2. How does this item relate to academic integrity?
It helps separate broad concern from specific evidence. Instead of only stating that integrity matters, RAIDT asks what happened in one run, what materials were used, whether the process was disclosed and reviewed, and whether the educational workflow remained defensible.
Q3. Why is human review not enough on its own?
Human review matters, but without evidence it can become a vague assurance claim. RAIDT asks whether the review can be shown, reconstructed, and assessed after the event, including what the reviewer saw, changed, approved, or rejected.
Q4. Could existing LMS or platform logs already solve this problem?
Usually only in part. Platform logs may show access events or timestamps, but they often do not capture prompt rationale, rubric alignment, review quality, educational purpose, or the reasoning behind accepting or revising an output. RAIDT adds governance-relevant context.
Q5. Why include Education as a distinct RAIDT item rather than treating it as just another example?
Because the domain changes what evidence matters. Educational runs bring distinctive issues around learning impact, assessment fairness, tutor accountability, and challenge procedures. A dedicated item shows how RAIDT is calibrated to domain-specific governance needs rather than applied in a one-size-fits-all way.
Suggested citation concepts to support this item
- generative AI in education governance and accountability
- AI-assisted feedback quality assurance in higher education
- academic integrity and generative AI evidence practices
- human oversight in AI-supported assessment workflows
- explainability and transparency in educational AI systems
- audit trails and traceability for AI in universities
- fairness and bias in automated educational feedback
- sociotechnical governance of AI in teaching and learning
- documentation practices for AI-supported marking and moderation
- organisational adoption of generative AI in education
Short explanation for presentation
Education is the RAIDT domain playbook for teaching, learning, assessment, feedback, and student-support uses of generative AI. Its importance is that educational AI governance cannot rely only on general policy or platform approval. One concrete run, such as AI-assisted feedback on an essay, may affect fairness, learner understanding, and confidence in assessment. RAIDT therefore asks whether that run can be reconstructed through evidence: the task, prompt, source materials, output, human edits, moderation, and final release. Once that evidence exists, the organisation can build an evidence pack, justify a five-pillar score profile, and improve governance readiness through review and learning. In this way, Education shows how RAIDT becomes operational in a domain where human judgement remains central and where accountability to learners is unavoidable.
One-line takeaway
Education is the RAIDT sector playbook for teaching, learning, assessment, and feedback because it ties educational GenAI use to run-level evidence, scoreable governance, and reviewable accountability.
Related items in empirical programme, domains and sector playbooks
- S10.01 ? Empirical programme
- S10.02 ? 14 domains
- S10.03 ? 20 scenarios per domain
- S10.04 ? 6 configurations
- S10.05 ? Repeated runs
- S10.06 ? Governance readiness as outcome
- S10.07 ? Healthcare
- S10.08 ? Finance
- S10.09 ? Law and public services
- S10.10 ? Cybersecurity
- S10.12 ? Environment
- S10.13 ? Crisis and emergency response
- S10.14 ? Supply chain
- S10.15 ? Ageing calibration
Anchored questions
The source note does not currently include anchored questions for this item.
Mentioned in reference-paper summaries (5)
Paper summaries live in Port/93-References/pdf_summaries/. Each file listed below contains the key term at least once.
REF-020__Bommasani-2021.mdREF-024__Charness-2009.mdREF-027__Currie-2025.mdREF-045__Haki-2025.mdREF-068__Marjanovic-2021.md