S10.11 - Education

S10.11 ? Education

flowchart LR
    A[Educational pressures
feedback at scale, adaptive learning,
assessment integrity] --> B[RAIDT
run-level evidence framework] H[Educational run fields
rubric, prompt, source work,
model settings, tutor edits, timestamps] --> C[[Education
domain playbook for governable GenAI use]] B --> C C --> D[Evidence pack] C --> E[RAIDT score profile] D --> F[Reviewer reconstruction
moderation and challenge] E --> G[Governance readiness
organisational learning and policy alignment]

? Star S10 - Empirical Programme, Domains and Sector Playbooks

Star context: Shows how RAIDT is applied in the education domain, where teaching, learning, assessment, and feedback workflows require evidence that GenAI use was fair, reviewable, interpretable, and open to challenge.


Academic picture
Definition / background

Education, in this RAIDT item, refers to the domain-specific application of the framework to teaching, learning, assessment, feedback, student support, and related institutional workflows in schools, colleges, universities, and professional learning settings. It is not simply a label for AI in education as a broad research field. Within RAIDT, Education functions as a sector playbook that specifies how run-level evidence should be understood when GenAI outputs may shape learner understanding, progression, fairness, or academic judgement.

The concept matters because educational settings combine pedagogical goals with governance obligations. A GenAI output in education may appear low stakes when compared with clinical or legal advice, yet it can still influence grades, feedback quality, confidence, access to support, and perceptions of fairness. Educational use therefore raises distinctive questions about assessment integrity, tutor accountability, explainability of feedback, accessibility, and the developmental impact of automation on learners.

Conceptually, this item differs from generic discussions of educational technology adoption. A great deal of AI-in-education discourse focuses on innovation, efficiency, or learning outcomes at a system level. RAIDT instead asks what evidence exists for one concrete run. That shift matters because educational disputes and governance questions usually arise from specific events: one generated feedback report, one adaptive recommendation, one flagged assessment anomaly, one student-facing explanation, or one tutor-assisted drafting interaction.

This item belongs inside RAIDT because the framework's central claim is that responsible GenAI governance should attach to actual use events. In education, that means connecting a specific run to the run-level evidence pack and then to the five-pillar score profile across Responsibility, Auditability, Interpretability, Dependability, and Traceability. The Education item therefore translates RAIDT's general evidential logic into a domain where fairness, reviewability, and human educational judgement remain essential.

Why this concept matters

Education matters in RAIDT because educational organisations are under pressure to use GenAI productively while still protecting fairness, academic standards, and learner welfare. Without a domain-specific treatment, governance can remain too generic. Institutions may have an AI policy, a plagiarism policy, or a digital learning strategy, yet still struggle to explain whether one particular use of GenAI in feedback, tutoring, or assessment support was appropriate and reviewable.

The concept also prevents an important confusion: educational value is not the same as governance adequacy. A tool may appear helpful, save staff time, or even improve student satisfaction, but those benefits do not remove the need to examine what evidence exists for a specific run. RAIDT helps avoid the mistake of treating educational usefulness as sufficient proof of responsible use.

If this item is missing, organisations risk treating education as just another deployment context rather than a domain with its own accountability pressures. That can lead to weak moderation, poor documentation of AI-assisted feedback, unclear responsibility between teacher and tool, inconsistent handling of assessment integrity, and limited ability to defend practice to students, supervisors, quality offices, or regulators. RAIDT uses the Education item to move from broad principle to operational governance in a recognisably educational setting.

Key idea: Education matters in RAIDT because educational uses of GenAI affect learners and assessment in ways that must be evidenced at the level of the individual run, not assumed from policy alone.

What this item enables
Practical example / likely audience question

Audience question

How does RAIDT help?

Answer

The concern behind this question is usually that education already has review processes, marking rubrics, moderation practices, and academic regulations, so a separate framework may appear unnecessary. The direct answer is that RAIDT helps by making one educational use of GenAI reconstructable. It records the evidence needed to examine feedback quality, source use, review, and challenge in a way that generic governance documents do not.

Consider a university module leader using GenAI to draft formative feedback comments on a large cohort's essays. If a student later argues that the feedback was inconsistent, misleading, or insufficiently grounded in the submission, the institution needs more than a general statement that staff were allowed to use AI. It needs evidence of the prompt template, the rubric or marking criteria supplied, the student work or excerpt used as input, the generated feedback, the tutor's edits, the moderation step, and any disclosure to students.

RAIDT handles this better than a generic AI governance approach because it does not stop at principle-level approval. It ties educational use to one run, one evidence pack, and one score profile. That allows the institution to reconstruct what happened, test whether safeguards were followed, and improve the workflow rather than relying on anecdote or assumption.

Practical example in RAIDT terms

A university uses a GenAI assistant to draft formative feedback on first-year social science essays. The educational use case is legitimate: tutors want faster, more consistent comments while retaining academic judgement. The run-level issue is whether the generated feedback is accurate, aligned with the rubric, appropriate to the student's work, and clearly subject to human review before release.

The evidence needed for that run includes the task definition, the essay or excerpt supplied, the rubric or criteria used, the prompt template, the model or tool version, any system settings, the generated feedback, the tutor's edits, the final released feedback, and the moderation or sign-off record. Responsibility is affected because the tutor and institution must remain accountable for what students receive. Auditability is affected because a reviewer must be able to reconstruct the run after a complaint or quality review. Interpretability is affected because the pathway from source material and prompt to feedback must be understandable enough to evaluate. Dependability is affected because repeated use should not produce erratic or low-quality guidance. Traceability is affected because the run must be linked to actor, time, artefacts, and outcome.

In governance-readiness terms, the Education item improves the organisation's position by turning AI-assisted feedback into a reviewable process rather than an informal convenience. It supports moderation, contestability, student assurance, and evidence-based refinement of institutional practice.

Detailed link to RAIDT

Education links to RAIDT in four ways.

First, it gives RAIDT a concrete domain in which the framework's core claim can be tested: governance should attach to actual organisational use of GenAI, not only to broad tool descriptions or institutional policy.

Second, it translates the run into educational terms by asking what happened in one teaching, feedback, assessment, or student-support event and what domain-specific context must be recorded for that run to be reviewable.

Third, it shapes the evidence pack and score profile by clarifying what evidence matters in education, such as rubric alignment, human moderation, learner impact, disclosure, and challenge pathways.

Fourth, it strengthens reviewability, contestability, audit readiness, and organisational learning by enabling educational institutions to examine disputed or high-impact AI-assisted events as concrete cases rather than abstract policy questions.

Education playbook ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

Link to the five RAIDT pillars

Responsibility

In education, Responsibility is especially important because AI assistance must not obscure who is accountable for feedback, recommendations, or assessment-related actions. Educational judgement remains a human and institutional responsibility even when drafting or analytic support is automated.

Example evidence / implication:

Auditability

Education has a strong Auditability requirement because students, supervisors, quality teams, and external reviewers may need to reconstruct how an output was produced and whether relevant procedures were followed.

Example evidence / implication:

Interpretability

Interpretability matters because educational users need to understand how an output relates to the task, the source material, and the learning objective. A useful educational output is not enough if its rationale cannot be reasonably examined.

Example evidence / implication:

Dependability

Dependability matters because repeated educational use should produce outputs that are stable enough to support fair and reliable practice. If similar runs produce unpredictable feedback quality, the educational workflow becomes hard to defend.

Example evidence / implication:

Traceability

Traceability is essential because educational governance depends on linking the run to the relevant learner, task, timing, materials, and downstream action without losing the history of human intervention.

Example evidence / implication:

Education affects all five pillars, but it is especially consequential for Responsibility, Auditability, and Traceability because educational disputes often turn on who approved an output, what evidence exists, and whether the case can be reconstructed fairly.

Why this item is more than a generic concept

In general AI governance, education may be treated as one sector among many, often discussed at the level of policy, innovation strategy, academic integrity concern, or edtech adoption. In RAIDT, Education has a more specific meaning. It is the domain playbook that operationalises how run-level governance should work when GenAI is used in teaching, learning, assessment, feedback, or student support.

The RAIDT meaning is more operational because it ties educational concerns to specific runs, evidence capture requirements, evidence packs, and score profiles. It therefore turns broad educational AI debates into reviewable governance practice. The point is not simply that education is an important field. The point is that educational uses become governable when they are evidenced and assessed at run level.

Common misunderstanding

Misunderstanding

Education is only a medium-risk or low-risk domain unless AI is directly assigning final grades.

Correction

That view is too narrow. Even when a GenAI system is used only for formative feedback, adaptive hints, student-facing explanations, or draft communications, it can still affect learner confidence, study choices, opportunity, and perceived fairness. For example, if AI-generated feedback repeatedly gives weaker guidance to certain students, or if a tutoring assistant offers misleading support before an assessment, the effect may be educationally significant even without automated final grading. RAIDT therefore treats education as a domain where run-level evidence still matters because developmental consequences and accountability demands can be substantial.

Boundary and limitation

This item does not prove that GenAI improves learning, guarantees pedagogical quality, or resolves broader debates about assessment design, academic misconduct, accessibility, privacy, or safeguarding. Nor does it replace institutional governance mechanisms such as ethics review, curriculum governance, procurement checks, staff development, or legal compliance processes. Its role is narrower and more practical: to specify how educational uses of GenAI should be evidenced and reviewed at run level.

The concept can also fail if it is applied too loosely or too rigidly. If institutions treat every trivial use as requiring exhaustive capture, the process may become burdensome and performative. If they capture too little, disputed educational runs remain opaque. RAIDT handles this limitation by emphasising proportionate, risk-sensitive evidence capture tied to the significance of the educational task and the likely consequence for learners or assessment.

Implementation levels

Manual implementation

A researcher, lecturer, or small educational team can apply this item manually by using a structured template for significant GenAI-assisted runs. The template can capture task purpose, learner context, prompt, source materials, output, reviewer comments, disclosure, and the final action taken.

Semi-automated implementation

Semi-automated implementation can use forms, LMS fields, prompt templates, and moderation checklists so that key metadata are captured consistently without requiring fully bespoke record-keeping each time. For example, a feedback workflow might auto-fill timestamps and user roles while still requiring staff to record rubric use and approval status.

Fully automated implementation

At scale, a governance wrapper around an educational platform, virtual learning environment, or institutional GenAI service can capture run metadata, prompts, artefacts, reviewer actions, and scoring inputs automatically. A dashboard or evidence service can then assemble education-specific evidence packs, support moderation and complaint review, and monitor governance readiness across courses or departments.

Practical use in the RAIDT project

Within the RAIDT project, the Education item is useful in Paper 08 Foundations because it shows that the framework is not merely abstract. It demonstrates how the run-level idea travels into a domain where fairness, explanation, and human judgement matter but where many uses are also routine and scalable. That helps clarify RAIDT's conceptual position between general AI governance and domain-specific implementation.

For Paper 09 Empirical Validation, education provides a strong test setting because it contains a mix of lower-stakes and higher-stakes tasks, repeated workflows, and visible challenge mechanisms such as moderation, appeals, and quality review. This makes it a useful domain for testing what evidence can actually be captured and how consistently scoring can be applied.

For Paper 10 Policy Pathways and the sector playbooks, this item helps show how institutional policy can move from broad statements about acceptable AI use toward operational controls, evidence thresholds, and review procedures. It is also useful for explaining the evidence pack, scoring rubric, governance interventions, and viva positioning because it gives a familiar, concrete domain through which the RAIDT logic can be defended to supervisors, reviewers, and practitioner audiences.

Key audience questions to prepare for

Q1. Does RAIDT require every educational use of GenAI to be documented in full detail?

No. RAIDT supports proportionate capture. A trivial drafting aid for low-consequence internal work may need minimal recording, whereas AI-assisted feedback, assessment support, or student-facing advice may require richer evidence because the governance consequences are higher.

Q2. How does this item relate to academic integrity?

It helps separate broad concern from specific evidence. Instead of only stating that integrity matters, RAIDT asks what happened in one run, what materials were used, whether the process was disclosed and reviewed, and whether the educational workflow remained defensible.

Q3. Why is human review not enough on its own?

Human review matters, but without evidence it can become a vague assurance claim. RAIDT asks whether the review can be shown, reconstructed, and assessed after the event, including what the reviewer saw, changed, approved, or rejected.

Q4. Could existing LMS or platform logs already solve this problem?

Usually only in part. Platform logs may show access events or timestamps, but they often do not capture prompt rationale, rubric alignment, review quality, educational purpose, or the reasoning behind accepting or revising an output. RAIDT adds governance-relevant context.

Q5. Why include Education as a distinct RAIDT item rather than treating it as just another example?

Because the domain changes what evidence matters. Educational runs bring distinctive issues around learning impact, assessment fairness, tutor accountability, and challenge procedures. A dedicated item shows how RAIDT is calibrated to domain-specific governance needs rather than applied in a one-size-fits-all way.

Suggested citation concepts to support this item
Short explanation for presentation

Education is the RAIDT domain playbook for teaching, learning, assessment, feedback, and student-support uses of generative AI. Its importance is that educational AI governance cannot rely only on general policy or platform approval. One concrete run, such as AI-assisted feedback on an essay, may affect fairness, learner understanding, and confidence in assessment. RAIDT therefore asks whether that run can be reconstructed through evidence: the task, prompt, source materials, output, human edits, moderation, and final release. Once that evidence exists, the organisation can build an evidence pack, justify a five-pillar score profile, and improve governance readiness through review and learning. In this way, Education shows how RAIDT becomes operational in a domain where human judgement remains central and where accountability to learners is unavoidable.

One-line takeaway

Education is the RAIDT sector playbook for teaching, learning, assessment, and feedback because it ties educational GenAI use to run-level evidence, scoreable governance, and reviewable accountability.

Related items in empirical programme, domains and sector playbooks
Anchored questions

The source note does not currently include anchored questions for this item.

Mentioned in reference-paper summaries (5)

Paper summaries live in Port/93-References/pdf_summaries/. Each file listed below contains the key term at least once.

Powered by Forestry.md