S6.11 - Adapter_lineage

S6.11 ? Adapter lineage

flowchart LR
    A[Background problem:
adapted behaviour hidden behind
generic base-model labels] --> B[RAIDT:
run-level evidence framework] B --> C[[Adapter lineage:
exact base model + adapter + checkpoint
+ training-data reference + deployment config]] H[Healthcare, finance, public services,
enterprise productivity, cybersecurity] --> C I[MLOps registry, deployment logs,
release metadata, wrapper records] --> C C --> D[Evidence pack] C --> E[Score profile] C --> F[Reviewer reconstruction] D --> G[Reviewability and contestability] E --> J[Governance readiness] F --> J C --> K[Organisational learning and policy alignment]

? Star S6 - Influence Methods as Governance Interventions

Star context: Positions prompting, RAG, PEFT/LoRA, RLHF/DPO and stacked influence as governance-relevant influence methods whose effects should be evidenced at run level inside RAIDT, rather than treated as the project core in themselves.


Academic picture
Definition / background

Adapter lineage is the documented chain that links a specific run to the exact adapted model state used in that run. In practice, it records the base model, adapter identifier, checkpoint or version, relevant training-data reference, and deployment configuration. The concept matters because many contemporary GenAI systems are not used in their original base form: they are altered through PEFT, LoRA-style adapters, domain tuning, task tuning, safety tuning, or other parameter-efficient interventions that change behaviour without necessarily changing the headline model name.

Conceptually, adapter lineage sits between model provenance and deployment provenance. It is narrower than full data lineage because it does not attempt to describe every historical transformation in a model pipeline. It is also more precise than simply logging a model version, because two runs may use the same base model but materially different adapters and therefore exhibit different behaviour, risk profiles, and governance implications.

Inside RAIDT, this belongs squarely within run-level evidence because RAIDT treats the run as the unit of governance. If a run is to be reviewable, then the reviewer must know not only what prompt was used or what retrieval source was attached, but also which adapted behavioural layer was active. Adapter lineage therefore strengthens the integrity of the evidence pack and improves the defensibility of the resulting score profile across the five pillars.

Why this concept matters

Adapter lineage solves a common governance failure: organisations often document that they used ?Model X? while omitting the adapter or checkpoint that actually shaped the generated output. That creates confusion when outputs are challenged, when performance drifts, when different teams report inconsistent results, or when a regulator or internal reviewer asks what exactly was deployed.

Without adapter lineage, it becomes harder to distinguish whether an issue arose from the base model, the adapter, the prompt, the retrieval context, or the surrounding workflow. That weakens accountability and makes remediation slower and less credible. With adapter lineage, organisations can move from broad claims about responsible AI to specific, inspectable evidence about which behavioural modification was active at the moment of use.

Key idea: Adapter lineage matters because RAIDT can only govern an adapted model run properly if the exact behavioural modification applied in that run is explicitly evidenced.

What this item captures
Practical example / likely audience question

Audience question

Why log it?

Answer

The concern behind the question is often that logging adapter information sounds like an overly technical detail with little governance value. The direct answer is that adapter lineage is what allows a reviewer to know exactly which behavioural delta was active in the run. If a system produced an unsafe recommendation, a biased summary, or an unexpectedly confident answer, it is not enough to know only the base model family. The adapted layer may be the decisive factor.

A practical example is a domain-tuned assistant built on a common base model for internal policy interpretation. Two departments may both report using the same base model, yet one uses an adapter tuned on legacy policy documents and the other uses a newer compliance-focused adapter. If a disputed output appears, RAIDT handles the issue better than a generic AI governance approach because RAIDT expects the run-level evidence pack to distinguish those two configurations explicitly, rather than hiding them inside a vague ?model used? field.

Practical example in RAIDT terms

Consider a healthcare administration setting where a GenAI assistant drafts referral-priority summaries from clinician notes. The organisation uses a base model combined with a lightweight adapter trained to reflect local triage language and referral conventions. A problematic run occurs when the assistant overstates urgency for a non-urgent case.

The run-level issue is not simply whether the base model is suitable. The governance question is whether a specific adapter checkpoint introduced an over-assertive behavioural pattern after a recent tuning cycle. The evidence needed therefore includes the base model ID, adapter ID, adapter checkpoint, training-data reference for the tuning set, deployment timestamp, and any wrapper configuration showing which adapter was loaded.

The most affected RAIDT pillars are Auditability, Dependability, and Traceability, with secondary relevance to Responsibility and Interpretability. Adapter lineage improves governance readiness because reviewers can isolate whether the problem should trigger prompt redesign, adapter rollback, retraining review, or stronger human oversight, instead of relying on speculation.

Detailed link to RAIDT

Adapter lineage links to RAIDT in four ways.

First, it supports RAIDT's core idea that governance should focus on the actual configured use of GenAI rather than abstract descriptions of a system.
Second, it connects directly to the run by recording which adapted model state was active in that specific use event.
Third, it strengthens the evidence pack and informs the score profile by making model-configuration evidence inspectable rather than assumed.
Fourth, it improves reviewability, contestability, audit readiness, and organisational learning because later reviewers can reconstruct and compare runs with greater precision.

Adapter lineage ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness

Link to the five RAIDT pillars

Responsibility

Adapter lineage supports responsibility by showing who approved, selected, or deployed a specific behavioural modification and under what governance conditions. It helps prevent responsibility from being blurred behind a generic model label.

Example evidence / implication:

Auditability

This item is especially strong for auditability because it lets reviewers inspect which adapted configuration was used when a contested output was produced. Audit trails become materially more useful when they refer to the exact adapted state rather than only to a base model family.

Example evidence / implication:

Interpretability

Adapter lineage does not make a model intrinsically interpretable, but it improves interpretive context by narrowing the source of behavioural change. It helps reviewers understand whether an observed shift is plausibly due to tuning rather than prompt variation alone.

Example evidence / implication:

Dependability

This item strongly affects dependability because stable and safe performance depends on knowing which adapted state was actually deployed. If performance changes across runs, adapter lineage helps determine whether the variation is due to a changed adapter, checkpoint drift, or deployment inconsistency.

Example evidence / implication:

Traceability

Adapter lineage is central to traceability because it records the chain from run to model state to deployment configuration. It allows the organisation to trace outputs back to the specific adapted artefact that shaped them.

Example evidence / implication:

If prioritised comparatively, adapter lineage has its strongest direct effects on Auditability, Dependability, and Traceability.

Why this item is more than a generic concept

In general AI governance, adapter lineage may be treated as a technical provenance detail relevant mainly to ML operations or reproducibility. In RAIDT, it means something more operational: a run-level governance record that shows which behavioural modification was active when a real organisational decision-support output was produced. The RAIDT meaning is more practical because it ties adapter information to evidence packs, reviewer reconstruction, pillar scoring, and governance readiness rather than leaving it as background engineering metadata.

Common misunderstanding

Misunderstanding

If the base model is logged, the adapter details are optional.

Correction

That is incorrect because the adapter may be the most important source of behavioural change in the run. For example, two teams might both say they used the same base model for compliance summarisation, yet one team used a safety-tuned adapter and the other used a speed-optimised adapter. The governance implications are not equivalent. RAIDT therefore treats adapter lineage as necessary run evidence when adapted models materially shape output behaviour.

Boundary and limitation

Adapter lineage does not prove that a run was fair, accurate, safe, or lawful. It also does not explain every causal reason for a model output, because prompt wording, retrieval content, surrounding software logic, user behaviour, and environmental conditions may also matter. In addition, lineage records are only as reliable as the logging and configuration discipline behind them; weak release management or undocumented hotfixes can undermine the value of the record.

RAIDT handles this limitation by treating adapter lineage as one evidence component among others. It should be read alongside prompt records, retrieval evidence, task context, human oversight evidence, evaluation results, and incident review materials rather than as a standalone proof of good governance.

Implementation levels

Manual implementation

A researcher or small team can maintain adapter lineage manually by recording the base model, adapter name, version, checkpoint, tuning-data reference, and deployment notes in a structured run log or evidence-pack template.

Semi-automated implementation

Metadata templates, model registries, experiment trackers, or deployment forms can pre-fill adapter identifiers and version fields so that reviewers capture them consistently without relying on memory.

Fully automated implementation

At scale, a platform wrapper, orchestration layer, governance dashboard, or logging pipeline can automatically record the loaded base model, adapter artefact hash, checkpoint, release metadata, and deployment environment at run time, then attach those data to the RAIDT evidence pack and scoring workflow.

Practical use in the RAIDT project

Within the RAIDT project, adapter lineage is useful in at least five ways. In Paper 08 Foundations, it helps clarify why influence methods should be governed through run-level evidence rather than discussed only as abstract technical categories. In Paper 09 Empirical Validation, it supports comparison between runs, especially where behavioural differences may arise from lightweight tuning choices. In Paper 10 Policy Pathways, it offers a practical way to translate model provenance expectations into auditable organisational procedure.

It is also relevant to sector playbooks because adapted models are likely to appear in domain-specific deployments across healthcare, finance, public services, and enterprise productivity. In the evidence pack and scoring rubric, adapter lineage provides a concrete field for documentation and assessment. For supervisor explanation, viva defence, and journal positioning, it helps show that RAIDT is attentive to real deployment complexity without losing its core focus on governable evidence at the level of the run.

Key audience questions to prepare for

Q1. Is adapter lineage really necessary if we already log the model version?

Yes, because the model version may describe only the base model. If an adapter materially changes behaviour, then the run cannot be reconstructed properly without knowing that adapted layer.

Q2. How is adapter lineage different from ordinary model provenance?

Ordinary model provenance is broader and may describe development history at a high level. Adapter lineage is more specific to the exact adapted state active in a particular run and therefore has direct governance value inside RAIDT.

Q3. Does adapter lineage guarantee reproducibility?

No. It improves the conditions for reconstruction and comparison, but full reproducibility may still depend on prompts, retrieval content, system settings, external tools, and time-sensitive dependencies.

Q4. Why is this relevant to governance rather than only engineering?

Because when an output is challenged, the organisation must explain what was actually deployed. Adapter lineage makes that explanation evidence-based and reviewable rather than informal or retrospective.

Q5. What happens if an organisation cannot capture full adapter metadata yet?

RAIDT can still be applied, but the evidence pack should record that limitation explicitly and the weaker evidential position should influence review confidence and, where appropriate, the score profile.

Suggested citation concepts to support this item
Short explanation for presentation

Adapter lineage is the record of which adapted model state was actually active in a given RAIDT run. That matters because many organisations do not use a foundation model in its untouched form; they use a base model plus an adapter, checkpoint, or lightweight tuning layer that changes behaviour. If that layer is not logged, then later reviewers cannot tell which behavioural delta shaped the output. In RAIDT, adapter lineage therefore becomes part of run-level evidence. It strengthens the evidence pack, improves the credibility of scoring across the five pillars, and helps reviewers distinguish whether a problem arose from the base model, the adapter, the prompt, or the wider deployment setup. In short, it turns model customisation from an opaque engineering detail into a governable evidence object.

One-line takeaway

Adapter lineage is the run-level record of which adapted model state shaped an output because RAIDT governs real configured uses of GenAI through evidence, not assumptions.

Related items in influence methods as governance interventions
Anchored questions
Powered by Forestry.md