S5.09 - Composite_vs_profile
S5.09 ? Composite vs profile
flowchart LR
A[Pressure for one headline score] --> B[RAIDT
Run-level evidence framework]
A2[Risk: weak pillars hidden by averaging] --> B
H[Healthcare, finance, public services, enterprise use] --> C
B --> C[[Composite vs profile
Profile first, composite second]]
C --> D[Run-level evidence pack]
C --> E[Five-pillar score profile]
C --> F[Optional composite summary]
D --> G[Reviewer reconstruction]
E --> I[Weak pillar visibility]
E --> J[Targeted governance improvement]
F --> K[High-level reporting]
G --> L[Audit readiness]
I --> L
J --> M[Organisational learning]
B --> N[Evidence over assertion
Reviewability and contestability]
C --> N? Star S5 - RAIDT Pillars and Scoring
Star context: Situates the five RAIDT pillars within a scoring logic that keeps governance measurable without collapsing important differences between pillars into a misleadingly simple headline number.
Academic picture
Definition / background
In RAIDT, composite vs profile refers to the distinction between two ways of presenting governance scoring results for a run of a generative AI system. A composite score is a single summary value, typically derived by averaging the five RAIDT pillars. A profile is the full five-pillar pattern showing separate results for Responsibility, Auditability, Interpretability, Dependability, and Traceability.
The distinction matters because the two outputs do different jobs. The composite supports concise communication and coarse comparison. The profile supports judgement, diagnosis, and governance action. RAIDT therefore allows the composite as a shorthand, but treats the profile as the primary result. This is because organisational risk is often driven not by the average condition of a run, but by a weakness in one specific governance dimension.
Conceptually, this item sits at the intersection of scoring theory and governance design. Many assessment systems produce an aggregate measure for reporting convenience, but governance frameworks also need to preserve internal structure so that trade-offs remain visible. In generative AI governance, this is particularly important because a run may be dependable in output stability while still being weak in traceability, or auditable in logging while still lacking interpretability for affected reviewers.
Inside RAIDT, the concept belongs directly to run-level evidence because the score profile is meant to remain anchored to concrete evidence within the run-level evidence pack. The profile is not just a visual summary; it is an organised representation of how the evidence supports or limits governance confidence across the five pillars. In that sense, composite vs profile is not only a reporting choice but a design principle for evidence-based governance.
Why this concept matters
This concept matters because it prevents governance simplification from becoming governance distortion. If a single composite score is treated as the main result, organisations may overlook serious weaknesses that are hidden by stronger scores elsewhere. A run that appears acceptable on average may still be difficult to contest, poorly documented, or operationally fragile.
It also reduces confusion between measurement convenience and governance adequacy. In practice, managers often ask for one number because it feels easier to track. RAIDT accepts that need but avoids letting it dominate the interpretation of readiness. By retaining the profile as the main output, RAIDT keeps attention on the actual shape of governance performance rather than only its arithmetic average.
For organisations using generative AI, that distinction supports better escalation, remediation, and prioritisation. It helps reviewers identify whether a governance problem is about ownership, documentation, explanation, reliability, or reconstruction. This is one of the ways RAIDT moves AI governance away from broad principles and towards operational reviewability.
Key idea: The profile matters more than the composite because governance readiness depends on the pattern of evidence across pillars, not just on the average score.
What this item measures
- It measures whether governance readiness is being interpreted as a structured five-pillar pattern rather than a single flattened number.
- It measures whether scoring outputs preserve pillar-specific weaknesses that may require targeted intervention.
- It measures the difference between summary reporting value and decision-making value.
- It measures how far a run-level assessment remains connected to evidence rather than becoming an abstract metric.
- It measures whether reviewers can diagnose trade-offs across Responsibility, Auditability, Interpretability, Dependability, and Traceability.
Practical example / likely audience question
Audience question
Why not simplify RAIDT to one composite score if decision-makers usually want a single number?
Answer
The concern behind this question is understandable: organisations often need concise reporting, benchmarking, and dashboard indicators. A composite score can help with that, and RAIDT does not reject it. The problem arises when the composite is treated as the main governance result rather than as a secondary summary.
The direct answer is that a single score can hide meaningful governance weaknesses. Imagine a run that scores strongly on Responsibility, Auditability, Interpretability, and Dependability, but poorly on Traceability because the chain of prompts, model settings, and review steps cannot be reconstructed. Its average may still look respectable. Yet from a governance perspective, that run remains problematic because an important part of the evidence trail is weak.
RAIDT handles this better than a generic AI governance approach because it ties the score to run-level evidence. Instead of saying only that a system is ?moderately governed?, RAIDT shows where the governance condition is strong, where it is weak, and what evidence underpins that judgement. That makes remediation more precise, contestation more credible, and audit preparation more realistic.
Practical example in RAIDT terms
Consider a hospital using a generative AI system to draft discharge summaries for clinicians. One specific run involves a configured prompt template, a named model version, patient-context constraints, staff review procedures, and a timestamped output. The run-level issue is that the generated summary is clinically plausible and the reviewer signs it off, but the underlying prompt revision and approval route were not captured consistently.
The evidence needed includes the exact prompt template used, model and parameter settings, reviewer identity and sign-off, workflow logs, exception notes, and the version history of the template. In RAIDT terms, Dependability may be relatively strong because outputs are stable and reviewed, while Traceability and Auditability may be weaker because the reconstruction trail is incomplete.
If the organisation relied only on a composite score, the run might appear broadly acceptable. The five-pillar profile, however, would show that governance readiness is uneven. That makes the next action clear: improve logging and reconstruction controls rather than assuming the run is fully ready because the average score is adequate. The item therefore strengthens readiness by exposing where evidence-pack improvements are actually required.
Detailed link to RAIDT
Composite vs profile links to RAIDT in four ways.
First, it supports RAIDT?s core idea that generative AI governance should be based on structured evidence rather than broad claims of compliance or responsibility.
Second, it connects directly to the run because the shape of the profile is generated from evidence about one configured use in one real context.
Third, it links to the evidence pack and score profile because the profile is the interpretable surface through which evidence is reviewed, while the composite is only a condensed summary.
Fourth, it strengthens reviewability, contestability, audit readiness, and organisational learning by making weak pillars visible instead of averaging them away.
Composite vs profile ? Run-level evidence ? Evidence pack ? RAIDT score profile ? Governance readiness
This chain matters because RAIDT is not trying merely to score AI systems; it is trying to make governance judgements inspectable, reconstructable, and actionable at the level where real organisational use occurs.
Link to the five RAIDT pillars
Responsibility
A profile makes it easier to see whether responsibility arrangements are genuinely defined or merely assumed. A composite score can obscure a weak assignment of ownership if other pillars score well.
Example evidence / implication:
- Named role ownership for the run, including approval and escalation responsibility.
- Evidence that responsibility for prompt design, review, and exception handling is explicitly allocated.
Auditability
This item is strongly linked to Auditability because audit depends on being able to inspect the basis of a score, not just the score itself. A profile reveals which audit-relevant dimensions require scrutiny.
Example evidence / implication:
- Scoring notes that explain why one pillar is lower than others.
- Documentation showing how reviewers derived both the five-pillar profile and any composite value.
Interpretability
The profile supports interpretability by making the internal logic of the governance judgement easier to understand. It shows what the assessment means rather than hiding that meaning inside one number.
Example evidence / implication:
- Pillar-level explanations describing why a run is strong in some areas and weak in others.
- Narrative interpretation accompanying the score profile for reviewers, managers, or auditors.
Dependability
Dependability should remain visible as one pillar among five rather than being treated as the whole governance story. A run can be operationally stable without being fully governable.
Example evidence / implication:
- Evidence of output consistency, testing, or review reliability for the run.
- Clear separation between dependable performance and other governance qualities that still need improvement.
Traceability
This item is also strongly linked to Traceability because weak reconstruction capacity is exactly the kind of issue a composite may hide. The profile preserves visibility of the evidence chain.
Example evidence / implication:
- Prompt, model, parameter, and workflow records that permit later reconstruction of the run.
- Explicit indication when trace records are incomplete even if other pillars are comparatively strong.
If this item has strongest effects anywhere, they are on Auditability, Interpretability, and Traceability, because profile-first scoring preserves the structure needed for review, explanation, and reconstruction.
Why this item is more than a generic concept
In general AI governance, composite versus profile may refer simply to a choice between an aggregate metric and a dashboard of sub-metrics. In RAIDT, the distinction is more specific and more operational. It refers to how governance readiness for a particular run should be represented when the underlying evidence is organised around five named pillars.
The RAIDT meaning is more operational because the profile is not just an analytics preference. It is tied to run-level evidence, evidence-pack structure, reviewer reconstruction, and the practical need to identify where governance weakness actually sits. In RAIDT, the profile is therefore part of the method, not merely part of the presentation.
Common misunderstanding
Misunderstanding
If a composite score is high enough, the run is governance-ready even if one pillar is noticeably weak.
Correction
That is incorrect because governance failure often emerges through the weakest pillar rather than the average. For example, a customer-service GenAI run may be well owned, stable, and understandable, yet still lack a reliable trace of prompt versions and approval steps. Its composite might look comfortable, but the run would remain difficult to reconstruct during an incident review. RAIDT corrects this by treating the profile as the main result and the composite as secondary.
Boundary and limitation
This item does not mean that composite scores are useless. A composite can still be helpful for trend tracking, high-level reporting, or coarse comparison across runs. Nor does the profile by itself prove that governance is adequate; it only makes the pattern of assessed readiness more visible.
The concept may also fail if pillar scores are poorly calibrated, weakly evidenced, or interpreted without context. A five-pillar profile is only as good as the evidence and scoring discipline behind it. RAIDT handles this limitation by connecting profile interpretation to evidence packs, scoring anchors, repeat review, and evidence-based scoring rather than treating any visual profile as self-justifying.
Implementation levels
Manual implementation
A researcher or small team can apply this manually by scoring each pillar separately for a run, recording short justifications, and only then calculating a composite as an optional summary. In manual use, the key discipline is to discuss the pattern before quoting the average.
Semi-automated implementation
Semi-automated implementation can use templates, metadata forms, and structured review sheets that force reviewers to enter pillar-level evidence before any aggregate score is displayed. Dashboards can then present both outputs while visually prioritising the profile.
Fully automated implementation
At scale, a wrapper, governance platform, orchestration layer, or logging pipeline can generate pillar-level evidence views automatically from run metadata, reviewer inputs, and workflow traces. The system can compute a composite for summary reporting, but should preserve the profile as the default governance display and flag materially uneven pillar patterns for escalation.
Practical use in the RAIDT project
This item is useful across the RAIDT project because it explains why the framework reports a five-pillar governance pattern rather than only a single readiness score. In Paper 08 Foundations, it helps justify the methodological choice to preserve internal structure in the scoring model. In Paper 09 Empirical Validation, it supports analysis of whether reviewers find profile-first scoring more informative and actionable than a single index.
In Paper 10 Policy Pathways, the concept helps explain to policymakers and organisational leaders why evidence-based governance should not collapse into a single compliance-like headline value. It also supports sector playbooks by showing practitioners how to identify which pillar requires intervention in healthcare, finance, education, public services, or enterprise settings.
For the evidence pack and scoring rubric, this item clarifies presentation logic. For viva defence and supervisor explanation, it provides a strong answer to the challenge that ?decision-makers only want one score?. RAIDT?s position is that one score may be reported, but five-pillar structure must remain available if governance is to be reviewable and defensible.
Key audience questions to prepare for
Q1. Why does RAIDT not treat the composite as the primary output?
Because the primary governance need is diagnostic visibility. The composite compresses information, whereas the profile preserves where governance strength and weakness actually lie.
Q2. Does keeping a profile make RAIDT harder for non-experts to use?
It adds some complexity, but that complexity reflects the real structure of governance. RAIDT manages this by allowing a composite for summary communication while retaining the profile for serious review.
Q3. Can two runs have the same composite score but different governance implications?
Yes. One run may be balanced across all five pillars, while another may combine very high scores in some pillars with a serious weakness in one. RAIDT treats those as meaningfully different because their governance risk is different.
Q4. Is the profile only useful for auditors?
No. It is also useful for managers, designers, reviewers, and policy teams because it points directly to where governance improvement is needed rather than merely indicating that improvement is needed somewhere.
Q5. How does this concept support organisational learning?
Over time, profile patterns across repeated runs can show where an organisation systematically underperforms, such as traceability gaps or interpretability weaknesses. That supports targeted process improvement rather than vague calls to ?improve governance?.
Suggested citation concepts to support this item
- composite indicators and governance measurement
- multidimensional assessment versus aggregate scoring
- dashboard metrics versus single-index reporting
- weak-link effects in organisational risk assessment
- evidence-based AI governance measurement
- interpretable scoring frameworks for responsible AI
- auditability and traceability in generative AI governance
- run-level evaluation of AI system use
- governance profiles for socio-technical systems
- trade-offs in responsible AI assessment frameworks
Short explanation for presentation
Composite vs profile explains why RAIDT does not reduce generative AI governance to one number. RAIDT may calculate a composite score across the five pillars, but it treats the five-pillar profile as the main result because governance weaknesses are often pillar-specific. A run can look acceptable on average while still being weak in traceability, auditability, or interpretability. By keeping the profile visible, RAIDT preserves the structure of governance judgement and ties it back to run-level evidence in the evidence pack. That makes the framework more useful for diagnosis, contestation, audit readiness, and organisational learning. In short, the composite helps summarise, but the profile helps govern.
One-line takeaway
Composite vs profile is the distinction between a summary score and a five-pillar governance pattern because RAIDT needs run-level evidence to remain diagnosable, reviewable, and actionable.
Related items in RAIDT pillars and scoring
Anchored questions
- Q059: Why does RAIDT keep interpretability, dependability, and traceability separate in the score profile?
- Q062: Why keep a score profile instead of relying on one composite?
- Q142: Why keep a five-pillar profile rather than a single score?
- Q240: Composite vs profile ? definition, example, and why it matters in RAIDT