The capabilities approach
Sen, Nussbaum
Wellbeing is the freedom to do and be what one has reason to value, and that freedom needs specific functional capacities, not abstract entitlements.
AI System Psychology is the discipline that measures an AI system as a behavioural subject and protects human agency across the full life of a deployment. With Prof. Leon De Beer, I am defining its scope of practice.

Before I argue for a new discipline, look at four findings from the last two years. Each one is a measurable loss of a human capacity that AI was supposed to support, and not one of them was read by a psychologist tasked with reading it.
Education
High-school students who practised mathematics with an unguarded AI tutor scored 17 percentage points below their peers once the tool was taken away. The capacity was borrowed, and the loan came due.
Bastani et al., 2025, PNAS
Clinical
Endoscopists' unaided detection rate fell from 28.4 percent to 22.4 percent after routine exposure to AI-assisted colonoscopy. That is the kind of drop that triggers an urgent safety review in any other clinical adjunct.
Budzyń et al., 2025, Lancet Gastroenterology and Hepatology
Diagnostic
Physicians who correctly judged when to trust the AI reached substantially higher diagnostic accuracy than those who did not, on the same task with the same AI output. The controllable variable lives on the human side.
Sakamoto et al., 2024, JMIR Formative Research
Social
Across 29 mental-health chatbots evaluated against the Columbia Suicide Severity Rating Scale, none met the adequate-response threshold. Emotional dependence on commercial chatbots is now a documented harm.
Pichowicz et al., 2025; Laestadius et al., 2024
In plain terms, I do two jobs at once: I study the AI as a subject whose behaviour can be measured, and I protect the people on the receiving end of it. AI System Psychology works both sides of the human-AI relation at the same time. Each side on its own is useful research. Neither side on its own finishes the job the discipline is being asked to do.
The boundary
I claim that these systems show stable, measurable behavioural regularities. I do not claim they have minds, beliefs, or experience. The position is methodological, and its boundary is explicit.
I am not inventing this from nothing. Several adjacent fields each cover a slice of what AI System Psychology must do. I show what each one contributes and where it stops, because the gap between them is the discipline.
The integration
AI System Psychology
Unit of analysis: AI behaviour and the humans interacting with it, across the lifecycle
Six-stage applied scope of practiceThe work of an AI System Psychologist is organised around six lifecycle stages, each with its own inputs, activities, outputs, and a success criterion you can hold me to. Open any stage to see what the work is.
design phase
The conceptual design phase, before any code is written.
Behavioural targets are measurable and falsifiable, and agency-preservation choices are explicit in the architecture rather than retrofitted.
I translate complex psychological processes into computational sub-tasks before engineering decisions foreclose what the system can become. After build, those decisions are hard to recover.
training phase
validation phase
rollout phase
operations phase
sunset phase
Four cross-cutting dimensions surface in every stage
The AI as a behavioural entity with measurable traits and dispositions.
The composed deployment artefact: orchestration, digital twins, multi-agent structure.
Ethics, regulation, and AI-IARA capacity protection.
Workforce, deployment context, and change management.
The six stages are sequential, but in production they are not strictly linear. They overlap, iterate, and feed back into one another as systems are revised, retrained, and redeployed.
Everything in the lifecycle is held together by one framework. AI-IARA names human agency as six capacities. Each one is trainable. Each one is erodable through routine AI use. And each one is a construct my psychometric toolkit can measure, at the level of an individual and a population.
The theoretical lineage
These capacities are not philosophical primitives. They are constructs anchored in established psychology.
Sen, Nussbaum
Wellbeing is the freedom to do and be what one has reason to value, and that freedom needs specific functional capacities, not abstract entitlements.
Bandura
Agency is the exercise of intentional influence over one's own functioning and life circumstances. This is what separates agentic capacity from passive disposition.
Ryan and Deci
Autonomy, competence, and relatedness are basic psychological needs whose support or thwarting is consequential for wellbeing.
Agency Debt is my one-number answer to a plain question: how much of a person's own capability has quietly eroded while they leaned on the AI, and for whom. If you measure one thing about a deployed AI system, measure this. Formally, it is the cumulative shortfall in the six capacities against a pre-exposure baseline, measured at fixed points in time and floored at zero so a recovery in one place never hides a loss in another. I report it three ways on purpose, because any single number conceals part of the picture a governance body needs.
Current capacity on a 0 to 100 scale across the six AI-IARA strata
Low debt
Pre-exposure baseline. Nothing has been borrowed yet.
The share of users with at least one capacity below the clinical threshold. An average can stay green while this tail accumulates.
Quantity 1
For each capacity c, the average gap between where a user started (baseline B) and where they are now (current C), with recovery floored at zero. This is the FGT poverty-gap measure (the same maths economists use to size how far the poor fall below a poverty line) moved from income to capacity space.
It preserves which capacity is degrading, which is the information a clinician needs to target an intervention.
Quantity 2
One weighted summary figure for governance reporting and cross-deployment comparison. The weights are derived by expert consensus or by regressing functional outcomes on capacity scores, not assumed equal.
It deliberately loses information for the sake of comparability, so the six-vector must stay available beside it.
Quantity 3
The proportion of users with at least one capacity below a clinical threshold. This is the union-deprivation headcount (the share of people who fall short on at least one dimension) from multidimensional poverty measurement.
The compound index is an average, so it can stay green while a clinically significant tail accumulates. The same logic drives pharmacovigilance to track adverse events, not means alone.
A scope of practice is empty without a competency model. The AI System Psychologist is a T-shape: broad working literacy across the AI technical stack, crossed with three deep specialty pillars, each mapped to the lifecycle stages where it does the most work.
Breadth
Enough depth to specify requirements, evaluate outputs, and intervene credibly at the design table. Not equivalence with a machine-learning engineer.
The value sits in the intersection of breadth and depth, not in either alone. Deep psychometrics without technical breadth produces a researcher who cannot sit at the design table. Technical breadth without psychometric depth produces another safety engineer.
I would rather anticipate the strongest critiques than wait for them. Here are five, including the one I find most serious, and the terms on which I answer each.
“We are the last generation of psychologists who will have studied humans whose minds were not yet substantially formed under conditions of routine AI exposure. The difference is measurable. The gap is intervenable. And the responsibility is psychology's, whether the field accepts it or not.”

Prof. Llewellyn E. van Zyl (Ph.D)
Chief Solutions Architect, Psynalytics
Cornerstone Hub
The buyer-facing hub. What an AI assessment audit produces, with a worked example through the Validity Stack.
Cornerstone Hub
The longitudinal application. A continuously-updated computational model of a person, audited the way an assessment is.
IPPA AI Summit, 2026
Why good intentions are not enough when AI systems measure and influence human behaviour. A call for psychological product safety standards.
6-12 week engagement
Build and assure AI systems that measure or influence human outcomes, with construct clarity, harm analysis, and lifecycle monitoring.
Article
AI therapy bots are moving from beta product to clinical product without the validation any other clinical tool would require. The class-action wave is twelve months out. Here is what an AI-IARA audit catches before it lands.
Article
Construct drift is the gradual shift in what an AI assessment is actually measuring after deployment, even when the model weights are frozen. It is the most expensive failure mode in deployed people-impact AI, and almost no one is watching for it.
AI System Psychology is a foundational proposal, co-authored with Prof. Leon De Beer. Read the AI-IARA paper for the full argument, run the AI-IARA audit on your own system in about fifteen minutes, or contact me to discuss a deployment across the lifecycle.