The Science of AI That Decides About People
AI psychology is the discipline that keeps AI systems valid, fair, and defensible when they make decisions about human beings. Prof. Llewellyn E. van Zyl (Ph.D) defines the field, and the AI-IARA framework operationalises it.

AI psychology is the science of designing, measuring, and assuring AI systems that make decisions about human beings. It treats people-impact AI as a measurement problem first and a technology problem second. The work asks four questions before a system is allowed near a hiring panel, a wellbeing programme, or a clinical workflow: does the system perceive the right context, interpret it correctly, act with proportionate authority, and preserve the user's autonomy and social agency. When any of those fail, the system harms people quietly and at scale. AI psychology names the failure modes, audits them, and turns the gaps into a defensible fix list.
AI psychology is not AI ethics, AI governance, or behavioural science. It is the measurement discipline behind all three when the decisions are about people.
AI-IARA. Six capacities every people-impact AI must demonstrate.
The AI-IARA framework (Awareness, Interpretation, Intention, Action, Relational Agency, Autonomy) is the canonical methodology this site teaches and applies. Each capacity surfaces a specific class of audit signal. Together they form the validity stack that determines whether an AI system that decides about people will hold up under scrutiny.
Five layers of audit evidence
Every AI psychology audit produces evidence at five layers. A system that passes one layer but fails another is not deployable. The layers are sequential, not optional.
Construct
Define the construct the system claims to measure or decide on, in language that an independent psychometrician can review. Wellbeing, engagement, fit, risk, and burnout are not interchangeable. The construct must be named, scoped, and tied to a published theoretical model. Without this layer the rest of the audit has nothing to anchor on.
Calibration
Cohort
Drift
Contestability
Common questions about AI psychology
The Authority Behind This Page
Every claim on this page is anchored in two or more independent proof types: peer-reviewed publications, third-party speaking engagements, formal standards, and named institutional roles.
Publications
Keynotes
Standards Cited
- AERA, APA, and NCME Standards for Educational and Psychological Testing
- ITC Guidelines on Psychological Testing
- EU AI Act, high-risk people-impact provisions
- ISO/IEC 42001 AI Management Systems
- NIST AI Risk Management Framework
Institutions
- Optentia Research Unit, North-West University
- Centre for Behavioural Engineering and Insight, University of Twente
- Frontiers in Psychology, Editorial Board
- Psynalytics (Chief Solutions Architect)
- Springer Nature, Editorial Affiliations
Related work and engagements
AI-Driven Assessments
Cornerstone Hub
The buyer-facing hub. What an AI assessment audit produces, with a worked example walking through all five Validity Stack layers.
Digital Twins for Wellbeing
Cornerstone Hub
The longitudinal application of AI psychology. A continuously-updated computational model of a person, audited the same way an assessment is.
The Malpractice of Good Intentions
IPPA AI Summit, 2026
Why good intentions are not enough when AI systems measure and influence human behaviour. A call for psychological product safety standards.
Psychologically Safe AI Infrastructure
6-12 week engagement
Build and assure AI systems that measure or influence human outcomes, with construct clarity, harm analysis, and lifecycle monitoring.
When AI Becomes Your Therapist: The Audit Nobody Is Running
Article
AI therapy bots are moving from beta product to clinical product without the validation any other clinical tool would require. The class-action wave is twelve months out. Here is what an AI-IARA audit catches before it lands.
Construct Drift: The Silent Failure Mode in Deployed AI Assessment
Article
Construct drift is the gradual shift in what an AI assessment is actually measuring after deployment, even when the model weights are frozen. It is the most expensive failure mode in deployed people-impact AI, and almost no one is watching for it.
Subscribe to the AI Psychology newsletter
Lessons from peer-reviewed publications, deployed audits, and live case studies on why most people-impact AI fails, and how to build systems that hold up under scrutiny.