Understanding the IAPT dataset#
Last modified: 13 Mar 2026
1. Introduction#
The IAPT dataset contains information about adults in England referred to the NHS Talking Therapies for Anxiety and Depression service. This service offers psychological therapy for anxiety and/or depression as well as other related mental health conditions (e.g. obsessive compulsive disorder, body dysmorphic disorder). It was launched in 2008 as Improving Access to Psychological Therapies (IAPT) and was rebranded in January 2023 though the associated dataset retains the IAPT name until its next formal revision. We therefore continue to refer to this dataset as IAPT.
The NHS Talking Therapies for Anxiety and Depression service offers evidence-based psychological treatments delivered using a stepped-care model. People with less severe depression and anxiety are typically offered a course of low-intensity treatment (e.g. guided self-help) delivered by a Psychological Wellbeing Practitioner over 6-8 sessions. Those who do not respond at this level can be stepped up to a course of high-intensity treatment. People with more severe depression and/or anxiety may receive high-intensity treatment from the outset. People with specific disorders (e.g. post-traumatic stress disorder or social anxiety) should also receive high-intensity treatment. High-intensity treatment (e.g. cognitive behavioural therapy) is delivered by an accredited therapist, involves more intensive clinician input and a greater number of sessions (e.g. 15-20). Following initial assessment some patients are not considered suitable for a course of treatment and will instead be signposted onto another service.
Employment advice and support is also provided as part of the service to help support the patient’s employment goals.
Central data collection has been in place for IAPT since 2012 by which time the service had been rolled out nationally across England.
2. Strengths of IAPT#
Although not designed for research, the IAPT dataset can be a useful research resource as its strengths include:
Comprehensive coverage – captures care pathway (e.g. referral, appointments, treatment type), sociodemographic information (e.g. age, gender, ethnicity, employment/benefit status), long term physical health conditions, and patient outcomes (e.g. recovery, improvement, deterioration).
Standardised and validated measures of depression, anxiety, and functional impairment. Specific thresholds are used for the symptom measures to calculate key outcomes such as reliable recovery and reliable improvement.
Standardised coding systems – for example SNOMED CT and ICD-10 codes.
Repeated assessment of symptoms – patients complete measures of symptoms at every session rather than only at the start and end of treatment. This design provides a last available score to calculate outcome metrics even if treatment ends earlier than expected. Outcome data has been obtained for more than 98% of all patients who have a course of treatment.
Large-scale naturalistic data about psychological treatment which enhances generalisability.
National consistency – standardised data collection enables comparison across services and geographical areas of England.
3. Limitations of IAPT#
Scope – limited to NHS-funded talking therapies for anxiety, depression and related conditions. Does not capture private mental health care, other NHS mental health services, or treatment outside of England.
Selection bias – includes only those people who are referred (by GP, other professional) or who self-refer. Excludes people whose problems are not recognised by referring professionals, those unaware of the service, lacking motivation or capacity to self-refer or seek help, or facing other barriers to access including perceived stigma. Treatment access barriers affect some sociodemographic groups disproportionately.
Missing data – while outcome data completion is high, incomplete date may arise due to patient distress or objection to its collection, language or literacy barriers, perceived administrative burden, or data collection errors.
Self-report data – many items, including concurrent medication use and long term conditions, rely on self-reported information without clinical verification.
Absence of formal diagnoses – patients are not formally diagnosed so they may not meet clinical criteria for their treated condition or may have unrecognised comorbidities.
4. Scope and coverage#
Population: The IAPT dataset includes adults referred for NHS psychological therapies for anxiety, depression and related conditions in England. A formal diagnosis is not required and therefore the dataset includes patients who may or may not meet clinical criteria for their treated condition. The IAPT dataset includes some people under the age of 18 who access adult services. It does not include activity related to the provision of Children and Young People’s (CYP) IAPT services which is included in MHSDS. Please also see FAQs for information about the differences between IAPT and MHSDS.
Included conditions: Anxiety, depression and other related conditions such as post-traumatic stress disorder, obsessive-compulsive disorder and body dysmorphic disorder. These conditions may occur in the context of comorbid long-term physical health conditions or symptoms such as irritable bowl syndrome, as well as some drug or alcohol use.
Excluded conditions: The service is not designed to treat other conditions including psychosis, bipolar disorder, ‘personality disorders’ or severe eating disorders (MHSDS captures these). However, individuals with these conditions may be accepted if their primary presenting problem is an included condition and they have no active symptoms that would interfere with treatment.
Referral routes: Patients can self-refer to the service or be referred by a health professional or voluntary organisation.
Data collected:
The IAPT dataset contains patient-level information about:
Sociodemogrphics including age, gender, ethnicity, language, religious affiliation
Area level deprivation, employment status, benefit status
Clinical presentation - problem descriptor (informal diagnosis) , long term physical health conditions, disabilities, medically unexplained symptoms, use of medication for mental health problems
Care pathway - referral source and dates, appointment dates/times/type, waiting time pauses, attendance
Treatment - therapy type (e.g. cognitive behavioural therapy), internet enabled therapies, professional qualification of the care provider
Clinical measures - symptom and impairment scpres from each appointment, and derived outcomes such as recovery
Coverage of specific items is dependant on whether the item is mandatory or optional in the dataset specification, and when the item was introduced (See Evolution of the dataset below).
5. Data collection methodology#
Patients complete self-report measures of their symptoms at every session, typically before or at the beginning. Depression is measured using the Patient Health Questionnaire-9 (PHQ-9). Anxiety is measured with the Generalised Anxiety Disorder 7-item assessment (GAD-7) or an anxiety disorder specific measure where available and appropriate for the patient’s problem descriptor. The PHQ-9 and GAD-7 assess the frequency of symptoms over the previous two weeks. Patients also complete the Work and Social Adjustment Scale to measure how their anxiety and depression symptoms impair their social, work and private life functioning. At the end of assessment and of treatment patients complete patient experience questionnaires (PEQs) about the quality of their care.
NHS England collects data for the IAPT dataset through monthly submissions from service providers via the Strategic Data Collection Service (SDCS) Cloud. Providers send data on referrals (received, seen or closed), contacts, and patient outcomes for that reporting period. NHS England carries out validation checks to ensure data quality, pseudonymisation to protect patient confidentiality, and linkage to previous months’ submissions.
6. Structure of the dataset#
The IAPT dataset is comprised of several tables (i.e. individual datasets). These constituent tables flow from NHS England to UK LLC grouped into 9 themed tables capturing patient demographics and referral, care activities, care cluster, scored assessments, employment status, internet enabled therapies, mental and physical health conditions, onward referrals and waiting time pauses. See Availability in the UK LLC TRE below for more detail.
7. Coding systems used#
The IAPT dataset uses ICD-10 and SNOMED CT codes according to the NHS Data Model and Dictionary. Understanding which tables use ICD-10, SNOMED CT, or both, helps in identifying the appropriate coding system when working with the data. Table 1 summarises the key variables and presents the percentage of records coded using each coding system. Coding system I denotes ICD-10, and Coding system II denotes SNOMED CT. Only variables with coding system usage greater than 10% in at least one coding system are included in the table. The variables are grouped into four main categories to make the table easier to interpret. The categories and their descriptions are as follows.
Care / Contact Activity#
Variables used to identify information about clinical activities, procedures, or observations captured during care contact.
Coded Assessment Tool#
Variables used to identify information about clinical assessment tools used to measure patient outcomes.
Patient Demographics#
Variables used to identify lifestyle or social factors captured during assessments.
Clinical Terminology / Diagnosis#
Variables used to identify diagnosis codes used to record clinical information about the health-related conditions of patients.
Table 1 Key variables and the coding systems used with percentage of records codes
Table_name |
Column_name |
Coding_system_1 |
Coding_system_2 |
%_coding_system_1 |
%_coding_system_2 |
Category |
|---|---|---|---|---|---|---|
nhsd.IAPT_care_activities_v0003 |
codeprocandprocstatus |
NO |
SNOMED CT |
0 |
100 |
Care/Contact Activity |
nhsd.IAPT_care_activities_v0003 |
codeobs |
NO |
SNOMED CT |
0 |
100 |
Care/Contact Activity |
nhsd.IAPT_coded_scored_assessments_v0003 |
codedasstooltype |
NO |
SNOMED CT |
0 |
99.7 |
Coded Assessment Tool |
nhsd.IAPT_demographics_and_referral_v0003 |
religion_socpercircumstance |
NO |
SNOMED CT |
0 |
100 |
Patient Demographics |
nhsd.IAPT_demographics_and_referral_v0003 |
sexorien_socpercircumstance |
NO |
SNOMED CT |
0 |
100 |
Patient Demographics |
nhsd.IAPT_demographics_and_referral_v0003 |
smokingstatus_socpercircumstance |
NO |
SNOMED CT |
0 |
73 |
Patient Demographics |
nhsd.IAPT_demographics_and_referral_v0003 |
asylumstatus_socpercircumstance |
NO |
SNOMED CT |
0 |
100 |
Patient Demographics |
nhsd.IAPT_demographics_and_referral_v0003 |
alcoholusestatus_socpercircumstance |
NO |
SNOMED CT |
0 |
100 |
Patient Demographics |
nhsd.IAPT_demographics_and_referral_v0003 |
drugusestatus_socpercircumstance |
NO |
SNOMED CT |
0 |
100 |
Patient Demographics |
nhsd.IAPT_demographics_and_referral_v0003 |
gamblingstatus_socpercircumstance |
NO |
SNOMED CT |
0 |
100 |
Patient Demographics |
nhsd.IAPT_demographics_and_referral_v0003 |
other_socpercircumstance |
NO |
SNOMED CT |
0 |
98.6 |
Patient Demographics |
nhsd.IAPT_mental_and_physical_health_conditions_v0003 |
menphyshealthcondition |
ICD-10 |
SNOMED CT |
43.6 |
33.8 |
Clinical Terminology/Diagnosis |
nhsd.IAPT_mental_and_physical_health_conditions_v0003 |
menphyshealthconddescription_icd10_master |
ICD-10 |
SNOMED CT |
54.7 |
41.8 |
Clinical Terminology/Diagnosis |
For SNOMED CT lookups and definitions see SNOMED International – service migration.
Further information about coding systems can be found in the coded variables guide.
8. Evolution of the dataset#
The IAPT dataset has been collected since 2012 and has evolved through incremental changes over time (summarised in Table 2). These have included: mandating previously optional items, introducing or discontinuing variables, alterations to response options, incorporating pilot datsets, and aligning with NHS-wide standards such as SNOMED CT coding and the NHS data model and dictionary.
The current version of the IAPT dataset is v2.1 which was released 20 July 2021 and implemented in 2022.
Important: When working with IAPT data spanning multiple years, researchers should ensure they check for differences in variables availability, coding and response options. For full details of the changes across versions see NHS Engalnd’s Change Specification documents published by NHS England for current and previous versions.
Table 2 A summary of IAPT dataset versions since 2012.
Version |
Year released1 |
Year implemented |
Summary of major changes from previous version |
|---|---|---|---|
1.0 |
2012 |
2012 |
Initial version developed to support consistent data collection and reporting across IAPT services. It was mandated for central return from April 2012 |
1.5 |
2013 |
2014 |
Addition of waiting time pauses, time from patient opt-in to treatment and patient experience questionnaires (PEQs) for assessment and treatment, appointment purpose made mandatory, new code list for reason for end of treatment |
2.0 |
2019 |
2020 |
Major structural changes to align with other NHS datasets (e.g. MHSDS. See NHS data model and dictionary) including expanded use of clinical terminology (e.g. SNOMED CT). Added/updated variables relating to internet-enabled therapies, language, qualifications of care personnel, treatment location, and group treatment delivery. Changes to anxiety disorder specific measures. Incorporated the Employment Advisors Pilot Data Set2 and items from the Long Term Conditions and Medically Unexplained Symptoms Pilot Data Set3. Clarified scope regarding the inclusion of under-18s who access adult IAPT services |
2.1 |
2021 |
2022 |
Updated the recording of gender to align with MHSDS, addition of medical history (previous diagnosis), accommodation type information, and added/updated items relevant to employment support activity |
1 The year of release is the date specified in the guidance reports.
2 This pilot increased the number of Employment Advisors embedded in IAPT services. Collection of Employment Advisors in IAPT pilot data from those services involved in the pilot started in October 2017 and was submitted alongside the core IAPT dataset v1.5. Version 2.0 incorporated this into the main dataset.
3 This pilot integrated IAPT care into existing medical pathways and services for patients with long term conditions and/or medically unexplained symptoms. Collection of IAPT integrated services pilot data from those services involved in the pilot started in January 2017 and was submitted alongside the core IAPT dataset v1.5. Version 2.0 incorporated this into the main dataset.
9. Availability in the UK LLC TRE#
The UK LLC TRE currently holds an extract of the IAPT dataset from 2020 onwards organised into 9 linked tables. A detailed overview of these tables is provided below (Table 3) as well as a pictorial representation of the relationships between these tables (Figure 1).
The UK LLC TRE includes IAPT records for participants in partner LPS and only where individual or LPS permissions allow linkage to NHS data. UK LLC does not hold any information about people who are not part of a partner LPS or about LPS participants who have requested that their NHS data not be shared via UK LLC.
Figure 1 Relationships between the IAPT tables available in the UK LLC TRE.
Table 3 IAPT tables available in the UK LLC TRE, including summary and key variables.
Table name |
Summary of constituent tables |
Key variables |
|---|---|---|
IAPT demographics and referral |
Master patient index (IDS001): personal details of the patient |
Patient ID, Organisational ID, Gender, Ethnicity, Language, Age at start/end of reporting, Start/end date of GP registration, GP distance from home, Indices of deprivation (decile & quartile), Accommodation type, Disability code, Social and personal circumstances code (religion, sexual orientation, smoking status, alcohol status, asylum status, debt status, drug use status, gambling status, other), Date social and personal circumstance code recorded, Overseas visitor charging category, Date of referral, Source of the referral, Age at referral received/discharge, Onset date of mental health symptoms, Previous diagnosed condition indicator, Age at service discharge, Date of discharge, Reason for discharge, First and last recorded outcome scores for Anxiety Disorder Specific Measures, First and last recorded scores for GAD-7, First and last recorded scores for PHQ-9, Indicator of caseness, Indicator of recovery, Indictor of no change, Indicator of reliably deterioration, Indictor of reliable improvement, Count of contacts & treatment type (including CBT, self-help, therapy, employment support) |
IAPT employment status |
Employment status (IDS004): patient employment details |
Employment status, Date employment status recorded, Hours worked per week, Benefit receipt indicator, Sickness absence indicator, Statutory sick pay receipt indicator, Universal credit receipt indicator, Personal Independent Payment receipt indicator, Employment and Support Allowance receipt indicator, Date referred for/discharged from employment support |
IAPT internet enabled therapies |
Internet enabled therapy care professional activity log (IDS205): details of the activity for the care professional supporting internet enabled therapy for the patient |
Start/end date for the period of internet enabled therapy log, Time spent supporting the internet enabled therapy, Name of the internet enabled therapy programme delivered, Qualification/accreditation attained or planned to be attained by the care personnel, Date qualification awarded/expected to be awarded |
IAPT care activities |
Care contact (IDS201): details of any contacts with a patient which have taken place as part of a referral |
Date and time of contact, Appointment attended indicator, Duration of contact, Location of contact, Type of appointment (e.g. assessment, treatment, review etc), Method of appointment delivery (e.g. face-to-face, telephone, email), Type of therapy, Psychotropic medication indicator, Duration of a care activity, Coded procedure, Coded finding, Coded observation |
IAPT coded scored assessments |
Coded scored assessment (referral) (IDS606): details of scored assessments that are issued and completed as part of a service request, but do not take place at a specific contact |
Assessment code, Assessment score, Date and time of assessment, Age of patient when assessment completed |
IAPT care cluster |
Care cluster (IDS803): information about needs-based groupings |
Start/end date & time a patient is assigned to a care cluster, Assigned adult mental health care cluster code |
IAPT mental and physical health conditions |
Long term physical health condition (IDS602): details of any long term physical health conditions for a patient which are stated by the patient or recorded in medical notes |
Long-term physical health condition or presenting complaint code, Description of the long term physical health condition, Date presenting complaint recorded |
IAPT onward referrals |
Onward referral (IDS105): onward referrals to other services both within and outside of the organisation |
Date & time of referral, Reason for referral, Organisation patient is referred to |
IAPT waiting time pauses |
Waiting times pauses (IDS108): details of breaks in activity initiated by the patient with the intention of resuming at some point |
Start/end date of activity break, Reason for the activity break |
10. Missing information#
Variable and value labels
UK LLC is infilling missing variable and value labels in the NHSE datasets in the TRE. Where variable labels have been added by UK LLC, rather than being found in NHSE documentation, this is made apparent by the phrase ‘label added by UK LLC’ being included in the variable label.Missing data
The amount of missing data varies widely between variables and across datasets. Throughout 2026, we will update this section with information about missingness in the IAPT dataset.
11. Tips for researchers using IAPT in the UK LLC TRE#
Please also see the list of mental health FAQs
Descriptions of the variables included in the dataset can be downloaded from the Enhanced Technical Output Specification available from NHS England.
Linking between tables: Use the cohort key variable (cohortkey_e) to link patients across IAPT tables and use the pathway ID variable (pathwayid) to link pathways (i.e. distinct referrals/episodes of care for a patient) across IAPT tables.
Multiple referrals per patient: Some patients have been referred on more than one occasion and therefore have records for multiple referrals (each referral has its own unique pathway ID). It may be appropriate for your analysis to select one per patient (e.g. first referral, most recent referral, or meeting other specific criteria). To understand more and to read practical tips about how to de-duplicate data, please read our detailed Multiple rows per participant guide.
Defining a course of treatment: A patient is deemed to have had a ‘course’ of treatment if they had at least two clinical sessions before discharge. Patients with only one session or assessment-only contacts are not considered to have received a course of treatment. Inspecting the type of contact that occured in each session is therefore important for understanding a patient’s care pathway.
Interpreting problem descriptors: Problem descriptors in IAPT use ICD-10 diagnotic codes but do not represent formal diagnoses. They are used to assign patients to appropriate treatment protocols, as specified by NICE guidelines. A patient may not meet full clinical criteria for for the condition they are treated for and it is possible that they could meet diagnostic criteria for other conditions not captured by their problem descriptor.
Understanding outcome metrics: Key outcomes including recovery are calculated based on symptom scores at a patient’s first and final sessions using specific thresholds. Not all participants are eligible for all outcome calculations. For example, a patient who scores below the clinical thresholds on both the PHQ-9 and GAD-7 at baseline cannot be assessed for recovery as they were not in the ‘caseness’ range to begin with.
We will regularly update these tips as we gather more information.
12. Useful syntax#
Below we will include syntax that may be helpful to other researchers in the UK LLC TRE. For longer scripts, we will include a snippet of the code plus a link to the UK LLC Github repository where you can find the full scripts.
13. Further reading#
Information in this section will be added in due course.