Understanding the CSDS#
Last modified: 20 Mar 2026
1. Introduction#
The CSDS comprises clinical and operational data for monitoring local and national service use and outcomes, reporting for commissioning, and to help address health inequalities. The CSDS is not used for direct patient care. Data are collected from publicly-funded community services in England such as health centres, day care facilities, schools, mobile facilities and patients’ homes.
2. Strengths of CSDS#
Although not designed for research, benefits of using the CSDS for research include:
National consistency – standardised data collection enables comparisons across all publicly-funded community health services and all regions of England.
Comprehensive coverage of community care ensures the dataset provides a full picture of community service activity.
The CSDS includes individual-level diagnostic, personal circumstance, screening and care event data, and can be used to supplement data recorded in primary and secondary care.
3. Limitations of CSDS#
Data completeness and consistency varies between, and within, organisations. Inconsistencies include missing data, variations in coding, and inconsistent adoption of national standards. Information about the quality of data submitted to the CSDS is available on the CSDS Data Quality Dashboard.
Published statistics from CSDS are labelled ‘experimental’ as they are still being evaluated, which means that the dataset may not yet be truly reliable.
Not all service providers submit data consistently, or submit complete returns, affecting representativeness and limiting comparative analyses.
The CSDS has a complex multi-table structure, with variables spread across topic-specific tables such as ‘care activities’, ‘referrals’, etc. This is described more fully below.
As a secondary uses dataset, the data in the CSDS has been ingested from multiple systems not necessarily designed for detailed analysis. Some clinically relevant data may not be captured with data quality dependent on local documentation practices.
4. Scope and coverage#
The CSDS is a long-standing dataset which re-uses clinical and operational data for purposes other than direct patient care. It comprises data about care events occurring in community settings such as health centres, day care facilities, schools, mobile facilities and patients’ own homes.
The CSDS includes patient-level data about adults and children including:
personal and demographic information
social and personal circumstances
breastfeeding and nutrition
screening and assessments
diagnoses, including long-term conditions and disabilities
weight management services
These are spread across 9 tables, each focusing on an aspect of community care.
5. Data collection methodology#
No data are collected specifically for the CSDS.
The dataset re-uses clinical and operational data extracted from electronic records held by providers such as community health trusts, mental health trusts, local authorities, and the independent sector. Providers are required to extract a structured set of data items and definitions from their Electronic Health Records (EHRs) or clinical systems according to the national CSDS specifications. Data are collated and processed centrally by NHS England.
6. Structure of the dataset#
The CSDS is comprised of multiple tables (i.e. individual datasets) which flow from NHS England to UK LLC grouped into 9 themed tables:
care activities
care plans
coded scored assessments
demographics and referral
diagnoses
group sessions
immunisations
onward referrals
rrt (referrals to treatment)
See ‘Availability in the UK LLC TRE’ below for more detail. All 9 tables contain multiple lines per participant.
7. Coding systems used#
The CSDS uses four coding systems across the 9 tables: ICD-10, SNOMED CT, National and Read Codes.
Understanding which tables use which codes helps in identifying the appropriate coding system when working with the data. Table 1 shows some of the key coded variables and the coding system(s) used in each.
The variables are grouped into four main categories to make the table easier to interpret. The categories and their descriptions are as follows.
Care / Contact Activity: variables used to identify information about clinical activities, procedures, or observations captured during care contact.
Coded Assessment Tool: variables used to identify information about clinical assessment tools used to measure patient outcomes.
Patient Demographics: variables used to identify lifestyle or social factors captured during assessments.
Clinical Terminology / Diagnosis: variables used to identify diagnosis codes used to record clinical information about the health-related conditions of patients.
Table 1: Key variables and the coding systems used
Table name (nhsd.CSDS_) |
Variable name |
Variable label |
Coding system |
Category |
|---|---|---|---|---|
care_activities_v0003 |
codedfinding |
Coded clinical entry for a finding |
ICD-10, Read, SNOMED |
Clinical Terminology / Diagnosis |
care_activities_v0003 |
codedobservation |
Clinical terminology code for an observable entity |
ICD-10, Read, SNOMED |
Clinical Terminology / Diagnosis |
care_activities_v0003 |
codedprocedure |
Clinical terminology code for a procedure |
Read, SNOMED |
Clinical Terminology / Diagnosis |
care_plans_v0003 |
plan_type |
Care plan type |
National |
Care / Contact Activity |
demographics_and_referral_v0003 |
employmentstatus |
Current employment status of a person |
National |
Patient Demographics |
coded_scored_assessments_v0003 |
snomed_id |
SNOMED CT concept ID used to identify an assessment |
SNOMED |
Coded Assessment Tool |
demographics_and_referral_v0003 |
accommstatus |
Type of accommodation that a patient currently lives in |
National |
Patient Demographics |
demographics_and_referral_v0003 |
carecontact_attended_count |
Cumulative count of the number of care contacts that were attended |
n/a |
Care / Contact Activity |
immunisations_v003 |
childhoodimmunisation_type |
Type of childhood immunisation |
National |
Clinical Terminology / Diagnosis |
onward_referrals_v0003 |
onward_referralreason |
Reason the patient was referred to another service |
National |
Care / Contact Activity |
Further information about coding systems can be found in Guidebook’s coded variables guide.
8. Evolution of the dataset#
The CSDS has been collected since 2015, with the most recent version of the dataset (v1.6) introduced in January 2023. The uplift to v1.6 introduced the several changes to the dataset including:
Demographics and referral table: a new variable ethniccategory2021 was added for voluntary inclusion once the value list is available.
Care activities table: the variable attendornot was renamed to attendancestatus to align with other NHS England datasets. Both the old and new versions of the variable exist in the CSDS in the UK LLC TRE.
9. Availability in the UK LLC TRE#
The UK LLC TRE currently holds an extract of the CSDS from 2015 onwards organised into 9 linked tables. An overview of these tables is provided in Table 2 below.
The UK LLC TRE includes CSDS records for participants in partner LPS and only where individual or LPS permissions allow linkage to NHS data. UK LLC does not hold any information about people who are not part of a partner LPS or about LPS participants who have requested that their NHS data not be shared via UK LLC.
Table 2: CSDS tables available in the UK LLC TRE
Table name (nhsd.CSDS_)1 |
Summary of contents |
No. variables |
|---|---|---|
care_activities |
Appointments, referrals, patient observations and |
98 |
care_plans |
Type and timings of care plans |
17 |
coded_scored_assessments |
Type of assessment, age of participant |
19 |
demographics_and_referral |
Demographics, personal circumstances, summary of |
153 |
diagnoses |
Dates and clinical codes of diagnoses |
16 |
group_sessions |
Dates and types of group sessions |
19 |
immunisations |
Dates and clinical codes of immunisations |
21 |
onward_referrals |
Dates and reasons for referrals |
11 |
rrt |
Metrics on patient age groups, referral dates and waiting times |
28 |
1 The number of LPS participants included in each table in the TRE is summarised on the Guidebook CSDS page.
10. Missing information#
Variable and value labels
UK LLC is infilling missing variable and value labels in the NHSE datasets in the TRE. Where variable labels have been added by UK LLC, rather than being found in NHSE documentation, this is made apparent by the phrase ‘label added by UK LLC’ being included in the variable label.Missing data
The amount of missing data varies widely between variables and across datasets. Throughout 2026, we will update this section with information about missingness in the CSDS.
11. Tips for researchers using the CSDS in the UK LLC TRE#
Descriptions of the variables included in the dataset can be downloaded from the Enhanced Technical Output Specification available from NHS England.
Use the individual identifer cohortkey_e to link participants across CSDS tables.
Throughout the CSDS tables, variables suffixed x_scheme indicate which coding system is used in the associated variable x (e.g. diagnosis, procedure)
We will regularly update these tips as we gather more information.
12. Useful syntax#
Below we will include syntax that may be helpful to other researchers in the UK LLC TRE. For longer scripts, we will include a snippet of the code plus a link to the UK LLC Github repository where you can find the full scripts.
13. Further reading#
The interactive CSDS Data Quality Dashboard is available online: CSDS Data Quality Dashboard