Understanding: Index of Multiple Deprivation (IMD)#

Last modified: 18 Aug 2025

IMD, urban/rural classification and population estimates#

This file is intended to be linked to the health record dataset, CORE_NHSD_LSOA11. It adds geographical indicators to each encrypted small area geography (Lower Super Output Area for England and Wales, Data Zone for Scotland or Super Output Area for Northern Ireland). The indicators include measures of deprivation for all 4 nations, population estimates and urban-rural classification. The small area geographies are those developed using the 2011 census. The associated data for each geographical unit is the most up to date available data that uses the 2011 boundaries, this is due to not all of the indicators yet available using the new 2021 census boundaries.

Methodology#

Each of the 4 nations has its own versions of the Index of Multiple Deprivation, please see below for the individual statistical release information:

England: The English Indices of Deprivation (2019)

Wales: Welsh Index of Multiple Deprivation (2019)

Scotland: Scottish Index of Multiple Deprivation (2020)

Northern Ireland: Northern Ireland Multiple Deprivation Measures (2017)

England and Wales have a combined urban-rural classification, while Scotland and Northern Ireland have a separate classification system. The supporting documentation for each classification can be found below:

England and Wales: Rural Urban Clasification of Statistical Geographies, England and Wales (2021)

Scotland: Scottish Government Urban Rural Classification (2020)

Northern Ireland: Northern Ireland Statistis and Research Agency Urban-Rural Classification (2015)

Information on the quality and methdology information for mid-year population estimates in the UK including the strengths and limitations of the data has been developed by the Office of National Statistics.

Data Harmonisation:

Indices of deprivation:

To harmonise the Indices of Multiple Deprivation across England, Wales, Scotland and Northern Ireland, use the ‘imd2019har’ variable that is in the IMD datasets for all 4 nations. The variable uses the Geographic Data Service’s Harmonised Index of Multiple Deprivation 2019. The indicator is a simple (non-reweighted) combination of the English IMD 2019, Welsh IMD 2019, Northern Irish Multiple Deprivation Measure (MDM) 2017 and the Scottish IMD 2020. The index is presented as quintiles.

For more information, click here

Urban-Rural Classification:

To harmonise the urban-rural classifications across England, Wales, Scotland and Northern Ireland, use the ‘ur_harm_2021’ variable that is in the IMD datasets for all 4 nations. The variable is derived from the My Society’s UK Composite Rural Urban Classification. The indicator is a 3-fold metric that describes various small areas (LSOAs/Datazones/SOAs) on a rural/urbal scale that is comparable across the UK. For more information, click here.

Caveats:#

  • The associated data for each geographical unit is the most up to date available data that uses the 2011 boundaries, this is due to not all of the indicators yet available using the new 2021 census boundaries.

Limitations:#

  • Each of the 4 nations has its own versions of the Index of Multiple Deprivation and Urban-Rural classification in order to fit the local context and meet their local needs. As a result each nation’s data source for IMD and urban-rural classification will have its own set of biases. Please refer to the individual statistical release documents in the Methodology section of this page for the relevant limitations to each source of data.

Useful Syntax#

The UKLLC_nhse_lsoa11_dataset_mapping dataset can be linked to PLACE_IMD_XXX_v0000_YYYYMMDD to add in geographical indicator variables associated with encrypted LSOA11 (lsoa11cd_e) from health records

England, Wales, Scotland and Northern Ireland each have their own Index of Multiple Deprivation dataset.

To link UKLLC_nhse_lsoa11_dataset_mapping_v0000_YYYYMMDD with PLACE_IMD_england_v0000_YYYYMMDD:

  1. Retrieve data from database via helper syntax

  2. Link datasets on the lsoa11cd_e field. Example of STATA syntax linking to England:

* load NHSE lsoa11 dataset
use “S:\LLC_9999\data\stata_w_labs\UKLLC_nhse_lsoa11_dataset_mapping_v0000_YYYYMMDD.dta”, clear

* merge in IMD
merge m:1 lsoa11cd_e using “S:\LLC_9999\data\stata_w_labs\PLACE_IMD_england_v0000_YYYYMMDD.dta”

*drop geographical units not linked to any participant health record 
drop if _merge == 2

UKLLC_nhse_lsoa11_dataset_mapping is a long dataset typically with millions of rows, depending on size of data request. It is therefore recommended that you subset both or either of these datasets before linking/processing/saving. An example of this would be to select the quintile from the Index of Multiple Deprivation that you are going to use and keep these variables only. This will ensure the dataset size remains as manageable as possible. It is also advised to only link to the country/ countries that you require.