2 Background

2.1 Overview

The section begins with an overview of the spatial methods for mapping mortality for small areas and describes some examples of applications from the UK and beyond looking at mortality for small subnational regions. There is a brief history of the Small Area Health Statistics Unit (SAHSU), who manage the mortality data used in the thesis and have developed and applied many of the spatial methods discussed.

This is followed by a history of separating total mortality into different causes of death and the epidemiologic transition theory.

The chapter finishes by exploring the picture of inequalities in the UK over the past few decades through to the present, focussing on class, income, geography, and deprivation.

2.2 Mapping mortality and disease at small areas

Many studies compare the prevalence of diseases or mortality in different subgroups of the population by dividing the population geographically into small areas. The number of cases, or number of deaths, in an area is likely to be small. This sparseness issue is even more pertinent when the population is further stratified by age group. When calculating rates of incidence from the observed data, there is an apparent variability between spatial units, which is often larger than the true differences in risk due to the noise in the data. To overcome these issues, we can use statistical smoothing techniques to obtain robust estimates of rates by sharing information between strata.

2.2.1 Disease mapping methods

In small-area studies, it is common to smooth data using models with explicit spatial dependence, which are designed to give more weight to nearby areas than those further away. There are three main categories for modelling spatial effects. First, we can treat space as a continuous surface using Gaussian processes or splines. Second, we can use hierarchical models for areal data, which make use of the spatial neighbourhood structure of the units. Third, we can again use hierarchical models for areal data but instead we can exploit a nested hierarchy of geographical units, for example between state, county and census tract in the US. Each of these methods, which can be used separately or in combination if the context of the problem allows, rely on assumptions which may make them more or less appropriate in different applications.

Space as a continuous process

In the context of disease mapping, events are usually aggregated to areas rather than assigned specific geographical coordinates. Wakefield and Elliott (1999) model aggregated counts as realisations of a Poisson process, in which the expected number of cases is calculated by integrating a continuous surface that generates the cases over the area of the spatial unit. The surface was a function of spatially-referenced covariates. Kelsall and Wakefield (2002) describe an alternative model, where the log-transformed risk surface is modelled by a Gaussian process, whose correlation (or “kernel”) function depends on distance.

Best et al. (2005) provide a review of the use of hierarchical models with spatial dependence for disease mapping. In particular, the authors focus on Bayesian estimation, and different classes of spatial prior distributions.

The first prior proposed for spatial effects \(\mathbf{S} = {S_1, ..., S_n}\) is the multivariate normal \[ \mathbf{S} \sim \mathcal{N}(\pmb{\mu}, \pmb{\Sigma}), \tag{2.1}\]

where \(\pmb{\mu}\) is the mean effect vector, \(\pmb{\Sigma} = \sigma^2 \pmb{\Omega}\) and \(\pmb{\Omega}\) is a symmetric, positive semi-definite matrix defining the correlation between spatial units. A common choice when specifying the structure of the correlation matrix is to assume a kernel function that decays with the distance between the centroids of the areas, so that places nearby in space share similar disease profiles. Note, this is mathematically equivalent to the practical implementation of a Gaussian process for a specified set of spatial locations, which uses a finite set of points. An example in Elliott et al. (2001b) chooses an exponential decay function to map cancer risk in northwest England. Kernel functions based on distance, however, do not allow the variability of the spatial surface to change with location. Paciorek and Schervish (2006) describe nonstationary extensions to common kernel choices in spatial settings with this property.

Space as discrete units

A more popular prior is the conditional autoregressive (CAR) prior, also known as a Gaussian Markov random field (GMRF), which was first introduced by Besag et al. (1991). These form a joint distribution as in Equation 2.1, but the covariance is usually defined instead in terms of the precision matrix \[ \mathbf{P} = \pmb{\Sigma}^{-1} = \tau(\mathbf{D} - \rho \mathbf{A}), \tag{2.2}\] where \(\tau\) controls the overall precision of the effects, \(\mathbf{A}\) is the spatial adjacency matrix formed by the small areas, \(\mathbf{D}\) is a diagonal matrix with entries equal to the number of neighbours for each spatial unit, and the autocorrelation parameter \(\rho\) describes the amount of correlation. This can be seen as tuning the degree of spatial dependence, where \(\rho = 0\) implies independence between areas, and \(\rho = 1\) full dependence. The case with \(\rho = 1\) is called the intrinsic conditional autoregressive (ICAR) model. There sometimes exists further overdispersion in the residuals that cannot be modelled by purely spatially-structured random effects. Besag et al. (1991) proposed the model (hereafter called BYM) \[ S_i = U_i + V_i, \tag{2.3}\] where \(U_i\) follow an ICAR distribution, and \(V_i\) are independent and identically distributed random effects. The addition of the spatially-unstructured component \(V\) accounts for any non-spatial heterogeneity.

Space as a nested hierarchy of geographies

The relationships between different levels of a hierarchy of geographical units are often incorporated into models as a nested hierarchy of random effects. These models account for when spatial units lie within common administrative boundaries. This is often a desirable property of the model for certain geographies, like states in the US, which are administrative. Policy is decided at these geographies, so there is reason to believe these boundaries may have a greater effect on health outcomes than spatial structure. Finucane et al. (2014) demonstrate how country-level blood pressure can be modelled by exploiting the hierarchy of global, super-region, region and country. Note, although these models group by geographical region, these models are not spatial as they do not contain any information on the relative position of the areas.

Of the two specifications that are spatial, either as a continuous process or discrete units, the Markov random field priors are often preferred for computational reasons, as we can exploit the sparseness of the adjacency matrix in our inference algorithms rather than computing the covariance between each pair of spatial units as in the general case of Equation 2.1. There are concerns, however, that the GMRF representation of space as an adjacency matrix, which was originally proposed for a regular lattice of pixels in image analysis (Besag et al., 1991), is reductive for more complicated spatial problems. Despite this, in an epidemiological context, Duncan et al. (2017) found the standard ICAR model with binary, first-order neighbour weights outperformed models with a variety of different weighting schemes, including matrix weights based on higher-order degrees of neighbours, distance between neighbours, and distance between covariate values.

In applications to disease mapping, spatial models are the natural choice when the disease exhibits a spatial pattern. This is the case for mortality from infectious diseases, particularly on short timescales like Covid-19 (Konstantinoudis et al., 2022). Nested hierarchies are a more suitable choice when administrative areas are meaningful and have an effect on the health outcomes of the population. For example, state-specific abortion laws in the USA could affect maternal mortality, and so a model should include an effect for each state.

Modelling variation beyond space

As computational power has improved, it has become feasible to model patterns over other features of the population, such as time period and age group. Trends over time can be modelled as linear through slopes, or using nonlinear effects which allow neighbouring time points to be alike, the simplest of which is a first-order Gaussian random walk process. All-cause mortality varies smoothly over ages, following a characteristic J-shape with higher mortality in the infant and older age groups (Preston et al., 2001), and therefore can be modelled using a nonlinear process such as a random walk.

Difficulties arise when considering interactions between the space, age, and time variables. One can imagine situations in which different spatial units will have different age patterns in disease rates. For example, if certain age groups were vaccinated against disease in that spatial unit before others. There are also social or behavioural risk factors, such as alcohol consumption or smoking rates, that are likely to exhibit different age patterns over space. After implementing a base model with the main effects, the question is how to model additional terms which account for the interactions between the variables. Space-time interactions could range from fully independent, to each spatial unit having independent temporal patterns, to inseparable space-time variation where interactions borrow strength across neighbouring spatial units and neighbouring time periods (Knorr-Held, 2000).

However, it should be considered that by breaking the population down into smaller and smaller subgroups through space, age and time period, the counts of cases become more sparse and there is a need for stronger smoothing to produce robust estimates, particularly for data that are already at the small-area level. Although interaction effects are plausible, modellers should consider whether there is evidence for the interaction in the data or whether they can simplify the model if the interaction effect turns out to be negligible.

It should be noted that there are situations where statistical smoothing would not be appropriate. There might be true variability in the data which a smoothing model would conceal. For example, certain spatial units might contain isolated populations with high mortality over a sustained period, such as counties with Native American reservations in the USA (Dwyer-Lindgren et al., 2017a). There can also be spatially- and temporally-specific events that cause a spike in mortality such as the Grenfell Tower fire in 2017. Without accounting for these events, the models described above would either attenuate their effect on mortality, or a spike in deaths would cause estimates of mortality in nearby spatial units or years to be erroneously high. Beyond the use of subject matter experts, posterior predictive checks and plots of modelled death rates against the observed data can help to identify outlier spikes in mortality which are specific to a particular time or place, and which we do not want our model to smooth.

2.2.2 Applications of disease mapping methods

Small-area analyses of mortality

In order to compare the health status between areas, health authorities require a measure of mortality that collapses age-specific information into a single number. Indirectly standardised measures such as the standardised mortality ratio – the ratio between total deaths and expected deaths in an area – are easy to calculate, but are not easily understood by laypeople. Directly standardised methods, in contrast, require knowledge of the full age structure of death rates rather than just the total number of deaths. Age-standardised death rates, however, suffer the same interpretability issue as the standardised mortality ratio, and are only comparable between studies if the same reference population is used. An alternative choice is life expectancy. Silcocks et al. (2001) explain that life expectancy is a “more intuitive and immediate measure of the mortality experience of a population, [and] is likely to have greater impact… than other measures that are incomprehensible to most people.” However, although the metric appears more interpretable, life expectancy at birth constructed from a period life table is often misinterpreted as the mean length of life of the cohort into which the newborn is born. In fact, it measures the expectation of life assuming that the newborn will be exposed to age-specific mortality conditions throughout their life that are exactly the same as the current population.

The estimation of death rates requires two data sources: deaths counts and populations. Modern death registration systems, such as that of the UK, are almost entirely complete and accurate. On the other hand, although usually treated as a known quantity, the population denominator is often problematic. Populations for small geographies are only recorded during a decennial census, and estimates are generated for the years in-between using limited survey data on births, deaths and migration. And although the census is considered the “gold standard”, it is subject to enumeration errors, particularly for areas with special populations such as students or armed forces (Elliott et al., 2001b).

Beyond the population issue, finer scale studies are restricted by data availability. Where data are available, there is still the need to overcome small number issues before feeding death rates through the life table to calculate life expectancy. Eayres and Williams (2004) recommend a minimum population size of 5000 when using traditional life table methods, below which the calculation of life expectancy is unstable¹, or the error estimates become so large that any comparison between subgroups becomes meaningless. One approach, often taken by statistical agencies, is to build larger populations by either aggregating multiple years of data (Bahk et al., 2020; Office for National Statistics, 2015; Public Health England, 2021) or combining spatial units (Ezzati et al., 2008). Here, we focus on studies using Bayesian hierarchical models to generate robust estimates of age-specific death rates by recognising the correlations between spatial units and age groups, which produce more accurate estimates for small population studies of life expectancy (Congdon, 2009; Jonker et al., 2012).

Jonker et al. (2012) demonstrated the advantages of the Bayesian approach for 89 small areas in Rotterdam using a joint model for sex, space and age effects, finding a 8.2 year and 9.2 year gap between the neighbourhoods with the highest and lowest life expectancies for women and men. Stephens et al. (2013) employed the same model for 153 administrative areas in New South Wales, Australia.

Bayesian spatial models for mortality have been scaled to small areas for entire countries, and also consider trends in these regions over time. Bennett et al. (2015) forecasted life expectancy for 375 districts in England and Wales using a spatiotemporal model trained over a 31 year period, and Dwyer-Lindgren et al. (2017a) explored mortality trends in 3110 US counties from 1980 to 2014.

There have also been studies on specific cities at a finer resolution. In order to improve estimates for disability-free life expectancy, Congdon (2014) considered both ill-health and mortality in a joint likelihood with spatial effects for 625 wards in London, finding more than a two-fold variation in the percent of life spent in disability for men. Bilal et al. (2019) looked at 266 subcity units for six large cities in Latin America. As there is no contiguous boundary in this case, a random effects model for each city was used instead of a spatial model. The largest difference between the top and bottom decile of life expectancy at birth was 17.7 years for women in Santiago, Chile.

Two studies in North America have looked below the county level, at census tracts, with wide-ranging population sizes as small as 40. Dwyer-Lindgren et al. (2017b), using a model that relied heavily on sociodemographic covariates, studied trends for life expectancy and many causes of death for 397 tracts in King County, Washington, uncovering an 18.3 year gap in life expectancy for men. Using the same model for Vancouver, Canada, Yu et al. (2021) found widening inequalities over time and a difference of 9.5 years for men.

Small Area Health Statistics Unit

In 1983, a documentary on the radioactive fallout from a fire at the Sellafield nuclear site in Cumbria claimed that there was a ten-fold increase in cases of childhood leukaemia in the surrounding community. This anomaly had gone undetected by public health authorities, raising concern that routinely collected data were not able to identify local clusters of disease. The subsequent enquiry confirmed the excess, and recommended that a research unit was set up to monitor small-area statistics and respond quickly to ad hoc queries on local health hazards. The Small Area Health Statistics Unit (SAHSU) was established in 1987 (Elliott et al., 1992).

Beyond producing substantive research on environment and health, a core aim of SAHSU is to develop small-area statistical methodology (Wakefield and Elliott, 1999) for:

Point source type studies. Is there an increased risk close to an environmental hazard? SAHSU has investigated increased mortality from mesothelioma and asbestosis near Plymouth docks (Elliott et al., 1992); excess respiratory disease mortality near two factories in Barking and Havering (Aylin et al., 1999); kidney disease mortality near chemical plants in Runcorn (Hodgson et al., 2004); and possible excess of several morbidities near landfill sites (Elliott et al., 2001a; Jarup et al., 2007, 2002b).
Geographic correlation studies. Is there a correlation between disease risk and spatially-varying environmental variables? SAHSU have looked at several exposures, including a plume of mercury pollution (Hodgson et al., 2007); mobile phone base stations during pregnancy (Elliott et al., 2010); noise from aircraft near Heathrow (Hansell et al., 2013); road traffic noise in London (Halonen et al., 2015); and particulate matter from incinerators during pregnancy (Parkes et al., 2020).
Clustering. Does a disease produce non-random spatial patterns of incidence? If the aetiology is unknown, this could suggest that the disease is infectious.
Disease mapping. Summarising the spatial variation in risk.

SAHSU has been at the forefront of both methodology and applications in disease mapping. Aylin et al. (1999) mapped diseases for wards in Kensington, Chelsea and Westminster using a model that smoothed rates towards the mean risk across the region. Thereafter, SAHSU published a plethora of studies for disease mapping models with explicit spatial dependence, including using the BYM model (Equation 2.3) to map spatial variation in the relative risk of testicular (Toledano et al., 2001) and prostate (Jarup et al., 2002a) cancers for small areas in regions of England. In a landmark piece bringing together work on disease mapping and environmental exposures, SAHSU published an environment and health atlas for England and Wales, showing the spatial patterns of 14 health conditions at census ward level over an aggregated 25 year period alongside five environmental exposure surfaces (Hansell et al., 2014).

Further disease mapping studies at SAHSU using spatially structured effects have also extended the methodology to look at age patterns and trends over time. Asaria et al. (2012) analysed cardiovascular disease death rates by fitting a spatial model for all wards in England separately for each age group and time period. Bennett et al. (2015) designed a model to jointly forecast all-cause mortality for districts in England by age group and year. The model used BYM spatial effects and random walk effects over age and time to capture nonlinear relationships.

2.3 Mortality from specific causes of death

2.3.1 The Epidemiologic Transition

In the mid-twentieth century, a team in the US Public Health Service, led by Iwao Moriyama, began investigating the cause-specific composition of mortality into all diseases and injuries for the first half of the century. Moriyama and Gover (1948) grouped vital registration data into primary causes. Notably, they found, as the US saw an overall downward trend in mortality, the leading causes of death changed from communicable diseases, such as tuberculosis and diphtheria, toward non-communicable, “chronic diseases of older ages”, such as heart diseases and cancers. The success of the reduction – and in the case of typhoid fever, near-elimination – of infectious diseases was attributed to the strategy of the health officer in the early 1900s, who was focussed on improving water and sanitation, and public health interventions such as immunisation and quarantines.

By comparing vital registration data over several centuries, Abdel Omran observed this shift of mortality from communicable to non-communicable diseases (NCDs) in many countries (Omran, 1977, 1971). Although the pace and determinants of the transition varied between countries, Omran was able to formalise three common successive stages of the shift in mortality:

The Age of Pestilence and Famine. Mortality is high and largely governed by Malthusian “positive checks” – epidemics, famines, and wars.
The Age of Receding Pandemics. Mortality decreases as epidemics become less frequent, but infectious diseases remain the leading causes of death.
The Age of Degenerative and Man-made diseases. Mortality declines further along with fertility, increasing the average age of population and NCDs take over as the leading causes of death.

He termed this the Epidemiologic Transition theory. Omran (1971) explained that England and Wales took the classic transition path followed by western societies, whereby socioeconomic factors such as improvements to living standards are crucial in causing easily preventable diseases to subside and shifting towards the third phase of the transition, whilst medical and other public health technology only help society much later in the final stage. Later, Olshansky and Ault (1986) would propose a fourth stage to the theory, the Age of Delayed Degenerative Diseases, in which the structure of causes of death is stable, but the age at which degenerative diseases kill is postponed, thus decreasing older age mortality. There are, however, questions around the universality and unidirectionality of the theory, with many examples in which age-specific death rates for population subgroups have risen over time, most notably the HIV/AIDS pandemic (Gaylin and Kates, 1997). Gersten and Wilmoth (2002) also criticise the lack of attention Omran’s theory pays towards the role of infection in chronic and degenerative diseases, in particular certain cancers.

Around the same time as Omran, Preston collated cause-specific mortality data for a large number of populations, spanning 48 nations and nearly a century (Preston, 1970; Preston and Nelson, 1974). This would enable international comparisons of groups of causes of death over different time periods, and a deeper understanding of the upward trends in life expectancy. In particular, by plotting cause-specific disease rates against overall mortality, Preston and Nelson (1974) saw that, over time, the contribution of infectious diseases to a particular level of mortality had become ever smaller. That is to say, as mortality declined, the contribution from infectious diseases also declined. Preston attributed this to an accelerating rate of medical progress guided by the “germ theory of disease”, which public health and science were not able to replicate for NCDs. Preston also traced the excess deaths in older males observed in western societies to cardiovascular diseases, cancer and bronchitis – a direct result of dramatic increases in cigarette smoking (Preston, 1970).

Since its first edition in 1990, the subject of international comparisons of the cause-specific composition of mortality has been the remit of the Global Burden of Disease (GBD) studies (Murray and Lopez, 1996). The studies aim to quantify and compare the burden of diseases, injuries, and risk factors, usually through cross-sectional methods but occasionally by examining trends and subnational populations (Dwyer-Lindgren et al., 2017a; Ezzati et al., 2008). An important innovation of the GBD study was the introduction of a hierarchical classification of groups of causes, with the broadest level divided into three groups: communicable, maternal, perinatal, and nutritional diseases (Group 1), NCDs (Group 2), and injuries (Group 3). Salomon and Murray (2002) made use of the wide-ranging dataset and grouping from the GBD to revisit the epidemiologic transition for the second half of the twentieth century. They found the majority of the change in cause structure occurred among children, with a shift from Group 1 to Groups 2 and 3, and in young adults, where the role of injuries is more dominant for men.

2.3.2 Modelling cause-specific mortality

In studies looking at multiple causes of death it may be desirable to extend disease mapping models to capture the interdependence between causes of death. Cause-specific mortality can exhibit complex correlation structures, with correlations for diseases with common risk factors but anti-correlations for competing causes of death. For studies that already look at mortality for small areas and narrow age groups, breaking the population down further by cause of death increases the sparseness in the data, and it may be necessary to introduce terms to the smoothing model which share information between causes in order to stabilise the estimation of death rates.

The simplest and most computationally-scalable method is to ignore any correlations between diseases and run separate regression models for each cause of death. This is the approach taken by GBD studies for a vast array of causes of death. When looking at both total mortality and a mutually exclusive, collectively exhaustive list of causes of death, studies typically constrain the cause-specific death rate estimates from separate regressions to sum to the all-cause mortality value. This is because estimates of all-cause mortality are more robust, as the death counts are larger and therefore have lower variance, and the data do not suffer from errors in assignment of cause of death. The GBD studies scale death rates so that the proportions of each cause of death sum to unity. In the context of mortality projections, Wilmoth (1995) also points out that aggregating forecasts from multiple causes of death often leads to upward bias when compared to the total mortality forecast.

A joint modelling approach would allow the borrowing of strength across causes to express correlations between diseases that share common risk factors and similar aetiologies. There is also some redundancy in separate regression models as a unique spatial surface is specified for each cause. In the disease mapping literature, studies have built spatial models to look at a small number of diseases which share spatial components between diseases. Firstly, Knorr-Held and Best (2001) considered a model (also described in Best et al. (2005)) for two diseases, with disease-specific spatial components and one shared component. Held et al. (2005) generalised this approach to model any number of diseases as a weighted sum of shared components. Rather than a general approach, both Downing et al. (2008) and Mahaki et al. (2018) allocated the spatial components to the diseases a priori based on knowledge of the common risk factors between cancers such as smoking, obesity and alcohol consumption. However, unless the components are pre-specified based on prior knowledge, the required number of shared components to capture variation in the data needs to be determined through trial and error. This is especially problematic when the number of diseases in the study, and hence the possible number of combinations of shared components, increases.

Foreman et al. (2017) modelled a larger number of causes of death to jointly forecast cause-specific mortality for states in the US. The model was similar to the spatiotemporal models described earlier, with random walk effects for temporal nonlinearities and a CAR prior for spatial effects, but with the introduction of a multivariate normal prior for causes of death whereby the covariance matrix describes the correlation structure between the 15 cause groups. The model did not, however, share information between age groups.

In studies of the cause composition of total mortality, rather than estimating the absolute death rate for each cause of death, it is possible to reframe the problem using a compositional model which considers the fraction of each cause of death composing total mortality. This was the approach taken by Salomon and Murray (2002) to investigate the dynamics of the proportions of mortality from GBD Groups 1, 2, and 3. The benefit of a compositional model is that the proportions are constrained to sum to unity, and the model can capture covariance between the component causes of death. However, it is not possible to recover absolute cause-specific death rates using the compositional approach without estimating the overall death rate.

2.4 Health inequalities in the UK

While the UK is, by global standards, a wealthy nation with relatively high life expectancy, the nation still suffers vast, preventable inequalities in mortality and morbidity. Health inequalities can be reduced through, amongst other initiatives, progressive social and economic policies, better nutrition programmes, and improved health care. It is important to estimate and understand differences in health outcomes between population subgroups to aid the design of such policies. There are several ways to stratify the UK population and compare inequalities between subgroups. Here, I focus on class, income, geography, and deprivation.

Class and income inequality

The notion of class is prominent in UK society, but health outcomes between classes are difficult to separate from other risk factors such as hazards in manual labour or smoking rates. The Whitehall study of 1967 followed 17,530 men working in the civil service and recorded their mortality over a 10-year period. Marmot et al. (1984) found, by classifying the civil servants into social class according to their employment grade, there was a three-fold difference in mortality between the highest class, administrators, and men in the lowest class, mainly messengers and manual workers. They found, in general, a strong inverse association between grade and mortality, which Marmot described as a “social gradient”. The men were working stable, sedentary jobs in the same office building in London, so the gradient could not be explained by industrial exposure alone, and the gradient remained even after controlling for smoking. The authors concluded there must be other factors inherent to social class (defined here by employment grade), which explain the mortality differences. A second cohort of Whitehall employees from 1985 to 1988, this time including women as well as men, were screened and asked to answer questions on self-reported ill-health. Marmot et al. (1991) found the social gradient in health had persisted in the 20 years separating the studies. In 2008, Marmot was asked by the Secretary of State to conduct a review into the state of health inequalities in the UK and to use the evidence to design policy for reducing these inequalities. A key plot in the first Marmot Review, released in 2010, depicted the social gradient in mortality for regions in England by socio-economic classification of employment (Marmot et al., 2010).

Income is not a routinely collected statistic in the UK. Nevertheless, using a small survey of 7000 people on three measures of morbidity, Wilkinson (1992) showed health improved sharply from the lowest to the middle of the income range.

Spatial inequality

In 2015, the GBD study released its first subnational estimates of mortality, starting with the UK and Japan. Steel et al. (2018) assessed these data, which divided the UK into 150 regions, finding mortality from all-causes varied twofold across the country, with the highest years of life lost in Blackpool and the lowest in Wokingham. In a study on forecasting subnational life expectancy in England and Wales, Bennett et al. (2015) estimated a 8.2 year range in life expectancy for men and 7.1 year range for women in 2012 between 375 districts. The lowest life expectancies were seen in urban northern England, and the highest in the south and London’s affluent districts. Within London itself, Cheshire (2012) visualised the heterogeneity of mortality in London by assigning tube stops the life expectancy of the nearest ward, revealing that 10 years are lost between two consecutive stops, Canary Wharf and North Greenwich, on the Jubilee line.

Deprivation

There have been substantial efforts in the UK to measure the deprivation of an area. Since 2004, the standard deprivation indicator in England has been the Index of Multiple Deprivation (IMD) – a composite indicator for each Lower-layer Super Output Area (LSOA²) covering income, unemployment, health, crime and environmental data sources (Ministry of Housing, Communities & Local Government, 2019). The Marmot Review presented life expectancy and disability-free life expectancy against IMD at the Middle-layer Super Output Area, which exhibit strong social gradients (Marmot et al., 2010). The GBD study found the 15 most deprived areas had consistently raised mortality, especially for all causes, lung cancer and chronic obstructive pulmonary disease. Deprived areas in London, such as Tower Hamlets, Hackney, Barking and Dagenham had lower rates of premature mortality than expected for that level of deprivation (Steel et al., 2018). Bennett et al. (2018) jointly estimated death rates by age, year and deprivation decile. They found that since 2011, although national life expectancy has continued to increase, the rise in female life expectancy has reversed in the two most deprived deciles. Using data from Public Health England, the second Marmot Review in 2020 also reported that female life expectancy declined in the most deprived decile between the periods 2010-12 and 2016-18 (Marmot et al., 2020). Digging further into these trends by region, the report found this trend was seen in all regions except London, the West Midlands and the North West, and that male life expectancy in the bottom decile also decreased in the North East, Yorkshire and the Humber, and the East of England.

Recent trends in health outcomes

Since the turn of the millennium, there have been two periods of contrasting public health policy in the UK. The early 2000s saw the implementation of the English health inequalities strategy under New Labour, with explicit goals of reducing geographical inequalities in life expectancy. The strategy saw a large increase in public spending targeting the social determinants of health, with policies on supporting families, tackling deprivation, and preventative healthcare.

Following the change in government in 2010, the strategy came to an end. The Conservative government implemented a widespread series of cuts to public services, collectively known as austerity. This included both tight restrictions on the healthcare budget as well as a sweeping reorganisation of the NHS in hope of improving the organisation’s efficiency, and cuts to the wider determinants of social health, such as housing and education (Ham, 2023).

Although it is difficult to isolate the causal effect of the period of austerity on health outcomes, there is some evidence of differences in health outcomes in these periods. Barr et al. (2017) analysed the trends in life expectancy for different quantiles of deprivation and provided evidence that the English health inequalities strategy achieved its aim of reducing the gap in life expectancy between the 20% most-deprived areas and the rest of the English population, and that the trends were reversing since 2012. These trends have been found across the life course: with rising infant mortality associated with childhood poverty (Taylor-Robinson et al., 2019); increases in “deaths of despair” (drug overdose, suicide, alcoholic liver disease) for those in middle ages (Angus et al., 2023; Hiam et al., 2020); and falls in female life expectancy at 65 and 85 (Hiam et al., 2018). Alexiou et al. (2021) found strong associations between cuts to local government and the change in district-level life expectancy from 2013 to 2017. As written in the The New York Times, “after eight years of budget cutting, Britain is looking less like the rest of Europe and more like the United States, with a shrinking welfare state and spreading poverty” (Goodman, 2018).

Although fiscal policies of austerity were adopted by many countries in response to the global financial crash of 2008, there is evidence that public health has deteriorated more in the UK than in other countries. In an international study comparing mortality trends in England and Wales to 22 industrialised countries, Leon et al. (2019) showed that although there was a general slowdown in improvement of life expectancy across many nations, the slowdown in the most recent period of the study, 2011-16, was more pronounced in England and Wales. More recently, The Economist found the same evidence, comparing the long-run trend from 1980-2011 through to 2022 for 12 European countries: “longer-run slowdowns in life expectancy are observable in other European countries… but none has stalled quite as much as Britain” (The Economist, 2023). The UK has also performed worse as measured by cancer survival rates and infant mortality compared to other industrialised countries (OECD, 2016).

After a decade of cuts, the UK entered the 2020s facing the greatest public health challenge for a generation: the Covid-19 pandemic. Unsurprisingly, England and Wales suffered one of the highest excess deaths tolls relative to other high-income countries (Kontis et al., 2020).

It is important to estimate how health inequalities have changed in different areas of the country through this period of substantial change in economic, social, and healthcare policy. Small-area health statistics, and in particular those at high resolutions, not only reveal the extent of the mortality differences between neighbourhoods, but can also identify the areas at highest risk, allowing public health interventions to target the most disadvantaged groups.

2.5 Summary

Death rates vary by sex, age group and across time and space. Studying mortality variations at the small-area level introduces sparsity in the data, and statistical smoothing models must be used to obtain robust estimates of death rates. These models should be flexible to allow for variation across age, space and time, and should consider interactions between each of the dimensions. Modelling mortality from specific causes of death introduces further challenges, both through increased sparsity in the death counts and through correlations between diseases with common risk factors.

Following a change in government in 2010, the UK abandoned its strategy to reduce health inequalities. Since then, there has been a decline in female life expectancy in the most deprived areas.

Alexiou A, Fahy K, Mason K, Bennett D, Brown H, Bambra C, Taylor-Robinson D, Barr B. 2021. Local government funding and life expectancy in England: A longitudinal ecological study. The Lancet Public Health 6:e641–e647. doi:10.1016/S2468-2667(21)00110-9

Angus C, Buckley C, Tilstra AM, Dowd JB. 2023. Increases in “deaths of despair” during the COVID-19 pandemic in the United States and the United Kingdom. Public Health 218:92–96. doi:10.1016/j.puhe.2023.02.019

Asaria P, Fortunato L, Fecht D, Tzoulaki I, Abellan JJ, Hambly P, de Hoogh K, Ezzati M, Elliott P. 2012. Trends and inequalities in cardiovascular disease mortality across 7932 English electoral wards, 19822006: Bayesian spatial analysis. International Journal of Epidemiology 41:1737–1749. doi:10.1093/ije/dys151

Aylin P, Maheswaran R, Wakefield J, Cockings S, Jarup L, Arnold R, Wheeler G, Elliott P. 1999. A national facility for small area disease mapping and rapid initial assessment of apparent disease clusters around a point source: The UK Small Area Health Statistics Unit. Journal of Public Health 21:289–298. doi:10.1093/pubmed/21.3.289

Bahk J, Kang H-Y, Khang Y-H. 2020. Life expectancy and inequalities therein by income from 2016 to 2018 across the 253 electoral constituencies of the National Assembly of the Republic of Korea. Journal of Preventive Medicine and Public Health 53:143–148. doi:10.3961/jpmph.20.050

Barr B, Higgerson J, Whitehead M. 2017. Investigating the impact of the English health inequalities strategy: Time trend analysis. BMJ 358:j3310. doi:10.1136/bmj.j3310

Bennett JE, Li G, Foreman K, Best N, Kontis V, Pearson C, Hambly P, Ezzati M. 2015. The future of life expectancy and life expectancy inequalities in England and Wales: Bayesian spatiotemporal forecasting. The Lancet 386:163–170. doi:10.1016/S0140-6736(15)60296-3

Bennett JE, Pearson-Stuttard J, Kontis V, Capewell S, Wolfe I, Ezzati M. 2018. Contributions of diseases and injuries to widening life expectancy inequalities in England from 2001 to 2016: A population-based analysis of vital registration data. The Lancet Public Health 3:e586–e597. doi:10.1016/S2468-2667(18)30214-7

Besag J, York J, Mollié A. 1991. Bayesian image restoration, with two applications in spatial statistics. Annals of the Institute of Statistical Mathematics 43:1–20. doi:10.1007/BF00116466

Best N, Richardson S, Thomson A. 2005. A comparison of Bayesian spatial models for disease mapping. Statistical Methods in Medical Research 14:35–59. doi:10.1191/0962280205sm388oa

Bilal U, Alazraqui M, Caiaffa WT, Lopez-Olmedo N, Martinez-Folgar K, Miranda JJ, Rodriguez DA, Vives A, Diez-Roux AV. 2019. Inequalities in life expectancy in six large Latin American cities from the SALURBAL study: An ecological analysis. The Lancet Planetary Health 3:e503–e510. doi:10.1016/S2542-5196(19)30235-9

Cheshire J. 2012. Featured Graphic. Lives on the Line: Mapping Life Expectancy along the London Tube Network. Environment and Planning A: Economy and Space 44:1525–1528. doi:10.1068/a45341

Congdon P. 2014. Modelling changes in small area disability free life expectancy: Trends in London wards between 2001 and 2011. Statistics in Medicine 33:5138–5150. doi:10.1002/sim.6298

Congdon P. 2009. Life Expectancies for Small Areas: A Bayesian Random Effects Methodology. International Statistical Review 77:222–240. doi:10.1111/j.1751-5823.2009.00080.x

Downing A, Forman D, Gilthorpe MS, Edwards KL, Manda SO. 2008. Joint disease mapping using six cancers in the Yorkshire region of England. International Journal of Health Geographics 7:41. doi:10.1186/1476-072X-7-41

Duncan EW, White NM, Mengersen K. 2017. Spatial smoothing in Bayesian models: A comparison of weights matrix specifications and their impact on inference. International Journal of Health Geographics 16:47. doi:10.1186/s12942-017-0120-x

Dwyer-Lindgren L, Bertozzi-Villa A, Stubbs RW, Morozoff C, Mackenbach JP, van Lenthe FJ, Mokdad AH, Murray CJL. 2017a. Inequalities in Life Expectancy Among US Counties, 1980 to 2014: Temporal Trends and Key Drivers. JAMA Internal Medicine 177:1003–1011. doi:10.1001/jamainternmed.2017.0918

Dwyer-Lindgren L, Stubbs RW, Bertozzi-Villa A, Morozoff C, Callender C, Finegold SB, Shirude S, Flaxman AD, Laurent A, Kern E, Duchin JS, Fleming D, Mokdad AH, Murray CJL. 2017b. Variation in life expectancy and mortality by cause among neighbourhoods in King County, WA, USA, 19902014: A census tract-level analysis for the Global Burden of Disease Study 2015. The Lancet Public Health 2:e400–e410. doi:10.1016/S2468-2667(17)30165-2

Eayres D, Williams ES. 2004. Evaluation of methodologies for small area life expectancy estimation. Journal of Epidemiology & Community Health 58:243–249. doi:10.1136/jech.2003.009654

Elliott P, Briggs D, Morris S, Hoogh C de, Hurt C, Jensen TK, Maitland I, Richardson S, Wakefield J, Jarup L. 2001a. Risk of adverse birth outcomes in populations living near landfill sites. BMJ 323:363–368. doi:10.1136/bmj.323.7309.363

Elliott P, Toledano MB, Bennett JE, Beale L, Hoogh C de, Best N, Briggs D. 2010. Mobile phone base stations and early childhood cancers: Case-control study. BMJ 340:c3077. doi:10.1136/bmj.c3077

Elliott P, Wakefield J, Best N, Briggs D. 2001b. Spatial epidemiology: Methods and applications. Oxford University Press.

Elliott P, Westlake AJ, Hills M, Kleinschmidt I, Rodrigues L, McGale P, Marshall K, Rose G. 1992. The Small Area Health Statistics Unit: A national facility for investigating health around point sources of environmental pollution in the United Kingdom. Journal of Epidemiology & Community Health 46:345–349. doi:10.1136/jech.46.4.345

Ezzati M, Friedman AB, Kulkarni SC, Murray CJL. 2008. The Reversal of Fortunes: Trends in County Mortality and Cross-County Mortality Disparities in the United States. PLOS Medicine 5:e66. doi:10.1371/journal.pmed.0050066

Finucane MM, Paciorek CJ, Danaei G, Ezzati M. 2014. Bayesian Estimation of Population-Level Trends in Measures of Health Status. Statistical Science 29:18–25. doi:10.1214/13-STS427

Foreman KJ, Li G, Best N, Ezzati M. 2017. Small area forecasts of cause-specific mortality: Application of a Bayesian hierarchical model to US vital registration data. Journal of the Royal Statistical Society Series C (Applied Statistics) 66:121–139. doi:10.1111/rssc.12157

Gaylin DS, Kates J. 1997. Refocusing the lens: Epidemiologic transition theory, mortality differentials, and the AIDS pandemic. Social Science & Medicine 44:609–621. doi:10.1016/S0277-9536(96)00212-2

Gersten O, Wilmoth JR. 2002. The Cancer Transition in Japan since 1951. Demographic Research 7:271–306. doi:10.4054/DemRes.2002.7.5

Goodman PS. 2018. In Britain, Austerity Is Changing Everything. The New York Times.

Halonen JI, Hansell AL, Gulliver J, Morley D, Blangiardo M, Fecht D, Toledano MB, Beevers SD, Anderson HR, Kelly FJ, Tonne C. 2015. Road traffic noise is associated with increased cardiovascular morbidity and mortality and all-cause mortality in London. European Heart Journal 36:2653–2661. doi:10.1093/eurheartj/ehv216

Ham C. 2023. The Rise and Decline of the NHS in England 2000-20. The King’s Fund.

Hansell AL, Beale L, Ghosh RE, Fortunato L, Fecht D, Jarup L, Elliott P. 2014. The Environment and Health Atlas for England and Wales. Oxford University Press.

Hansell AL, Blangiardo M, Fortunato L, Floud S, Hoogh K de, Fecht D, Ghosh RE, Laszlo HE, Pearson C, Beale L, Beevers S, Gulliver J, Best N, Richardson S, Elliott P. 2013. Aircraft noise and cardiovascular disease near Heathrow airport in London: Small area study. BMJ 347:f5432. doi:10.1136/bmj.f5432

Held L, Natário I, Fenton SE, Rue H, Becker N. 2005. Towards joint disease mapping. Statistical Methods in Medical Research 14:61–82. doi:10.1191/0962280205sm389oa

Hiam L, Dorling D, McKee M. 2020. Things Fall Apart: The British Health Crisis 20102020. British Medical Bulletin 133:4–15. doi:10.1093/bmb/ldz041

Hiam L, Harrison D, McKee M, Dorling D. 2018. Why is life expectancy in England and Wales “stalling”? J Epidemiol Community Health 72:404–408. doi:10.1136/jech-2017-210401

Hodgson S, Nieuwenhuijsen MJ, Colvile R, Jarup L. 2007. Assessment of exposure to mercury from industrial emissions: Comparing “distance as a proxy” and dispersion modelling approaches. Occupational and Environmental Medicine 64:380–388. doi:10.1136/oem.2006.026781

Hodgson S, Nieuwenhuijsen MJ, Hansell A, Shepperd S, Flute T, Staples B, Elliott P, Jarup L. 2004. Excess risk of kidney disease in a population living near industrial plants. Occupational and Environmental Medicine 61:717–719. doi:10.1136/oem.2003.010629

Jarup L, Best N, Toledano MB, Wakefield J, Elliott P. 2002a. Geographical epidemiology of prostate cancer in Great Britain. International Journal of Cancer 97:695–699. doi:10.1002/ijc.10113

Jarup L, Briggs D, de Hoogh C, Morris S, Hurt C, Lewin A, Maitland I, Richardson S, Wakefield J, Elliott P. 2002b. Cancer risks in populations living near landfill sites in Great Britain. British Journal of Cancer 86:1732–1736. doi:10.1038/sj.bjc.6600311

Jarup L, Morris S, Richardson S, Briggs D, Cobley N, de Hoogh C, Gorog K, Elliott P. 2007. Down syndrome in births near landfill sites. Prenatal Diagnosis 27:1191–1196. doi:10.1002/pd.1873

Jonker MF, van Lenthe FJ, Congdon PD, Donkers B, Burdorf A, Mackenbach JP. 2012. Comparison of Bayesian Random-Effects and Traditional Life Expectancy Estimations in Small-Area Applications. American Journal of Epidemiology 176:929–937. doi:10.1093/aje/kws152

Kelsall J, Wakefield J. 2002. Modeling Spatial Variation in Disease Risk. Journal of the American Statistical Association 97:692–701. doi:10.1198/016214502388618438

Knorr-Held L. 2000. Bayesian modelling of inseparable space-time variation in disease risk. Statistics in Medicine 19:2555–2567. doi:10.1002/1097-0258(20000915/30)19:17/18<2555::AID-SIM587>3.0.CO;2-%23

Knorr-Held L, Best NG. 2001. A Shared Component Model for Detecting Joint and Selective Clustering of Two Diseases. Journal of the Royal Statistical Society Series A (Statistics in Society) 164:73–85.

Konstantinoudis G, Cameletti M, Gómez-Rubio V, Gómez IL, Pirani M, Baio G, Larrauri A, Riou J, Egger M, Vineis P, Blangiardo M. 2022. Regional excess mortality during the 2020 COVID-19 pandemic in five European countries. Nature Communications 13:482. doi:10.1038/s41467-022-28157-3

Kontis V, Bennett JE, Rashid T, Parks RM, Pearson-Stuttard J, Guillot M, Asaria P, Zhou B, Battaglini M, Corsetti G, McKee M, Di Cesare M, Mathers CD, Ezzati M. 2020. Magnitude, demographics and dynamics of the effect of the first wave of the COVID-19 pandemic on all-cause mortality in 21 industrialized countries. Nature Medicine 26:1919–1928. doi:10.1038/s41591-020-1112-0

Leon DA, Jdanov DA, Shkolnikov VM. 2019. Trends in life expectancy and age-specific mortality in England and Wales, 19702016, in comparison with a set of 22 high-income countries: An analysis of vital statistics data. The Lancet Public Health 4:e575–e582. doi:10.1016/S2468-2667(19)30177-X

Mahaki B, Mehrabi Y, Kavousi A, Schmid VJ. 2018. Joint Spatio-temporal Shared Component Model with an Application in Iran Cancer Data. Asian Pacific Journal of Cancer Prevention 19:1553–1560. doi:10.22034/APJCP.2018.19.6.1553

Marmot MG, Allen J, Boyce T, Goldblatt P, Morrison J. 2020. Marmot Review: 10 years on. Institute of Health Equity.

Marmot MG, Davey Smith G, Stansfeld S, Patel C, North F, Head J, White I, Brunner E, Feeney A. 1991. Health inequalities among British civil servants: The Whitehall II study. The Lancet 337:1387–1393. doi:10.1016/0140-6736(91)93068-K

Marmot MG, Goldblatt P, Allen J, Boyce T, McNeish D, Grady M, Strelitz J, Geddes I, Friel S, Porritt F, Reinertsen E, Bell R, Allen M. 2010. The Marmot Review: Fair society, healthy lives. Institute of Health Equity.

Marmot MG, Shipley MJ, Rose G. 1984. Inequalities in death - specific explanations of a general pattern? The Lancet 323:1003–1006. doi:10.1016/S0140-6736(84)92337-7

Ministry of Housing, Communities & Local Government. 2019. English indices of deprivation 2019. Ministry of Housing, Communities & Local Government.

Moriyama IM, Gover M. 1948. Statistical Studies of Heart Diseases: I. Heart Diseases and Allied Causes of Death in Relation to Age Changes in the Population. Public Health Reports (1896-1970) 63:537–545. doi:10.2307/4586527

Murray CJL, Lopez AD. 1996. The Global Burden of Disease: A comprehensive assessment of mortality and disability from diseases, injuries, and risk factors in 1990 and projected in 2020. World Health Organization.

OECD. 2016. OECD Reviews of Health Care Quality: United Kingdom 2016: Raising Standards. Paris: Organisation for Economic Co-operation and Development.

Office for National Statistics. 2015. Health Expectancies at Birth for Middle Layer Super Output Areas (MSOAs), England. Office for National Statistics.

Olshansky SJ, Ault AB. 1986. The Fourth Stage of the Epidemiologic Transition: The Age of Delayed Degenerative Diseases. The Milbank Quarterly 64:355–391. doi:10.2307/3350025

Omran AR. 1977. A century of epidemiologic transition in the United States. Preventive Medicine 6:30–51. doi:10.1016/0091-7435(77)90003-2

Omran AR. 1971. The Epidemiologic Transition: A Theory of the Epidemiology of Population Change. The Milbank Memorial Fund Quarterly 49:509–538. doi:10.2307/3349375

Paciorek CJ, Schervish MJ. 2006. Spatial modelling using a new class of nonstationary covariance functions. Environmetrics 17:483–506. doi:10.1002/env.785

Parkes B, Hansell AL, Ghosh RE, Douglas P, Fecht D, Wellesley D, Kurinczuk JJ, Rankin J, de Hoogh K, Fuller GW, Elliott P, Toledano MB. 2020. Risk of congenital anomalies near municipal waste incinerators in England and Scotland: Retrospective population-based cohort study. Environment International 134:104845. doi:10.1016/j.envint.2019.05.039

Preston SH. 1970. An International Comparison of Excessive Adult Mortality. Population Studies 24:5–20. doi:10.2307/2173259

Preston SH, Heuveline P, Guillot M. 2001. Demography: Measuring and Modeling Population Processes. Blackwell Publishing.

Preston SH, Nelson VE. 1974. Structure and change in causes of death: An international summary. Population Studies 28:19–51. doi:10.1080/00324728.1974.10404577

Public Health England. 2021. Local Health - Small Area Public Health Data. Public Health England.

Salomon JA, Murray CJL. 2002. The Epidemiologic Transition Revisited: Compositional Models for Causes of Death by Age and Sex. Population and Development Review 28:205–228. doi:10.1111/j.1728-4457.2002.00205.x

Silcocks PBS, Jenner DA, Reza R. 2001. Life expectancy as a summary of mortality in a population: Statistical considerations and suitability for use by health authorities. Journal of Epidemiology & Community Health 55:38–43. doi:10.1136/jech.55.1.38

Steel N, Ford JA, Newton JN, Davis ACJ, Vos T, Naghavi M, Glenn S, Hughes A, Dalton AM, Stockton D, Humphreys C, Dallat M, Schmidt J, Flowers J, Fox S, Abubakar I, Aldridge RW, Baker A, Brayne C, Brugha T, Capewell S, Car J, Cooper C, Ezzati M, Fitzpatrick J, Greaves F, Hay R, Hay S, Kee F, Larson HJ, Lyons RA, Majeed A, McKee M, Rawaf S, Rutter H, Saxena S, Sheikh A, Smeeth L, Viner RM, Vollset SE, Williams HC, Wolfe C, Woolf A, Murray CJL. 2018. Changes in health in the countries of the UK and 150 English Local Authority areas 19902016: A systematic analysis for the Global Burden of Disease Study 2016. The Lancet 392:1647–1661. doi:10.1016/S0140-6736(18)32207-4

Stephens AS, Purdie S, Yang B, Moore H. 2013. Life expectancy estimation in small administrative areas with non-uniform population sizes: Application to Australian New South Wales local government areas. BMJ Open 3:e003710. doi:10.1136/bmjopen-2013-003710

Taylor-Robinson D, Lai ETC, Wickham S, Rose T, Norman P, Bambra C, Whitehead M, Barr B. 2019. Assessing the impact of rising child poverty on the unprecedented rise in infant mortality in England, 20002017: Time trend analysis. BMJ Open 9:e029424. doi:10.1136/bmjopen-2019-029424

The Economist. 2023. Britain has endured a decade of early deaths. Why? The Economist 19–21.

Toledano MB, Jarup L, Best N, Wakefield J, Elliott P. 2001. Spatial variation and temporal trends of testicular cancer in Great Britain. British Journal of Cancer 84:1482–1487. doi:10.1054/bjoc.2001.1739

Wakefield J, Elliott P. 1999. Issues in the statistical analysis of small area health data. Statistics in Medicine 18:2377–2399. doi:10.1002/(SICI)1097-0258(19990915/30)18:17/18<2377::AID-SIM263>3.0.CO;2-G

Wilkinson RG. 1992. Income distribution and life expectancy. British Medical Journal 304:165–168. doi:10.1136/bmj.304.6820.165

Wilmoth JR. 1995. Are mortality projections always more pessimistic when disaggregated by cause of death? Mathematical Population Studies 5:293–319. doi:10.1080/08898489509525409

Yu J, Dwyer-Lindgren L, Bennett J, Ezzati M, Gustafson P, Tran M, Brauer M. 2021. A spatiotemporal analysis of inequalities in life expectancy and 20 causes of mortality in sub-neighbourhoods of Metro Vancouver, British Columbia, Canada, 19902016. Health & Place 72:102692. doi:10.1016/j.healthplace.2021.102692

Or, if the open-ended age group contains zero deaths, the calculation of life expectancy is impossible. This is because the probability of dying in the final age group will be zero, so the life table cannot be closed and the life expectancy will be infinite.↩︎
See Chapter 3 for descriptions of spatial units.↩︎