NCDS response and missingness

NCDS response

The table below presents statistics about response in NCDS at every major sweep from birth.

Of the 17,415 cohort members who participated in the first sweep, 4,497 (25.8%) have participated in all 11 major sweeps.

Of all 18,558 cohort members, 11,232 (60.5%) have taken part in seven or more sweeps.

Table: Participation in NCDS from birth to 55 years

Total cohort Dead Emigrants Eligible sample Participants (% of eligible sample)
Birth – 1958 17638 0 0 17638 17415 98.7
Age 7 – 1965 18016a 821 475 16720 15425 92.3
Age 11 – 1969 18287a 840 701 16746 15337 91.6
Age 16 – 1974 18558a 873 799 16886 14654 86.8
Age 23 – 1981 18558 960 1196 16402 12357 75.3
Age 33 – 1991 18558 1049 1335 16174 11469 70.9
Age 42 – 2000 18558 1199 1268 16091 11419 71.0
Age 44 – 2002 18558 1321 1234 16003 9377 58.6
Age 46 – 2004 18558 1323 1272 15963 9534 59.7
Age 50 – 2008 18558 1459 1293 15806 9790 61.9
Age 55 – 2013 18558 1659 1286 15613 9137 58.5

a The original sample was supplemented by migrants born in 1958

Missingness in NCDS

NCDS variables

We have used a systematic data-driven approach to identify variables that are important predictors of non-response at each NCDS sweep.

A list of these variables is provided in the appendix of the Handling missing data in the CLS cohort studies user guide.

These variables can then be considered for inclusion in analyses – for example as auxiliary variable when using multiple imputation – in order to maximise the plausibility of the missing at random (MAR) assumption.

Read more about the process in the article: Missing at random assumption made more plausible: evidence from the 1958 British birth cohort.

Example: restoring the composition of NCDS at age 50 and 55

We have been able to restore the composition of the NCDS samples at ages 50 and 55 to be representative of the study’s target population. We did this by including predictors of non-response at age 50 and age 55 as auxiliary variables in multiple imputation analyses.

For example, we were able to replicate:

  • the original distribution of paternal social class observed at the birth survey
  • the distribution of cognitive ability at age seven
  • and the known population distribution of educational attainment and marital status at age 50.

Figure 1: Social class of mother’s husband at birth before and after adjustment for missing data

Graph showing social class of mother's husband at birth before and after adjustment for missing data

 

 

 

 

 

 

 

The imputation phase of this analysis included predictors of non-response at age 55 and social class at birth only for cohort members that participated at age 55.

Figure 2: Percentage of those with degree or equivalent at age 50 in the Annual Population Survey and NCDS before and after adjustment for missing data.

graph

 

 

 

 

 

 

 

APS GB: Annual Population Survey = Born in Great Britain in 1958, derived by the Office for National Statistics (ONS)

APS All: Annual Population Survey = Born in Great Britain or elsewhere in 1958 (derived by ONS)

NCDS50 MI: Estimate after multiple imputation using predictors of educational attainment at age 50 and predictors of non-response at age 50 as auxiliary variables.

Useful documents

Mostafa, T., Narayanan, M., Pongiglione, B., Dodgeon, B., Goodman, A., Silverwood, R.J., & Ploubidis, G.B. (2021)
Missing at random assumption made more plausible: evidence from the 1958 British birth cohort
Journal of Clinical Epidemiology
Read the full paper
Silverwood, R.J., Goodman, A., Ploubidis, G.B. (2022)
Letter to the editor: Don’t forget survey data: ‘healthy cohorts’ are ‘real-world’ relevant if missing data are handled appropriately
Longitudinal and Life Course Studies
Read the full paper
Rajah, N., Calderwood, L., De Stavola, B.L., Harron, K., Ploubidis G.B. & Silverwood, R.J. (2023)
Using linked administrative data to aid the handling of non-response and restore sample representativeness in cohort studies
BMC Medical Research Methodology
Read the full paper
Narayanan, M.K., Dodgeon, B., Katsoulis, M., Ploubidis, G.B. & Silverwood, R.J. (2024)
How to mitigate selection bias in COVID-19 surveys: evidence from five national cohorts
European Journal of Epidemiology
Read the full paper

Handling missing data

Find out more about the approaches that CLS recommends for the handling of missing data, and resources available, on the Handling missing data page.

Contact us

Centre for Longitudinal Studies
UCL Social Research Institute

20 Bedford Way
London WC1H 0AL

Email: clsdata@ucl.ac.uk

Follow us