Applied statistical methods

On this page:

Detail Outputs Featured scientific publications

Background

CLS’ applied statistical methods research programme supports and enables users to tackle some of the important challenges in using longitudinal data, including:

missing data
causal inferences
measurement error and
survey mode effects.

We bring together ideas and methods from a number of disciplines, such as statistics, econometrics, psychometrics, epidemiology and computer science.

We publish applied methodological papers in peer-reviewed journals and are developing a series of step-by-step user guides and training to help users apply these methods in their own research, using widely available software such as Stata and R.

Here you can find out more about our work on applied statistical methods.

Handling missing data

We know different types of people tend to drop out of longitudinal studies over time, depending on their individual circumstances and characteristics.

To support researchers of CLS cohort data deal with this common problem, we have developed comprehensive advice on how to deal with missing data and reduce bias.

Find out more on our Handling missing data page.

Causal inference

Causal inference in observational data is far from straightforward. However, the wealth of information we have collected about cohort members about the whole of their lives, and even about the circumstances of their birth, gives data users the opportunity to select rich controls for multivariable adjustment.

There are also circumstances in which causal identification may also be achieved with approaches such as instrumental variable modelling/Mendelian randomisation, regression discontinuity, and fixed effects/correlated random effects methods.

We are developing a programme of work on causal inference in our cohorts, which will use a range of methods. We will use directed acyclic graphs to represent assumptions about potential confounders, and apply techniques such as negative controls and simulations of unmeasured confounders to test the degree to which omitted variables might lead to bias.

Measurement error

Data from self-reported measures can be biased due to processes driven by cohort members’ personalities and circumstances. On the other hand, data from objective measures may also be affected by instrumental errors, for example the precision of the blood pressure device used by the nurse or laboratory variations in blood analysis. Additional sources of error arise when comparing data from multiple studies as there can be variation in how different groups interpret the same question and in response tendencies.

In our work on measurement error, we use the latest extensions of the generalised latent variable modelling framework to specify complex error structures. This allows us to investigate the properties of some key areas of measurement of our cohort data, including on physical health, mental health and cognition, and to establish within and between cohort equivalent measures.

Two ongoing projects funded by Closer investigate the measurement properties of mental health and cognitive ability measures in British birth cohorts.

Survey mode effects

Each of CLS’ cohort studies contains elements of mixed mode data collection. This can include: carrying out interviews via face-to-face, telephone, video and/or web survey.

The potential advantages of mixed mode data collection are lower costs, increased efficiency, and higher participation rates.

However, participants’ responses may differ systematically between survey modes used – this is termed “mode effects”. For instance, the presentation of a survey item either aurally or visually can influence responses and sensitive information may be reported more accurately when given anonymously.

Unaccounted for, mode effects may lead to bias in analyses.

User guide

To help data users work with mixed mode data, we have developed a comprehensive Handling Mode Effects user guide.

This guide:

provides frameworks and relevant empirical evidence to help researchers think about the possible consequences of mode effects in their own analyses
describes methods for handling mode effects, including their strengths and limitations
highlights sensitivity analysis as a particularly promising approach
provides walkthroughs for these methods with code in R and Stata
contains recommendations data users may want to follow in their own work.

Download the Handling mode effects user guide summary.

Download the Handling mode effects user guide.

Research projects and outputs

USER GUIDE SUMMARY

Handling mode effects in the CLS cohort studies user guide summary (Nov 2024)

26 November 2024

A summary to a user guide which provides guidance and recommendations for handling mode effects in CLS’ cohort studies through applied examples, using data from…

Download

User guide

Handling mode effects in the CLS cohort studies user guide (Nov 2024)

26 November 2024

A user guide which provides guidance and recommendations for handling mode effects in CLS’ cohort studies through applied examples, using data from the National Child…

Download

USER GUIDE

Handling missing data in the CLS cohort studies - User Guide

This user guide aims to describe and illustrate a straightforward approach to missing data handling, while detailing some more general considerations around missing data along the way.

Download

Webinar recording

Handling missing data in the 1970 British Cohort Study

Watch our webinar on CLS’ missing data strategy and how to tackle missing data in BCS70.

Webinar recording

Handling missing data in the British cohort studies (2023)

Watch our webinar on ways to address missing data in the CLS cohort studies.

Featured scientific publications

Silverwood, R.J., Calderwood, L., Henderson, M., Sakshaug, J.W. & Ploubidis, G.B.(2024)

A data-driven approach to understanding non-response and restoring sample representativeness in the UK Next Steps cohort

Longitudinal and Life Course Studies

Read the full paper

Narayanan, M.K., Dodgeon, B., Katsoulis, M., Ploubidis, G.B. & Silverwood, R.J. (2024)

How to mitigate selection bias in COVID-19 surveys: evidence from five national cohorts

European Journal of Epidemiology

Read the full paper

Katsoulis, M., Narayanan, M.K., Dodgeon, B., Ploubidis, G.B. & Silverwood, R.J. (2024)

A data driven approach to address missing data in the 1970 British birth cohort

medRxiv

Read the full paper

Rajah, N., Calderwood, L., De Stavola, B.L., Harron, K., Ploubidis G.B. & Silverwood, R.J. (2023)

Using linked administrative data to aid the handling of non-response and restore sample representativeness in cohort studies: the 1958 national child development study and hospital episode statistics data

BMC Medical Research Methodology

Read the full paper

Goodman, A., Brown, M., Silverwood, R.J., Sakshaug, J.W., Calderwood, L., Williams, J. & Ploubidis, G.B. (2022)

The Impact of Using the Web in a Mixed-Mode Follow-up of a Longitudinal Birth Cohort Study: Evidence from the National Child Development Study

Journal of the Royal Statistical Society

Read the full paper

Mostafa, T., Narayanan, M., Pongiglione, B., Dodgeon, B., Goodman, A., Silverwood, R.J., & G.B. Ploubidis, G.B. (2021)

Missing at random assumption made more plausible: evidence from the 1958 British birth cohort

Journal of Clinical Epidemiology

Read the full paper

Contact us

Centre for Longitudinal Studies
UCL Social Research Institute

20 Bedford Way
London WC1H 0AL

Email: clsdata@ucl.ac.uk

Applied statistical methods

Background

Handling missing data

Causal inference

Measurement error

Survey mode effects

User guide

Research projects and outputs

Handling mode effects in the CLS cohort studies user guide summary (Nov 2024)

Handling mode effects in the CLS cohort studies user guide (Nov 2024)

Handling missing data in the CLS cohort studies - User Guide

Handling missing data in the 1970 British Cohort Study

Handling missing data in the British cohort studies (2023)

Featured scientific publications

Contact us

Funded by

Follow us