Abstract
Objectives To address the lack of individual-level socioeconomic information in electronic health care records, we linked the 2011 census of England and Wales to patient records from a large mental healthcare provider. This paper describes the linkage process and methods for mitigating bias due to non-matching.
Setting South London and Maudsley NHS Foundation Trust (SLaM), a mental health care provider in southeast London.
Design Clinical records from SLaM were supplied to the Office of National Statistics (ONS) for link-age to the census through a deterministic matching algorithm. We examined clinical (ICD-10 diagnosis, history of hospitalisation, frequency of service contact) and sociodemographic (age, gender, ethnicity, deprivation) information recorded in CRIS as predictors of linkage success with the 2011 Census. To assess and adjust for potential biases caused by non-matching, we evaluated inverse probability weighting for mortality associations.
Participants Individuals of all ages in contact with SLaM up until December 2019 (N=459,374).
Outcome measures Likelihood of mental health records’ linkage to census.
Results 220,864 (50.4%) records from CRIS linked to the 2011 census. Young adults (Prevalence ratio (PR) 0.80, 95% CI 0.80-0.81), individuals living in more deprived areas (PR 0.78,0.78-0.79), and minority ethnic groups (e.g., Black African, PR 0.67, 0.66-0.68) were less likely to match to census. After implementing inverse probability weighting, we observed little change in the strength of association between clinical/demographic characteristics and mortality (e.g., presence of any psychiatric disorder: unweighted PR 2.66, 95% CI 2.52, 2.80; weighted PR 2.70, 95% CI 2.56, 2.84)
Conclusions Lower response rates to the 2011 census amongst people with psychiatric disorders may have contributed to lower match rates, a potential concern as the census informs service planning and allocation of resources. Due to its size and unique characteristics, the linked dataset will enable novel investigations into the relationship between socioeconomic factors and psychiatric disorders.
Strengths and limitations of this study
This is the first time mental healthcare electronic records have been linked to ONS census at the individual-level in England. Due to its scale, ethnic diversity and demographic characteristics, and abundance of detailed information on a variety of socioeconomic and demographic indicators acquired through the linkage to census records, this dataset will enable novel investigations into the causes, trajectories and outcomes of psychiatric disorders.
A significant strength of the study is that we could assess and adjust for potential biases caused by non-matching related to age, gender and deprivation.
Whilst we observed differences between individuals that matched to census, and those that did not, our weighted analyses were able to show that these differences did not substantially alter associations with mortality outcomes.
Due to the nature of the deterministic linkage algorithm, we could not determine the causes of non-linkage.
Competing Interest Statement
MH is principal investigator of the RADAR-CNS, a pre-competitive public-private collaboration on mobile health funded by the Innovative Medicine Initiative with cash and in-kind contributions paid to the university from Janssen, Lundbeck, UCB, MSD and Biogen. RS declares research support in the last 3 years from Janssen, GSK and Takeda. All other authors have no conflicts of interest to declare.
Funding Statement
This paper represents independent research part-funded by the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King's College London. LC and MW are supported by a grant from the ESRC (ES/S002715/1). RH is currently funded by a doctoral studentship granted by the UKRI ESRC LISS-DTP managed by King's College London. JD and CM are part supported by the ESRC Centre for Society and Mental Health at King's College London (ESRC Reference: ES/S012567/1) and by the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King's College London and the National Institute for Health Research (NIHR) Applied Research Collaboration South London (NIHR ARC South London) at King's College Hospital NHS Foundation Trust. MH is a NIHR Senior Investigator. RS is part-funded by: i) the National Institute for Health Research (NIHR) Maudsley Biomedical Research Centre at the South London and Maudsley NHS Foundation Trust and King's College London; ii) the NIHR Applied Research Collaboration South London (NIHR ARC South London) at King's College Hospital NHS Foundation Trust; iii) UKRI - Medical Research Council through the DATAMIND HDR UK Mental Health Data Hub (MRC reference: MR/W014386); iv) the UK Prevention Research Partnership (Violence, Health and Society; MR-VO49879/1), an initiative funded by UK Research and Innovation Councils, the Department of Health and Social Care (England) and the UK devolved administrations, and leading health research charities. The views expressed are those of the authors and not necessarily those of the ESRC, NHS, the NIHR or the Department of Health and Social Care or King's College London.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
CRIS has Research Ethics Committee approval as a source of anonymised data for secondary analysis (Oxford REC C, reference 18/SC/0372). The current CRIS-Census linkage was supported through: REC reference for CRIS-Census Linkage: 18/SC/0003. Additional approvals from the Confidential Advisory Group to access patient information without consent, for the purposes of linkage, were obtained (CAG S251 reference: 17/CAG/0204). Approvals were also sought and obtained from the National Statistician's Data Ethics Advisory Committee (NSDEC) for approvals to use linked CRIS-census data for specified projects.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
Data from SLaM are owned by a 3rd party SlaM BRC CRIS tool which provides access to anonymised data derived from SlaM electronic medical records. These data can only be accessed by permitted individuals from within a secure firewall (i.e., remote access is not possible and the data cannot be sent elsewhere) in the same manner as the authors. Our team is interested in supporting collaboration with interested researchers, subject to appropriate approvals and accreditation status. Requests to access data can be directed to jayati.das-munshi@kcl.ac.uk