Data Availability
Access to the underlying identifiable and potentially re-identifiable pseudonymised electronic health record data is tightly governed by various legislative and regulatory frameworks, and restricted by best practice. The data in OpenSAFELY is drawn from General Practice data across England where TPP is the Data Processor. TPP developers (Chris Bates, Jonathan Cockburn, John Parry, Frank Hester, and Sam Harper) initiate an automated process to create pseudonymised records in the core OpenSAFELY database, which are copies of key structured data tables in the identifiable records. These are linked onto key external data resources that have also been pseudonymised via SHA-512 one-way hashing of NHS numbers using a shared salt. DataLab developers and PIs (Ben Goldacre, Liam Smeeth, Caroline E Morton, Seb Bacon, Alex J Walker, William Hulme, Helen J Curtis, David Evans, Peter Inglesby, Simon Davy, George Hickman, Krishnan Bhaskaran and Christopher T Rentsch) holding contracts with NHS England have access to the OpenSAFELY pseudonymised data tables as needed to develop the OpenSAFELY tools. These tools in turn enable researchers with OpenSAFELY Data Access Agreements to write and execute code for data management and data analysis without direct access to the underlying raw pseudonymised patient data, and to review the outputs of this code. All code for the full data management pipeline, from raw data to completed results for this analysis, and for the OpenSAFELY platform as a whole is available for review at github.com/OpenSAFELY. The data management and analysis code for this paper was led by MDR and JBG.