Good TREs work
Raw data
This page provides links to the data structures and data sources that were used to create TheySoldItAnyway.com and GoodTRESWork.com. These can be reused by anyone wishing to experiment with other ways to present similar information.
N.B. Please do not fetch these files each time you wish to use them. The files are very large, and not cached by our CDN. They are updated (approximately) every month, after NHS Digital has published its updated register.
If you wish to cite the source of this data, the citation can be to GoodTREsWork.com; there is no requirement to mention medConfidential, or TheySoldItAnyway.com.
Start here: Standard format files
These files are for input to an ETL process, and not for remote loading from webpages, but have a consistent structure
- UKSA (this is the simplest file)
- ONS
- IDS
- CPRD
- NHSD 2018 onwards
- ADRUK
- London SDE
- OpenSAFELY/NHSEngland
- through NHS Digital and NHS England COVID special process registers
Each file is structured thus:
{projects: { projectid : { publicbody:"...", projectid:"..." , who:, what, where, when, why, dataset, url, optouts, outcomes, htmlblob }, ... }}
The htmlblob is what we display on TSIA.
Some unpublished fields in some sources may have generic text.
The optouts field is the best available information on consent/dissent for a particular project or study, and will need some editorial processing to be readily comparable.
We are happy to add files for other bodies that publish authoritative, accessible data registers. (We note that HDR UK does neither of these things, having IBM 'help with' the gateway to avoid HDR needing to understand or take responsibility for implementations.)
Raw files
These files are much closer to the original information, as released by the relevant body. They follow almost no standard, and can contain much more detail than the standard format files linked above. These files are close to the data structures we use internally, and as a result are always subject to change. They do, however, enable you to do things that the standard JSON does not allow, and you can map between one and the other via projectIDs.
- NHS Digital register 2018 onwards
- CPRD as published (xls)
- UKSA DEA as published (xls)
- ADRUK
- ONS
- ONS
- SDE London
- OpenSAFELY
- through COVID special process registers
Due to their simplicity, the UKSA raw and standard format files are identical, and this one file is also available as a spreadsheet.
Please send any suggestions / bugs to sam @ medconfidential.org