This is an outdated version published on 2026-06-12. Read the most recent version.

Cochrane Data Extractor: Automated Dataset Harvesting from Cochrane Systematic Reviews

Authors

DOI:

https://doi.org/10.66040/7w81q996

Abstract

Is automated extraction the way forward for extracting from the large-scale Cochrane datasets that Cochrane has started publishing? We developed an automated Python pipeline to download and organise the study-level data that Cochrane has put up on its website (pairwise, DTA, NMA). We designed it to check the data links and then download, validate, and organise the data into standardised datasets with covariates. The test set showed 11 pairwise datasets with 1,295 data rows at high accuracy. Uninterrupted runs were then carried out until 501 Cochrane reviews had been downloaded and their data organised into the Pairwise70 repository. We converted the Cochrane open data, which is distributed and hard to use, into a single data artefact to support meta-analysis research. This tool is limited to recent Cochrane reviews and does not work on other databases.

Published

2026-06-06 — Updated on 2026-06-12

Versions

Issue

Section

Methods Note

How to Cite

Cochrane Data Extractor: Automated Dataset Harvesting from Cochrane Systematic Reviews. (2026). Synthēsis, 2(4). https://www.synthesis-medicine.org/index.php/journal/article/view/108 (Original work published 2026)

Most read articles by the same author(s)

<< < 1 2 3