Retrospective case study to test performance of machine learning: results from Cochrane Heart

Article type
Year
Authors
Martin N1, Thomas J2, Casas J1, Huffman M3, Jonnalagadda S4
1Cochrane Heart, University College London, UK
2Institute of Education, University College London, UK
3Cochrane Heart, Northwestern University Feinberg School of Medicine, Chicago, USA
4Northwestern University, Chicago, USA
Abstract
Background: Screening search results to identify eligible studies for inclusion in systematic reviews is time consuming. Machine learning aims to reduce the workload of screening, but data evaluating the performance are limited.

Project outline: We are therefore conducting a retrospective case study by comparing the performance of machine learning technology to the ‘gold standard’ of duplicate manual screening.

Methods: We included data from published Cochrane Heart Reviews for which search results are available to Cochrane Heart.

Results: Preliminary results for six (out of 40) reviews were presented at the Cochrane UK and Ireland Symposium in Birmingham, UK, in March 2016. These showed that at least 60% of the screening workload could have been saved with no loss in recall. Final results for 40 reviews will be presented at the Colloquium.

Conclusions: Machine learning represents a potential strategy to reduce the workload of screening for systematic reviews. Further research evaluating the performance of machine learning systems and in other fields are needed before this method can be widely adopted