Article type
Abstract
Background:
The identification of RCTs for inclusion in Cochrane reviews is an extremely labour-intensive task. Machine learning may be able to reduce the manual burden of study identification, and previous evaluations have shown that recall of more than 99% is attainable whilst excluding more than 75% of citations automatically; leaving less than 25% for manual checking.Objectives:
To evaluate the performance of a machine-learning classifier to reduce manual workload in screening in terms of the burden of screening saved and the ‘cost’ in terms of relevant studies being erroneously excluded.Methods:
A machine-learning classifier was built to distinguish between RCTs and non-RCTs using more than 280 000 records from the Cochrane Crowd.In January 2017, the classifier was applied to all 94 305 citations to studies included in published Cochrane reviews that had an inclusion criterion of including RCTs only. Records with no abstract were counted as ‘identified’ on the assumption they would have been manually checked.