“Facts are stubborn, but statistics are more pliable” Mark Twain

Introduction

Whatever the intervention of interest for a given clinical condition or public health problem, it is likely that previous studies have been performed to address it. It is therefore logical that such prior knowledge should be understood and analysed in order to inform decisions and help plan, design and justify future studies. Unfortunately, heterogeneity of prior to knowledge in terms of settings, teams and patient features creates uncertainty about its validity, robustness, significance, relevance and implications for both modern knowledge and future investigations. Moreover, it creates opportunities for evidence distortion driven by confirmation, financial and academic biases among many.

Faced with such uncertainty, even as far back as 1904, Karl Pearson [1] proposed the use of formal techniques to combine data from different studies to examine the preventive effect of serum inoculations against enteric fever. Since then, the need to systematically assess prior knowledge has become even more seductive, given the current global research output with 20,000 journals publishing more than 2 million articles per year, often with unclear or conflicting results. This task, however, is challenging, no matter how theoretically desirable. In particular, most modern literature remains heterogeneous in quality, design, patient characteristics, diagnoses and outcomes. In addition, many studies involve small numbers, are at high risk of type I and type II errors, and lack blinding. In response to such limitations, systematic reviews and meta-analyses (SRMAs) have been promoted as the “path to scientific salvation” by providing the statistical magic that will allow clinicians, public health officials or investigators to determine the possible efficacy or harm of a given intervention [2], despite the use of data from poor quality, unblinded, heterogeneous trials. Moreover, even if a systematic review does not necessarily involve quantitative statistical pooling (meta-analysis), it is frequently argued that it will provide important synthetic information of prior knowledge and identify significant (and allegedly previously unnoticed) knowledge gaps. Such SRMAs have become particularly fashionable in the era of evidence-based medicine.

Evidence-based medicine and the evolution of SRMAs

The advent of the evidence-based medical and healthcare paradigm in the 1990s has provided strong impetus for the use of SRMAs, placing them at the top of the hierarchical pyramid of evidence [3]. In particular, the Cochrane Collaboration was established in 1993 to support the generation and dissemination of SRMAs [4]. Since then, SRMA specialists have attempted to develop standards on how to conduct and report such SRMAs and created tools to assess the risk of bias or the quality of SRMAs [5]. Although such frenetic corrective statistical activity implies that SRMAs carry the same flaws as those of the data they dredge, SRMAs have proliferated [6], so that the concept of synthesizing data from two or more SRMAs has gained traction in healthcare research—yielding SRMAs of SRMAs, also named “overviews” [7] or “umbrella views” [8]. New methods for dealing with multiple SRMAs published on the same topic area and methods for displaying outcome data in overviews have been proposed [9]. In addition, network meta-analyses now allow the inclusion of both directly and indirectly relevant evidence from individual studies to describe the relative benefits (or harms) of a range of interventions even in the absence of head-to-head comparisons [10]. Lastly, updating the evidence by continuous or trial sequential meta-analysis (TSA) for timely decision-making has been proposed for SRMAs [11], and cumulative network meta-analyses [12]. Many believe that such approaches will lead to improvements in the quality of SRMAs and to more reliable estimates of intervention effects.

Irrespective of the SRMA technique, all of the above developments will now ensure that a veritable tsunami of SRMAs will drown clinicians, public health officials and investigators in perpetuity. Indeed, their production has now reached “epidemic proportions” [13]. Thus, we are in the middle of an ideological, publication-fuelled bubble which, similar to the alchemists of centuries ago, promises to make gold out of clods of earth. Faced with this onslaught, it is legitimate to ask the key question of whether SRMAs are “useful research”.

Why SRMAs are not useful research

There are several reasons why SRMAs are not useful research. First, they are not new research at all; rather they are simply summaries and statistical transformations of existing research based on empirically untested assumptions. Such analytical processes are often managed by SRMA professionals who, in some sense, academically and somewhat symbiotically benefit from the years of effort expended by investigators and research teams conducting clinical trials. Second, SRMAs do not consider and cannot estimate the biological plausibility, reproducibility and external validity of their results [14]. Third, they have limited predictive ability for the results of subsequent randomized controlled trials (RCTs), often producing a misleading assessment of knowledge [15]. Fourth, understanding prior knowledge in a way that is useful requires a systematic and reproducible approach (as is the case for all science), which delivers reproducible outcomes. SRMAs often do not deliver such reliable outcomes; SRMAs are sometimes published reporting different conclusions from seemingly almost identical sources. Fifth, the most common conclusions of SRMAs are typically unhelpful and include sentences such as “better studies are needed” or “there is insufficient evidence” (Table 1). These kinds of conclusions are often also reported after the non-pivotal underpowered RCTs, for whose poor level evidence the SRMAs are supposed to compensate. Assessment of such prior knowledge by experts could have easily come to the same conclusion in one line. In this regard, the concerns associated with SRMAs are similar to those associated with low quality research. Unfortunately, SRMAs sometimes seem to amplify the reach of such low quality work rather than represent a correction for their shortcomings.

Table 1 The 10 main issues of systematic reviews and meta-analyses (SRMAs)

Conclusions

The synthesis of prior knowledge and the assessment of its quality are essential to scientific progress. SRMAs offer a (often flawed) means of delivering such synthesis and assessment but are not useful research themselves because they do not provide novel information or deliver the results of experiments. SRMAs are often based on poor studies and low quality primary evidence and thus cannot deliver useful and robust information or insights (garbage in = garbage out). SRMAs often confer a whiff of legitimacy to that which should be dismissed and are used as tools to support specific agendas. Finally, SRMAs promote misleading views among readers by showing that a single-centre, unblinded study of 20 patients belongs to the same forest plot as a 2000-patient, multicentre, double-blind RCT. Any rational consideration of the consequences of such an approach must inevitably lead to the logical conclusion that, in general, SRMAs are not useful research and can never be a substitute for reasoned and careful assessment of the literature.