Abstract
The statistical analysis of results from inter-laboratory comparisons (for example Key Comparisons, or Supplemental Comparisons) produces an estimate of the measurand (reference value) and statements of equivalence of the results from the participating laboratories. Methods to estimate the reference value have been proposed that rest on the idea of finding a so-called consistent subset of laboratories, that is, eliminating allegedly outlying participants. We propose an alternative statistical model that accommodates all participant data and incorporates the dispersion of the measurement values obtained by different laboratories into the total uncertainty of the various estimates. This model recognizes the fact that the dispersion of values between laboratories often is substantially larger than the measurement uncertainties provided by the participating laboratories. We illustrate the methods on data from key comparison CCQM–K25.
Similar content being viewed by others
References
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B 57:289–300
Bureau International des Poids et Mesures (BIPM), International Electrotechnical Commission (IEC), International Federation of Clinical Chemistry (IFCC), International Organization for Standardization (ISO), International Union of Pure and Applied Chemistry (IUPAC), International Union of Pure and Applied Physics (IUPAP), and International Organization of Legal Metrology (OIML) (1995) Guide to the expression of uncertainty in measurement. ISO/IEC Guide 98:1995. International Organization for Standardization (ISO), Geneva, Switzerland
Chunovkina AG, Elster C, Lira I, Wöger W (2008) Analysis of key comparison data and laboratory biases. Metrologia 45(2):211–216
Comité international des poids et mesures (CIPM). Mutual recognition of national measurement standards and of calibration and measurement certificates issued by national metrology institutes. Bureau International des Poids et Mesures (BIPM), Pavillon de Breteuil, Sèvres, France, October 14th 1999. Technical Supplement revised in October 2003
Cox MG (2002) The evaluation of key comparison data. Metrologia 39:589–595
Cox MG (2007) The evaluation of key comparison data: determining the largest consistent subset. Metrologia 44:187–200
Davison AC, Hinkley D (1997) Bootstrap methods and their applications. Cambridge University Press, New York
Decker JE, Brown N, Cox MG, Steele AG, Douglas RJ (2006) Recent recommendations of the consultative committee for length (CCL) regarding strategies for evaluating key comparison data. Metrologia 43:L51–L55
Hochberg Y, Tamhane A (1987) Multiple comparison procedures. Wiley, New York
Iyer HK, Wang CM, Vecchia DF (2004) Consistency tests for key comparison data. Metrologia 41:223–230
Kacker R, Forbes A, Kessel R, Sommer K-D (2008) Classical and Bayesian interpretation of the Birge test of consistency and its generalized version for correlated results from interlaboratory evaluations. Metrologia 45(3):257–264
Lunn DJ, Thomas A, Best N, Spiegelhalter D (2000) WinBUGS—a Bayesian modelling framework: concepts, structure, and extensibility. Stat Comput 10:325–337
Martin AD, Quinn KM, Park JH (2008) MCMCpack: Markov chain Monte Carlo (MCMC) Package, (2008) URL http://mcmcpack.wustl.ed.. R package version 0.9-5
Pinheiro JC, Bates DM (2000) Mixed-effects models in S and S-Plus. Springer, New York
Possolo A, Toman B (2007) Assessment of measurement uncertainty via observation equations. Metrologia 44:464–475
R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, (2008) URL http://www.R-project.or. ISBN 3-900051-07-0
Schantz M, Wise S (2004) CCQM–K25: determination of PCB congeners in sediment. Metrologia 41(Technical Supplement):08001
Searle SR (1971) Linear models. Wiley, New York
Searle SR, Casella G, McCulloch CE (1992) Variance components. Wiley, New York
Toman B (2007a) Bayesian approaches to calculating a reference value in key comparison experiments. Technometrics 49(1):81–87
Toman B (2007b) Statistical interpretation of key comparison degrees of equivalence based on distributions of belief. Metrologia 44(2):L14–L17
Wasserman L (2004) All of statistics, a concise course in statistical inference. Springer, New York
Zhang NF (2006) The uncertainty associated with the weighted mean of measurement data. Metrologia 43(3):195–204
Acknowledgments
The authors are grateful to Paul De Bièvre for his interest in a discussion of laboratory effects models in the context of a CCQM Key Comparison, and also wish to express their appreciation for the many useful comments that Michele Schantz, Will Guthrie, Hung-kung Liu, and Nien-fan Zhang (all from NIST) made on an early draft of this contribution.
Author information
Authors and Affiliations
Corresponding author
Additional information
Papers published in this section do not necessarily reflect the opinion of the Editors, the Editorial Board and the Publisher.
An erratum to this article is available at http://dx.doi.org/10.1007/s00769-010-0707-4.
Rights and permissions
About this article
Cite this article
Toman, B., Possolo, A. Laboratory effects models for interlaboratory comparisons. Accred Qual Assur 14, 553–563 (2009). https://doi.org/10.1007/s00769-009-0547-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00769-009-0547-2