Abstract
In frontier analysis, most of the nonparametric approaches (DEA, FDH) are based on envelopment ideas which suppose that with probability one, all the observed units belong to the attainable set. In these “deterministic” frontier models, statistical theory is now mostly available (Simar and Wilson, 2000a). In the presence of super-efficient outliers, envelopment estimators could behave dramatically since they are very sensitive to extreme observations. Some recent results from Cazals et al. (2002) on robust nonparametric frontier estimators may be used in order to detect outliers by defining a new DEA/FDH “deterministic” type estimator which does not envelop all the data points and so is more robust to extreme data points. In this paper, we summarize the main results of Cazals et al. (2002) and we show how this tool can be used for detecting outliers when using the classical DEA/FDH estimators or any parametric techniques. We propose a methodology implementing the tool and we illustrate through some numerical examples with simulated and real data. The method should be used in a first step, as an exploratory data analysis, before using any frontier estimation.
Similar content being viewed by others
References
Barnett, V. and T. Lewis. (1995). Outliers in Statistical Data. Chichester: Wiley.
Cazals, C., J. P. Florens and L. Simar. (2002).“Nonparametric Frontier Estimation: A Robust Approach”Journal of Econometrics 106, 1–25.
Charnes, A., W. W. Cooper and E. Rhodes. (1978).“Measuring the Inefficiency of Decision Making Units”European Journal of Operational Research 2, 429–444.
Charnes, A., W. W. Cooper and E. Rhodes. (1981).“Evaluating Program and Managerial Efficiency: An Application of Data Envelopment Analysis to Program Follow Through”Management Science 27, 668–697.
Christensen, L. and R. Greene. (1976).“Economies of Scale in U.S. Electric Power Generation”Journal of Political Economy 84, 653–667.
Davies, P. L. and U. Gather. (1993).“The Identification of Multiple Outliers”Journal of the American Statistical Association 88(423), 782–801.
Debreu, G. (1951).“The Coefficient of Resource Utilization”Econometrica 19(3), 273–292.
Deprins, D., L. Simar and H. Tulkens. (1984).“Measuring Labor Inefficiency in Post Offices”In M. Marchand, P. Pestieau and H. Tulkens (eds.), The Performance of Public Enterprises: Concepts and measurements. North-Holland: Amsterdam, 243–267.
Farrell, M. J. (1957).“The Measurement of Productive Efficiency”Journal of the Royal Statistical Society, Series A 120, 253–281.
Gijbels, I., E. Mammen, B. U. Park and L. Simar. (1999).“On Estimation of Monotone and Concave Frontier Functions”Journal of the American Statistical Association 94(445), 220–228.
Greene, W. H. (1990).“A Gamma-Distributed Stochastic Frontier Model”Journal of Econometrics 46, 141–163.
Hall, P. and L. Simar. (2002).“Estimating a change point, boundary or frontier in the presence of observation errors”Journal of the American Statistical Association 97, 523–534.
Kneip, A., B. U. Park and L. Simar. (1998).“A Note on the Convergence of Nonparametric DEA Estimators for Production Efficiency Scores”Econometric Theory 14, 783–793.
Koopmans, T. C. (1951).“An Analysis of Production as an Efficient Combination of Activities”In T. C. Koopmans (ed.), Activity Analysis of Production and Allocation. Cowles Commission for Research in Economics, Monograph 13. New York: John Wiley and Sons, Inc.
Olesen, O. B., N. C. Petersen and C. A. K. Lovell (eds.) (1996).“Summary of Workshop Discussion”Journal of Productivity Analysis 7, 341–345.
Park, B. L. Simar and Ch. Weiner. (2000).“The FDH Estimator for Productivity Efficiency Scores: Asymptotic Properties”Econometric Theory 16, 855–877.
Shephard, R. W. (1970). Theory of Cost and Production Function. Princeton: Princeton University Press.
Simar, L. (2003). How to improve the performances of DEA/FDH estimators in the presence of noise? Discussion paper no. 0323, Institut de Statistique, UCL, Louvende, Neuve, Belgium.
Simar, L. and P. W. Wilson. (1998).“Sensitivity Analysis of Efficiency Scores: How to Bootstrap in Nonparametric Frontier Models”Management Science 44(11), 49–61.
Simar, L. and P. W. Wilson. (2000a).“Statistical Inference in Nonparametric Frontier Models: The State of the Art”Journal of Productivity Analysis 13, 49–78.
Simar, L. and P. W. Wilson. (2000b).“A General Methodology for Bootstrapping in Nonparametric Frontier Models”Journal of Applied Statistics 27(6), 779–802.
Wilson, P. W. (1993).“Detecting Outliers in Deterministic Nonparametric Frontier Models with Multiple Outputs”Journal of Business and Economic Statistics 11, 319–323.
Wilson, P. W. (1995).“Detecting Influential Observations in Data Envelopment Analysis”Journal of Productivity Analysis 6, 27–45.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Simar, L. Detecting Outliers in Frontier Models: A Simple Approach. Journal of Productivity Analysis 20, 391–424 (2003). https://doi.org/10.1023/A:1027308001925
Issue Date:
DOI: https://doi.org/10.1023/A:1027308001925