Abstract
The measurement of well-being of people is very difficult because it is characterized by a multiplicity of aspects or dimensions. Principal Components Analysis (PCA) is probably the most popular multivariate statistical technique for reducing data with many dimensions and, often, well-being indicators are reduced to a single index of well-being by using PCA. However, PCA is implicitly based on a reflective measurement model that is not suitable for all types of indicators. In this paper, we discuss the use and misuse of PCA for measuring well-being, and we show some applications to real data.
Similar content being viewed by others
Notes
For the sake of simplicity, only linear models will be considered.
Because the formative measurement model is based on a multiple regression, the stability of the coefficients λi is affected by the strength of the indicator intercorrelations. Therefore, multicollinearity must be avoided. (Diamantopoulos and Winklhofer 2001).
Individual indicators must have at least an interval level of measurement. For variables measured on nominal or ordinal scale, we recommend the use of Categorical Principal Components Analysis (CATPCA). For a introduction and application of CATPCA, see Linting et al. (2007).
Often, the use of the first principal component as the ‘only’ composite index is a bad practice that reduces the PCA potentials.
Normalization aims to make individual indicators comparable, as they often have different measurement units and/or different polarities. Normalized indicators are calculated by transforming individual indicators into pure, dimensionless, numbers, with positive polarity (Mazziotta and Pareto 2017).
Principal components can be real features of the data, or more or less convenient fictions and summaries. That they are real is a hypothesis for which PCA can provide only a very weak evidence (Shalizi 2009).
STATIS is an exploratory technique of multivariate data analysis for handling three-way matrices, where the same units have measures on a set of indicators under a number of conditions (Lavit et al. 1994).
There are two types of FA: exploratory and confirmatory. In this paper, we consider exploratory factor analysis (Fabrigar and Wegener 2011).
Individual indicators were normalized as z-scores. The signs were reversed if the polarity is negative.
The first factor of PCA accounts for 72.4% of the variance in the data.
Note that, for constructing a composite index, all the normalized indicators must have positive polarity, so that an increase in each of them corresponds to an increase in the composite index (Mazziotta and Pareto 2013).
Influence Analysis is a particular case of Uncertainty Analysis that aims to empirically quantify the ‘weight’ of each individual indicator in the calculation of the composite index (Mazziotta and Pareto 2017).
Note that only individual indicators are released by Istat at the provincial level.
A pillar describes a particular aspect—not directly observable—of the latent phenomenon by a set of individual indicators which are assumed to be related to it.
References
Biswas, B., & Caliendo, F. (2002). A multivariate analysis of the human development index. Indian Economic Journal, 49, 96–100.
Bleys, B. (2012). Beyond GDP: Classifying alternative measures for progress. Social Indicators Research, 109, 355–376.
Boarini, R., Kolev, A., & McGregor, A. (2014). Measuring well-being and progress in countries at different stages of development: Towards a more universal conceptual framework. Working Paper No. 325. OECD Development Centre.
Bohrnstedt, G. W. (1970). Reliability and validity assessment in attitude measurement. In G. F. Summers (Ed.), Attitude measurement (pp. 80–99). London: Rand McNally.
Booysen, F. (2002). An overview and evaluation of composite indices of development. Social Indicators Research, 59, 115–151.
Borsboom, D., Mellenbergh, G. J., & Heerden, J. V. (2003). The theoretical status of latent variables. Psychological Review, 110, 203–219.
Cadogan, J. W., & Lee, N. (2013). Improper use of endogenous formative variables. Journal of Business Research, 66, 233–241.
Chelli, F., Ciommi, M., Emili, A., Gigliarano, C., & Taralli, S. (2017). A new class of composite indicators for measuring well-being at the local level: An application to the Equitable and Sustainable Well-being (BES) of the Italian Provinces. Ecological Indicators, 76, 281–296.
Coltman, T., Devinney, T. M., Midgley, D. F., & Venaik, S. (2008). Formative versus reflective measurement models: Two applications of formative measurement. Journal of Business Research, 61, 1250–1262.
Diamantopoulos, A. (2006). The error term in formative measurement models: Interpretation and modeling implications. Journal of Modeling in Management, 1, 7–17.
Diamantopoulos, A., Riefler, P., & Roth, K. P. (2008). Advancing formative measurement models. Journal of Business Research, 61, 1203–1218.
Diamantopoulos, A., & Winklhofer, H. M. (2001). Index construction with formative indicators: An alternative to scale development. Journal of Marketing Research, 38, 269–277.
Dunteman, G. H. (1989). Principal components analysis. Newbury Park: Sage.
Edwards, J. R., & Bagozzi, R. P. (2000). On the nature and direction of relationships between constructs and measures. Psychological Methods, 5, 155–174.
Fabrigar, L. F., & Wegener, D. T. (2011). Exploratory factor analysis. New York: Oxford University Press.
Fayers, P. M., & Hand, D. J. (2002). Causal variables, indicator variables and measurement scales: An example from quality of life. Journal of the Royal Statistical Society, Series A, 165, 233–261.
Ferrara, A. R., & Nisticò, R. (2014). Measuring well-being in a multidimensional perspective: A multivariate statistical application to Italian regions. Working Paper, 6. Dipartimento di Economia, Statistica e Finanza, Università della Calabria.
Filzmoser, P. (1999). Robust principal components and factor analysis in the geostatistical treatment of environmental data. Environmetrics, 10, 363–375.
Götz, O., Liehr-Gobbers, K., & Krafft, M. (2010). Evaluation of structural equation models using the partial least squares (PLS) approach. In V. Esposito Vinzi, W. W. Chin, J. Henseler, & H. Wang (Eds.), Handbook of partial least squares: Concepts, methods, and applications (pp. 691–711). Berlin: Springer.
Guttman, L. (1954). Some necessary conditions for common factor analysis. Psychometrika, 19, 149–161.
Haq, R., & Zia, U. (2013). Multidimensional wellbeing: An index of quality of life in a developing economy. Social Indicators Research, 114, 997–1012.
Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417–441.
Howell, R. D., Breivik, E., & Wilcox, J. B. (2007). Reconsidering formative measurement. Psychological Methods, 12, 205–218.
Hubert, M., Rousseeuw, P. J., & Vanden Branden, K. (2005). Robpca: A new approach to robust principal component analysis. Technometrics, 47, 64–79.
Istat (2015a). BES 2015. Il benessere equo e sostenibile in Italia. http://www.istat.it/it/files/2015/12/Rapporto_BES_2015.pdf. Accessed 21 May 2018.
Istat (2015b). Il benessere equo e sostenibile delle province. http://www.besdelleprovince.it/fileadmin/grpmnt/1225/pubblicazione_nazionale.pdf. Accessed 21 May 2018.
Jarvis, C. B., Mackenzie, S. B., & Podsakoff, P. M. (2003). A critical review of construct indicators and measurement model misspecification in marketing and consumer research. Journal of Consumer Research, 30, 199–218.
Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: A review and recent developments. Philosophical Transactions of the Royal Society, A, 374, 20150202. https://doi.org/10.1098/rsta.2015.0202.
Kaiser, H. F. (1961). A note on Guttman’s lower bound for the number of common factors. British Journal of Mathematical ans Statistical Psychology, 14, 1–2.
Kendall, M. G., & Stuart, A. (1968). The advanced theory of statistics (Vol. 3). London: Charles Griffin & Co.
Krishnakumar, J., & Nagar, A. L. (2008). On exact statistical properties of multidimensional indices based on principal components, factor analysis, MIMIC and structural equation models. Social Indicators Research, 86, 481–496.
Lai, D. (2003). Principal component analysis on human development indicators of China. Social Indicators Research, 61, 319–330.
Lavit, C., Escoufier, Y., Sabatier, R., & Traissac, P. (1994). The ACT (STATIS method). Computational Statistics & Data Analysis, 18, 97–119.
Linting, M., Meulman, J. J., Groenen, P. J. F., & Van der Kooij, A. J. (2007). Nonlinear principal components analysis: Introduction and application. Psychological Methods, 12, 336–358.
Maggino, F. (2017). Developing indicators and managing the complexity. In F. Maggino (Ed.), Complexity in society: From indicators construction to their synthesis (Vol. 70, pp. 87–114)., Social indicators research series Cham: Springer.
Maggino, F., & Zumbo, B. D. (2012). Measuring the quality of life and the construction of social indicators. In K. C. Land, A. C. Michalos, & M. J. Sirgy (Eds.), Handbook of social indicators and quality-of-life research (pp. 201–238). Dordrecht: Springer.
Markus, K. A., & Borsboom, D. (2013). Frontiers of test validity theory. Measurement, causation, and meaning. New York: Routledge.
Mazziotta, M., & Pareto, A. (2013). Methods for constructing composite indices: One for all or all for one. Rivista Italiana di Economia Demografia e Statistica, LXVII(2), 67–80.
Mazziotta, M., & Pareto, A. (2016a). On a generalized non-compensatory composite index for measuring socio-economic phenomena. Social Indicators Research, 127, 983–1003.
Mazziotta, M., & Pareto, A. (2016b). On the construction of composite indices by principal components analysis. Rivista Italiana di Economia Demografia e Statistica, LXX(1), 103–109.
Mazziotta, M., & Pareto, A. (2017). Synthesis of indicators: The composite indicators approach. In F. Maggino (Ed.), Complexity in society: From indicators construction to their synthesis (Vol. 70, pp. 159–191)., Social indicators research series Cham: Springer.
McGillivray, M. (2005). Measuring non-economic well-being achievement. Review of Income and Wealth, 51, 337–364.
Michalos, A. C. (2014). Encyclopedia of quality of life and well-being research. Dordrecht: Springer.
Michalos, A. C., Smale, B., Labonté, R., Muharjarine, N., Scott, K., Moore, K., et al. (2011). The Canadian Index of wellbeing. Technical report 1.0. Waterloo, ON: Canadian Index of Wellbeing and University of Waterloo.
Mishra, S. K. (2007). A comparative study of various inclusive indices and the index constructed by the principal components analysis. MPRA Paper, No. 3377. MPRA. http://mpra.ub.uni-muenchen.de/3377. Accessed 21 May 2018.
Mishra, S. K. (2008). On Construction of Robust Composite Indices by Linear Aggregation. SSRN. http://ssrn.com/abstract=1147964. Accessed 21 May 2018.
Nahman, A., Mahumani, B. K., & De Lange, W. J. (2016). Beyond GDP: Towards a green economy index. Development Southern Africa. https://doi.org/10.1080/0376835X.2015.1120649.
OECD. (2004). The OECD-JRC handbook on practices for developing composite indicators. Paper presented at the OECD Committee on Statistics, 7–8 June 2004, OECD, Paris.
OECD. (2008). Handbook on constructing composite indicators. Methodology and user guide. Paris: OECD Publications.
OECD. (2015). How’s life? 2015: Measuring well-being. Paris: OECD Publishing.
Osborne, J. W. (2014). Best practices in exploratory factor analysis. Newbury Park: Jason W. Osborne.
Ram, R. (1982). Composite indices of physical quality of life, basic needs fulfilment, and income: A principal component representation. Journal of Development Economics, 11, 227–247.
Salzman, J. (2003). Methodological choices encountered in the construction of composite indices of economic and social well-Being. Technical Report. Center for the Study of Living Standards, Ottawa.
Sen, A. K. (1985). Commodities and capabilities. Amsterdam: North Holland Publishing Company.
Shalizi C. R. (2009). The truth about principal components and factor analysis. http://www.stat.cmu.edu/~cshalizi/350/lectures/13/lecture-13.pdf.
Shwartz, M., Restuccia, J. D., & Rosen, A. K. (2015). Composite measures of health care provider performance: A description of approaches. The Milbank Quarterly, 93, 788–825.
Simonetto, A. (2012). Formative and reflective models: State of the art. Electronic Journal of Applied Statistical Analysis, 5, 452–457.
Slottje, D. J. (1991). Measuring the quality of life across countries. The Review of Economics and Statistics, 73, 684–693.
Somarriba, N., & Pena, B. (2009). Synthetic indicators of quality of life in Europe. Social Indicators Research, 94, 115–133.
Stiglitz, J., Sen, A. K., & Fitoussi, J. P. (2009). Report of the commission on the measurement of economic performance and social progress. Paris. Available online from the Commission on the Measurement of Economic Performance and Social Progress: http://www.stiglitz-sen-fitoussi.fr/en/index.htm. Accessed 21 May 2018.
UNDP. (1990). Human development report 1990. New York: Oxford University Press.
UNDP. (2010). Human development report 2010. New York: Palgrave Macmillan.
Van Beuningen, J., Van der Houwen, K., & Moonen, L. (2014). Measuring well-being. An analysis of different response scales. Discussion Paper, 3. Statistics Netherlands.
Vinzi, V. E., Lauro, C., & Tenenhaus, M. (2003). PLS path modeling. Working paper. DMS – University of Naples, HEC – School of Management, Jouy-en-Josas.
Wong, K. M. (2012). Well-being and economic development: A principal components analysis. International Journal of Happiness and Development, 1, 131–141.
Zumbo, B. D. (2007). Validity: Foundational issues and statistical methodology. In C. R. Rao & S. Sinharay (Eds.), Handbook of statistics (Vol. 26, pp. 45–79)., Psychometrics Boston: Elsevier.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Mazziotta, M., Pareto, A. Use and Misuse of PCA for Measuring Well-Being. Soc Indic Res 142, 451–476 (2019). https://doi.org/10.1007/s11205-018-1933-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11205-018-1933-0