Skip to main content
Log in

Use and Misuse of PCA for Measuring Well-Being

  • Published:
Social Indicators Research Aims and scope Submit manuscript

Abstract

The measurement of well-being of people is very difficult because it is characterized by a multiplicity of aspects or dimensions. Principal Components Analysis (PCA) is probably the most popular multivariate statistical technique for reducing data with many dimensions and, often, well-being indicators are reduced to a single index of well-being by using PCA. However, PCA is implicitly based on a reflective measurement model that is not suitable for all types of indicators. In this paper, we discuss the use and misuse of PCA for measuring well-being, and we show some applications to real data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. For the sake of simplicity, only linear models will be considered.

  2. Some authors exclude the error term so that Eq. (2) reduces to a weighted linear combination of the Xi (Diamantopoulos 2006).

  3. Experts suggest that weights could be determined a priori, according to the theoretical contribution of the indicators to the concept (Howell et al. 2007). For Cadogan and Lee (2013), if there is no theory suggesting the contrary, individual indicators should have equal weightings.

  4. Because the formative measurement model is based on a multiple regression, the stability of the coefficients λi is affected by the strength of the indicator intercorrelations. Therefore, multicollinearity must be avoided. (Diamantopoulos and Winklhofer 2001).

  5. Individual indicators must have at least an interval level of measurement. For variables measured on nominal or ordinal scale, we recommend the use of Categorical Principal Components Analysis (CATPCA). For a introduction and application of CATPCA, see Linting et al. (2007).

  6. Often, the use of the first principal component as the ‘only’ composite index is a bad practice that reduces the PCA potentials.

  7. Normalization aims to make individual indicators comparable, as they often have different measurement units and/or different polarities. Normalized indicators are calculated by transforming individual indicators into pure, dimensionless, numbers, with positive polarity (Mazziotta and Pareto 2017).

  8. Principal components can be real features of the data, or more or less convenient fictions and summaries. That they are real is a hypothesis for which PCA can provide only a very weak evidence (Shalizi 2009).

  9. STATIS is an exploratory technique of multivariate data analysis for handling three-way matrices, where the same units have measures on a set of indicators under a number of conditions (Lavit et al. 1994).

  10. Several robust PCA methods have been introduced in the literature (Filzmoser 1999; Hubert et al. 2005), but they make the analysis resistant to outlying observations.

  11. There are two types of FA: exploratory and confirmatory. In this paper, we consider exploratory factor analysis (Fabrigar and Wegener 2011).

  12. Individual indicators were normalized as z-scores. The signs were reversed if the polarity is negative.

  13. The first factor of PCA accounts for 72.4% of the variance in the data.

  14. Note that, for constructing a composite index, all the normalized indicators must have positive polarity, so that an increase in each of them corresponds to an increase in the composite index (Mazziotta and Pareto 2013).

  15. Influence Analysis is a particular case of Uncertainty Analysis that aims to empirically quantify the ‘weight’ of each individual indicator in the calculation of the composite index (Mazziotta and Pareto 2017).

  16. Note that only individual indicators are released by Istat at the provincial level.

  17. A pillar describes a particular aspect—not directly observable—of the latent phenomenon by a set of individual indicators which are assumed to be related to it.

References

  • Biswas, B., & Caliendo, F. (2002). A multivariate analysis of the human development index. Indian Economic Journal, 49, 96–100.

    Google Scholar 

  • Bleys, B. (2012). Beyond GDP: Classifying alternative measures for progress. Social Indicators Research, 109, 355–376.

    Article  Google Scholar 

  • Boarini, R., Kolev, A., & McGregor, A. (2014). Measuring well-being and progress in countries at different stages of development: Towards a more universal conceptual framework. Working Paper No. 325. OECD Development Centre.

  • Bohrnstedt, G. W. (1970). Reliability and validity assessment in attitude measurement. In G. F. Summers (Ed.), Attitude measurement (pp. 80–99). London: Rand McNally.

    Google Scholar 

  • Booysen, F. (2002). An overview and evaluation of composite indices of development. Social Indicators Research, 59, 115–151.

    Article  Google Scholar 

  • Borsboom, D., Mellenbergh, G. J., & Heerden, J. V. (2003). The theoretical status of latent variables. Psychological Review, 110, 203–219.

    Article  Google Scholar 

  • Cadogan, J. W., & Lee, N. (2013). Improper use of endogenous formative variables. Journal of Business Research, 66, 233–241.

    Article  Google Scholar 

  • Chelli, F., Ciommi, M., Emili, A., Gigliarano, C., & Taralli, S. (2017). A new class of composite indicators for measuring well-being at the local level: An application to the Equitable and Sustainable Well-being (BES) of the Italian Provinces. Ecological Indicators, 76, 281–296.

    Article  Google Scholar 

  • Coltman, T., Devinney, T. M., Midgley, D. F., & Venaik, S. (2008). Formative versus reflective measurement models: Two applications of formative measurement. Journal of Business Research, 61, 1250–1262.

    Article  Google Scholar 

  • Diamantopoulos, A. (2006). The error term in formative measurement models: Interpretation and modeling implications. Journal of Modeling in Management, 1, 7–17.

    Article  Google Scholar 

  • Diamantopoulos, A., Riefler, P., & Roth, K. P. (2008). Advancing formative measurement models. Journal of Business Research, 61, 1203–1218.

    Article  Google Scholar 

  • Diamantopoulos, A., & Winklhofer, H. M. (2001). Index construction with formative indicators: An alternative to scale development. Journal of Marketing Research, 38, 269–277.

    Article  Google Scholar 

  • Dunteman, G. H. (1989). Principal components analysis. Newbury Park: Sage.

    Book  Google Scholar 

  • Edwards, J. R., & Bagozzi, R. P. (2000). On the nature and direction of relationships between constructs and measures. Psychological Methods, 5, 155–174.

    Article  Google Scholar 

  • Fabrigar, L. F., & Wegener, D. T. (2011). Exploratory factor analysis. New York: Oxford University Press.

    Book  Google Scholar 

  • Fayers, P. M., & Hand, D. J. (2002). Causal variables, indicator variables and measurement scales: An example from quality of life. Journal of the Royal Statistical Society, Series A, 165, 233–261.

    Article  Google Scholar 

  • Ferrara, A. R., & Nisticò, R. (2014). Measuring well-being in a multidimensional perspective: A multivariate statistical application to Italian regions. Working Paper, 6. Dipartimento di Economia, Statistica e Finanza, Università della Calabria.

  • Filzmoser, P. (1999). Robust principal components and factor analysis in the geostatistical treatment of environmental data. Environmetrics, 10, 363–375.

    Article  Google Scholar 

  • Götz, O., Liehr-Gobbers, K., & Krafft, M. (2010). Evaluation of structural equation models using the partial least squares (PLS) approach. In V. Esposito Vinzi, W. W. Chin, J. Henseler, & H. Wang (Eds.), Handbook of partial least squares: Concepts, methods, and applications (pp. 691–711). Berlin: Springer.

    Chapter  Google Scholar 

  • Guttman, L. (1954). Some necessary conditions for common factor analysis. Psychometrika, 19, 149–161.

    Article  Google Scholar 

  • Haq, R., & Zia, U. (2013). Multidimensional wellbeing: An index of quality of life in a developing economy. Social Indicators Research, 114, 997–1012.

    Article  Google Scholar 

  • Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417–441.

    Article  Google Scholar 

  • Howell, R. D., Breivik, E., & Wilcox, J. B. (2007). Reconsidering formative measurement. Psychological Methods, 12, 205–218.

    Article  Google Scholar 

  • Hubert, M., Rousseeuw, P. J., & Vanden Branden, K. (2005). Robpca: A new approach to robust principal component analysis. Technometrics, 47, 64–79.

    Article  Google Scholar 

  • Istat (2015a). BES 2015. Il benessere equo e sostenibile in Italia. http://www.istat.it/it/files/2015/12/Rapporto_BES_2015.pdf. Accessed 21 May 2018.

  • Istat (2015b). Il benessere equo e sostenibile delle province. http://www.besdelleprovince.it/fileadmin/grpmnt/1225/pubblicazione_nazionale.pdf. Accessed 21 May 2018.

  • Jarvis, C. B., Mackenzie, S. B., & Podsakoff, P. M. (2003). A critical review of construct indicators and measurement model misspecification in marketing and consumer research. Journal of Consumer Research, 30, 199–218.

    Article  Google Scholar 

  • Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: A review and recent developments. Philosophical Transactions of the Royal Society, A, 374, 20150202. https://doi.org/10.1098/rsta.2015.0202.

    Article  Google Scholar 

  • Kaiser, H. F. (1961). A note on Guttman’s lower bound for the number of common factors. British Journal of Mathematical ans Statistical Psychology, 14, 1–2.

    Article  Google Scholar 

  • Kendall, M. G., & Stuart, A. (1968). The advanced theory of statistics (Vol. 3). London: Charles Griffin & Co.

    Google Scholar 

  • Krishnakumar, J., & Nagar, A. L. (2008). On exact statistical properties of multidimensional indices based on principal components, factor analysis, MIMIC and structural equation models. Social Indicators Research, 86, 481–496.

    Article  Google Scholar 

  • Lai, D. (2003). Principal component analysis on human development indicators of China. Social Indicators Research, 61, 319–330.

    Article  Google Scholar 

  • Lavit, C., Escoufier, Y., Sabatier, R., & Traissac, P. (1994). The ACT (STATIS method). Computational Statistics & Data Analysis, 18, 97–119.

    Article  Google Scholar 

  • Linting, M., Meulman, J. J., Groenen, P. J. F., & Van der Kooij, A. J. (2007). Nonlinear principal components analysis: Introduction and application. Psychological Methods, 12, 336–358.

    Article  Google Scholar 

  • Maggino, F. (2017). Developing indicators and managing the complexity. In F. Maggino (Ed.), Complexity in society: From indicators construction to their synthesis (Vol. 70, pp. 87–114)., Social indicators research series Cham: Springer.

    Chapter  Google Scholar 

  • Maggino, F., & Zumbo, B. D. (2012). Measuring the quality of life and the construction of social indicators. In K. C. Land, A. C. Michalos, & M. J. Sirgy (Eds.), Handbook of social indicators and quality-of-life research (pp. 201–238). Dordrecht: Springer.

    Chapter  Google Scholar 

  • Markus, K. A., & Borsboom, D. (2013). Frontiers of test validity theory. Measurement, causation, and meaning. New York: Routledge.

    Google Scholar 

  • Mazziotta, M., & Pareto, A. (2013). Methods for constructing composite indices: One for all or all for one. Rivista Italiana di Economia Demografia e Statistica, LXVII(2), 67–80.

    Google Scholar 

  • Mazziotta, M., & Pareto, A. (2016a). On a generalized non-compensatory composite index for measuring socio-economic phenomena. Social Indicators Research, 127, 983–1003.

    Article  Google Scholar 

  • Mazziotta, M., & Pareto, A. (2016b). On the construction of composite indices by principal components analysis. Rivista Italiana di Economia Demografia e Statistica, LXX(1), 103–109.

    Google Scholar 

  • Mazziotta, M., & Pareto, A. (2017). Synthesis of indicators: The composite indicators approach. In F. Maggino (Ed.), Complexity in society: From indicators construction to their synthesis (Vol. 70, pp. 159–191)., Social indicators research series Cham: Springer.

    Chapter  Google Scholar 

  • McGillivray, M. (2005). Measuring non-economic well-being achievement. Review of Income and Wealth, 51, 337–364.

    Article  Google Scholar 

  • Michalos, A. C. (2014). Encyclopedia of quality of life and well-being research. Dordrecht: Springer.

    Book  Google Scholar 

  • Michalos, A. C., Smale, B., Labonté, R., Muharjarine, N., Scott, K., Moore, K., et al. (2011). The Canadian Index of wellbeing. Technical report 1.0. Waterloo, ON: Canadian Index of Wellbeing and University of Waterloo.

  • Mishra, S. K. (2007). A comparative study of various inclusive indices and the index constructed by the principal components analysis. MPRA Paper, No. 3377. MPRA. http://mpra.ub.uni-muenchen.de/3377. Accessed 21 May 2018.

  • Mishra, S. K. (2008). On Construction of Robust Composite Indices by Linear Aggregation. SSRN. http://ssrn.com/abstract=1147964. Accessed 21 May 2018.

  • Nahman, A., Mahumani, B. K., & De Lange, W. J. (2016). Beyond GDP: Towards a green economy index. Development Southern Africa. https://doi.org/10.1080/0376835X.2015.1120649.

    Google Scholar 

  • OECD. (2004). The OECD-JRC handbook on practices for developing composite indicators. Paper presented at the OECD Committee on Statistics, 7–8 June 2004, OECD, Paris.

  • OECD. (2008). Handbook on constructing composite indicators. Methodology and user guide. Paris: OECD Publications.

    Book  Google Scholar 

  • OECD. (2015). How’s life? 2015: Measuring well-being. Paris: OECD Publishing.

    Book  Google Scholar 

  • Osborne, J. W. (2014). Best practices in exploratory factor analysis. Newbury Park: Jason W. Osborne.

    Google Scholar 

  • Ram, R. (1982). Composite indices of physical quality of life, basic needs fulfilment, and income: A principal component representation. Journal of Development Economics, 11, 227–247.

    Article  Google Scholar 

  • Salzman, J. (2003). Methodological choices encountered in the construction of composite indices of economic and social well-Being. Technical Report. Center for the Study of Living Standards, Ottawa.

  • Sen, A. K. (1985). Commodities and capabilities. Amsterdam: North Holland Publishing Company.

    Google Scholar 

  • Shalizi C. R. (2009). The truth about principal components and factor analysis. http://www.stat.cmu.edu/~cshalizi/350/lectures/13/lecture-13.pdf.

  • Shwartz, M., Restuccia, J. D., & Rosen, A. K. (2015). Composite measures of health care provider performance: A description of approaches. The Milbank Quarterly, 93, 788–825.

    Article  Google Scholar 

  • Simonetto, A. (2012). Formative and reflective models: State of the art. Electronic Journal of Applied Statistical Analysis, 5, 452–457.

    Google Scholar 

  • Slottje, D. J. (1991). Measuring the quality of life across countries. The Review of Economics and Statistics, 73, 684–693.

    Article  Google Scholar 

  • Somarriba, N., & Pena, B. (2009). Synthetic indicators of quality of life in Europe. Social Indicators Research, 94, 115–133.

    Article  Google Scholar 

  • Stiglitz, J., Sen, A. K., & Fitoussi, J. P. (2009). Report of the commission on the measurement of economic performance and social progress. Paris. Available online from the Commission on the Measurement of Economic Performance and Social Progress: http://www.stiglitz-sen-fitoussi.fr/en/index.htm. Accessed 21 May 2018.

  • UNDP. (1990). Human development report 1990. New York: Oxford University Press.

    Google Scholar 

  • UNDP. (2010). Human development report 2010. New York: Palgrave Macmillan.

    Google Scholar 

  • Van Beuningen, J., Van der Houwen, K., & Moonen, L. (2014). Measuring well-being. An analysis of different response scales. Discussion Paper, 3. Statistics Netherlands.

  • Vinzi, V. E., Lauro, C., & Tenenhaus, M. (2003). PLS path modeling. Working paper. DMS – University of Naples, HEC – School of Management, Jouy-en-Josas.

  • Wong, K. M. (2012). Well-being and economic development: A principal components analysis. International Journal of Happiness and Development, 1, 131–141.

    Article  Google Scholar 

  • Zumbo, B. D. (2007). Validity: Foundational issues and statistical methodology. In C. R. Rao & S. Sinharay (Eds.), Handbook of statistics (Vol. 26, pp. 45–79)., Psychometrics Boston: Elsevier.

    Google Scholar 

Download references

Acknowledgements

The paper is the result of the common work of the authors: in particular M. Mazziotta has written Sects. 2.1, 3.2, 4 and A. Pareto has written Sects. 1, 2.2, 2.3, 2.4, 3.1.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adriano Pareto.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mazziotta, M., Pareto, A. Use and Misuse of PCA for Measuring Well-Being. Soc Indic Res 142, 451–476 (2019). https://doi.org/10.1007/s11205-018-1933-0

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11205-018-1933-0

Keywords

Navigation