Use and Misuse of PCA for Measuring Well-Being

Mazziotta, Matteo; Pareto, Adriano

doi:10.1007/s11205-018-1933-0

Use and Misuse of PCA for Measuring Well-Being

Published: 25 May 2018

Volume 142, pages 451–476, (2019)
Cite this article

Social Indicators Research Aims and scope Submit manuscript

Matteo Mazziotta¹ &
Adriano Pareto²

3092 Accesses
84 Citations
Explore all metrics

Abstract

The measurement of well-being of people is very difficult because it is characterized by a multiplicity of aspects or dimensions. Principal Components Analysis (PCA) is probably the most popular multivariate statistical technique for reducing data with many dimensions and, often, well-being indicators are reduced to a single index of well-being by using PCA. However, PCA is implicitly based on a reflective measurement model that is not suitable for all types of indicators. In this paper, we discuss the use and misuse of PCA for measuring well-being, and we show some applications to real data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dimensions of Well-Being and Their Statistical Measurements

Exploring Multidimensional Well-Being in Switzerland: Comparing Three Synthesizing Approaches

Article 17 September 2016

Katia Iglesias, Christian Suter, … B. P. Vani

Fifty Years After the Social Indicators Movement: Has the Promise Been Fulfilled?

Article 02 March 2017

Kenneth C. Land & Alex C. Michalos

Notes

For the sake of simplicity, only linear models will be considered.
Some authors exclude the error term so that Eq. (2) reduces to a weighted linear combination of the X_i (Diamantopoulos 2006).
Experts suggest that weights could be determined a priori, according to the theoretical contribution of the indicators to the concept (Howell et al. 2007). For Cadogan and Lee (2013), if there is no theory suggesting the contrary, individual indicators should have equal weightings.
Because the formative measurement model is based on a multiple regression, the stability of the coefficients λ_i is affected by the strength of the indicator intercorrelations. Therefore, multicollinearity must be avoided. (Diamantopoulos and Winklhofer 2001).
Individual indicators must have at least an interval level of measurement. For variables measured on nominal or ordinal scale, we recommend the use of Categorical Principal Components Analysis (CATPCA). For a introduction and application of CATPCA, see Linting et al. (2007).
Often, the use of the first principal component as the ‘only’ composite index is a bad practice that reduces the PCA potentials.
Normalization aims to make individual indicators comparable, as they often have different measurement units and/or different polarities. Normalized indicators are calculated by transforming individual indicators into pure, dimensionless, numbers, with positive polarity (Mazziotta and Pareto 2017).
Principal components can be real features of the data, or more or less convenient fictions and summaries. That they are real is a hypothesis for which PCA can provide only a very weak evidence (Shalizi 2009).
STATIS is an exploratory technique of multivariate data analysis for handling three-way matrices, where the same units have measures on a set of indicators under a number of conditions (Lavit et al. 1994).
Several robust PCA methods have been introduced in the literature (Filzmoser 1999; Hubert et al. 2005), but they make the analysis resistant to outlying observations.
There are two types of FA: exploratory and confirmatory. In this paper, we consider exploratory factor analysis (Fabrigar and Wegener 2011).
Individual indicators were normalized as z-scores. The signs were reversed if the polarity is negative.
The first factor of PCA accounts for 72.4% of the variance in the data.
Note that, for constructing a composite index, all the normalized indicators must have positive polarity, so that an increase in each of them corresponds to an increase in the composite index (Mazziotta and Pareto 2013).
Influence Analysis is a particular case of Uncertainty Analysis that aims to empirically quantify the ‘weight’ of each individual indicator in the calculation of the composite index (Mazziotta and Pareto 2017).
Note that only individual indicators are released by Istat at the provincial level.
A pillar describes a particular aspect—not directly observable—of the latent phenomenon by a set of individual indicators which are assumed to be related to it.

References

Biswas, B., & Caliendo, F. (2002). A multivariate analysis of the human development index. Indian Economic Journal, 49, 96–100.
Google Scholar
Bleys, B. (2012). Beyond GDP: Classifying alternative measures for progress. Social Indicators Research, 109, 355–376.
Article Google Scholar
Boarini, R., Kolev, A., & McGregor, A. (2014). Measuring well-being and progress in countries at different stages of development: Towards a more universal conceptual framework. Working Paper No. 325. OECD Development Centre.
Bohrnstedt, G. W. (1970). Reliability and validity assessment in attitude measurement. In G. F. Summers (Ed.), Attitude measurement (pp. 80–99). London: Rand McNally.
Google Scholar
Booysen, F. (2002). An overview and evaluation of composite indices of development. Social Indicators Research, 59, 115–151.
Article Google Scholar
Borsboom, D., Mellenbergh, G. J., & Heerden, J. V. (2003). The theoretical status of latent variables. Psychological Review, 110, 203–219.
Article Google Scholar
Cadogan, J. W., & Lee, N. (2013). Improper use of endogenous formative variables. Journal of Business Research, 66, 233–241.
Article Google Scholar
Chelli, F., Ciommi, M., Emili, A., Gigliarano, C., & Taralli, S. (2017). A new class of composite indicators for measuring well-being at the local level: An application to the Equitable and Sustainable Well-being (BES) of the Italian Provinces. Ecological Indicators, 76, 281–296.
Article Google Scholar
Coltman, T., Devinney, T. M., Midgley, D. F., & Venaik, S. (2008). Formative versus reflective measurement models: Two applications of formative measurement. Journal of Business Research, 61, 1250–1262.
Article Google Scholar
Diamantopoulos, A. (2006). The error term in formative measurement models: Interpretation and modeling implications. Journal of Modeling in Management, 1, 7–17.
Article Google Scholar
Diamantopoulos, A., Riefler, P., & Roth, K. P. (2008). Advancing formative measurement models. Journal of Business Research, 61, 1203–1218.
Article Google Scholar
Diamantopoulos, A., & Winklhofer, H. M. (2001). Index construction with formative indicators: An alternative to scale development. Journal of Marketing Research, 38, 269–277.
Article Google Scholar
Dunteman, G. H. (1989). Principal components analysis. Newbury Park: Sage.
Book Google Scholar
Edwards, J. R., & Bagozzi, R. P. (2000). On the nature and direction of relationships between constructs and measures. Psychological Methods, 5, 155–174.
Article Google Scholar
Fabrigar, L. F., & Wegener, D. T. (2011). Exploratory factor analysis. New York: Oxford University Press.
Book Google Scholar
Fayers, P. M., & Hand, D. J. (2002). Causal variables, indicator variables and measurement scales: An example from quality of life. Journal of the Royal Statistical Society, Series A, 165, 233–261.
Article Google Scholar
Ferrara, A. R., & Nisticò, R. (2014). Measuring well-being in a multidimensional perspective: A multivariate statistical application to Italian regions. Working Paper, 6. Dipartimento di Economia, Statistica e Finanza, Università della Calabria.
Filzmoser, P. (1999). Robust principal components and factor analysis in the geostatistical treatment of environmental data. Environmetrics, 10, 363–375.
Article Google Scholar
Götz, O., Liehr-Gobbers, K., & Krafft, M. (2010). Evaluation of structural equation models using the partial least squares (PLS) approach. In V. Esposito Vinzi, W. W. Chin, J. Henseler, & H. Wang (Eds.), Handbook of partial least squares: Concepts, methods, and applications (pp. 691–711). Berlin: Springer.
Chapter Google Scholar
Guttman, L. (1954). Some necessary conditions for common factor analysis. Psychometrika, 19, 149–161.
Article Google Scholar
Haq, R., & Zia, U. (2013). Multidimensional wellbeing: An index of quality of life in a developing economy. Social Indicators Research, 114, 997–1012.
Article Google Scholar
Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417–441.
Article Google Scholar
Howell, R. D., Breivik, E., & Wilcox, J. B. (2007). Reconsidering formative measurement. Psychological Methods, 12, 205–218.
Article Google Scholar
Hubert, M., Rousseeuw, P. J., & Vanden Branden, K. (2005). Robpca: A new approach to robust principal component analysis. Technometrics, 47, 64–79.
Article Google Scholar
Istat (2015a). BES 2015. Il benessere equo e sostenibile in Italia. http://www.istat.it/it/files/2015/12/Rapporto_BES_2015.pdf. Accessed 21 May 2018.
Istat (2015b). Il benessere equo e sostenibile delle province. http://www.besdelleprovince.it/fileadmin/grpmnt/1225/pubblicazione_nazionale.pdf. Accessed 21 May 2018.
Jarvis, C. B., Mackenzie, S. B., & Podsakoff, P. M. (2003). A critical review of construct indicators and measurement model misspecification in marketing and consumer research. Journal of Consumer Research, 30, 199–218.
Article Google Scholar
Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: A review and recent developments. Philosophical Transactions of the Royal Society, A, 374, 20150202. https://doi.org/10.1098/rsta.2015.0202.
Article Google Scholar
Kaiser, H. F. (1961). A note on Guttman’s lower bound for the number of common factors. British Journal of Mathematical ans Statistical Psychology, 14, 1–2.
Article Google Scholar
Kendall, M. G., & Stuart, A. (1968). The advanced theory of statistics (Vol. 3). London: Charles Griffin & Co.
Google Scholar
Krishnakumar, J., & Nagar, A. L. (2008). On exact statistical properties of multidimensional indices based on principal components, factor analysis, MIMIC and structural equation models. Social Indicators Research, 86, 481–496.
Article Google Scholar
Lai, D. (2003). Principal component analysis on human development indicators of China. Social Indicators Research, 61, 319–330.
Article Google Scholar
Lavit, C., Escoufier, Y., Sabatier, R., & Traissac, P. (1994). The ACT (STATIS method). Computational Statistics & Data Analysis, 18, 97–119.
Article Google Scholar
Linting, M., Meulman, J. J., Groenen, P. J. F., & Van der Kooij, A. J. (2007). Nonlinear principal components analysis: Introduction and application. Psychological Methods, 12, 336–358.
Article Google Scholar
Maggino, F. (2017). Developing indicators and managing the complexity. In F. Maggino (Ed.), Complexity in society: From indicators construction to their synthesis (Vol. 70, pp. 87–114)., Social indicators research series Cham: Springer.
Chapter Google Scholar
Maggino, F., & Zumbo, B. D. (2012). Measuring the quality of life and the construction of social indicators. In K. C. Land, A. C. Michalos, & M. J. Sirgy (Eds.), Handbook of social indicators and quality-of-life research (pp. 201–238). Dordrecht: Springer.
Chapter Google Scholar
Markus, K. A., & Borsboom, D. (2013). Frontiers of test validity theory. Measurement, causation, and meaning. New York: Routledge.
Google Scholar
Mazziotta, M., & Pareto, A. (2013). Methods for constructing composite indices: One for all or all for one. Rivista Italiana di Economia Demografia e Statistica, LXVII(2), 67–80.
Google Scholar
Mazziotta, M., & Pareto, A. (2016a). On a generalized non-compensatory composite index for measuring socio-economic phenomena. Social Indicators Research, 127, 983–1003.
Article Google Scholar
Mazziotta, M., & Pareto, A. (2016b). On the construction of composite indices by principal components analysis. Rivista Italiana di Economia Demografia e Statistica, LXX(1), 103–109.
Google Scholar
Mazziotta, M., & Pareto, A. (2017). Synthesis of indicators: The composite indicators approach. In F. Maggino (Ed.), Complexity in society: From indicators construction to their synthesis (Vol. 70, pp. 159–191)., Social indicators research series Cham: Springer.
Chapter Google Scholar
McGillivray, M. (2005). Measuring non-economic well-being achievement. Review of Income and Wealth, 51, 337–364.
Article Google Scholar
Michalos, A. C. (2014). Encyclopedia of quality of life and well-being research. Dordrecht: Springer.
Book Google Scholar
Michalos, A. C., Smale, B., Labonté, R., Muharjarine, N., Scott, K., Moore, K., et al. (2011). The Canadian Index of wellbeing. Technical report 1.0. Waterloo, ON: Canadian Index of Wellbeing and University of Waterloo.
Mishra, S. K. (2007). A comparative study of various inclusive indices and the index constructed by the principal components analysis. MPRA Paper, No. 3377. MPRA. http://mpra.ub.uni-muenchen.de/3377. Accessed 21 May 2018.
Mishra, S. K. (2008). On Construction of Robust Composite Indices by Linear Aggregation. SSRN. http://ssrn.com/abstract=1147964. Accessed 21 May 2018.
Nahman, A., Mahumani, B. K., & De Lange, W. J. (2016). Beyond GDP: Towards a green economy index. Development Southern Africa. https://doi.org/10.1080/0376835X.2015.1120649.
Google Scholar
OECD. (2004). The OECD-JRC handbook on practices for developing composite indicators. Paper presented at the OECD Committee on Statistics, 7–8 June 2004, OECD, Paris.
OECD. (2008). Handbook on constructing composite indicators. Methodology and user guide. Paris: OECD Publications.
Book Google Scholar
OECD. (2015). How’s life? 2015: Measuring well-being. Paris: OECD Publishing.
Book Google Scholar
Osborne, J. W. (2014). Best practices in exploratory factor analysis. Newbury Park: Jason W. Osborne.
Google Scholar
Ram, R. (1982). Composite indices of physical quality of life, basic needs fulfilment, and income: A principal component representation. Journal of Development Economics, 11, 227–247.
Article Google Scholar
Salzman, J. (2003). Methodological choices encountered in the construction of composite indices of economic and social well-Being. Technical Report. Center for the Study of Living Standards, Ottawa.
Sen, A. K. (1985). Commodities and capabilities. Amsterdam: North Holland Publishing Company.
Google Scholar
Shalizi C. R. (2009). The truth about principal components and factor analysis. http://www.stat.cmu.edu/~cshalizi/350/lectures/13/lecture-13.pdf.
Shwartz, M., Restuccia, J. D., & Rosen, A. K. (2015). Composite measures of health care provider performance: A description of approaches. The Milbank Quarterly, 93, 788–825.
Article Google Scholar
Simonetto, A. (2012). Formative and reflective models: State of the art. Electronic Journal of Applied Statistical Analysis, 5, 452–457.
Google Scholar
Slottje, D. J. (1991). Measuring the quality of life across countries. The Review of Economics and Statistics, 73, 684–693.
Article Google Scholar
Somarriba, N., & Pena, B. (2009). Synthetic indicators of quality of life in Europe. Social Indicators Research, 94, 115–133.
Article Google Scholar
Stiglitz, J., Sen, A. K., & Fitoussi, J. P. (2009). Report of the commission on the measurement of economic performance and social progress. Paris. Available online from the Commission on the Measurement of Economic Performance and Social Progress: http://www.stiglitz-sen-fitoussi.fr/en/index.htm. Accessed 21 May 2018.
UNDP. (1990). Human development report 1990. New York: Oxford University Press.
Google Scholar
UNDP. (2010). Human development report 2010. New York: Palgrave Macmillan.
Google Scholar
Van Beuningen, J., Van der Houwen, K., & Moonen, L. (2014). Measuring well-being. An analysis of different response scales. Discussion Paper, 3. Statistics Netherlands.
Vinzi, V. E., Lauro, C., & Tenenhaus, M. (2003). PLS path modeling. Working paper. DMS – University of Naples, HEC – School of Management, Jouy-en-Josas.
Wong, K. M. (2012). Well-being and economic development: A principal components analysis. International Journal of Happiness and Development, 1, 131–141.
Article Google Scholar
Zumbo, B. D. (2007). Validity: Foundational issues and statistical methodology. In C. R. Rao & S. Sinharay (Eds.), Handbook of statistics (Vol. 26, pp. 45–79)., Psychometrics Boston: Elsevier.
Google Scholar

Download references

Acknowledgements

The paper is the result of the common work of the authors: in particular M. Mazziotta has written Sects. 2.1, 3.2, 4 and A. Pareto has written Sects. 1, 2.2, 2.3, 2.4, 3.1.

Author information

Authors and Affiliations

Italian National Institute of Statistics, Viale Oceano Pacifico 171, 00144, Rome, Italy
Matteo Mazziotta
Italian National Institute of Statistics, Via A. Depretis 77, 00184, Rome, Italy
Adriano Pareto

Authors

Matteo Mazziotta
View author publications
You can also search for this author in PubMed Google Scholar
Adriano Pareto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Adriano Pareto.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mazziotta, M., Pareto, A. Use and Misuse of PCA for Measuring Well-Being. Soc Indic Res 142, 451–476 (2019). https://doi.org/10.1007/s11205-018-1933-0

Download citation

Accepted: 22 May 2018
Published: 25 May 2018
Issue Date: 15 April 2019
DOI: https://doi.org/10.1007/s11205-018-1933-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Use and Misuse of PCA for Measuring Well-Being

Abstract

Access this article

Similar content being viewed by others

Dimensions of Well-Being and Their Statistical Measurements

Exploring Multidimensional Well-Being in Switzerland: Comparing Three Synthesizing Approaches

Fifty Years After the Social Indicators Movement: Has the Promise Been Fulfilled?

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Use and Misuse of PCA for Measuring Well-Being

Abstract

Access this article

Similar content being viewed by others

Dimensions of Well-Being and Their Statistical Measurements

Exploring Multidimensional Well-Being in Switzerland: Comparing Three Synthesizing Approaches

Fifty Years After the Social Indicators Movement: Has the Promise Been Fulfilled?

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation