Introduction

In recent years, empirical studies of the meaning of life construct have increased as it has become an important concept within the theories of well-being especially driven by the development of Positive Psychology (García-Alandete, 2014; García-Alandete et al., 2017). This provides a better understanding of the potentialities and resources people have (Crea, 2016). For Frankl (2014), experiencing the meaning of life is the most important motivational characteristic for human beings. There is evidence that the meaning of life is related to a greater experience of freedom, satisfaction with life, optimism, hope and well-being (Akbari et al., 2019; Karataş et al., 2021; Stoyles et al., 2015; Yalçın & Malkoç, 2015). In addition, it has a protective role against depression, hopelessness, aimlessness, self-injurious behaviors and suicide, among other mental health problems (Disabato et al., 2017; George & Park, 2016; Lew et al., 2020; Marco et al., 2017).

Meaning in life has been defined in many ways, such as an experience of fulfillment in life, sense of purpose, authentic life, etc. (Wong, 2016). Thus, it has been suggested that meaning in life comprises both meaning and purpose, which are closely related constructs that can be used interchangeably (García-Alandete et al., 2019). However, others indicate that value and importance are synonymous with meaning; while goals and intentions are synonymous with purpose (Steger et al., 2006). From other theoretical perspectives, the cognitive impulses that give meaning to personal experiences are important (Heintzelman & King, 2014a; Proulx & Inzlicht, 2012; Waytz et al., 2015). Similarly, importance is given to the emotional aspects of the meaning of life (Baumeister et al., 2013; King et al., 2016). Others suggest that meaning is part of a system of social relations (Lambert et al., 2013; Klein, 2017). Finally, there are studies that consider the meaning of life as a permanent search process or that try to identify the levels of meaning (McAdams, 2013; Schnell, 2009).

The difficulty to define the meaning of life also generates problems to evaluate it in a valid and reliable way. In logotherapy, there is an empirical orientation that has led to the development of instruments to quantitatively evaluate the meaning of life (Batthyany & Guttman, 2006; Schulenberg, 2004). From this approach, the Purpose in Life Test was developed with the objective of assessing meaning and purpose in life. (PIL; Crumbaugh & Maholick, 1964). The PIL is probably the most widely used and researched instrument based on logotherapy (Schulenberg & Melton, 2010). According to logotherapy, people have the ability to resist the impact of external situations and their own limitations, both physical and psychological. This would allow them to develop meaning in their own lives whenever they are able to reflect on themselves (Frankl, 2014). In its original form, the PIL is comprised of 20 items that assess a set of life experiences, such as the presence of life goals, satisfaction with daily activities, boredom and enthusiasm, among others (Shuv-Ami & Bareket-Bojmel, 2021). Numerous studies have reported that the original 20-item PIL presents evidence of validity and reliability in different populations and countries (e.g., Brunelli et al., 2012; García-Alandete et al., 2016; Haugan & Moksnes, 2013; Ortiz et al., 2012; Schulenberg & Melton, 2010; Simkin et al., 2018; Zhang et al., 2021).

However, there are certain concerns regarding the PIL, such as the criticism related to its dimensionality. Different studies question whether the PIL has a unidimensional structure (e.g., Marsh et al., 2003; Simkin et al., 2018), two dimensions (e.g., García-Alandete et al., 2013; Hayashi & Esmerelles, 2017; Morgan & Farsides, 2009), three dimensions (e.g., Armas et al., 2018; Gottfried, 2016; Magaña Valladares et al., 2004) or even up to six factors (Reker & Cousins, 1979). It has even been proposed that both two-factor and five-factor solutions work adequately (Shek, 1988). On the other hand, it has been indicated that the PIL format is cumbersome and bulky at the time of evaluation, even more so if other tests are administered in addition (Harlow et al., 1987). This is largely due to the fact that each of the PIL items has different response scales; for example, item 5 (“Every day is:” [“Cada día es:”]) has response options ranging from “exactly the same” [“exactamente lo mismo”] to “always new and different” [“siempre nuevo y diferente”]; while, in item 8 (“In terms of reaching my life goals, I:” [“En términos de alcanzar mis metas de vida, yo:”]), the response options are “haven’t made any progress” [“no he hecho ningún progreso”] to “have achieved all of them completely” [“los he logrado todos por completo”] (Schulenberg et al., 2011). This can lead to confusion and difficulty in understanding the test in a unitary way (Francis et al., 2019). There is also concern about the extreme nature of some items, which refer to suicidal thoughts or preparation for death, and the influence it may receive from certain momentary emotions, such as boredom and enthusiasm (Heintzelman & King, 2014b). It has been suggested that the content of some items may be mismeasuring other constructs, such as depression (Schulenberg et al., 2011). In addition, some items, such as those referring to life goals, or reason for existence, may be too abstract for some age groups, such as adolescents (Law, 2012). In this sense, for example, a factor structure has been proposed without the presence of three items that were not representative and did not load on any factor (Jonsén et al., 2010). This has led to the suggestion of shorter versions of the PIL by eliminating some items. Thus, there are, for example, versions of 17 items (Jonsén et al., 2010) and 19 items divided into three factors (Halama, 2009), 10 items divided into two factors (García-Alandete et al., 2013; García-Alandete, 2014; Hayashi & Esmerelles, 2017) and even a 4-item version (Schulenberg et al., 2011).

Specifically, the 4-item version of the PIL, called Purpose in Life Test - Short Form (PIL-SF; Schulenberg et al., 2011) is made up of items 3, 4, 8 and 20 of the original version. For the development of this version, items 3, 8 and 20, which make up the factor called “purposeful life” as it appeared in some studies were joined with item 4 that has specific content on meaning in life. This improved the reliability of the factor from .75 to .81 (Schulenberg et al., 2011). In addition, the 4 items have formed the same factor in different studies, which assess the presence and realization of life goals/purposes (Molcar & Stuempfig, 1988; Steger, 2006). The original psychometric study of the PIL-SF (Schulenberg et al., 2011) showed that the 4-item undimensional model has an excellent fit and produces reliable scores. In addition, PIL-SF scores showed expected and significant correlations with other measures of life purpose and meaning, search for noetic goals or goals to find meaning in life (from logotherapy, the term noetic can be interpreted as meaning, Crumbaugh, 1977), life satisfaction, and boredom proneness. Likewise, the PIL-SF significantly predicted psychological distress. The results were similar whether the PIL-SF was administered independently or in conjunction with the original 20-item PIL. Subsequent studies have also reported an adequate fit of the unidimensional model and good reliability in Spain (García-Alandete et al., 2017) and Denmark (Pacak-Vedel et al., 2021) using confirmatory factor analysis models. From Rasch analysis, support has also been provided for the unidimensionality and good reliability of the PIL-SF in a pooled sample of people with spinal cord injuries from Germany, France and Italy (Peter et al., 2016). The degree of similarity between the psychometric properties of the PIL-SF in the previous studies seems to support the validity of the translations into the different languages in which the scale was applied. This would suggest the absence of linguistic bias in previous studies. However, no cross-cultural studies have been conducted with the PIL-SF in different languages to assess the presence of measurement biases due to cultural differences, problems in the understanding of the items or inconsistencies in the translation of the scale (Bader et al., 2021). Although the study by Peter et al. (2016) used samples from Germany, France, and Italy, the analyses were performed considering them as a single sample, without making comparisons across countries.

The PIL-SF has been used in different countries, including Spain (García-Alandete et al., 2017), the United States (Aiena et al., 2016; Schulenberg et al., 2011; Schulenberg et al., 2016; Weber et al., 2020), China (Lew et al., 2021; Wang et al., 2016), Iran (Cheraghifard et al., 2021), Germany, France and Italy (Peter et al., 2016) and Denmark (Pacak-Vedel et al., 2021). However, although the PIL-SF was proposed more than 10 years ago, there is no evidence of it having been used in studies in Latin America, which limits the global generalization of the results. Nowadays, in-depth research on the protective constructs of mental health and well-being in all countries is increasingly valuable (Bieda et al., 2017). For some years now, studies on meaning in life in cross-cultural contexts have increased in popularity (Temane et al., 2014), since cultural differences provide a guide that helps to better understand the way in which we give meaning to our lives (Kitayama et al., 2010; Mascaro & Rosen, 2008; Steger et al., 2008a). Culture enables the shaping of values and expectations that impact emotional experiences, the way people perceive themselves, and how they experience presence and search for meaning in their lives (Fischer et al., 2021; Koltko-Rivera, 2004). However, most of the studies on the meaning people find in life have been conducted in North American and European samples, which does not provide information on how people live in other cultures (Steger et al., 2008b).

Studies in different cultures suggest that the meaning of life presents certain variations or similarities between countries (Dogra et al., 2008; Fischer et al., 2021For example, it has been observed that in Japanese and American college samples, the presence of meaning in life and happiness are similarly related (Steger et al., 2008b). The same study, however, indicated that the search for meaning in life may differ between cultures. Thus, the search for meaning was negatively related to happiness in American college students, but the two variables were not related in the Japanese sample. Another study, which included countries in the Americas and Asia, indicated that Cambodian participants had a higher level of meaning in life, followed by Mexican and Chinese participants (Wȩziak-Białowolska et al., 2019). These findings suggest that cultural differences such as individualism vs. collectivism (Hofstede, 2001a, b), can influence the meaning people find in life. The individualism-collectivism continuum expresses the degree to which members of a culture see themselves as independent of or interdependent on the social context in which they live (Hofstede & Bond, 1984). This translates into people’s self-concept of “I” or “we” and indicates how much people care only about themselves and their immediate family, or care about the whole community to which they belong (Maaravi et al., 2021).

In this sense, people belonging to more collectivist cultures, such as Latin American countries (Minkov, 2018), seem to experience a better association between the presence and search for meaning in life (Fischer et al., 2021). Another study indicated that people from collectivist cultures generally had similar levels of meaning in life, as opposed to U.S. individuals (Wȩziak-Białowolska et al., 2019). This could be related to the presence of traditional values that include close association with family and friends. However, there is evidence that not all collectivist cultures show a positive relationship between presence and the search for meaning in life. Thus, in cultures that adopt collectivist aspects, such as Turkey and India, a negative association between presence and the search for meaning in life has been reported (Boyraz et al., 2013; Singh et al., 2016). These different results seem to suggest the presence of differences in the meaning found in life among collectivist cultures. There are also other factors that transcend culture and may explain these differences. In this sense, some developmental processes, characteristic of different stages of life, may have an influence on the development and search for the meaning in life (Atak & Çok, 2008). For example, in emerging adulthood, there is a predominance of the search for meaning in life (Steger et al., 2009). Thus, the explanatory power of the individualism-collectivism continuum in the relationship between presence and the search for meaning in life may vary according to the cultures and the period of development of the participants (Fischer et al., 2021). It is possible, therefore, that cultural differences may have less impact on how presence and meaning-seeking relate to well-being.

This leads to the need for further studies that seek to clarify the way in which presence of, and the search for, meaning in life are experienced in different cultures. Recently, there is a growing interest in conducting studies in different countries (Boer et al., 2018; Fischer & Karl, 2019). However, this type of research poses different challenges, especially with the measurement instruments used and, specifically, with the lack of research assessing measurement invariance across countries (Matos et al., 2021). Nowadays, it is increasingly necessary to develop and adapt measurement instruments that can be used in different countries. In view of this, although the PIL-SF has been used in different countries, it is not clear whether it shows measurement invariance in Spanish-speaking Latin American countries. Therefore, the main objective of this study was to test the invariance of the PIL-SF measurement in a sample of people from 7 Latin American countries. Additionally, the characteristics of the PIL-SF items were evaluated using an Item Response Theory (IRT) model. It is expected that the unidimensional structure of the PIL-SF will also be present in this group of Latin American countries and will present an adequate reliability estimate, as previously reported (García-Alandete et al., 2017; Pacak-Vedel et al., 2021). Likewise, although no measurement invariance studies of the PIL-SF have been carried out among different countries, it would be expected that the measure is invariant, considering that the undimensional structure has shown good fit in previous psychometric studies. Finally, it is expected that the PIL-SF items will present good discrimination and difficulty parameters as has been previously observed (Peter et al., 2016).

Testing measurement invariance is important as it could then assess the applicability of the purpose in life construct in different countries (van de Vijver, 2013). This is valuable as people may have different interpretations of certain words or the entire scale due to variations in cultural characteristics (Veenhoven, 1996). In this sense, the instrument may operate differently and the underlying construct may have different theoretical structures in various countries, leading to biased estimates (Dimitrov, 2010). If the underlying factor structure of an instrument developed in one country and language is the same in different countries with different languages, this would ensure that the instrument performs equally across groups and can be useful for bias-free comparisons (Byrne & Watkins, 2003). For this to occur, it must be shown that the underlying construct is measured equally in each country, i.e., demonstrate that the factor structure is invariant when assessed simultaneously across different countries (Borsboom, 2006; Byrne & Watkins, 2003; He & van de Vijver, 2012; Milfont & Fisher, 2010). The absence of invariance would indicate that individuals or groups respond differently to the items of an instrument. This would lead to the inability to make a reasonable comparison between group means (Dimitrov, 2010; van de Schoot et al., 2012). Providing evidence of measurement invariance of the PIL-SF could improve the accuracy of comparative estimation of the meaning people find in life at the Latin American level and better track the progress of interventions in multinational contexts. As mentioned above, to our knowledge, the PIL-SF has not been applied to Latin American samples. Therefore, this would be the first study using the PIL-SF in this region and the first also to examine its measurement invariance in different Latin American countries.

Finally, IRT-based analyses would make it possible to evaluate item parameters, estimate measurement errors, and describe the relationship between the latent trait and the probability that a person would choose a particular response (Thomas, 2011, 2019). In addition, the IRT model provides a graphical representation of item functioning, which gives the opportunity for a deeper analysis, both at the item and person level, using the same metric. While the PIL-SF has previously been analyzed using IRT models in Germany, France and Italy as well as in the United States (Peter et al., 2016), this is the first time that this analysis has been performed with the Spanish version.

Method

Participants

A total of 4306 people from the general population of seven Latin American countries (Argentina, Colombia, Ecuador, El Salvador, Mexico, Paraguay and Uruguay) selected by non-probabilistic purposive sampling participated in the study. The inclusion criteria were: 1) to be of legal age (in all participating countries the age of adulthood is 18 years or older), and 2) to have given informed consent to participate. The number of participants ranged from 244 in Paraguay to 1360 in Argentina. The countries participating in this study were part of the Epidemiological Project on the mental health of the population in a pandemic situation, designed by the Neuroscience Research Center of the School of Psychology at the National University of Rosario in Argentina, together with the School of Health Sciences of the Universidad Privada del Norte in Peru. The countries were selected to try to provide a variety of contextual settings within Latin America. Although an attempt was made to include as many countries as possible, most countries from South America and only one from Central America and the Caribbean participated.

Table 1 shows that Uruguay has the participants with the highest mean age (M = 41.8; SD = 16.6 years); while Ecuador has the lowest mean age (M = 24.6; SD = 7.8 years). Likewise, the majority of participants, in all countries, were women (> 64%). More than 60% of the participants have completed university studies and have a professional career (> 50%), except in Ecuador (38.5%) and El Salvador (27.8%), where there is a higher proportion of people with unskilled work (45.1% and 37.5%, respectively). Table 1 reports more detailed information on the sociodemographic characteristics of the participants in each country.

Table 1 Sociodemographic characteristics of the participants from the seven countries

Instruments

Sociodemographic Questionnaire

An ad hoc questionnaire was created to collect information on some sociodemographic variables such as nationality, age, sex, educational level and occupation.

The Purpose in Life Test-Short Form (PIL-SF; Schulenberg et al., 2011)

The PIL-SF is a short version of four items, derived from the original 20-item test, which measures the degree to which people perceive that their lives had meaning. Items 3, 4, 8, and 20 from the Spanish version by Simkin et al. (2018) were used in this study. Each of the PIL-SF items has seven different Likert-type response options. The total score for the PIL-SF ranges from 4 to 28 and is obtained from the sum of the scores for each item. Higher scores indicate meaning and purpose in life.

Procedure

The data was collected between the months of June and September 2020 at the height of the COVID-19 pandemic. The data collection process was the same in each of the participating countries. Argentina initiated and coordinated the Epidemiological Project on the mental health of the population in a pandemic situation. A general coordinator together with a team of researchers carried out the work in each participating country. An online survey was used, elaborated on the Google Forms platform, with had three distinct parts. The first consisted of the objective of the study and informed consent. If the participants gave their consent to participate in the study, they could access the second (sociodemographic questionnaire) and third (PIL-SF questions, FCV-19S and other measures that were not taken into account in this study) parts of the questionnaire. The survey was distributed by email and social networks, seeking to reach the largest number of people. The data collected were confidential and participants were free to stop answering the questions at any time. The project was reviewed and approved by the Ethics Committee of the Universidad Privada del Norte in Peru (registration number: 20213002).

Data Analysis

Descriptive Analysis

Mean (M) and standard deviation (SD) were calculated, as well as skewness (g1) and kurtosis (g2). Values within the range of ±2, for g1, and ± 7 for g2 are considered adequate (Finney & DiStefano, 2013).

Confirmatory Factor Analysis

First, the Confirmatory Factor Analysis (CFA) was performed with the Diagonally Weighted Least Squares with Mean and Variance Corrected (WLSMV) estimator. This estimator was chosen since the PIL-SF items had an ordinal nature (Brown, 2015). The chi-square test (χ2; absolute fit test), Root Mean Square Error of Approximation (RMSEA, estimate of the approximation error of the proposed model), Standardized Root Mean Squared Residual (SRMR, estimate of the average size of the residuals between the sample and the hypothesized covariance matrices), Comparative Fit Index (CFI) and Tucker-Lewis Index (TLI) were used to evaluate model fit. Both the CFI and TLI are incremental fit indices that compare the fit of a hypothetical model with the fit of a reference model. Values above .95 for the CFI and TLI would indicate a good fit; whereas, values above .90 indicate an acceptable fit. (Schumacker & Lomax, 2015). For the RMSEA and SRMR, values below .05 are indicative of a good fit; while values between .05 and .08 indicate an acceptable fit (Kline, 2015).

Reliability

Reliability was estimated by calculating Cronbach’s alpha (Cronbach, 1951) and omega for categorical items (Green & Yang, 2009). In both cases, values greater than .70 indicate adequate reliability (Viladrich et al., 2017).

Measurement Invariance

Measurement invariance assessment was performed using the approximate measurement invariance (AMI) approach. This method considers that the factorial loads and the intercepts are not identical between the groups and, therefore, the presence of small differences between parameters is acceptable (Byrne & van de Vijver, 2017; Fischer & Karl, 2019; Lomazzi, 2018). Within this approach, multigroup factor analysis alignment was chosen to test for invariance (Asparouhov & Muthén, 2014). First, the fit of a configurational model that did not present restrictions between groups was evaluated. Second, the configurational model was optimized with a component loss function to minimize the invariance between the means of each factor and the variances of the groups (Asparouhov & Muthén, 2014). Invariance tolerance criteria were established for factor loadings (λ = .40) and intercepts (ν = .20) (Robitzsch, 2020). The alignment power for the parameters was .25 (Fischer & Karl, 2019). The evaluation of the equivalence of the parameters was carried out based on the interpretation of the R2 index, where values ​​close to 1 express greater invariance; while values ​​close to 0 express less invariance (Asparouhov & Muthén, 2014). The presence of up to 25% of non-invariant parameters (λ and ν) is adequate to consider the lack of invariance of a scale (Asparouhov & Muthén, 2014).

Analysis Based on Item Response Theory

Analyzes based on Item Response Theory (IRT) were performed with the 2-Parameter Graded Response Model (2-PLM) for ordinal items with three or more response alternatives (GRM, Samejima, 1997) (Hambleton et al., 2010). Discrimination (a) and difficulty (b) parameters were estimated. The parameter a indicates the variation of the answers of the items according to the level of the trait. The parameter b indicates the amount of trait necessary for the item to be answered in a specific way. Because the PIL-SF has 7 response categories, there are 6 difficulty estimates, one for each threshold. Item and Test Information Curves (IIC and TIC respectively) were calculated.

Statistical Package Used

Statistical analysis was performed in the RStudio environment for R. For the CFA, the “lavaan” package was used (Rosseel, 2012); while the “sirt” (Robitzsch, 2020) and “ltm” (Rizopoulos, 2006) packages were used for the Alignment and GRM methods, respectively.

Results

Descriptive Analysis

Table 2 shows that the mean scores of the PIL-SF items indicate a tendency toward high levels of purpose in life in all country samples. All mean scores were greater than 5, on a scale of 1 to 7. This is especially observed in items 1 (Enthusiasm) and 2 (Excitement in life), which have the highest mean scores in most countries. Skewness and kurtosis values are adequate in most countries (g1 < ±2; g2 < ±7). This gives evidence of the univariate normality of the data. On the other hand, the items present moderate and high correlations in all countries. Thus, evidence of convergent internal validity is provided. Furthermore, since the correlations are not greater than .80, the items are not redundant measures of the purpose in life construct (internal discriminant validity).

Table 2 Descriptive analysis of the items by nationality of the participants

Validity Based on Internal Structure

Table 3 shows that the unidimensional PIL-SF model presents adequate fit indices in all countries, especially in Ecuador (RMSEA = .000 [.000–.056]; CFI = 1.00; TLI = 1.00), Mexico (RMSEA = .000 [.000–.059]; CFI = 1.00; TLI = 1.00) and Paraguay (RMSEA = .000 [.000–.083]; CFI = 1.00; TLI = 1.00). In addition, all items have high factor weights in all countries.

Table 3 Fit indices, factorial weights and reliability of the models by nationality of the participants

Scale Reliability

As mentioned, the alpha and omega coefficients for categorical items were used to assess the reliability of the PIL-SF. The scale shows adequate reliability indices in all countries (α = .83 ̶ .88; ω = .84 ̶ .87) (Table 3).

Factor Invariance by Country

The Alignment method showed that the factorial structure of the PIL-SF was invariant for the factorial loads (R2 = .99) and the intersections of the items (R2 = .99), as can be seen in Table 4. The finding of a single non-invariant parameter indicates that the low percentage of non-invariant parameters (3.6%). Therefore, these findings demonstrate the metric and scalar invariance of the scale.

Table 4 ML Invariance alignment (IA) in all seven countries

Item Response Theory Model

The discrimination parameters of the items have values greater than 1, which indicates adequate discrimination (Table 5) (Zickar et al., 2002). As for the difficulty parameters, all the threshold estimators increased monotonically.

Table 5 Discrimination and difficulty parameters for scale items

Figure 1 allows observing the IIC where it is indicated that item 4 is the most precise to measure the purpose of life. In addition, the TIC would indicate that the PIL-SF is more reliable in a scale range between −3 and .5.

Fig. 1
figure 1

Item and Test Information Curves for the Scale

Discussion

The aim of this study was to examine the cross-cultural measurement invariance of the PIL-SF in samples from seven Latin American countries. Rigorous approaches were adopted to assess whether a single latent factor model would explain the covariances between PIL-SF items across countries. First, the unidimensional model of the PIL-SF was tested and shown to fit the data very well across countries. This provides evidence that the PIL-SF measures a single underlying construct. In this sense, the sum of the PIL-SF item scores provides an overall measure of purpose in life. In addition, the factor loadings of the items were high, which provided further support for the internal structure-based validity of the PIL-SF. Thus, it appears that the PIL-SF is a unidimensional and brief instrument for assessing meaning in life in the participating countries. It has been suggested that instruments with more items may have better psychometric evidence than shorter ones (Gosling et al., 2003). However, in many situations it is not practical to use longer instruments, for example, in Internet-based studies with a longitudinal design or those in which many variables are used (Robins et al., 2001). Short instruments, such as the PIL-SF, take less time to answer, are cheaper, easier to administer, and reduce boredom (Herzberg & Brähler, 2006; Joseph et al., 2004). Also, the PIL-SF was shown to be equally reliable in all 7 countries. These results are similar to those reported in previous studies (García-Alandete et al., 2017; Pacak-Vedel et al., 2021; Schulenberg et al., 2011). The findings support the factorial structure and reliability in different countries and contribute to expanding the use of the PIL-SF, both for use in research and in clinical settings.

In addition, the invariance of the PIL-SF factorial structure in the seven countries with a new alignment optimization method. This method identifies non-invariant parameters and does not require exact invariance or different model modifications to make reliable comparisons between means (Asparouhov & Muthén, 2014). Overall, the results provided evidence of metric and scalar invariance of the PIL-SF. The findings suggest that the PIL-SF evaluates the same purpose in life construct in the participating Latin American countries. This is the first research study to test the measurement invariance of the PIL-SF across countries. The findings would make it possible to avoid errors when comparing the purpose of life between different countries. This allows for more solid conclusions in cross-cultural research on the meaning people find in life in Latin America. Therefore, those who wish to use the PIL-SF to assess purpose in life across Latin American countries can be assured that the differences obtained are a true reflection of the differences in the purpose in life construct and not a measurement error. In addition, there is also more evidence to evaluate the relationships between purpose in life and other theoretically and culturally relevant variables in all participating countries. Using the alignment optimization method, item intersections were reported to have a greater amount of invariance compared to factor loadings. Furthermore, the results may provide important information for the continued development of the PIL-SF. For example, item 3 was the least invariant in El Salvador and dealt with the achievement of life goals. This could be explained by the fact that personal goals vary according to different developmental stages, roles and life situations (Salmela-Aro et al., 2007; Zhang et al., 2009). Similarly, it is possible that this lack of invariance is also explained by differences in the characteristics of the sample across countries. For example, El Salvador was one of the countries with the lowest average age. This is important because personal goals change over time. Younger people tend to have more self-centered and education-related goals. Older people have goals more related to classic developmental tasks, such as starting a family (Krings et al., 2008).

In this study, Item Response Theory (IRT) was used to assess the characteristics of the PIL-SF items. IRT analysis allows for item and latent trait estimates that are independent of sample characteristics, trait-level standard errors, and item content (Hays et al., 2000). Evidence was provided about the high informativeness and adequate discrimination provided by each item of the PIL-SF, covering a wide range of the latent purpose in life construct. The findings indicate that people who respond to the highest response categories need a greater purpose in life. That is, the PIL-SF items are useful for determining people with a relatively high degree of purpose in life. Identifying people with higher levels of purpose in life would allow them to be part of intervention programs, being a support and model for those who have low levels. Likewise, identifying these people would allow them to be guided to maintain or reinforce their purpose in life. In addition, the information function of the PIL-SF indicated an adequate degree of accuracy when the level of purpose in life was medium to high. Specifically, item 4, which refers to the discovery of clear goals and a satisfactory life purpose, is the most accurate item of the PIL-SF to assess the latent trait life purpose. This can be explained by the fact that the study of the meaning people find in life is based on two main factors: the presence of a life purpose, which is experienced when a person understands himself/herself, understands the world around him/her and identifies his/her purpose in it, and the search for meaning (Steger et al., 2006).

Strengths and Limitations

This study has several strengths. First, it was the first to examine the measurement invariance of the PIL-SF in different Latin American countries. Second, it has made an important contribution to increasing the scope of application of the PIL-SF in the Latin American context. This is valuable because of the evidence that the scale is related to well-being (Pacak-Vedel et al., 2021). In addition, samples from different countries have been used, which helps to make the findings more robust. Also, robust statistical analyses have been performed, which have considered the ordinal level of items, as well as good and novel practices for assessing invariance across many groups. However, it is important to consider the limitations of the study. First, samples from only seven Latin American countries were accessed, mostly from South America (Argentina, Colombia, Ecuador, Paraguay and Uruguay) and only one each from North America (Mexico) and Central America (El Salvador). This leads to the possibility of examining in the future whether the results can be generalized to other Latin American countries. Similarly, it would be ideal to test the measurement invariance of the PIL-SF in more diverse countries and languages, such as those in little-studied contexts like Africa and the Middle East. Second, the use of non-probabilistic convenience sampling has not allowed us to obtain representative samples from each country. This has therefore made it impossible to generalize the results to the general population. Third, possible differences in the meaning of life between subgroups within countries (e.g., between people of different sexes and ages) were not assessed. Previously, it has been reported that women have higher meaning of life scores than men (Hamama & Hamama-Raz, 2021). Additionally, there is greater meaning found in life as age increases (Dhanjal, 2019). Fourth, for the most part, the participants in each country had completed university studies. Forthcoming studies should examine whether the invariance of PIL-SF holds in samples with lower educational attainment in different countries. It has been suggested that university students do not necessarily represent the general population of a country, often being a privileged group (Vogel et al., 2013). This could influence the degree to which people give meaning to their lives compared to people of other educational levels. In addition, people with lower educational levels were likely to have limited access to the internet. Fifth, because the study was conducted during the COVID-19 pandemic, infection and death rates were different in each participating country. This may have affected the meaning people gave to their lives. This leads to the suggestion that, future work could compare the structure of PIL-SF within countries, but at different time intervals. Sixth, the study used a cross-sectional design, so future research should use longitudinal designs to assess the temporal stability of the PIL-SF. Finally, the data were self-reported, so responses could have been affected by recall bias or social desirability.

Theoretical Implications

At the theoretical level, the findings confirm the unidimensionality of the purpose in life construct measured with the PIL-SF. As mentioned before, the 4 items have been grouped in the same factor that assesses the presence and realization of life goals/purposes (Molcar & Stuempfig, 1988; Steger, 2006). The fact that the 4 items are grouped into one factor is consistent with Frankl’s proposal, who mentions that when we are aware of our goals or purposes in life and our way of living is consistent with those goals, we are more likely to perceive our life as meaningful (Schulenberg et al., 2008). This finding is also consistent with other studies that recommend evaluating life purpose as a unidimensional concept (Marsh et al., 2003; Simkin et al., 2018; Steger, 2006). Similarly, having a one-dimensional measure indicates that each PIL-SF item reflects a single latent construct (Gefen, 2003). If the PIL-SF had one or more items that measure other aspects related to purpose in life, your total score should also include information on these other aspects. This would lead to a misinterpretation of the PIL-SF (Ziegler & Hagemann, 2015). Likewise, identifying that item three is the least invariant among all the items would lead to this item being chosen as the anchor or reference item when constructing a cross-country CFA model.

Practical Implications

The findings have several practical implications. First, the PIL-SF can be used in epidemiological studies to quickly determine the relationship between purpose in life and other constructs in the countries that participated in this study. Second, the PIL-SF can be used as an early screening measure to identify groups of people with low purpose in life or to identify those who are likely to benefit most from increased purpose in life. Third, having a consistent measure across a group of Latin American countries can be useful to assess whether or not an intervention has had a desired effect on the development of purpose in life. In this way, evidence-based information can be obtained to support the development of policies to improve the mental health and well-being of the population. Fourth, the findings provide the opportunity for meaningful comparisons of latent means to detect variables that may influence meaning in life. However, the measurement and comparison of purpose in life across countries should be done on the basis of approximate invariance.

Conclusion

In conclusion, this study provided evidence of invariance of the PIL-SF, a psychometric measure of the purpose in life construct, in seven Latin American countries. Thus, the PIL-SF items are interpreted in the same way by people regardless of the country to which they belong. However, future studies could gather more evidence on the nature of the PIL-SF items by testing the one-factor model in other cultures less and more distant from those of the countries evaluated here.