Articles

RECONSTRUCTING THE HISTORY OF ENERGY CONDITION VIOLATION FROM OBSERVATIONAL DATA

, , and

Published 2012 June 18 © 2012. The American Astronomical Society. All rights reserved.
, , Citation Chao-Jian Wu et al 2012 ApJ 753 97 DOI 10.1088/0004-637X/753/2/97

0004-637X/753/2/97

ABSTRACT

We study the likelihood of energy condition violations in the history of the universe. Our method is based on a set of functions that characterize energy condition violation. Friedmann–Lemaître–Robertson–Walker cosmological models are built around these "indication functions." By computing the Fisher matrix of model parameters using Type Ia supernova and Hubble parameter data, we extract the principal modes of these functions' redshift evolution. These modes allow us to obtain general reconstructions of energy condition violation history independent of the dark energy model. We find that the data suggest a history of strong energy condition violation, but the null and dominant energy conditions are likely to be fulfilled. Implications for dark energy models are discussed.

Export citation and abstract BibTeX RIS

1. INTRODUCTION

The energy conditions introduced by Hawking & Ellis (1973, chapter 4) are coordinate-invariant inequality constraints on the energy-momentum tensor that appears in the Einstein field equation. They play an essential role in the proof of theorems in the general theory of relativity concerning the existence of spacetime singularities (see Wald 1984). Due to their simplicity and model independence, the energy conditions are listed as one of many approaches to understand the evolution of the universe.

Of the many proposed energy conditions, the ones we analyze in this paper are the null, the strong, and the dominant energy conditions (abbreviated, respectively, as NEC, SEC, and DEC). These energy conditions can be expressed as follows (Carroll 2004, chapter 4) for a general stress-energy tensor T:

  • 1.  
    NEC: Tαβnαnβ ⩾ 0 for all null vectors n;
  • 2.  
    DEC: Tαβtαtβ ⩾ 0 and TαβTβγtαtγ ⩽ 0 for all timelike vectors t; and
  • 3.  
    SEC: $T_{\alpha \beta } t^{\alpha } t^{\beta } \ge \frac{1}{2} T^{\gamma }_{\gamma } t^{\delta } t_{\delta }$ for all timelike vectors t.

The energy conditions are of great relevance to the study of cosmology. For example, the NEC is an important condition of stability for fluids (Dubovsky et al. 2006; Buniy et al. 2006), hence it is useful for the analysis of cosmological models. The DEC guarantees stability of a source obeying it and imposes on the dark energy equation of state parameter w a lower bound w ⩾ −1 (Carroll et al. 2003). Violation of the w ⩾ −1 bound can lead to the "big rip doomsday" scenario of the universe (Caldwell et al. 2003). It can also be shown that sudden future singularity solutions of the Friedmann–Lemaître–Robertson–Walker (FLRW) cosmology always violate the DEC (Lake 2004). Finally, SEC violation is a typical trait of a positive cosmological constant Λ (for example, see Li et al. 2011) and other dark energy models (see Schuecker et al. 2003, and references therein). We therefore find an investigation of these energy conditions useful for our understanding of the universe's evolution.

The knowledge of cosmic energy condition violation has been greatly facilitated by precise observational data. Visser (1997a, 1997b) shows that Hubble constant and stellar age measurements, which bound the age of the universe, suggest a history of SEC violation (see also the review by Krauss & Chaboyer 2003). X-ray galaxy cluster number count and Type Ia supernova (SNIa) luminosity distance data have been applied to a constant-w model, which also suggest SEC violation without significant indication for NEC violation (Schuecker et al. 2003). Similar results are also obtained by Lima et al. (2008a) using available SNIa data at that time. On the other hand, if one assumes the energy conditions, then constraints on a variety of cosmological observables or parameters can be predicted, such as the Hubble parameter, luminosity and angular diameter distances, look-back time, total density parameter Ω(z), energy density ρ(z), and pressure p(z) (Cattoën & Visser 2008).

In this paper, we study the issue of energy condition violations from a data-driven perspective. Specifically, we aim at reconstructing what we can say about the likelihood of energy condition fulfillment (or violation) given the data. To achieve this, we use the energy conditions themselves to construct a family of descriptive cosmological models for the recent history of cosmic expansion, and subject them to a statistical test. Throughout our paper, we assume a universe of perfect fluids that lead to the spacetime solution characterized by the FLRW metric. We use the Union2 SNIa luminosity distance data compilation (Amanullah et al. 2010) and the observational Hubble parameter data from differential ages of passively evolving galaxies (Simon et al. 2005; Stern et al. 2010) and radial baryon acoustic oscillation measurements (Gaztañaga et al. 2009). The cosmological parameters used in this paper are Ωm = 0.274, ΩΛ = 0.726, and Hubble constant H0 = 74.2 km s−1 Mpc−1.

This paper is organized as follows. In Section 2, we specify our models built from our indication functions of energy condition violation and provide solutions for the cosmic expansion rate. We proceed to Section 3, where we lay out the procedures of analysis using luminosity distance and Hubble parameter data. Our main results are presented in Section 4 and are discussed in Section 5.

2. ENERGY CONDITIONS AND COSMOLOGICAL MODELS

2.1. Overview

As summarized by Carroll (2004, chapter 4), the energy conditions can be expressed by simple inequalities in terms of the energy density ρ and the pressure p in the setting of a homogeneous and isotropic FLRW universe:

  • 1.  
    NEC: ρ + p ⩾ 0,
  • 2.  
    DEC: ρ ⩾ 0 and −ρ ⩽ p ⩽ ρ, and
  • 3.  
    SEC: ρ + 3p ⩾ 0 and ρ + p ⩾ 0.

By virtue of the Friedmann equation, the energy condition bounds can be expressed in terms of the Hubble constant-normalized, dimensionless expansion rate E(z) = H(z)/H0 and its first derivative, or the deceleration parameter defined by

Equation (1)

The results, following Lima et al. (2008a), are given below:

  • 1.  
    NEC:
    Equation (2)
  • 2.  
    DEC:
    Equation (3)
  • 3.  
    SEC:
    Equation (4)

A notable feature of these bounds is that they are only explicitly dependent on one arbitrary constant, namely, the curvature parameter Ωk, although the matter content of the FLRW universe does control the functional form of E(z). However, for our purpose, we do not need to make assumptions on the constituent matter of the universe being modeled.

2.2. Models and Their Solutions for E(z)

We are therefore motivated to introduce the "indication functions" F(z) that quantify whether an energy condition has been violated. Fulfillment (or violation) of an energy condition should be indicated by F(z) ⩾ 0 (or F(z) < 0). The simplest choices are just the left-hand sides of inequalities (2)–(4). We denote them as FNEC, FDEC, and FSEC, respectively:

  • 1.  
    NEC:
    Equation (5)
  • 2.  
    DEC:
    Equation (6)
  • 3.  
    SEC:
    Equation (7)

With the introduction of these indication functions, inequalities (2)–(4) become ordinary differential equations that can be integrated to find E(z). We may look at them as mappings from the set of dimensionless Hubble expansion rates E(z), to that of indication functions F(z), and vice versa. These mappings enable us to reconstruct F(z) from observables related to E(z).

Our formalism so far remains general enough for any model that makes a prediction on E(z). In particular, we will concentrate on the underlying evolution of E(z) that is favored by observational data.4 This approach has been used on the study of possible evolution of dark energy (Huterer & Starkman 2003; Huterer & Cooray 2005) and the reconstruction of historical deceleration parameter q(z) (Shapiro & Turner 2006).

Following this line of inquiry, we can use a similar method to reconstruct the indication functions F(z), where the function to be reconstructed is "coarse-grained" as piecewise constant. While this can be done using more elaborate functions such as the smooth, hyperbolic tangent function (Crittenden et al. 2009; Zhao & Zhang 2010), we find that the piecewise-constant functions greatly simplify the integration of Equations (5)–(7). In the interval in which F(z) is constant, F(z) = r, the general solutions are given by

  • 1.  
    NEC:
    Equation (8)
  • 2.  
    DEC:
    Equation (9)
  • 3.  
    SEC:
    Equation (10)

where C stands for the integration constant. To obtain the special solutions of E(z) for the piecewise-constant F(z) models extending from z = 0 to an arbitrary redshift, we can recursively apply the above solutions and the continuity condition of E(z) across subinterval endpoints starting with E(z = 0) = 1, thereby fixing the integration constants.

3. ANALYSIS

As we have briefly stated in the preceding section, we use the Fisher matrix of the full array of model parameters to extract a small number of principal components that preserve the information contained in the data without introducing serious overparameterization. The Fisher matrix elements Fij for the model parameters (not to be confused with the symbol for the indication functions F) are expressed by

Equation (11)

where P is the posterior probability density function (PDF) in the parameter space and θi is the ith model parameter, and the angled brackets stand for statistical averaging. The principal components correspond to the eigenvectors of the Fisher matrix. The posterior probability P is to be found from observational data.

3.1. Posterior Probability

To construct the posterior probability P from observational data, we assume that the uncertainties assigned to the measurement results are Gaussian. Under this assumption, the posterior probability can be expressed by an additive χ2 statistic with χ2 = −2ln P. The generic form of χ2 under this assumption can be laid out as follows:

Equation (12)

where the symbols Xth, obs denote, respectively, the theoretical or observed value of the ith observable (direct or indirect), and σi is the ith Gaussian variance or uncertainty given by the data. The summation is done over all individual data entries.

In dealing with parameterized models, it is natural to introduce nuisance parameters to be eliminated later by marginalization. The nuisance parameters and their marginalization distort the simple form of Equation (12), as well as introduce computational complexities that often call for Monte Carlo techniques. Nevertheless, before engaging any numerical computations, we can eliminate certain nuisance parameters by analytical marginalization, as illustrated in the following sections.

3.1.1. Luminosity Distance

The luminosity distance data, such as the Union2 data set that we use in this paper, are usually presented as tabulated distance moduli with errors. Physically, the luminosity distance modulus is the difference of apparent magnitude m and the absolute magnitude M,

Equation (13)

with dL measured in units of 10 pc. It can be related to the modeled expansion rate of the universe, E(z), by the formula

Equation (14)

where the sinn function is a shorthand for the definition

Equation (15)

For notational convenience, we employ a variable

Equation (16)

where the parametric dependency on the Hubble constant H0 has been separated out, leaving a dimensionless quantity as our model prediction. With this definition, the modeled distance modulus can be expressed as

Equation (17)

In this expression, we introduce a numerical constant M0 to accommodate the numerical constants arising from unit conversions, as well as an additive variable characterizing the uncertain variability or spread of the standard candles' absolute magnitudes. By doing so, we include M0 as another nuisance parameter which affects the overall scaling of the modeled cosmological distance.

The χ2 statistic for a luminosity distance modulus data set, according to Equation (12), is therefore

Equation (18)

The above expression is quadratic in the combination of nuisance parameters (5 lg H0 + M0):

Equation (19)

where, for notational simplicity, we have defined the symbols

Equation (20)

The nuisance parameters H0 and M0 can now be marginalized over by Gaussian integration over (− , ), leaving a "reduced χ2" in the form of

Equation (21)

This expression of $\tilde{\chi }^2$ is a general result for luminosity distance data (di Pietro & Claeskens 2003; Nesseris & Perivolaropoulos 2004).

3.1.2. Hubble Parameter

The observational Hubble parameter data we use are tabulated in Zhang et al. (2010) and Ma & Zhang (2011) and form an independent data set. As noted by Zhang & Zhu (2008), the Hubble parameter data could help us catch possible "wiggles" of E(z) predicted by certain dark energy models that would have been flattened out by the luminosity distance test. For the Hubble parameter data, our model requires Equation (12) to be in the form of

Equation (22)

We adopt the result of H0 = 74.2  ±  3.6 km s−1 Mpc−1 measured from nearby Cepheids by the Hubble Space Telescope (Riess et al. 2009) as a Gaussian prior, and use the analytic expression given by Ma & Zhang (2011) to construct the $\tilde{\chi }^2$ with H0 marginalized over. This statistic from H(z) data is added to the one from SNIa data (Equation (21)) to give the final posterior log-PDF.

3.2. Redshift Binning

Ideally, the coarse-graining approximation of the F(z) evolution should approach continuity as its limit. However, for our analysis using the data of 557 SNeIa from Union2 and the 13 H(z) measurements, the redshift distribution of data poses a limit on how fine we can dissect the redshift range into bins, and hence the resolution of our reconstruction of F(z). Each bin should cover enough data to make the value of F(z) well constrained inside of it. To this end, we introduce 11 bins equally spaced to cover the redshift range, shown in Figure 1. The first 10 bins are kept to be equal in width, but the last one is widened to cover the diminishing tail of the SNIa redshift distribution.

Figure 1.

Figure 1. Binning of the redshift range to accommodate enough data in each of the bins. The histogram shows the number of SNIa data points distributed across the redshift range, with each bin covering width 0.1.

Standard image High-resolution image

3.3. Evaluation of the Fisher Matrix

The Fisher matrix elements as defined by Equation (11) are statistical averages involving unknown parameters. However, for our purpose, we cannot evaluate them by averaging in the absence of a known posterior. In fact, it is neither practical nor logical to do this, as our final models to assess the evaluation of energy conditions are not the model described so far per se, but the models to be built from the principal components extracted from the Fisher matrix.

In practice, the Fisher matrix elements can be estimated at a point in the parameter space where the posterior PDF is close to the maximum. In other words, we can approximate the numerical values of the Fisher matrix elements using a fiducial model, which we choose to be the concordance flat ΛCDM model with Ωm = 0.274, ΩΛ = 0.726 as favored by combined observational data (Amanullah et al. 2010; Komatsu et al. 2011) and H0 = 74.2 km s−1 Mpc−1 from (Riess et al. 2009). In a similar setting, Shapiro & Turner (2006) used an even simpler, constant-input model as the fiducial one in their analysis, and found the method to be robust against the introduction of a simplified fiducial. Our own findings are in agreement with their claim.

To add robustness and safeguard against limited sampling in the parameter space, we use a resampling technique (Efron 1982) to check for any possible bias resulting from not treating the averaging in Equation (11) rigorously. The result confirms the robustness of the Fisher matrix estimation, and the average from the resampled Fisher matrices are used as the "reference stack" in subsequent steps.

The Fisher matrix elements are thus calculated using the $\tilde{\chi }^2$ (or posterior log-PDF) expressions given in the preceding Sections 3.1.1 and 3.1.2. Note that the normalization constants discarded in the previous steps do not contribute to the matrix elements because they are additive constants in the log-PDF independent of the parameters, and their derivatives vanish.

4. RESULTS

We numerically compute the (N + 1)  ×  (N + 1) Fisher matrix for F(z) in each of the N bins and Ωk. At this stage, the Ωk parameter can be effectively marginalized over by projecting the matrix onto the N × N subspace (Dodelson 2003, chapter 11). The eigenvectors of the resulting matrix, fi, are then retrieved. In this manner, we can thus build a chain of models with successively more parameters. The eigenvectors are the principal components we are after, and they are shown in Figures 24 for three energy conditions. Then, the reconstruction we need can be expressed as a linear combination of the principal components,

Equation (23)

Notably, the first three principal components appears stable regardless of which energy condition the reconstruction is for. However, the higher-order and noisier modes show greater change in the form with respect to the energy condition being reconstructed. Furthermore, when reconstructing F(z) by fitting the linear expression (Equation (23)), the curvature parameter Ωk appearing in Equations (5) and (6) has been marginalized with numerical integration.

Figure 2.

Figure 2. Four well-constrained principal components of FNEC(z). They are normalized so that ∫f2i(z)dz = 1. fi's (i = 1, 2, 3, 4) are the first four eigenvectors of the Fisher matrix.

Standard image High-resolution image
Figure 3.

Figure 3. Same as Figure 2 but for FDEC(z).

Standard image High-resolution image
Figure 4.

Figure 4. Same as Figure 2 but for FSEC(z).

Standard image High-resolution image

The reconstructed F(z) for the three energy conditions are shown in Figures 5, 6, and 7. We have used only the first three principal components in the fitting of F(z). The fitted values of the coefficients are listed in Table 1.

Figure 5.

Figure 5. Reconstruction of FNEC(z) using the first three eigenvectors. The central line in each bin is the best three-mode fitting result. The shaded bands show 1σ and 2σ uncertainties.

Standard image High-resolution image
Figure 6.

Figure 6. Same as Figure 5 but for FDEC(z).

Standard image High-resolution image
Figure 7.

Figure 7. Same as Figure 5 but for FSEC(z).

Standard image High-resolution image

Table 1. Best-fit Coefficients of the First Three Principal Components

ai NEC DEC SEC
a1 −0.51 ± 0.08 2.40 ± 0.10 0.88 ± 0.08
a2 0.12 ± 0.21 −0.31 ± 0.22 −0.18 ± 0.21
a3 −0.16 ± 0.45 −0.25 ± 0.50 −0.43 ± 0.40

Download table as:  ASCIITypeset image

This cutoff from the full spectrum of principal components introduces a well-known artifact, namely, a bias at the higher end of the redshift (Shapiro & Turner 2006) that suppresses the reconstructed value and its error bars toward zero. Nevertheless, we note that a value of F(z) close to zero is consistent with our lack of knowledge about the violation of energy conditions by the very construction of F(z). In other words, a vanishing value of F(z) tells us that our chance of finding energy condition fulfillment is not significantly higher or lower than that of finding its violation. Therefore, this bias in the value of F(z) does not tempt us with false claims about energy condition violations, but merely restates our ignorance about it at redshift ranges where there are insufficient data.

5. CONCLUSION AND DISCUSSIONS

We have explored the likelihood of energy condition violation during the evolution of the universe using the indication functions we proposed in Section 2.1. The indication functions are reconstructed from SNIa and H(z) data.

Our result for NEC does not prefer a history of violation, as is evident from Figure 5. A slight but rising trend can be seen from z ≈ 0.7 up to now. This trend is more pronounced in our result for DEC presented in Figure 6, and we can be fairly certain that there has been no DEC violation for z ⩽ 0.5.

However, the result for SEC shows fairly strong indication of violation. This shows the view of an accelerating cosmic expansion is compatible with our combination of data, especially for low redshift (z ⩽ 0.3). Although we have noted the diminishing ability to distinguish energy condition fulfillment from violation at the high-z end, we nevertheless find that our result for SEC hints at a recent transition from deceleration (FSEC(z) = q(z) > 0) to acceleration (FSEC(z) = q(z) < 0) with the transition redshift z ≈ 0.5, if we ignore the probably biased result at the high-redshift end. This estimation of transition redshift is consistent with the transition redshifts reported by Riess et al. (2004) and Shapiro & Turner (2006) using a kinematic model with q(z) linear in z.

For all energy conditions considered, our results show qualitative agreement with previous studies (Visser 1997a, 1997b; Visser & Barceló 2000; Schuecker et al. 2003; Cattoën & Visser 2008). In particular, the clear trend of DEC fulfillment at low redshift is similar to what was found by Lima et al. (2008a, 2008b).

These results are obtained without reference to any specific form of dark energy. With this in mind, we can use these results to assess the feasibility of theoretical dark energy models.

Some models predict that the dark energy component undergoes a transition into the "phantom" phase with the equation of state parameter w < −1 (Štefančić 2005 and references therein). It has been noted that NEC (hence necessarily DEC) violation is a feature of these models, and some oscillating universe models predict episodes of NEC violation throughout the history (Cai et al. 2010). Even if certain phenomenological models could offer a fairly good fit (Xia et al. 2005), our data-driven result for NEC suggests that there is not enough motivation to consider them as a viable explanation for cosmic acceleration, at least not in the redshift range investigated in this paper.

On the other hand, a family of dark energy models exist that do not violate NEC or DEC, yet display transient periods of SEC violation (Barrow et al. 2000; Blais & Polarski 2004; Barenboim et al. 2006). While such scenarios are unlikely to have been realized from the early universe up to z ≳ 1 (Linder 2010; Linder & Smith 2011), from an energy condition point of view they are not ruled out by our independent results in low redshift. Whether the "returning" FSEC curve in Figure 7 within z < 0.4 suggests a coming back to SEC fulfillment, which has been independently noted earlier (Shafieloo et al. 2009; Guimarães & Lima 2011) for spatially flat models, may be worth examining using future data.

As we have already noted in Section 4, our test is sensitive in low redshift but not in high redshift. In the future, this could be partially alleviated by the availability of high-quality cosmological data that could populate higher redshift.

Nevertheless, our work addresses several deficiencies in previous literature and has some special merits.

First, by proposing a set of new models based on our indication functions, we are able to visualize the evolution of the universe as a history of energy condition violation or fulfillment in a straightforward manner. By construction, we can avoid ambiguous statements such as "SEC implies H0(zf ≈ 15) ⩽ 58 ± 7 km s−1 Mpc−1" (paraphrasing Equation (79) of Visser 1997b), where the inequality nature of the energy conditions finds some difficulty being used with error bounds from observational data. Compared with the research done by Lima et al. (2008a), our models and their presentations can be more readily interpreted as answers of "whether an energy condition was violated for a given epoch," because our indication functions stay constant in each redshift bin.

Second, our models are general and are free from overly restrictive presumptions on the nature of the dark energy. It stands in contrast to previous studies by Schuecker et al. (2003), where a constant equation of state parameter model was used to study the possibility of energy condition violation.

Third, by incorporating the Fisher matrix formalism in our analysis, we find an optimal set of bases for the analysis of energy condition violation in the sense that the coefficients for different modes are nearly uncorrelated. When we generate this set of bases, the effects of Ωk are naturally and explicitly taken into account. Compared with the parameterization of Shapiro & Turner (2006), our method is more generally applicable because we do not presume spatial flatness, and has a unified description for all the energy conditions considered. We apply this formalism to the combination of SNIa and H(z) data, and this can be extended to other suitable cosmological probes when the data become available.

C.-J. Wu is grateful to Liang Gao, Xiang-Ping Wu, Hong Wu, and Ming-Jian Zhang for their kind help. C. Ma thanks Hao Wang and Kai Liao for valuable discussions. This work was supported by the National Science Foundation of China (grant No. 11173006), the Ministry of Science and Technology National Basic Science program (project 973) under grant No. 2012CB821804, and the Fundamental Research Funds for the Central Universities.

Footnotes

  • The practice of deriving a model of cosmic expansion by phenomenologically treating the model parameters without physical prescriptions is widely known as the "model-independent" approach, which does not actually exclude the use of a model.

Please wait… references are loading.
10.1088/0004-637X/753/2/97