Characterization and Modeling of Contamination for Lyman Break Galaxy Samples at High Redshift

, , , , , , and

Published 2017 February 24 © 2017. The American Astronomical Society. All rights reserved.
, , Citation Benedetta Vulcani et al 2017 ApJ 836 239 DOI 10.3847/1538-4357/aa5caf

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/836/2/239

Abstract

The selection of high-redshift sources from broadband photometry using the Lyman-break galaxy (LBG) technique is a well established methodology, but the characterization of its contamination for the faintest sources is still incomplete. We use the optical and near-IR data from four (ultra)deep Hubble Space Telescope legacy fields to investigate the contamination fraction of LBG samples at $z\sim 5\mbox{--}8$ selected using a color–color method. Our approach is based on characterizing the number count distribution of interloper sources, that is, galaxies with colors similar to those of LBGs, but showing detection at wavelengths shorter than the spectral break. Without sufficient sensitivity at bluer wavelengths, a subset of interlopers may not be properly classified, and contaminate the LBG selection. The surface density of interlopers in the sky gets steeper with increasing redshift of LBG selections. Since the intrinsic number of dropouts decreases significantly with increasing redshift, this implies increasing contamination from misclassified interlopers with increasing redshift, primarily by intermediate redshift sources with unremarkable properties (intermediate ages, lack of ongoing star formation and low/moderate dust content). Using Monte-Carlo simulations, we estimate that the CANDELS deep data have contamination induced by photometric scatter increasing from $\sim 2 \% $ at $z\sim 5$ to $\sim 6 \% $ at $z\sim 8$ for a typical dropout color $\geqslant 1$ mag, with contamination naturally decreasing for a more stringent dropout selection. Contaminants are expected to be located preferentially near the detection limit of surveys, ranging from 0.1 to 0.4 contaminants per arcmin2 at ${J}_{125}$ = 30, depending on the field considered. This analysis suggests that the impact of contamination in future studies of $z\gt 10$ galaxies needs to be carefully considered.

Export citation and abstract BibTeX RIS

1. Introduction

The Lyman-break technique, first proposed by Steidel et al. (1996), transformed the identification of reliable samples of galaxy candidates at high redshift from broadband imaging, and it is now routinely used to study galaxy formation and evolution as early as 500 Myr after the big bang, at redshift $z\sim 10$ (e.g., see Bouwens et al. 2015; Coe et al. 2015; McLeod et al. 2016; Oesch et al. 2016). While one could consider selecting high-redshift samples based on the best-fit photometric redshift or redshift likelihood contours (e.g., McLure et al. 2010; Finkelstein et al. 2012; Bradley et al. 2014), Lyman-break selection procedures utilizing cuts in color space can be simpler to apply and offer a slight advantage in terms of operational transparency. This makes such a selection procedure easier to reproduce by both theorists and observers, as follow-up studies by Shimizu et al. (2014), Lorenzoni et al. (2013), and Schenker et al. (2013) show.

The idea of the method rests on the identification of the strong spectral break introduced by neutral hydrogen atoms along the line of sight at wavelengths shorter than Lyα (1216 Å rest frame).6 Thus, to identify probable sources at high redshift with high confidence, the Lyman-break selection typically resorts to three crucial ingredients: (1) color information from two adjacent passbands to locate the wavelength location and measure the amplitude of the break, (2) color information redward of the break to characterize the intrinsic color of the source, and (3) evidence that sources have no flux blueward of the break.

Many studies have used different color selections, also depending on the availability of the photometric bands (e.g., Giavalisco et al. 2004; Bouwens et al. 2007, 2012a, 2015; Bradley et al. 2012; Castellano et al. 2012; Oesch et al. 2012, 2014, to cite a few), showing how different choices can still lead to comparable results and assessing the strength of the method.

The Lyman-break technique has been applied very successfully to build large samples of galaxies, especially from Hubble Space Telescope (HST) imaging (e.g., more than 10,000 sources identified at $3.5\lesssim z\lesssim 11$ from HST legacy fields to date; see Bouwens et al. 2015). Also, substantial spectroscopic follow-up work has shown that samples are generally reliable, and that contamination from sources with similar colors but different redshift is generally under control (Steidel et al. 1999; Bunker et al. 2003; Malhotra et al. 2005; Dow-Hygelund et al. 2007; Popesso et al. 2009; Vanzella et al. 2009; Stark et al. 2010). Nonetheless, photometrically defined samples are intrinsically affected by contamination (see, e.g, Le Fèvre et al. 2005; Paltani et al. 2007; Le Fèvre et al. 2015; Thomas et al. 2017). While this possibility is universally acknowledged in the literature and specific studies estimate the contamination rate of the samples presented (e.g., Su et al. 2011; Pirzkal et al. 2013; Bouwens et al. 2015), surprizingly few studies have been devoted to a detailed characterization of the contamination rate and of its dependence on survey parameters and redshift of the galaxy population. Potential classes of contaminants that have been identified include stellar sources, low-redshift galaxies with prominent 4000 Å/Balmer breaks and dust, extreme emission line galaxies, time-variable sources such as supernovae, with the first two classes of objects representing the major risks (Stanway et al. 2008; Atek et al. 2011; Bowler et al. 2012).

Dwarf stars have colors similar to those of  high-redshift galaxies because of their low surface temperatures, and can thus enter dropout samples, especially at $z\geqslant 7$ in data that lack sufficient angular resolution to discriminate point sources from extended light profiles (Stanway et al. 2003; Bouwens et al. 2006; Ouchi et al. 2009; Tilvi et al. 2013; Wilkins et al. 2014). At these redshifts, very-low-temperature stars (sub-types M, L, T, and Y) result in sources that are intrinsically faint, and spectra in which the continuum is interrupted may show large breaks across narrow-wavelength ranges, or in which the flux peaks in narrow regions. While deep medium-band observations are efficient at identifying these stellar contaminants in seeing-limited ground-based observations (Wilkins et al. 2014), HST imaging is generally effective at identifying stellar objects that are detected at signal-to-noise ratios of ${\rm{S}}/{\rm{N}}\gtrsim 10$ (Finkelstein et al. 2010; Bouwens et al. 2011b). In addition, we note that at $z\gt 9$, the contamination from stars is negligible, since there are essentially no observed stars with spectral energy distributions (SEDs) that peak at >1.4 μm and are undetectable in the optical for typical HST surveys (e.g., Oesch et al. 2014).

The main source of contamination for space-based surveys is thus that of low/intermediate redshift galaxies that have a deep break around a 4000 Å rest frame. The nature of these contaminants has not been investigated in detail, but they are likely low-mass, moderate-age, Balmer break galaxies at $z\sim 1\mbox{--}3$ (Wilkins et al. 2010; Hayes et al. 2012), possibly with strong emission lines that contribute, or even dominate, the flux redward of the spectral break (Atek et al. 2011; van der Wel et al. 2011). To effectively discriminate between the high-z Lyman-break and the 4000 Å/Balmer break, Stanway et al. (2008) recommend using a set of non-overlapping, but adjacent, filters, so that a clear color cut can be imposed on the selection. Another key requirement to build a clean sample is the availability of very deep observations blueward of the spectral break to distinguish between a true non-detection for an high-z object and a faint continuum for an interloper (Bouwens et al. 2015).

The goal of this paper is to focus on this class of intermediate redshift interlopers and to empirically quantify their impact on high-z Lyman-break galaxy (LBG) samples selected via a color cut method and characterize how their incidence varies with depth and adopted selection cut. For this, we resort to the optical and near-infrared imaging on the GOODS south deep (GSd), GOODS north wide (GNw) fields observed by the CANDLES program (Grogin et al. 2011) and the XDF (Illingworth et al. 2013) and HUDF09-2 (Bouwens et al. 2012b) fields. These data sets provide us with high-quality multi-wavelength observations over different areas of the sky (from ∼4.7 arcmin2 to ∼64.5 arcmin2). Specifically, we focus on LBG samples from $z\sim 5$ to $z\sim 8$, and investigate the population of galaxies that satisfy the color–color requirements to be included in the LBG selection based on imaging at wavelengths starting from the spectral break, but show a clear detection in bluer filters. We define this class of objects as interlopers and characterize (1) their surface density in the sky depending on luminosity and on the redshift of the dropout selection; (2) the likelihood that fainter counterparts of the known population of interlopers enter an LBG sample because of a lack of sufficiently deep imaging in the blue. We define this population as contaminants.

The results of our analysis, based on some of the deepest Hubble observations available, have multiple applications. In particular, they find applications to the estimation of the contamination rate of other surveys, which may lack the multi-wavelength, multi-observatory coverage, such as random pointings and/or parallel observations (e.g., see Trenti et al. 2011, 2012; Bradley et al. 2012; Schmidt et al. 2014; Calvi et al. 2016). Another important application includes planning and optimization of future observations (e.g., see Mason et al. 2015 for JWST and WFIRST surveys at high-z).

This paper is organized as follows. In Section 2, we introduce our data set and construct the samples of dropouts and interlopers. In Section 3, we analyze and discuss the properties of the contaminants and the expected impact on LBG samples. In Section 4, we discuss how results depend on the selection criteria. We summarize and conclude in Section 5. Throughout the paper, we assume ${{\rm{\Omega }}}_{0}=0.3$, ${{\rm{\Omega }}}_{{\rm{\Lambda }}}=0.7$, and ${H}_{0}=70$ km s−1 Mpc−1. All magnitudes are in the AB system (Oke & Gunn 1983).

2. Data Set and Sample Selection

We base our analysis on four different samples, in order to test how results change with the field used for selection. We use the CANDELS/GSd and CANDELS/GNw imaging (Grogin et al. 2011), the entire XDF data set (Illingworth et al. 2013) and the HUDF09-2 (Bouwens et al. 2012b). A summary of all the data sets used in the present study is provided in Table 1, along with the covered area and the 5σ depths. The latter are drawn from Bouwens et al. (2015) and are based on the median uncertainties in the total fluxes (after correction to total), as found for the faintest 20% of sources in the catalog. As discussed by Bouwens et al. (2015), these depths reflect the actual sensitivity achieved in science images, as established through artificial source recovery simulations (see Bouwens et al. 2015, for details).

Table 1.  Data Sets Used

Field Area 5σ Depth
  (arcmin2) ${B}_{435}\,$ ${V}_{606}$ ${i}_{775}$ ${I}_{814}$ ${z}_{850}$ ${Y}_{105}$ ${J}_{125}$ ${H}_{160}$
CANDELS GOODS 64.5 27.7 28.0 27.5 28.0 27.3 27.5 27.8 27.5
South Deep (GSd)                  
CANDELS GOODS 60.9 27.5 27.7 27.2 27.0 27.2 26.7 26.8 26.7
North Wide (GNw)                  
XDF 4.7 29.6 30.0 29.8 28.7 29.2 29.7 29.3 29.4
HUDF09-2 4.7 28.3 29.3 28.8 28.3 28.8 28.6 28.9 28.7

Note. Data sets used in the analysis along with area covered by each survey and $5\sigma $ depth for the HST observations, obtained from Bouwens et al. (2015), based on median uncertainty in the flux measurements for faint sources.

Download table as:  ASCIITypeset image

We exploit the data reduction and source catalog derived by Bouwens et al. (2015). Data were processed using the ACS GTO pipeline APSIS (Blakeslee et al. 2003) and the WFC3/IR pipeline WFC3RED.PY (Magee et al. 2011), with final science imaging drizzled to a 0farcs03 pixel scale. The photometric catalog has been constructed using SourceExtractor (Bertin & Arnouts 1996) after PSF-matching imaging to the F160W filter. Multi-band photometric information is available in the following optical bands: F435W, F606W, F775W, F814W, and F850LP (hereafter ${B}_{435}$, ${V}_{606}$, ${i}_{775}$, I814, ${z}_{850}$, respectively), as well as in the following near-IR bands: F098M, F105W, F125W, F140W, and F160W (hereafter Y098, ${Y}_{105}$, ${J}_{125}$, JH140, ${H}_{160}$, respectively.) Complete details on data analysis and catalog construction can be found in Bouwens et al. (2015).

To ensure robust results, we limit our analysis to sources with detection in the ${J}_{125}$+${H}_{160}$ bands at high signal-to-noise ratios $[{\rm{S}}/{\rm{N}}({{JH}}_{\det })\gt 6$], defined as

Equation (1)

(Stiavelli 2009), where FLUX and FLUXERR are the isophotal flux and its associated error in the combined detection band, which we indicate as ${{JH}}_{\det }$.7 We note that adopting an even higher S/N limit $[{\rm{S}}/{\rm{N}}({{JH}}_{\det })\gt 8]$ samples would be even purer, but to the detriment of sample statistics.8

In addition, with the goal of focusing on contamination from extended sources, we remove stellar-like sources, that is, all sources with CLASSTAR > 0.85 measured from the detection image (where SourceExtractor assigns CLASSTAR = 0 to (very) extended sources and CLASSTAR = 1 to point sources). We then proceed to select LBG sources at high redshift (or interlopers with similar colors at low redshift). We apply a color cut selection, which is as uniform as possible across samples with different median redshifts, to ensure a consistency in the analysis. The adopted criteria can be summarized as follows.

For $z\sim 5$ candidates

Equation (2)

For $z\sim 6$ candidates

Equation (3)

For $z\sim 7$ candidates

Equation (4)

For $z\sim 8$ candidates

Equation (5)

The color–color selection criteria listed above are not sufficient to construct a sample of galaxies that are confidently at $z\gtrsim 5$ because intermediate redshift galaxies with a prominent spectral break such as the 4000 Å break may also fall into the color–color selection regions typical of LBGs at higher redshift.

Following the standard practice, we use the photometry in the bands bluer than the putative Lyman break to separate high-z sources, which in the following we indicate as dropouts, from lower-redshift galaxies, which we label as interlopers. Specifically, $z\sim 5$ dropouts (named as ${V}_{606}$-dropouts) are selected as sources with S/N(${B}_{435}$) < 2, $z\sim 6$ dropouts (named as ${i}_{775}$-dropouts) with S/N $({B}_{435})\lt 2$ and S/N $({V}_{606})\lt 2$, $z\sim 7$ dropouts (named as ${z}_{850}$-dropouts) and $z\sim 8$ dropouts (named as ${Y}_{105}$ dropouts) with S/N $(x)\lt 2$ and ${\chi }_{x}^{2}\lt 3$, where ${\chi }_{\mathrm{opt}}^{2}$ is defined as

Equation (6)

In the equation FLUXx is the isophotal flux measured in a given band, FLUXERRx the uncertainty associated to the flux, and x is intended to be ${B}_{435}$, ${V}_{606}$, and ${i}_{775}$ bands for ${z}_{850}$-dropouts and ${B}_{435}$, ${V}_{606}$, ${i}_{775}$, and ${I}_{814}$ bands for ${Y}_{105}$-dropouts (see also Bouwens et al. 2011a). In addition, following Bouwens et al. (2015), ${z}_{850}$-dropouts are selected as sources with S/N(${I}_{814}$) < 2, but ${I}_{814}$ is not used for computing the ${\chi }_{\mathrm{opt}}^{2}$.

Finally, if a dropout satisfies more than one dropout selection, we assign it to the highest redshift sample. This additional cut removes only a few percent of the sources (for example, in the GSd data set, we identified 38 cases out of 870 dropouts). In contrast, we do not apply this restriction to interlopers, which thus may enter multiple selections. On average, at most two interlopers appear in two selections, and none appear at the same time in all the samples.

Finally, we highlight that the dropout sample may, in general, contain a residual (small) fraction of low-z galaxies that have not been identified through the photometric analysis, because of the lack of sufficiently deep imaging in the blue. Hereafter, we call them contaminants. Interlopers and contaminants are the focus of our investigation.

Note that the separation between dropouts and contaminants for sources with a low ${\chi }_{\mathrm{opt}}^{2}$ is arbitrary to a certain extent; for example, Bouwens et al. (2015) impose a cut at ${\rm{S}}/{\rm{N}}\lt 2$, while the Brightest of Reionizing Galaxies survey (BoRG, Trenti et al. 2011) resorts to a more conservative threshold of ${\rm{S}}/{\rm{N}}\lt 1.5$ in the bluest bands. Obviously, more conservative cuts entail the exclusion of a higher number of real high-z sources from the selections, thus different investigators may decide to give priority either to sample purity or to selection completeness.

3. Results

In this section, we present and discuss our results obtained separately for the four fields we analyze (GSd, GNW, XDF, HUDF09-2). However, we only show plots for GSd, to avoid unnecessary repetitions.

3.1. Numbers and Redshift Distribution of Dropouts and Interlopers

The color–color selection of dropouts and interlopers is shown in Figure 1 for samples of ${V}_{606}$, ${i}_{775}$, ${z}_{850}$, and ${Y}_{105}$-dropout sources drawn from the GSd field, with discrimination between the two classes based on the S/N and optical ${\chi }^{2}$ (Equation (6)). We note that photometric scatter is likely to play a significant role in the selection of faint objects. Indeed, more than half of the 1σ error bars for the interlopers intersect at least one boundary of the color–color selection box. Therefore, to carry out a more comprehensive analysis, we enlarge the color–color selection box by 0.2 mag, to check for both candidate high-z LBGs and interlopers that slightly fail to meet the adopted selection criteria (see also Su et al. 2011 for an alternative approach based on assigning a probability that a source belongs to the color–color selection). In the following, we will consider the original selection the one given in Section 2 (dotted line in Figure 1), and the enlarged selection the one introduced in this section (dashed–dotted line in Figure 1).

Figure 1.

Figure 1. Color–color selection box used to identify ${V}_{606}$-(upper left), ${i}_{775}$-(upper right), ${z}_{850}$-(bottom left) and ${Y}_{105}$-dropouts (bottom right) over the GSd field. Red squares represent dropouts, i.e., high-z sources with no flux blueward of the Lyman-break; blue circles represent interlopers, i.e., high-z candidates showing a detection in the blue bands. Dashed lines represent the boundaries of the original sample selection; dashed–dotted lines represent the boundaries of the enlarged sample (see the text for details). Darker symbols refer to the original selection, lighter ones refer to the enlarged selection.

Standard image High-resolution image

The most striking feature of Figure 1 is the relative weight of interlopers versus dropouts, which is quantified in Table 2 for all the fields considered. We first focus on the GSd field. At lower redshift ($z\sim 5$), dropouts dominate the sample within the original selection, while the opposite situation is present at higher redshift ($z\sim 8$), when the interloper fraction is much higher. We stress that this percentage is not giving a level of contamination in our dropout sample, since the presence of sufficiently deep data at bluer wavelengths allows us to identify the interlopers. Interestingly, the situation remains qualitatively similar to our enlarged selection; though, as expected, the enlarged samples contain a larger fraction of interlopers. If we adopted a more conservative S/N in the sample selection (S/N < 1.5), percentages of dropouts would be systematically smaller, but comparable within 2σ uncertainty.

Table 2.  Statistics of Dropouts and Interlopers

  GSd GNw XDF HUDF09-2
Population Original Sample Enlarged Sample Original Sample Enlarged Sample Original Sample Enlarged Sample Original Sample Enlarged Sample
  Number $ \% $ Number $ \% $ Number $ \% $ Number $ \% $ Number $ \% $ Number $ \% $ Number $ \% $ Number $ \% $
${V}_{606}$-dropouts 648 90 ± 2 882 81 ± 2 392 87 ± 2 510 73 ± 2 132 93 ± 3 165 85 ± 4 102 94 ± 4 127 88 ± 4
${V}_{606}$-interlopers 72 10 ± 2 205 19 ± 2 58 13 ± 2 189 27 ± 2 10 7 ± 3 28 15 ± 4 6 6 ± 4 17 12 ± 4
${i}_{775}$-dropouts 172 62 ± 4 239 52 ± 3 72 65 ± 7 105 47 ± 4 69 77 ± 7 78 68 ± 6 28 60 ± 10 37 51 ± 8
${i}_{775}$-interlopers 106 38 ± 4 223 48 ± 3 39 35 ± 7 118 53 ± 4 20 23 ± 7 37 32 ± 6 16 40 ± 10 35 49 ± 8
${z}_{850}$-dropouts 33 17 ± 4 42 11 ± 2 29 26 ± 6 35 16 ± 4 31 40 ± 8 36 30 ± 6 17 30 ± 10 22 23 ± 6
${z}_{850}$-interlopers 157 83 ± 4 322 89 ± 2 82 74 ± 6 180 84 ± 4 47 60 ± 8 85 70 ± 6 37 70 ± 10 73 76 ± 6
${Y}_{105}$-dropouts 17 10 ± 3 27 8 ± 2 54 36 ± 6 62 26 ± 4 6 50 ± 20 10 40 ± 20 12 21 ± 8 15 17 ± 6
${Y}_{105}$-interlopers 150 90 ± 3 329 92 ± 2 96 64 ± 6 177 74 ± 4 6 50 ± 20 15 60 ± 20 45 79 ± 8 71 83 ± 6

Note. Errors are defined as binomial errors (Gehrels 1986).

Download table as:  ASCIITypeset image

The observed behavior is mainly due to the fact that the population of dropouts steady decreases in number for the higher redshift selections. This is primarily determined by the evolution of the luminosity function, which decreases significantly from $z\sim 5$ to $z\sim 8$ at all luminosities. In contrast, the number of interlopers in the sky remains approximately constant over a wide range of dropout selection windows. Second order effects in the evolution of the interloper population with the redshift of the Lyman-break selection are complex to model, and include intrinsic evolution of their luminosity functions, change in the distance modulus and in the comoving volume of the selection, with partial offsets among them (e.g., the decrease in sensitivity because of an increase in the distance modulus is offset by an increase of the comoving volume).

Similar findings are obtained when we analyze the other fields, even though the results from the different fields highlight the presence of sample (or "cosmic") variance, naturally expected because of galaxy clustering (see, e.g., Trenti & Stiavelli 2008). In addition, fields such as the XDF and HUDF09-2 have small areas, resulting in significant Poisson uncertainty. Finally, the difference in relative depths reached by the different surveys plays a role, which we discuss further in Section 3.4.1.

To further investigate the properties of the interlopers, and test whether these are indeed 4000 Å break sources, we resort to the photometric redshift catalogs from the 3D-HST survey (Skelton et al. 2014), which we matched to our sources based on coordinates and ${H}_{160}$ band magnitudes. The expectation is that given zdropouts as the redshift of the Lyman-break selection, the interlopers should be peaked at zinterlopers given by

Equation (7)

So, for example, for z = 5 selection, the interlopers are predicted to be found at $z\sim 0.7\mbox{--}1.0$ corresponding to the Balmer and 4000 Å breaks; for z = 8 selection, the interlopers are expected at $z\sim 1.6\mbox{--}1.9$. Taking into account uncertainties, this is broadly the case based on the photo-$z$ analysis, as shown in Figure 2 for the GSd field. For this field, after the match with the 3D-HST survey, we recover $\sim 85 \% $ of our sources. From this figure, and from Equation (7), it is clear that as zdropouts increases, $\langle {z}_{\mathrm{interlopers}}\rangle $ changes relatively little (${\rm{\Delta }}(z)\lt 1$). The error on the median values narrows as we go from lower to higher redshift, but this is mainly due to the larger sample statistics provided by the ${Y}_{105}$-interlopers with respect to the ${V}_{606}$-interlopers. The fact that not all interlopers of a given selection fall exactly in the expected redshift window, and the lower than expected median redshift of the Y-interlopers highlight the limitations of both our selection method and photo-z techniques. Indeed, some real dropouts might be misclassified due to photometric scatter and/or the photo-z estimates might not be reliable. Similar trends are also obtained for the other fields even though uncertainties are very large.

Figure 2.

Figure 2. Redshift distribution for the interlopers at the different dropout selections, as indicated in the labels, for the original sample over the GSd field. Photometric redshifts are drawn from the 3D-HST survey (Skelton et al. 2014). Median values along with their associated uncertainty (defined as $1.235\times \sigma /\sqrt{n}$, with n number of objects) are shown above the histograms as horizontal points with error bars.

Standard image High-resolution image

Thus, extrapolating the trend to even higher redshift samples of dropouts, such as those accessible by JWST observations, one expects that the number of interlopers in the color–color selection will remain relatively constant, while dropout numbers will decrease very rapidly for ${z}_{\mathrm{dropouts}}\gt 10$ based on theoretical modeling (see, e.g., Mason et al. 2015).

We note that, according to the Madau–Lilly plot (e.g., Madau & Dickinson 2014), the star-formation rate peaks at intermediate redshift ($z\sim 2$). This means that the number density of interlopers for ${z}_{\mathrm{interlopers}}\gtrsim 1.85$ (corresponding to ${z}_{\mathrm{dropouts}}\sim 9.5$ from Equation (7)) may likely slightly decrease with increasing redshift, though the decrease of the interloper density will still be less steep than that of the dropouts because the latter have significantly higher redshift.

3.2. Surface Densities of Dropouts and Interlopers

In the previous section, we have investigated the incidence of dropouts and interlopers at the different redshifts. We now aim at characterizing the distribution of luminosity for these populations, to study how they compare. Thus, we derive the surface density distributions of dropouts and interlopers by counting the number of objects in each bin of 0.5 mag and dividing it by the area of the survey, as shown in Figure 3 for the GSd field. For each population, the surface density is plotted as a function of ${i}_{775}$ for the $z\sim 5$ samples, ${z}_{850}$ for the $z\sim 6$ samples, ${Y}_{105}$ for the $z\sim 7$ samples, and ${J}_{125}$ for the $z\sim 8$ samples. These are the magnitudes in the band that best match the 1600 Å rest frame at that redshift for the dropouts, as was done in Bouwens et al. (2015). We note that for ${m}_{{AB}}\gtrsim 27$ (see the exact value for each magnitude as the dotted line in Figure 3) all of our samples suffer from incompleteness, which is the cause of the apparent decline in the number counts of faint objects.

Figure 3.

Figure 3. Surface density distribution of dropouts (red) and interlopers (blue) in the original (left) and enlarged (right) samples for the selections at the different redshifts, as indicated in the labels, for the GSd field. The black vertical line indicates the formal 5σ magnitude limit (Bouwens et al. 2015).

Standard image High-resolution image

Surface density distributions strongly depend on the redshift and on the population considered. At the lowest redshift, the surface density distribution of interlopers is relatively flat with magnitude for 20 ≤ ${i}_{775}$ ≤ 28 and there are about 0.1 interlopers per arcmin2 in bins of magnitude. In contrast, the distribution of dropouts rises very steeply. As expected, due to the well established exponential cut off of the luminosity function at the bright end (e.g., McLure et al. 2013; Schenker et al. 2013; Bowler et al. 2014; Oesch et al. 2014; Bouwens et al. 2015), there are essentially no dropouts brighter than ${i}_{775}$ ∼ 24. Overall, dropouts are much more numerous than interlopers. Similar conclusions are reached in both the original and enlarged samples.

Moving to higher redshift, the shape of the distribution of dropouts stays almost constant, just showing a modest steepening, but that of interlopers considerably changes. At $z\sim 6$, interlopers and dropouts have similar distributions, with the exception that interlopers extend toward brighter magnitudes. At $z\sim 7$ and $z\sim 8$ interlopers are more numerous than dropouts at all luminosities, and have a tail of objects at the bright end as well. Overall, the interloper distribution appears as steep as the dropout one. This holds both for the original and the enlarged samples. Similar results are also found in the other fields.

It is reasonable to expect that interlopers are a subpopulation of BzK color-selected galaxies (Daddi et al. 2004, 2007). This method is designed to find red, dusty or passively evolving older galaxies at $z\gt 1.5$. We can therefore compare our derived surface densities of interlopers to those of BzK selected samples. We use, as a reference, the data set presented by Conselice et al. (2011) for galaxies at $1.5\lt z\lt 3$ drawn from GOODS north and south fields and the GOODS NICMOS Survey. That study analyzes two of the same fields considered in our work and includes HST imaging, therefore reaching a deeper magnitude limit compared to the many studies of BzK samples conducted from the ground (e.g., Cirasuolo et al. 2007, Hartley et al. 2008). Conselice et al. (2011) quote the ${H}_{160}$-magnitude distribution of all galaxies at $1.5\lt z\lt 3$, without splitting them into redshift bins, so a direct comparison to our results is not possible because our interloper samples are more localized in redshift (see Figure 2). Still, to have a first-order approximation, we treat the Conselice et al. (2011) sample as uniform in redshift, and thus we simply rescale the observed number counts to take into consideration the difference in volume with our selections.

Figure 4 compares the Conselice et al. (2011) scaled distribution to the ${H}_{160}$-band number counts for our interlopers samples in the GSd field. At each magnitude and in each redshift bin, the BzK population is up to a factor of 10 larger than that of interlopers. This is consistent with the assumption that not all galaxies at $z\sim 1.5\mbox{--}2$ are interlopers of high-redshift selections, but only the subset with a particular combination of colors. Interestingly, we observe that the interloper counts get steeper at faint luminosities with increasing redshift compared to the general BzK sample. This might suggest that interlopers evolve differently compared to the general population, but investigating this trend in more detail is beyond the scope of this work.

Figure 4.

Figure 4. H160 surface density distribution of interlopers (blue) in the original sample for the selections at the different redshifts, as indicated in the labels, for the GSd field. Samples of BzK selected samples, drawn from Conselice et al. (2011), are shown from comparison. The black vertical line indicates the formal 5σ magnitude limit (Bouwens et al. 2015).

Standard image High-resolution image

3.3. Distribution of Optical ${\chi }^{2}$ for Interlopers

One of the aims of our analysis is to derive an estimate of the contamination in dropout samples. Before proceeding, we need to characterize the distribution of the optical ${\chi }^{2}$ values for interlopers as a function of their S/N in the detection bands. It is evident that the robustness of the detection is a key quantity to distinguish between interlopers and dropouts. In fact, while less deep observations in the redder bands give smaller ${\rm{S}}/{\rm{N}}({{JH}}_{\det })$ and simply exclude galaxies both from the dropout and interloper populations, less deep observations in the bluer bands produce lower ${\chi }_{\mathrm{opt}}^{2}$ and may induce a misclassification, moving sources from the interloper to the dropout population.

Figure 5 plots the ${\rm{S}}/{\rm{N}}({{JH}}_{\det })$ to the ${\chi }_{\mathrm{opt}}^{2}$ for both dropouts and interlopers from the $z\sim 8$ selection (${Y}_{105}$ Lyman-break selection), for the GSd field. As expected, the interlopers have a positive correlation between the two quantities, reflecting the finite amplitude of the 4000 Å break. Similar results also hold for samples from selections at lower $z$ and drawn from the other fields.

Figure 5.

Figure 5. Comparison between the ${\rm{S}}/{\rm{N}}({{JH}}_{\det })$ and the optical ${\chi }^{2}$ for ${Y}_{105}$-dropouts for the GSd field. Colors and symbols are as in Figure 1. The dotted vertical line indicates the separation between interlopers and dropouts adopted in this work.

Standard image High-resolution image

3.4. Contamination in Dropout Samples

Now that we characterized the properties of interlopers, we can use them to investigate dropout sample contamination induced by interlopers that are misclassified as dropouts in absence of sufficiently deep data at bluer wavelengths. The main causes of contamination are the impact of noise in the measurement of the optical ${\chi }^{2}$ and photometric scatter in the color–color selection.

To estimate the impact of noise in the measurements on the data sets we analyzed, we perform a resampling Monte-Carlo (MC) simulation on the entire photometric catalogs and add zero-mean noise in the fluxes sampling from a Gaussian distribution with width determined by the S/N of the simulated broadband fluxes. We then apply the dropout selection criteria given in Section 2 and quantify the number of interlopers and dropouts in the simulated sample. We repeat the procedure 500 times to collect statistics and we find that, on average, increasing the noise we obtain systematically larger fractions of interlopers at any redshift than those obtained with the original catalogs (Table 2). The average statistics are given in Table 3 for each field separately. This test emphasizes the need of precise photometry to robustly distinguish between dropouts and interlopers.

Table 3.  Statistics of Dropouts and Interlopers after the MCMC Experiment that Introduces Spurious Noise on the Observations

  GSd GNw XDF HUDF09-2
Population Original Sample Enlarged Sample Original Sample Enlarged Sample Original Sample Enlarged Sample Original Sample Enlarged Sample
  % $ \% $ $ \% $ $ \% $ $ \% $ $ \% $ $ \% $ $ \% $
${V}_{606}$-dropouts 72 ± 2 69 ± 2 71 ± 3 66 ± 3 75 ± 6 72 ± 5 81 ± 6 78 ± 6
${V}_{606}$-interlopers 28 ± 2 31 ± 2 29 ± 3 34 ± 3 25 ± 6 28 ± 5 19 ± 6 22 ± 6
${i}_{775}$-dropouts 32 ± 3 30 ± 3 34 ± 6 30 ± 4 52 ± 8 47 ± 7 33 ± 9 29 ± 7
${i}_{775}$-interlopers 68 ± 3 70 ± 3 66 ± 6 60 ± 4 48 ± 8 53 ± 7 67 ± 9 71 ± 7
${z}_{850}$-dropouts 48 ± 2 4 ± 1 9 ± 3 7 ± 2 9 ± 5 8 ± 4 8 ± 5 7 ± 4
${z}_{850}$-interlopers 52 ± 2 96 ± 1 91 ± 3 93 ± 2 91 ± 5 92 ± 4 92 ± 5 93 ± 4
${Y}_{105}$-dropouts 2 ± 1 2.2 ± 0.1 4 ± 2 3 ± 1 4 ± 7 5 ± 5 5 ± 4 4 ± 3
${Y}_{105}$-interlopers 98 ± 1 97.8 ± 0.1 96 ± 2 97 ± 1 96 ± 7 95 ± 5 95 ± 4 96 ± 3

Note. Errors are defined as binomial errors (Gehrels 1986).

Download table as:  ASCIITypeset image

We note that if instead of using the entire catalogs as a starting point of the MC experiment we used only a combination of the dropout and interloper enlarged samples, we would get results in agreement within the errors, indicating that actually only the sources close to the boundaries of our selection boxes can contaminate the samples.

As the next step, we also consider the photometric scatter and perform a more sophisticated resampling MC simulation on the photometric catalogs. Specifically, for each dropout selection, we uniformly sample with repetition the luminosity in the detection band from the catalog of enlarged interlopers, extracting a simulated catalog with the same size as the original one. Next, we assign to each of these objects the broadband colors of a random galaxy from the same catalog (again using uniform sampling probability with repetition), and we add zero-mean noise in the flux sampling from a Gaussian distribution with width determined by the S/N of the simulated broadband fluxes. We use as our starting point a catalog that includes all the ${Y}_{105}$-interlopers detected in our four fields (enlarged samples), in order to consider a population that is relatively homogenous, but statistically significant. Note that for this second test it would not be appropriate to resort to the photometric catalogs of all sources since the MC procedure effectively "re-shuffles" colors of galaxies, thus a relatively uniform starting sample is needed. Finally, we perform the photometric analysis of the catalog to quantify the number of interlopers in the enlarged sample that are classified as dropouts. After repeating the procedure 500 times to collect statistics, for example, for the GSd field we find that, on average,

  • 1.  
    the $z\sim 5$ selection has 17 ± 4 interlopers entering the ${V}_{606}$-dropout sample as contaminants, for an estimated contamination rate of ${f}_{c}\sim 17/648\sim 2.6 \% ;$
  • 2.  
    the $z\sim 6$ selection has 7 ± 2 interlopers entering the ${i}_{775}$-dropout sample as contaminants, for an estimated contamination rate of ${f}_{c}\sim 7/172\sim 4.0 \% ;$
  • 3.  
    the $z\sim 7$ selection has 2 ± 1 interlopers entering the ${z}_{850}$-dropout sample as contaminants, for an estimated contamination rate of ${f}_{c}\sim 2/33\sim 6.0 \% ;$ and
  • 4.  
    the $z\sim 8$ selection has 1 ± 1 interlopers entering the ${Y}_{105}$-dropout sample as contaminants, for an estimated contamination rate of ${f}_{c}\sim 1/17\sim 5.9 \% $.

Overall, results from the different fields are in agreement, indicating that the contamination is always only a few percent in all samples, and it increases with increasing redshift. These results are also in broad agreement with other literature estimates, as will be discussed in Section 4.

These results are clearly illustrating that while the number of misclassified interlopers remains relatively constant across different samples, as the redshift increases, the relative weight compared to the number of dropouts grows significantly. Interestingly, these estimates are consistent with the predictions from the contamination model based on source simulations from an extensive SED library encompassing a wide range of star-formation histories (SFHs), metallicities, and dust content and a combination of an old and a young population (Oesch et al. 2007), and used for the BoRG survey sample purity analysis (e.g., see Trenti et al. 2011; Bradley et al. 2012; Calvi et al. 2016). Applying the color cuts adopted in the current work, the model predicts a contamination of 0.7% at $z\sim 5$, 1.6% at $z\sim 6$, 3.5% at $z\sim 7$, and 7.3% at $z\sim 8$.

3.4.1. Contamination at $z\sim 8$

We now focus only on the highest redshift selection ($z\sim 8$), since it contains the largest number of interlopers, and investigate the level of contamination in the different surveys in greater detail.

Using a similar approach to that presented in the previous section, we can use the multi-band photometric catalog of all the interlopers identified in the different fields (enlarged sample), combined with an extrapolation of the surface number densities of interlopers, in order to estimate the contamination in different surveys, assuming that the SEDs of the interlopers are representative of the general population. We simulate a series of surveys with relative depths in the different bands similar to those used in our analysis, but different values of 5σ depth.

For the simulation, we compute the brightness distribution of all the ${Y}_{105}$-interlopers in the enlarged sample (Figure 6). For a basic characterization of the luminosity distribution, we fit the populations using a power law through a Markov Chain MC method. The best power-law fit of the sample is $\mathrm{log}(N/{\mathrm{arcmin}}^{2}/{\rm{\Delta }}(\mathrm{mag}))=(0.35\pm 0.1)$ × ${J}_{125}$ + $(-9.0\pm 0.4)$. As shown in Figure 6, while trends for the GSd and HUDF09-2 are compatible within the errors, the GNw field seems to have a systematically larger number of interlopers, while the XDF has a systematically lower number. This plot confirms that there is significant cosmic variance across fields. Quite interestingly, GNw not only has an excess of interlopers, but there is also an excess of genuine high-redshift candidates reported by many studies (Finkelstein et al. 2013; Oesch et al. 2014; Bouwens et al. 2015) and across a range of redshifts. Further medium/deep lines of sight beyond those available in the HST archive would be very interesting to use to investigate this correlation in greater detail.

Figure 6.

Figure 6. Surface density distribution of ${Y}_{105}$-interlopers in the enlarged samples for all the fields considered in this study, as indicated in the labels. Each sample is plotted down to its formal 5σ magnitude limit (from Bouwens et al. 2015). The best power-law fit of the sample is $\mathrm{log}(N/{\mathrm{arcmin}}^{2}/{\rm{\Delta }}(\mathrm{mag}))\,=(0.35\pm 0.1)$ × ${J}_{125}$ + $(-9.0\pm 0.4)$.

Standard image High-resolution image

We then assume that all interlopers in the sky follow a power-law fit to the number counts distribution, extrapolated in the magnitude range of ${J}_{125}$ = 22-31, and we sample from this distribution a catalog of object luminosities. Next, we proceed to estimate the contamination in each field separately. We assign to each simulated object the broadband colors of a random galaxy from one of our interloper sample (GSd, GSw, XDF, HUDF-092), and add to the signal in each band a Gaussian noise drawn from a distribution with dispersion associated with the S/N that would be achieved in the simulated survey (following the depths reported in Table 1). Finally, we apply the dropout selection criteria given in Section 2, and quantify the number of simulated interlopers that satisfy the dropout selection. This gives us our best estimate of the surface number counts of contaminants in each simulated survey.

For the GSd simulation, based on the extrapolated number density of interlopers for 22 ≤ ${J}_{125}$ ≤ 31 and an assumed area of 64.5 arcmin2, we extract 4390 simulated galaxies per realization, and we repeated the simulation 500 times. On average, we find that a realization has 4 ± 2 of these simulated interlopers appearing as dropouts, 120 ± 10 are correctly classified as interlopers, while the remaining either turn out to have ${\rm{S}}/{\rm{N}}({{JH}}_{\det })\lt 6$, or colors outside the selection box, and do not enter in the interloper or dropout sample. To illustrate the MC experiment, results are shown in Figure 7 for one of the 500 realizations. This implies that the probability of misclassifying a specific (enlarged sample) interloper as dropout is very small $p\sim 0.001$.

Figure 7.

Figure 7. Results of one realization of our Monte-Carlo extraction aimed at testing the reliability of the interloper-dropout selection in deeper surveys (${J}_{125}$ $\leqslant $ 30) for an XDF-like survey (see the text for details). Left: comparison between the ${\rm{S}}/{\rm{N}}({{JH}}_{\det })$ and the optical ${\chi }^{2}$. Dotted lines represent the detection thresholds. Right: color–color selection box used to identify ${Y}_{105}$-dropouts. Blue squares represent interlopers, red squares represent dropouts and black dots represent galaxies that no longer enter the selection after the dimming procedure. Dotted and dashed–dotted lines are the same as in Figure 1.

Standard image High-resolution image

Figure 8 shows the ${J}_{125}$ magnitude distribution for observed dropouts in the GSd, selected with the criteria given in Section 2. Overplotted is also the average magnitude distribution of the simulated contaminants (i.e., interlopers appearing as dropouts after the MC experiment). It can be clearly seen that the fraction of contamination increases at fainter magnitudes, consistent with the explanation that photometric scatter is the main cause of contamination.

Figure 8.

Figure 8.  ${J}_{125}$ magnitude distribution for the sample of ${Y}_{105}$-dropouts in the GSd as derived from the cuts in Section 2 (blue) and for the average contamination from dropouts as obtained from our Monte-Carlo simulation (red; see the text for details).

Standard image High-resolution image

Repeating the MC experiment for the other fields, we found that for the GNw/XDF/HUDF09-2 simulation, on average, a realization has 1 ± 1/1 ± 1/1 ± 1 of the simulated interlopers appearing as dropouts, and 53 ± 7/45 ± 6/25 ± 5 that are correctly classified as interlopers. This implies that the probability of misclassifying a specific (enlarged sample) interloper as dropout is very small in all of the fields ($p\sim 0.0002/0.003/0.003$). The GNw field is the one with the lowest contamination. As shown in Table 1, this field has the deepest relative depth blueward of the dropout band compared to the detection band, and clearly this allows more efficient identification of interlopers, minimizing contamination of the dropout sample. In fact, even though the other fields have deeper photometry, their photometric limits in the blue bands are relatively shallower compared to the limits of the detection (red) bands, inducing a higher probability of misclassification of interlopers as dropouts (and therefore higher contamination).

Overall, our analysis also suggests that, given a survey, the higher the S/N in the detection, the higher the likelihood that the dropout is an LBG. Thus we are fully consistent with the high sample purity inferred from spectroscopic follow-up studies of LBG samples in ultradeep surveys (e.g., Malhotra et al. 2005) since spectroscopic investigations are limited to the brighter galaxies (e.g., $m\lesssim 27.5$).

To explore how contamination changes as a function of the limiting depth of the survey, we repeat the MC experiments varying the ${J}_{125}$-band magnitude limit and scaling the limits in all other bands, keeping the relative depths constant. The results are shown in Figure 9, which summarizes the level of contamination per arcmin2 versus limiting magnitude. As expected, the contamination increases toward fainter magnitudes, because there is a higher number of potential contaminants, and strongly depends on the relative limiting depths of the different bands. Overall, the level of contamination ranges between ∼0 and ∼0.4 contaminants per arcmin2 in the magnitude range of ${J}_{125}$ = 26.5–30. As already mentioned, the relative depth of the GNw seems to do the best job of discriminating interlopers and dropouts; therefore, the contamination is the smallest.

Figure 9.

Figure 9. Estimated level of contamination in ${Y}_{105}$-dropout surveys due to misidentified interlopers depending on survey magnitude limits. Different colors refer to different observational choices (relative depths and filter types) used as reference, as indicated in the label. Errors are the errors on the mean of the 500 realizations.

Standard image High-resolution image

Our conclusion of the presence of a significant level of contamination near the detection limit of a survey because of significant photometric scatter is indirectly supported by a cross-matching analysis of the catalogs for ${i}_{775}$- and ${Y}_{105}$-dropouts in the XDF/GOODS south published by McLure et al. (2013) and Bouwens et al. (2015), which shows that less than 50% of the sources appear in both catalogs within one magnitude of the survey detection limit, even though the derived luminosity functions are similar (see Barone-Nugent et al. 2014).

3.5. Properties of the ${Y}_{105}$-contaminants

To investigate what the properties of the objects are that can migrate from the interloper to the dropout sample when their photometry is rescaled to fainter fluxes (and therefore lower S/N), we report in Figure 10 some examples of interlopers in the enlarged sample that after the MC dimming experiment appear as dropouts at $z\sim 8$. As it is clear from the SEDs, these objects are bright in the band of detection, but their 4000 Å is sufficiently deep that the faint flux at bluer wavelengths is not detected after the typical dimming of $\sim 3\mbox{--}4$ mag that our MC experiment assigns to simulated objects near the XDF detection limit. The figure also highlights the key assumption (and potential limitation) of our approach, that is the use of SEDs observed in brighter galaxies for modeling the colors of fainter sources.

Figure 10.

Figure 10. Examples of SEDs and images in the band of detection for ${Y}_{105}$-interlopers that appear as dropouts at $z\sim 8$ (right) after the MC experiment. SED fitting has been obtained by the photometry with FAST (Kriek et al. 2009).

Standard image High-resolution image

To further characterize the interlopers and especially those that after the dimming appear as dropouts, we derive their stellar population properties by fitting the observed SEDs from the F435W to the F160W or to the Spitzer-IRAC 8 μm photometry,9 depending on availability, using FAST (Kriek et al. 2009). We adopt Bruzual & Charlot (2003) models assuming exponentially declining SFHs of the form $\mathrm{SFR}\propto \exp -t/\tau $, where SFR is the star-formation rate, t is the time since the onset of star formation, and τ sets the timescale of the decline in the SFR, solar metallicity, a Calzetti et al. (2000) dust law, and a Chabrier (2003) Inital Mass Function. We allow $\mathrm{log}(\tau /\mathrm{Gyr})$ to range between 7.0 and 10.0 Gyr, $\mathrm{log}(t/\mathrm{Gyr})$ between 7.0 and 10.1 Gyr, and AV between 0 and 4 mag. When possible, we also use photometric redshifts from the 3D-HST survey (Skelton et al. 2014), to further constrain the fits.

Overall, across the different field, 273 ${Y}_{105}$-interlopers appear as contaminants in at least one out of the 500 MC realizations. We expect this sample to be representative of the entire contaminant population.

A summary of the typical properties of the interlopers and of those that might contaminate the dropout samples at $z\sim 8$ is given in Table 4. The distributions of some properties are also presented in Figure 11. Interestingly, both interlopers and contaminants have intermediate ages, low levels of ongoing star formation, and only moderate dust content. Both median values and Kolmogorov–Smirnov tests support the similarity of the distributions. As expected, given the fact that our contaminants are drawn from the enlarged sample, which by construction includes objects up to 0.2 mag bluer than interlopers, interlopers have a noticeably redder ${Y}_{105}$${J}_{125}$ color than contaminants. These estimates are consistent with the typical values of dust content and ages obtained from the contamination model based on source simulations from the extensive SED library used in Section 4.1.3. This result suggests that it is reasonable to expect that such properties can scale from the intermediate mass objects used as templates to the lower mass and fainter sources that would be contaminants in actual data sets.

Figure 11.

Figure 11. Distribution of redshifts, stellar masses, star-formation rates, dust extinctions, and colors for Y-interlopers in the original sample ("I," black lines), Y-interlopers in the enlarged sample ("eI," green lines), and for interlopers appearing as contaminants after the MC experiment ("C," red lines). Stellar population properties have been estimated from the photometry with FAST (Kriek et al. 2009). Values indicated are the probabilities that two distributions are drawn from the same parent distribution, according to the Kolmogorov–Smirnov test.

Standard image High-resolution image

Table 4.  Stellar Population Properties of All ${Y}_{105}$-interlopers and of Those Appearing as Dropouts after the MC Experiment

Property ${Y}_{105}$-interlopers ${Y}_{105}$-contaminants
z 1.51 ± 0.07 1.51 ± 0.07
(${Y}_{105}$${J}_{125}$)AB 1.33 ± 0.03 1.21 ± 0.04
(${J}_{125}$${H}_{160}$)AB −0.03 ± 0.04 −0.02 ± 0.04
$\mathrm{log}({M}_{* }/{M}_{\odot })$ 8.07 ± 0.08 8.06 ± 0.06
$\mathrm{log}(\mathrm{SFR}/({M}_{\odot }\,{\mathrm{yr}}^{-1})$ −1.0 ± 1 −1.5 ± 0.9
$\mathrm{log}(\mathrm{SSFR}/{\mathrm{yr}}^{-1})$ −9.5 ± 0.9 −9.5 ± 0.8
AV 0.00 ± 0.03 0.00 ± 0.03
$\mathrm{log}(\tau /G{\mathrm{yr}}^{-1})$ 8.0 ± 0.1 8.0 ± 0.1
$\mathrm{log}(t/G{\mathrm{yr}}^{-1})$ 8.60 ± 0.04 8.60 ± 0.04

Note. Median values along with errors are listed.

Download table as:  ASCIITypeset image

4. Contamination Estimates in the Literature

In the literature, there have been various studies that tried to give an estimate of the contamination in the dropout sample, with the intent to correct the estimates of the luminosity functions, but not to characterize the properties of the contaminants. Each of these studies has used a different definition for the dropout/interloper sample and evaluated the contamination in a different way, so a direct comparison among the different findings is not always possible and has to be considered carefully.

Here, we present a summary of some important literature results and then we will redo our analysis using the same selection criteria adopted by Bouwens et al. (2015), with the aim of directly comparing our findings with theirs.

Bouwens et al. (2015) have estimated the impact of scattering onto color selection windows owing to the impact of noise by repeatedly adding noise to the imaging data from the deepest fields, creating catalogs, and then attempting to reselect sources from these fields in exactly the same manner as the real observations. Sources that were found with the same selection criteria as the real searches in the degraded data but that show detections blueward of the break in the original observations were classified as contaminants. They estimated the likely contamination by using brighter, higher-S/N sources in the XDF to model contamination in fainter sources. They estimated contamination rates of 2 ± 1%, 3 ± 1%, 6 ± 2%, 10 ± 3%, and 8 ± 2% at $z\sim 4$, $z\sim 5$, $z\sim 6$, $z\sim 7$, and $z\sim 8$, respectively.

These results are in agreement with ours (see also Section 4.1.1) and with those found in other recent selections of sources in the high-redshift universe (e.g., Giavalisco et al. 2004; Bouwens et al. 2006, 2007, 2012b; Wilkins et al. 2011; Schenker et al. 2013).

Finkelstein et al. (2015) found instead larger values of contamination. They estimated the contamination by artificially dimming lower-redshift sources in their catalog to see if the increased photometric scatter allows them to be selected as high-redshift candidates. For sources with 25 < ${H}_{160}$ < 27, they estimated a contamination fraction of 4.5%, 8.1%, 11.4%, 11.1%, and 16.0% at $z\sim 4$, $z\sim 5$, $z\sim 6$, $z\sim 7$, and $z\sim 8$, respectively. For fainter sources with 26 < ${H}_{160}$ < 29, the contamination fraction increased to 9.1%, 11.6%, 6.2%, 14.7%, and <4.9% at $z\sim 4$, $z\sim 5$, $z\sim 6$, $z\sim 7$, and $z\sim 8$, respectively. These fractions are in line with the estimates from the stacked probability distribution curves (e.g., Malhotra et al. 2005).

By studying the space density of the potentially contaminating sources, Casey et al. (2014) found that dusty star-forming galaxies at $z\lt 5$ might contaminate $z\gt 5$ galaxy samples at a rate of $\lt 1 \% $. Such fraction might increase when photometric scatter is applied to faint, red galaxies, making it easier for them to scatter into high-z samples (Finkelstein et al. 2015).

To minimize the probability of contamination by low-redshift interlopers, the BoRG strategy was to impose a conservative non-detection threshold of 1.5σ on the optical-band data (Trenti et al. 2011; Bradley et al. 2012). In order to estimate the residual contamination, from the Bouwens et al. (2010) data reduction they first identified F098M dropouts with F125W < 27 considering a version of the GOODS F606W image degraded to a 5σ limit F606W = 27.2 to match the relative F125W versus F606W BoRG depth. They then checked for contaminants by rejecting F098M dropouts with ${\rm{S}}/{\rm{N}}\gt 2$ in either B, V, or i (at their full depth). They estimated approximately 30% contamination, which is much higher to what we found, but in good agreement with the estimate based on the application of the color selection to libraries of SED models (Oesch et al. 2007).

Note that the key difference between BoRG and other surveys is that BoRG only has one blue band, making the identification of contaminants more difficult.

4.1. The Bouwens et al. (2015) Cuts

4.1.1. Sample Selection

We now repeat our analysis adopting the cuts proposed by Bouwens et al. (2015) in order to test how a different sample selection may alter our conclusions. For the sake of brevity, we report our analysis performed only on the CANDELS/GSd imaging (Grogin et al. 2011). The parent catalog is the one presented in Section 2. We apply the same cut in ${\rm{S}}/{\rm{N}}({{JH}}_{\det })$ and stellarity index described in Section 2.

We then apply the following color selection criteria for samples of LBGs in the redshift range of $z\sim 5\mbox{--}8$, based on Bouwens et al. (2015).

For $z\sim 5$ candidates

Equation (8)

For $z\sim 6$ candidates

Equation (9)

For $z\sim 7$ candidates

Equation (10)

For $z\sim 8$ candidates

Equation (11)

To distinguish between interlopers and dropouts, we use the following cuts in S/N. According to Bouwens et al. (2015), ${V}_{606}$-dropouts are selected as sources with S/N(${B}_{435}$) < 2, ${i}_{775}$-dropouts with S/N(${B}_{435}$$\lt \,2$ and either (${V}_{606}$${z}_{850}$) > 2.7 or S/N(${V}_{606}$$\lt \,2$, ${z}_{850}$- and ${Y}_{105}$ dropouts with S/N(x$\lt \,2$ and ${\chi }_{x}^{2}\lt 3$, where x is intended to be ${B}_{435}$, ${V}_{606}$, and ${i}_{775}$ bands for ${z}_{850}$-dropouts and ${B}_{435}$, ${V}_{606}$, ${i}_{775}$, and ${I}_{814}$ bands for ${Y}_{105}$-dropouts (Equation (6)). In addition, ${z}_{850}$-dropouts are also selected as sources with either (${I}_{814}$${J}_{125}$$\gt \,1.0$ or S/N(${I}_{814}$) < 1.5.

As before, if a dropout satisfies more than one dropout selection, we assign it to the highest redshift sample. This additional cut removes ∼30 sources from the ${z}_{850}$-selection, while in the other cases at most a few sources are removed. In contrast, we do not apply this restriction to interlopers, which thus may enter multiple selections. However, only very few galaxies enter simultaneously more than one selection; therefore, results are not driven by this subpopulation of duplicates.

4.1.2. Results

The color–color selection of dropouts and interlopers adopting the Bouwens et al. (2015) selection is shown in Figure 12 for samples of ${V}_{606}$-, ${i}_{775}$-, ${z}_{850}$-, and ${Y}_{105}$-dropout and interloper sources. As done in Section 3.1, we define an original and an enlarged selection, by simply enlarging the color–color selection box by 0.2 mag, to check for both candidate high-z LBGs and interlopers that slightly fail to meet the usual selection criteria.

Figure 12.

Figure 12. Color–color selection box used to identify ${V}_{606}$-(upper left), ${i}_{775}$-(upper right), ${z}_{850}$-(bottom left) and ${Y}_{105}$-dropouts (bottom right), following the color selection presented in Bouwens et al. (2015). Red squares represent dropouts, i.e., high-z sources with no flux blueward of the Lyman-break; blue circles represent interlopers, i.e., high-z candidates showing a detection in the blue bands. Dashed lines represent the boundaries of the original sample selection, following Bouwens et al. (2015); dashed–dotted lines represent the boundaries of the enlarged sample (see the text for details). Darker symbols refer to the original selection, lighter ones refer to the enlarged selection.

Standard image High-resolution image

Given the fact that the Bouwens et al. (2015) criteria on the Lyman-break color are less strict than those presented in Section 1, many more galaxies enter both the dropout and the interloper samples, at any redshift. Table 5 presents a summary of the incidence of each population. Comparing the fractions to those presented for the same field in Table 2, we find that the fraction of ${V}_{606}$-dropouts is the same (the changes to the selection are really minor), while at higher redshifts the dropout fraction is considerably smaller. This indicates that a more strict selection criteria does reduce the number of interlopers, even though it simultaneously reduces the sample of dropouts. Therefore, each selection should be a good compromise between purity and completeness.

Table 5.  Statistics of Dropouts and Interlopers

Population Original Sample Enlarged Sample
  Number % Number %
${V}_{606}$-dropouts 446 93 ± 2 601 90 ± 2
${V}_{606}$-interlopers 33 7 ± 2 69 10 ± 2
${i}_{775}$-dropouts 167 62 ± 4 225 53 ± 4
${i}_{775}$-interlopers 102 48 ± 4 215 47 ± 4
${z}_{850}$-dropouts 53 11 ± 2 74 7 ± 1
${z}_{850}$-interlopers 443 89 ± 2 999 93 ± 1
${Y}_{105}$-dropouts 45 4.1 ± 0.9 61 2.5 ± 0.5
${Y}_{105}$-interlopers 1054 95.9 ± 0.9 2418 97.5 ± 0.5

Note. Errors are defined as binomial errors (Gehrels 1986).

Download table as:  ASCIITypeset image

Similarly to what we did in the previous section, we derive the surface density distributions of dropouts and interlopers in our Bouwens et al. (2015) like selection. Results are qualitatively similar to those found for a constant color cut, with the shape of the distribution of dropouts staying almost constant with increasing redshift, while that of interlopers is considerably steepening.

Inspecting a ${\rm{S}}/{\rm{N}}({{JH}}_{\det })$ versus ${\chi }_{\mathrm{opt}}^{2}$ diagram for the Bouwens et al. (2015) sample selection, it emerges that the enlarged sample appears to have objects distributed along two different sequences. To further explore this population, in Figure 13, we focus on interlopers only and add the information on their near-IR colors. It appears evident that most of the objects in the second sequence are characterized by intermediate colors in ${Y}_{105}$${J}_{125}$ and red colors in ${J}_{125}$${H}_{160}$ (0.5 < ${J}_{125}$${H}_{160}$ < 0.7). This demonstrates the utility of excluding candidates that are too red (e.g., Giavalisco et al. 2004; Bouwens et al. 2007, 2015). Similar results also hold for samples from selections at lower z.

Figure 13.

Figure 13. Comparison between the ${\rm{S}}/{\rm{N}}({{JH}}_{\det })$ and the optical ${\chi }^{2}$ for interlopers in the enlarged sample at $z\sim 8$. In the right panel, the ${Y}_{105}$${J}_{125}$ color is color coded, while in the left panel the ${J}_{125}$${H}_{160}$ color is color coded.

Standard image High-resolution image

4.1.3. Contamination in Dropout Samples

Mimicking the analysis in the previous section, we investigate the level of contamination in the Bouwens et al. (2015) dropout sample induced by interlopers that are misclassified as dropouts in absence of sufficiently deep data at bluer wavelengths.

First, we estimate the impact of noise on the measurement of the optical ${\chi }^{2}$ and photometric scatter in the color–color selection performing a resampling MC simulation on the photometric catalogs. As before, for each dropout selection, we uniformly sample with repetition the luminosity in the detection band from the catalog of enlarged interlopers, extracting a simulated catalog with the same size as the original one. Next, we assign to each of these objects the broadband colors of a random galaxy from the original parent catalog (again using uniform sampling probability with repetition), and we add zero-mean noise in the fluxes sampling from a Gaussian distribution with width determined by the S/N of the simulated broadband fluxes. Finally, we perform the photometric analysis of the catalog to quantify the number of interlopers in the enlarged sample that are classified as dropouts. After repeating the procedure 500 times to collect statistics, we find that, on average,

  • 1.  
    the $z\sim 5$ selection has 7 ± 1 interlopers entering the ${V}_{606}$-dropout sample as contaminant, for an estimated contamination rate of ${f}_{c}\sim 7/446\sim 1.5 \% ;$
  • 2.  
    the $z\sim 6$ selection has 4 ± 1 interlopers entering the ${i}_{775}$-dropout sample as contaminant, for an estimated contamination rate of ${f}_{c}\sim 4/167\sim 2.5 \% ;$
  • 3.  
    the $z\sim 7$ selection has 5 ± 1 interlopers entering the ${z}_{850}$-dropout sample as contaminant, for an estimated contamination rate of ${f}_{c}\sim 5/53\sim 9.4 \% ;$ and
  • 4.  
    the $z\sim 8$ selection has 7 ± 1 interlopers entering the ${Y}_{105}$-dropout sample as contaminant, for an estimated contamination rate of ${f}_{c}\sim 7/45\sim 15.3 \% $.

These results are clearly illustrating that while the number of misclassified interlopers remains relatively constant across different samples, as the redshift increases, the relative weight compared to the number of dropouts grows significantly. These estimates are systematically larger than those presented in the previous section, indicating how in the selection cuts proposed Bouwens et al. (2015) many more interlopers might be incorrectly classified as dropouts. Nonetheless, as the sample presented in the previous section, these estimates are consistent with the predictions from the contamination model based on source simulations from an extensive SED library (Oesch et al. 2007). The model predicts a contamination of 3.1% at $z\sim 5$, 0.5% at $z\sim 6$, 6.3% at $z\sim 7$, and 12.4% at $z\sim 8$. It forecasts a higher contamination at $z\sim 5$ compared to $z\sim 6$ that our MC experiment does not capture. This is likely due to the fact that, being based on the observed data, the MC at $z\sim 5$ is able to identify contaminants only when the objects show a detection in the single band (${B}_{435}$) blueward of the break, unlike the model based on a template library.

Therefore, it appears evident that the choice of the color cuts noticeably alters the fraction of dropouts and interlopers and the estimates of contamination. The Bouwens et al. (2015) selection criteria ensure a larger number of dropouts at all redshifts, but unavoidably also a larger number of interlopers. As a consequence, the estimated contamination is also considerably higher.

5. Summary and Conclusions

In this paper, we investigated the contamination of photometrically selected samples of high-redshift galaxies. Our focus has been on the widely adopted Lyman-break technique, using high-quality multi-band imaging from the Hubble CANDELS surveys (GOODS deep south and GOODS wide north), the XDF and the HUDF09-2. In our analysis, we distinguished between dropouts, that is, sources that formally satisfy all the selection criteria of LBGs, and interlopers, that is, sources with similar colors redwards of the spectral break, but showing a detection at bluer wavelengths. Because of finite photometric precision, when no sufficiently deep data at bluer wavelengths are available, a (small) fraction of interlopers can be misclassified as dropouts, and contaminate the selection. Hence we indicated these objects as contaminants.

The class of interlopers/contaminants that we studied is that of intermediate redshift galaxies with a prominent 4000 Å/Balmer break, which are the most common among interlopers based on redshift estimates from the 3D-HST survey (see Figure 2).

Our key results are the following.

  • 1.  
    Adopting a constant cut on the strength of the Lyman break across different redshifts, the number counts of interlopers shows an increase in number with z of at most a factor of two. In contrast, in selections where the cut on the strength of the Lyman break varies with redshift (e.g., Bouwens et al. 2015), the number counts of interlopers increase significantly from $z\sim 5$ to $z\sim 8$. This suggests that cleaner samples of dropouts can be achieved by requesting a clear spectral break, which reduces the number of interlopers more significantly than the number of dropouts.
  • 2.  
    The surface density of interlopers in the sky remains approximately constant over the range of dropout selection windows considered in this study (that is dropouts from $z\sim 5$ to $z\sim 8$), for a given depth of the survey and for a uniform cut in the color containing the Lyman break. This is because the population of interlopers resides at lower redshift and its average redshift evolves more slowly (by a factor of ∼0.3) compared to that of the dropouts (see Equation (7) and Figure 2). Thus, since the number of dropouts evolves rapidly with redshift, the ratio of interlopers to dropouts grows significantly with increasing redshift.
  • 3.  
    While the shape of the surface density distribution of dropouts stays relatively constant with increasing redshift, that of interlopers possibly gets steeper. Interlopers also tend to have a tail at the bright end.
  • 4.  
    Using an MC resampling of the interloper population, we estimate a contamination of the dropout samples in all the fields, ranging from $\sim 2 \% $ at $z\sim 5$ to $\sim 6 \% $ at $z\sim 8$ for the GSd field, with a clear trend of increasing contamination for higher redshift dropout samples. In the other fields, the contamination is similar, but systematically lower. The lowest level of contamination is found for GDw, indicating that having relatively deeper blue bands compared to red bands is the most effective circumstance to properly separate interlopers from dropouts.
  • 5.  
    Extrapolating with a power law the interloper number count distribution at the faint end to simulate ultradeep surveys, we derive that the contamination increases toward fainter magnitudes and ranges from 0.1 to 0.4 contaminants per arcmin2 at ${J}_{125}$ = 30, depending on the field considered. Generally, we find that these contaminants are located near the detection limit of the survey.
  • 6.  
    By means of SED modeling, we characterized the stellar population properties of the interlopers that may contaminate the dropout sample, and found objects with intermediate ages (∼1 Gyr at $z\sim 1.5\mbox{--}2$), very-low levels of ongoing star formation, and relatively low dust content.

Our results and contamination estimates are limited by restrictions to galaxy-like sources and to Gaussian noise. The former is not likely to be an issue for space-based observations with high angular resolution, but it might affect ground-based surveys that do not have the ability to discriminate between compact galaxies and stars. The assumption of normally distributed errors is again likely to underestimate the occurrence of rare, extreme events of photometric scatter, since data are likely to have an excess of noise compared to a normal distribution in their tails (Schmidt et al. 2014). Thus, our results are to be considered lower limits for the contamination of dropout samples. Also, while we focused on Lyman-break selection, a similar analysis would be expected to hold qualitatively if we had considered photometric redshift estimates to construct the sample of dropouts and interlopers, with the added complication of leaving more degrees of freedom in defining the selection and the separation between the two samples.

Overall, our key conclusion is that the dropout selection of high-redshift sources currently leads to samples with high purity, but the purity degrades when the number of dropouts becomes much smaller than the number of interlopers. We demonstrated this clearly for the ${Y}_{105}$-dropout sample from space observations over deep fields. A qualitatively similar conclusion on an increase of the contamination fraction is expected to hold for ground-based surveys over large areas as well, targeting the bright end ($m\sim 24\mbox{--}26$) of the galaxy luminosity function at high-z, since the relative number of dropouts versus interlopers is significantly suppressed. However, in this case, the objects are so bright that targeted follow-ups, such as spectroscopic observations, should be able to discriminate between high-z sources and contaminants.

Finally, extrapolating our results to future surveys at $z\gt 10$, we highlight the need to consider carefully the contamination of the dropout samples, since the number of objects expected at such early times will be orders of magnitude smaller than the number of interlopers with similar colors, and thus the contamination might exceed 50%. Fortunately, in this respect, the capability of JWST to observe efficiently at rest-frame optical wavelengths for sources at $z\gt 10$ will greatly help in continuing to select samples of photometrically selected objects with high purity, similar to the role played currently by Spitzer IRAC imaging to validate samples of bright dropouts at $z\sim 8\mbox{--}10$ identified by HST (Bouwens et al. 2015).

We thank the anonymous referee for insightful remarks that helped us to improve the paper. B.V. acknowledges the support from an Australian Research Council Discovery Early Career Researcher Award (PD0028506). This work was partially supported by grants ARC FT130101593, and HST/GO 13767, 12905, and 12572.

Facilities: HST(ACS), HST(WFC3).

Footnotes

  • Note that there is a further suppression of the flux in the region across the 912 Å rest-frame Lyman-continuum discontinuity, but in practice for galaxies at $z\gtrsim 6$ the non-detection starts at a $\lambda \leqslant 1216$ Å rest frame.

  • Note that this is distinct from the F140W image, indicated as JH140.

  • We note that within uncertainties, applying a more stringent S/N cut yields the same results, thus we opted for S/N > 6 to include in the analysis a larger number of objects.

  • We resort to the IRAC photometry from CANDELS (Guo et al. 2013), which we matched to our sources based on coordinates. IRAC photometry is available only for galaxies in GSd.

Please wait… references are loading.
10.3847/1538-4357/aa5caf