A publishing partnership

The Unusual Initial Mass Function of the Arches Cluster

, , , , , , , and

Published 2019 January 4 © 2019. The American Astronomical Society. All rights reserved.
, , Citation Matthew W. Hosek Jr. et al 2019 ApJ 870 44 DOI 10.3847/1538-4357/aaef90

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/870/1/44

Abstract

As a young massive cluster in the central molecular zone, the Arches cluster is a valuable probe of the stellar initial mass function (IMF) in the extreme Galactic center environment. We use multi-epoch Hubble Space Telescope observations to obtain high-precision proper-motion and photometric measurements of the cluster, calculating cluster membership probabilities for stars down to ∼1.8 M between cluster radii of 0.25 and 3.0 pc. We achieve a cluster sample with just ∼6% field contamination, a significant improvement over photometrically selected samples that are severely compromised by the differential extinction across the field. Combining this sample with K-band spectroscopy of five cluster members, we forward model the Arches cluster to simultaneously constrain its IMF and other properties (such as age and total mass) while accounting for observational uncertainties, completeness, mass segregation, and stellar multiplicity. We find that the Arches IMF is best described by a one-segment power law that is significantly top-heavy: α = 1.80 ± 0.05 (stat) ± 0.06 (sys), where dN/dm ∝ mα, though we cannot discount a two-segment power-law model with a high-mass slope only slightly shallower than local star-forming regions $(\alpha ={2.04}_{-0.19}^{+0.14}\pm 0.04)$ but with a break at ${5.8}_{-1.2}^{+3.2}\pm 0.02\,{M}_{\odot }$. In either case, the Arches IMF is significantly different than the standard IMF. Comparing the Arches to other young massive clusters in the Milky Way, we find tentative evidence for a systematically top-heavy IMF at the Galactic center.

Export citation and abstract BibTeX RIS

1. Introduction

A fundamental quantity in star formation is the initial mass function (IMF), which describes the distribution of stellar masses created during star formation. Though its functional form is debated (e.g., Chabrier 2005), the IMF is often represented as a multipart power law given by dN/dm ∝ mα, where

Equation (1)

as discussed in Kroupa (2002). Stellar populations in the Milky Way and nearby galaxies have been found to be consistent with this "local IMF," leading to the suggestion that it may be a universal property of star formation (see reviews by Bastian et al. 2010, Offner et al. 2014, and references therein). Thus, the local IMF is often used to describe stellar populations throughout the universe.

However, it is unknown whether the local IMF is applicable to environments other than those found in local star formation regions. Of particular interest are starburst environments, which exhibit extremely high gas densities and temperatures, radiation fields, and turbulence (e.g., Swinbank et al. 2011). Some studies predict that the increased thermal Jeans mass results in an overabundance of high-mass stars and a "top-heavy" IMF (e.g., Larson 2005; Bonnell et al. 2006; Klessen et al. 2007; Bonnell & Rice 2008; Papadopoulos et al. 2011; Narayanan & Davé 2013). Alternatively, others claim that the IMF is set by the mass distribution of prestellar cores within a molecular cloud (the core mass function (CMF)), which itself is set by turbulence (e.g., Padoan & Nordlund 2002; Hopkins 2012). These theories predict that the increased turbulence in starburst environments would favor the formation of low-mass stars and a "bottom-heavy" IMF (Hopkins 2013; Chabrier et al. 2014). However, recent simulations suggest that the CMF cannot be directly mapped to the IMF (e.g., Bertelli Motta et al. 2016; Liptai et al. 2017). A third set of studies contend that the IMF is driven by local processes such as radiative feedback (e.g., Bate 2009; Offner et al. 2009; Krumholz 2011; Krumholz et al. 2012) and is largely independent of environment (e.g., Guszejnov et al. 2016). Thus, understanding how the IMF behaves in starburst environments yields critical insight into the underlying physics driving star formation (e.g., Krumholz 2014).

There is some observational evidence that the IMF changes in starburst environments, though these results are debated. Studies of massive elliptical galaxies have found that the IMF becomes increasingly bottom-heavy with increasing velocity dispersion and/or α-element enhancement, conditions that reflect starburst-like conditions (e.g., Cappellari et al. 2012, 2013; Conroy & van Dokkum 2012; Conroy et al. 2013; La Barbera et al. 2013; Spiniello et al. 2014; Li et al. 2017). Further studies suggest that the cores of massive galaxies, which are thought to have formed rapidly in starburst-like environments at high redshift (e.g., Oser et al. 2010), are systematically bottom-heavy relative to the rest of the galaxy (e.g., Martín-Navarro et al. 2015; Conroy et al. 2017; van Dokkum et al. 2017; Parikh et al. 2018). However, these results rely on modeling stellar populations from unresolved stellar spectra, which is prone to systematic effects such as elemental abundance gradients (e.g., Zieleniewski et al. 2015, 2017; McConnell et al. 2016; Vaughan et al. 2018). Overall, the consistency of IMF determinations for a single galaxy using spectroscopic, kinematic, and lensing methods has not yet been established, with some galaxies showing agreement and others showing significant discrepancies (Lyubenova et al. 2016; Newman et al. 2017). This highlights the difficulty of measuring the IMF from these complex and unresolved stellar populations.

Massive star clusters in starburst galaxies (also known as super star clusters, or starburst clusters) also offer a probe into starburst environments. Still unresolved with current observing facilities, their mass functions are inferred from the light-to-mass ratios (e.g., Ho & Filippenko 1996). This analysis also faces many challenges, including the need for virial equilibrium, uncertainties in stellar models and extinction corrections, the impact of mass segregation and multiplicity, and anisotropy in the velocity dispersion (e.g., Bastian et al. 2007). A range of both bottom-heavy and top-heavy IMFs has been reported for these clusters, perhaps as a result of these difficulties (Larsen et al. 2004; McCrady et al. 2005; Bastian et al. 2006).

Ideally, one would directly measure the IMF of starburst environments using resolved stellar populations. Such investigations are possible at the Milky Way Galactic center (GC), which has been shown to exhibit similar densities, temperatures, and kinematics to those in starburst galaxies (Kruijssen & Longmore 2013; Ginsburg et al. 2016). The GC contains several young massive clusters (YMCs) whose youth and high mass make them ideal tools for measuring the IMF (Morris & Serabyn 1996). The young nuclear cluster (YNC; ∼2.5–5.8 Myr, M ≳ 2 × 104 M), which lies within the central parsec of the galaxy, has been found to have a top-heavy IMF with α = 1.7 ± 0.2 (Lu et al. 2013). The Arches cluster (2–4 Myr, M ∼ 4–6 × 104 M; Martins et al. 2008; Clarkson et al. 2012), located within the central molecular zone (CMZ) and at a projected distance of ∼26 pc from the central supermassive black hole, offers an additional opportunity to probe the IMF in this extreme environment.

Despite many efforts, the IMF of the Arches cluster has not yet been established. This is due to two significant challenges: mass segregation and differential extinction. As a result of mass segregation, the present-day mass function (PDMF) of the inner region (r ≲ 0.5 pc) has been measured to be top-heavy (Figer et al. 1999; Stolte et al. 2002, 2005; Kim et al. 2006), while the outer regions (r ≳ 0.5 pc) have been found to be either consistent with the local IMF or bottom-heavy (Espinoza et al. 2009; Habibi et al. 2013). Dynamical modeling is required to determine whether the observed PDMF is consistent with the local IMF (e.g., Kim et al. 2000; Harfst et al. 2010; Park et al. 2018), though the uncertainty in cluster orbit (Stolte et al. 2008) and initial conditions requires that a large parameter space must be considered.

In addition, inferring the IMF from the PDMF depends heavily on the PDMF at large cluster radii, where the differences between dynamical models are the largest (e.g., Figure 13 of Habibi et al. 2013). However, significant differential extinction (ΔAV ∼ 15 mag; Habibi et al. 2013) makes it challenging to separate the cluster from field populations via photometry, especially at large radii, where field star contamination can be high (e.g., Stolte et al. 2005). Measurements of the internal velocity dispersion of the cluster indicate that its mass function is top-heavy and/or truncated at low masses (Clarkson et al. 2012), but this has yet to be confirmed by direct star counts.

In this paper, we combine multi-epoch Hubble Space Telescope (HST) Wide Field Camera 3 (WFC3) IR observations with Keck OH-Suppressing Infrared Integral Field Spectrograph (OSIRIS) K-band spectroscopy to measure the IMF of the Arches cluster for M > 1.8 M. We describe our observations in Section 2 and our methods for calculating cluster membership probabilities, correcting for extinction, and measuring observational completeness in Section 3. In Section 4, we detail our forward-modeling technique for constraining the IMF, and in Section 5, we present our result that the Arches cluster IMF is inconsistent with the local IMF. We compare our result to past Arches IMF measurements and place it in context with other YMCs in the Milky Way in Section 6. We present our conclusions in Section 7.

2. Observations and Measurements

2.1. HST Photometry and Astrometry

Astrometry and photometry of the Arches cluster were obtained from observations with the infrared channel of the WFC3 on the HST for four epochs between 2010 and 2016. The 2010 epoch contains images in the F127M, F139M, and F153M filters (GO-11671, PI: A. M. Ghez), while the 2011, 2012, and 2016 epochs have images only in the F153M filter (GO-12318 and GO-12667, PI: A. M. Ghez; GO-14613, PI: J. R. Lu). A detailed description of the 2010–2012 observations is provided in Hosek et al. (2015, hereafter H15). The 2016 observations were designed to mimic the earlier F153M epochs in order to maximize the astrometric precision between the data sets. These observations have a field of view (FOV) of 120'' × 120'', providing coverage of at least 30% of the cluster area within successive circular annuli of width 0.25 pc out to 3 pc (Figure 1).

Figure 1.

Figure 1. Three-color HST image of the Arches cluster, with F127M = blue, F139M = green, and F153M = red. The inner and outer green circles represent cluster radii of 0.25 and 3.0 pc, which define the inner and outer boundaries of our HST sample, respectively. The yellow box near the center of the cluster corresponds to the Keck OSIRIS field, where K-band spectroscopy of five cluster members was obtained. The hole in the lower left side of the image is due to a known defect in the WFC3-IR chip.

Standard image High-resolution image

We extract high-precision astrometry and photometry using the FORTRAN codes img2xym_wfc3ir, a version of the img2xym_WFC package for WFC3-IR (Anderson & King 2006), and KS2, a generalization of the software developed for the Globular Cluster Treasury Program (Anderson et al. 2008; see also Bellini et al. 2018). A detailed description of this procedure and the analysis of the subsequent astrometric and photometric errors is provided in Appendix A of H15. In short, point-spread function (PSF) fit astrometry and photometry are extracted using a grid of spatially varying PSF models across the field. No significant differences in measurement precision were found for the 2016 epoch compared to the previous epochs, with average astrometric and photometric errors of 0.16 mas (1.3 × 10−3 pixel) and 0.008 mag, respectively, for the brightest nonsaturated stars (error on the mean calculated from 21 individual measurements). The photometry is calibrated to the Vega magnitude system using the improved KS2 zero points derived in Hosek et al. (2018, hereafter H18), which uses significantly more stars than the original zero-point derivation in H15.

The stellar positions in each epoch are transformed into a master astrometric reference frame using a second-order polynomial transformation in both X and Y (12 free parameters). The master frame is constructed such that there is no net motion of the cluster, as only high-probability cluster members (≥0.7) in the H15 catalog are used as reference stars. An iterative process is used to match stars, calculate initial proper motions, and rematch stars using those proper motions to identify stars across the epochs. The star matching is done by position, using a search radius of 0.5 pixel (0farcs06). Proper motions are calculated for stars detected in at least three F153M epochs using a linear fit to the X and Y positions as a function of time, weighted by their astrometric errors. The final star catalog contains ∼45,000 stars with proper-motion errors 3 times smaller than those of H15 on account of the increased time baseline, reaching a precision of ∼0.03 mas yr−1 at the bright end (Figure 2).

Figure 2.

Figure 2. Proper-motion error as a function of F153M magnitude in the final star catalog. For each star, the error shown is the average between the X and Y directions. The red dotted line denotes the proper-motion error limit of 1.42 mas yr−1 required for membership analysis (Section 3.1). The solid blue line shows the completeness limit of F153M = 21.18 mag, which corresponds to ∼1.8 M (Section 3.3). These errors are ∼3× lower than those reported in H15 due to the increased time baseline.

Standard image High-resolution image

2.2. Keck OSIRIS Spectroscopy

The K-band spectroscopy of a sample of Arches cluster members was obtained using OSIRIS (Larkin et al. 2006) with laser guide star adaptive optics (Wizinowich et al. 2006) on the Keck I telescope on 2014 May 16. The Kbb filter was used with the 0farcs10 pixel scale, which provides spectral coverage of 1.965–2.381 μm at R ∼ 3800 over a 1farcs× 6farcs4 FOV. A single field was observed near the core of the cluster (J2000: α = 17:45:50.7, δ = −28:49:23.4; Figure 1) at a position angle of 28° using 10 dithered exposures of 900 s for a total integration time of 9000 s. This field was chosen to maximize the number of non-Wolf-Rayet (WR) stars (F153M ≥ 14.5 mag; see Section 3.4) while avoiding the densest inner region of the cluster. The spectroscopic sample contains five stars, as described in Table 1.

Table 1.  OSIRIS Spectroscopic Sample

Namea R.A.b Decl.b Spectral Typec F127M F153M AKsd Teff log g
  (J2000) (J2000) (literature) (mag) (mag) (mag) (K) (cgs)
47 17:45:50.68 −28:49:24.39 O4–5 Ia 17.01 ± 0.01 14.85 ± 0.01 2.43 ${34750}_{-1500}^{+3000}$ ${3.50}_{-0.15}^{+0.30}$
44 17:45:50.62 −28:49:24.77 17.18 ± 0.01 14.91 ± 0.01 2.43 ${34500}_{-1500}^{+3000}$ ${3.75}_{-0.25}^{+0.15}$
53 17:45:50.64 −28:49:24.14 O4–5 Ia 17.10 ± 0.01 14.91 ± 0.01 2.43 ${37000}_{-2000}^{+2000}$ ${3.50}_{-0.10}^{+0.30}$
55 17:45:50.73 −28:49:24.54 O5.5–6 I–III 17.09 ± 0.01 14.97 ± 0.01 2.40 ${34500}_{-1500}^{+3000}$ ${3.85}_{-0.15}^{+0.25}$
60 17:45:50.74 −28:49:21.08 O4–5 Ia 17.20 ± 0.01 15.03 ± 0.01 2.39 ${36000}_{-1500}^{+2500}$ ${3.60}_{-0.15}^{+0.20}$

Notes.

aAs defined in the catalog from Figer et al. (2002). bMeasured in 2010 F153M epoch. cFrom Clark et al. (2018). dDerived using the extinction map in Section 3.2.

Download table as:  ASCIITypeset image

The OSIRIS data cubes were reduced using version 4.1.0 of the OSIRIS data reduction pipeline7 (ODRP; Krabbe et al. 2004). The ODRP corrects for dark current, electronic biases and crosstalk, and cosmic rays and properly extracts the wavelength-calibrated spectrum at each spaxel (spatial pixel). The science data cubes were averaged together using the "Mosaic Frames" module to create the master science data cube. One-dimensional science spectra were extracted using a 3 × 3 aperture box centered on the spaxel with the highest integrated flux for the star. This aperture size was chosen to maximize the signal-to-noise while minimizing contamination from nearby stars.

After extraction, the raw science spectra need to be corrected for contamination from sky features such as continuum, OH emission lines, and telluric absorption lines. The standard set of calibration observations (sky frames and telluric standards) were obtained at the telescope, but we found that the sky features were better corrected using the Skycorr8 (Noll et al. 2014) and molecfit9 (Kausch et al. 2015; Smette et al. 2015) software packages. Skycorr removes sky emission lines by fitting physically related OH line groups in a reference sky spectrum and scaling them to match the science spectrum (e.g., Davies 2007). The sky continuum is measured by a linear interpolation of the wavelength channels without line emission and then combined with the OH line model to produce the final sky spectrum that is subtracted from the science spectrum. In this case, a reference sky spectrum for each star is extracted using a box annulus formed by a 5 × 5 and 7 × 7 spaxel box centered on the star itself and then rescaled to science spectrum aperture size. Once Skycorr has removed the sky emission and continuum, the telluric absorption lines are modeled using molecfit, which uses a radiative transfer code and an atmospheric profile based on the date and location of the observations to predict atmospheric lines caused by molecules such as H2O, CO2, and CH4. The telluric model is then divided out of the science spectrum to produce the final reduced science spectrum.

However, as discussed by Lockhart et al. (2018), OSIRIS introduces a shape to the stellar continuum due to its varying sensitivity as a function of wavelength that cannot be modeled by molecfit. This requires the extra step of creating an OSIRIS "flat" free of sky, telluric, and stellar-flux contributions. We construct this flat using the observed telluric standards, empirically subtracting the sky, and using molecfit to remove the telluric lines. In the A0 V spectrum, the only remaining feature is the Br-γ line. To remove this line, we combine the A0 V and G2 V spectra using the technique described in Do et al. (2009), replacing the A0 V spectrum between 2.155 and 2.175 μm with the spectrum of the G2 V star after it has been divided by the solar spectrum. Finally, we smooth the resulting spectrum using a median filter (kernel size = 51 pixels) to create the OSIRIS flat. The science spectra are divided by this flat and normalized to produce the final science spectra (Figure 3).

Figure 3.

Figure 3. Reduced OSIRIS spectra of Arches cluster members. The gray regions mark wavelengths with high telluric absorption, while the red dotted lines denote several useful spectral features.

Standard image High-resolution image

3. Methods

3.1. Proper-motion-based Cluster Membership

Cluster membership probabilities are calculated using the proper motions derived in Section 2.1 and the Gaussian mixture-model technique described in H15. This approach provides the flexibility needed to fit the complex kinematics of the cluster and field populations while taking the proper-motion errors into account. To reduce outliers, an error cut of 1.42 mas yr−1 (one-third of the difference between the average cluster and field population proper motions in H15) is adopted, resulting in a membership catalog of 29,895 stars. This is significantly larger than the sample analyzed in H15 (∼6000 stars) because we adopt a proper-motion error cut that is 2.2× larger, do not impose a magnitude error cut, and generally have improved proper-motion errors due to the extra epoch of data. As a result, a five-Gaussian mixture model is required to fit the cluster and field populations (Figure 4), as opposed to the four-Gaussian model used in H15. This is confirmed by the Bayesian information criterion (BIC; see Equation (20) and Section 5.2 for description), which significantly favors the five-Gaussian model.

Figure 4.

Figure 4. Gaussian mixture-model fit to the observed cluster and field proper-motion distributions. Top: vector point diagram of the proper motions with the 1σ Gaussian contours overlaid. The red Gaussian corresponds to the cluster, while the blue, green, cyan, and magenta Gaussians describe the field population. The right panel is a zoomed-in version of the left panel, focusing on the cluster distribution. Bottom: observed (black) vs. predicted (red) proper-motion distributions in the R.A. and decl. directions (left and right panels, respectively). Good agreement is found between the observations and model.

Standard image High-resolution image

Individual cluster membership probabilities are calculated as

Equation (2)

where πc and πk are the fraction of total stars in the cluster and kth field Gaussian, respectively, and ${P}_{c}^{i}$ and ${P}_{k}^{i}$ are the probability of the ith star being part of the cluster and kth field Gaussian, respectively. A table describing the parameters of the Gaussian mixture-model fit is provided in Appendix A.

3.2. Extinction Correction

Red clump (RC) stars are used to correct for differential extinction across the field. The intrinsic magnitudes and colors of these stars do not vary significantly with age or metallicity, making them useful "standard crayons" with which to measure extinction (Girardi 2016). While not associated with the Arches cluster itself, RC stars are numerous in the Galactic bulge and have a density distribution that is sharply peaked at the GC (Wegg & Gerhard 2013). Thus, we assume that the extinction of the RC stars is similar to that of the cluster, so an extinction map derived using RC stars can be used for cluster stars. This approach was validated in H15, who showed that an RC extinction map significantly reduced the differential extinction in proper-motion-selected Arches members.

We improve the extinction map presented in H15 by using a refined sample of RC stars identified using an unsharp-masking technique (e.g., De Marchi et al. 2016) and adopting a revised version of the optical/near-infrared extinction law presented in H18 (Appendix B). The advantage of the unsharp-masking technique is that it increases the contrast of high-density features, such as the RC population, while reducing low-frequency noise. We select RC stars using the criteria described in H18: we calculate a best-fit line to the high-density RC feature in the color magnitude diagram (CMD) after unsharp masking and identify stars within ΔF153M = 0.3 mag of the best-fit line as the RC population (see Figure 7 from H18). This width is selected to encompass the RC feature and is likely caused by the distribution of stellar distances, metallicities, and ages within the population, all of which alter their location in the CMD. In addition, we consider only stars with Pclust ≤ 0.02 in order to eliminate cluster members from the sample (which is necessary since the populations overlap in CMD space) and require a photometric error better than 0.05 mag in both the F127M and F153M filters in order to remove field interlopers that scatter into the selection space. Ultimately, 875 RC stars are used in the final extinction map.

The Arches extinction map is created using a spatial interpolation of the RC star sample with a fifth-order bivariate spline10 (Figure 5). All pixels with rcl < 0.25 pc are removed from the map, since high stellar crowding prevents an adequate number of RC stars from being detected at these radii. Ignoring the extreme values at the edge of the field where the interpolation becomes invalid, the extinction map values are in the range 1.9 mag < AKs < 2.65 mag, with a median extinction of AKs = 2.38 mag for stars with Ppm ≥ 0.5. We will adopt this as an initial estimate for the average extinction of the cluster and include a term in the IMF analysis to capture residual differential extinction in the cluster due to errors in the extinction map (Section 4).

Figure 5.

Figure 5. RC-interpolated extinction map for the Arches cluster field, with the positions shown in arcseconds relative to the cluster center. No measurement is made for rcl < 0.25 pc due to the low HST completeness in the area.

Standard image High-resolution image

3.3. Completeness

Observational completeness is determined using artificial-star planting and recovery tests. We plant a total of 675,000 artificial stars and run them through the same detection pipeline as the real stars. These stars are generated in three sets. The first set contains 400,000 artificial stars with magnitudes drawn from the observed CMD, perturbed by a random amount drawn from a Gaussian distribution with a width equal to the photometric uncertainty. These stars are planted uniformly across the field. The second set contains 175,000 artificial stars that are assigned to a grid of magnitudes and colors in order to cover sparsely populated regions of the CMD (e.g., the brightest and faintest observed magnitudes) in order to improve the confidence of the completeness corrections in these regions. These stars are also given a uniform spatial distribution. The final set of 100,000 artificial stars is generated based on the brighter stars in the observed CMD (F153M ≤ 18 mag) and planted according to the radial profile of the Arches cluster from H15. This increases the confidence of the completeness correction near the cluster center, where the effects of stellar crowding are strongest.

After the artificial stars are extracted by the detection pipeline, their photometric and astrometric errors are lower than the real data errors because they do not account for PSF uncertainty. Following H15, a magnitude-dependent error term is added in quadrature to the artificial-star errors so their distribution matches those of the real-star errors. Proper motions are then calculated and photometry differentially dereddened for the artificial stars in the same manner as the real stars. To be successfully recovered, an artificial star must be detected within 0.5 mag of its planted magnitude and 0.5 pixels of its planted position in at least three of the four F153M epochs and the F127M epoch and have a proper-motion error ≤1.42 mas yr−1. The resulting F127M and F153M completeness curves as a function of differentially dereddened magnitudes in different cluster radius bins (0 pc ≤ R ≤ 3 pc in steps of 0.25 pc) are shown in Figure 6.

Figure 6.

Figure 6. Observational completeness as a function of cluster radius and differentially dereddened F153M (left panel) and F127M (right panel) magnitude. At the average color of the cluster in the CMD, the F153M curve sets the completeness. Due to the low completeness in the innermost radius bin (0–0.25 pc), we exclude stars at these radii from the IMF analysis. We require a minimum of 30% completeness across the sample (red horizontal line) and thus adopt an F153M magnitude cut at F153M = 21.18 mag (Section 3.4).

Standard image High-resolution image

For the IMF analysis, we calculate the completeness for each star based on its cluster radius and position in the CMD. Within a given radius bin, the CMD is binned in steps of 0.15 mag in F153M (range: 24.5–12.3 mag) and 0.2 mag in F127M–F153M (range: 0–5 mag). The completeness in each bin is assigned to the lowest value from the F127M and F153M completeness curves at the respective F153M and F127M magnitudes at the center of the bin. At the average color of the cluster, the F153M curve sets the completeness limit.

3.4. Final Sample

Starting with the cluster membership catalog described in Section 3.1 (29,895 stars), we apply a series of cuts in order to produce a high-quality sample for the IMF analysis. We require the following.

  • 1.  
    Ppm ≥ 0.3, in an effort to reduce the number of field stars in our sample.
  • 2.  
    A minimum of 30% completeness, as determined in Section 3.3. Due to the limited HST completeness at small cluster radii, we only consider stars with rcl > 0.25 pc. We thus achieve a depth of F153M ≤ 21.18 mag, corresponding to M ≥ ∼1.8 M. We note that the analysis is not sensitive to this choice of the completeness limit; adopting a minimum completeness of 50% does not significantly impact the results, other than changing the lower mag limit to F153M ≤ 20 mag (M ≥ ∼2.5 M).
  • 3.  
    A minimum of 30% area coverage within successive circular annuli of width 0.25 pc. As discussed in H15, this is achieved for rcl ≤ 3.0 pc.
  • 4.  
    All F153M measurements for a given star to agree with its median F153M magnitude within 0.5 mag. This was found to remove situations where a faint star is misidentified as a nearby bright star.
  • 5.  
    WR stars removed from our sample, given the uncertainty in their stellar models and thus stellar masses. We use the population of spectroscopically identified WR stars in the Arches cluster (Figer et al. 2002; Martins et al. 2008; Clark et al. 2018) to determine their F153M magnitudes at the average cluster extinction of AKs = 2.38 mag. The faintest of these stars, star B1 in Clark et al. (2018), is found to have a differentially dereddened magnitude of F153M = 14.1 mag (observed F153M = 14.01 mag; the star is less extinguished than the cluster average), so we adopt a conservative magnitude cut of F153M ≥ 14.5 mag.

Finally, a photometric color cut is used to remove obvious field contaminants from the sample. High-probability cluster members (Ppm ≥ 0.6) are corrected for differential extinction as described in Section 3.2, and a 3σ clipping algorithm is used to calculate the average F127M–F153M color and standard deviation as a function of F153M magnitude. For the entire sample, stars with differentially dereddened colors larger than 2σ to the blue or 3σ to the red of the cluster sequence are automatically assigned Ppm = 0, while all others are unchanged. This color cut is more conservative to the red in order to account for the fact that some stars may have intrinsic reddening due to circumstellar disk material due to the cluster's young age (e.g., Stolte et al. 2015).

After these cuts, we are left with a sample of 981 stars with $\sum {P}_{\mathrm{pm}}=638.0$. The CMD of this sample before and after the differential extinction correction is shown in Figure 7, and a summary of the cuts and their impact on the sample size is given in Table 2.

Figure 7.

Figure 7. Left: observed CMD of the proper-motion-selected sample (Ppm ≥ 0.3; red) vs. the field stars (black). Due to the significant overlap between the populations, proper-motion analysis is required to obtain an accurate cluster sample. Right: differentially dereddened CMD of the stars used in the IMF analysis. The bright red points are stars with Ppm ≥ 0.3 and F153M magnitudes within the adopted magnitude limits (blue dashed lines). Stars eliminated by the color or magnitude cuts are shown as faded red points. The cluster sequence significantly tightens after the differential extinction correction, though a term for residual differential extinction is still required in the IMF analysis.

Standard image High-resolution image

Table 2.  Sample Selection

Selection Description Criterion Nstars $\sum {P}_{\mathrm{pm}}$
Original sample   29,895 1290.7
    Cut from Sample  
Membership Ppm ≥ 0.3 28,237  
Completeness  ≥0.3 539  
F153M mag diff.  ≤0.5 mag 45  
WR stars F153M ≥ 14.5 mag 16  
Color cut See Section 3.4 78  
Final sample   980 636.7

Download table as:  ASCIITypeset image

Despite these efforts, some field contamination inevitably remains in our sample. This is due to stars with similar proper motions and colors as the cluster, so their membership probabilities are artificially inflated. In Section 5.1, we derive revised cluster membership probabilities after the IMF analysis using the best-fit cluster and field model in order to take full advantage of the photometric information. We find that the number of cluster stars based on Ppm is ∼6% larger than the number of cluster stars based on the revised membership probabilities, and we thus conclude that the sample contains approximately this amount of field contamination.

3.5. Spectroscopic Analysis

Effective temperatures and surface gravities are derived for the spectroscopic stars by comparing the spectra to non-LTE CMFGEN model atmospheres (Hillier & Miller 1998; Hillier & Lanz 2001). Non-LTE treatment is required due to the high temperatures of the stars and the presence of significant stellar winds, as evidenced by the Br-γ emission inferred from the weak Br-γ photospheric absorption line. Uncertainties in the stellar parameters are estimated by adjusting the models until they no longer provide good fits to the main diagnostic lines. Throughout the analysis, we assume a terminal velocity (Vinf) of 2000 km s−1, since this cannot be constrained from the spectra.

The best-fit model spectra are shown in Figure 8, and the corresponding Teff and log g values are reported in Table 1. Here Teff is constrained to within ±3000 K or better and is determined primarily from the He ii/He i line ratios, as well as the absorption component of the He i 2.113 μm line. Stars 47, 55, and 60 were recently classified as O4–5 Ia stars and star 53 as an O5.5–6 I–III star by Clark et al. (2018). Our derived temperatures are consistent with the observed Teff–versus–spectral type relation for galactic O-type stars within the uncertainties (Martins et al. 2005). The log g values are less well constrained since they rely on the weak Br-γ lines and thus are not used in the IMF analysis.

Figure 8.

Figure 8. Best-fit CMFGEN models (red) compared to the observed spectra (black).

Standard image High-resolution image

4. Modeling the Cluster

We use a forward-modeling approach to derive the IMF of the Arches cluster, comparing the observations to a cluster and field model within a Bayesian framework. The methodology described in Lu et al. (2013) is expanded to simultaneously fit the IMF and other cluster parameters while taking into account degeneracies between cluster parameters, observational uncertainties, stellar multiplicity, and the empirical field population. Two IMF models are used: a one-segment power law and a two-segment power law. In the one-segment IMF model, the free parameters are the high-mass IMF slope α1, cluster mass (Mcl), age (log t), distance (d), average extinction (AKs), and residual differential extinction after the extinction map correction (ΔAKs). The two-segment IMF model has additional free parameters mbreak and Xα, where mbreak is the mass at which the IMF slope is α2 = Xα ∗ α1 for m ≤ mbreak and α1 for m > mbreak. We require that 0 ≤ Xα ≤ 1 to enforce that α2 < = α1 (i.e., the low-mass IMF slope is more shallow than the high-mass IMF slope). The model parameters and their adopted priors are presented in Table 3.

Table 3.  IMF Model Parameters

Parameter Description Priora Units
α1 High-mass IMF slope U(1.0, 3.0)
Xα α2/α1b U(0, 1)
mbreak Break massb U(2, 14) M
Mcl Massc U(3000, 50,000) M
log t Age U(6.2, 7.0) log (yr)
d Distance G(8000, 250) pc
AKs Average extinction U(1.5, 2.7) AKs (mag)
ΔAKs Differential extinction U(0, 0.5) AKs (mag)

Notes.

aUniform distributions: U(min, max), where min and max are bounds of the distribution; Gaussian distributions: G(μ, σ), where μ is the mean and σ is the standard deviation. bOnly used in two-segment IMF model. cFormally, Mcl is the cluster mass between mmin and mmax (0.8 and 150 M, respectively), since this is the mass range over which the IMF is sampled when constructing the cluster.

Download table as:  ASCIITypeset image

To create a synthetic cluster, a population of stellar masses is stochastically generated based on the input IMF and the total cluster mass. We use the numerical formulation described by Pflamm-Altenburg & Kroupa (2006) to efficiently generate masses from the IMF between 0.8 and 150 M. Note that this is the mass range over which Mcl is valid, since masses above and below these values are not generated in the synthetic cluster. The multiplicity of each star is determined using the mass-dependent multiplicity fraction, companion-star fraction, and mass ratio empirically derived by Lu et al. (2013) from studies of nearby young clusters in the literature. Stars and their companions are generated in batches until the cumulative stellar mass is larger than the designated mass of the cluster. Then, the population is trimmed to the star at which the cumulative mass is closest to the overall cluster mass, and then one additional star is drawn from the IMF and added to the sample.

Stellar evolution models are used to determine the physical properties of each star in the population. For a given age, a stellar evolution model provides the effective temperature (Teff) and surface gravity (log g) at each stellar mass. We use two sets of stellar evolution models: the Pisa evolution models (Tognelli et al. 2011) for the pre-main-sequence stars and the most recent Geneva models with rotation (Ekström et al. 2012) for the main-sequence and evolved stars. The Pisa models have been shown to be consistent with observations of eclipsing binaries (Stassun et al. 2014) and nearby moving groups (Herczeg & Hillenbrand 2015) for stars above 1 M and are advantageous in that they model pre-main-sequence stars to high masses (∼7 M). High-mass pre-main-sequence stars are necessary due to the young age of the Arches cluster. The Geneva models have been shown to match observations for all but the most massive stars (M > 60 M; Martins & Palacios 2013), where stellar evolution models become uncertain.

The physical properties are fed into a stellar atmosphere model, which returns a spectral energy distribution (SED) for each star. We assume solar metallicity, consistent with spectroscopic studies of the bright WR stars that find the Arches metallicity to be solar (Najarro et al. 2004) or slightly supersolar (Z = 1.3–1.4 Z; Martins et al. 2008). Two sets of atmosphere models are used: an ATLAS9 grid (Castelli & Kurucz 2004) for Teff > 5500 K and a PHOENIX grid (version 16; Husser et al. 2013) for Teff < 5000 K. An average between the two model grids is used in the transition region between 5000 and 5500 K. Both model grids assume local thermodynamic equilibrium (LTE), an assumption that begins to fail for massive stars. However, synthetic photometry calculated with ATLAS9 models compared to non-LTE CMFGEN models (Fierro et al. 2015) shows differences of ≤∼0.017 mag in F153M up to temperatures of 31,000 K.

The choice of stellar evolution and atmosphere models is an unavoidable source of systematic uncertainty in our analysis. To assess the impact of our model selections, we also run our IMF analysis using the recent MIST v1.0 evolution models (Choi 2016; Dotter 2016), which are computed using the Modules for Experiments in Stellar Astrophysics (MESA) code (Paxton et al. 2011, 2013, 2015). These analyses are discussed in Section 6.4.

We use Pysynphot (STScI Development Team 2013) to calculate synthetic photometry for the individual stars in the cluster population. The SEDs are reddened to the model AKs according to the extinction law and then convolved with the WFC3-IR F127M and F153M filter transmission functions. Multiple systems are treated as unresolved, with the total flux in each filter calculated as the sum of the system components. To simulate differential extinction, the photometry of each star system is perturbed by a random amount drawn from a Gaussian distribution centered at zero with a width corresponding to the given ΔAKs in that particular filter.

Finally, the synthetic stars are assigned cluster radii based on the observed radial density profile of the Arches. We combine the radial profile for R < 0.25 pc from Espinoza et al. (2009) with the magnitude-dependent profiles in the range 0.25 pc ≤R ≤ 3.0 pc from H15 (one profile for F153M > 17 mag, the other for F153M ≤ 17 mag) for complete radial coverage over our data range. Each star's cluster radius is drawn from the following probability density distribution:

Equation (3)

where Σb(r) and Σf(r) are the bright-star (F153M ≤ 17 mag) and faint-star (F153M > 17 mag) radial profiles, respectively; cb and cf are constants such that ${\int }_{r=0\,\mathrm{pc}}^{r=3\mathrm{pc}}P(r)=1$; and a(r) is the fraction of the observed area at radius r (a(r) = 1.0 for 0 pc < r ≤ 2.3 pc, a(r) < 1.0 for r > 2.3 pc). Thus, we are able to simulate mass segregation in the synthetic cluster and can properly account for the fact that all stars with r < 0.25 pc are removed from the observed sample due to low completeness. The synthetic cluster stars are then binned using the same radius, color, and magnitude bins as the completeness calculations (Section 3.3) in preparation for the IMF analysis.

4.1. Bayesian Analysis

For a cluster model Θ, we adopt a likelihood function with four components,

Equation (4)

where $p({{\boldsymbol{k}}}_{\mathrm{obs}}| {\rm{\Theta }})$ is the probability of obtaining the observed distribution of stars in CMD space, with ${{\boldsymbol{k}}}_{\mathrm{obs}}$ representing the set of observed F153M magnitudes and F127M–F153M colors; $p({N}_{\mathrm{cl}}| {\rm{\Theta }})$ is the probability of detecting the number of observed cluster stars Ncl; $p({N}_{W}| {\rm{\Theta }})$ is the probability of detecting the observed number of WR stars; and $p(\{{T}_{\mathrm{eff}},{m}_{\mathrm{obs}}\}| {\rm{\Theta }})$ is the probability of measuring the observed Teff values for the spectroscopic stars given their F153M magnitudes mobs.

To calculate $p({{\boldsymbol{k}}}_{\mathrm{obs}}| {\rm{\Theta }})$, we must first calculate the CMD probability distribution for the cluster model and the field. The intrinsic CMD probability distribution for cluster stars generated by the model Θ, $p{({{\boldsymbol{k}}}_{\mathrm{int}}| {\rm{\Theta }})}_{\mathrm{cl}}$, is calculated according to the procedure described in Section 4. Here ${{\boldsymbol{k}}}_{\mathrm{int}}$ is the distribution of synthetic star magnitude and colors in the model cluster. To reduce the impact of stochastic effects in the synthetic CMD, the model cluster is generated with a total mass of 5 × 106 M (∼500 times more massive than the expected mass of the Arches), regardless of the Mcl designated by the model. To calculate the observed CMD probability distribution for the model cluster, we apply the observational completeness and make the same magnitude cuts as the observed sample (Section 3.4),

Equation (5)

where $p{({{\boldsymbol{k}}}_{\mathrm{int},r}| {\rm{\Theta }})}_{\mathrm{cl}}$ and C(r) are the intrinsic model cluster CMD probability distribution and observational completeness at a cluster radius r, Nr is the number of radius bins, and Nk is the total number of magnitude–color bins in the CMD itself.

In addition to the synthetic cluster, we construct a CMD probability distribution for the field stars. We select all stars with Ppm ≤ 0.03; apply the same differential extinction correction, magnitude, and color cuts as the IMF analysis sample; and normalize to calculate the field CMD probability distribution $p({{\boldsymbol{k}}}_{\mathrm{obs},f})$,

Equation (6)

where ${{\boldsymbol{k}}}_{\mathrm{obs},f}$ is the observed field CMD. Note that we do not apply a completeness correction, since the CMD is already "observed" and thus already inherently included, and that $p({{\boldsymbol{k}}}_{\mathrm{obs},f})$ is not dependent on the cluster model.

With the cluster and field CMD probability distributions in place, we can calculate the probability of observing the ith star given its color and magnitude ($p({k}_{\mathrm{obs},i}| {\rm{\Theta }})$). We infer that the field membership probability for a given star is Pf = 1 − Ppm. To incorporate observational error, we assume that ${k}_{i}={k}_{i}^{{\prime} }+{\epsilon }_{i}$, where epsiloni is drawn from a normal distribution centered at zero and with standard deviation drawn from the set of observational errors σk,i. Thus,

Equation (7)

The final CMD likelihood is calculated by multiplying the individual likelihoods for the observed stars together,

Equation (8)

where Nobs is the number of stars in the sample.

The second component of the likelihood, $p({N}_{\mathrm{cl}}| {\rm{\Theta }})$, is calculated from the number of cluster stars we would predict to observe given the cluster model. Returning to the intrinsic synthetic cluster CMD ${{\boldsymbol{k}}}_{\mathrm{int}}$, we perturb the photometry of each star by a random amount drawn from the photometric error of the observations at its magnitude and then apply the magnitude cuts and observational completeness. Following Lu et al. (2013), we linearly scale the number of stars in the simulated cluster after it is convolved with the observational completeness (Nsim) to the cluster model mass in order to obtain the expected number of observed stars Ne,

Equation (9)

where Mcl is the cluster model mass. The probability of obtaining the observed number of cluster stars ${N}_{\mathrm{cl}}=\sum {P}_{\mathrm{pm}}$ is calculated from a Poisson distribution:

Equation (10)

The purpose of applying the observational errors to ${{\boldsymbol{k}}}_{\mathrm{int}}$ for this calculation is to account for any potential Malmquist bias that is introduced by our magnitude cuts. Note that this is not done in Equation (5) for the CMD component of the likelihood, since the observational errors are already accounted for in Equation (7).

The third component of the likelihood is based on the predicted number of WR stars in the cluster model, which serves as a constraint on the cluster age (e.g., Lu et al. 2013). The brightest stars in the inner region of the cluster (rcl <0.75 pc) were cataloged by Figer et al. (2002), and later spectroscopic studies identified 13 WR stars among this sample (Martins et al. 2008; Clark et al. 2018). In the cluster model, we calculate the number of predicted WR stars within this radius range and, similarly scaling that number to cluster model mass, calculate the probability of obtaining the observed number of WR stars,

Equation (11)

where NW = 13 and is the number of WR stars in the observations within rcl < 0.75 pc, and ${N}_{{W}_{0}}$ is the number of WR stars predicted by the scaled cluster model in that same radius range.

The final component of the likelihood comes from from the Teff measurements from the spectroscopic sample. For each star, we calculate $\overline{{T}_{{\mathrm{eff}}_{0}}}$ and ${\sigma }_{T{\mathrm{eff}}_{0}}$, which represent the median Teff and its standard deviation for all stars in the cluster model with $({m}_{\mathrm{obs}}-{\sigma }_{{m}_{\mathrm{obs}}})\leqslant m\leqslant ({m}_{\mathrm{obs}}+{\sigma }_{{m}_{\mathrm{obs}}})$ and $({\mathrm{col}}_{\mathrm{obs}}-{\sigma }_{{\mathrm{col}}_{\mathrm{obs}}})\,\leqslant \mathrm{col}\leqslant ({\mathrm{col}}_{\mathrm{obs}}+{\sigma }_{{\mathrm{col}}_{\mathrm{obs}}})$, where mobs, ${\sigma }_{{m}_{\mathrm{obs}}}$, colobs, and ${\sigma }_{{\mathrm{col}}_{\mathrm{obs}}}$ are the F153M magnitude and F127M–F153M color of the observed star and its respective errors. The likelihood of measuring Teff for the star is then

Equation (12)

where Teff and ${\sigma }_{{T}_{\mathrm{eff}}}$ are the measured effective temperature and associated error of the star and ${\sigma }_{\mathrm{tot}}=\sqrt{{\sigma }_{{T}_{\mathrm{eff}}}^{2}+{\sigma }_{T{\mathrm{eff}}_{0}}^{2}}$. The likelihood of the spectroscopic sample is calculated by multiplying the individual likelihoods together,

Equation (13)

where Nspec is the number of stars in the spectroscopic sample.

We derive the best-fit cluster model using the Bayes theorem,

Equation (14)

where $P({\rm{\Theta }}| {{\boldsymbol{k}}}_{\mathrm{obs}},{N}_{\mathrm{cl}},{N}_{\mathrm{WR}},{T}_{\mathrm{eff}})$ is the posterior probability for the given model Θ, ${ \mathcal L }({{\boldsymbol{k}}}_{\mathrm{obs}},{N}_{\mathrm{cl}},{N}_{\mathrm{WR}},{T}_{\mathrm{eff}}| {\rm{\Theta }})$ is the likelihood equation, P(Θ) is the priors on the model free parameters, and $P({{\boldsymbol{k}}}_{\mathrm{obs}},{N}_{\mathrm{cl}},{N}_{\mathrm{WR}},{T}_{\mathrm{eff}})$ is the sample evidence. To sample the parameter space to find the best-fit model, we use Multinest, a publicly available multimodal sampling algorithm shown to be more efficient that Markov chain Monte Carlo algorithms when exploring complex parameter spaces (Feroz et al. 2009). We adopt an evidence tolerance of 0.5, a sampling efficiency of 0.8, and 1000 live points to run the analysis. The algorithm is run using the python wrapper module PyMultinest (Buchner et al. 2014).

We test the accuracy of this procedure by running the analysis on simulated clusters of known properties. A discussion of how the simulated clusters are created and the results of the tests are provided in Appendix D. We find that the analysis is able to recover the input values to within 1σ for all parameters for both the one-segment and two-segment IMF models.

4.2. Model-dependent Membership Probabilities and Stellar Properties

After the best-fit cluster model is determined, we calculate revised cluster membership probabilities that take full advantage of the available kinematic and photometric information. The cluster model provides the distribution of cluster stars in CMD space, from which stars with proper motions similar to the cluster but with photometry similar to the field can be deweighted. First, we calculate the expected cluster CMD ${{\boldsymbol{k}}}_{{\rm{\Theta }},{cl}}$ and field star CMD ${{\boldsymbol{k}}}_{{\rm{f}}}$,

Equation (15)

where $p{({{\boldsymbol{k}}}_{\mathrm{int}}| {\rm{\Theta }})}_{\mathrm{cl},\mathrm{obs}}$ and $p({{\boldsymbol{k}}}_{\mathrm{obs},f})$ are as defined in Equations (5) and (6). The revised membership probability for a given star then becomes

Equation (16)

and Pclust is thus a combination of the proper-motion membership (which sets the scale of cluster and field CMD components) and the cluster and field CMDs themselves.

We also use the best-fit cluster model to infer the intrinsic properties (e.g., mass) for each star in the observed sample. These values are often estimated by tracing the star to a theoretical cluster isochrone along the reddening vector, but this approach is challenging near the pre-main-sequence turn-on where multiple intersections between the reddening vector and isochrone can occur. Instead, we calculate a probability distribution for the desired stellar property from ${{\boldsymbol{k}}}_{\mathrm{int}}$, based on the stars located at the observed star's location in the CMD. For example, the mass probability distribution within a given CMD bin k is

Equation (17)

where mi,b,k is the mass of the ith star in mass bin b in the CMD bin k, Ni is the number of stars in mass bin b, and Nb is the total number of mass bins. The mass bins are chosen to be 20 equal log-spaced bins between 0.8 and 70 M, which are the minimum and maximum masses in the cluster model.11

For a given star, we calculate its mass probability distribution by multiplying $p{(m| {\rm{\Theta }})}_{k}$ by the position of the star in the CMD convolved with its photometric error:

Equation (18)

We construct the observed IMF Φobs by summing the mass probability distributions over the sample, taking into account each star's revised cluster membership probability, observational completeness, and area completeness,

Equation (19)

where C(k, r) is the completeness as a function of CMD position and radius and a(r) is the area completeness. We reiterate that Φobs is dependent on the synthetic cluster and calculated after the best-fit model is found. It thus serves as a check that the IMF derived in the analysis is indeed a good match to the observations.

5. Results

We find that the Arches cluster is best described by a one-segment IMF model that is top-heavy (α = 1.80 ± 0.05 (stat) ± 0.06 (sys)). However, we cannot discount a two-segment IMF model with a high-mass slope closer to the local IMF value $(\alpha ={2.04}_{-0.19}^{+0.14}\pm 0.04)$ but with a break at ${5.8}_{-1.2}^{+3.2}\pm 0.02\,{M}_{\odot }$. This section is organized as follows. We describe the best-fit IMF model in Section 5.1 and compare the one- and two-segment IMF model solutions in Section 5.2. In Section 5.3, we discuss the impact of our assumptions regarding stellar evolution models and stellar multiplicity.

5.1. The Arches Cluster IMF: Best-fit Model

The best-fit cluster models for each of the different cases examined in this analysis (one-segment versus two-segment IMF, Pisa/Geneva versus MIST evolution models, with versus without multiplicity) are given in Table 4, and a breakdown of the corresponding likelihoods is given in Table 5. A detailed comparison of these cases is presented in Sections 5.2 and 5.3, but in summary: (1) the one-segment IMF model is slightly favored over the two-segment IMF model, but we cannot rule out the two-segment IMF model; (2) we cannot distinguish between the Pisa/Geneva and MIST evolution models in the one-segment IMF case, but the MIST models are favored in the two-segment IMF case; and (3) the fits without multiplicity are strongly disfavored. As a result, we adopt the one-segment IMF fit with the Pisa/Geneva models and multiplicity as the best-fit IMF for the Arches cluster, and we use the MIST model solution to estimate the systematic error. When discussing the two-segment IMF fit, we adopt the MIST model solution with multiplicity and use the Pisa/Geneva model solution to estimate the systematic error.

Table 4.  Best-fit Cluster Models

  One-segment IMF Two-segment IMF
Parametera Pisa/Genevab MIST v1.0c Pisa/Genevab MIST v1.0c
α1 1.80 ± 0.05 1.68 ± 0.05 2.12 ± 0.11 ${2.04}_{-0.19}^{+0.14}$
α2 0.95 ± 0.45 ${1.10}_{-0.31}^{+0.39}$
mbreak ${5.4}_{-0.8}^{+2.4}$ ${5.8}_{-1.2}^{+3.2}$
Mcl ${24400}_{-1600}^{+2000}$ ${28600}_{-2800}^{+3000}$ ${19600}_{-1600}^{+2000}$ ${21000}_{-2800}^{+3400}$
log t 6.57 ± 0.02 6.56 ± 0.02 6.60 ± 0.05 ${6.55}_{-0.04}^{+0.02}$
d 7900 ± 158 7900 ± 160 8030 ± 160 8100 ± 160
AKs 2.44 ± 0.01 2.44 ± 0.01 2.46 ± 0.02 2.45 ± 0.01
ΔAKs 0.15 ± 0.01 0.15 ± 0.01 0.13 ± 0.01 0.15 ± 0.01

Notes.

aPriors and units are the same as described in Table 3. Note that only statistical uncertainties are reported in this table. Systematic uncertainties are estimated to be half the difference between the parameter values derived using different stellar evolution models. bPisa: Tognelli et al. (2011); Geneva: Ekström et al. (2012). cChoi (2016), Dotter (2016).

Download table as:  ASCIITypeset image

Table 5.  IMF Model Likelihoods

  One-segment IMF Two-segment IMF
Component Pisa/Genevaa MIST v1.0b No Multiples Pisa/Genevaa MIST v1.0b No Multiples
CMD −5058.6 −5060.2 −5067.0 −5055.8 −5051.9 −5057.4
Nstars −4.48 −4.15 −4.23 −4.23 −4.22 −4.18
NWR −3.24 −2.46 −4.23 −2.21 −3.45 −2.26
Spectroscopy −19.48 −19.45 −21.08 −19.48 −19.54 −19.35
log(${ \mathcal L }$) −5103.1 −5103.5 −5116.9 −5101.4 −5098.6 −5102.9
BIC 10247.5 10248.3 10275.1 10257.9 10252.3 10260.9

Notes.

aPisa: Tognelli et al. (2011); Geneva: Ekström et al. (2012). bChoi (2016), Dotter (2016).

Download table as:  ASCIITypeset image

The posteriors for the IMF model parameters are provided in Appendix C. A comparison between the observed and model CMD is shown in Figure 9, and the subsequent F153M luminosity function shown in Figure 10. Good agreement is generally found between the observations and model, though perhaps with a slight excess of model stars at the bright end of the sample (F153M ≲ 16 mag). The agreement between the spectroscopic Teff measurements and those predicted by the model is shown in a Hertzsprung–Russell (HR) diagram, where the (model-dependent) luminosity for each of the observed stars has been derived in the manner described in Section 4.2 (Figure 11). The luminosities $(\mathrm{log}L/{L}_{\odot }\sim 5.0\mbox{--}5.2)$ are noticeably smaller than what has been measured for O-type supergiants of similar spectral type $(\mathrm{log}L/{L}_{\odot }\sim 5.6\mbox{--}5.95$; Najarro et al. 2011; Bouret et al. 2012), though further work is required to determine if these stars are truly anomalous. The total number of cluster stars predicted by the model (618.9 ± 33) is in good agreement with the observed value $(\sum {P}_{\mathrm{pm}}=636.7)$, though the expected number of WR stars is ∼1.3σ higher than observed (18.4 ± 1.75, compared to NWR = 13).

Figure 9.

Figure 9. Comparison between the observed and predicted CMD from the best-fit cluster model. The left panel shows the Hess diagram for the observed cluster, the middle panel shows the Hess diagram of the best-fit cluster model, and the right panel shows the residuals between the two. The cluster model has been convolved with observational uncertainties in this comparison. In all panels, the isochrone associated with the best-fit model is plotted as a red line, and the F153M magnitude limits are represented by the cyan dashed lines. Note that the cluster model contains both cluster and field components; the impact of the RC is particularly evident by the slight high-density diagonal feature near F153M ∼ 18 mag.

Standard image High-resolution image
Figure 10.

Figure 10. Comparison of the observed F153M luminosity function (black points) vs. the best-fit model (red line). The 1σ envelope of possible models sampled from the posterior distribution is shown by the red shaded area. Good agreement is found, with the exception of a possible excess of model stars in the brightest magnitude bins (F153M ≲ 16 mag).

Standard image High-resolution image
Figure 11.

Figure 11. Measured Teff and inferred luminosity of the spectroscopic stars (black points) compared to the best-fit model (red line).

Standard image High-resolution image

We obtain a high-mass power-law slope of α = 1.80 ± 0.05, which is either ∼1.6σ or ∼10σ lower than the local IMF value, depending on whether the uncertainty on the local IMF is considered. Perhaps a more informative comparison is that our result is ∼8.3σ lower than the measured IMF of young clusters in M31 ($\alpha ={2.45}_{-0.06}^{+0.03};$ Weisz et al. 2015). A comparison of these values is shown in Figure 12. This suggests that the Arches has a top-heavy IMF with an overabundance of high-mass stars relative to low-mass stars for M > ∼1.8 M. The α we derive does somewhat depend on which stellar evolution model we adopt, as the best-fit cluster with the MIST models has α = 1.68 ± 0.05. We thus add a systematic error term of 0.06 to our α measurement (the difference between the α values of the two fits divided by 2), so the final constraint becomes α = 1.80 ± 0.05 (stat) ± 0.06 (sys). Note that when the statistical and systematic errors are added in quadrature, the Arches result remains 6.6σ lower than the M31 result.

Figure 12.

Figure 12. Posterior probability distribution for the high-mass IMF slope α in the Arches cluster (red) compared to the local IMF (black dotted line; Kroupa 2002) and the IMF of young clusters in M31 (blue dotted line; Weisz et al. 2015), with the 1σ statistical uncertainties shown by the respective shaded regions. The Arches IMF slope is significantly lower than that of the Milky Way or M31, indicating that the cluster has a top-heavy IMF. Note that the uncertainties shown in this figure are statistical in nature. We estimate a systematic uncertainty of ±0.06 in our measurement of α.

Standard image High-resolution image

The best-fit cluster age is log t = 6.57 ± 0.02 (∼3.7 ± 0.17 Myr), with negligible systematic error. This is generally older than previous ages reported in the literature. Past estimates come primarily from spectroscopic studies of the massive stars, with values of 2–2.5 Myr based on the observed nitrogen abundances and luminosities of WR stars (Najarro et al. 2004), 2–4 Myr based on the locations of WR + O stars on the HR diagram (Martins et al. 2008), 2–3.3 Myr based on the spectral types of candidate main-sequence stars (Clark et al. 2018), and ${2.6}_{-0.2}^{+0.4}\,\mathrm{Myr}$ based on the properties of an eclipsing binary in the cluster (Lohr et al. 2018). However, a cluster age of 3.7 ± 0.7 was obtained by Schneider et al. (2014) based on the shape of the PDMF relative to stellar population models with binary star evolution, which is more consistent with our result.

We infer a cluster mass of ${M}_{\mathrm{cl}}=({2.44}_{-0.16}^{+0.20}\pm 0.21)\,\times {10}^{4}\,{M}_{\odot }$, which represents the intrinsic mass between 0.8 and 150 M out to a cluster radius of 3 pc. This assumes that the one-segment IMF model is valid over the entire mass range and that the radial profile is adequately modeled for r < 0.25 pc, which is outside the boundary of the observed sample (Section 6.4). However, the advantage of this result is that it is jointly constrained with the IMF, while previous photometric mass estimates of the cluster are needed to adopt an IMF and extrapolate it to achieve a similar depth (e.g., Serabyn et al. 1998; Figer et al. 1999; Espinoza et al. 2009).

As a consistency check, we compare the best-fit cluster model to dynamical mass estimates of the cluster made by Clarkson et al. (2012). Using the velocity dispersion of the cluster core region, they estimate the dynamical mass of the cluster to be ${0.9}_{-0.35}^{+0.40}\times {10}^{4}\,{M}_{\odot }$ for rcl < 0.4 pc and ${1.5}_{-0.60}^{+0.74}\times {10}^{4}\,{M}_{\odot }$ for rcl < 1.0 pc. Since our model only contains stellar masses down to 0.8 M, we would expect the enclosed mass at these radii to be lower than the dynamical estimate. This is indeed the case, with model enclosed masses of (0.74 ± 0.10) × 104 and (1.2 ± 0.14) × 104 M for rcl <0.4 and 1.0 pc, respectively.

We use the procedure outlined in Section 4.2 to calculate revised membership probabilities and Φobs. Figure 13 shows the Ppm and Pclust for the individual stars in the CMD. A comparison of the panels reveals the regions where Ppm > Pclust, suggesting that Ppm is overestimated due to field contamination, which is especially evident near the RC (the diagonal distribution of stars to the red of the cluster sequence at F153M ∼ 18 mag) and faint field star distribution (the stars to the blue of the cluster sequence at F153M ≥ 20 mag). The total number of cluster stars based on Pclust is 601.3, which is ∼6% smaller than what is calculated from Ppm. Thus, we estimate that Ppm (which was used in the IMF analysis) contains ∼6% field contamination.

Figure 13.

Figure 13. Ppm (left) and Pclust (right) for the observed sample, plotted in the CMD. Here Pclust is a more accurate determination of the cluster membership probability, since it uses both proper-motion and photometric information but is dependent on the best-fit cluster model from the IMF analysis. Regions where Ppm > Pclust reveal field contamination in the proper-motion memberships, in particular around the RC (F153M ∼ 18 mag, F127M–F153M > ∼2.5 mag) and faint field stars (F153M ≥ 20 mag, F127M–F153M < ∼2.5 mag). All magnitudes have been differentially dereddened to AKs = 2.38 mag using the extinction map.

Standard image High-resolution image

The observed IMF Φobs is shown in Figure 14. Also plotted is the Φobs we would obtain if we adopted a cluster model identical to the best fit but with the local IMF. The mass function obtained with the local IMF is significantly inconsistent with the observations, while the mass function obtained from the best-fit model is a good match to the observations.

Figure 14.

Figure 14. IMF of the Arches cluster constructed using Pfinal and the stellar mass probability distributions derived using the best-fit cluster model. The red points represent the IMF constructed using the observed stellar masses calculated with the model, while the red line is the best-fit IMF itself. The 1σ uncertainty in the best-fit cluster model is represented by the red shaded region, which is calculated by drawing different sets of parameter values from the joint posterior distribution. The red square represents the number of WR stars predicted by the best-fit model, compared to the observed number (black star). A good agreement is found between the observed IMF and the cluster model. On the other hand, the blue points represent the IMF constructed using stellar masses derived from a cluster identical to the best fit but with a Milky Way IMF (α = 2.3), with the intrinsic cluster IMF shown by the blue dashed line. The Milky Way IMF is a poor fit to the data, as it significantly underestimates the number of high-mass stars and overestimates the number of low-mass stars.

Standard image High-resolution image

A catalog of the observed stars with membership probabilities and mass estimates is provided as a machine-readable table with this paper. A sample of the catalog is shown in Table 6.

Table 6.  Stellar Parameters

Namea F127Mb F153Mb Xc σx Yc σy μαcosδ ${\sigma }_{{\mu }_{\alpha {\cos }\delta }}$ μδ ${\sigma }_{{\mu }_{\delta }}$ M σM AKs Ppm Pclust
  mag mag '' '' '' '' mas yr−1 mas yr−1 mas yr−1 mas yr−1 M M mag    
Ae035_001 15.99 14.12 27.5032 9.50E-04 21.3643 1.15E-03 −0.03 0.025 −0.19 0.023 26.3 3.9 2.24 0.84 0.91
Aw061_002 16.38 14.47 −55.0359 1.63E-03 26.2556 1.46E-03 −0.04 0.054 −0.02 0.064 22.7 3.4 2.35 0.90 0.89
Aw048_004 16.39 14.49 −43.1954 1.12E-03 21.9127 1.22E-03 −0.22 0.013 −0.15 0.031 22.7 5.5 2.13 0.74 0.89
As017_001 16.95 14.71 7.6862 1.03E-03 −15.6326 1.15E-03 0.08 0.019 0.15 0.064 26.3 3.9 2.05 0.82 0.94
An022_002 17.27 14.72 −10.0457 1.03E-03 19.4646 1.15E-03 −0.06 0.087 −0.15 0.027 35.4 8.5 2.28 0.86 0.38
Aw006_001 17.28 14.86 −5.6225 1.03E-03 −0.0122 1.15E-03 −0.27 0.014 0.06 0.017 30.5 7.4 2.34 0.69 1.00
Ae010_001 17.04 14.92 7.3893 1.22E-03 6.8467 1.22E-03 0.15 0.040 −0.04 0.027 26.3 3.9 2.46 0.85 0.98

Notes.

aNaming convention is as follows: The first letter is always "A", followed by "n/s/e/w" to designate whether the star is to the north, south, east or west quadrant relative to the central reference star, as determined using 45° and −45° line boundaries that intersect at the reference star position. The number immediately following gives the radius of the given star relative to the reference in arcseconds, while the second number (following the "_" separator) is a unique index for the star relative to all others at that same radius. bObserved magnitudes not corrected for differential extinction. cPositions are reported as offsets relative to a central reference star, chosen to be star 8 in the Clarkson et al. (2012) catalog. Note that this star is in the cluster core, which is excluded from this catalog. The x-direction is ${\rm{\Delta }}\alpha \cos {\rm{\Delta }}\delta $ while the y-direction is Δδ, where Δα and Δδ are the offsets in RA and DEC, respectively. Reported positional uncertainties are the uncertainties in the star position and the reference star position added in quadrature.

Only a portion of this table is shown here to demonstrate its form and content. A machine-readable version of the full table is available.

Download table as:  DataTypeset image

5.2. One-segment versus Two-segment IMF Model

The best-fit two-segment cluster model is also significantly different than the local IMF, but in a different manner than the one-segment IMF model. While the high-mass IMF slope is perhaps slightly shallow $({\alpha }_{1}={2.04}_{-0.19}^{+0.14}\pm 0.04)$, the real discrepancy is in the detection of a significant mbreak at ${5.8}_{-1.2}^{+3.2}\pm 0.02\,{M}_{\odot }$, which is an order of magnitude larger than the local IMF (mbreak = 0.5 M). The power-law slope below mbreak is ${\alpha }_{2}={1.10}_{-0.31}^{+0.39}\pm 0.08$, which is consistent with the local IMF values of 1.3 ± 0.3 for 0.08 M ≤ M < 0.5 M (Kroupa 2002). As a result of the high mbreak, the two-segment IMF solution could be characterized as "bottom-light," with a deficit of low-mass stars relative to the local IMF. Figure 15 shows the two-segment model compared to the observed luminosity function and the derived Φobs.

Figure 15.

Figure 15. Comparison of the best-fit two-segment IMF model with the observed luminosity function (left) and Φobs (right). The features of the plots are the same as described for Figures 10 and 14. The two-segment IMF solution cannot be ruled out based on our data. Additional studies are required to distinguish between the one- and two-segment IMF models.

Standard image High-resolution image

One of the advantages of the Bayesian framework is that we can distinguish between one- and two-segment IMF models by comparing the likelihoods of the best-fit solutions. We use the BIC (Schwarz 1978) for this comparison,

Equation (20)

where n is the total number of stars in the sample (980 in this case), k is the number of free parameters in the model (i.e., six for the one-segment model and eight for the two-segment model), and ${ \mathcal L }$ is the best-fit likelihood of the model. When comparing the two models, the model with the lowest BIC is preferred, and the absolute value of the difference between the BIC values (ΔBIC) indicates if the preference is statistically significant. Table 5 contains the likelihoods and BIC values for the one- and two-segment IMF fits.

The one-segment IMF model is slightly preferred over the two-segment IMF model in both the Pisa/Geneva and MIST cases, with ΔBIC = 10.4 and 8.2, respectively. To assess the significance of these values, we generate artificial clusters with one- and two-segment IMFs as described in Appendix D (adopting the best-fit values in the Arches solutions), fit them in both the one- and two-segment cases, and calculate the corresponding ΔBIC values. In our simulations, the ΔBIC values between the one- and two-segment models are consistently a factor of 1.5–7.5 times greater the actual Arches data. Thus, the real data are substantially more agnostic on the distinction between one- and two-component IMF models than the synthetic data sets. We conclude that we cannot rule out the two-segment IMF model, though we adopt the one-segment IMF model as the overall best fit. In either case, our results show that the Arches cluster IMF is significantly different from the local IMF.

5.3. The Impact of Stellar Evolution Models and Stellar Multiplicity

Table 4 reveals that the best-fit model parameters are only weakly dependent on the choice of stellar evolution model. Similar to Section 5.2, we use the BIC test to determine whether our analysis prefers one set of evolution models over the other. For the one-segment IMF model, the Pisa/Geneva model solution is slightly favored with ΔBIC = 0.8. However, artificial-cluster tests show that this ΔBIC is not significant, as the average difference between evolution model fits is ΔBIC = 2.1 ± 1.2. Thus, we conclude that we cannot distinguish between the two solutions, and we adopt the Pisa/Geneva solution as the result and use the MIST solution to estimate the systematic error. In the two-segment IMF case, the MIST solution is favored with a ΔBIC = 5.6. The simulations show that this discrepancy is indeed significant, with an average difference of ΔBIC = 5.39 ± 2.56 between two-segment IMF fits using different evolution models. As a result, we adopt the MIST solution for the two-segment IMF case and use the Pisa/Geneva solution to estimate the systematic error.

Whether stellar multiplicity is accounted for in the cluster model is found to significantly impact the quality of the fit. The BIC analysis strongly favors the models that include stellar multiplicity, with ΔBIC values of 27.6 and 8.6 for the one- and two-segment IMF model cases, respectively. As seen in Table 5, this difference is primarily driven by the CMD likelihood component. Artificial-cluster tests show that the observed ΔBIC values are significant; for artificial clusters that have intrinsic multiplicity, ΔBIC = 12.2 ± 0.5 in favor of the fit with multiplicity in the one-segment IMF case and ΔBIC = 8.8 ± 0.5 in the two-segment IMF case. Thus, we adopt the model fits with multiplicity included over those without.

6. Discussion

6.1. Past IMF Measurements of the Arches Cluster

Our result that the high-mass slope of the Arches IMF is significantly top-heavy differs from previous photometric studies of the cluster that have found the IMF to be largely consistent with the local IMF (Kim et al. 2006; Espinoza et al. 2009; Habibi et al. 2013; Shin & Kim 2015). However, a key advantage of this study is the use of proper motions to calculate cluster membership probabilities, which produces a significantly more accurate sample of cluster members than is possible through photometry alone. For example, Figure 16 shows a comparison between cluster samples obtained using proper motions versus a photometric color cut similar to Habibi et al. (2013). Even when limited to r < 1.5 pc and M > 10 M (the range of PDMF was measured by Habibi et al. 2013), the photometric sample is systematically larger than the proper-motion selection due to field contamination. On the other hand, adopting stricter color cuts on photometric samples can be problematic as well, as Espinoza et al. (2009) noted that the color cuts they adopted forced them to eliminate stars that could be high-mass (M > 16 M) cluster members.

Figure 16.

Figure 16. Comparison between Arches cluster members selected via proper motion vs. a photometric color cut. The proper-motion sample, shown as the red solid and dashed lines, contains all stars with Ppm > 0.3, where each star is weighted by its membership probability for radius ranges of 0.25 pc < r < 3.0 pc and 0.25 pc < r < 1.5 pc, respectively. The photometric sample is selected as all stars with differentially dereddened F127M–F153M colors within ±0.3 mag of the average color on the main sequence, similar to Habibi et al. (2013). The photometric sample is larger than the proper motion due to field contamination, even at high masses (the blue dashed line represents M = 10 M).

Standard image High-resolution image

An alternative approach is to statistically subtract the field from the cluster using the field population observed in nearby control fields (e.g., Kim et al. 2006; Shin & Kim 2015). However, differential extinction can alter both the average extinction and the distribution of extinction values between two fields (e.g., note the detailed extinction structures in Figure 5). As a result, it is challenging to obtain a sufficiently accurate model of the field stars in the cluster observations. In addition, care must be taken that the control fields are beyond the extent of the cluster, which H15 showed extends to a radius of at least 75'' (∼3 pc).

It is interesting to note that several previous studies have reported evidence of an enhancement in the PDMF at ∼6 M, whether it be evidence of a turnover (Stolte et al. 2005) or a localized "bump" in the mass function (Kim et al. 2006). The presence of such a feature may be driving the two-segment IMF model solution. Future studies are needed to extend the proper-motion-selected sample to lower masses in order to definitively distinguish between the one- and two-segment IMF models and determine whether an enhancement at 5–6 M truly exists.

6.2. A Top-heavy IMF Near the GC?

The top-heavy IMF we obtain for the Arches cluster (α = 1.80 ± 0.05 ± 0.06) is in good agreement with the YNC (α = 1.7 ± 0.2 for M > 10 M; Lu et al. 2013). This suggests that this unusual IMF extends beyond the central parsec of the Galaxy and into the CMZ, which spans a galactocentric radius of ∼200 pc (Morris & Serabyn 1996). Unfortunately, the exact birth location of the Arches is not well constrained due to the range of possible orbits allowed by the three-dimensional motion of the cluster (Stolte et al. 2008; Kruijssen et al. 2015). Further, the proper motion of the cluster in the galactocentric reference frame is not yet well determined, as current estimates are based on the relative proper motion between the cluster and a single-Gaussian kinematic model for the field (e.g., Clarkson et al. 2012). In reality, the field exhibits a more complex kinematic structure (see H15 and Appendix A), so the cluster motion may need to be revised. This is left to a future paper.

However, this result raises the question of whether the top-heavy IMF is truly due to the GC environment or if it is a general property of YMCs (see review by Portegies Zwart et al. 2010). In Figure 17, we compare IMF measurements of YMCs in the Milky Way disk to the YNC and Arches cluster at the GC. The YMC sample includes Westerlund 1 (Wd1; Gennaro et al. 2011; Lim et al. 2013; Andersen et al. 2017), Westerlund 2 (Wd2; Zeidler et al. 2017), NGC 3603 (Harayama et al. 2008; Pang et al. 2013), Trumpler 14 and 16 (Hur et al. 2012), and h and χ Persei (Slesnick et al. 2002).

Figure 17.

Figure 17. Plot of IMF slope α vs. mass for YMCs in the Galactic disk (blue points: Wd2, Trumpler 14 and 16, h and χ Persei; purple squares: Wd1; turquoise triangles: NGC 3603) and the GC (red circle: YNC; red star: Arches cluster, with statistical and systematic errors added in quadrature). The dotted error bars in the X direction show the mass range over which the measurement was made, while the solid error bars in the Y direction show the measurement uncertainty. The references are provided in the text; Wd1 and NGC 3603 have their own symbols in order to represent the multiple values reported in the literature. Also shown is the local IMF (black dashed line) and the IMF measured for the young cluster in M31 from Weisz et al. (2015; cyan box).

Standard image High-resolution image

Figure 17 shows that the YMCs in the Galactic disk are generally consistent with the local IMF, though potential discrepancies exist. In particular, NGC 3603 has been found to be potentially top-heavy ($\alpha ={1.74}_{-0.47}^{+0.62}$ and 1.88 ± 0.15; Harayama et al. 2008 and Pang et al. 2013, respectively). However, these results may be biased due to mass segregation, which both studies find to be significant in the cluster. Indeed, the uncertainty in the Harayama et al. (2008) measurement is quite large in order to account for this (as well as other) systematic uncertainties, while Pang et al. (2013) acknowledged that their IMF measurement is restricted to the inner 60'' of the cluster. The IMF of Wd1 is potentially discrepant as well, with reported high-mass IMF slopes that are both near-standard ($\alpha ={2.44}_{-0.20}^{+0.08};$ Gennaro et al. 2011, via near-infrared photometry) and top-heavy (α = 1.8 ± 0.1; Lim et al. 2013, via optical photometry). The inconsistency between these studies makes it difficult to draw conclusions about the IMF of Wd1, though the low-mass stellar content of the cluster has been found to be consistent with the local IMF (Andersen et al. 2017). These cases highlight the difficulty of these measurements, as differences in cluster membership selection, stellar models, and methodology may significantly impact results.

Given the uncertainties surrounding the NGC 3603 and Wd1 measurements, the fact that other YMCs in the Galactic disk have been found to be consistent with the local IMF while the Arches and YNC are top-heavy provides tentative evidence that the top-heavy IMF is indeed caused by the extreme GC environment. We discuss the implications of a top-heavy IMF at the GC in Section 6.3 and the caveats of our Arches IMF measurement in Section 6.4.

The Quintuplet cluster is a third YMC in the CMZ that provides an additional probe of the IMF at the GC. A proper-motion-based analysis of the Quintuplet mass function was carried out by Hußmann et al. (2012), who found a top-heavy PDMF ($\alpha ={1.68}_{-0.09}^{+0.13}$) for the inner 0.5 pc of the cluster. However, it is uncertain whether this is due to mass segregation or a top-heavy IMF. A study of the Quintuplet IMF using a similar approach as this work is in progress.

6.3. Implications for Star Formation

At first, a top-heavy IMF at the GC appears to favor star formation models where the increased thermal Jeans mass leads to the formation of more high-mass stars (e.g., Larson 2005; Bonnell et al. 2006; Klessen et al. 2007; Bonnell & Rice 2008; Papadopoulos et al. 2011; Narayanan & Davé 2013). However, the main prediction of these models is that the break mass of the IMF should increase, leading to a deficit of low-mass stars, rather than an overabundance of high-mass stars and a shallow high-mass slope. This behavior is similar to the "bottom-light" two-segment IMF solution, but we do not yet have enough evidence to conclude that this is preferred over the top-heavy one-segment IMF solution.

However, our results are generally inconsistent with models where the IMF is set by the CMF (e.g., Padoan & Nordlund 2002; Hopkins 2012). Though the combination of turbulence and gravity naturally produces a CMF with a shape similar to the local IMF, these models predict a steeper mass slope and a bottom-heavy IMF near the GC (Hopkins 2013; Chabrier et al. 2014). This suggests that the CMF cannot be directly mapped to the IMF and that additional physical processes are involved. On the other hand, it has been suggested that the gravoturbulent fragmentation of a molecular cloud may lead to a top-heavy IMF (and possibly a bump around 5–6 M) if the coalescence of prestellar cores is highly efficient, as might be expected in Arches-like environments (Dib et al. 2007). In addition, recent simulations have shown that a top-heavy IMF can be produced in turbulence-dominated environments if the turbulent probability density distribution can be described as a power law at high densities (Lee & Hennebelle 2018).

Previous studies have shown that radiative feedback (e.g., Bate 2009; Offner et al. 2009; Krumholz 2011), protostellar outflows (e.g., Krumholz et al. 2012; Federrath et al. 2014), and magnetic fields (e.g., Hennebelle et al. 2011; Myers et al. 2013) can impact the IMF as well. Only recently have simulations begun to incorporate all of these processes simultaneously (Myers et al. 2014; Krumholz et al. 2016; Cunningham et al. 2018; Li et al. 2018). However, these simulations have been limited to molecular clouds with initial masses ≤1000 M and are only applicable to low-mass stars in environmental conditions similar to local star-forming regions. Future simulations of higher-mass molecular clouds in starburst-like environments are needed in order to determine what physics is behind a shallow high-mass IMF slope in the GC.

6.4. Caveats

A caveat of our IMF measurement is that we do not take the potential effects of tidal stripping into account. Tidal stripping might play a significant role in the evolution of the Arches cluster given the strong Galactic tidal field near the GC. Since tidal stripping preferentially removes low-mass stars from a cluster (e.g., Kruijssen 2009; Lamers et al. 2013), it could bias the mass function to appear top-heavy. However, it is unclear from current dynamical models of the Arches whether tidal stripping would significantly impact the mass range examined in this study (M ≳ 1.8 M). The N-body simulations by Habibi et al. (2014) predict the formation of tidal tails out to 20 pc from the cluster core that potentially contribute to the population of isolated massive stars observed near the GC. Additional simulations by Park et al. (2018) also predict the formation of tidal tails but find that ∼98% of the tidally stripped stars have masses less than 2.5 M and that the impact on the mass function above this limit is minor. This is consistent with the observations of H15 that find no evidence of tidal tails down to ∼2.5 M and a cluster radius of 3 pc. Thus, we assume that the effects of tidal stripping can be ignored for the mass range in our sample.

It should be noted that the dynamical models discussed above require assumptions regarding the initial conditions and orbit of the Arches cluster, both of which are quite uncertain. In addition, only stars are considered in the simulations, though the expulsion of excess gas after cluster formation is expected to have a significant impact on the dynamical evolution of the cluster as well (e.g., Bastian & Goodwin 2006; Goodwin & Bastian 2006; Farias et al. 2015).

Another caveat is that this analysis does not contain data for r < 0.25 pc, where the observational completeness is low due to stellar crowding. We adopt the radial profile of Espinoza et al. (2009) for this region when modeling the cluster (Section 4), but their profile was derived only for stars with M > 10 M. So, while magnitude-dependent radial profiles are used to account for mass segregation in the range 0.25 pc < r < 3.0 pc, the profile for all stars within the cluster core is assumed to be the same. Combining the HST data set from this study with higher-resolution ground-based observations of the cluster core would remove the need for this assumption.

Finally, we note that changing the extinction law does not have a significant impact on the IMF results. To demonstrate this, we repeat the analysis using the Nishiyama et al. (2009) and original H18 extinction laws, which are shallower (i.e., lower Aλ/AKs values) and steeper (i.e., higher Aλ/AKs values) than the law we ultimately adopt, respectively (Appendix B). In both cases, the only parameter that significantly changes is the overall extinction, which decreases for the H18 law (AKs = 2.07 ± 0.01 mag) and increases for the Nishiyama et al. (2009) law (AKs = 2.47 ± 0.01 mag). The high-mass IMF slope only changes by $| {\rm{\Delta }}\alpha | =0.01$ in the one-segment case and $| {\rm{\Delta }}\alpha | =0.04$ in the two-segment case, well within the 1σ uncertainties. Additionally, $| {\rm{\Delta }}{\alpha }_{2}| =0.03$ and $| {\rm{\Delta }}{m}_{\mathrm{break}}| =0.49\,{M}_{\odot }$ for the two-segment case, again well within the uncertainties. Therefore, the extinction law is not a significant source of uncertainty in this analysis.

7. Conclusions

We use multi-epoch HST WFC3-IR observations and Keck OSIRIS K-band spectroscopy to measure the IMF of the Arches cluster. Critically, we use proper motions to calculate cluster membership probabilities for stars down to ∼1.8 M over a radius range of 0.25 pc ≤ rcl ≤ 3.0 pc, obtaining a sample with just ∼6% field contamination. This is a significant improvement over purely photometric studies, which are compromised by the severe differential extinction across the field. Our proper-motion sample contains $\sum {P}_{\mathrm{pm}}=638.0$ cluster members, which we combine with K-band spectra of five O-type giants and supergiants in order to measure the IMF.

We forward model the Arches cluster to simultaneously constrain its IMF with the cluster distance, total mass, average extinction, and residual differential extinction (after a spatially dependent extinction correction). This approach allows us to account for observational uncertainties, completeness, mass segregation, and stellar multiplicity. We generate synthetic clusters and compare them to the observations using a likelihood equation with four components: the distribution of stars in the color–magnitude diagram, the total number of observed stars, the total number of Wolf–Rayet stars with rcl < 0.75 pc (taken from spectroscopic surveys in the literature), and the measured Teff of the spectroscopic stars versus those predicted by the cluster model.

We find that the Arches IMF is best described by a one-segment power law with a slope of α = 1.80 ± 0.05 (stat) ± 0.06 (sys), which is significantly more shallow than the local IMF and thus "top-heavy." However, we cannot discount a two-segment power-law model that has a high-mass slope only slightly less than the local IMF slope $({\alpha }_{1}={2.04}_{-0.19}^{+0.14}\pm 0.04)$ but exhibits a break at ${5.8}_{-1.2}^{+3.2}\pm 0.02\,{M}_{\odot }$. This would make the Arches IMF deficient in low-mass stars and thus "bottom-light." In either case, the Arches IMF is significantly different than the local IMF common throughout the Milky Way and nearby galaxies.

The unusual nature of the Arches IMF, combined with the top-heavy IMF observed for the YNC (α = 1.7 ± 0.2; Lu et al. 2013), suggests that the starburst-like environment at the GC induces variations in the IMF. Other YMCs in the Galactic disk have been found to be generally consistent with the local IMF, suggesting that these variations are truly due to the GC environment rather than an intrinsic property of YMCs. However, several disk YMCs (NGC 3603, Wd1) have been found to be potentially discrepant with the local IMF, so future studies must clarify the nature of their IMFs in order to strengthen this conclusion.

We note that the potential impact of tidal stripping is not included in our analysis. Measurements of the stellar radial density profile (Hosek et al. 2015) and the N-body simulations of the Arches (Park et al. 2018) suggest that tidal stripping has not significantly impacted the mass function over the mass range examined in this study. However, better constraints on the cluster orbit (e.g., Stolte et al. 2008) and full dynamical modeling of the stars and primordial gas are needed to fully explore the effects of tidal stripping. This is beyond the scope of the current study.

New observational capabilities will offer exciting opportunities for future study of the Arches cluster IMF. In particular, the James Webb Space Telescope (JWST) will provide the increased sensitivity and spatial resolution required to extend the IMF significantly beyond the current completeness limits (e.g., Kalirai 2018), allowing us to distinguish between the one- and two-segment IMF solutions. In addition, the larger FOV offered by JWST will facilitate the detection of tidal tails, which will yield critical new insight into the cluster's dynamical evolution.

The authors thank Kelly Lockhart and Tuan Do for help with OSIRIS data reduction, as well as the anonymous referee for feedback. M.W.H. and J.R.L. acknowledge support from NSF AAG (AST-1518273) and HST GO-13809. F.N. acknowledges financial support through Spanish grants ESP2015-65597-C4-1-R and ESP2017-86582-C4-1-R (MINECO/FEDER). This work is based on observations made with the NASA/ESA Hubble Space Telescope, obtained at the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS 5-26555. The observations are associated with programs 11671, 12318, 12667, and 14613. Additional observations were obtained at the W.M. Keck Observatory, which is operated as a scientific partnership among the California Institute of Technology, the University of California, and the National Aeronautics and Space Administration. The Observatory was made possible by the generous financial support of the W.M. Keck Foundation. The authors wish to recognize and acknowledge the very significant cultural role and reverence that the summit of Maunakea has always had within the indigenous Hawaiian community. We are most fortunate to have the opportunity to conduct observations from this mountain. This research has made extensive use of the NASA Astrophysical Data System.

Facilities: HST (WFC3-IR) - , Keck:II (OSIRIS) - KECK II Telescope.

AstroPy (Astropy Collaboration et al. 2013), extlaw_H18.py (Hosek et al. 2017), KS2 (Anderson et al. 2008), Matplotlib (Hunter 2007), Molecfit (Kausch et al. 2015; Smette et al. 2015), SciPy (Jones et al. 2001), Skycorr (Noll et al. 2014).

Appendix A: Gaussian Mixture Model

The Gaussian mixture model used to describe the cluster and field kinematics is described in Table 7. Cluster membership probabilities are calculated using this model as discussed in Section 3.1.

Table 7.  Cluster and Field Population Model: Free Parameters, Priors, and Results

  Cluster Gaussian Field Gaussian 1 Field Gaussian 2 Field Gaussian 3 Field Gaussian 4
Parametera Priorb Result Prior Result Prior Result Prior Result Prior Result
πk U(0, 1) 0.047 ± 0.003 U(0, 1) 0.182 ± 0.019 U(0, 1) 0.467 ± 0.026 U(0, 1) 0.296 ± 0.023 U(0, 1) 0.008 ± 0.001
μα,k (mas yr−1) G(0, 0.2) −0.01 ± 0.014 U(−7, 4) −0.69 ± 0.05 U(−7, 4) −1.75 ± 0.07 U(−7, 4) −1.90 ± 0.08 U(−7, 4) −0.76 ± 1.36
μδ,k (mas yr−1) G(0, 0.2) −0.34 ± 0.014 U(−7, 4) −1.01 ± 0.06 U(−7, 4) −2.65 ± 0.10 U(−7, 4) −2.89 ± 0.10 U(−7, 4) −0.20 ± 1.44
σa,k (mas yr−1) U(0, 4) 0.15 ± 0.01 U(0, 20) 1.27 ± 0.08 U(0, 20) 2.68 ± 0.05 U(0, 20) 3.46 ± 0.09 U(0, 20) 14.41 ± 1.24
σb, k (mas yr−1) σb = σa 0.15 ± 0.01 U(0, σa,k) 0.66 ± 0.05 U(0, σa,k) 1.39 ± 0.06 U(0, σa,k) 3.21 ± 0.09 U(0, σa,k) 11.24 ± 1.01
θk (rad) 0 U(0, π) 0.93 ± 0.04 U(0, π) 0.99 ± 0.02 U(0, π) 1.01 ± 0.14 U(0, π) 0.79 ± 0.21

Notes.

aDescription of parameters: πk = fraction of stars in Gaussian; μα,k = R.A.-velocity centroid of Gaussian; μδ,k = decl.-velocity centroid of Gaussian; σa,k = semimajor axis of Gaussian; σb, k = semiminor axis of Gaussian; θk = angle between σa,k and the R.A. axis. bUniform distributions: U(min, max), where min and max are bounds of the distribution; Gaussian distributions: G(μ, σ), where μ is the mean and σ is the standard deviation.

Download table as:  ASCIITypeset image

Appendix B: Revised Extinction Law

The extinction law used in this analysis is a revised version of the one presented in H18, which is derived from HST observations of RC stars in the Arches field and proper-motion-selected main-sequence stars in Wd1. These samples represent highly reddened stellar populations located in the Galactic plane. The revision is necessary because an error was discovered in the application of the photometric zero points to the F160W and F814W photometry in the H18 analysis, resulting in magnitudes that were systematically too faint by 0.072 and 0.134 mag, respectively. No other filters were affected, and since the error was restricted to how the zero points were applied, the zero points presented in Figure 3 of H18 are still correct. The revised extinction law is derived using the same methodology and model (free parameters, priors, etc.) described in H18 but with the corrected F160W and F814W photometry.

A comparison between the revised extinction law and other laws in the literature is shown in Figure 18. The revised law is shallower (i.e., lower Aλ/AKs values) than the original H18 law and the Nogueras-Lara et al. (2018) law derived at the GC, but it remains steeper than the Nishiyama et al. (2009) law often used for stars in the inner bulge. Longward of 1.25 μm, the revised law is consistent with a power law (Aλ/AKsλβ) with β = 2.14, though there is some evidence that the law deviates from this function shortward of 1.25 μm (Figure 19). However, additional studies of the extinction law in this wavelength range are required to investigate further. The revised Aλ/AKs values and their corresponding errors are shown in Table 8.

Figure 18.

Figure 18. Revised extinction law used in this analysis (red points) compared to other laws in the literature. The 1σ uncertainty in the revised law is shown by the red shaded region. The other laws shown are from Cardelli et al. (1989; cyan triangles), Nishiyama et al. (2009; green squares), Damineli et al. (2016; magenta diamonds), Schlafly et al. (2016; yellow pentagons), Nogueras-Lara et al. (2018; blue triangles), and Hosek et al. (2018; black circles).

Standard image High-resolution image
Figure 19.

Figure 19. Residuals between the best-fit power law (exponent: β = 2.14) and the revised extinction law as a function of wavelength. The 1σ uncertainty in the law is shown by the red shaded region. The law is consistent with a power law for λ > 1.25 μm but possibly deviates from a power law for λ < 1.25 μm.

Standard image High-resolution image

Table 8.  Revised Extinction Law

Parameter λa (μm) Priorb Result
${{\rm{A}}}_{{\rm{F}}814{\rm{W}}}/{{\rm{A}}}_{{K}_{s}}$ 0.806 U(4, 14) 7.94 ± 0.21
${{\rm{A}}}_{y}/{{\rm{A}}}_{{K}_{s}}$ 0.962 U(4, 14) 5.72 ± 0.16
${{\rm{A}}}_{{\rm{F}}125{\rm{W}}}/{{\rm{A}}}_{{K}_{s}}$ 1.25 U(1, 6) 3.14 ± 0.07
${{\rm{A}}}_{{\rm{F}}160{\rm{W}}}/{{\rm{A}}}_{{K}_{s}}$ 1.53 U(1, 6) 2.04 ± 0.04
${{\rm{A}}}_{[3.6]}/{{\rm{A}}}_{{K}_{s}}$ 3.545 G(0.5, 0.05) 0.50 ± 0.04

Notes.

aHST + PanSTARRS filters: pivot wavelengths of filter; IRAC [3.6] filter: isophotal wavelength from Nishiyama et al. (2009). bUniform distributions: U(min, max), where min and max are bounds of the distribution; Gaussian distributions: G(μ, σ), where μ is the mean and σ is the standard deviation.

Download table as:  ASCIITypeset image

As discussed in Section 6.4, the IMF results are found to be insensitive to variations in the extinction law, so the choice of extinction law does not significantly impact the results in this paper.

Appendix C: Arches Cluster: Model Posteriors

In this appendix, we show the posterior probability distributions for the one- and two-segment IMF analyses. For the one-segment IMF fit, we show the joint posterior distribution for α1 and Mcl in Figure 20 and the 1D posteriors for each model parameter in Figure 21. The corresponding posteriors for the two-segment IMF fit posteriors are shown in Figures 22 and 23.

Figure 20.

Figure 20. 2D posterior probability distribution for −α1 and Mcl for the one-segment IMF analysis for the Arches cluster.

Standard image High-resolution image
Figure 21.

Figure 21. 1D posterior probability distributions for the one-segment IMF model for the Arches cluster.

Standard image High-resolution image
Figure 22.

Figure 22. 2D posterior probability distribution for −α1 and Mcl and −α2 and mbreak for the two-segment IMF analysis for the Arches cluster.

Standard image High-resolution image
Figure 23.

Figure 23. 1D posterior probability distributions for the two-segment IMF model for the Arches cluster.

Standard image High-resolution image

Appendix D: Testing the IMF Analysis with Synthetic Clusters

To verify the accuracy of the IMF analysis, we apply it to simulated observations of a synthetic cluster and compare the output best-fit parameters with the input ones. The synthetic cluster is created as described in Section 4, and observational completeness is applied as a function of position in the CMD and cluster radius. To simulate observational errors, the synthetic photometry for each star is perturbed by a random amount drawn from a normal distribution with a width equal to the median photometric error of the observed stars at the synthetic star's magnitude. These stars are assigned Ppm = 1. To simulate field stars, a number of stars are drawn from the observed field star population used to calculate $p{(k| {\rm{\Theta }})}_{f,\mathrm{obs}}$ in Equation (6) and assigned Ppm = 0. The number of field stars drawn is chosen such that the combined sample contains 80% cluster stars and 20% field stars. The spectroscopic sample is simulated by selecting six random stars with 14.5 mag ≤ F153M ≤ 15.0 mag and assigning them Teff uncertainties similar to those found in Section 3.5.

The combined synthetic catalog is run through the Bayesian analysis in Section 4.1 in the same way as the real observed catalog, with two exceptions: no differential dereddening correction is applied, since the cluster is already generated with a realistic value of ΔAKs, and no minimum Ppm value is enforced. The number of WR stars within rcl < 0.75 pc is calculated and input to the fitter, mimicking the information gained from the real spectroscopic surveys of the Arches. The priors are the same as the real analysis, as described in Table 3.

The results of the tests are shown in Table 9 and found the output values to match the input values to within 1σ. The joint posterior probability distributions for α1 and Mcl in the one-segment IMF fit are shown in Figure 24, while the joint posterior probability distributions for α1 and Mcl and α2 and mbreak in the two-segment IMF fit are shown in Figure 25.

Figure 24.

Figure 24. 2D posterior probability distribution for −α1 and Mcl for the one-segment IMF simulated cluster analysis. The input values are represented by the red dotted lines.

Standard image High-resolution image
Figure 25.

Figure 25. Joint posterior probability distribution for −α1 and Mcl (left) and −α2 and mbreak (right) for the two-segment IMF simulated cluster analysis. The input values are represented by the red dotted lines.

Standard image High-resolution image

Table 9.  Simulated Cluster Analysesa

  One-segment IMF Two-segment IMF
Parameter Input Value Recovered Value Input Value Recovered Value
α1 1.7 1.7 ± 0.06 2.1 1.99 ± 0.13
α2 0.7 0.74 ± 0.27
mbreak 5.0 4.43 ± 0.91
Mcl 20,000 21,400 ± 1900 20,000 20,400 ± 2300
log t 6.40 6.41 ± 0.03 6.40 6.39 ± 0.01
d 8000 7865 ± 146 8000 8101 ± 139
AKs 2.07 2.07 ± 0.01 2.07 2.06 ± 0.01
ΔAKs 0.15 0.13 ± 0.01 0.15 0.14 ± 0.01

Note.

aParameter priors and units are the same as in Table 3.

Download table as:  ASCIITypeset image

Footnotes

Please wait… references are loading.
10.3847/1538-4357/aaef90