[go: up one dir, main page]

A publishing partnership

The following article is Open access

The Poor Old Heart of the Milky Way

, , , , , , , , , , , and

Published 2022 December 12 © 2022. The Author(s). Published by the American Astronomical Society.
, , Citation Hans-Walter Rix et al 2022 ApJ 941 45 DOI 10.3847/1538-4357/ac9e01

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/941/1/45

Abstract

Our Milky Way should host an ancient, metal-poor, and centrally concentrated stellar population, which reflects the star formation and enrichment in the few most massive progenitors that coalesced at high redshift to form the proto-Galaxy. While metal-poor stars are known to reside in the inner few kiloparsecs of our Galaxy, current data do not yet provide a comprehensive picture of such a metal-poor "heart" of the Milky Way. We use information from Gaia Data Release 3, especially the XP spectra, to construct a sample of 2 million bright (GBP < 15.5 mag) giant stars within 30° of the Galactic center (GC) with robust [M/H] estimates, δ[M/H] ≲ 0.1. For ∼1.25 million stars we calculate orbits from Gaia Radial Velocity Spectrometer velocities and astrometry. This sample reveals an extensive, ancient, and metal-poor population that includes ∼18,000 stars with −2.7 < [M/H] < −1.5, representing a stellar mass of ≳5 × 107 M. The spatial distribution of these [M/H] < −1.5 stars has a Gaussian extent of only ${\sigma }_{{{R}}_{{\rm{GC}}}}\sim 2.7\,\mathrm{kpc}$ around the GC, with most orbits confined to the inner Galaxy. At high orbital eccentricities, there is clear evidence for accreted halo stars in their pericentral orbit phase. Most stars show [α/Fe] enhancement and [Al/Fe]–[Mn/Fe] abundances expected for an origin in the more massive portions of the proto-Galaxy. Stars with [M/H] < −2 show no net rotation, whereas those with [M/H] ∼ −1 are rotation dominated. These central, metal-poor stars most likely predate the oldest disk population (τage ≈ 12.5 Gyr), which implies that they formed at z ≳ 5, forging the proto-Milky Way.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Understanding the formation history of our own Galaxy, especially its earliest phases, has been a central goal of Galactic archeology for decades (Freeman & Bland-Hawthorn 2002), with dramatic progress enabled by a suite of spectroscopic surveys (e.g., Yanny et al. 2009; De Silva et al. 2015; Majewski et al. 2017; Conroy et al. 2019) and ESA's Gaia mission (Perryman et al. 2001; Gaia Collaboration et al. 2022). The earliest phases of the Milky Way's star formation and enrichment history are reflected at the present epoch in the orbit and abundance distributions of old and metal-poor stars. In the context of the hierarchical formation of massive disk galaxies like the Milky Way, we should expect the oldest and most metal-poor stars (say, [M/H] < −1.5) to be a mix of stars that (a) formed within one of the main overdensities which coalesced early to form the proto-Galaxy, or (b) formed early in distinct satellite galaxies that eventually merged with the main body (Zolotov et al. 2009). The first channel is commonly referred to as in situ formation, the second accreted (Tumlinson 2010; Pillepich et al. 2015; El-Badry et al. 2018; Renaud et al. 2021b). If the in situ stars formed in a considerably more massive potential well than the accreted stars, this difference in origin should also be reflected in their stars' abundance patterns, such as [α/Fe] versus [Fe/H] or [Mg/Mn] versus [Al/Fe] (Zolotov et al. 2010; Hawkins et al. 2015; Das et al. 2020; Horta et al. 2021; Belokurov & Kravtsov 2022; Conroy et al. 2022; and others).

However, at very early epochs, this distinction between in situ and accreted may become rather blurry. High-resolution formation simulations of galaxies like the Milky Way show (e.g., Renaud et al. 2021a, 2021b) that pieces of comparable mass may rapidly coalesce early on (say, z > 4) in a sequence of "major mergers." One terminological choice is to label only the one piece that was somewhat more massive as the in situ component, and to label all other pieces as accreted (which put together may constitute the majority of mass). However, simulations show that it may be operationally impossible later on to differentiate what was in situ and what accreted at very early epochs, whether one uses orbit or abundance information (e.g., Renaud et al. 2021b; Brauer et al. 2022; Orkney et al. 2022). Alternatively, one could label collectively all the major pieces that coalesced very early on, say, at z ≳ 5, as the proto-Galaxy (see Conroy et al. 2022), 10 and then differentiate subsequent additions at slightly later epochs (say, z < 3) as either in situ, if the material was brought in as gas, or as accreted, if the stars formed in a distinct potential well that then subsequently merged. Here, we opt for the second terminology, in part in light of the results we find. Consequently, we refer to the oldest parts of the Milky Way as the proto-Galaxy, without trying to single out one of the contributing pieces as in situ.

The nature of the accreted component of our Galaxy at RGC ≳ 5 kpc has come into focus due to the combination of Gaia and large ground-based spectroscopic surveys. Tidal debris from the disrupted Sagittarius satellite (Ibata et al. 1994) dominates the halo from 2050 kpc with stars mostly on highly inclined orbits with substantial angular momentum (Majewski et al. 2003; Naidu et al. 2020); at 5–25 kpc the debris from the disrupted Gaia-Sausage-Enceladus (GSE) satellite (Belokurov et al. 2018b; Helmi et al. 2018; Naidu et al. 2020) dominates, with stars on highly eccentric orbits (e ≳ 0.7) with little (slightly retrograde) angular momentum. Beyond these dominant components, a growing number of additional, distinct accreted components of the Galaxy have been identified (e.g., Newberg et al. 2009; Kruijssen et al. 2019; Myeong et al. 2019; Naidu et al. 2020; Yuan et al. 2020; Malhan et al. 2022). Beyond their orbits, the abundance patterns of these stars also point toward an accreted origin: their distribution in the [α/M]–[M/H] plane (lower [α/M] at a given [M/H]) indicates that their birth material was enriched in a potential well of lower mass, i.e., a satellite galaxy (e.g., Frebel & Norris 2015; Hawkins et al. 2015; Lee et al. 2015). In addition to presumed satellite galaxy debris, the stellar halo also appears to encompass a large number of disrupted star clusters (e.g., Malhan et al. 2018; Shipp et al. 2018; Bonaca et al. 2021).

There has also been progress in understanding the old proto-Galactic or in situ component of the Milky Way. The old and metal-poor Galactic in situ component that has been mapped best is the old, α-enhanced disk (e.g., Hayden et al. 2015; Belokurov et al. 2020; Bonaca et al. 2020; Xiang & Rix 2022): its stars seem to date back to ≥12.5 Gyr (or z ≳ 5), are centrally concentrated, and form a thick disk. However, the old α-enhanced disk metallicity distribution function (MDF) is truncated, or at least drops sharply, below [M/H] = −1. It seems inevitable that there must have been a substantive proto-Galactic stellar population responsible for enriching the old disk's birth material to [M/H] = −1. Recent surveys have provided clear evidence for a so-called in situ halo component in the Milky Way (e.g., Bonaca et al. 2017; Haywood et al. 2018; Di Matteo et al. 2019; Belokurov et al. 2020; Bonaca et al. 2020; Naidu et al. 2020; Horta et al. 2021). Many of these studies drew on samples at the solar radius and beyond, and those in situ halo stars may reflect very early disk stars that were kicked up or splashed to hotter orbits.

While the chemodynamical structure of the Galaxy at RGC ≳ 5 kpc has come into focus over the past decade, the nature of the inner Galaxy—especially at low metallicity—has proven more elusive. Fortunately, recent work has begun to piece together a picture of the metal-poor inner Galaxy with limited sample sizes (e.g., García Pérez et al. 2013; Ness et al. 2013; Schlaufman & Casey 2014; Casey & Schlaufman 2015; Ness et al. 2015; Koch et al. 2016; García Pérez et al. 2018; Arentsen et al. 2020a; Reggiani et al. 2020; Lucey et al. 2021). In particular, narrowband photometric surveys have spectacularly enabled a more efficient selection of metal-poor stars toward the inner Galaxy (e.g., Arentsen et al. 2020a, 2020b), providing metal-poor samples in the inner Galaxy of several thousand objects. Spectroscopic follow-up has shown that these stars are indeed metal-poor and show very little rotation at the lowest metallicities (Arentsen et al. 2020b). Kruijssen et al. (2019, 2020) and Forbes (2020) recently used the properties of inner Galaxy globular clusters as tracers of a very old Milky Way component, which they dubbed Kraken and Koala. They considered these possible instances of early accretion.

Most recently, Belokurov & Kravtsov (2022) and Conroy et al. (2022) have been able to shed first light on the question of how the old disk emerged from an earlier, more metal-poor population of stars on dynamically hot orbits. Belokurov & Kravtsov (2022) combined Apache Point Observatory Galactic Evolution Experiment (APOGEE) and Gaia astrometry to study the kinematics of stars in the inner Galaxy down to metallicities of [M/H] ∼ −1.5. They found that metal-poor stars in the inner Galaxy with abundance patterns they identified with in situformation (dubbed "Aurora") show rather little net rotation, and reasoned that these stars preceded the Milky Way's ancient disk (see also Arentsen et al. 2020a). Conroy et al. (2022) used chemistry, kinematics, and ages from the H3 Survey and Gaia data to identify likely proto-Galactic stars down to [M/H] ∼ −2.5 with ages ≳13 Gyr. This recent work clearly implies that our Milky Way has some form of a proto-Galactic population. Whether or not there exists an operationally separable accreted population buried deep in the potential well of our Galaxy remains an open question (see Kruijssen et al. 2020; Horta et al. 2021; Myeong et al. 2022).

It is on this background that we aim to flesh out a more comprehensive picture of the abundance–orbit distribution of metal-poor stars in the inner Galaxy, RGC ≲ 5 kpc. Here, sensible definitions of metal-poor could be (a) more metal-poor than the oldest disk stars, i.e., [M/H] < −1, or (b) more metal-poor than large published samples of stars with abundances and orbits in the inner Galaxy, i.e., [M/H] < −1.5.

Specifically, we carry out a comprehensive search for metal-poor stars toward the Galactic center (GC), drawing on the immense wealth of new information that Gaia Data Release 3 (DR3) now affords. Our work relies mainly on newly published low-resolution BP/RP spectra (hereafter collectively referred to as "XP" spectra; Carrasco et al. 2021; De Angeli et al. 2022) and photometry synthesized from these spectra (Montegriffo et al. 2022). In principle, various metallicity estimates exist for millions of stars in DR3. Yet, the XP-based estimates from Andrae et al. (2022b) suffer from strong biases, 11 while the Radial Velocity Spectrometer (RVS)-based metallicity estimates from Recio-Blanco et al. (2022) are currently limited to bright stars with high-quality Gaia RVS spectra.

For our analysis, we select giant stars within 30° of the GC, derive data-driven [M/H] estimates from their XP spectra, and combine them with RVS velocities to obtain orbits. This provides a sample of 1.5 million stars in the inner Galaxy, for a chemodynamical study.

The rest of the paper is organized as follows. In Section 2 we describe the initial selection of the sample and the subsequent data-driven derivation of [M/H] estimates, using their XP spectra and AllWISE photometry (Cutri et al. 2021). In Section 3 we present the spatial, [M/H], and orbit distribution of these stars, along with the abundance patterns for a small subset. In Section 4 we summarize these results, put them in the context of Galactic archeology, and sketch a few avenues for follow-up work.

2. Data Sets and Derived Abundance and Orbit Quantities

We aim to devise a well-defined and large sample of stars in the "inner Galaxy" that consists of stellar tracers that cover the full age and [M/H] range with comparable selection effects. For that we need tracers that are luminous enough to reach the distance of the GC and beyond, even in the presence of some dust extinction. For chemodynamical mapping, we need a robust and precise estimate of [M/H] and an estimate of the 6D phase-space coordinates (x, v) to estimate the orbits. Ideally, we would also like to have precise age estimates and individual abundances, [X/H], or at least an estimate of the α enhancement, [α/M]. Using age estimates and individual abundances for these giant stars is beyond the scope of this work, as is a full accounting of the sample selection function.

To build such a sample, we first identify all likely red giant stars (red giant branch and red clump, RC) in the direction of the GC that (a) are bright enough that we can expect a robust and precise estimate of [M/H] from Gaia XP data (Montegriffo et al. 2022; De Angeli et al. 2022), and (b) have radial velocities from Gaia RVS (Katz et al. 2022) to estimate the stars' orbits in conjunction with their parallaxes, ϖ, and proper motions, $\vec{\mu }$. Then we rederive [M/H] estimates for these stars from their XP spectra with a data-driven approach that draws on Data Release 17 (DR17) of the Sloan Digital Sky Survey's (SDSS) APOGEE survey (Abdurro'uf et al. 2022) to address the documented shortcomings of the GSP-Phot abundance estimates in Gaia DR3 (Andrae et al. 2022b). Finally, we calculate the orbits of the sample members from their (x, v), which requires a model for the Galactic potential, Φ(x) (here from Price-Whelan 2017).

2.1. Initial Gaia Query

The initial inner Galaxy sample was devised via the following ADQL query:

  • selectfrom gaiadr3.gaia_source
  • where
  • parallax < 100.power(10.,0.2(0.9 - (phot_g_mean_mag---1.5(bp_rp-1.))))
  • and parallax < 1.
  • and abs(b)<30 and (l<30 or l > 330.)
  • and bp_rp between 1.0 and 3.5
  • and phot_bp_mean_mag < 15.5
  • and has_xp_continuous='true'

The first condition on the parallax selects stars whose absolute magnitude is more luminous than MG = 0.9, aimed at selecting giant stars at luminosities that just include the RC; the -1.5*(bp_rp-1.) term reflects an approximately dereddened magnitude. The query is articulated in such a way that zero or negative parallaxes are accommodated. The second parallax condition only excludes the immediate "foreground" at D < 1 kpc. The conditions on l and b select a rectangular region of 30° around the GC. The lower limit on bp_rp eliminates most luminous hot stars for which [M/H] estimates are not feasible (but not all in the presence of reddening). The condition on phot_bp_mean_mag is designed to select objects bright enough in the blue part (BP) of the XP spectra for robust [M/H] estimates; it still allows one to select RC stars at the distance of the GC in the presence of moderate reddening. The final line in the selection ensures that continuous XP spectra are available. This query yields 2.1 million objects.

2.2. Deriving Robust [M/H] Estimates

While about 75% of this sample has RVS velocities in Gaia DR3, only a small fraction of them have metallicities or abundances derived from the RVS spectra (Recio-Blanco et al. 2022). This leaves the route of estimating [M/H] from the low-resolution XP spectra of these objects. Consequently, our objective is to train a machine-learning algorithm that can precisely, accurately, and robustly predict [M/H] from Gaia DR3 data.

Estimating [M/H] based on machine learning involves four aspects: (1) it involves the choice of the training sample, for which we adopt SDSS DR17 (APOGEE; Abdurro'uf et al. 2022) since its abundances are well validated, it covers the inner Galaxy, and contains mainly giants, many with high extinctions; (2) it involves the choice of the data features on which to train the prediction; these can be derived from the XP spectra but may also entail, as we discuss below, external data available across the sample, such as near-infrared photometry from AllWISE (Cutri et al. 2021); (3) it involves the choice of the machine-learning algorithm, for which we adopt an extreme gradient boosting algorithm (Chen & Guestrin 2016, hereafter XGBoost); XGBoost is one of the best algorithms currently available, being straightforward and computationally inexpensive to train, and it can outperform other methods such as deep learning (e.g., Grinsztajn et al. 2022); and (4) finally we need to validate our [M/H] prediction, which then speaks to the quality of the data, the training sample, and the algorithm.

The choice of the correct data features to train and predict [M/H] from the XP spectra is not trivial. One obvious option is to directly use the 2 × 55 modified Gauss–Hermite coefficients that describe XP spectra. But our numerical experimentation implies that other or additional data features may yield more precise and robust results with a machine-learning approach, 12 for several pragmatic reasons. Much of the information about narrow metal line features is contained in high-order coefficients, which are often noisy. We have prior information on which parts of the spectrum are highly diagnostic in [M/H] estimates and which ones are not; we need not ask an algorithm to learn this. Any estimate of [M/H] requires an implicit estimate of the effective temperature, Teff, which is covariant with [M/H]; and Teff is also highly covariant with reddening. Therefore, providing additional information that helps break that degeneracy is precious. The near-infrared photometry from AllWISE (Cutri et al. 2021) exists across the sky and proves powerful in this respect.

In practice, we follow the approach of Montegriffo et al. (2022) to calculate synthetic photometry from the XP spectra in a wide range of mostly narrowband filters, in particular those filters with established utility in identifying metal-poor stars: Strömgren filters (Strömgren 1966), JPAS+ filters (Marín-Franch et al. 2012), and Pristine H&K filters (Starkenburg et al.2017). On this basis, we use XGBoost to train the prediction of [M/H] using the entire SDSS DR17 APOGEE of giants with GBP < 15.5 (∼230,000 stars). Details are given in the Appendix.

Training on a particular data set and using a model that is discriminative, rather than generative, raises several issues. First, we (obviously) tie our results to the metallicity scale of SDSS DR17. Second, estimates of [M/H] may be drawn into the support of the training set. In particular, XGBoost will not extrapolate [M/H] significantly beyond the APOGEE DR17 range, as it is a tree-based method that segments the feature space and assigns a mean label to each such segment. Finally, generative data-driven spectral models, such as the Cannon (Ness et al. 2015), will presumably degrade more gracefully toward low signal-to-noise (S/N). Here we have addressed the last point implicitly by restricting the sample to GBP < 15.5.

This results in [M/H] estimates for about 2 million giants toward the inner Galaxy, 1.58 million of which have RVS velocities, and 1.25 million have both RVS velocities and ϖ/σϖ ≥ 5, which is our minimal condition for an orbit estimate. For subsequent analysis, we have eliminated 2% of the sample as reddened hot stars, which can be readily recognized by their position in the m1bp_rp color plane, where m1v − 2b + y is the metallicity-sensitive Strömgren filter combination (see Strömgren 1966, for details). Specifically, we eliminate sources with m1 + 0.16 (bp_rp − 1) < 0 and bp_rp < 2.7.

2.3. [M/H] Validation

We can explore and validate the precision and accuracy of these [M/H] estimates through various comparisons with analogous estimates from (higher-resolution) spectroscopic surveys. For this validation, we consider SDSS APOGEE (Abdurro'uf et al. 2022), the Large Sky Area Multi-Object Fibre Spectroscopic Telescope (LAMOST; here Xiang et al. 2019), Data Release 3 (DR3) of GALactic Archaeology with HERMES (GALAH; Buder et al. 2021), and Gaia's GSP-Spec (Recio-Blanco et al. 2022).

We start by considering the ∼17,000 stars in common between APOGEE and our sample, shown in Figure 1 (top panel). This panel of Figure 1 illustrates that, for this sample, XGBoost provides a remarkably precise (≲ 0.1 in the median), accurate, and robust [M/H] estimate across −2.5 < [M/H] < 0.5. It should be noted that XGBoost was trained on the full all-sky APOGEE sample, so this is a validation on a small subset of the full training set, not a pure cross-validation. The accuracy of the [M/H] prediction (i.e., lack of systematic offset) is therefore by construction. It is particularly remarkable that basically all stars predicted to be metal-poor from [M/H]XP are indeed metal-poor according to [M/H]APOGEE: a selection of, for example, metal-poor objects by [M/H]XP appears to be nearly pure. The bottom panel of Figure 1 shows that these [M/H] estimates remain unbiased, at a level of a few percent, even in the presence of substantial dust extinction, AK = 0.3, corresponding to about three magnitudes of AV .

Figure 1.

Figure 1. Top: Validation of our [M/H] estimates from Gaia XP data as function of actual [M/H] from APOGEE. Bottom: (In)sensitivity of [M/H] estimates to extinction, AK . The Y-axis shows the difference between [M/H] estimates based on XP spectra and APOGEE as a function of APOGEE's AK . There is no evidence for any systematic trend of Δ[M/H] with AK to an extinction level that corresponds to AV ≈ 3.

Standard image High-resolution image

Appendix A.2 details a true cross-validation of our [M/H]XP estimates against three external data sets: LAMOST, GALAH, and GSP-Spec. Broadly speaking, the cross-validation affirms that the [M/H]XP estimates are precise and robust (enabling pure [M/H]-based sample selection). Our estimates have a rms difference of ≈0.1 dex compared to these data sets, with minimal bias. Most importantly, the performance remains unbiased toward the metal-poor end, affirming our confidence in selecting metal-poor stars based on XP metallicities. We refer to Appendix A.2 for more detailed validation of the [M/H] estimates.

2.4. Orbits of the Sample Members

For sample members with suitable 6D phase-space information, we calculate their orbits. The sky positions and proper motions are available for all of them, radial velocities and good parallax measurements only for a good fraction of the sample. In practice, we take all stars that have RVS velocities with δ vRVS < 5 km s−1 and ϖ/σϖ > 5, about 1.25 million objects across all [M/H].

To compute orbits and compute orbital properties, we use a four-component Milky Way mass model consisting of spherical Hernquist nucleus and bulge components, an (approximate) exponential disk component (Smith et al. 2015), and a spherical Navarro–Frenk–White halo component. We adopt a radial scale length hR = 2.6 kpc and scale height hz = 300 pc (Bland-Hawthorn & Gerhard 2016), and fit for the masses and scale radii of the nucleus, bulge, and halo components using the same compilation of Milky Way enclosed mass measurements as used to define the MilkyWayPotential in gala (Price-Whelan 2017; Price-Whelan et al. 2020), with the additional constraint of having a circular velocity at the solar position vc (R0) = 229 km s−1 (Eilers et al. 2019). We compute actions using the "Stäckel Fudge" (Binney 2012; Sanders 2012) as implemented in galpy (Bovy 2015). To compute the orbital eccentricity, pericenter, and apocenter values, we numerically integrate the orbits with a time step of 0.5 Myr across four times the radial orbit period (estimated by the computed orbital frequencies from the action solver).

The [M/H] estimates for 1.5 million stars toward the GC, along with orbits for the RVS subsample of 1.25 million stars, are available as supplementary material to this article and are hosted online 13 (Andrae et al. 2022a).

3. Results

We now present the properties of the low-[M/H] stellar population contained within this sample. We start with the spatial distribution, then move to the [M/H] distribution, the orbit distribution, and finally to a first exploration of the abundance patterns of the metal-poor stars, especially the level of α enhancement. This is all done with an eye toward what we can learn about a central metal-poor in situ halo or bulge population.

3.1. Spatial Distribution of Metal-poor Stars

Figure 2 compares the on-sky density distribution of metal-rich stars with [M/H] > −0.4 (left panel) to metal-poor ones with [M/H] < −1.5 (middle panel). The metal-poor stars exhibit a striking central concentration relative to the metal-rich sample, suggesting a spheroidal population in the inner Galaxy. The projected sky density of metal-poor sample members in Figure 2 (middle panel) is dramatically altered by the high dust extinction toward the GC (right panel). The fine-scale structure in the projected [M/H] < −1.5 stellar distribution can mostly be explained by dust extinction, as the right-hand panel of Figure 2 shows. This is expected as we require GBP < 15.5 for sample selection.

Figure 2.

Figure 2. On-sky logarithmic density distribution of stars in our sample for metal-rich (left panel) and metal-poor (middle panel) selections. The distribution of metal-poor stars appears very centrally concentrated, but the morphology is dramatically modulated by the foreground dust extinction, as illustrated in the right panel. The right panel demonstrates that the dearth of stars is highly correlated with the distribution of dust.

Standard image High-resolution image

We do not have precise parallaxes for the entire sample, which complicates the analysis of the spatial structure. For the overall sample, the median parallax precision is ϖ/σϖ ∼ 12, with 10% of the sample having ϖ/σϖ ≲ 4, and 1% of the sample with negative parallaxes. For the metal-poor sample members with [M/H] < −1.5, which turn out to be more distant than the sample mean; the median precision is only ϖ/σϖ ∼ 5. Therefore, the most robust way to present the distance distribution of objects in a directional cone may be to consider their parallaxes. Given that the objects of particular interest are at ∼8 kpc distance, it matters that we apply a Gaia parallax zero-point correction, for which we adopt 0.02 mas (Lindegren et al. 2021).

Figure 3 shows ${n}_{* }\left(\varpi \,| \,[{\rm{M}}/{\rm{H}}]\right)$ in five [M/H] bins, with the minimal distance (maximal parallax) of 1 kpc at the right edge and the parallax expected for the GC indicated by the dotted line (1/8.2 kpc, or 0.12 mas; Gravity Collaboration et al. 2021). This figure shows that the distance distribution of giants toward the GC depends dramatically on [M/H]. Metal-rich stars ([M/H] > −0.4) are distributed throughout the disk with heliocentric distances D = 1−5 kpc (or 0.2–1.0 mas). In contrast, metal-poor populations become increasingly more concentrated toward the GC. Indeed, the most metal-poor stars are predominantly clustered around the GC's parallax, revealing that there is a centrally concentrated very metal-poor population in the Milky Way.

Figure 3.

Figure 3. Parallax distribution of the different subsamples, selected by [M/H], taking ϖ as a proxy for distance. It is apparent that increasingly more metal-poor subsamples are increasingly more tightly centered around the expected parallax of the Galactic center (dotted black line; Gravity Collaboration et al. 2021). The most metal-poor stars are entirely confined to the inner ≈2.7 kpc of the Galaxy (see Section 3.1).

Standard image High-resolution image

Figures 2 and 3 qualitatively confirm the picture of a metal-poor population that is very concentrated toward the GC. The fact that there are almost no [M/H] < −1.5 stars at very low latitudes shows that almost all lie behind the strong dust extinction 2–3 kpc inward of the Sun. Rigorous modeling of these stars' spatial distribution of these stars density distribution (e.g., Rix et al. 2021) is beyond the scope of this paper, with the severe dust extinction and crowding effects presenting formidable challenges. However, we can model the parallax distribution (Figure 3) in the central regions less affected by dust to quantify the radial extent of this metal-poor population.

We assume that the 3D distribution is a spherical Gaussian as a function of Galactocentric radius R ${\rho }_{\mathrm{Gauss}}(R,{\sigma }_{{R}_{\mathrm{GC}}})$, with a width of ${\sigma }_{{R}_{\mathrm{GC}}}$. As long as we can see to the far side of the GC (see Figure 3), we should expect the parallax (ϖ) distribution in a cone of dΩ in the direction (l,b) to be

Equation (1)

Phrasing this model in terms of the ϖ allows the model to deal with vanishing (or even negative) observed parallaxes. We find that the maximum likelihood of the observed parallaxes (and their uncertainties) for the [M/H] < −1.5 stars implies an extent of ${\sigma }_{{R}_{\mathrm{GC}}}\sim 2.7\,\mathrm{kpc}$. Interestingly, this radius corresponds to an angle of 18° at the distance of the GC, which seems very plausible in light of the projected density distribution (Figure 2; middle panel).

3.2. The [M/H] Distribution in the Inner Galaxy

We now turn our attention to the [M/H] distribution, n*([M/H]), in the inner Galaxy, which is illustrated in Figure 4. The first remarkable feature of the distribution is the sheer number of objects, most apparent from the cumulative distribution in Figure 4: there are >4000 stars with an estimated [M/H] < −2, ∼18,000 with [M/H] < −1.5, and almost 100,000 stars with [M/H] < −1, i.e., below the abundance floor of the old, α-enhanced disk. This is an order of magnitude more objects than previously published metal-poor samples ([M/H] < −1) of the inner Galaxy (e.g., Arentsen et al. 2020a, 2020b).

Figure 4.

Figure 4. Top: Cumulative distribution of metallicities [M/H] for the entire sample of ∼1.7 million objects, estimated from XP spectra and broadband photometry via XGBoost prediction trained on the APOGEE DR17: there are ∼5000 stars with [M/H] < −2, ∼20,000 stars with [M/H] < −1.5, and nearly 100,000 stars with [M/H] < −1. Bottom: Distribution of [M/H] for our sample, split into sources with presumed Galactocentric radii RGC < 4 kpc (in green), which are the focus of our analysis, and those at larger RGC. The (green) distribution shows a steep rise to [M/H] ∼ −2, then a power law of slope $d(\mathrm{log}{n}_{* })/d[{\rm{M}}/{\rm{H}}]\approx 1$ to [M/H] ∼ −1, and finally a yet again steeper rise to [M/H] ∼ −0.6. A possible interpretation of these slopes in the context of a simple chemical evolution model is given in Section 3.2.

Standard image High-resolution image

We have already presented two arguments that the low-[M/H] portion of n*([M/H]) is not severely contaminated by spurious [M/H] estimates from some of the far more numerous stars of higher (true) metallicity: the validation with APOGEE and the parallax, or distance, distribution, which is most peaked for the most metal-poor objects. The [M/H] distribution itself provides additional evidence. It rises very steeply from [M/H] = −2.5 to −2.2. The distribution then follows a power law of slope $d(\mathrm{log}{n}_{* })/d[{\rm{M}}/{\rm{H}}]\approx 1$ over a wide range of metallicities to [M/H] ≈ −0.9, where it steepens toward a peak at [M/H] = −0.6, beyond which n*([M/H]) starts dropping toward the highest metallicities present, [M/H] ≈ 0.5.

The bottom panel of Figure 4 shows the metallicity distribution with stars of an estimated Galactocentric distance ${\hat{R}}_{\mathrm{GC}}\lt 4\,\mathrm{kpc}$ in green, and the stars of the intervening disk in gray. As giants, in particular red clump stars, are of comparable luminosity and color over a wide range of metallicities, the differential selection effects across different [M/H] should be modest. Hence, the [M/H] distribution of stars RGC < 4 kpc in Figure 4 should be relatively unbiased. The [M/H] distribution of course varies with position in the Galaxy (both ∣z∣ and RGC), as Figure 4 illustrates.

The decrease in n*([M/H]) for stars at the metal-rich end ([M/H] > 0) with RGC < 4 kpc (green) may seem surprising, because we know from APOGEE (e.g., Eilers et al. 2022) that the inner Galaxy is teeming with high-metallicity stars. This decrease in n*([M/H]) is straightforwardly explained by Figure 2, which shows that in the inner Galaxy we are almost completely lacking sample members with ∣Z∣ < 700 pc; any thinner population will inevitably be absent from our sample due to dust extinction. The APOGEE sample has a different MDF in part because its Two Micron All Sky Survey–selected stars better sample the low-∣Z∣, high-[M/H] population.

3.2.1. The Mass of the Central Proto-galactic Stellar Population

We also can estimate roughly the implied mass of the proto-Galactic population we have uncovered here. Taking the nearest globular cluster, M4, as a template for an old, [M/H] < −1 population, we find that there are about 35 giants with MG < 0.5 in the Gaia DR3 catalog beyond M4's half-mass–radius of ${6}^{{\prime} }$ (Richer et al. 2004, where the Gaia catalog should be approximately complete). Given M4's total mass of 7 × 104 M (Marks & Kroupa 2010), this implies one giant (at MG < 0.5) per 800 M of total stellar population mass. At face value, this implies ${M}_{\mathrm{tot}}\left([{\rm{M}}/{\rm{H}}]\lt -1.5\right)\sim 2\,\times \,{10}^{7}$ M. But this does not account for the incompleteness of XP spectra in very crowded fields, nor for the far-reaching effects of dust extinction. A conservative estimate of these effects is a factor of 2.5, leading us to a lower limit on the mass of ${M}_{\mathrm{tot}}\left([{\rm{M}}/{\rm{H}}]\lt -1.5\right)\gtrsim 5\times {10}^{7}$ M.

3.3. A Simple Chemical Model for the Proto-galaxy

We now proceed to see what the observed metallicity distribution might tell us about the early central enrichment history.

The bulk of the metal-poor population, below the minimal [M/H] of the old disk at [M/H] ∼ −1, follows $d(\mathrm{log}{n}_{* })/d[{\rm{M}}/{\rm{H}}]\approx 1$: the cumulative number of stars N*(<Z) below a (linear) metallicity Z grows linearly with Z. This seems to be a natural slope for n*([M/H]) in the early phases of a self-enriching, in situ system, as shown by the following argument, which builds on the models of Weinberg et al. (2017). Under fairly general conditions, the evolution of the interstellar medium's (ISM's) metallicity in a simple, fully mixed system can be described by

Equation (2)

where MZ is the mass in metals, yZ is the initial-mass-function-averaged yield, ${\dot{M}}_{* }$ is the star formation rate, ZMZ /Mg is the metal mass fraction of the star-forming gas, $\eta \equiv {\dot{M}}_{\mathrm{out}}/{\dot{M}}_{* }$ is the mass loading factor of a gaseous outflow, and r ≈ 0.4 represents the recycling of metals that are formed into stars but quickly returned when the stars die. Equation (2) treats enrichment as instantaneous, which should be a good approximation at low metallicities where core-collapse supernovae and massive star winds are the dominant sources. It also assumes that outflows have the same metallicity as the ambient ISM; if outflows instead consist of a fraction, f, of the supernova ejecta plus entrained ISM, then the effect is to reduce yZ by a factor (1 − f) so that it reflects only metals retained by the ISM. Importantly for the case at hand, if Z is well below the equilibrium metallicity ZeqyZ /(1 + ηr) then the entire second term of Equation (2) can be neglected.

In this low-metallicity limit, Equation (2) is simply ${\dot{M}}_{Z}={y}_{Z}{\dot{M}}_{* }$, and for a metallicity-independent yield, its time integral implies MZ = yZ M* and thus Z = yZ M*/Mg . The log-slope of the MDF is

Equation (3)

where the last equality introduces the star formation efficiency (SFE) timescale ${\tau }_{* }\equiv {M}_{g}/{\dot{M}}_{* }$. Using

Equation (4)

gives the end result:

Equation (5)

The combination $({\tau }_{* }{\dot{M}}_{g}/{M}_{g})$ corresponds to the fractional change of the gas reservoir mass over one SFE timescale.

Equation (5) shows that an MDF log-slope ≈ 1 is a generic result in the low-metallicity regime (ZZeq), arising whenever the gas mass is constant (${\dot{M}}_{g}=0$) or more generally when the star-to-gas ratio, M*/Mg , is small enough that the second term in Equation (4) can be neglected. Figure 4 exhibits this generic slope over the range −2 < [M/H] < −1. The steeper slope at [M/H] < −2 could plausibly arise because the gas reservoir is small but rapidly growing, with ${\tau }_{* }{\dot{M}}_{g}/{M}_{g}\gt 1$. The steeper slope at −1 < [M/H] < −0.6 may reflect rapid gas accretion (high ${\dot{M}}_{g}/{M}_{g}$) coinciding with the onset of the old α-enhanced disk (e.g., Belokurov & Kravtsov 2022; Conroy et al.2022; Xiang & Rix 2022). At still higher metallicities our neglect of sink terms in Equation (2) becomes a poor approximation.

3.4. The Orbit Distribution of Low-[M/H] Stars in the Inner Galaxy

We now turn to the orbit distribution of the sample at hand (see Section 2.4), which can tell us which of the metal-poor stars selected in the inner Galaxy remain confined to the central regions of the Milky Way. The orbit distribution can tell us which stars are just passers-through near their pericenter but on orbits that take them into the outer halo. In particular, we might expect to see the pericenter members of accreted halo components like GSE, with stars on highly eccentric orbits that take them to Rapo > 10 kpc. We can also delineate how early—or how metal-poor, if we use abundances to estimate relative stellar ages— the population develops net rotation and how rapidly the kinematics change with [M/H] toward near-circular motion in the plane of the Galaxy (see Arentsen et al. 2020a; Belokurov & Kravtsov 2022; Conroy et al. 2022).

We start by considering the orbit distribution, n*(e, Rapo), in terms of the orbital eccentricity, e, and the apocenter, Rapo. We initially focus on the [M/H] range −1.9 < [M/H] < −1.2 since this encompasses the bulk of known GSE stars. This distribution of 28,000 stars is shown in Figure 5. The density in the (e, Rapo) plane is affected and limited by two experimental aspects. First, our initial Gaia query selects almost exclusively stars with RGC < 8 kpc, and hence all stars whose pericenter distance exceeds 8 kpc are excluded; this boundary is indicated by the thick gray line. Except at e < 0.2 the n*(e, Rapo) distribution falls off toward large Rapo well before this limit. There may be two reasons for the seeming dearth of stars with Rapo < 2 kpc near the center. First, severe dust extinction simply obscures much of the central kiloparsec (Figure 2). Second, any uncertainties in the 6D phase-space coordinates, especially uncertainties in the distance, are far more likely to increase the inferred Rapo than decrease it, as it is a positive definite quantity. These aspects deserve careful modeling, which is beyond the scope of this paper.

Figure 5.

Figure 5. Top: Rapo vs. eccentricity space colored by the median XP metallicity. We overlay contours of linear density using a Gaussian kernel density estimate. Bottom: Distribution of the −1.9 < [M/H] < −1.2 subsample in Rapo vs. eccentricity space. This figure shows that most of the low-metallicity sample members have orbits that remain confined to the inner Galaxy (RGC < 5 kpc), with a broad eccentricity distribution. At high eccentricities, there are many stars with large apocenters, presumably representing a population of accreted stars on highly radial orbits. This population may extend inwards to the (rather tentative) excess of stars near (e, Rapo) ≈ (0.7, 4 kpc).

Standard image High-resolution image

The distribution in Figure 5 can be characterized by two regimes: most stars form a nearly flat distribution in eccentricity from e = 0.1 to e = 0.8, but with a narrow range in Rapo, with 〈Rapo〉 ≈ 4 kpc. This implies that most stars form an approximately isotropic distribution that stays confined to the inner Galaxy and hence will not appear in surveys of halo stars that focused on larger RGC and off the Galactic plane. The second regime is that of e ≥ 0.75 and Rapo ≳ 10 kpc. The orbits of these stars fully match the expectations of the pericenter members of the GSE stars, the accreted component that dominates the halo population between 10 and 30 kpc (Belokurov et al. 2018a; Helmi et al. 2018; Naidu et al. 2021). Indeed, recent work by Belokurov et al. (2022) has shown quite clearly that there are likely GSE members with pericenters within 3 kpc. There is tantalizing but inconclusive evidence from Figure 5 that the distribution of GSE stars might extend to small Rapo and lower e. Figure 5 indicates an excess of stars near (e, Rapo) ≈ (0.7, 4 kpc) that appears to continue down from the accreted GSE sequence. Here it would become important to disentangle GSE from other proposed accreted structures in the inner Galaxy (e.g., Kruijssen et al. 2019; Horta et al. 2021; see Section 3.5).

We now ask to what extent different metal-poor stars in the inner Galaxy have net angular momentum. In general, more metal-rich stars are generally expected and observed to have a more coherent sense of angular momentum. We explore this in the two panels of Figure 6, which show how close, or far, stars at a given [M/H] and Rapo are from being an ensemble of stars on circular, in-plane orbits. This is quantified via the ratio of the angular momentum, which is the azimuthal action Jϕ , to the total action ${J}_{\mathrm{tot}}=\sqrt{{J}_{\phi }^{2}+{J}_{R}^{2}+{J}_{z}^{2}}$. The top panel shows that the increase of $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}$ with [M/H] depends somewhat on the orbit size: At a given [M/H] orbits with smaller Rapo are farther away from the coplanar quasi-circular orbits than at Rapo ∼ 5 kpc.

Figure 6.

Figure 6. Dependence of the orbits' level of rotation support, as a function of [M/H] and Rapo. The rotation support is defined as 〈Jϕ /Jtot〉, where 1 is an ensemble of circular, in-plane orbits, covering both an increase in net rotation and the approach toward circular, coplanar orbits. The top panel shows 〈Jϕ /Jtot〉 as a function of [M/H] and Rapo. The bottom panel shows $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}([{\rm{M}}/{\rm{H}}])$, marginalized over 3 kpc < Rapo < 7 kpc: starting already at [M/H] < −2 there is some net $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}([{\rm{M}}/{\rm{H}}])$, which increases to ∼0.3 at [M/H] = −1.3, then steeply toward [M/H] ∼ −0.7, after which it levels off in the cold rotation-dominated regime.

Standard image High-resolution image

The bottom panel of Figure 6 shows $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}([{\rm{M}}/{\rm{H}}])$, marginalized over 3 < Rapo/kpc < 7: starting already at [M/H] < −2 there is some, albeit small, net $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}([{\rm{M}}/{\rm{H}}])$, which increases gradually to ∼0.3 at [M/H] = −1.3. Among more metal-rich populations, $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}([{\rm{M}}/{\rm{H}}])$ steeply rise with rising [M/H] to [M/H] ∼ −0.7, after which it levels off in the regime dominated by cold rotation. This analysis implies that some modest net rotation is present in the population well below [M/H] < −1.5, affirming and extending the findings of Arentsen et al. (2020a), Belokurov & Kravtsov (2022), and Conroy et al.(2022).

3.5. Abundance Patterns in the Inner Galaxy

We now enlist element abundances for our sample stars that overlap with APOGEE to explore which parts of the sample appear accreted from a chemical enrichment perspective (as opposed to proto-Galactic, or in situ). Such abundance-based arguments for the origin of stars are based on the idea that enrichment works differently in (sub)halos of different mass (e.g., Hawkins et al. 2015; Das et al. 2020; Horta et al. 2021; Belokurov & Kravtsov 2022): the production of α elements and aluminum aided by high star formation intensities in the deeper potential wells of the proto-Galaxy, compared to the shallower potential of lower-mass satellite galaxies. However, these abundance diagnostics of the stars' formation environment may be less discriminating at lower metallicities (e.g., Conroy et al. 2022).

We remove stars belonging to globular clusters and apply a few basic quality cuts to the APOGEE abundance data based on their uncertainties and flags. We plot the resulting chemical distributions, [X/Fe] versus [Fe/H], with ${\rm{X}}=\left\{\alpha ,\mathrm{Al},\mathrm{Mn}\right\}$, as a function of eccentricity in Figure 7, by median apocenter radius. The presumed boundaries between in situ and accreted are indicated by the yellow dashed lines.

Figure 7.

Figure 7. Abundance diagnostics for stars in the inner Galaxy, and their relation to these stars' orbital apocenters. We show [Fe/H], [α/Fe] (top row), and [Mg/Mn] and [Al/Fe] (bottom row) from SDSS/APOGEE (DR17) for stars with ϖ < 0.25 mas. The three columns of panels show slices of increasing orbital eccentricity from left to right, with stars colored by their median Rapo. The color map transitions from blue to red for apocenters at the solar Galactocentric radius; linear density contours of the underlying sample are overlaid in black. The gold dashed lines indicate boundaries that have been used for selecting "accreted" stars on the basis of their abundance patterns (e.g., Hawkins et al. 2015; Das et al. 2020; Horta et al. 2021; Belokurov & Kravtsov 2022; Conroy et al. 2022). Note, however, that the [Mg/Mn]–[Al/Fe] diagnostic has been found to be more ambiguous at low metallicity (see Belokurov & Kravtsov 2022; Conroy et al. 2022). Overall, the figure shows that there is a high degree of correlation between the orbital and the abundance properties. At eccentricities e < 0.8 most stars with [M/H] < −1 have both RapoRGC(Sun) and abundance patterns attributed to in situ formation. Only at the highest eccentricities, e ≥ 0.8, is there a good fraction of stars that have large apocenters and at the same time abundance patterns characteristic of accreted stars. Both of these latter properties are expected for members of GSE near pericenter.

Standard image High-resolution image

Figure 7 shows clearly that at modest eccentricities (left and middle columns), most stars belong to the α-enhanced sequence, implying a proto-Galactic origin. At high eccentricities (right columns), a prominent low-α sequence with large apocenters appears. These stars on eccentric orbits with large apocenters appear to be remnants of the accreted GSE merger based on their chemistry and kinematics (e.g., Mackereth et al. 2019; Bonaca et al. 2020; Naidu et al. 2020; Hasselquist et al. 2021; Horta et al. 2022). Note that among the stars of highly eccentric orbits there is a portion whose abundances point toward proto-Galactic origin (right column). Remarkably, they almost exclusively have apocenters ≲5 kpc, very much like their cousins on less eccentric orbits, but very unlike the chemically identified GSE debris. The population of eccentric stars attributable to the proto-Galaxy has been seen before. It has been dubbed "Aurora" by Belokurov & Kravtsov (2022) and has been explored at even lower metallicities by Conroy et al. (2022). Our sample probes to considerably lower RGC than either of these works, and reveals the bulk and full extent of the metal-poor proto-Galaxy at the heart of the Milky Way.

It is apparent from Figures 5 and 7 that both orbits and abundances can help to differentiate between presumed proto-Galactic stars and stars accreted later from a lower-mass satellite. A differentiation by chemistry alone appears increasingly difficult at low [M/H] (Conroy et al. 2022). But the combination of both aspects, summarized in Figure 7, can make such a differentiation more convincing.

Horta et al. (2021) selected "accreted" stars in these APOGEE chemical spaces as evidence of a past "Heracles" merger buried deep within the Milky Way, distinguished from GSE on the basis of a bimodal distribution in total orbital energy. This claimed galaxy bears resemblance to the previously proposed Kraken/Koala mergers (Kruijssen et al. 2019; Forbes 2020; Kruijssen et al. 2020), and these names may well refer to the same event. However, Lane et al. (2022) pointed out that the apparent bimodality in orbital energy used by Horta et al. (2021) to select Heracles/Kraken/Koala might be an artifact of the spatial selection function of APOGEE. Furthermore, Heracles/Kraken/Koala and the proto-Galactic Aurora are virtually indistinguishable in all chemical spaces (Horta et al. 2022; Myeong et al. 2022; Naidu et al. 2022).

Taken together, these lines of evidence suggest that the stars attributed to Heracles, Kraken, Koala, and Aurora, along with the numerous metal-poor stars identified here, are different orbit and metallicity regimes of one proto-Galactic stellar population, now forming the metal-poor, ancient heart of the Milky Way. Whereas past studies have mainly identified the eccentric tail of this population that traverses the solar neighborhood or off the Galactic plane (Belokurov & Kravtsov 2022; Conroy et al. 2022; Myeong et al. 2022), we now show more directly that the bulk of this population resides in the bulge itself, mostly within a few kiloparsecs of the GC. Indeed, this matches theoretical predictions from cosmological zoom-in simulations, which predict a spheroidal distribution of proto-Galaxy stars predominantly concentrated around the GC (e.g., El-Badry et al. 2018; Wetzel et al. 2022; Figure 13 in Belokurov & Kravtsov 2022).

The picture presented here appears to obviate the need for an ex situ merger like Heracles/Kraken/Koala, at least based on the orbits and chemistry of field stars. This does not necessarily imply that these mergers did not occur; rather, it emphasizes the—conceptual and practical—ambiguity between in situ and accreted stars during the earliest phases of the Milky Way, which led us to use the collective term proto-Galaxy. This view is supported by cosmological zoom-in simulations, which affirm the difficulty in distinguishing accreted galaxies from the Milky Way proto-Galaxy at early times (e.g., Renaud et al. 2021a, 2021b; Orkney et al. 2022).

4. Summary, Discussion, and Outlook

We have presented an exploration of the metal-poor stellar population in the inner Galaxy (RGC ≲ 5 kpc), drawing on the newly available low-resolution XP spectroscopy from Gaia's DR3. The primary goal was to see whether there is an extensive ancient and metal-poor stellar population at the "heart" of the Milky Way. We wanted to learn whether the bulk of these stars are a tightly bound proto-Galactic population, rather than stars accreted from more distant satellites whose location in the inner Galaxy just reflects the pericenter phase of orbits that take them out to the "classical" halo at RGCR. As simulations imply that the earliest heart of galaxies often arises from the high-redshift coalescence of clumps with comparable mass, we prefer the term proto-Galactic for the resulting population, rather than in situ, a term that singles out one of these clumps even if it has only modestly larger mass than other nearby clumps.

The practical foundations of our analysis were estimates of [M/H] of ∼2 million giant stars within 30° of the GC, at least 1 kpc from us and sufficiently bright for good S/N in the blue part of the XP spectra, GBP ≤ 15.5. We converted the XP spectral information into a set of narrowband fluxes in filters that are known to be highly diagnostic of [M/H] in cool stars. We combined this synthetic photometry with AllWISE photometry and then used them—along with the XP spectral coefficients—for a data-driven [M/H] prediction, trained on the APOGEE DR17 data using XGBoost. We found these [M/H] estimates to be precise to ∼0.1 dex. We also found them robust and pure enough to identify metal-poor samples of stars in the inner Galaxy ([M/H] < −1), despite the presence of a vastly dominant population of more metal-rich stars in the inner Galaxy. For most of these stars, radial velocities exist from Gaia RVS, allowing the estimates of their orbits.

On this basis, we identified metal-poor samples of stars toward the inner Galaxy that are about two orders of magnitude larger than published ones: > 4000 stars with [M/H] < −2, ∼20,000 stars with [M/H] < −1.5, and ∼70,000 stars with [M/H] < −1. The most metal-poor tail of the metallicity distribution extends to [M/H] ≈ −2.5. Whether this constitutes a genuine metallicity floor or reflects that the APOGEE training set for XGBoost only extended to [M/H] = −2.5 requires follow-up data.

These samples reveal a large metal-poor population ([M/H] < −1.5) in the inner few kiloparsecs of our Galaxy. Three lines of argument imply that this population is centrally concentrated. First, the parallax distribution implies that most stars are within 5 kpc of the GC (Figure 3). If we model the spatial density distribution of stars with [M/H] < −1.5 as a Gaussian centered on the GC, with an extent of ${\sigma }_{{R}_{\mathrm{GC}}}$, this parallax distribution implies ${\sigma }_{{R}_{\mathrm{GC}}}\sim 2.7\,\mathrm{kpc}$. Second, the number density projected onto the sky shows a centrally concentrated distribution (albeit severely modulated by intervening dust; see Figure 2), which is qualitatively consistent with ${\sigma }_{{R}_{\mathrm{GC}}}\sim 2.7\,\mathrm{kpc}$. Third, their orbits show that most stars have apocenters of less than 5 kpc (Figure 5). The latter property implies that most of this population could not have been found in many of the previous surveys that focused on RGC > 5 kpc. At the same time, our analysis showed that a minority (but still a large number) of stars with [M/H] < −1.2 currently in the inner Galaxy are on highly eccentric orbits that can take them to >10 kpc. These are presumably members of the accreted halo, found here near the pericenter of their orbits.

Using the members of this sample that have detailed abundances from SDSS (DR17), in particular [α/Fe], [Al/Fe], and [Mn/Fe], we explored which of these stars have abundance patterns attributable to a proto-Galactic (or in situ) versus accreted origin. We found that almost all stars that remain tightly bound within the inner Galaxy (Rapo < RGC(Sun)) have abundances that point toward a proto-Galactic origin. Stars on highly eccentric orbits with Rapo > 10 kpc show abundances typical of accreted stars and are presumed pericenter members of GSE.

We quantified how much net rotation the various monoabundance populations show and how much they resemble disk-like orbits. We did this by considering the mean ratio of the angular momentum to the total action, $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}([{\rm{M}}/{\rm{H}}])$. We found that only the most metal-poor tail [M/H] < −2 of this population shows no rotation, $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}([{\rm{M}}/{\rm{H}}]\lt -2)\approx 0$. Populations with higher [M/H] also have higher $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}([{\rm{M}}/{\rm{H}}])$: at [M/H] = −1.5, $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}$ has reached 0.25, then continuously increasing to $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}\approx 0.9$ in [M/H] = −0.8.

All of this information fits a picture in which this metal-poor heart of the Milky Way constitutes the most ancient proto-Galactic component of our Galaxy.

  • 1.  
    Much of this population is confined to RGC < R, or tightly bound at the bottom of our Galaxy's potential well, arguing against an origin through accretion from a once-distant satellite.
  • 2.  
    The vast majority of stars with [M/H] < −1.5 are distinctly α enhanced, arguing for rapid enrichment in a deep potential well.
  • 3.  
    The stellar mass we see at [M/H] < −1.5 within 5 kpc corresponds to ∼5 × 107 M, but manifest dust obscuration implies a large correction that suggests that the stellar mass of this metal-poor heart of the Milky Way is high, M* ≳ 108 M. Such high masses of strongly α-enhanced stars at such low [M/H] are most likely to occur at the center of a (eventually) quite massive halo, not at a low-mass satellite.
  • 4.  
    We have mapped the degree of (prograde) ordered azimuthal motion, $\overline{{J}_{\phi }/{J}_{\mathrm{tot}}}([{\rm{M}}/{\rm{H}}])$, as a function of [M/H]. At [M/H] ∼ −2 it is approximately zero, and increases continuously when considering sets of stars with increasing [M/H]. Such a relation is expected if star formation occurs—as time goes on—in gas that has increasingly settled into a disk, as recent simulations of disk galaxy formation show (Gurvich et al. 2022).
  • 5.  
    Much of this central component has [M/H] well below that of the centrally concentrated, α-enhanced, thick disk, whose oldest members (at [M/H] ∼ −1) appear to be ≈12.5 Gyr old (Xiang & Rix 2022). If much of this metal-poor heart of the Milky Way chemically, and therefore temporally, predates the oldest α-enhanced disk, it must have formed at τ ≳ 12.5 Gyr, corresponding to z ≳ 5. This stellar population at the heart of the Milky Way should be almost exclusively ancient.

The results presented here are by no means a new, distinct stellar component of the Milky Way. The stars in our sample seem to constitute the tightly bound part—and bulk—of a proto-Galactic spheroid. The distribution tail of stars on somewhat more extended orbits has already been recognized in recent work as an in situ halo (Belokurov & Kravtsov 2022; Conroy et al. 2022). But our results significantly flesh out the existing picture by showing that there is indeed a tightly bound in situ "iceberg," whose tips have been recognized before. The fact that there are many metal-poor stars in the inner Galaxy has also already been recognized by the recent work of Arentsen et al. (2020a, 2020b).

Our analysis here is in many ways preliminary and suggests various avenues of follow-up. The spatial distribution of this population deserves to be modeled quantitatively, including the effects of dust extinction, source crowding, parallax uncertainties, and the giant luminosity function. The abundances, and perhaps more importantly, the abundance patterns, deserve full spectroscopic follow-up to understand: How pure are the XP selected samples in this regime? Is there a floor in the [M/H] distribution that might reflect the [M/H] of the proto-Milky Way's circumgalactic medium? If these are indeed among the most ancient stars in the Milky Way, do they have remarkable abundance ratios? Was the formation of this component associated with the formation of an extensive set of globular clusters? What can the degree of central concentration of this component tell us about the violence of subsequent merging events, which would scatter stars to larger radii?

On a more technical side, this analysis reflects the astounding information content of the Gaia DR3 data, particularly the XP spectra. Our data-driven approach to estimate [M/H] seems to work well for the current analysis, at the price of restricting [M/H] estimates to bright objects. Presumably, we are far from exploiting the information of the XP spectra, which should be unlocked by forward modeling of the data.

V.C. gratefully acknowledges a Peirce Fellowship from Harvard University. The authors thank Vasily Belokurov and Andrey Kravtsov for helpful comments on the manuscript.

This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement.

Funding for the Sloan Digital Sky Survey IV has been provided by the Alfred P. Sloan Foundation, the U.S. Department of Energy Office of Science, and the Participating Institutions.

SDSS-IV acknowledges support and resources from the Center for High Performance Computing at the University of Utah. The SDSS website is www.sdss.org.

SDSS-IV is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Brazilian Participation Group, the Carnegie Institution for Science, Carnegie Mellon University, Center for Astrophysics ∣ Harvard & Smithsonian, the Chilean Participation Group, the French Participation Group, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU)/University of Tokyo, the Korean Participation Group, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), National Astronomical Observatories of China, New Mexico State University, New York University, University of Notre Dame, Observatário Nacional/MCTI, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University.

This work made use of the Third Data Release of the GALAH Survey (Buder et al. 2021). The GALAH Survey is based on data acquired through the Australian Astronomical Observatory, under programs: A/2013B/13 (The GALAH pilot survey); A/2014A/25, A/2015A/19, A2017A/18 (The GALAH survey phase 1); A2018A/18 (Open clusters with HERMES); A2019A/1 (Hierarchical star formation in Ori OB1); A2019A/15 (The GALAH survey phase 2); A/2015B/19, A/2016A/22, A/2016B/10, A/2017B/16, A/2018B/15 (The HERMES-TESS program); and A/2015A/3, A/2015B/1, A/2015B/19, A/2016A/22, A/2016B/12, A/2017A/14 (The HERMES K2-follow-up program). We acknowledge the traditional owners of the land on which the AAT stands, the Gamilaraay people, and pay our respects to elders past and present. This paper includes data that has been provided by AAO Data Central (datacentral.org.au).

The Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

This work has made use of the Python package GaiaXPy, developed and maintained by members of the Gaia Data Processing and Analysis Consortium (DPAC), and in particular, Coordination Unit 5 (CU5), and the Data Processing Centre located at the Institute of Astronomy, Cambridge, UK (DPCI).

Appendix: [M/H] Estimates via XGBoost Trained on SDSS APOGEE Abundances

Here we expand on a few specifics of the [M/H] estimate that is summarized in Section 3.2.

A.1. Training of XGBoost Model of [M/H]

We want to derive [M/H] from XP information for objects that still have significant flux in the blue (GBP < 15.5) yet may be reddened by several magnitudes. To help break any degeneracies between extinction and Teff, we include near-infrared photometry from AllWISE (Cutri et al. 2021). We adopt SDSS DR17 APOGEE abundances (Abdurro'uf et al. 2022) as a training sample, since it covers the inner disk and contains many giants stars with high extinctions.

For the machine-learning algorithm, we choose the extreme gradient boosting algorithm (Chen & Guestrin 2016, hereafter XGBoost), as it is computationally inexpensive to train and can outperform other algorithms, e.g., deep learning (e.g., Grinsztajn et al. 2022). To improve the [M/H] predictions for our sample (consisting of red giant stars by construction), we also restrict the training sample from SDSS DR17 sample to only giants, via $\mathrm{log}g\lt 3.5$.

This leaves us with a suitable choice of data features, for which we explored a number of options. Our starting point was the set of XP coefficients (Carrasco et al. 2021; De Angeli et al.2022), which we normalized by the Gaia apparent G flux. In Table 1, we quote the test errors for using the different numbers of XP coefficients. The performance is overall very good but depends only weakly on how many coefficients we use. This suggests that also the high-order coefficients contain useful information, and, at least for this application and the APOGEE DR17 sample, a truncation of XP coefficients is neither required nor helpful in our case.

Table 1. Test Errors of [M/H] from XGBoost from 20-fold Cross-validation Using Various Numbers of XP Coefficients

Input FeaturesMedian AERMSE
All 55 coefficients0.0610.145
First 40 coefficients only0.0590.138
First 30 coefficients only0.0600.137
First 20 coefficients only0.0620.139
First 10 coefficients only0.0660.144
XP colors only0.0780.154
All coefficients+XP colors0.0640.138
All coefficients+WISE0.0880.186
XP colors+WISE0.0670.136
All coefficients+XP colors+WISE0.0490.107

Note. We quote the median absolute error and the rms error.

Download table as:  ASCIITypeset image

Given the results in Montegriffo et al. (2022), we then attempted to only use photometry synthesized from XP spectra using GaiaXPy 14 (Ruz-Mieres 2022) and combine this with AllWISE near-infrared photometry. Specifically, we synthesize photometry from the systems of Pristine, StromgrenStd, Jpas, Jplus, SkyMapper, and HstAcswfc. As we can see from Table 1, using only such colors has a similar performance as only using the XP coefficients, despite including AllWISE photometry. However, a substantial improvement is obtained when combining all XP coefficients with these colors.

In the end, we used GaiaXPy to compute the following synthetic photometry and combined it with all 110 XP coefficients in the XGBoost training on SDSS DR17 (APOGEE):

We emphasize that the GBPW2 color has the longest wavelength leverage and therefore is highly affected by extinction, whereas the W1W2 color is comparatively insensitive to extinction.

As is evident from Table 1, this final configuration has slightly worse performance than the best configuration. However, the median absolute errors are similar, that is, most sources get similar results. The only noteworthy difference is in the rms error, suggesting that the final configuration may have a few more outliers. Still, the results are better than what we can achieve from coefficients alone.

A.2. Further [M/H] Validation

Figure 8 illustrates a cross-validation comparison of our results to other spectroscopic surveys. Such cross-validation is easy to interpret if one can assume that the external data set represents "ground truth." We applied a SNR_G > 25 to the LAMOST sample and required that there were no quality (concern) flags in any of the GALAH spectra. The left panel shows that the [M/H]XP estimates are accurate for LAMOST, which itself is tied to the APOGEE [M/H] scale, with a slight increase in scatter toward low [M/H]. The comparison with GALAH (middle panel) affirms the precision and purity of the low-[M/H]XP estimates. Note that GALAH has a different scaling between [M/H] and [Fe/H] than APOGEE and LAMOST, which may explain the offset of the estimates for low-[M/H] stars from the 1-to-1 line. In the parent sample without stringent GALAH quality cuts, there are many stars for which GALAH implies that they are [M/H] < −1, but the [M/H]XP estimate disagrees. The position of those stars in the dereddened color–magnitude plane suggests that they indeed have [M/H] > −1. In other words, for these cases [M/H]XP appears to be the more robust estimate. The comparison with GSP-Spec in the right panel of Figure 8 confirms the quality of GSP-Spec's [M/H] estimates. But it also shows that the GSP-Spec values do not cover low [M/H] (by design; see Recio-Blanco et al. 2022). It is possible to get GSP-Spec values for a wider range of metallicities by relaxing quality cuts, but this results in a more contaminated sample with systematic grid effects.

Figure 8.

Figure 8. Cross-validation of our [M/H]XP estimates against three external spectroscopic data sets, LAMOST (here Xiang et al. 2019), GALAH (Buder et al. 2021), and GSP-Spec (Recio-Blanco et al. 2022). This cross-validation supports the precision and robustness of our [M/H]XP estimates.

Standard image High-resolution image

Because a star's effective temperature (Teff) can be very helpful in sample selection and characterization, we also estimate Teff using the same XP features and XGBoost trained on SDSS DR17 APOGEE Teff, as illustrated in Figure 9. The XGBoost predictions based on XP spectra and WISE photometry match the APOGEE values within a median ΔTeff = 32 K, and with a mean difference of only 3 K. Figure 9 compares our [M/H] estimates to APOGEE as a function of Gaia GBP magnitude, showing the slight degradation toward the faintest sources in the sample.

Figure 9.

Figure 9. Top: Validation of our Teff(XP) estimates against the Teff from SDSS DR17 APOGEE. The XP predictions match the APOGEE values within a median ΔTeff = 32 K, and with a mean difference of only 3 K. Bottom: Validation of our [M/H] estimates as a function of GBP magnitude, with ${\rm{\Delta }}{[{\rm{M}}/{\rm{H}}]\equiv {[{\rm{M}}/{\rm{H}}]}_{\mathrm{XP}}-[{\rm{M}}/{\rm{H}}]}_{\mathrm{APOGEE}}$, showing that the quality of the estimate depends somewhat on GBP, as expected, but is overall unbiased.

Standard image High-resolution image

We also compared our [M/H] estimates to the extensive set of photometric estimates for metal-poor stars from Chiti et al. (2021) based on SkyMapper Data Release 2 (DR2/SM2) photometry. We found a mean [M/H] offset of 0.28 dex, and for stars with [M/H]SM2 ∼ −1.5 a central 68% interval of $p\left(\,{[{\rm{M}}/{\rm{H}}]}_{\mathrm{SM}2}-{[{\rm{M}}/{\rm{H}}]}_{\mathrm{XP}}\,\right)$ of 1 dex. Clearly, the cross-validation of our XP-based [M/H] estimates and SkyMapper DR2 shows far more scatter than our cross-validation with external spectroscopic data sets. This illustrates that for this sample XP-based [M/H] estimates outperform those of SM2.

Footnotes

  • 10  

    Belokurov & Kravtsov (2022) use the term "young Galaxy".

  • 11  

    The metallicity biases reported in Andrae et al. (2022b) presumably originate from systematics in the ab initio spectral energy distribution (SED) models and from our imperfect understanding of the XP instrument used to transfer those model SEDs into XP spectra.

  • 12  

    If stellar features are to be estimated from XP spectra in a full forward-modeling approach, the situation may be different.

  • 13  
  • 14  
Please wait… references are loading.
10.3847/1538-4357/ac9e01