1 Introduction

Atmospheric muon neutrinos are produced in weak decays of mesons that are produced in cosmic-ray-induced air showers in Earth’s atmosphere [1]. These mesons either decay and produce neutrinos and muons or interact with air. The probability of interaction decreases proportional to the local density of air, which is inversely proportional to the atmospheric temperature [2]. When the probability for mesons to interact decreases, the probability of decay and the subsequent production of atmospheric neutrinos and muons increases. Therefore, the relative change in flux of atmospheric neutrinos and muons is expected to correlate with the relative change in the temperature of Earth’s atmosphere.

Locally, the atmospheric temperature changes continuously resulting in daily variations. Additionally, these small variations are modulated by a larger yearly variation that is related to the seasonal change of the global atmosphere. For atmospheric muons, which arise from weak decays of mesons as well, similar variations are a well-established observation, see e.g. [3,4,5,6,7,8,9,10,11,12,13,14,15]. For atmospheric \(\nu _{\mu }\) a similar measurement is challenging because of the large number of events required for a significant observation. Additionally, the flux of atmospheric \(\nu _{\mu }\) is observed from all zenith directions due to these neutrinos penetrating Earth. This requires the measurement of temperatures globally unlike the atmospheric muon flux which requires the observation of temperatures at the experiment’s location.

The production of high-energy atmospheric neutrinos and muons on average occurs at high altitudes of  20–40 km (corresponding to a range from 3 to \({60}\,\textrm{hPa}\)) where the atmospheric density is sufficiently large for the production of the parent mesons, but still sufficiently small for these mesons to decay. Therefore, the measurement of the correlation of atmospheric \(\nu _{\mu }\) with the stratospheric temperature probes the meson production in the early development of cosmic-ray air showers at high altitude. Unlike the flux of atmospheric muons, which is generally dominated by decays of charged pions, the flux of high-energy atmospheric \(\nu _{\mu }\) has a substantial contribution from decays of charged kaons [1, 16]. This is caused by the different energy fraction dependencies in meson decays for neutrinos (\(1 - m_{\mu }^2/m_{K/\pi }^2\)) and muons (\(m_{\mu }^2/m_{K/\pi }^2\)). Due to the higher mass, energy fractions of muons and neutrinos in kaon decays are much closer compared to pion decays. This contribution amounts to roughly 30 % at lower neutrino energies (\(\lesssim {10}\,{\textrm{GeV}}\)) but increases with neutrino energy reaching about 80 % at high neutrino energy (\(\gtrsim {1}\,{\textrm{TeV}} \)) (see [17]). The production of kaons at high energy is still a major uncertainty in the modeling of air showers and atmospheric \(\nu _{\mu }\) fluxes [18, 19] and the analysis of seasonal variations provides a new opportunity for constraining such uncertainties.

The IceCube Neutrino Observatory [20] is measuring high-energy neutrinos of astrophysical origin. While IceCube was designed to measure astrophysical neutrinos, it also detects a unique data sample of high-energy atmospheric \(\nu _{\mu }\) with unprecedented statistics of several hundred thousand events in 10 years at energies above \({100}\,{\textrm{GeV}}\) [21]. Preliminary studies of seasonal variations of the \(\nu _{\mu }\) flux in IceCube have been reported earlier [22, 23].

Fig. 1
figure 1

Sketches of the experimental setup. The top sketch shows in the right half the production, propagation and measurement of an atmospheric \(\nu _{\mu }\) with IceCube. It also shows in the top left the measurement of the temperature by the AIRS instrument. The observation zone (IceCube zenith from \({90}^\circ \) to \({115}^\circ \)) can be seen in the bottom sketch. This sketch also contains the definition of geometrical quantities such as h, l and \(\theta ^*\). These quantities are explained further in Appendix A.2. The sketch is not to scale

A sketch of the analysis concept is shown in Fig. 1. For each detected neutrino in IceCube, the reconstructed direction is used to determine the atmospheric temperature profile at the production site. The Atmospheric Infrared Sounder (AIRS) on NASA’s Aqua satellite [24] provides a daily coverage of the global distribution of atmospheric temperatures. This atmospheric temperature dataset allows for a description of the atmospheric conditions for the respective time and direction of each neutrino event. The data is analyzed two ways: using an un-binned likelihood approach as well as a \(\chi ^2\) fit with bins of daily temperature and neutrino rates. The results are investigated for various systematic effects and compared to predictions using the numerical cascade equation solver MCEq [17] with cosmic-ray primary particle fluxes, and different state-of-the-art hadronic interaction models.

2 Data sets

2.1 The IceCube Neutrino Observatory

The IceCube Neutrino Observatory [20] is a cubic-kilometer Cherenkov detector located at the geographic South Pole. It detects neutrino interactions by measuring Cherenkov light from secondary charged particles using 5160 digital optical modules (DOMs) buried in the antarctic ice at depths of 1450–2450 m below the ice’s surface. Each DOM consists of a hemispherical photomultiplier tube which is embedded in a glass pressure housing together with electronics for digitization, control and communication to the surface. These DOMs are placed along 86 vertical strings arranged in a hexagonal structure. This analysis uses a sample of through-going muon tracks induced by up-going and horizontal muon-neutrinos which has been recorded between April 2012 and December 2018 , corresponding to 2443 days of data-taking and an uptime of about 97%. This uptime includes further quality requirements compared to the IceCube duty-cycle.

Fig. 2
figure 2

Plot showing the data used in the seasonal variations analysis. The neutrino data is shown in daily bins (blue points/error bars), monthly bins (orange points/error bars). The red line depicts the effective temperature as defined in Eq. (1) based on the data measured by AIRS. The temperature values are on the right hand y-axis of the plot

The data selection is identical to the analysis in [21] which investigates the diffuse astrophysical neutrino spectrum. The sample uses the Earth as a natural shield (zenith range from \({85}^\circ \) to \({180}^\circ \)) to suppress atmospheric muons and additional cuts are applied on the goodness of the events reconstructions and energy deposited in the detector. The selection reaches a purity of neutrino-induced muons of  99.85%, estimated using simulations. In addition to the criteria described in [21], we narrow the field of view to neutrinos reconstructed within zenith directions from \({90}^\circ \) to \({115}^\circ \). Neutrinos in this range originate dominantly from air showers in Earth’s Southern hemisphere with geographic latitudes between \({-90}^\circ \) and \({-40}^\circ \), as shown in Fig. 1. With this choice, we exclude neutrinos from geographic regions of opposite seasons (Northern hemisphere) and regions with small seasonal temperature variations close to the Earth’s equator. The resulting sample consists of a total of 262,846 events. This corresponds to an average of 110 detected neutrinos per day. The measured daily rates of neutrinos are shown in Fig. 2. Based on the fit of the measured energy spectrum in [21], we expect a median neutrino energy of \({900}\,\mathrm{{GeV}}\), and 90 % of neutrinos to have energies between 200 and \({7700}\,{\textrm{GeV}}\).

Fig. 3
figure 3

Example of altitude-profiles of atmospheric temperatures for an IceCube zenith of 95\(^{\circ }\) throughout the years 2012 and 2013. The location on Earth corresponds to \(-80^{\circ }\) in latitude and 0\(^{\circ }\) in longitude. Also shown is the average effective production-profile of atmospheric \(\nu _{\mu }\) in dashed black. These profiles are based on calculations with MCEq and are integrated in energy with the IceCube effective detection area. The small rise towards lower altitude is not related to meson decays but to muon decays in flight

2.2 The Atmospheric Infrared Sounder AIRS

The measured neutrino rate is correlated with the atmospheric temperature at the geographic location of the parent air shower. The temperature data required by this analysis is obtained by the Atmospheric Infrared Sounder (AIRS) [24]. The satellite orbits Earth on a Sun-synchronous orbit, crossing the equator at about 13:30 h and 1:30 h local time. AIRS observes a swath of \({1650}\,{\textrm{km}}\) and gives an angular resolution of \({1}^\circ \times {1}^\circ \) in latitude and longitude. This results in observational gaps around the equator (see Fig. 10) but a good overlapping coverage of the atmosphere in the considered geographic range with at least two observations per day for every location. The AIRS data product is publicly available [24]. For each measurement the data include temperatures on 24 isobar levels ranging from 0.1 to \({1000}\,{\textrm{hPa}}\), and additionally include the altitude of each of these pressure levels. The combined accuracy is given as \({1}\,{\textrm{K}}\) per \({1}\,{\textrm{km}}\) of vertical depth [24]. As an example, Fig. 3 shows for one location the yearly variation of measured temperatures and the predicted effective neutrino production yield that is weighted with the detection efficiency of IceCube. The figure highlights regions in the atmosphere with temperature variations that are relevant for atmospheric \(\nu _{\mu }\) detected by IceCube.

2.3 The Effective Temperature

The geographic location of the parent air shower of an observed atmospheric \(\nu _{\mu }\) is given by the observed direction of the neutrino (see Fig. 1) that is expressed by the zenith angle \(\theta \) and the azimuth angle \(\varphi \). The atmospheric altitude of production cannot be measured by IceCube, and thus the atmospheric temperature measured by AIRS has to be averaged along the line of sight l as shown in Fig.  1. The altitude h of production is related to the atmospheric depth \(X(h,\theta ,\varphi ) = \int _{l}^{\infty }dl' \rho (l'(h,\theta ,\varphi )) \) which integrates the density from the upper edge of the atmosphere along the line of sight towards the observer. The calculated average is referred to as effective temperature:

$$\begin{aligned} T_{\textrm{eff}}(\theta ,\varphi ,t) = \frac{\int dX\, T(X,\theta ,\varphi ,t) R_X(\theta ,\varphi , X, T) }{\int dX R_X(\theta , \varphi , X,T)}. \end{aligned}$$
(1)

Here, t is the neutrino arrival time, \(\theta \) and \(\varphi \) are the local IceCube zenith and azimuth direction. These are, because of the location of IceCube at the geographic South pole, closely related to Earth’s latitude and longitude. The depth dependent neutrino rate \(R_X\) is defined by:

$$\begin{aligned} R_X(\theta , \varphi , X, T) {=} \int dE \; P(X,E,\theta ,T(X, \theta , \varphi , t) ) \; A_{\textrm{eff}}(E,\theta ). \end{aligned}$$
(2)

Here E is the neutrino energy and \(P(X,E,\theta ,T)\) is the atmospheric \(\nu _{\mu }\) production yield, defined by \(\int dX \; P = \frac{d\varPhi }{dE}\) in [2], with \(\varPhi \) being the the atmospheric \(\nu _{\mu }\) flux. \(A_{\textrm{eff}}(E,\theta )\) is IceCube’s effective area, estimated by Monte-Carlo simulations. The temperature T depends on the direction \(\theta \) and \(\varphi \) and the time t. The integral is approximated by the sum over the pressure levels given by AIRS, with the atmospheric depth being calculated for each level using the pressure, temperature and altitude. Both the neutrino production yield \(P(X,E,\theta ,T)\) in the atmosphere and the effective detection area \(A_{\textrm{eff}}(E,\theta ) \) of IceCube depend on energy and zenith, and have to be integrated when the atmospheric temperature is weighted with these quantities. The temperature data is discussed in  Appendix A.1. A detailed description of the implementation is given in  Appendix A.2.

In this analysis, we neglect the angular dependency of effective temperatures for the selected field of view and only consider time information for the correlation. Therefore, the angular dependence is averaged over the entire considered zenith range from \({90}^\circ \) to \({115}^\circ \).

$$\begin{aligned} T_{\textrm{eff}}(t) = \frac{\int d\varOmega \; T_{\textrm{eff}}(\theta ,\varphi ,t) \cdot \int dX R_X(\theta , \varphi , X,T) }{\int d\varOmega \; \int dX R_X(\theta , \varphi , X,T) }. \end{aligned}$$
(3)

Here, \(d\varOmega = d\varphi \cdot d\cos (\theta )\) is the solid angle as seen from IceCube. The zenith angle \(\theta \) can be approximately related with the geographic latitude l by \(\theta \approx l / 2 + {135}^\circ \) for \(\theta >{90}^\circ \). IceCube’s azimuth \(\varphi \) and Earth’s longitude \(\lambda \) are related by \(\varphi = -\lambda + {90}^\circ \) if \(\lambda \le {90}^\circ \) and \(\varphi = -\lambda + {90}^\circ + {360}^\circ \) if \(\lambda > {90}^\circ \). To correctly account for the differences of local measurement times with respect to the IceCube time zone (UTC), these respective local times are converted into UTC. Then, for averaging the global \(T_{\textrm{eff}} \), the temperatures are interpolated for all directions considered in the analysis with respect to the UTC times of the temperature measurements and then averaged. More details are found in [25]. The resulting temperatures for each UTC day are represented as the red line in Fig.  2. The temperature data clearly follow the annual seasons and also the daily neutrino rates, shown in blue, indicate a seasonal variation. The monthly averaged neutrino rates in orange highlight the correlation, which is analyzed quantitatively in the next section.

3 Analysis methods and results

This section introduces two methods to analyze the data: A binned \(\chi ^2\) fit, and an un-binned likelihood. The \(\chi ^2\)-fit is used in similar atmospheric muon analyses, and gives a result which is comparable to [4]. The \(\chi ^2\)-fit assumes gaussian distribution of event counts, which holds true for large event counts without introducing a reconstruction bias. To avoid dependencies on the statistical distribution of events an un-binned likelihood method is used as a complementary measurement. The un-binned likelihood is a new method, which is optimized for the smaller neutrino rates, and can be extended to account for an angular dependence in future analyses.

3.1 Binned \(\chi ^2\)-fit

To estimate the correlation between the atmospheric temperatures and the measured atmospheric \(\nu _{\mu }\) rates, a linear relation between the relative neutrino rate change and relative effective temperature change is approximated by [2, 23]

$$\begin{aligned} \frac{R(t)-{\bar{R}}}{{\bar{R}}} = \alpha \frac{T_{\textrm{eff}}(t)- {\bar{T}}_{\textrm{eff}} }{{\bar{T}}_{\textrm{eff}}}, \end{aligned}$$
(4)

where \(\alpha \) is the slope parameter, \({\bar{R}}\) the average measured neutrino rate of the whole observation time, and \( {\bar{T}}_{\textrm{eff}}\) the corresponding average effective temperature. The parameter \(\alpha \) measures the strength of the correlation. The neutrino rate \(R_i\) for each day i is calculated by dividing the neutrino count \(N_i\) by the effective detector livetime \(\tau _{i}\) of that day. The effective temperature is evaluated at noon (GMT) of each day through interpolation. Large uncertainties that stem from the primary cosmic-ray flux and systematic detector effects cancel out by focusing on the relative change of rate and temperature. To estimate \(\alpha \), similar to muon seasonal variations analyses [4, 8], the \(\chi ^2\) is minimized:

$$\begin{aligned} \chi ^2 = \sum _i^{N_{\textrm{tot}}} \frac{\Big (\frac{R_i-{\bar{R}}}{{\bar{R}}} - \alpha \frac{T_{\textrm{eff}; i}- {\bar{T}}_{\textrm{eff}} }{{\bar{T}}_{\textrm{eff}}} - b \Big )^2 }{\sigma ^2_{i}}. \end{aligned}$$
(5)

In this \(\chi ^2\)-definition an additional bias parameter b is added. This ensures robustness against systematic shifts in the average temperature and neutrino rate. The uncertainty is approximated by \(\sigma _i = \sigma _{R_i}/{\bar{R}}\) and \(\sigma _{R_i} = \sqrt{N_i}/\tau _{i}\). As the uncertainty is dominated by the limited amount of statistics of neutrino events, we neglect the uncertainty of the effective temperature and systematic uncertainties in the fit. The correlation of the quantities in Eq. (4) is plotted in Fig. 4, together with the result of the \(\chi ^2\) fit, with the best fit parameter \(\alpha =0.357\pm 0.030\). The p-value corresponding to a \(\chi ^2/ndof = 2519/2436\) is 0.117, which indicates agreement of the linear model with the data. The uncertainty of \(\alpha \) is dominated by the amount of neutrinos measured per day, but also by the range of temperature variations which is limited to \(\pm {8}\,\%\) (x-axis of Fig. 4). The observed bias is compatible with zero. The significance using the \(\chi ^2\) difference between the observed \(\alpha \) and the null hypothesis of \(\alpha = 0\) is more than 10 standard deviations. As discussed below in Sect. 3.3, the resulting \(\alpha \) has been tested for its robustness with respect to the used bin-size in time and it is found to be approximately unchanged, even if time-bins up to a month are used (see Fig. 6).

Fig. 4
figure 4

Plot showing the correlation of the measured relative rates of atmospheric \(\nu _{\mu }\) and the relative effective temperature change. The orange line depicts the result of the \(\chi ^2\) fit of the shown data, with the values written in the legend. “gof“ refers to the goodness of fit p-value, calculated from the \(\chi ^2\)-value

3.2 Un-binned likelihood

To exploit the full available time information of measured neutrinos, an un-binned likelihood technique is used. Compared to the \(\chi ^2\) fit this approach is also independent of the underlying statistical distribution of the events. This minimizes potential biases which could occur at small neutrino rates. In a linear approximation, the probability density distribution of neutrinos in time is given by

$$\begin{aligned} f(t) = \frac{1}{\tau _{\textrm{tot}}} \Bigg (1 + \alpha \frac{T_{\textrm{eff}}(t)- {\bar{T}}_{\textrm{eff}} }{{\bar{T}}_{\textrm{eff}}}\Bigg ). \end{aligned}$$
(6)

Here, \(\tau _{\textrm{tot}}\) is the total effective livetime. The total livetime is given by,

$$\begin{aligned} \tau _{\textrm{tot}} = \int _{\textrm{obs}} dt \; \epsilon (t) \end{aligned}$$
(7)

with the measurement efficiency \(\epsilon (t)\) defined as 1 if the detector is running and 0 if it is not taking data. In the same way the average effective temperature is calculated

$$\begin{aligned} {\bar{T}}_{\textrm{eff}} = \frac{1}{\tau _{\textrm{tot}}} \int _{\textrm{obs}} dt \; \epsilon (t)\cdot T_{\textrm{eff}}(t) ~. \end{aligned}$$
(8)

The log-likelihood is given by evaluating the probability density at the time of each measured neutrino event i and then summing up:

$$\begin{aligned} \begin{aligned} LLH&= \sum _i^{N_{\textrm{tot}}} \log (f(t_i)) \\&= \sum _i^{N_{\textrm{tot}}} \Bigg ( \log \Bigg (1 + \alpha \frac{T_{\textrm{eff}}(t_i)- {\bar{T}}_{\textrm{eff}} }{{\bar{T}}_{\textrm{eff}}})-\log (\tau _{\textrm{tot}}\Bigg )\Bigg ). \end{aligned} \nonumber \\ \end{aligned}$$
(9)

Minimization of the negative LLH with respect to \(\alpha \) results in the best estimate for the temperature coefficient. The resulting likelihood-profile in dependence of \(\alpha \) can be seen in Fig. 5, together with the result of the minimization and the uncertainty estimated by the width of the parabola at \(-2\cdot \varDelta LLH = 1\): \(\alpha =0.347\pm 0.029\). The uncertainty on \(\alpha \) is comparable to the \(\chi ^2\) result. There is also a small shift in \(\alpha \), which is consistent within the uncertainties of the two analysis methods. The gain from the additional time information in the likelihood-method is found to be small. However, in a future analysis this un-binned likelihood approach can be extended to include also directional information without the need to average temperatures over the observation zone.

Fig. 5
figure 5

Figure showing the negative log-likelihood profile for the un-binned likelihood defined in Eq. (9). In the inlay figure a zoom of the two sigma region is shown

3.3 Systematic uncertainties

For the discussion of systematic uncertainties, the contributions of the relative neutrino rate and the relative effective temperature including the effect of averaging of data over the Southern hemisphere have to be considered.

Uncertainties of the neutrino rates may arise from the estimation of the effective livetime and the possible contamination by background from wrongly reconstructed atmospheric muons, both of which are found negligible. The atmospheric muon rate is correlated with the same seasonal phase as the neutrinos in the Southern hemisphere but with a correlation factor about twice as large [4]. The contamination of the data sample with atmospheric muons has been estimated in [21] to about 0.15 %. This is further reduced by restricting the zenith angle to \(\theta > {90}^\circ \), leading to a rate of less than \({0.16}\,\hbox {d}^{-1}\). Assuming a maximum change of relative temperatures at the level of 8\(\%\), the variation of the rate is negligible and amounts to less than \(\varDelta R \approx {0.009}\,\hbox {d}^{-1}\). As mentioned in Sect. 2.1, IceCube is operating with a duty cycle close to 97 %. The uncertainty of the 1 % downtime is further reduced by excluding specifically days of low duty cycle from the analysis. The remaining loss in livetime that could occur, for example during run transitions, can be calculated to an accuracy of a few microseconds.

Estimating the uncertainty of \(T_{\textrm{eff}}\) is more challenging. The accuracy of the measured temperatures is estimated depending on the height [24] as \({1}\,\hbox {K}\,\,\hbox {km}^{-1}\) (see Sect. 2.2). Assuming additionally a height uncertainty of 1% in the integration of the atmospheric depth, this implies an uncertainty of \(T_{\textrm{eff}}\) of 0.25% or about \({0.7}\,{\textrm{K}}\). This estimation is supported by the observed difference of the estimated \(T_{\textrm{eff}}\) between the ascending and descending measurements of each day for which a standard deviation of \({0.45}\,{\textrm{K}}\) is found.

The sufficient coverage of the atmosphere in time by the satellite can be tested with the binned analysis by increasing the bins from days to longer periods and using only the temperature at the mean time of the bin. The result is shown in Fig.  6. Even if increased to the scale of a month, the obtained \(\alpha \) value remains almost unchanged. Only on time-scales of 3 months or more does the correlation decrease significantly due to the averaging of the seasonal effects. This, in turn, is a strong indication that an even better coverage in time and correspondingly smaller time bins would result in an unchanged \(\alpha \) and that the time coverage of the temperature data is sufficient.

As discussed above, fractional ground-coverage and topographic changes (\(< {0.1}\,\%\)) can be neglected for the chosen Southern hemisphere region. It turns out that the largest uncertainty arises from the numerical integration of the slant depth. Depending on the choice of bin-boundaries in the atmospheric depth integration, the relative temperatures can change up to 1 %. Another concern is the absolute accuracy of the temperature data. For estimating this uncertainty we have estimated the \(T_{\textrm{eff}}\) values with a partly independent data set of atmospheric temperature profiles that also include higher altitudes [26], see also  Appendix A.1. The difference of the daily determined \(T_{\textrm{eff}} \) values has a standard deviation of 0.8 % which supports a robust estimation of \(T_{\textrm{eff}} \).

The median uncertainty of the directional reconstruction of neutrino events is roughly \({1}^\circ \) at \({1}\,{\textrm{TeV}}\) neutrino energy and improves to \({0.3}^\circ \) at \({1}\,{\textrm{PeV}}\) [27]. This results in a small mismatch of the assumed location of the parent air shower from its true location. By averaging the \(T_{\textrm{eff}} \) over zenith and azimuth in the observation zone, this effect becomes negligible. Also other systematic detector effects, like the photo detection efficiency of the DOMs or optical properties of the ice, can be largely excluded. The detector is being operated in an almost unchanged configuration during the observation time and small changes in the effective area do cancel in the relative rates. Related to the detection of neutrinos with IceCube, additional uncertainties arise. This is tested by varying systematic effects of the detector, e.g. the absorption length in the ice or the quantum efficiency of the PMTs, in the MC simulation. The largest deviation of \(\alpha \) was found to be \(\delta \alpha = 0.005\), so the systematic detector effects can be neglected compared to other uncertainties. Effects from the uncertainty of the primary cosmic-ray spectrum on the prediction are tested in two ways: The first method varies the spectral index of the primary spectrum by multiplying the flux with a factor \((E/E_0)^{\varDelta \gamma _{CR}}\). A change of \(\varDelta \gamma _{CR}\) of  0.05 results in a change \(\delta \alpha = 0.005\). The second method changes the composition of the primary flux by fixing it to either pure proton (\(\ln (A) = 0\)) or pure iron (\(\ln (A)=4\)). The difference in \(\alpha \) between these extreme scenarios is  0.02.

Fig. 6
figure 6

Plot showing the dependence of \(\alpha \) on the bin-size obtained from the \(\chi ^2\)-fit. Only statistical uncertainties on \(\alpha \) are given

4 Comparison with model predictions

In this section the experimental measurement is compared to model expectations that are based on the physics of cosmic-ray air showers.

4.1 Atmospheric \(\nu _{\mu }\) flux modeling

The model expectation of the seasonal variation of the neutrino flux is based on a full numerical calculation of the cascade equation of atmospheric air showers using the numerical cascade equation solver MCEq [17]. The calculation of the neutrino flux is done for each day at each location. Here we explicitly use the daily-measured local temperature profiles measured by AIRS, as they are used in the calculation of effective temperatures. Therefore, the specific properties of the local and time dependent atmosphere is accounted for as precisely as possible. This approach thus includes the full seasonal variation of temperature profiles of the global atmosphere during the observation time. We also compare the results of the analysis to the expectation of an analytic approximation of the cascade equation [2], which is also used to estimate the production probability in Eq. (1). The analytic approximation embeds the temperature dependence in the critical energies \(\epsilon _{\pi ,K}\). The critical energy is the energy scale above which re-interactions dominate decay processes of the parent mesons. Both of these approaches are compared in [28]. For the calculations with MCEq, hadronic interactions are modeled by SIBYLL 2.3c [29] and the cosmic-ray primary particle flux by the H4a flux model [30]. After integrating the modeled neutrino fluxes \(\varPhi (E,\theta ,\varphi ,t)\) with the effective area of IceCube \(A_{\textrm{eff}}(E,\theta )\), this method gives the expected neutrino rates R(t) for each day during the observation time:

$$\begin{aligned} R(t) = \int dE \; d\varOmega \; A_{\textrm{eff}}(E,\theta ) \frac{d\varPhi (E,\theta ,\varphi ,t)}{dE}. \end{aligned}$$
(10)

From this time dependent rate expectation, we generate an ensemble of pseudo-experiments of neutrino events for the considered observation time that are then analyzed using the same methods and the same effective temperature data as the experimentally measured neutrino events.

A comparison of the expected correlations with the experimental results for the two analysis methods, are shown in Table 1. In addition to the full MCEq calculation and the analytic approximation, the expectation from the extreme cases of only pions or kaons as parent particles are given. The agreement between the analytic approximation and the MCEq based result is high, underlining the result in [28]. The experimental measurement finds a correlation that is smaller than the theoretical prediction by about 2–3 standard deviations. This tension is investigated further in the next sections.

Table 1 Table of the experimental results and predictions for \(\alpha \) using the binned \(\chi ^2\) and un-binned LLH analysis methods. The prediction for MCEq [17] used the SIBYLL 2.3c [29] hadronic interaction model. The kaon and pion results rely on the flux fraction of MCEq to estimate their neutrino production rate. The uncertainty given for the predicted values corresponds to the expected statistical uncertainty for each measurement

4.2 Uncertainties of the atmospheric \(\nu _{\mu }\) flux

Uncertainties in the prediction of the flux of atmospheric muon neutrinos are related to the flux of primary cosmic-rays, the composition of the primary particles, and the production and re-interaction probability of parent mesons (kaons and pions) in the atmosphere. In particular, a substantially larger number of parent kaons (see Table 1) would reduce the observed tension. Changes in the fraction of parent mesons can be related to uncertainties in the respective production and re-interaction cross sections of these mesons in particle interactions in the air. Such uncertainties are described in [18], by introducing parameters in different regions of primary nucleon energy and secondary meson lab energy fraction \(x_L\). As shown in [21], only high-energy (\(>{100}\;{\textrm{GeV}}\)) meson production is relevant for the atmospheric \(\nu _{\mu }\) flux. For an estimation how these uncertainties propagate to the expected seasonal variation of atmospheric \(\nu _{\mu }\) fluxes, we have varied these parameters assuming independent Gaussian uncertainties for each of these and calculated the corresponding value of \(\alpha \) for each instance. This procedure is repeated, in addition to the SIBYLL 2.3c model [29], also for different hadronic interaction models: EPOS-LHC [31], QGSJet-II 04 [32], and DPMJet-III 19.1 [33]. The resulting distributions of \(\alpha \) values are displayed in Fig. 7. It can be seen that all interaction models predict rather similar \(\alpha \) values with also a similar variance when including the variation of the parameters in [18]. The observed tension thus persists also under variation of the atmospheric and hadronic model parameters.

Fig. 7
figure 7

Plot comparing the predictions (error-bar and violins) of different hadronic interaction models with the experimental result (horizontal red line) of this analysis (un-binned LLH). The systematic uncertainties of the predictions are estimated by varying several Barr parameters [18] inside their prior distributions, with the resulting likelihood distribution of expected \(\alpha \) values being represented by the violins (larger width corresponds to a higher likelihood). The p-values of each hadronic interaction model are estimated in Table 2

For each of the hadronic interaction models, a p-value for the agreement with the experimental result based on the statistical uncertainty of the experiment and the systematic uncertainty from the atmospheric modeling is calculated and shown in Table 2. The best agreement is found for the DPMJet-III 19.1 model, with a tension of 1.8 \(\sigma \), the largest tension is found for the QGSJet-II 04 model with a tension of 2.3 \(\sigma \).

Table 2 Summary of the expected \(\alpha \)-values for different hadronic interaction models with the uncertainty estimates based on the method in [18]. The given uncertainties reflect the 68% region around the central value. The p-value giving the comparability of prediction and experimental result includes the statistical uncertainty of the experiment as well as the uncertainty estimated by varying the Barr-parameters. All predictions used MCEq with the H4a primary cosmic ray model

5 Systematic effects of the observed tension

In order to investigate the origin of the observed tension between model predictions and our measurement, Fig. 8 shows the measured and expected rate variation as a function of the respective temperature difference. The figure indicates an inverted sigmoidal trend of the data for small values of \(\varDelta T_{\textrm{eff}}\). While the extreme values of \(\varDelta T_{\textrm{eff}} \) would be consistent with the expectation, the overall range results in a smaller overall slope for the experimental data. The apparent region close to small absolute \(\varDelta T_{\textrm{eff}}\) corresponds to the spring and fall seasons at the South Pole. These times are dominated by rapid temperature changes with large fluctuation particularly in spring as seen in Fig. 2.

Fig. 8
figure 8

Plot depicting the predicted and measured relative neutrino rate against the relative temperature variations bins. The slope corresponds to \(\alpha \). The prediction is made using MCEq with the SIBYLL 2.3c hadronic interaction model

In the following sections, we investigate the effects of splitting the data into smaller, systematically split sets. The results of the different splits are summarized in Fig. 9.

Fig. 9
figure 9

Plot summarizing the main results for the slope parameter \(\alpha \), after applying either the binned \(\chi ^2\) fit (dots) or the un-binned likelihood analysis (squares). This is done for the full data set (red, first column), for each individual year starting in May (red, second column) and three different splits of the data: By energy (third column), by rising and falling temperatures (fourth column) and by splitting into caps and flanks (last column). For comparison, predictions of the respective data are shown as gray bands, with uncertainties being estimated using the approach described in [18]. The dashed red line is an extension of the overall result for \(\alpha \), to compare it to the individual results

5.1 Yearly splits

When splitting the observation periods into single years and repeating the analysis, the obtained \(\alpha \) values show larger fluctuations. These are, however, consistent with the larger statistical uncertainties. For the years 2012, 2015, and 2016 the obtained values agree with the expectation, while for the year 2014 the smallest \(\alpha \) value is measured. Detailed investigations of specific peculiarities of the IceCube detector operation as well as the used AIRS temperature data have revealed no indications of any specific difference for this period. Therefore, the observed fluctuations away from the all year mean are considered of statistical origin.

5.2 Energy dependence

The correlation between the atmospheric \(\nu _{\mu }\) rate and the atmospheric temperature exhibits a strong energy dependence, and up to this point the correlation analysis effectively averages the effect over the energy distribution of measured neutrinos. In order to verify the expected energy dependence and the implicit averaging of the spectrum, the full data is split into two samples with respect to low and high reconstructed energy [34]. The measured energy distribution is shown in Fig. 13, and the data is split at the median reconstructed energy value of \({700}\,\textrm{GeV}\) which thus yields two samples of equal statistical power. The analysis is repeated for each sample separately and results are shown in Fig.  9, together with the theoretically predicted values for the same energy split. As expected, in the low-energy bin the value of \(\alpha \) becomes smaller in contrast to the larger value at high energies. The same effect is observed for the prediction as well. This reflects that the decay probability decreases compared to the interaction probability of the parent mesons with higher energy, leading to a larger seasonal dependence.

The tension between model and experiment is similar for both energy regions. This observation disfavors effects related to the energy dependence of the detector response or the prediction as origin of the observed tension. One can conclude that the tension persists independent of the selected energy range.

5.3 Hysteresis due to seasonally dependent atmospheric layering

During seasons of generally rising or generally falling temperatures, the layering of the temperatures can strongly differ (see Fig. 3). Due to the marginalization of the altitude information in the calculation of \(T_{\textrm{eff}} \), different temperature profiles in spring and fall can result in a small difference of the atmospheric \(\nu _{\mu }\) rates for the same value of \(T_{\textrm{eff}} \). This effect has been observed for atmospheric muons [4], and is called hysteresis effect because of the difference in rate for different seasons at the same \(T_{\textrm{eff}} \) value. The non-linearity in the relation between rate and temperature is expected on the level of less than \({1}\,\%\) difference of the measured rates between spring and fall which is much smaller than the effect observed here. Furthermore, our calculation of the expectation that includes each measured temperature profile does in fact include this effect and results in a small expected difference between spring and fall of less than \({0.4}\,\%\).

As an experimental verification, the data is split at the maximum and minimum points of temperature into a falling (fall) and rising (spring) samples (see Fig. 13). The results of the split is shown in Fig. 9. A difference between the two split samples is seen with a larger value of \(\alpha \) in the spring season, which however both are statistically compatible with the average value. Unlike the experimental observation this large seasonal difference is not predicted by the theoretical calculation, despite having included the seasonal hysteresis effect into the calculation.

5.4 Extreme temperature bins

Following up the observed deviations in the spring and fall seasons, as seen in Fig. 8 in the mid-high and mid-low relative temperature variations, we further investigate systematic effects in the observed data. We separate the data into “caps” and “flanks” (see Fig. 13). Here, caps correspond to seasons of extreme temperatures, i.e. winter and summer, and flanks to the data in the transition seasons. To the right in Fig. 9, 25 % is included in each cap, i.e. 50 % of all the data, and thus 50 % of all the data in the flanks data set. As the lever arm of the extreme points in the fit remains the same compared to the complete data set, the uncertainty of the “caps” is smaller compared to the flanks. Again, the prediction has been calculated separately for the subsets of data being analyzed. When limiting the analyses to the caps, the value of \(\alpha \) is consistent with the theoretical expectation. On the other hand, the observed value of \(\alpha \) of the flanks strongly disagrees with the predictions which do not depend on the chosen selection. The systematic shift observed in this split is similar to the shape in Fig. 8. Further systematic studies based on data splits can be found in [35]. This also includes a zenith-dependent analysis of \(\alpha \), which was excluded as it did not contain additional insights.

6 Conclusion and outlook

In this paper we present the analysis of the correlation of the rate of high-energy atmospheric \(\nu _{\mu }\) measured by IceCube with the effective atmospheric temperature based on atmospheric profiles measured by the AIRS instrument on the Aqua satellite. For an observed 10% variation of the effective atmospheric temperature we observe a 3.5(3)% highly significant (\(> 10 \; \sigma \)) seasonal variation of the rate of high-energy atmospheric neutrinos.

In the comparison with the expectation from cosmic-ray air shower models predicting a larger seasonal variation of 4.3%, a tension of about two to three standard deviations is observed. This tension is marginally consistent with a statistical fluctuation, and cannot be explained by known systematic uncertainties: neither the experimental measurement, nor the used satellite data, nor the modeling of air showers. In comparison to the muon seasonal variation analysis [5, 36, 37], we observe a smaller value for \(\alpha \), which is expected due to the larger kaon contribution to the neutrino flux. The re-analysis of systematically selected subsets of the data shows deviations from the average model, with some being predicted (reconstructed energy) and others not being predicted (rising/falling temperatures, extreme temperatures). This is interpreted as a hint that the production of atmospheric \(\nu _{\mu }\) during rapidly changing atmospheric conditions may not yet be fully understood.

The observed tension demonstrates that in the future measurements of seasonal variations of atmospheric \(\nu _{\mu }\) may provide a new and complementary test of our understanding of the physics of atmospheric air showers. As the present analysis is still limited by statistical uncertainties, a future analysis needs to include a larger statistics data set. Five additional years of IceCube data taking should become available for a follow-up analysis soon. In addition, the analysis itself can be expanded by taking into account the atmospheric profile in the specific direction of the observed neutrino events. A substantially larger detector, IceCube-Gen2, [38] will allow observing the atmospheric temperature correlation with much increased statistics and thus allow for testing the correlation during shorter periods of time.