[go: up one dir, main page]

A publishing partnership

The following article is Open access

Searching for Broadband Pulsed Beacons from 1883 Stars Using Neural Networks

, , , , , , , , , , , , , , , , , , , , , and

Published 2022 June 17 © 2022. The Author(s). Published by the American Astronomical Society.
, , Citation Vishal Gajjar et al 2022 ApJ 932 81 DOI 10.3847/1538-4357/ac6dd5

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/932/2/81

Abstract

The search for extraterrestrial intelligence at radio frequencies has largely been focused on continuous-wave narrowband signals. We demonstrate that broadband pulsed beacons are energetically efficient compared to narrowband beacons over longer operational timescales. Here, we report the first extensive survey searching for such broadband pulsed beacons toward 1883 stars as a part of the Breakthrough Listen's search for advanced intelligent life. We conducted 233 hr of deep observations across 4–8 GHz using the Robert C. Byrd Green Bank Telescope and searched for three different classes of signals with artificial (or negative) dispersion. We report a detailed search—leveraging a convolutional neural network classifier on high-performance GPUs—deployed for the very first time in a large-scale search for signals from extraterrestrial intelligence. Due to the absence of any signal-of-interest from our survey, we place a constraint on the existence of broadband pulsed beacons in our solar neighborhood: ≲1 in 1000 stars have transmitter power densities ≳105 W Hz−1 repeating ≤500 s at these frequencies.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

1.1. Searching for Broadband Signals from ETI

The search for extraterrestrial intelligence (SETI) is one of the most profound ventures to understand humanity's place in the cosmos. Tarter (2003) argued that electromagnetic waves from technology built by extraterrestrial intelligences (ETIs), especially at radio frequencies, still serve as one of the best possible ways to detect evidence of extraterrestrial life via their "technosignatures." Following suggestions from Cocconi & Morrison (1959), a major fraction of SETI efforts have been focused on locating extremely narrowband signals across a limited fraction of the radio spectrum (Drake 1961; Verschuur 1973; Tarter et al. 1980; Valdes & Freitas 1986; Horowitz & Sagan 1993; Siemion et al. 2013; Harp et al. 2016; Tingay et al. 2016; Enriquez et al. 2017; Harp et al. 2018; Tingay et al. 2018; Pinchuk et al. 2019; Price et al. 2020; Sheikh et al. 2020; Gajjar et al. 2021).

The time–frequency formulation of the uncertainty principle suggests that ETI signals (intentional or unintentional) could also occupy the corner of parameter space corresponding to temporally limited broadband signals (i.e., transients; see Cole & Ekers 1979). Clancy (1980) was the first to discuss the advantage of a broadband beacon rather than a CW narrowband beacon, from the perspective of the ETI transmitter. Frank Drake, in Swift (1990), stated that, "The most rational ET signal would be a series of pulses that would be evidence of intelligent design." Project Cyclops (Oliver 1973), one of the most ambitious and detailed design studies for technosignature searches, also suggested broadband pulses as one class of likely ETI beacons.

The primary limitation of broadband signals is their susceptibility to propagation effects such as dispersion, scattering, and scintillation due to the intervening interstellar medium (ISM; Shostak 1995; Blair et al. 2010). However, by studying astrophysical broadband pulsed emitters such as pulsars (Hewish et al. 1968), rotating radio transients (McLaughlin et al. 2006), and fast radio bursts (FRBs; Lorimer et al. 2007), we can characterize these effects from the ISM and actually use them as a feature to identify true ETI beacons. For example, dispersion effects from the ISM cause broadband signals from astrophysical sources to exhibit a predictable early arrival at higher frequencies compared to lower frequencies. Demorest et al. (2004) and Siemion et al. (2010) suggested that ETI might send broadband pulsed signals with negative dispersion as a means of adding artificiality: broadband signals that appear to arrive earlier at lower frequencies compared to higher frequencies, contrary to natural phenomena. 13

Despite these advantages, to the best of our knowledge, no detailed targeted searches have been performed for such signals, likely due to limitations in computing resources. Distributed computing is one potential solution: ASTROPULSE (von Korff 2010) conducted blind searches for 0.4 μs long pulses with the help of thousands of volunteers to overcome these limitations. In the last decade, radio SETI has entered a new era with the advancement in computing power enabled by graphics processing units (GPUs) and widely available machine-learning (ML) algorithms. The Breakthrough Listen (BL) Initiative is a US $100 million 10 yr project to conduct the most sensitive, comprehensive, and intensive search for technosignatures on other worlds (Isaacson et al. 2017; Worden et al. 2017; Gajjar et al. 2019). The BL program aims to utilize these advances in computing power and algorithms to explore a range of possible ETI beacon types that have never been investigated before.

1.2. AI in the Search for ETI

Radio SETI searches must contend with large data volumes and a complex background of radio frequency interference (RFI) of anthropogenic origin, which could make it difficult to identify a real signal from ETI. A typical radio SETI survey defines a target signal type, and then aims to search for that signal class by designing a "filter"—for example, turboSETI looks for CW narrowband drifting signals of artificial origin (Enriquez et al. 2017). Such filter-based searches often produce millions of hits—for example, Enriquez et al. (2017) and Price et al. (2020) reported around 29 and 51.7 million initial hits from their filter-based searches toward 692 and 1138 stars, respectively. These signals almost entirely originate from human technology such as mobile phones, wireless communication technologies, and satellites. This prevalence of anthropogenic interference has tightly constrained the filters that can be employed, and consequently, the ETI signal morphologies that can be investigated with this method.

Advances in deep neural networks and in particular, convolutional neural networks (CNNs) for image classification, can help assess the validity of these large numbers of hits and reduce the quantity to a level more suited for manual inspection. Brzycki et al. (2020) designed a CNN classifier that can be trained to identify relatively weak injected ETI signals "drowning" in strong RFI using synthetic data sets. Similarly, Harp et al. (2018) demonstrated the usefulness of various neural network classifiers in identifying seven different kinds of likely ETI beacons. Recently, Pinchuk & Margot (2021) also demonstrated the usefulness of CNN-based classifiers for discriminating RFI from true ETI signals. Zhang et al. (2018) used a hybrid approach where a CNN-based classifier identified FRB candidates, and then a filter-based dispersion check reduced the number of false positives. Here, similar to Zhang et al. (2018), we have used a hybrid approach. We have developed a state-of-the-art GPU-accelerated pulse detection pipeline named SPANDAK (discussed in detail in Section 5) to carry out quick filter-based searches across a large number of observations. We have also developed and deployed a CNN-based classifier to eliminate a large fraction of false positives, presented in detail in the Appendix. With the help of the CNN classifier, we were able to reduce the large number of false positives generated from the SPANDAK pipeline by over 97%.

Here, we report the first-ever targeted search for broadband pulsed ETI beacons toward 1883 stars. In Section 2, we introduce three types of artificially dispersed broadband signals, and we compare their power budget with CW narrowband signals in Section 3. Details of our observations are presented in Section 4. We report our results in Section 6 with the implication of our findings discussed in Section 7, and Section 8 lists our final conclusions.

2. Artificially Dispersed Signals

Broadband signals passing through the ISM experience dispersion due to cold ionized plasma. As mentioned in Section 1, the frequency-dependent refractive index of this plasma will cause signals at higher radio frequencies to arrive earlier than signals at lower frequencies (see Figure 1). A typical broadband signal can be presented in a 2D array of frequency against time—[${{ \mathcal V }}_{N}$, T]—with N frequency channels and T time samples. The dispersion delay for a broadband signal can be expressed for this 2D array as,

Equation (1)

Here, κ is a constant, DM is the dispersion measure, and tH i is the arrival time bin of the signal in channel i—which is at a frequency of νi Hz—compared to the highest reference frequency of νH Hz. Figure 1 shows an example of such a naturally dispersed broadband signal represented in a 2D array of 64 × 4096 bins. We will refer to these naturally occurring signals as positively dispersed (pDM) signals.

Figure 1. Refer to the following caption and surrounding text.

Figure 1. A simulated positively dispersed (pDM) signal, with a dispersion measure of 6000 pc cm−3, showing an example of natural dispersion across our observed frequencies. The signal is stored as a 2D array $[{ \mathcal V }$, T] of 64 × 4096 bins.

Standard image High-resolution image

It is possible that ETIs would be aware of the existence of astrophysical pDM signals, but choose to use pDM signals as beacons by adding obviously artificial features (Demorest et al. 2004; Siemion et al. 2010). One method of modifying pDM signals would be the use of extremely short pulses: the ASTROPULSE project searched for pulses of the order of 1 μs (Korpela et al. 2009; von Korff 2010). However, as von Korff (2010) has suggested, other astrophysical sources are also known to emit such short pulses; FRBs have recently been shown to have submicrosecond structures (Majid et al. 2021; Nimmo et al. 2022). Another form of artificiality could be added by producing pDM signals exhibiting a repeating nonphysical sequence, for example, a Fibonacci series or fundamental frequency of any well-known element (Sullivan 1991).

Here, we postulate instead three simple variants of the pDM signal that are not yet known to occur in nature (see Gajjar et al. 2021 for details). As stated previously, broadband signals can be represented as 2D arrays (pDM ∈ $[{ \mathcal V }$, T]). Transposing either or both of the axes of these arrays adds artificiality to the pDM signals. By transposing the time axis, one can artificially produce a signal that arrives at lower frequencies first and then gradually drifts to higher frequencies. Similarly, the frequency axis can be transposed such that the shape of the pDM signal appears to be reversed. Thus, there are three different classes of axis-transposed signals that one can search for: negative time (nT: $[{ \mathcal V }$,TT ]), negative frequency (nF: $[{{ \mathcal V }}^{T}$,T]), and both negative time and frequency (nTnF: $[{{ \mathcal V }}^{T}$,TT ]). We will refer to these signals as artificially dispersed signals (aDM), as shown in Figure 2.

Figure 2. Refer to the following caption and surrounding text.

Figure 2. Simulated examples of the three different kinds of artificially dispersed transients (aDM) included in this project. We performed a search for signals from these three classes with the SPANDAK pipeline across 4–8 GHz from data collected with the Breakthrough Listen program toward 1883 stars. These signals are negative time aDMs, negative frequency aDMs, and negative time and frequency aDMs from left to right, respectively.

Standard image High-resolution image

3. Power Budget of a Broadband Pulsed Beacon

As mentioned in Section 1, almost all previous SETI surveys have searched for CW narrowband signals. In this section we compare the total energy spent on a transmitter broadcasting (a) a CW narrowband beacon against, and (b) a broadband transient aDM beacon, as we described in Section 2. The output power of any transmitter, also known as effective isotropic radiated power (EIRPout; Enriquez et al. 2017), can be expressed as

Equation (2)

Here, Pa is the power provided to an antenna in Watts and GET is the gain of the ET antenna. For transmission occurring across a bandwidth νET, we define power spectral density (PSD) in units of Watts per hertz:

Equation (3)

The transmitting antenna can be of any form: a single giant dish or multiple antennas spread across a large area operating as a phased-array. As an illustration, let us consider that the transmitting antenna is similar to the Green Bank Telescope (GBT) with antenna gain ${{\rm{G}}}_{\mathrm{ET}}\,=\,4\pi {{\rm{A}}}_{\mathrm{ET}}^{\mathrm{eff}}/{\lambda }^{2}$, where ${{\rm{A}}}_{\mathrm{ET}}^{\mathrm{eff}}$ is the effective aperture, and λ is the wavelength. For a GBT-sized telescope operating at 6 GHz with an aperture efficiency of 70%, we expect a gain of ∼3 × 107. To set a fiducial power, we will assume that our example ETI aims to send signals that can be detected at a distance D of 1000 pc by a receiver with similar gain to the transmitter antenna. The minimum required power density for an ETI transmission (${\mathrm{PSD}}_{\mathrm{ET}},\min $) to be detected depends on its directionality and other characteristics of the signal; however, we shall assume a perfect alignment of transmitter and receiver for simplicity. For a broadband signal with bandwidth similar to the receiver bandwidth (i.e., νETνr ), this can be expressed as

Equation (4)

Here, ${({\rm{S}}/{\rm{N}})}_{\min }$ is the minimum required signal-to-noise ratio for detection (assumed to be 10), SEFDr is the system-equivalent-flux-density of the receiver, which is 10 Jy at 6 GHz for GBT with receiver bandwidth νr of 4 GHz, Δτ is the temporal width of the broadband signal (assumed to be 0.3 ms). For CW narrowband signals, we expect ETIs to concentrate all of the output power into a narrow frequency, ideally νET ≤1 Hz. For such signals, the minimum detectable power density can be given as

Equation (5)

Here we have assumed a channel bandwidth Δνr similar to the transmitter bandwidth of 1 Hz, and we have set τobs, the length of observation, to be 5 minutes for our receiver. For this example, we find that the power density required to send a detectable 0.3 ms broadband beacon is lower than the power density required for a detectable continuous narrowband signal lasting 5 minutes.

The other disadvantage of sending a narrowband beacon is that it requires the sender to choose a transmission frequency. This limitation does not exist for broadband beacons, as their signals are likely to exist across several gigahertz, increasing the signal's chance of detection. However, one of the best advantages of sending a narrowband signal is the ability for the receiver to integrate the incoming signal, which allows beacons with significantly lower power levels to be received. For example, integrating for 1 hr allows us to detect power densities on the order of 6 × 106 W Hz−1. Furthermore, it is likely that an ETI might send a signal with νET ≤1 Hz, which would increase their peak power density requirement by several orders of magnitude. Similarly, an ETI might send a "comb" of narrowband signals separated by a few megahertz, which would increase their chance of detection, as suggested by Shostak (1995).

We can calculate the total minimum required power budget (or total power consumed) and compare them between these two classes of signals for operating such transmitters as

Equation (6)

Here, L is the operating time of the transmitter. Figure 3 shows that on shorter operating timescales, narrowband transmitters have an advantage; however, over longer operating timescales, the narrowband transmitter power budget will likely be similar or higher than that of a broadband transmitter. The Drake Equation (Shklovskii & Sagan 1966) implies that the chances of success in a SETI search will largely depend on the lifetime of the transmitter. In other words, longer-lasting transmissions have a better chance of being detected. Figure 3 also shows the power budget difference between the two methods, indicating that the power cost of operating a narrowband transmitter versus a broadband pulsed transmitter progressively widens with transmitter operating time. Thus, we speculate that for any sufficiently advanced ETI wishing to transmit over a timescale of several hundred years, sending broadband signals is more desirable than sending CW narrowband signals. Benford et al. (2010) carried out a detailed estimates of the capital and operation costs for pulsed ETI beacon transmitters. They concluded that short (microsecond) pulses repeating around 1000 times per second serve as the most cost effective transmission strategy.

Figure 3. Refer to the following caption and surrounding text.

Figure 3. Power budget comparison for two types of ETI beacons. For the narrowband beacon (red), we have assumed a "comb" of narrowband signals separated by 40 MHz. For the broadband beacon (blue), we have assumed a periodicity of 300 s. The vertical dashed line shows the transition point where the total power consumed for the narrowband beacon equals that for the broadband beacon, which, for our assumed example, occurs around 200 s. The absolute difference between the broadband pulsed beacon and the narrowband beacon is also shown explicitly with the green dashed line. It is apparent that the longer a transmitter operates (past the transition point), the better the broadband pulsed strategy fares vs. the CW narrowband strategy.

Standard image High-resolution image

Recently, Gajjar et al. (2021) conducted one of the most comprehensive blind surveys toward the Galactic Center. Along with CW narrowband signals, Gajjar et al. (2021) also searched for broadband artificially dispersed signals and constrained their existence near the GC with PSDET of 107 W Hz−1 among half a million stars (assuming transmitter GET ∼ 107).

4. Observations

We performed our signal search on 233 hr of BL observations conducted between 2017 July and 2018 June at the GBT from 4 to 8 GHz. These observations collectively provide the largest sample of SETI observations at these frequencies, which are higher than those chosen for the majority of prior searches. The observations employ a position-switching RFI mitigation method, observing a cadence of three "on-target" scans interspersed with three "off-target" scans. The "on" targets are drawn from the BL primary target database, which consists of around 1200 nearby stars (see Isaacson et al. 2017 for details on target selection) for the GBT. In order to improve the efficiency of our observations, the "off" targets are selected from a secondary list of nearby stars not included in the primary catalog. Here, we report observations from 2795 independent observations of 1883 stars, which includes 595 stars from the primary target database and 1288 secondary target stars. For our search, we are not bound to use the similar "on" and "off" strategy; hence, we treated 1883 targets as independent observations. Table 1 provides a truncated list of these targets, which is also shown in Figure 4.

Figure 4. Refer to the following caption and surrounding text.

Figure 4. Sky distribution of 1883 observed target stars with the Breakthrough Listen program at the Robert C. Byrd Green Bank Telescope across 4–8 GHz. These sources include a total of 1883 target stars, which includes 595 targets from the primary target database and 1288 secondary off-source targets.

Standard image High-resolution image

Table 1. Truncated List of Targets Analyzed in This Work's Search for Broadband Pulsed ETI Beacons

NameR.A. (J2000)Decl. (J200)Spectral TypeDistance (pc) ${R}_{\max }^{50}$ PSD${}_{\mathrm{ET}}^{a\mathrm{DM}}$ (W Hz−1)
GJ69917.9632224.739167M3.5V1.831701.09.23e–01
GJ820A21.11652838.763889K5.0V3.49501.03.36e+00
GJ820B21.11691938.755833K7.0V3.49501.03.36e+00
GJ2807.6548065.220278F5IV3.50501.03.38e+00
GJ15A0.30752844.024722M1.5V3.57501.03.51e+00
...
HIP10772721.82305834.064861F8840.34501.01.95e+05
HIP151593.25637847.278278A4V869.57501.02.08e+05
HIP244675.2516254.905222B8892.86501.02.20e+05
HIP10075320.42767243.967583B8900.90501.02.24e+05
HIP9685219.68691112.062444B0Ib:n980.39901.02.65e+05

Note. The Columns, from left to right, are: the name of the target star, its R.A. and decl. in J2000 coordinates, its stellar spectral type, its distance from Earth, the measured repetitiveness with ≥ 50% probability, and the estimated putative transmitter output power. The final two columns are defined in Sections 3 and 7.2.

Only a portion of this table is shown here to demonstrate its form and content. A machine-readable version of the full table is available.

Download table as:  DataTypeset image

We observed these targets with the BL Digital Recorder (MacMahon et al. 2018), which is a state-of-the-art, 64 node, GPU-equipped compute cluster at the GBT. The cluster is divided into eight banks, with each bank hosting eight computing nodes. Each computing node records a 187.5 MHz segment of incoming bandwidth, with each bank recording 1500 MHz of intermediate-frequency bandwidth. Our observations used four banks with overlapping frequency coverage, providing a total observing bandwidth of ∼4875 MHz (3563–8438 MHz), which covers the C-band receiver band of 3950–8000 MHz. We initially record data as raw baseband voltages in the GUPPI raw format (Lebofsky et al. 2019), and then convert these baseband data to total intensity SIGPROC-formatted filterbank files with 364 kHz spectral and 349 μs temporal resolution. For this study, we further binned these filterbank data sets from 13,312 to 6656 frequency channels before searching for aDM beacons.

5.  SPANDAK Pipeline

We developed several tools to search for the three different types of aDM signals shown in Figure 2. The tools comprise an entire pipeline, which we refer to as SPANDAK, as shown in Figure 5. To search for transient signals exhibiting nT–aDM type dispersion, the pipeline reversed the order of received samples to counter $[{ \mathcal V }$,TT ]. Similarly, SPANDAK reversed the order of frequency channels to search for nF–aDM signals ([${{ \mathcal V }}^{T}$,T]). For nTnF–aDM signals ([${{ \mathcal V }}^{T}$,TT ]), the orders of both time samples and frequency channels were reversed. These reversals allow any embedded aDM signal to be detected as a natural pDM signal, enabling use of the large suite of publicly available tools built to search for single pulses of astrophysical origin. For each of the observed 5 minute long SIGPROC-filterbank 14 files, we produced three reversed filterbank files corresponding to the intended nT, nF, and nTnF signals searches. We used a GPU-accelerated tool—named HEIMDALL (Barsdell et al. 2012)—as the main kernel to search for dispersed signals in these reversed filterbank files. We searched all three sets of files in parallel across three NVIDIA GTX Titan XP GPUs to expedite processing.

Figure 5. Refer to the following caption and surrounding text.

Figure 5. A schematic view of the broadband pulse detection survey pipeline SPANDAK. The input SIGPROC-filterbank files were time and/or frequency reversed to search for the corresponding class of aDM signals. The pipeline uses HEIMDALL (Barsdell et al. 2012), which is a GPU-accelerated tool to search for dispersed pulses. The lightly shaded gray block at the bottom right of the diagram represents the newly developed ML classifier that prioritizes candidate selection. The criteria used to select top candidates are listed in the darker gray block at the top-right of the ML region.

Standard image High-resolution image

For each of the order-reversed files, the pipeline ran HEIMDALL across a DM range of 10–5000 pc cm−3 with the DM steps selected such that the maximum signal-to-noise ratio (S/N) loss due to incorrect DM was always under 15%. In principle, it is possible to search for higher aDM signals; however, due to our temporal resolution of 0.3 ms, interchannel dispersion smearing greatly reduces our sensitivity for larger aDMs. We searched across a range of pulse widths from 0.3 to 42 ms. The pipeline accumulates all transient candidates reported with HEIMDALL from a single order-reversed input file and cross-references various candidate parameters (proximity of arrival times, DMs, S/Ns, widths, etc.). We removed candidates that appear across a large range of DMs within a short interval, allowing us to remove a significant number of false positives due to RFI. A short list of selected candidates was extracted from each corresponding filterbank file for further validation. We time-scrunched the extracted data such that the detected pulse would fit within two to four time bins. We also frequency-scrunched the pulse to 16–512 frequency channels based on the detected S/N. An example output plot is shown in the inset in Figure 5 (see Figure 9 of Gajjar et al. 2021 for details).

For candidate validation, the pipeline produces dedispersed dynamic spectra, where an ideal broadband transient pulse should show up across all observed frequencies. The pipeline then selects an on-pulse window based on the width and arrival time reported from HEIMDALL and extracted on-pulse and off-pulse spectra. We flagged as RFI all channels that were four times the standard deviation in the off-pulse spectra. From the remaining channels, we compared the on-pulse and off-pulse spectral energy distribution using a t-test. For a true broadband pulse, the t-test should show a significant difference between these spectra. Moreover, a true broadband dispersed signal should show both a peak at the correct DM and a gradual decline in the S/N around nearby DMs. Cordes & McLaughlin (2003) outlined a relation where a candidate with a width of Wms at the frequency of νGHz across a band of ΔνMHz shows a 50% decline in the S/N across ΔDM with respect to the S/N at a true DM. This approximation can be expressed as:

Equation (7)

We dedispersed each of the extracted candidates across a DM range of 3 × ΔDM (with 48 DM steps) to compare the S/N variations to those likely to exist for a true broadband signal. These DM-vs-time plots of S/N were also produced for each candidate for visual inspection.

Across all observations, we found 133,393 candidates that were initially detected with the SPANDAK pipeline. Visually inspecting this many candidates to identify a potential ETI signal is daunting, and would require significant personnel investment. Thus, as mentioned in Section 1.2, we have developed a fully automated CNN-based classifier to vet all detected candidates. This classifier returns probabilities for each candidate being a real broadband transient signal, as opposed to spurious interference. This ML classifier is one of the main components of the SPANDAK pipeline and analysis presented here. Full details about the ML-assisted candidate prioritization are presented in the Appendix.

6. Results

We searched ∼233 hr of total observations, which were divided into 2795 scans, each 5 minutes long. This corresponds to approximately 1883 unique stars (see Table 1) comprising 595 primary targets likely to be observed more than once and 1288 secondary stars. Our search for three different classes of aDM signals, using the SPANDAK pipeline, found 133,393 raw hits. Figure 6 shows the distribution of these hits as a function of aDM and S/N detected with the SPANDAK pipeline. We received an overabundance of hits near zero DM, which is an indication that a large number of the hits are due to local interference.

Figure 6. Refer to the following caption and surrounding text.

Figure 6. Distribution of hits detected from the search of broadband pulsed aDM beacons toward 1883 stars. (a) Histogram of measured aDM. A sharp peak near zero DM indicates a large number of these candidates originating due to RFI. (b) Histogram of measured S/N with the SPANDAK pipeline for all detected hits. For both histograms, the lighter blue color shows raw hits, while the darker blue color shows shortlisted final hits.

Standard image High-resolution image

We further shortlisted these candidates using various selection criteria mentioned in Section 5. We only selected candidates that showed a Student t-test value larger than 3.0, analogous to seeing a difference in the on-pulse and off-pulse energy distributions with a p-value ≥ 0.99. We then considered ML probabilities utilizing the CNN model described in the Appendix. We only selected candidates for which our ML probabilities were larger than 50%, given that the ML model reported an accuracy of around 98%. This helped us significantly reduce the number of likely false positives, resulting in 2948 final hits that were visually inspected. Distributions of these shortlisted hits are also shown in Figure 6. The ML-assisted shortlisting resulted in a significant reduction in the number of hits across all aDMs.

Figure 7 shows examples of our top candidates in each of the three aDM classes. Each of these candidates appears to show a dispersion relation loosely matching various aDM beacon criteria. However, we have found similar signals across a large number of targets. This indicates that such candidates arise due to coincidental alignment of spurious temporal interference, which is apparent once the data are compared with the best-fit dispersion curves in the bottom panel of Figure 7. It is likely that the fully automated time and frequency binning in the candidate plots might not be optimum. We have also developed a special interactive tool 15 by which we can iterate over a combination of frequency and time bins for any given candidate plot to improve S/N to better aid with its identification. We visually vetted all 2948 candidates but did not find any signals of interest that we could not rule out as spurious interference, and thus, no signals required further inspection through our interactive tool. Through our survey, we are therefore able to place probabilistic limits on the presence of ETI beacons toward 1883 stars.

Figure 7. Refer to the following caption and surrounding text.

Figure 7. The top candidates in the three different aDM categories found in our search for broadband ETI beacons toward 1883 stars. For each candidate plot, the top panel shows a dedispersed single pulse with an on-pulse region marked by two dotted lines along with the necessary diagnostic information to its right. The second panel from the top shows dedispersed dynamic spectra containing a broadband pulse across 4–8 GHz, with on-pulse and off-pulse spectra shown on its right in red and gray lines, respectively. The third panel from the top shows the DM-vs-time plot, with on-pulse DM vs. S/N shown in the adjacent panel on the right. The bottom panel shows the original data that triggered the hit, containing the detected dispersed pulse and two blue lines indicating the best-fit DM for visual guidance. The color in all three panels shows normalized intensities in arbitrary units.

Standard image High-resolution image

7. Discussion

7.1. Survey Sensitivity

We can estimate our survey's sensitivity for all of our targets using Equation (4), solving for the transmitter's power density (PSDET). Figure 9 shows a histogram of constrained PSDET for broadband aDM class signals. The histogram shows a bimodal distribution, reflecting the fact that our survey included two sets of targets: (1) primary targets that were within 50 pc from Earth, and (2) secondary targets that extended all the way up to 1000 pc. The median PSDET from all observed targets is around 103 W Hz−1, but the lowest expected PSDET is on the order of 1 W Hz−1 for our closest source, GJ 699. By comparison, powerful aircraft radar, 16 which also emit powerful broadband pulses (200 MHz wide), have power densities on the order of 10−3 W Hz−1.

7.2. Repetitiveness of Broadband Beacons

As mentioned in Section 3, broadband pulsed beacons are expected to repeat pulses in order to increase chances of detection. Through our observations, we can constrain the repetitiveness of these broadband aDM signals. For example, from a single 5 minute long observation, we can reject with high probability the notion that the repetitiveness of aDM broadband signals is under 300 s. For many of the targets included in this analysis, we are likely to have observed them for multiple 5 minute scans, interspaced by 5 minute scans of different targets. To measure the probability of rejection for such a set of observations, we simulated and arranged broadband pulses on the time axis and then simulated a pulse train with a range of repetition periods.

We also adjusted the phase (or offset) of these bursts within the corresponding period under consideration. These pulse trains of different offsets were overlapped with observations for a given target on the same time axis. We counted the number of instances that would have allowed us to detect at least one pulse for a given repetition period across all offsets. For periods where we were able to detect at least one pulse for all phase offsets, we can reject such repetitions with near 100% accuracy. For periods where we were able to detect one pulse for half of the offsets, we can only reject repetitions with 50% accuracy. Figure 8(a) shows the probability of rejection of repetitiveness from this above-mentioned exercise across all observed sources.

Figure 8. Refer to the following caption and surrounding text.

Figure 8. Repetitiveness distribution from the survey of broadband ETI beacons. Left: probability of rejection for a range of repetitiveness of broadband beacons for 1883 targets shown with gray solid lines. The red solid line shows a rejection probability for a typical set of three on-source and three interspaced off-source observations, each 5 minutes in length. The dashed blue and dashed–dotted red lines show 50% rejection probability at the corresponding Rmax, respectively. Right: histogram of measured ${R}_{\max }^{50}$ from all 1883 targets. Peaks around 500, 1100, and 1800 s in the measured ${R}_{\max }^{50}$ are apparent corresponding to sources observed for a total of 5, 10, and ≥15 minutes, respectively. A small number of targets were observed more than three times, populating the tail of the histogram.

Standard image High-resolution image

We measure ${R}_{\max }^{50}$, as shown in Figure 8(a), as the maximum period for which repetitiveness can be rejected with more than 50% probability for each target. Figure 8(b) shows a histogram of measured ${R}_{\max }^{50}$ for all targets. As mentioned in Section 4, our observations toward primary targets were conducted with multiple 5 minute on-source scans. As shown in Figure 8(a), for observations of primary targets with three 5 minute long on-source scans, the ${R}_{\max }^{50}$ is approximately 1800 s. For most of the secondary targets, which were observed only once, a ${R}_{\max }^{50}$ peak exists at around 500 s. There were also some stars (primary and secondary) that were observed two times, which leads to an ${R}_{\max }^{50}$ peak at around 1100 s. A few primary targets were observed more than three times on different days, which leads to a constraint on longer repetitiveness (see Figure 8(a)) and an extended tail in the ${R}_{\max }^{50}$ histogram shown in Figure 8(b).

7.3. Broadband Transmitter Occurrence Rate and Drake Equation

In recent years, radio SETI has been quickly expanding the search, eliminating regions of parameter space that are unlikely to host ETI beacons. Enriquez et al. (2017) introduced a metric known as the transmitter rate for narrowband ETI beacons, which represents the number of transmitters per star as a function of EIRP (see Figure 7 in Enriquez et al. 2017). Gajjar et al. (2021) carried out a search for broadband ETI beacons in a blind survey toward the Galactic Center and placed a limit of ≲1 on approximately half a million stars with PSDET ≳ 107 W Hz−1. It should be noted that the average required PSDET from that study was several orders of magnitude larger than that of this survey (see Section 7.1). Here, we have placed some of the very first constraints on the fraction of stars with broadband ETI beacons in the solar neighborhood across PSDET ranging from 1 to 105 W Hz−1. Figure 9 shows histograms of constrained PSDET (i.e., transmitted power densities) subgrouping them into different ${R}_{\max }^{50}$ across all targets. We then measured the cumulative distribution of these counts, represented by the shaded areas in Figure 9. This figure shows the region of the underlying parameter space our survey was able to reject for repeating broadband ETI beacons. For example, we can reject an occurrence rate of more than 1 in 1000 stars in the solar neighborhood transmitting a broadband ETI beacon with PSDET ≳ 105 W Hz−1 with a repetition rate of less than 500 s.

Figure 9. Refer to the following caption and surrounding text.

Figure 9. Transmitter rate and power density histograms for broadband pulsed ETI beacons of all types. The bottom panel shows a histogram of power density limits obtained toward all of our targets, subdivided into three subgroups of ${R}_{\max }^{50}$ (i.e., repetitiveness with ≥50% probability). The top panel shows cumulative distributions of these subgroups, shown with the shaded region, representing the fraction of stars hosting a transmitter across a range of power densities. With our survey, we can reject an occurrence rate of more than 1 in 1000 stars in the solar neighborhood transmitting a broadband ETI beacon with PSDET ≳ 105 W Hz−1 with a repetition rate of less than 500 s.

Standard image High-resolution image
Figure 10. Refer to the following caption and surrounding text.

Figure 10. Fraction of stars producing broadband signals with artificial dispersion (fc b ) as a function of PSD for the entire Milky Way. The hashed region in red represents constraints from 1883 nearby stars from our current survey and around half a million stars located at the Galactic Center from a previous survey by Gajjar et al. (2021).

Standard image High-resolution image

The Drake Equation (Shklovskii & Sagan 1966) provides a simple metric to estimate a speculative abundance of communicative ETI civilizations in the Milky Way we are likely to detect. For broadband signals with artificial dispersion, this estimation (N) can be approximated as

Equation (8)

Here, RIP is the emergence rate (yr−1) of technologically advanced intelligent life in the Milky Way, ${f}_{{\rm{c}}}^{{\rm{b}}}$ represents the fraction of these advanced civilizations producing broadband signals with artificial dispersion, and L is the lifetime of such a civilization. The RIP is a combination of the average rate of star formation and the fraction of stars providing suitable conditions and time for life to emerge and evolve. This rate is similar but not strictly related to the formation rate of planets inside the conventional habitable zones around different spectral-type stars. Moreover, due to the differential distribution of metallicity and the history of star formation in the Milky Way, RIP can be speculated to be widely different for different parts of the Milky Way. For example, Lineweaver et al. (2004) suggested that there exists a Galactic Habitable Zone (GHZ), which is an annulus extending from 7 to 9 kpc from the Galactic Center providing an ideal location for advanced life to emerge and evolve. Thus, the solar neighborhood and more broadly the spiral arms of the Milky Way are expected to have a relatively high RIP. Contrary to the Lineweaver et al. (2004) model, Morrison & Gowanlock (2015) and Gajjar et al. (2021) argued that the Galactic Center is expected to have a higher RIP due to the sheer number of stars. Although it is hard to estimate RIP and L, conducting surveys like ours can help us to jointly constrain them from the inferred limits on the fraction of stars producing signals of our interest, i.e., ${{\rm{R}}}_{\mathrm{IP}}\times {\rm{L}}\leqslant 1/{{\rm{f}}}_{c}^{b}$. Combining constraints from our current survey with those from Gajjar et al. (2021), which searched for similar classes of signals at the center of the Milky Way, enables us to estimate ${f}_{{\rm{c}}}^{{\rm{b}}}$. In our current survey, which extends up to 1 kpc from the Sun, we can roughly constrain ${f}_{{\rm{c}}}^{{\rm{b}}}$ in the solar neighborhood, or more generally, in the GHZ speculated by Lineweaver et al. (2004). Similarly, from Gajjar et al. (2021), we can get similar constraints on ${f}_{{\rm{c}}}^{{\rm{b}}}$ at the center of the Milky Way. Figure 10 shows a combined constraint ${f}_{{\rm{c}}}^{{\rm{b}}}$ from these two surveys, which can provide one of the most stringent limits on the fraction of stars producing broadband beacons in the spiral arms and center of the Milky Way.

7.4. Future Work

In the analysis presented here, we searched for dispersed signals that followed a frequency and time relation corresponding with a dispersion index of 2.0. It is plausible that a true signal from ETI might exhibit a dispersion index that is not exactly 2.0. We are continuing to explore that parameter space by searching for artificially dispersed signals exhibiting other dispersion indices using an expansion of the ML techniques presented in this paper. Moreover, in this work, we performed a search for broadband beacons reliant on bright individual pulses (i.e., with a flat period prior). Sullivan (1991) gave a list of potentially unique periods for broadband pulsed beacons that could indicate artificiality, including the lifetime of a neutron (∼896 s) or the lifetime of the most luminous optical line O ii (46.7 s). Thus, in the future, we plan to carry out a full periodicity search for these aDM signals. This will also allow us to detect even weaker broadband signals, as we can fold the underlying time series to improve S/N.

8. Conclusion

Radio SETI has so far been largely focused on searches for narrowband CW signals. We demonstrate that broadband pulsed beacons are energetically efficient compared to CW signals given longer operational timescales. We carry out one of the first comprehensive surveys for this newly suggested class of ETI beacon toward 1883 stars by searching for broadband pulsed beacons with artificial or negative dispersion. This search used 233 hours of data taken across 4–8 GHz with the Robert C. Byrd GBT. We used a GPU-accelerated pipeline named SPANDAK to search for three different classes of broadband signals: nT–aDM, nF–aDM, and nTnF–aDM. We found ∼105 initial hits from our filter-based search approach. To reduce the number of false positives, we locally designed and deployed a fully automated CNN-based classifier. We trained this classifier to identify dedispersed broadband beacons, and used it to prioritize the hits from all three classes of dedispersed aDM beacons. To the best of our knowledge, this is one of the first uses of an ML-based approach for radio SETI across such a large number of targets. With the assistance of the ML classifier, we were able to reduce the number of false positives by 97%. We did not detect an aDM beacon of artificial nature in our data sets. Hence, we place a constraint on the existence of broadband pulsed beacons in our solar neighborhood with ≲1 in 1000 stars exhibiting a PSDET ≳ 105 W Hz−1 and repeating ≤500 s.

Breakthrough Listen is managed by the Breakthrough Initiatives, sponsored by the Breakthrough Prize Foundation. We thank the staff at the Green Bank Observatory for their operational support. We would also like to thank the anonymous referee for all of the suggestions, which helped us improve our draft. S.Z.S. acknowledges that this material is based upon work supported by the National Science Foundation MPS-Ascend Postdoctoral Research Fellowship under grant No. 2138147.

Appendix: Machine-learning-assisted Candidate Prioritization

As illustrated in Figure 5, we build a single CNN classifier 17 to characterize all three types of aDM signals. We achieve this by training the ML classifier to classify dedispersed dynamic spectra and frequency-averaged pulses, rather than building three different classifiers for three different classes of aDM signals. Moreover, the same classifier can also be used to search for FRBs, which will be discussed in future publications (V. Gajjar et al. 2022, in preparation). This concept is partially similar to that presented by Agarwal et al. (2020). In the following subsections, we outline our simulation to train the classifier, model architecture, and recovery tests.

A.1. Simulating Dedispersed Broadband Signals

Modern supervised learning methods require large amounts of training data, so we created a synthetic data set of a wide variety of mock broadband signals embedded in the background of real telescope noise and spurious RFI. We randomly selected 40 observations (each 5 minutes long) from our set of 2795 observations. This randomly selected sample of observations is likely to represent the necessary backgrounds for most of our observations. To simulate a training set of broadband transient signals, we followed a similar procedure to Connor & van Leeuwen (2018) with a few modifications. We first randomly selected start times from one of the 40 observations and extracted the appropriate number of samples corresponding to a randomly selected dispersion delay (DM). Before injecting a simulated broadband signal on top of a real observation, we dedispersed the empty background to this randomly selected DM. This allows our network to see the telescope background after "dedispersion," since it is not at zero DM and thus more likely to represent the backgrounds of real dedispersed signals. This dedispersed background was time-scrunched to 256 bins and frequency-scrunched to 16 channels, providing a good compromise between data resolution and memory requirements during training.

For our simulation, we first produced a broadband dedispersed signal of constant intensity across all observed channels, akin to a frequency-averaged Gaussian pulse. The S/N of this signal was sampled from a log-normal distribution ranging from 6 to 20. The spectrum of the simulated pulse was convolved with a cosine function of random phase to mimic any frequency-dependent intensity variations. To simulate the effect of frequency channels being flagged and removed due to excessive RFI, we randomly removed 10%–50% of channels. The resulting broadband pulse was then added to the "dedispersed" background. Finally, each signal and background is also over-dispersed or under-dispersed by a small percentage, sampled from a normal distribution with a standard deviation of 0.005. This helped us imitate inaccuracies in our search pipeline that could report slightly wrong pDM values. A total of around 200,000 data points were simulated, in which 100,000 of these synthetic data contained mock broadband signals embedded in real telescope backgrounds, and the remaining 100,000 contained empty backgrounds. Figure A1 visualizes nine randomly selected broadband pulses, showing the dedispersed waterfall plot along with its frequency-averaged time series. We shuffled our simulated data set and used 50% for training and 50% for validation. We discuss the network architecture in the following sections.

Figure A1. Refer to the following caption and surrounding text.

Figure A1. Six examples of dedispersed simulated broadband ETI beacons injected into real observations. In each plot, the top panel shows a dedispersed time series containing the pulse while the bottom panel shows the dedispersed dynamic spectrum. We simulated ∼100,000 such examples, using real observations as the background, in order to train our ML classifier.

Standard image High-resolution image

A.2. Model Architecture

Compared to state-of-the-art CNNs today, our model is very simple, as the task at hand is not complex and can be likened to detecting a vertical/near vertical line in an image. For every example, the model takes two inputs, a 2D dynamic spectrum, and its corresponding 1D frequency-averaged time series (similar to examples shown in Figure A1). In Figure 5, these inputs are shown in the middle and top inset plots, respectively. These inputs are fed into two separate branches of the network—what we call the spectrogram branch and the time series branch—which extract features from the inputs and are eventually concatenated together to produce one softmax prediction. Figure A2 contains a visualization of the network architecture, with the spectrogram and time series branches on the left and right, respectively.

Figure A2. Refer to the following caption and surrounding text.

Figure A2. The neural network model architecture of the ML classifier built to identify dedispersed broadband ETI beacons. The model was trained on two different characteristics of the broadband pulse, corresponding to the two branches: the left branch is the spectrogram branch, which takes an input of a 2D dynamic spectrum, while the right branch takes an input of a 1D time series. Both branches were concatenated to assess final probability (see Appendix A.2 for details).

Standard image High-resolution image

We found that an architecture with two convolutional layers worked best for our application, retaining the ability to recognize signals without being unnecessarily complex. A 3 × 3 kernel with a stride length of 1 ensures that the convolution operation does not "skip over" broken broadband signals. In the time series branch, the number of filters in a convolutional layer is roughly half the number of filters in the parallel convolutional layer in the spectrogram branch to prevent overfitting, since detecting whether a peak is present in a 1D signal is not a hugely complex task. Each convolutional layer is followed by a BatchNormalization layer and an ReLU activation. We use MaxPooling layers at the end of every convolutional block in the spectrogram branch to reduce the dimensionality, but found that MaxPooling layers within the time series branch led to losing the peaks in the 1D signal, and thereby decided to remove them. After all convolutional blocks for both branches, we use global max pooling to transform all convolutional feature maps into a 1D tensor for each example. Subsequently, the two branches of the network are fused by concatenating the outputs of the GlobalMaxPooling layers, after which they are fed into two fully connected layers before the prediction layer. We also use Dropout layers between the fully connected layers, which have been shown to be a simple method of reducing overfitting (Srivastava et al. 2014).

A.3. Training Parameters

Preprocessing of the data is done on a per-array basis. For each array, we subtract the median from each row (the spectrum) and divide the entire array by its standard deviation. As stated above, data were split evenly between training and validation sets, such that 100,000 data points went to training and the other 100,000 went to validation.

We implement our classifier in Keras 2.0.8 and TensorFlow 1.4.1, compiling the model with a binary cross-entropy loss and the Adam optimizer (Kingma & Ba 2017). Because we value recall over precision, we weight the positive class 10 times as much as the negative class, thereby penalizing the network more for missing signals than for false positives.

We employ several Keras callbacks to supplement model training:

  • 1.  
    ModelCheckpoint: save the model only when the validation loss decreases from its last known minimum, thus only saving the model that produces the lowest validation loss.
  • 2.  
    ReduceLROnPlateau: halve the learning rate if validation loss does not improve after 15 epochs.
  • 3.  
    EarlyStopping: stop training if validation loss does not improve after 30 epochs.

With a batch size of 32, we trained our models using a single Nvidia Titan XP GPU. Though we allowed the model to run for a maximum of 500 epochs, EarlyStopping halted training after 134 epochs after not seeing a decrease in validation loss in the designated number of epochs. For the model presented in this paper, training completed the 134 epochs in about 5.5 hr. Figure A3 displays the recall and loss curves for the validation set over the course of training 134 epochs. Our model converges with a validation recall of 0.9897.

Figure A3. Refer to the following caption and surrounding text.

Figure A3. Recall and loss for the validation set over 134 epochs from the broadband transient detection ML pipeline. The left abscissa represents the recall rate from the simulated data sets, while the right abscissa represents the corresponding loss function. The ordinate represents epoch number, which appears to produce constant recall and loss beyond 110 epochs. With the help of this supervised ML classifier, we were able to reject 97% of false positives.

Standard image High-resolution image

Footnotes

Please wait… references are loading.
10.3847/1538-4357/ac6dd5