Pseudo-Bayesian Approach for Robust Mode Detection and Extraction Based on the STFT
<p>Character of the variational objective obtained by varying the ABD hyperparameters <math display="inline"><semantics> <mi>α</mi> </semantics></math> and <math display="inline"><semantics> <mi>β</mi> </semantics></math>.</p> "> Figure 2
<p>RQF difference between the Gaussian and binary filter. Each pixel is associated with a distinct SNR and reconstruction filter bandwidth. A positive values informs on the denoising performance of the Gaussian filter over that of the binary one. Conversely, a negative value means the binary filter reaches higher RQF. The results are averaged over 100 realizations of noise. The black line delimits the positive and negative regions.</p> "> Figure 3
<p>RQFs obtained by using the Gaussian filter approach for various SNR and std for <math display="inline"><semantics> <mrow> <mi>L</mi> <mo>=</mo> <mn>20</mn> </mrow> </semantics></math>. The results are averaged over 100 realizations of noise.</p> "> Figure 4
<p>Spectrogram of the analyzed multicomponent signal.</p> "> Figure 5
<p>RMSE (in dB) of the IF (averaged over 100 realizations of noise) obtained with the different competing methods for the component <math display="inline"><semantics> <mrow> <mi>C</mi> <mn>1</mn> </mrow> </semantics></math> [<a href="#B23-sensors-23-00085" class="html-bibr">23</a>,<a href="#B24-sensors-23-00085" class="html-bibr">24</a>].</p> "> Figure 6
<p>Estimation example of a single component signal in the time-frequency plane using the competing methods respectively at SNR = −5 dB (<b>a</b>) and SNR = −10 dB (<b>b</b>).</p> "> Figure 7
<p>Amplitudes associated with components C1 (<b>left</b>), C2 (<b>middle</b>), and C3 (<b>right</b>).</p> "> Figure 8
<p><b>Left</b>: Spectrogram of a noiseless monocomponent signal whose frequency is not null in time indices <math display="inline"><semantics> <mrow> <mo>[</mo> <mn>200</mn> <mo>,</mo> <mn>400</mn> <mo>]</mo> </mrow> </semantics></math>. <b>Right</b>: Ground truth binary detection indicating where the signal amplitude is non-null.</p> "> Figure 9
<p>MAE between estimated detection array and ground truth for a linear chirp (truncated middle component in <a href="#sensors-23-00085-f004" class="html-fig">Figure 4</a>) by using <math display="inline"><semantics> <mrow> <mi>L</mi> <mo>=</mo> <mn>20</mn> </mrow> </semantics></math> (obtained with 100 realizations of noise).</p> "> Figure 10
<p>RQF obtained for each component in <a href="#sensors-23-00085-f004" class="html-fig">Figure 4</a> using compared methods for various SNRs with <math display="inline"><semantics> <mrow> <mi>L</mi> <mo>=</mo> <mn>20</mn> </mrow> </semantics></math>. The results are averaged over 100 realizations of noise [<a href="#B5-sensors-23-00085" class="html-bibr">5</a>].</p> "> Figure 11
<p>Estimation of the first <math display="inline"><semantics> <mrow> <mi>K</mi> <mo>=</mo> <mn>5</mn> </mrow> </semantics></math> signal components of the piping data using the proposed ABD method.</p> "> Figure 12
<p>Estimation of <math display="inline"><semantics> <mrow> <mi>K</mi> <mo>=</mo> <mn>3</mn> </mrow> </semantics></math> signal components of the bat record signal by using the proposed ABD method with <math display="inline"><semantics> <mrow> <mi>α</mi> <mo>=</mo> <mn>0.4</mn> <mo>,</mo> <mi>β</mi> <mo>=</mo> <mn>0.7</mn> </mrow> </semantics></math> with (<b>right</b>) and without (<b>left</b>) performing the detection.</p> ">
Abstract
:1. Introduction
- A novel pseudo-Bayesian estimation algorithm for ridge extraction based on an alternative variational objective allowing for efficient regularization is discussed.
- A new fast and reliable detection algorithm for determining the time support of each of the MCS frequency component is discussed.
- A new denoising strategy for signal reconstruction, mitigating the noise content present in frequency bands used for signal synthesis is discussed.
2. Problem Statement
3. Pseudo-Bayesian Analysis
3.1. Observation Model
3.2. Variational Objective
3.3. Estimation Strategy
Algorithm 1: Overall ridge extraction procedure using backward correction. |
Input: TFR , GRW mean and variance , Number of components K, g. Ouput: for each component. for do for do Compute by matching moments from Equtaion (10). Compute the pseudo-posterior from Equtaion (8). Perform MMSE estimation of . end for Repeat steps 5 to 7 iterating from Update the TFR by subtracting the kth ridge (TFR support set to 0). end for |
3.4. Alpha–Beta Divergence
3.5. Amplitude and Noise Estimation
4. Detection Algorithm
4.1. Alternative Model
4.2. Prior Models
4.3. Hypothesis Test
4.4. Derivation
4.5. Application to Multicomponent Signals
5. Robust Reconstruction
6. Numerical Experiments
6.1. Synthetic Data Analysis
6.2. Real-World Data
7. Conclusions
- 1.
- A new, robust, instantaneous frequency estimator has been proposed to perform estimation of the ridge position in the time-frequency plane accounting for the presence of spurious additional noise. The simple postulated observation model allows us to quickly infer estimates by sequentially extracting the instantaneous frequency associated with each component of the signal. The new variational objective that is proposed in this work controls together the balance robustness/efficiency and mode seeking/mass covering of the estimator.
- 2.
- An algorithm to perform signal detection in order to postprocess the instantaneous frequency estimation is based on a hypothesis test requiring amplitude and frequency noise-expectation estimates. We showed the ability of the proposed detection method to efficiently estimate the time instants when the signal is active, as well as the importance of the amplitude estimation performance for achieving a satisfying signal reconstruction.
- 3.
- We alleviate issues encountered when performing signal reconstruction from noisy frequency bands of the STFT. We finally present in this work two different denoising reconstruction approaches, involving, respectively, a simple extension of the proposed algorithm to apply on the signal SST, and the use of a nonbinary mask.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
AB | Alpha-Beta |
ABD | Alpha-Beta Divergence |
AM-FM | Amplitude-and Frequency-Modulated |
CE | Cross Entropy |
CWT | Continuous Wavelet Transform |
ELBO | Evidence Lower-Bound |
EMD | Empirical Mode Decomposition |
FFT | Fast Fourier Transform |
GRW | Gaussian Random Walk |
IF | Instantaneous Frequency |
KLD | Kullback-Leibler Divergence |
MAE | Mean Absolute Error |
MCS | MultiComponent Signal |
MLE | Maximum Likelihood Estimation |
MMSE | Minimum Mean Squared Error |
NB | Non-Binary |
PB | Pseudo-Bayesian |
RD | Ridge Detector |
RE | Rényi Entropy |
RQF | Reconstruction Quality Factor |
SNR | Signal-to-Noise Ratio |
std | standard deviation |
SSA | Singular Spectrum Analysis |
SST | Synchrosqueezing transform |
STFT | Short-Time Fourier Transform |
TFR | Time-Frequency Representation |
-D | -Divergence |
-D | -Divergence |
Appendix A. Short-Time Fourier Transform
Appendix B. Synchrosqueezing Transform
Appendix C. Derivation of the Presence Joint Posterior Probability
References
- Song, K.; Fang, J.; Zhang, L.; Chen, F.; Wan, J.; Xiong, N. An Intelligent Epileptic Prediction System Based on Synchrosqueezed Wavelet Transform and Multi-Level Feature CNN for Smart Healthcare IoT. Sensors 2022, 22, 6458. [Google Scholar] [CrossRef] [PubMed]
- Hou, H.; Yang, Z.; Pang, C. Rotor UAV’s Micro-Doppler Signal Detection and Parameter Estimation Based on FRFT-FSST. Sensors 2021, 21, 7314. [Google Scholar] [CrossRef] [PubMed]
- Flandrin, P. Explorations in Time-Frequency Analysis; Cambridge University Press: Cambridge, UK, 2018. [Google Scholar]
- Lin, L.; Xiaorui, Y.; Qingtang, J.; Bo, Z.; Li, J. Synchrosqueezing transform meets α-stable distribution: An adaptive fractional lower-order SST for instantaneous frequency estimation and non-stationary signal recovery. Signal Process. 2022, 201, 108683. [Google Scholar]
- Pirhonen, M.; Peltokangas, M.; Vehkaoja, A. Acquiring respiration rate from photoplethysmographic signal by recursive bayesian tracking of intrinsic modes in time-frequency spectra. Sensors 2018, 18, 1693. [Google Scholar] [CrossRef] [Green Version]
- Bayer, F.M.; Kozakevicius, A.J.; Cintra, R.J. An iterative Wavelet threshold for signal denoising. Signal Process. 2019, 162, 10–20. [Google Scholar] [CrossRef]
- Fourer, D.; Auger, F.; Peeters, G. Local AM/FM parameters estimation: Application to sinusoidal modeling and blind audio source separation. IEEE Signal Process. Lett. 2018, 25, 1600–1604. [Google Scholar] [CrossRef]
- Rilling, G.; Flandrin, P. One or two frequencies? The empirical mode decomposition answers. IEEE Trans. Signal Process. 2007, 56, 85–95. [Google Scholar] [CrossRef] [Green Version]
- Hassani, H. Singular spectrum analysis: Methodology and comparison. J. Data Sci. 2007, 5, 239–257. [Google Scholar] [CrossRef]
- Fourer, D.; Harmouche, J.; Schmitt, J.; Oberlin, T.; Meignen, S.; Auger, F.; Flandrin, P. The ASTRES Toolbox for Mode Extraction of Non-Stationary Multicomponent Signals. In Proceedings of the 2017 25th European Signal Processing Conference (EUSIPCO), Kos, Greece, 28 August–2 September 2017; pp. 1170–1174. [Google Scholar]
- Laurent, N.; Meignen, S. A Novel Ridge Detector for Nonstationary Multicomponent Signals: Development and Application to Robust Mode Retrieval. IEEE Trans. Signal Process. 2021, 69, 3325–3336. [Google Scholar] [CrossRef]
- Lagrange, M.; Marchand, S.; Rault, J.B. Enhancing the tracking of partials for the sinusoidal modeling of polyphonic sounds. IEEE Trans. Audio Speech Lang. Process. 2007, 15, 1625–1634. [Google Scholar] [CrossRef]
- Chen, S.; Dong, X.; Peng, Z.; Zhang, W.; Meng, G. Nonlinear Chirp Mode Decomposition: A Variational Method. IEEE Trans. Signal Process. 2017, 65, 6024–6037. [Google Scholar] [CrossRef]
- Daubechies, I.; Maes, S. A nonlinear squeezing of the continuous wavelet transform. In Wavelets in Medicine and Biology; Routledge: London, UK, 1996; pp. 527–546. [Google Scholar]
- Meignen, S.; Pham, D.H.; Colominas, M.A. On the use of short-time Fourier transform and synchrosqueezing-based demodulation for the retrieval of the modes of multicomponent signals. Signal Process. 2021, 178, 107760. [Google Scholar] [CrossRef]
- Meignen, S.; Pham, D.H.; McLaughlin, S. On demodulation, ridge detection, and synchrosqueezing for multicomponent signals. IEEE Trans. Signal Process. 2017, 65, 2093–2103. [Google Scholar] [CrossRef] [Green Version]
- Flandrin, P. Time–frequency filtering based on spectrogram zeros. IEEE Signal Process. Lett. 2015, 22, 2137–2141. [Google Scholar] [CrossRef]
- Li, Z.; Martin, N. A Time-Frequency Based Method for the Detection and Tracking of Multiple Non-Linearly Modulated Components With Births and Deaths. IEEE Trans. Signal Process. 2016, 64, 1132–1146. [Google Scholar] [CrossRef] [Green Version]
- Zhou, P.; Dong, X.; Chen, S.; Peng, Z.; Zhang, W. Parameterized model based Short-time chirp component decomposition. Signal Process. 2018, 145, 146–154. [Google Scholar] [CrossRef]
- Legros, Q.; Fourer, D. A Novel Pseudo-Bayesian Approach for Robust Multi-Ridge Detection and Mode Retrieval. In Proceedings of the 2021 29th European Signal Processing Conference (EUSIPCO), Dublin, Ireland, 23–27 August 2021; pp. 1925–1929. [Google Scholar]
- Regli, J.B.; Silva, R. Alpha-beta divergence for variational inference. arXiv 2018, arXiv:1805.01045. [Google Scholar]
- Cichocki, A.; Cruces, S.; Amari, S.I. Generalized alpha-beta divergences and their application to robust nonnegative matrix factorization. Entropy 2011, 13, 134–170. [Google Scholar] [CrossRef]
- Oberlin, T.; Meignen, S.; Perrier, V. Second-order synchrosqueezing transform or invertible reassignment? Towards ideal time-frequency representations. IEEE Trans. Signal Process. 2015, 63, 1335–1344. [Google Scholar] [CrossRef] [Green Version]
- Millioz, F.; Martin, N. Circularity of the STFT and spectral kurtosis for time-frequency segmentation in Gaussian environment. IEEE Trans. Signal Process. 2010, 59, 515–524. [Google Scholar] [CrossRef]
- Bardenet, R.; Flamant, J.; Chainais, P. On the zeros of the spectrogram of white noise. Appl. Comput. Harmon. Anal. 2020, 48, 682–705. [Google Scholar] [CrossRef] [Green Version]
- Tzikas, D.G.; Likas, A.C.; Galatsanos, N.P. The variational approximation for Bayesian inference. IEEE Signal Process. Mag. 2008, 25, 131–146. [Google Scholar] [CrossRef]
- Li, Y.; Turner, R.E. Rényi Divergence Variational Inference. arXiv 2016, arXiv:1602.02311. [Google Scholar]
- Futami, F.; Sato, I.; Sugiyama, M. Variational Inference based on Robust Divergences. In International Conference on Artificial Intelligence and Statistics; Storkey, A.J., Pérez-Cruz, F., Eds.; PMLR: Canary Islands, Spain, 2018; Volume 84, pp. 813–822. [Google Scholar]
- Fujisawa, H.; Eguchi, S. Robust parameter estimation with a small bias against heavy contamination. J. Multivar. Anal. 2008, 99, 2053–2081. [Google Scholar] [CrossRef] [Green Version]
- Millioz, F.; Martin, N. Estimation of a white Gaussian noise in the Short Time Fourier Transform based on the spectral kurtosis of the minimal statistics: Application to underwater noise. In Proceedings of the 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, TX, USA, 14–19 March 2010; pp. 5638–5641. [Google Scholar]
- Malsiner-Walli, G.; Wagner, H. Comparing Spike and Slab Priors for Bayesian Variable Selection. Austrian J. Stat. 2016, 40, 241–264. [Google Scholar] [CrossRef]
- Tachella, J.; Altmann, Y.; McLaughlin, S.; Tourneret, J.Y. Fast surface detection in single-photon Lidar waveforms. In Proceedings of the 2019 27th European Signal Processing Conference (EUSIPCO), A Coruna, Spain, 2–6 September 2019; pp. 1–5. [Google Scholar]
- Brevdo, E.; Fuckar, N.S.; Thakur, G.; Wu, H.T. The Synchrosqueezing algorithm: A robust analysis tool for signals with time-varying spectrum. arXiv 2011, arXiv:1105.0010. [Google Scholar]
- Colominas, M.; Meignen, S.; Pham, D.H. Time-Frequency Filtering Based on Model Fitting in the Time-Frequency Plane. IEEE Signal Process. Lett. 2019, 26, 660–664. [Google Scholar] [CrossRef]
- Auger, F.; Flandrin, P.; Lin, Y.T.; McLaughlin, S.; Meignen, S.; Oberlin, T.; Wu, H.T. Time-frequency reassignment and synchrosqueezing: An overview. IEEE Signal Process. Mag. 2013, 30, 32–41. [Google Scholar] [CrossRef] [Green Version]
- Fourer, D.; Orlowska, A. Detection and Identification of Beehive Piping Audio Signals. In Proceedings of the 2022, Detection and Classification of Acoustic Scenes and Events Workshop (DCASE), Nancy, France, 3–4 November 2022; pp. 41–45. [Google Scholar]
- Fourer, D.; Auger, F.; Czarnecki, K.; Meignen, S.; Flandrin, P. Chirp Rate and Instantaneous Frequency Estimation: Application to Recursive Vertical Synchrosqueezing. IEEE Signal Process. Lett. 2017, 24, 1724–1728. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Legros, Q.; Fourer, D. Pseudo-Bayesian Approach for Robust Mode Detection and Extraction Based on the STFT. Sensors 2023, 23, 85. https://doi.org/10.3390/s23010085
Legros Q, Fourer D. Pseudo-Bayesian Approach for Robust Mode Detection and Extraction Based on the STFT. Sensors. 2023; 23(1):85. https://doi.org/10.3390/s23010085
Chicago/Turabian StyleLegros, Quentin, and Dominique Fourer. 2023. "Pseudo-Bayesian Approach for Robust Mode Detection and Extraction Based on the STFT" Sensors 23, no. 1: 85. https://doi.org/10.3390/s23010085