Computer Science > Sound

arXiv:2203.12188 (cs)

[Submitted on 23 Mar 2022 (v1), last revised 26 Mar 2022 (this version, v2)]

Title:FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement

Authors:Jun Chen, Zilin Wang, Deyi Tuo, Zhiyong Wu, Shiyin Kang, Helen Meng

View PDF

Abstract:Previously proposed FullSubNet has achieved outstanding performance in Deep Noise Suppression (DNS) Challenge and attracted much attention. However, it still encounters issues such as input-output mismatch and coarse processing for frequency bands. In this paper, we propose an extended single-channel real-time speech enhancement framework called FullSubNet+ with following significant improvements. First, we design a lightweight multi-scale time sensitive channel attention (MulCA) module which adopts multi-scale convolution and channel attention mechanism to help the network focus on more discriminative frequency bands for noise reduction. Then, to make full use of the phase information in noisy speech, our model takes all the magnitude, real and imaginary spectrograms as inputs. Moreover, by replacing the long short-term memory (LSTM) layers in original full-band model with stacked temporal convolutional network (TCN) blocks, we design a more efficient full-band module called full-band extractor. The experimental results in DNS Challenge dataset show the superior performance of our FullSubNet+, which reaches the state-of-the-art (SOTA) performance and outperforms other existing speech enhancement approaches.

Comments:	Accepted by ICASSP 2022
Subjects:	Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2203.12188 [cs.SD]
	(or arXiv:2203.12188v2 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2203.12188

Submission history

From: Jun Chen [view email]
[v1] Wed, 23 Mar 2022 04:33:09 UTC (315 KB)
[v2] Sat, 26 Mar 2022 19:20:53 UTC (313 KB)

Computer Science > Sound

Title:FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators