TW201532035A - Prediction-based FM stereo radio noise reduction - Google Patents
Prediction-based FM stereo radio noise reduction Download PDFInfo
- Publication number
- TW201532035A TW201532035A TW103103758A TW103103758A TW201532035A TW 201532035 A TW201532035 A TW 201532035A TW 103103758 A TW103103758 A TW 103103758A TW 103103758 A TW103103758 A TW 103103758A TW 201532035 A TW201532035 A TW 201532035A
- Authority
- TW
- Taiwan
- Prior art keywords
- signal
- received
- noise reduction
- parameter
- side signal
- Prior art date
Links
Landscapes
- Noise Elimination (AREA)
Abstract
Description
本文件係關於音頻訊號處理,特別是關於用於改善FM立體聲無線電接收器之音頻訊號的設備與對應的方法。特別是,本文件關於用於降低已接收的FM立體聲無線電訊號之雜訊的方法與系統。 This document relates to audio signal processing, and more particularly to apparatus and corresponding methods for improving the audio signal of an FM stereo radio receiver. In particular, this document relates to methods and systems for reducing noise in received FM stereo radio signals.
在類比FM(頻率調變)立體聲無線電系統中,無線電訊號之左聲道(L)與右聲道(R)以中側(M/S)代表來傳達,亦即為中聲道(M)與側聲道(S)。中聲道M對應L與R之和(sum)訊號,例如M=(L+R)/2,並且側聲道S對應L與R之差(difference)訊號,例如S=(L-R)/2。對於傳送,側聲道S調變上至38kHz抑制載波且相加至基帶中訊號M以形成向後相容(backwards-compatible)立體聲多工訊號(multiplex signal)。此多工基帶訊號接著用以調變FM傳送器之HF(高頻)載波,通常在87.5到108MHz之間的範圍中操作。 In an analog FM (frequency modulation) stereo radio system, the left channel (L) and the right channel (R) of the radio signal are conveyed in the middle (M/S) representation, that is, the center channel (M). With side channel (S). The middle channel M corresponds to the sum (sum) signal of L and R, for example, M=(L+R)/2, and the side channel S corresponds to a difference signal of L and R, for example, S=(LR)/2 . For transmission, the side channel S is modulated up to a 38 kHz rejection carrier and added to the baseband signal M to form a backwards-compatible stereo multiplex signal. This multiplexed baseband signal is then used to modulate the HF (high frequency) carrier of the FM transmitter, typically operating in the range between 87.5 and 108 MHz.
當接收品質減低時(亦即,在無線電頻道之 上的訊號對雜訊比(signal-to-noise)無線電減低),在傳送期間S聲道通常比M聲道變得更糟。在許多FM接收器實行中,當接收條件變得太嘈雜/太多雜訊(too noisy)時,S聲道則被遮音(mute)。此意味接收器從立體聲落回到單聲道(mono),以防低劣HF無線電訊號的形情(通常參照為單聲道降退(mono dropout))。 When the reception quality is reduced (ie, on the radio channel) The signal on the signal-to-noise radio is reduced. The S channel is usually worse than the M channel during transmission. In many FM receiver implementations, the S channel is muted when the reception conditions become too noisy/too noisy. This means that the receiver falls back from stereo to mono (mono) in case of poor HF radio signals (usually referred to as mono dropout).
即使假使中訊號M可接受品質的,側訊號S可為有雜訊的,因而當在輸出訊號之左及右聲道中進行混音(mix)(其例如依據L=M+S且R=M-S而導出)時,能嚴苛的降等全體的音街品質。當側訊號S僅具有對中間品質之不良時,有兩個選項:不是接收器選擇接收與側訊號S有關的雜訊且輸出包含有嘈雜的/有雜訊的左及右訊號之真實立體聲訊號,就是接收器將側訊號S下降且落回至單聲道。 Even if the medium signal M can accept quality, the side signal S can be noisy, so when mixing in the left and right channels of the output signal (for example, according to L = M + S and R = When MS is derived), it is possible to severely reduce the quality of the entire street. When the side signal S only has a bad quality for the intermediate quality, there are two options: the receiver does not choose to receive the noise related to the side signal S and outputs the real stereo signal containing the noisy/noisy left and right signals. That is, the receiver drops the side signal S and falls back to mono.
參數立體聲(PS:Parametric Stereo)寫碼(coding)為來自非常低位元率音頻寫碼之領域的技術。PS允許將2-聲道立體聲音頻訊號編碼(encoding)為與額外的PS側資訊,亦即PS參數,結合的單聲道縮混(downmix)訊號。獲得單聲道縮混訊號做為立體聲訊號之雙聲道的結合。PS參數致能PS解碼器從單聲道縮混訊號與PS側訊號重構立體聲訊號。通常來說,PS參數為時間-與頻率-變量,且在PS解碼器中之PD處理通常在混合型濾波帶域加上複數個正交鏡像濾波器(QMF;Quadrature Mirror Filter)排中實現。 Parametric Stereo (PS) is a technique from the field of very low bit rate audio writing. The PS allows the 2-channel stereo audio signal to be encoded as a mono downmix signal combined with additional PS side information, ie PS parameters. Obtain a mono downmix signal as a combination of two channels of stereo signals. The PS parameter enables the PS decoder to reconstruct the stereo signal from the mono downmix signal and the PS side signal. In general, the PS parameters are time-and-frequency-variables, and PD processing in the PS decoder is typically implemented in a hybrid filter band plus a plurality of Quadrature Mirror Filter (QMF) rows.
已在WO2011/029570,PCT/EP2011/064077及PCT/EP2011/064084中提出使用接收的FM立體聲訊號之PS編碼以為了降低包含在接收的FM立體聲訊號內的雜訊。參數立體聲(PS)為基的(Parametric Stereo based)FM立體聲無線電雜訊降低技術之一般原理係為使用從接收的FM立體聲訊號導出的參數立體聲參數,以為了降低包含在接收的左及右訊號中的雜訊。上述專利文件之揭露藉參考而併入。 The PS encoding using the received FM stereo signal has been proposed in WO 2011/029570, PCT/EP2011/064077 and PCT/EP2011/064084 in order to reduce the noise contained in the received FM stereo signal. The general principle of Parametric Stereo based FM stereo radio noise reduction technology is to use the parametric stereo parameters derived from the received FM stereo signal in order to reduce the left and right signals included in the reception. The noise. The disclosure of the above patent documents is incorporated by reference.
在本文件中,說明了使用預測式(prediction-based)框架的FM立體聲無線電雜訊降低之方法與系統。此預測式框架對如上所指之參數立體聲(PS)為基的框架係為替代的方法。如同將在本文件中說明的,預測式框架提供較低的計算複雜度(computational complexity)。進一步而言,已觀察到的是,同時預測式FM立體聲無線電雜訊降低方案達成相較於PS為基的FM立體聲無線電雜訊降低方案之改善的音頻品質。 In this document, a method and system for FM stereo radio noise reduction using a prediction-based framework is illustrated. This predictive framework is an alternative to the parametric stereo (PS) based framework as indicated above. As will be explained in this document, predictive frameworks provide lower computational complexity. Further, it has been observed that the simultaneous predictive FM stereo radio noise reduction scheme achieves improved audio quality compared to the PS-based FM stereo radio noise reduction scheme.
依據一態樣,說明組態成降低接收的多聲道FM無線電訊號之雜訊的設備或系統。多聲道FM無線電訊號可為兩個聲道立體聲訊號。特別是,接收的多聲道FM無線電訊號可能可表示為或可呈現為或表示中訊號與側訊號。進一步而言,側訊號可表示在立體聲訊號之左訊號與右訊號之間的差。 According to one aspect, a device or system configured to reduce noise of a received multi-channel FM radio signal is illustrated. The multi-channel FM radio signal can be a two-channel stereo signal. In particular, the received multi-channel FM radio signal may be represented or may be presented as or representing a medium signal and a side signal. Further, the side signal can indicate the difference between the left signal and the right signal of the stereo signal.
在一實施例中,設備包含參數決定單元,組態成決定表示接收的中訊號與接收的側訊號之間的相關(correlation)及/或解相關(decorrelation)的一或更多參數。一或更多參數可為預測參數a,用以從接收的中訊號決定雜訊降低側訊號之相關成分,及/或可為解相關參數b,用以從中訊號之解相關形式決定雜訊降低側訊號之解相關成分。進一步而言,設備包含雜訊降低單元,組態成使用一或更多參數從接收的中訊號產生雜訊降低側訊號。為了此目的,雜訊降低單元不將接收的側訊號(例如,接收的側訊號之取樣)計入考量。換句話說,接收的側訊號並非在用於雜訊降低側訊號之決定的訊號路徑中。特別是,雜訊降低單元可組態成僅從接收的中訊號(例如,接收的中訊號之取樣)及一或更多參數決定雜訊降低側訊號。 In an embodiment, the device includes a parameter decision unit configured to determine one or more parameters indicative of correlation and/or decorrelation between the received intermediate signal and the received side signal. The one or more parameters may be the prediction parameter a for determining the correlation component of the noise reduction side signal from the received medium signal, and/or may be the correlation parameter b for determining the noise reduction from the decoupling form of the medium signal. The relevant components of the side signal. Further, the device includes a noise reduction unit configured to generate a noise reduction side signal from the received medium signal using one or more parameters. For this purpose, the noise reduction unit does not take into account the received side signals (eg, samples of the received side signals). In other words, the received side signal is not in the signal path used for the decision of the noise reduction side signal. In particular, the noise reduction unit can be configured to determine the noise reduction side signal only from the received medium signal (eg, the received medium signal sample) and one or more parameters.
如上所指,參數決定單元可組態成決定預測參數a。預測參數a可表示在接收的中訊號與接收的側訊號之間的交叉相關(cross-correlation)。特別是,參數決定單元可組態成基於接收的中訊號與接收的側訊號之對應的取樣之乘積的期望值來決定預測參數a。甚至更特別的,參數決定單元可組態成使用公式a=E[S*M]/E[M*M]來決定預測參數a,其中E[.]意指期望運算元,S意指接收的側訊號以及M意指接收的中訊號。 As indicated above, the parameter decision unit can be configured to determine the prediction parameter a. The prediction parameter a may indicate a cross-correlation between the received intermediate signal and the received side signal. In particular, the parameter decision unit can be configured to determine the prediction parameter a based on the expected value of the product of the received medium signal and the corresponding sample of the received side signal. Even more particularly, the parameter decision unit can be configured to determine the prediction parameter a using the formula a=E[S*M]/E[M*M], where E[. ] means the desired operand, S means the received side signal and M means the received medium signal.
在參數決定單元提供預測參數a之情形中,雜訊降低單元可組態成使用預測參數a從接收的中訊號產 生雜訊降低側訊號(或雜訊降低側訊號之相關成分)。雜訊附低側訊號對相關成分可決定為預測參數a與接收的中訊號之乘積,亦即a*M。此意味雜訊降低側訊號之相關成分可為接收的中訊號之加權形式。有鑒於預測參數a可為時間變量及/或頻率變量的事實,對接收的中訊號之加權因子(weighting factor)可為時間變量及/或頻率變量。 In the case where the parameter decision unit provides the prediction parameter a, the noise reduction unit can be configured to use the prediction parameter a to generate from the received signal. The noise is reduced by the side signal (or the noise is reduced by the side signal). The noise side low signal pair correlation component can be determined as the product of the prediction parameter a and the received medium signal, that is, a*M. This means that the relevant component of the noise reduction side signal can be a weighted form of the received medium signal. In view of the fact that the prediction parameter a can be a time variable and/or a frequency variable, the weighting factor for the received medium signal can be a time variable and/or a frequency variable.
參數決定單元可組態成決定表示接收的中訊號與接收的側訊號之間之解相關的解相關參數b。特別是,參數決定單元可組態成基於接收的側訊號與使用預測參數a從中訊號決定的訊號之差訊號之能量來決定解相關參數b。甚至更特別的是,參數決定單元可組態成使用公式b=sqrt(E[D*D]/E[M*M]),具有D=S-a*M做為差訊號,以決定解相關參數b。運算元「sqrt()」指示平方根運算。 The parameter decision unit can be configured to determine a decorrelation parameter b indicative of a decorrelation between the received intermediate signal and the received side signal. In particular, the parameter decision unit can be configured to determine the decorrelation parameter b based on the received side signal and the energy of the difference signal of the signal determined from the medium signal using the prediction parameter a. Even more particularly, the parameter decision unit can be configured to use the formula b=sqrt(E[D*D]/E[M*M]) with D=Sa*M as the difference signal to determine the decorrelation parameters b. The operand "sqrt()" indicates the square root operation.
在此情形下,雜訊降低單元可組態成使用解相關參數b從接收的中訊號之解相關形式產生雜訊降低側訊號(或雜訊降低側訊號之解相關成分)。特別是,雜訊降低側訊號之解相關成分可決定為b*decorr(M),具有decorr(M)做為接收的中訊號之解相關形式。接收的中訊號之解相關形式可藉使用全通濾波器(all-pass filter)將接收的中訊號濾波來決定。 In this case, the noise reduction unit can be configured to use the decorrelation parameter b to generate a noise reduction side signal (or a decorrelated component of the noise reduction side signal) from the decorrelated form of the received medium signal. In particular, the decorrelation component of the noise reduction side signal can be determined as b*decorr(M), with decorr(M) as the decoupling form of the received medium signal. The de-correlated form of the received medium signal can be determined by filtering the received medium signal by using an all-pass filter.
若接收的側訊號包含大量的雜訊,其可有利的降低在雜訊降低側訊號上雜訊降低側訊號之解相關成分的影響。為此目的,參數決定單元可組態成決定接收的側 訊號之頻譜平坦度(spectral flatness)(或表示接收的側訊號之頻譜平坦度)的影響因子(impact factor)特性。高頻譜平坦度通常指示包含在側訊號內的高程度的雜訊。如此,解相關參數b可取決於影響因子。特別是,當影響因子指示接收的訊號之頻譜平坦度的增加程度時,解相關參數b可減少。藉範例的方式來說,影響因子為在本文件中說明的SMF_impact_factor且修改的解相關參數b_new則決定為b_new=(1-SMF_impact_factor)*b,藉此迫使雜訊低側訊號之解相關成分(亦即,b_new*decorr(M))到零,若SMF-impact_factor趨於朝向「1」的話。 If the received side signal contains a large amount of noise, it can advantageously reduce the influence of the decorrelated component of the noise reduction side signal on the noise reduction side signal. For this purpose, the parameter decision unit can be configured to determine the receiving side The impact factor characteristic of the spectral flatness of the signal (or the spectral flatness of the received side signal). High spectral flatness typically indicates a high degree of noise contained within the side signal. As such, the decorrelation parameter b can depend on the impact factor. In particular, when the influence factor indicates the degree of increase in the spectral flatness of the received signal, the decorrelation parameter b can be reduced. By way of example, the impact factor is the SMF_impact_factor described in this document and the modified decorrelation parameter b_new is determined to be b_new=(1-SMF_impact_factor)*b, thereby forcing the decorrelation component of the low-side signal of the noise ( That is, b_new*decorr(M)) to zero, if the SMF-impact_factor tends to face "1".
如上所指,參數決定單元可組態成時間變量方式決定一或多個參數(例如,預測參數a及/或解相關參數b)。如此一來,對於一或更多參數之各者,可決定對於時間間隔之對應序列的分別參數之序列。藉範例的方式來說,對於第一參數(例如,預測參數a或解相關參數b),決定對於時間間隔之序列的第一參數之序列。時間間隔之序列可為訊號時框(其包含例如2048個訊號取樣)之序列。通常來說,使用接收的中訊號及/或接收的側訊號(其在於特定的時間間隔內)之取樣來決定對於時間間隔之序列中特定時間間隔的第一參數之序列中特定的第一參數。在一或更多參數為時間變量的情形中,雜訊降低單元可組態成使用一或更多時間變量參數產生雜訊降低側訊號。 As indicated above, the parameter decision unit can be configured to determine one or more parameters (eg, prediction parameter a and/or decorrelation parameter b) in a time variable manner. As such, for each of one or more parameters, a sequence of separate parameters for the corresponding sequence of time intervals can be determined. By way of example, for a first parameter (eg, prediction parameter a or decorrelated parameter b), a sequence of first parameters for a sequence of time intervals is determined. The sequence of time intervals can be a sequence of signal time frames (which include, for example, 2048 signal samples). In general, the sampling of the received intermediate signal and/or the received side signal (which is within a particular time interval) is used to determine a particular first parameter in the sequence of first parameters for a particular time interval in the sequence of time intervals. . In the case where one or more of the parameters are time variables, the noise reduction unit can be configured to generate a noise reduction side signal using one or more time variable parameters.
為了確保在相鄰時間間隔之間的連續性,並且為了避免在相鄰時間間隔之邊界處聽覺得不連續,可能有利的是,藉由從第一參數之序列內插相鄰的第一參數來決定內插的第一參數之序列。 In order to ensure continuity between adjacent time intervals, and to avoid hearing discontinuities at the boundaries of adjacent time intervals, it may be advantageous to interpolate adjacent first parameters by sequence from the first parameter To determine the sequence of the first parameter of the interpolation.
在高度惡化的接收條件之情形中,FM接收器可迫使接收的FM無線電訊號到單聲道,亦即FM接收器可抑制接收的側訊號。設備可組態成偵測這類單聲道降退(dropout),亦即設備可組態成偵測到接收的多聲道FM無線電訊號為強迫的單聲道訊號。這可藉偵測從高能量到低能量之接收的側訊號之快速轉移(transition)來達成。特別是,可決定在時間間隔之序列中的第一時間間隔內接收的側訊號之能量,且可判定出此能量高於高臨界。進一步而言,可決定許多追隨連續的時間間隔之轉移周期,於其期間側訊號之能量從高於高臨界降落到低於低臨界之值。基於此資訊,若轉移周期之連續時間間隔之數目低於間隔臨界,可判定追隨第一時間間隔之接收的多聲道FM無線電訊號為強迫的單聲道訊號。此間隔臨界可為追隨第一時間間隔之1、2、3或4個時間間隔。 In the case of highly degraded reception conditions, the FM receiver can force the received FM radio signal to mono, ie the FM receiver can suppress the received side signal. The device can be configured to detect such mono dropouts, ie the device can be configured to detect that the received multi-channel FM radio signal is a forced mono signal. This can be achieved by detecting a fast transition of the side signal from high energy to low energy reception. In particular, the energy of the side signal received during the first time interval in the sequence of time intervals can be determined and can be determined to be higher than the high threshold. Further, a number of transition periods following a continuous time interval may be determined during which the energy of the side signal falls from above the high threshold to below the low threshold. Based on this information, if the number of consecutive time intervals of the transition period is lower than the interval threshold, it can be determined that the received multi-channel FM radio signal following the first time interval is a forced mono signal. This interval threshold can be 1, 2, 3 or 4 time intervals following the first time interval.
若偵測出的是在(直接)追隨第一時間間隔的時間間隔中接收的多聲道FM無線電訊號為強迫的單聲道訊號,參數決定單元可組態成從用於第一時間間隔之一或更多參數決定用於(直接)追隨第一時間間隔之時間間隔的一或更多參數。換句話說,參數決定單元可組態成藉使用在單聲道降退前決定的一或更多參數隱蔽在單聲道降 退期間參數之缺乏。 If it is detected that the multi-channel FM radio signal received in the time interval (directly) following the first time interval is a forced mono signal, the parameter decision unit can be configured to be used for the first time interval. One or more parameters determine one or more parameters for (directly) following the time interval of the first time interval. In other words, the parameter decision unit can be configured to be concealed in mono down by using one or more parameters determined before the mono drop The lack of parameters during the withdrawal period.
如上所述,參數決定單元可組態成頻率變量方式決定一或更多參數(例如,預測參數a及/或解相關參數b)。此意味決定不同的參數以用於接收的中及/或側訊號之不同的子帶。為此目的,設備可包含中變換單元,其組態成從接收的中訊號產生涵蓋對應複數個頻率範圍的複數個中子帶訊號(mid subband signal)。進一步而言,設備可包含側變換單元,其組態成從接收的側訊號產生涵蓋對應複數個頻率範圍的複數個側子帶訊號(side subband signal)。在這類情形中,參數決定單元可組態成決定用於複數個頻率範圍之各者的一或更多參數。特別是,對於一或更多參數之第二者(例如,預測參數a及/或解相關參數b),複數個第二子帶參數可從對應複數個中子帶訊號及對應複數個側子帶訊號來決定。此可藉將上述用於決定一或更多參數(例如,預測參數a或解相關參數b)應用到複數個頻率範圍之各者的公式來完成。 As described above, the parameter decision unit can be configured to determine one or more parameters (eg, prediction parameter a and/or decorrelation parameter b) in a frequency variable manner. This means that different parameters are determined for the different sub-bands of the received medium and/or side signals. To this end, the device may include a mid-conversion unit configured to generate a plurality of mid subband signals covering the corresponding plurality of frequency ranges from the received intermediate signal. Further, the device can include a side transform unit configured to generate a plurality of side subband signals covering the plurality of frequency ranges from the received side signals. In such cases, the parameter decision unit can be configured to determine one or more parameters for each of the plurality of frequency ranges. In particular, for a second one or more parameters (eg, prediction parameter a and/or decorrelation parameter b), the plurality of second sub-band parameters may correspond to a plurality of neutron band signals and corresponding plurality of side parameters Take the signal to decide. This can be done by applying the above formula for determining one or more parameters (eg, prediction parameter a or decorrelation parameter b) to each of a plurality of frequency ranges.
雜訊降低單元可組態成使用一或更多頻率變量參數產生雜訊降低側訊號。特別是,雜訊降低單元可組態成(僅)從對應複數個中子帶訊號及對應複數個子帶參數來產生複數個雜訊降低的側子帶訊號。使用反變換(inverse transformation)單元,可從複數個雜訊降低側子帶訊號產生雜訊降低的側訊號。 The noise reduction unit can be configured to generate noise reduction side signals using one or more frequency variable parameters. In particular, the noise reduction unit can be configured to generate (a) only a plurality of noise reduction side subband signals from the corresponding plurality of neutron band signals and corresponding plurality of subband parameters. Using the inverse transformation unit, the side signal of the noise reduction can be generated by reducing the side subband signals from a plurality of noises.
中變換單元及/或側變換單元可為QMF濾波器排(filter bank)且反變換單元可為反QMF濾波器排。 有鑒於接收的中訊號係在訊號路徑中(且接收的側訊號未在訊號路徑中)的事實,側變換單元可滿足比中變換單元更低的需求在對於至少下列其中之一的項目而言:頻率選擇性;頻率解析度(frequency resolution);時間解析度(time resolution);以及數值精度(numerical accuracy)。 The medium transform unit and/or the side transform unit may be a QMF filter bank and the inverse transform unit may be an inverse QMF filter bank. In view of the fact that the received medium signal is in the signal path (and the received side signal is not in the signal path), the side transform unit can meet lower requirements than the medium transform unit in terms of at least one of the following items. : frequency selectivity; frequency resolution; time resolution; and numerical accuracy.
接收的FM無線電訊號可由嘈雜的接收的側訊號所支配,其具有比接收的中訊號更高的能階。當使用一或更多參數產生來自接收的中訊號之雜訊降低側訊號時,這類的情況可導致感知上惱人的人為因素(artifact)。為了應付這類情況,參數決定單元可組態成藉將限制因子c應用到一或更多參數a來限制一或更多參數。特別是,一或更多參數可除以限制因子c。在實施例中,對於c>1,限制因子c與一或更多平方的參數(squared parameter)之和成比例。在另一實施例中,對於c>1,限制因子c與一或更多平方的參數之和的方根成比例。通常來說,選擇限制因子c使得限制因子c之應用並不增加一或更多參數。 The received FM radio signal can be dominated by the noisy received side signal, which has a higher energy level than the received medium signal. Such situations can lead to perceptually annoying artifacts when one or more parameters are used to generate noise reduction side signals from the received intermediate signal. To cope with such situations, the parameter decision unit can be configured to limit one or more parameters by applying a limit factor c to one or more parameters a. In particular, one or more parameters can be divided by a limiting factor c. In an embodiment, for c > 1, the limiting factor c is proportional to the sum of one or more squared parameters. In another embodiment, for c > 1, the limiting factor c is proportional to the square root of the sum of one or more squared parameters. In general, the selection of the limit factor c is such that the application of the limit factor c does not increase one or more parameters.
應注意,設備可包含延遲單元,其組態成將該接收的中訊號(之取樣)延遲了對應到需要產生雜訊降低側訊號(之對應的取樣)的計算時間之時間量。 It should be noted that the device may include a delay unit configured to delay the received intermediate signal (sampling) by an amount of time corresponding to the computation time required to generate the noise reduction side signal (the corresponding sample).
在當接收得側訊號包含幾乎沒有雜訊時之良好接收條件中,使用接收的側訊號以用於產生立體聲訊號是有利的。為此目的,設備可包含結合單元,組態成使用 表示接收的多聲道FM無線電訊號之品質的品質指示器自雜訊降低立體聲訊號以及接收的側訊號決定修改的雜訊降低側訊號。取決於接收的側訊號之品質,修改的雜訊降低側訊號可在雜訊降低側訊號與接收的側訊號之間(或選自雜訊降低側訊號與接收的側訊號或插入在雜訊降低側訊號與接收的側訊號之間)混合。為此目的,結合單元可包含雜訊降低增益單元,組態成使用雜訊降低增益將雜訊降低側訊號加權;旁通增益單元,組態成使用旁通增益將接收的側訊號加權;以及合併單元,組態成合併(例如,相加)加權的雜訊降低側訊號和加權的接收的側訊號;其中雜訊降低增益與旁通增益係取決於品質指示器。應注意,結合單元可組態成在頻率選擇方式決定修改的雜訊降低側訊號。 In a good reception condition when the received side signal contains almost no noise, it is advantageous to use the received side signal for generating a stereo signal. For this purpose, the device can comprise a combined unit configured for use A quality indicator indicating the quality of the received multi-channel FM radio signal from the noise reduction stereo signal and the received side signal determines the modified noise reduction side signal. Depending on the quality of the received side signal, the modified noise reduction side signal can be between the noise reduction side signal and the received side signal (or selected from the noise reduction side signal and the received side signal or inserted in the noise reduction). The side signal is mixed with the received side signal. To this end, the combining unit may include a noise reduction gain unit configured to use noise reduction gain to reduce noise side signal weighting; a bypass gain unit configured to weight the received side signal using a bypass gain; The merging unit is configured to combine (eg, add) the weighted noise reduction side signal and the weighted received side signal; wherein the noise reduction gain and the bypass gain are dependent on the quality indicator. It should be noted that the combining unit can be configured to determine the modified noise reduction side signal in a frequency selective manner.
設備可包含品質決定單元,組態成決定指示接收的側訊號之品質的品質指示器。此可藉判定接收的中訊號之功率(參照為中功率),以及接收的側訊號之功率(參照為側功率)。可決定中功率和側功率之比率,亦即中對側比率(mid-to-side ratio),且基於至少中對側比率可決定接收的FM無線電訊號之品質指示器。本文件說明用於以可信賴的方式決定指示接收的側訊號品質之品質指示器α HQ 的各種實施例。 The device may include a quality decision unit configured to determine a quality indicator indicative of the quality of the received side signal. This can be determined by the power of the received medium signal (referred to as the medium power) and the power of the received side signal (referred to as the side power). The ratio of the medium power to the side power, that is, the mid-to-side ratio, can be determined, and the quality indicator of the received FM radio signal can be determined based on at least the mid-to-side ratio. This document describes various embodiments for determining the quality indicator α HQ indicative of the received side signal quality in a trusted manner.
設備可更包含MS-to-LR轉換器,組態成從接收的中訊號與雜訊降低側訊號(或修改的雜訊降低側訊號)決定雜訊降低左訊號和雜訊降低右訊號。特別是, MS-to-LR轉換器可組態成從接收的中訊號與(修改的)雜訊降低側訊號之和決定雜訊降低左訊號;以及從接收的中訊號與(修改的)雜訊降低側訊號之差決定雜訊降低右訊號。 The device may further include an MS-to-LR converter configured to determine noise from the received middle signal and noise reduction side signal (or modified noise reduction side signal) to reduce the left signal and the noise to reduce the right signal. especially, The MS-to-LR converter can be configured to reduce the left signal from the sum of the received medium signal and the (modified) noise reduction side signal; and from the received medium signal and (modified) noise reduction side The difference between the signals determines the noise to reduce the right signal.
依據另一態樣,說明用於降低接收的多聲道FM無線電訊號之雜訊的方法。接收的多聲道FM無線電訊號可能可呈現為接收的中訊號與接收的側訊號。方法可包含在接收的中訊號與接收的側訊號之間決定表示相關及/或解相關之一或更多參數;以及使用此一或更多參數從接收的中訊號而非從接收的側訊號產生雜訊降低側訊號。 According to another aspect, a method for reducing noise of a received multi-channel FM radio signal is illustrated. The received multi-channel FM radio signal may be presented as a received medium signal and a received side signal. The method can include determining, between the received intermediate signal and the received side signal, one or more parameters indicative of correlation and/or decorrelation; and using the one or more parameters from the received intermediate signal instead of the received side signal Generate noise to reduce the side signal.
依據另一態樣,係說明軟體程式。軟體程式可適於在處理器上執行且當在計算裝置上實現時用於施行在本文件中所概述的方法步驟。 According to another aspect, the software program is described. The software program can be adapted to execute on a processor and, when implemented on a computing device, to perform the method steps outlined in this document.
依據另一態樣,係說明儲存媒體。儲存媒體可包含軟體程式,適於在處理器上執行且當在計算裝置上實現時用於施行在本文件中所概述的方法步驟。 According to another aspect, the storage medium is illustrated. The storage medium may include a software program adapted to be executed on the processor and, when implemented on a computing device, for performing the method steps outlined in this document.
依據另一態樣,係說明電腦程式產品。電腦程式可包含可執行指令,適於在處理器上執行且當在電腦上實現時用於施行在本文件中所概述的方法步驟。 According to another aspect, a computer program product is described. The computer program can include executable instructions adapted to be executed on the processor and, when implemented on a computer, for performing the method steps outlined in this document.
應注意,如在本專利申請案中所概述包括他們較佳的實施例之方法和系統可獨立使用或與在本文件中揭示的其它方法和系統結合。進一步而言,在本專利申請案中概述的方法和系統之所有態樣可隨意結合。特別是,申請專利範圍之特徵可以隨意方式與彼此結合。 It should be noted that the methods and systems including their preferred embodiments as outlined in this patent application can be used independently or in combination with other methods and systems disclosed in this document. Further, all aspects of the methods and systems outlined in this patent application can be combined at will. In particular, the features of the scope of the patent application can be combined with each other in an arbitrary manner.
1‧‧‧接收器 1‧‧‧ Receiver
2‧‧‧設備 2‧‧‧ Equipment
3‧‧‧PS參數估計單元 3‧‧‧PS parameter estimation unit
4‧‧‧升混單元 4‧‧‧Upmixing unit
5‧‧‧PS參數 5‧‧‧PS parameters
7‧‧‧PS編碼器 7‧‧‧PS encoder
8‧‧‧部分的PS解碼器 8‧‧‧ part of the PS decoder
9‧‧‧縮混產生單元 9‧‧‧Shrink mixing unit
10‧‧‧解相關器 10‧‧ ‧Resolver
20‧‧‧偵測單元 20‧‧‧Detection unit
30‧‧‧旁通增益單元 30‧‧‧Bypass gain unit
31‧‧‧雜訊降低增益單元 31‧‧‧ Noise reduction gain unit
32‧‧‧合併單元 32‧‧‧Merge unit
71‧‧‧濾波器排 71‧‧‧Filter row
72‧‧‧濾波器排 72‧‧‧ Filter row
73‧‧‧濾波器排 73‧‧‧Filter row
74‧‧‧延遲 74‧‧‧Delay
75‧‧‧LR-to-MS轉換器 75‧‧‧LR-to-MS converter
76‧‧‧MS-to-LR轉換器 76‧‧‧MS-to-LR converter
77‧‧‧參數決定單元 77‧‧‧Parameter decision unit
78‧‧‧解相關器 78‧‧‧Resolver
79‧‧‧雜訊降低單元 79‧‧‧ Noise Reduction Unit
本發明藉由闡述的範例參照所附圖式之方式解釋於下,其中圖1闡述對於用於改善FM立體聲無線電接收器之立體聲輸出的系統的示意範例;圖2闡述基於參數立體聲之概念的音頻處理設備之範例;圖3闡述基於預測之概念之音頻處理設備之範例;圖4繪示用於嘈雜FM無線電語音訊號之中及側訊號的範例功率譜;圖5闡述使用接收的FM無線電訊號之品質指示器處理接收的FM無線電訊號的方法整流程圖;以及圖6繪示用於隱蔽預測和解相關參數的範例狀態機。 The invention is explained by way of example with reference to the accompanying drawings, in which FIG. 1 illustrates a schematic example of a system for improving the stereo output of an FM stereo radio receiver; FIG. 2 illustrates an audio based on the concept of parametric stereo Example of a processing device; Figure 3 illustrates an example of an audio processing device based on the concept of prediction; Figure 4 illustrates an example power spectrum for a noisy FM radio voice signal and side signals; Figure 5 illustrates the use of a received FM radio signal A method flow diagram for processing a received FM radio signal; and FIG. 6 illustrates an example state machine for concealing prediction and decorrelation parameters.
圖1繪示用於改善FM立體聲無線電接收器1之立體聲輸出的示意範例系統。如在本案之先前技術段落所討論的,在FM無線電中立體聲訊號藉作為中訊號M和側訊號S之設計而傳送。在FM接收器1中,側訊號用以在FM接收器1之輸出處建立左訊號L與右訊號R之間的立體聲差(至少當在接收足夠良好且側訊號資訊未被遮音時)。換句話說,側訊號用以從中訊號建立左及右音頻訊號。左及右訊號L、R可為數位或類比訊號。 FIG. 1 illustrates a schematic example system for improving the stereo output of an FM stereo radio receiver 1. As discussed in the prior art paragraph of this case, the stereo signal is transmitted in the FM radio as the design of the medium signal M and the side signal S. In the FM receiver 1, the side signal is used to establish a stereo difference between the left signal L and the right signal R at the output of the FM receiver 1 (at least when the reception is good enough and the side signal information is unmasked). In other words, the side signal is used to establish left and right audio signals from the middle signal. The left and right signals L and R can be digital or analog signals.
為了改善FM接收器之左及右音頻訊號L、R,可使用在其輸出產生立體聲音頻信號L’及R’的音頻處理設備2。使用參數立體聲可致能音頻處理設備2來施行接收的FM無線能訊號之雜訊降低。或者,使用如在本文件所說明使用預測式參數化可致能音頻處理設備2來施行接收的FM無線電訊號之雜訊降低。 In order to improve the left and right audio signals L, R of the FM receiver, an audio processing device 2 that produces stereo audio signals L' and R' at its output can be used. The parametric stereo can be used to enable the audio processing device 2 to perform the noise reduction of the received FM wireless energy signal. Alternatively, the noise reduction of the received FM radio signal can be performed using the predictive parametric enablement audio processing device 2 as described in this document.
在設備2中的音頻處理係較佳的在數位域中施行;因此,在FM接收器1與音頻處理設備2之間的類比界面的情形中,於在設備2中的數位音頻處理之前則使用類比對數位轉換器(analog-to-digital converter)。FM接收器1和音頻處理設備2可在相同的半導體晶片上整合或可為兩個半導體晶片之部分。FM接收器1和音頻處理設備2能為無線通訊裝置之部分,諸如行動電話(cellular telephone)、個人數位助理(PDA;personal digital assistant)及智慧型電話。在此情形中,FM接收器1可為具有額外FM無線電接收器功能的基帶(baseband)晶片之部分。在另一應用中,FM接收器1和音頻處理設備2能為交通工具音頻系統(vehicle audio system)之部分以對於移動的交通工具之變化接收條件補償。 The audio processing in the device 2 is preferably implemented in the digital domain; therefore, in the case of an analog interface between the FM receiver 1 and the audio processing device 2, it is used before the digital audio processing in the device 2 Analog-to-digital converter. The FM receiver 1 and the audio processing device 2 may be integrated on the same semiconductor wafer or may be part of two semiconductor wafers. The FM receiver 1 and the audio processing device 2 can be part of a wireless communication device, such as a cellular telephone, a personal digital assistant (PDA), and a smart phone. In this case, the FM receiver 1 can be part of a baseband chip with additional FM radio receiver functionality. In another application, the FM receiver 1 and the audio processing device 2 can be part of a vehicle audio system to compensate for variations in receiving conditions for the moving vehicle.
取代在FM接收器1之輸出和設備2之輸入處使用左/右代表,可在FM接收器1與設備2之間的介面處使用中/側代表(請看在圖1中之M、S用於中/側代表以及L、R用於左/右代表)。在FM接收器1與設備2之 間的介面處這類中/側代表可造成降低的處理負載,其係由於FM接收器1已接收中/側信號且音頻處理設備2可直接處理中/側信號而不用縮混或不用L/R對M/S轉換。假若FM接收器1緊密的與音頻處理設備2整合的話,特別是假若FM接收器1和音頻處理設備2在相同的裝置(例如,相同的半導體晶片)上整合的話,中/側代表可為有利的。 Instead of using the left/right representation at the output of the FM receiver 1 and the input of the device 2, a mid/side representation can be used at the interface between the FM receiver 1 and the device 2 (see M, S in Figure 1). Used for the mid/side representation and L, R for the left/right representation). At FM receiver 1 and device 2 Such a mid/side representation at the interface may result in a reduced processing load since the FM receiver 1 has received the mid/side signal and the audio processing device 2 can directly process the mid/side signal without downmixing or L/ R to M/S conversion. If the FM receiver 1 is tightly integrated with the audio processing device 2, especially if the FM receiver 1 and the audio processing device 2 are integrated on the same device (for example, the same semiconductor wafer), the mid/side representation may be advantageous. of.
可選擇的是,無線電訊號強度訊號6指示無線電接收條件可用於在音頻處理設備2中適應音頻處理。 Alternatively, the radio signal strength signal 6 indicates that the radio reception condition is available for adaptation to audio processing in the audio processing device 2.
FM無線電接收器1與音頻處理設備2之結合對應具有整合雜訊降低系統的FM無線電接收器。 The combination of the FM radio receiver 1 and the audio processing device 2 corresponds to an FM radio receiver with an integrated noise reduction system.
圖2繪示基於參數立體聲的音頻處理設備2之實施例。設備2包含PS參數估計單元3。參數估計單元3組態成基於要改善的輸入音頻訊號決定PS參數5(其可為不是以左/右就是中/側代表)。PS參數5可包括(還有別的參數)指示聲道間強度差(IID或亦所謂CLD-聲道級數差(channel level differences))的參數及/或指示聲道間交叉相關(ICC;inter-channel cross-correlation)的參數。較佳的,PS參數5為時間和頻率變量。在參數估計單元3之輸入處之M/S代表的情形中,然而藉應用L/R聲道之適當的轉換,參數估計單元3可決定關於L/R聲道的PS參數5。 2 illustrates an embodiment of an audio processing device 2 based on parametric stereo. The device 2 comprises a PS parameter estimation unit 3. The parameter estimation unit 3 is configured to determine the PS parameter 5 (which may or may not be left/right or medium/side representative) based on the input audio signal to be improved. PS parameter 5 may include (and other parameters) parameters indicating inter-channel intensity differences (IID or also CLD-channel level differences) and/or indicating inter-channel cross-correlation (ICC; Inter-channel cross-correlation) parameters. Preferably, PS parameter 5 is a time and frequency variable. In the case of the M/S representation at the input of the parameter estimation unit 3, however, by applying an appropriate conversion of the L/R channel, the parameter estimation unit 3 can decide the PS parameter 5 for the L/R channel.
縮混音頻訊號DM係得自輸入訊號。萬一輸入音頻訊號已使用中/側代表,縮混音頻信號DM可直接 對應中訊號。萬一輸入音頻訊號具有左/右代表,音頻訊號可藉在縮混產生單元9中縮混音頻訊號產生。較佳的,在縮混之後之結果訊號滿足中訊號M且可由下列方程式產生:DM=(L+R)/d,例如具有d=2,亦即縮混訊號DM滿足L與R之平均。對於不同的比例因子(scaling factor)d之值,L與R訊號之平均則被放大或衰減。縮混產生單元9與參數估計單元3為PS編碼器7之部分。 The downmix audio signal DM is derived from the input signal. In case the input audio signal has been used in the middle/side representation, the downmix audio signal DM can be directly Corresponding to the middle signal. In case the input audio signal has a left/right representation, the audio signal can be generated by downmixing the audio signal in the downmix generating unit 9. Preferably, the result signal after the downmixing satisfies the middle signal M and can be generated by the following equation: DM=(L+R)/d, for example, having d=2, that is, the downmix signal DM satisfies the average of L and R. For different values of the scaling factor d, the average of the L and R signals is amplified or attenuated. The downmix generating unit 9 and the parameter estimating unit 3 are part of the PS encoder 7.
設備更包含升混(upmix)單元4,其亦稱為立體聲混音模組或立體聲升混器(upmixer)。升混單元4係組態成基於音頻訊號DM及PS參數5產生立體聲訊號L’、R’。較佳的,升混單元4不僅使用DM訊號亦使用側訊號So(其通常對應到原始接收的側訊號S)或使用解相關器10從縮混訊號DM產生的假側訊號(pseudo side signal)S*。解相關器10接收單聲道縮混DM且產生解相關的訊號S*,其被使用為假側訊號。解相關器10係由如在文件「『Low Complexity Parametric Stereo Coding in MPEG-4』,Heiko Purnhagen,Proc.Digital Audio Effects Workshop(DAFx),pp.163-168,Naples,IT,Oct.2004」之章節4中所討論之合適的全通濾波器所實現。其參數立體聲之討論,特別是關於參數立體聲參數之決定,且特別是在區段4中,係特此藉參考併入。立體聲混音矩陣4可為2x2升混矩陣,其從訊號DM及So或S*產生立體聲訊號 L’、R’。升混單元4及解相關器10為部分的PS解碼器8。 The device further includes an upmix unit 4, which is also referred to as a stereo mixing module or a stereo upmixer. The upmixing unit 4 is configured to generate stereo signals L', R' based on the audio signal DM and the PS parameter 5. Preferably, the upmixing unit 4 uses not only the DM signal but also the side signal S o (which usually corresponds to the original received side signal S) or the pseudo side signal generated by the decorrelator 10 from the downmix signal DM (pseudo side signal). )S*. The decorrelator 10 receives the mono downmix DM and generates a decorrelated signal S*, which is used as a false side signal. The decorrelator 10 is as described in the document "Low Complexity Parametric Stereo Coding in MPEG-4", Heiko Purnhagen, Proc. Digital Audio Effects Workshop (DAFx), pp. 163-168, Naples, IT, Oct. 2004. Implemented by a suitable all-pass filter as discussed in Section 4. The discussion of its parametric parameters, in particular regarding the decision of the parametric stereo parameters, and in particular in section 4, is hereby incorporated by reference. The stereo mixing matrix 4 can be a 2x2 liter mixing matrix that produces stereo signals L', R' from the signals DM and S o or S*. The upmixing unit 4 and the decorrelator 10 are part of the PS decoder 8.
設備2係基於接收的側訊號可能對於藉簡單的結合接收的中及側訊號來重建立體聲訊號來說太嘈雜的想法;然而,在此情形中,在接收的L/R訊號中接收的側訊號或側訊號的成分可仍然對於在PS參數估計單元3中之立體聲參數分析來說係為足夠良好的。能接著使用造成的PS參數5以用於產生立體聲訊號L’、R’,其相較於直接在FM接收器1之輸出處的音頻訊號來說具有降低的雜訊之級數。 Device 2 is based on the idea that the received side signal may be too noisy to reconstruct the stereo signal by simply combining the received side and side signals; however, in this case, the side signal received in the received L/R signal The component of the side signal or the signal may still be sufficiently good for stereo parameter analysis in the PS parameter estimation unit 3. The resulting PS parameter 5 can then be used to generate stereo signals L', R' which have a reduced number of levels of noise compared to the audio signal directly at the output of the FM receiver 1.
因此,嘈雜的無線電信號能藉使用參數立體聲概念而被「清理」。失真與在FM無線電信號中之雜訊的主要部分係位在側頻道中,其通常非在PS縮混中使用。然而,即使在嘈雜的接收之情形,接收的側頻道S通常有用於PS參數萃取之充分的品質。 Therefore, noisy radio signals can be "cleaned up" by using the parametric stereo concept. Distortion and the main part of the noise in the FM radio signal are tied in the side channel, which is usually not used in PS downmixing. However, even in the case of noisy reception, the received side channel S typically has sufficient quality for PS parameter extraction.
在本文件中所繪示之圖式中,到音頻處理設備2之輸入訊號為左/右立體聲訊號。具有對在音頻處理設備2內一些模組輕微的修改,音頻處理設備2亦能處理在中/側代表中之輸入訊號。因此,能使用於此討論的概念以及連同在中/側代表中之輸入訊號。 In the drawing shown in this document, the input signal to the audio processing device 2 is a left/right stereo signal. With minor modifications to some of the modules within the audio processing device 2, the audio processing device 2 can also process input signals in the mid/side representation. Thus, the concepts discussed herein, as well as the input signals in the mid/side representation, can be used.
在圖2中所闡述以PS為基的(PS-based)FM立體聲雜訊降低方法良好的施行以用於在其中接收的FM無線電訊號之側訊號包含起源自無線電傳送頻道的高或中間的雜訊之級數的情形。然而,以PS為基的FM立體聲 雜訊降低方法具有幾個缺點。PS為基的FM立體聲雜訊降低方法相當計算複雜,當其需要2個QMF分析排(用於PS參數之計算)以及2個QMF合成排(用於雜訊降低立體聲訊號L’、R’之產生)時。進一步而言,PS為基的FM立體聲雜訊降低方法通常利用混合(亦即,QMF加額外的尼奎斯特(Nyquist))濾波器排(filter bank)手法以用於在較低頻率處增加的頻率解析度(frequency resolution)。此意味PS參數之決定通常需要高量的濾波器排運算。此外,PS為基的雜訊降低方法需要超越計算(transcendental computation),像是sin()及atan()運算,其牽涉高計算複雜度。PS為基的FM立體聲雜訊降低方法之另一個缺點是其並不完全單聲道相容,由於其不僅修改側訊號也修改中訊號以為了決定雜訊降低立體聲訊號L’、R’。換句話說,PS為基的FM立體聲雜訊降低系統之輸出的單聲道縮混M’=(L’+R’)/2通常與原始中訊號M不同。特別是,若接收的立體聲訊號具有寬立體聲像(stereo image)(亦即,若接收的立體聲訊號具有顯著的搖擺(pan)及/或解相關訊號成分),單聲道縮混訊號M’通常被衰減(亦即,較低級數的)。相較於此,對於預測式FM立體聲雜訊降低系統,輸出之單聲道縮混係為原始中訊號(由於僅修改/處理側訊號)。 The PS-based FM stereo noise reduction method illustrated in FIG. 2 is well implemented for the side signals of the FM radio signals received therein to contain high or intermediate impurities originating from the radio transmission channel. The situation of the number of levels of the news. However, PS-based FM stereo The noise reduction method has several drawbacks. The PS-based FM stereo noise reduction method is quite computationally complex, when it requires two QMF analysis rows (for the calculation of PS parameters) and two QMF synthesis rows (for noise reduction stereo signals L', R' When produced). Further, PS-based FM stereo noise reduction methods typically utilize hybrid (ie, QMF plus additional Nyquist) filter bank techniques for adding at lower frequencies. Frequency resolution. This means that the decision of the PS parameter usually requires a high amount of filter bank operations. In addition, PS-based noise reduction methods require transcendental computations, such as sin() and atan() operations, which involve high computational complexity. Another disadvantage of the PS-based FM stereo noise reduction method is that it is not completely mono compatible, since it not only modifies the side signals but also modifies the medium signals to reduce the stereo signals L', R' in order to determine the noise. In other words, the mono downmix M' = (L' + R')/2 of the output of the PS-based FM stereo noise reduction system is typically different from the original signal M. In particular, if the received stereo signal has a wide stereo image (ie, if the received stereo signal has significant pan and/or decorrelated signal components), the mono downmix signal M' is typically Attenuated (ie, lower order). In contrast, for a predictive FM stereo noise reduction system, the mono downmix of the output is the original medium signal (since only the side signal is modified/processed).
由於PS為基的FM立體聲雜訊降低方法之計算複雜度之量在許多建置中是一種顧慮,此文件說明替代的框架以用於利用預測式手法的FM立體聲雜訊降低。相 較於參數立體聲(PS)為基的框架,預測式框架需要較低的計算複雜度。特別是,預測式FM立體聲雜訊降低方法使用降低的濾波器排之數目並且避免使用超越計算。同時,其已顯示當使用預測式FM立體聲雜訊降低方法時能達成改善的音頻品質。 The computational complexity of the PS-based FM stereo noise reduction method is a concern in many implementations, and this document illustrates an alternative framework for FM stereo noise reduction using predictive techniques. phase Predictive frameworks require lower computational complexity than parametric stereo (PS)-based frameworks. In particular, the predictive FM stereo noise reduction method uses a reduced number of filter banks and avoids using overrun calculations. At the same time, it has been shown that improved audio quality can be achieved when using the predictive FM stereo noise reduction method.
如上所概述的,在圖2中所繪示PS為基的FM無線電雜訊降低系統需要2個QMF分析濾波器排及2個QMF合成濾波器排。這些排運算之全部者係在訊號路徑中因而需要高精確度。2個QMF分析濾波器排在PS編碼器7之輸出處於訊號L及R上運算,並且2個QMF合成濾波器排在PS解碼器8之輸出處產生訊號L’及R’。進一步而言,PS為基的系統使用立體聲參數IID及ICC,並且需要像sin()及atan()的超越函數以從這些立體聲參數來計算立體聲升混矩陣4之元素。 As outlined above, the PS-based FM radio noise reduction system illustrated in Figure 2 requires two QMF analysis filter banks and two QMF synthesis filter banks. All of these rank operations are in the signal path and therefore require high precision. The two QMF analysis filters are arranged at the outputs of the PS encoder 7 on signals L and R, and the two QMF synthesis filters are arranged at the output of the PS decoder 8 to produce signals L' and R'. Further, the PS-based system uses stereo parameters IID and ICC, and requires a transcendental function like sin() and atan() to calculate the elements of the stereo upmix matrix 4 from these stereo parameters.
要提出的是,藉使用預測式框架降低FM立體聲雜訊降低系統之計算複雜度,代替圖2中所描述PS為基的系統之縮混/升混框架。藉使用LR-to-MS轉換器75及MS-to-LR轉換器76切換到中/側訊號代表,結合預測式手法,降低需要的QMF排之數目是可能的。LR-to-RS轉換器75產生中訊號M=(L+R)/2及側訊號S=(L-R)/2,且其可被省略,若來自FM接收器1的中/側訊號係直接的饋送進入圖3之音頻處理設備2的話。MS-to-LR轉換器76施行對LR-to-MS 75的反向運算。 It is proposed to reduce the computational complexity of the FM stereo noise reduction system by using a predictive framework instead of the downmix/upmix frame of the PS-based system described in FIG. By using the LR-to-MS converter 75 and the MS-to-LR converter 76 to switch to the mid/side signal representative, it is possible to reduce the number of required QMF rows in combination with the predictive method. The LR-to-RS converter 75 generates the middle signal M=(L+R)/2 and the side signal S=(LR)/2, and it can be omitted if the middle/side signal from the FM receiver 1 is directly The feed enters the audio processing device 2 of FIG. The MS-to-LR converter 76 performs an inverse operation on the LR-to-MS 75.
圖3繪示範例預測式FM無線電雜訊降低系 統之概觀,在其中薄線80意指時間域訊號,厚線81意指QMF域訊號,且點線82意指參數。預測式框架僅使用QMF分析濾波器排71以及在訊號路徑中的一個QMF合成濾波器排72,以及僅用於參數估計(且其通常具有降低的準確度需求)的第二QMF分析排73。 Figure 3 depicts an exemplary predictive FM radio noise reduction system An overview of the system, in which the thin line 80 means the time domain signal, the thick line 81 means the QMF domain signal, and the dotted line 82 means the parameter. The predictive framework uses only the QMF analysis filter bank 71 and a QMF synthesis filter bank 72 in the signal path, and a second QMF analysis bank 73 that is only used for parameter estimation (and which typically has reduced accuracy requirements).
如上所概述的,PS為基的FM無線電雜訊降低系統通常使用混合型濾波器排(亦即,具有使用尼奎斯特濾波器排對於最低QFM帶額外帶分列的QMF排)以為了對於直到大約1kHz之最低頻率而達成較高頻率解析度。對於預測式FM無線電雜訊降低系統,已發現可以達成良好的音頻品質,即使沒有額外由混合型濾波器排提供帶分離(band-splitting)。從此,預測式FM無線電雜訊降低系統可僅使用QMF排(亦即,沒有混合型濾波器排),其進一步降低計算複雜度且亦降低FM無線電訊號處理之演算的延遲(或潛時)74。 As outlined above, PS-based FM radio noise reduction systems typically use a hybrid filter bank (i.e., a QMF row with an additional band split for the lowest QFM band using a Nyquist filter bank) for Higher frequency resolution is achieved up to a minimum frequency of approximately 1 kHz. For predictive FM radio noise reduction systems, it has been found that good audio quality can be achieved even without additional band-splitting by the hybrid filter bank. From then on, the predictive FM radio noise reduction system can use only the QMF row (ie, no hybrid filter bank), which further reduces computational complexity and also reduces the delay (or latency) of the calculation of FM radio signal processing. .
圖3之預測式FM雜訊降低系統旨在使用2個參數a及b從接收的中訊號M產生雜訊降低側訊號S’。接收的中訊號M保持未改變(撇開用以對於需要決定雜訊降低側訊號S’的計算時間而補償的延遲74)。此是不同於PS為基的FM雜訊降低系統,在其中2個訊號,雜訊降低左及右訊號L’、R’係決定為PS參數之函數。 The predictive FM noise reduction system of Figure 3 is intended to generate a noise reduction side signal S' from the received intermediate signal M using two parameters a and b. The received medium signal M remains unchanged (the delay 74 is compensated for the calculation time required to determine the noise reduction side signal S'). This is a PS-based FM noise reduction system in which two signals, noise reduction left and right signals L', R' are determined as a function of the PS parameters.
界定接收的中及側訊號M及S為M=(L+R)/2和S=(L-R)/2,側訊號能使用預測系數a及殘留訊號 D代表為S=a*M+D。這意味預測參數a用以從中訊號預測側訊號。最佳預測系數a(其最小化D之能量)能計算為a=E[S*M]/E[M*M],其中E[.]意指為期望運算元。字面上,預測系數a可決定為在接收的側及接收的中訊號之間的交叉相關,與中訊號之能量之比率。通常來說,系數a(及b)為時間及/或頻率變量。此意味決定不同的系數a(及b)用於不同的時間間隔及/或不同的頻率範圍。如此一來,期望值E[.]可針對特定時間間隔(例如,64ms)及/或在特定頻率範圍內(例如QMF子帶(subband)或成群的QMF子帶之數目)而決定。 The medium and side signals M and S defining the reception are M=(L+R)/2 and S=(LR)/2, and the side signal can use the prediction coefficient a and the residual signal D to represent S= a *M+D. This means that the prediction parameter a is used to predict the side signal from the medium signal. The best prediction coefficient a (which minimizes the energy of D) can be calculated as a = E[S*M]/E[M*M], where E[. ] means the expected operand. Literally, the prediction coefficient a can be determined as the ratio of the cross-correlation between the receiving side and the received medium signal to the energy of the medium signal. In general, the coefficients a (and b) are time and / or frequency variables. This means that different coefficients a (and b) are used for different time intervals and/or different frequency ranges. As a result, the expected value E[. ] may be determined for a particular time interval (eg, 64 ms) and/or within a particular frequency range (eg, a QMF subband or a number of groups of QMF subbands).
一旦已決定預測系數,才可從接收的中及側訊號M、S來決定殘留訊號D。殘留訊號D可由接收的中訊號M之解相關形式decorr(M)來近似。如此一來側訊號之雜訊降低形式S’可決定為S'=a*M+b*decorr(M)其中b為控制解相關訊號之能量的增益因子,亦參照為解相關參數b。解相關的中訊號decorr(M)可使用像是圖2之解相關器10的解相關器78來決定。解相關參數b能計算為b=sqrt(E[D*D]/E[M*M])以為了將具有與原始殘留訊號D相同的能量之能量控制的解相關訊號(b*decorr(M))取代殘留訊號D。結果是,預 測模型之參數a、b可在參數決定單元77內從接收的中訊號及接收的側訊號來決定。 Once the prediction coefficients have been determined, the residual signal D can be determined from the received middle and side signals M, S. The residual signal D can be approximated by the decorrelation form decorr(M) of the received medium signal M. Thus, the noise reduction form S' of the side signal can be determined as S'= a *M+ b *decorr(M), where b is the gain factor for controlling the energy of the decorrelated signal, and is also referred to as the decorrelation parameter b. The decorrelated medium signal decorr (M) can be determined using a decorrelator 78 like the decorrelator 10 of FIG. The de-correlation parameter b can be calculated as b = sqrt(E[D*D]/E[M*M]) in order to control the energy of the same energy as the original residual signal D ( b *decorr(M) )) Replace the residual signal D. As a result, the parameters a, b of the prediction model can be determined in the parameter decision unit 77 from the received medium signal and the received side signal.
結果是,在預測式FM雜訊降低系統之輸出處立體聲訊號L’及R’係從接收的中訊號M和2個參數a及b以雜訊降低單元79來計算。由於參數a及b通常估計及應用在複數值的QMF域代表(例如64帶的)中,處理能以時間及頻率變化方式來實現。通常來說,使用感知刺激時間(perceptually motivated time)及頻率舖貼(frequency tiling)。例如,64 QMF帶可依據感知頻率標度(例如,巴克標度(Bark scale))成群為總計15個頻率帶。感知頻率標度可藉將在較高頻率處鄰近的QMF帶成群來形成以形成較寬的頻率帶,其通常參照為「參數帶(parameter band)」。此組參數a和b(對於各個參數帶以一者)通常以定期的時間間隔(時框)來計算,例如使用大約64ms長度之時間性分析窗以近似E[.]運算。為了確保參數值從一個時間間隔(例如,時框)到下一個的平滑的轉移,可運用時間性內插(temporal interpolation)(例如,沿著時間線之線性內插)以產出a及b之內插的參數值。內插的參數值a和b接著以他們要應用於其上之對應的QMF帶訊號相乘。 As a result, the stereo signals L' and R' are output from the received intermediate signal M and the two parameters a and b at the output of the predictive FM noise reduction system by the noise reduction unit 79. Since parameters a and b are typically estimated and applied in complex-valued QMF domain representations (eg, 64-band), processing can be implemented in a time and frequency manner. Generally, perceptually motivated time and frequency tiling are used. For example, 64 QMF bands can be grouped into a total of 15 frequency bands in accordance with a perceptual frequency scale (eg, a Bark scale). The perceptual frequency scale can be formed by grouping adjacent QMFs at higher frequencies to form a wider frequency band, which is generally referred to as a "parameter band." This set of parameters a and b (for each parameter band) is usually calculated at regular time intervals (time frames), for example using a temporal analysis window of approximately 64 ms length to approximate E[. ] Operation. To ensure a smooth transition of parameter values from one time interval (eg, time frame) to the next, temporal interpolation (eg, linear interpolation along the timeline) can be used to produce a and b. Interpolated parameter values. The interpolated parameter values a and b are then multiplied by the corresponding QMF band signals to which they are applied.
如上所指,第二QMF分析排73僅用於在參數決定單元77內的參數估計。如從上面提供的公式能見到的是,第二QMF分析排73提供在接收的側訊號S上的子帶資訊,其係用以決定在每參數帶基礎上的交叉相關 E[S*M]。換句話說,第二QMF分析排73僅用以決定在參數帶之級數上的期望值(與QMF頻率帶相比)。又換句話說,第二QMF分析排73用以決定在相對粗糙的時間及頻率網格上的預測參數。結果是,在選擇性上的需要(例如,原型窗之長度),時間/頻率解析度及/或第二QMF分析排73之計算精度明顯的低於用於位於訊號路徑內之QMF分析帶71的需求。 As indicated above, the second QMF analysis row 73 is only used for parameter estimation within the parameter decision unit 77. As can be seen from the formula provided above, the second QMF analysis row 73 provides subband information on the received side signal S, which is used to determine the cross correlation on a per parameter band basis. E[S*M]. In other words, the second QMF analysis row 73 is only used to determine the expected value (in comparison to the QMF frequency band) over the number of stages of the parameter band. In other words, the second QMF analysis row 73 is used to determine the prediction parameters on a relatively coarse time and frequency grid. As a result, the selectivity requirements (e.g., the length of the prototype window), the time/frequency resolution, and/or the second QMF analysis row 73 are significantly less accurate than the QMF analysis band 71 used in the signal path. Demand.
如此一來,音頻處理設備2已說明依據圖2何者允許在相較於PS為基的FM雜訊降低系統之降低的計算複雜度處決定雜訊降低側訊號S’。使用MS-to-LR轉換器76,側訊號S’及(延遲的)接收的中訊號M’可轉換成雜訊降低左及右立體聲訊號L’、R’。感知試驗已顯示除了降低計算複雜度之外,當使用在本文所概述的預測式FM雜訊降低系統時(例如在圖3中),能改善雜訊降低FM訊號之感知品質。 As such, the audio processing device 2 has been described in accordance with FIG. 2 which allows the noise reduction side signal S' to be determined at a reduced computational complexity of the PS-based FM noise reduction system. Using the MS-to-LR converter 76, the side signal S' and the (delayed) received intermediate signal M' can be converted into noise to reduce the left and right stereo signals L', R'. Perceptual testing has shown that in addition to reducing computational complexity, when using the predictive FM noise reduction system outlined herein (e.g., in Figure 3), noise can be improved to reduce the perceived quality of the FM signal.
另一方面,已觀察的是,當使用用於FM立體聲雜訊降低的預測式手法時,接收的訊號由強照且嘈雜的側訊號所支配的情況(亦即,具有比中訊號更高的級數)能造成感知上惱人的人為因素。這類情況能發生在例如當傳送的立體聲訊號相對的安靜(例如,在兩段音樂之間的短暫停期間)同時接收器正面臨中間到不良的接收條件時。這類情況可由E[S*S]>>E[M*M]來特徵化,亦即接收的側訊號S之能量為(明顯的)高於接收的中訊號M之能量。鑒於參數a及b取決於中訊號E[M*M]之能量並 且部分的取決於側訊號E[S*S]之能量的事實,在上面所提的情況中,參數a和b通常具有大的絕對值(顯然的大於1)。此意味中訊號M明顯的增升以為了決定雜訊降低側訊號S’,藉此導入人為因素。進一步而言,參數a和b可沿著時間及頻率強烈的波動,其通常聽覺上感知為不欲的不穩定。 On the other hand, it has been observed that when a predictive method for FM stereo noise reduction is used, the received signal is dominated by a strong and noisy side signal (ie, having a higher frequency than the middle signal) The number of levels can cause a perceptually annoying human factor. Such a situation can occur, for example, when the transmitted stereo signal is relatively quiet (eg, during a short pause between two pieces of music) while the receiver is facing intermediate to poor reception conditions. Such a situation can be characterized by E[S*S]>>E[M*M], that is, the energy of the received side signal S is (apparently) higher than the energy of the received medium signal M. Since the parameters a and b depend on the energy of the medium signal E[M*M] And partly depending on the fact that the energy of the side signal E[S*S], in the case mentioned above, the parameters a and b usually have large absolute values (obviously greater than 1). This means that the signal M is significantly increased in order to determine the noise reduction side signal S', thereby introducing a human factor. Further, the parameters a and b can fluctuate strongly along time and frequency, which are generally audibly perceived as undesired instability.
為了減輕這個問題,能將後處理(post-processing)步驟應用到參數a和b。換句話說,修改的參數a’和b’之組能以a’=fa(a,b)和b’=fb(a,b)來決定。可能的後處理手法係為應用衰減或限制因子c以獲得後處理的參數a’=a/c和b’=b/c,其中c=1導致未修改的參數a和b。c>1之值造成雜訊降低側訊號S’乘以1/c,亦即由因子c來衰減。應注意,其它用於a’、b’及a、b之間的關係的公式是可能的。 To alleviate this problem, a post-processing step can be applied to parameters a and b . In other words, the modified set of parameters a' and b' can be determined by a '=f a ( a , b ) and b '= f b ( a , b ). A possible post-processing approach is to apply the attenuation or limiting factor c to obtain post-processing parameters a '= a / c and b '= b / c , where c =1 results in unmodified parameters a and b . The value of c >1 causes the noise reduction side signal S' to be multiplied by 1/ c , which is attenuated by the factor c . It should be noted that other formulas for the relationship between a ', b ' and a , b are possible.
不同手法用以從a和b計算限制因子c(亦即,c=f(a,b))是可能的。兩個不同的手法為:c=max(1,(a 2+b 2)),或 (1)
使用公式(2)的手法確保雜訊降低側訊號S’之能量不會超過中訊號M之能量,同時使用公式(1)之手法將甚至更強烈的衰減應用到在上述情況中的S’(相較於公式(2)),其中E[S*S]>E[M*M]。已發現使用公式(2)的手法趨向於提供稍微較好的音頻品質以用於在良好接收條件之情形中的寬音頻訊號,同時使用公式 (1)之手法趨向於在中間及不良接收條件之情形中於上述防止知覺惱人的人為因素上為更可靠的。 Use the formula (2) to ensure that the energy of the noise reduction side signal S' does not exceed the energy of the medium signal M, and use the formula (1) to apply even stronger attenuation to the S' in the above case ( Compared to formula (2)), where E[S*S]>E[M*M]. It has been found that the technique using equation (2) tends to provide a slightly better audio quality for wide audio signals in the case of good reception conditions while using the formula The technique of (1) tends to be more reliable in the above-mentioned situation of preventing annoying human factors in the case of intermediate and poor reception conditions.
應注意,在通常的接收情況中,側訊號之能量E[S*S]小於中訊號之能量E[M*M]。在此情形中,參數a和b通常小於1。在公式(1)和(2)中「最大(max)」運算確保在這類情形中限制因子為c=1,亦即沒有應用限制。 It should be noted that in the normal reception case, the energy E[S*S] of the side signal is smaller than the energy E[M*M] of the medium signal. In this case, the parameters a and b are usually less than one. The "max" operation in equations (1) and (2) ensures that in such cases the limiting factor is c =1, ie no application restrictions.
如在圖3中所闡述的,參數p可用以於通過或旁通模式中在雜訊降低側訊號S’與原始接收的(延遲的)側訊號S之間平滑的交叉衰弱。通過模式能對於以最佳方式處理具有良好接收條件的情況是有益的。為此目的,接收的FM立體聲訊號之品質應以可靠的方式估計,以為了下決定S’、S或S’及S之結合之使用以用於產生雜訊降低立體聲訊號L’、R’。在更一般的術語中,雜訊降低側訊號S’可通過雜訊降低增益單元31且旁通的側訊號S可通過旁通增益單元30。增益單元30、31從在他們輸入處的側訊號產生放大的及/或衰減的側訊號於他們的輸出處。放大的及/或衰減的側訊號在合併單元(例如,相加單元)32中合併,藉此提供結合的側訊號,其用以產生雜訊降低立體聲訊號L’、R’。 As illustrated in FIG. 3, the parameter p can be used to smooth the cross-fading between the noise reduction side signal S' and the originally received (delayed) side signal S in the pass or bypass mode. The pass mode can be beneficial for situations where there are good reception conditions in an optimal manner. For this purpose, the quality of the received FM stereo signal should be estimated in a reliable manner in order to determine the use of a combination of S', S or S' and S for generating noise reduction stereo signals L', R'. In more general terms, the noise reduction side signal S' can be reduced by the noise reduction unit 31 and the bypassed side signal S can pass through the bypass gain unit 30. Gain units 30, 31 generate amplified and/or attenuated side signals from their side signals at their inputs. The amplified and/or attenuated side signals are combined in a merging unit (e.g., adding unit) 32, thereby providing a combined side signal for generating noise reducing stereo signals L', R'.
預測式FM雜訊降低系統可進一步包含HQ(高品質)偵測單元20,其組態成決定或估計在接收的FM立體聲訊號L、R(或M、S)內可聽的雜訊之級數。在HQ偵測單元20內決定的雜訊級數估計可用以在雜訊 降低側訊號S’與原始(旁通的)側訊號S之間混合。為了混合側訊號,HQ偵測單元20可組態成設定雜訊降低增益單元31和旁通增益單元30之增益值。替代的或除此之外,混合側訊號可藉內插(線性的或非線性的)側訊號來達成。或者,可基於在HQ偵測單元20內所決定可聽的雜訊之級數的估計來選擇側訊號之其中之一。 The predictive FM noise reduction system can further include an HQ (High Quality) detection unit 20 configured to determine or estimate the level of audible noise within the received FM stereo signals L, R (or M, S). number. The estimated number of noise levels determined in the HQ detection unit 20 can be used in the noise The side signal S' is reduced and mixed with the original (bypass) side signal S. In order to mix the side signals, the HQ detecting unit 20 can be configured to set the gain values of the noise reduction gain unit 31 and the bypass gain unit 30. Alternatively or in addition, the mixed side signal can be achieved by interpolating (linear or non-linear) side signals. Alternatively, one of the side signals may be selected based on an estimate of the number of levels of audible noise determined within the HQ detection unit 20.
在下文中,方法係說明HQ偵測單元20如何可估計在接收的FM立體聲訊號內雜訊之實際的級數,且以藉此下決定是否更著重於雜訊降低側訊號S’或更著重於旁通側訊號S。 In the following, the method illustrates how the HQ detecting unit 20 can estimate the actual number of levels of noise in the received FM stereo signal, and thereby decide whether to focus more on the noise reduction side signal S' or more. Bypass side signal S.
為了在雜訊與實際酬載訊號之間進行區別,假定若側訊號S明顯的比接收的中訊號M強的話,接收的側訊號S主要的包含雜訊。換句話說,假定若側訊號S之功率以預定的臨界超過中訊號M之功率的話,側訊號S之功率係主要由於雜訊。因此,接收的立體聲訊號M、S之訊號對雜訊比(SNR;Signal-to-Noise Ratio)能近似為中對側比(MSR;Mid-to-Side Ratio)以用於低MSR值:
k=1,...,K頻率帶能自例如QMF帶分析層級(stage)71、73導出,其中QMF音頻資料之K=64頻道可用於處理。如上所概述的,QMF或混合QMF帶可有益的成群成為降低數目的頻率帶,其對應至例如非均勻感知動機的(perceptibly motivated)標度,例如巴克標度。如此一來,能決定MSR以用於複數個頻率(參數)帶,其中複數個頻率帶之解析度係為感知動機的。藉範例的方式,QMF濾波器排可包含64個QMF帶或混合QMF濾波器排可包含71個帶。這些濾波器排之解析度通常在高頻率範圍中過度的高。如此一來,可有利的是將帶中之一些者以感知動機的方式成群。如上所概述的,在預測式FM雜訊降低系統中的參數對應到這類成群的(感知動機的)頻率帶。藉範例的方式,預測式FM雜訊降低系統之參數a及b可使用在對應到單一時框(包含例如2048個取樣)的時間窗內總計15到20個成群的QMF頻率帶來決定。用於決定參數a及b的相同的頻率或參數帶,亦可用於決定每頻率/參數帶的MSR值,藉此降低全體計算複雜度。 k =1,..., K frequency bands can be derived, for example, from QMF band analysis stages 71, 73, where K = 64 channels of QMF audio data are available for processing. As outlined above, a QMF or hybrid QMF band can be beneficially clustered into a reduced number of frequency bands that correspond to, for example, a perceptibly motivated scale, such as a Barker scale. In this way, the MSR can be determined for a plurality of frequency (parameter) bands, wherein the resolution of the plurality of frequency bands is perceptually motivated. By way of example, a QMF filter bank can contain 64 QMF bands or a hybrid QMF filter bank can contain 71 bands. The resolution of these filter banks is typically too high in the high frequency range. As such, it may be advantageous to group some of the bands in a perceptually motivated manner. As outlined above, the parameters in the predictive FM noise reduction system correspond to such clustered (perceived) frequency bands. By way of example, the parameters a and b of the predictive FM noise reduction system can be determined using a total of 15 to 20 groups of QMF frequencies within a time window corresponding to a single time frame (including, for example, 2048 samples). The same frequency or parameter band used to determine parameters a and b can also be used to determine the MSR value per frequency/parameter band, thereby reducing overall computational complexity.
用於中訊號M及用於在時間n上某給定點的參數帶k之功率能計算為期望值:
當側訊號S並未強於中訊號M(或未藉由因子MSR_THRESHOLD而更強)時,SNR估計使用MSR通常是不可用的。換句話說,當側訊號S並不強於中訊號M(或未藉由MSR_THRESHOLD而更強)時,MSR通常並非SNR之良好估計。在此情形,SNR可基於一或更多在前的SNR之估計來決定。此可藉如在圖5之步驟104的內文中所說明之應用平滑或衰減函數來建置。 When the side signal S is not stronger than the medium signal M (or is not stronger by the factor MSR_THRESHOLD ), the SNR estimation using the MSR is usually not available. In other words, when the side signal S is not stronger than the medium signal M (or is not stronger by MSR_THRESHOLD ), the MSR is usually not a good estimate of the SNR. In this case, the SNR can be determined based on an estimate of one or more previous SNRs. This can be accomplished by applying a smoothing or attenuation function as illustrated in the context of step 104 of FIG.
圖4繪示用於中訊號60之功率譜以及在嘈雜的FM無線電接收條件中用於側訊號61之功率譜。對於具有強烈支配的中訊號M的頻率帶,側訊號S是否為雜訊是模稜兩可的。側訊號S可例如為部分的環境訊號或部分搖擺的訊號。結果是,這些頻率帶通常未提供在接收的FM立體聲訊號L、R(或M、S)內雜訊之功率的可靠指示。然而,注視其中側訊號S為明顯強於中訊號M(例如以至少6dB或以幾乎10dB)的頻率帶,此可看作非常有可能在由無線電傳送造成的側訊號S內本質上純雜訊之指示。這類情況(其中)能見於圖4在大約2kHz和5kHz處。如此一來,橫跨頻率帶k=1,...,K的MSR之最小值可視為接收的FM無線電訊號之SNR(亦即全體接收的FM無線電立體聲訊號之品質)的可靠的指示器。 4 illustrates the power spectrum for the middle signal 60 and the power spectrum for the side signal 61 in the noisy FM radio reception conditions. For the frequency band of the medium signal M with strong control, it is ambiguous whether the side signal S is noise. The side signal S can be, for example, a partial environmental signal or a partial sway signal. As a result, these frequency bands typically do not provide a reliable indication of the power of the noise within the received FM stereo signal L, R (or M, S). However, looking at the side signal S is a frequency band that is significantly stronger than the medium signal M (for example, at least 6 dB or almost 10 dB), which is considered to be very likely to be essentially pure noise in the side signal S caused by radio transmission. Instructions. Such a situation (where Can be seen in Figure 4 at approximately 2 kHz and 5 kHz. In this way, the minimum value of the MSR across the frequency bands k =1, . . . , K can be regarded as a reliable indicator of the SNR of the received FM radio signal (ie, the quality of the entire received FM radio stereo signal).
像是音樂或語音的音頻內容在高頻率範圍中通常具有比在低頻率範圍中較少的酬載能量。進一步而 言,在高頻率範圍中的酬載能量可比在低頻率範圍中的較少連續。如此一來,接收的FM訊號之雜訊的能量在高頻率範圍內比在低頻率範圍中更容易偵測。有鑑於此,可為有利的是限制MSR之分析到全體K頻率帶之選定的子範圍。特別是,可為有利的是限制MSR之分析到全體K頻率帶之上面的子範圍,例如到K頻率帶之上面一半。如此一來,用於偵測接收的FM訊號之品質的方法可更強健的做成。 Audio content, such as music or speech, typically has less payload energy in the high frequency range than in the lower frequency range. Further In other words, the payload energy in the high frequency range can be less continuous than in the low frequency range. As a result, the energy of the received FM signal noise is easier to detect in the high frequency range than in the low frequency range. In view of this, it may be advantageous to limit the analysis of the MSR to the selected sub-range of the entire K-frequency band. In particular, it may be advantageous to limit the analysis of the MSR to sub-ranges above the entire K frequency band, for example to the upper half of the K frequency band. In this way, the method for detecting the quality of the received FM signal can be made more robust.
鑒於上述,可界定高品質因子α HQ ,其取決於橫跨一些或所有的頻率帶k=1,...,K(例如,橫跨高頻率帶)的MRS之分析。高品質因子α HQ 可使用為在接收的FM無線電立體聲訊號內可聽的雜訊之指示器。不具有雜訊的高品質訊號可由α HQ =1指示且具有高雜訊之低品質訊號可由α HQ =0指示。中間品質狀態可由0<α HQ <1指示。高品質因子α HQ 能自MSR值導出,其依據:
在上面公式中,q為從一或更多MSR值導出的值。如上所指,q可從橫跨頻率帶之子集的最小的MSR值導出。進一步而言,q可以設定為反向的最小MSR值之 峰值衰減(peak-decay)值。替代的或此外,任何其它平滑的方法可以使用以使隨時間推移的品質指示器參數q之進化平滑。 In the above formula, q is a value derived from one or more MSR values. As indicated above, q can be derived from the smallest MSR value across a subset of the frequency bands. Further, q can be set to a peak-decay value of the reverse minimum MSR value. Alternatively or in addition, any other smoothing method can be used to smooth the evolution of the quality indicator parameter q over time.
高品質因子α HQ 能用於在雜訊降低側訊號S’與原始未處理側訊號S之間切換或衰退或內插。此意味高品質因子α HQ =p可使用為用於旁通增益單元30之增益,然而因子(1-α HQ )=1-p可使用為用於雜訊降低增益單元31之增益。 The high quality factor α HQ can be used to switch or fade or interpolate between the noise reduction side signal S' and the original unprocessed side signal S. This means that the high quality factor α HQ = p can be used as the gain for the bypass gain unit 30, however the factor (1- α HQ ) = 1 - p can be used as the gain for the noise reduction gain unit 31.
HQ偵測演算法100之實施例能藉由下列在圖5中所繪示的步驟所說明: An embodiment of the HQ detection algorithm 100 can be illustrated by the steps illustrated in Figure 5 below:
˙在步驟101中,計算中及側訊號功率,亦即針對一些或所有的頻率或參數帶k,例如K low <k K high ,決定中訊號之能量及側訊號之能量。在一範例中,K high =K且K low =K/2(亦即,僅考量頻率帶之上面一半)。中及側功率及在時間點n處決定,例如使用上面提供的用於期望值的平均公式。 ̇ In step 101, the mid- and side-signal powers are calculated, that is, for some or all of the frequencies or parameter bands k , such as K low < k K high , the energy of the signal And the energy of the side signal . In one example, K high = K and K low = K /2 (ie, only the upper half of the frequency band is considered). Medium and side power and It is decided at time point n , for example using the average formula provided above for the expected value.
˙在步驟102中,用於一些或所有的頻率帶k的中 對側比(MSR)值係決定為例如 In step 102, the mid-to-side ratio (MSR) value for some or all of the frequency bands k is determined to be, for example,
˙在步驟103中,決定用於某頻率範圍的MSR值,其中頻率範圍係為例如K low <k K high 。 ̇ In step 103, determining the MSR value for a certain frequency range , where the frequency range is, for example, K low < k K high .
˙在步驟104中,例如藉由將MSR峰值決定為下列式子,而使最小MSR值隨時間推移而平滑γ peak (n)=min(κγ peak (n-1),γ min),具有衰減因子κ=exp(-1/(F s τ))具有例如τ=2秒的時間常數,且具有F s 作為框率(frame rate),亦 即實現步驟104頻率多高的比率。此建置反向的峰值衰減,其隨時間推移使最小MSR值平滑。 步骤 In step 104, the minimum MSR value is smoothed by time γ peak ( n )=min( κγ peak ( n −1), γ min ), for example, by determining the MSR peak as the following expression, with attenuation The factor κ = exp(-1/( F s τ )) has a time constant of, for example, τ = 2 seconds, and has F s as a frame rate, that is, a ratio of how high the frequency of step 104 is. This establishes a reverse peak attenuation that smoothes the minimum MSR value over time.
˙在步驟105中,在時間點n處的高品質因子α HQ 係藉由使用在時間點n處的MSR峰值γ peak (n)決定,亦即,藉由使用在時間點n處的平滑的最小MSR值γ peak (n),具有q=γ peak (n)而成為
˙在步驟107中,在時間點n處的高品質因子α HQ 可應用到在圖3中闡述的側訊號混合處理。 In step 107, the high quality factor α HQ at time point n can be applied to the side signal mixing process illustrated in FIG.
上面所提的HQ偵測演算法100可重複以用於隨後的時間點(由從步驟107回到步驟101之箭頭所闡述)。 The HQ detection algorithm 100 mentioned above can be repeated for subsequent time points (as illustrated by the arrow from step 107 back to step 101).
用於決定接收的FM無線電立體聲訊號之高品質的方法或系統可進一步藉使高品質因子α HQ 取決於一或更多另外的指示器(除了一或更多MSR值之外)。特別是,高品質因子可取決於接收的FM無線電立體聲訊號之頻譜平坦度測量(FSM;Spectral Flatness Measure)而做成。如在WO PCT/EP2011/064077中所概述的,可決定在0與1之間被正規化的所謂SFM_impact_factor。SFM_impact_factor=0可對應到指示側訊號S之功率譜的低 SFM值,對於其譜功率集中在訊率帶之相對小的數目中。亦即,SFM衝擊因子「0」指示低級數的雜訊。另一方面,SFM衝擊因子「1」對應到高SFM值,其指示在所有頻譜帶中頻譜具有相似的功率之量。結果是,SFM衝擊因子「1」指示高級數的雜訊。 The method or system for determining the high quality of the received FM radio stereo signal may further dictate that the high quality factor α HQ depends on one or more additional indicators (in addition to one or more MSR values). In particular, the high quality factor can be made dependent on the Spectral Flatness Measurement (FSM) of the received FM radio stereo signal. The so-called SFM_impact_factor , which is normalized between 0 and 1, can be determined as outlined in WO PCT/EP2011/ 064077 . SFM_impact_factor =0 may correspond to a low SFM value indicating the power spectrum of the side signal S for which the spectral power is concentrated in a relatively small number of signal bands. That is, the SFM impact factor "0" indicates low-level noise. On the other hand, the SFM impact factor "1" corresponds to a high SFM value indicating that the spectrum has a similar amount of power in all spectral bands. As a result, the SFM impact factor "1" indicates the noise of the advanced number.
修改的高品質因子α' HQ 可依據下式來決定:
在隨後,係說明對於增強用於HQ偵測的方法及系統的另一個選擇。修改的高品質因子可藉將高品質因子α HQ 影響了總側級數S sum 成為柔和雜訊閘(soft noise gate)來決定,亦即側訊號之總級數(亦即能量或功率),其可決定為側訊號之能量(橫跨所有頻率帶)。如此一來,修改的高品質因子α' HQ 可依據下式來決定:
用於提供增強的HQ偵測演算法的另一個選擇係使高品質因子α HQ 受如例如在WO PCT/EP2011/064084中說明的隱蔽偵測器(concealment detector)之輸出所影響。修改的高品質因子α' HQ 可藉考慮在預測式FM無線電降低系統內的隱蔽是否為主動的來決定,以為了隱蔽不欲的FM接收器之單聲道降退情況。修改的高品質因子α' HQ 可依據α' HQ =(1-δ conceal )α HQ 來決定,其中若隱蔽為主動時δ conceal =1,且其中另外δ conceal =0。此意味若在預測式FM無線電雜訊降低系統內隱蔽為主動的話,接收的FM無線電訊號當然視為低品質的(α' HQ =0),否則接收的FM無線電訊號之品質係基於高品質因子α HQ 之計算值來估計。為了避免(可聽的)當從隱蔽狀態(亦即δ conceal =1)復原時的不連續,亦即為了確保將修改的高品質因子α' HQ 從0平滑的轉移到非零值,每當δ conceal =1時最小MSR值γ min可強制為γ min=MSR_LOW,使得平滑的轉移係藉圖5之步驟104之平滑方法來確保。由於使高品質因子取決於隱蔽狀態δ conceal , 能建置使用預測式FM無線電雜訊降低之對模式的快速切換(亦即,對用於不良接收條件之突然發生的FM無線電雜訊降低處理快速轉移),以及慢混合回至旁通模式(當接收條件已改善時)。 Another selection system for providing enhanced detection algorithm of the HQ of the high quality factor [alpha] HQ detector concealed by, for example, as described in WO PCT / EP2011 / 064084 in (concealment detector) output of the impact. The modified high quality factor α ' HQ can be determined by considering whether the concealment within the predictive FM radio reduction system is active in order to conceal the mono drop condition of the unwanted FM receiver. The modified high quality factor α ' HQ can be determined according to α ' HQ = (1 - δ conceal ) α HQ , where δ conceal =1 if concealment is active, and δ conceal =0 otherwise. This means that the received FM radio signal is of course considered low quality ( α ' HQ =0) if it is concealed as active in the predictive FM radio noise reduction system, otherwise the quality of the received FM radio signal is based on a high quality factor. The calculated value of α HQ is estimated. In order to avoid (audible) discontinuities when restoring from the concealed state (ie δ conceal =1), ie to ensure that the modified high quality factor α ' HQ is smoothed from 0 to a non-zero value, whenever When δ conceal =1, the minimum MSR value γ min can be forced to γ min = MSR_LOW , so that the smooth transition is ensured by the smoothing method of step 104 of FIG. 5 . Since the high quality factor depends on the concealed state δ conceal , it is possible to establish a fast switching of the mode using the predicted FM radio noise reduction (that is, the FM radio noise reduction processing for sudden occurrence of bad reception conditions is fast) Transfer), and slow mix back to bypass mode (when the receiving conditions have improved).
在下列,說明用於增強HQ偵測方法的另一個選擇。可調整MSR值γ k 以用於大的搖擺訊號,其係依據:
應注意,上面所提用於決定修改的高品質因子α HQ 的選擇能單獨的或是彼此任意的結合來使用。 It should be noted that the selection of the high quality factor α HQ proposed above for determining the modification can be used singly or in any combination with each other.
進一步而言,應注意,高品質因子α HQ 能用以在預測式FM立體聲無線電雜訊降低系統中調整參數a和b。特別是,限制因子c可受品質指示器α HQ 影響。此能例如依據下式而完成: 其中ε為選擇的調整值(小數目)防止當品質指示器α HQ =1時a和b為無限大(或不合理的大數目),亦即當接收的FM訊號包含低程度的雜訊時。 Further, it should be noted that the high quality factor α HQ can be used to adjust parameters a and b in a predictive FM stereo radio noise reduction system. In particular, the limit factor c can be affected by the quality indicator α HQ . This can be done, for example, according to the following formula: Where ε is the selected adjustment value (small number) to prevent a and b from being infinite (or unreasonably large) when the quality indicator α HQ =1, that is, when the received FM signal contains a low degree of noise .
取決於品質指示器α HQ 的限制函數c=f(a,b,α HQ )之目的係為限制a和b以用於低品質FM訊號(α HQ 接近0),同時不(或僅輕微的)限制a和b以用於高品質FM訊號(α HQ 接近1)。應注意,上面所提用於取決於品質指示器α HQ 修改限制因子的函數近似對於α HQ =0的c之第一函數(1)、對於α HQ =0.5的第二函數(2)以及對於α HQ =1施行“未限制”參數a和b。進一步而言,應注意,上面所提的公式僅為建置考量接收的FM訊號之品質的修改的限制函數之一個範例。 The purpose of the limit function c = f( a , b , α HQ ) depending on the quality indicator α HQ is to limit a and b for low quality FM signals ( α HQ close to 0), while not (or only slight) ) Limit a and b for high quality FM signals ( α HQ is close to 1). It should be noted that the above-mentioned function for modifying the limiting factor depending on the quality indicator α HQ approximates the first function (1) for c with α HQ =0, the second function (2) for α HQ = 0.5, and α HQ =1 applies the “unrestricted” parameters a and b . Further, it should be noted that the above-mentioned formula is only an example of a modified limit function for constructing consideration of the quality of the received FM signal.
在圖3中闡述的雜訊降低側訊號S’和旁通的側訊號S之選擇和合併可以頻率選擇方式來施行。可能的建置會包含下列對圖3之方塊示意圖的修改。可以修改圖3中的方塊示意圖使得增益單元30、31及合併單元32會在側訊號合成濾波器排「QMF-1」72之前於QMF域中施行。進一步而言,對旁通增益單元30的輸入可以為「QMFs」分析濾波器排73之輸出。此會意味濾波器排73 係在通過之情形之中於訊號路徑中,且因此具有與「QMF」分析濾波器排71相同的準確度需求。QMF合成濾波器排72可以用以將合併的側訊號(合併單元32之下行(downstream))轉換成時間域。 The selection and combining of the noise reduction side signal S' and the bypass side signal S illustrated in FIG. 3 can be performed in a frequency selective manner. Possible builds will include the following modifications to the block diagram of Figure 3. The block diagram in Figure 3 can be modified such that gain units 30, 31 and merging unit 32 are implemented in the QMF domain prior to side signal synthesis filter bank "QMF -1 " 72. Further, the input to the bypass gain unit 30 may be the output of the "QMF s " analysis filter bank 73. This would mean that the filter bank 73 is in the signal path in the case of passing, and therefore has the same accuracy requirements as the "QMF" analysis filter bank 71. The QMF synthesis filter bank 72 can be used to convert the combined side signals (downstream of the merging unit 32) into a time domain.
在替代的實施例中,頻率可選擇性的限定在兩個頻率帶,亦即高頻率帶和低頻率帶。特別是,低頻率帶可固定到旁通路徑,亦即重建構的側訊號可對應到用於低頻率範圍之接收的側訊號S,然而在高頻率範圍中,可使用雜訊降低側訊號S’(或是依據品質指示器p的混合的側訊號)。 In an alternative embodiment, the frequency is selectively limited to two frequency bands, namely a high frequency band and a low frequency band. In particular, the low frequency band can be fixed to the bypass path, that is, the reconstructed side signal can correspond to the side signal S for low frequency range reception, but in the high frequency range, the side signal S can be reduced by using noise. '(Or according to the mixed side signal of the quality indicator p).
WO PCT/EP2011/064077說明用以透過使用頻譜平坦度測量(spectral flatness measure)降低或移除在雜訊降低立體聲訊號中不欲的解相關成分之量的技術。這些技術亦能應用到在本文件中說明的預測式FM無線電雜訊降低系統。特別是光譜平坦度測量能藉修改參數b來應用如下:b_new=(1-SMF_impact_factor)*b.此意味SFM_impact_factor=1會強制b_new=0。對於SFM_impact_factor=0,b會維持不變。如此一來,在具有帶有SFM_impact_factor=1的高頻譜平坦度的側訊號(表示嘈雜的側訊號),無解相關相加到雜訊降低側訊號S’,使得雜訊降低側訊號S’對應到接收的中訊號之標度版本(scaled version),亦即a*M。 WO PCT/EP2011/064077 describes techniques for reducing or removing the amount of uncorrelated components that are undesirable in noise reduction stereo signals by using a spectral flatness measure. These techniques can also be applied to the predictive FM radio noise reduction system described in this document. In particular, the spectral flatness measurement can be applied by modifying the parameter b as follows: b _new=(1-SMF_impact_factor)* b . This means that SFM_impact_factor=1 forces b _new=0. For SFM_impact_factor=0, b will remain unchanged. In this way, in the side signal (with a noisy side signal) with high spectral flatness with SFM_impact_factor=1, no de-correlation phase is added to the noise reduction side signal S', so that the noise reduction side signal S' corresponds to To the scaled version of the received signal, that is, a *M.
在下列,概述用於決定SFM_impact_factor的例 子。在通常接收的FM無線電立體聲訊號中,中訊號M之功率譜在較低頻率範圍中相對的陡峭而具有高級數的能量。在另一方面,訊號S通常具有總體低程度的能量以及相對平坦的功率譜。 In the following, an overview is given for the example of determining SFM_impact_factor . In the normally received FM radio stereo signal, the power spectrum of the medium signal M is relatively steep in the lower frequency range and has the energy of the advanced number. On the other hand, the signal S typically has an overall low degree of energy and a relatively flat power spectrum.
由於側訊號雜訊之功率譜頗為平坦且具有特性斜率,連同斜率補償的SFM可用以估計在接收的FM訊號內雜訊級數。可使用不同類型的SFM值。亦即,SFM值可以各種方式計算。特別是,可使用瞬時SFM值以及SFM之經平滑的版本。瞬時SFM值通常對應到側訊號之訊號時框的SFM,然而瞬時SFM值之平滑版本亦取決於側訊號之先前訊號時框之SFM。 Since the power spectrum of the side signal noise is quite flat and has a characteristic slope, the slope compensated SFM can be used to estimate the number of noise levels within the received FM signal. Different types of SFM values can be used. That is, the SFM value can be calculated in various ways. In particular, instantaneous SFM values and smoothed versions of SFM can be used. The instantaneous SFM value usually corresponds to the SFM of the signal frame of the side signal, however the smoothed version of the instantaneous SFM value also depends on the SFM of the previous signal frame of the side signal.
用於從側訊號決定衝擊因子的方法包含決定側訊號之功率譜的步驟。通常來說,這是使用側訊號之一定數目的取樣(例如,訊號時框之取樣)。功率譜可決定為側訊號之能量值,用於複數個頻率帶k,例如k=1,...,K。功率譜之決定周期可與用於決定參數a及b之周期對準。如此一來,側訊號之功率頻譜可針對於對應的參數a及b之有效周期來決定。 The method for determining the impact factor from the side signal includes the step of determining the power spectrum of the side signal. Typically, this is a certain number of samples using the side signal (for example, the sampling of the signal frame). The power spectrum can be determined as the energy value of the side signal , for a plurality of frequency bands k , such as k =1,..., K . The decision period of the power spectrum can be aligned with the period used to determine parameters a and b . In this way, the power spectrum of the side signal can be determined according to the effective period of the corresponding parameters a and b .
在後續的步驟中,可補償側訊號雜訊之功率譜的特性斜率。可例如藉由決定一組單聲道訊號之側訊號的平均功率譜來實驗性的決定特性斜率(在設計/調諧階段)。替代的或此外,可適應性的從目前側訊號決定特性斜率,例如在目前側訊號之功率譜上使用線性迴歸(linear regression)。特性斜率之補償可由反向雜訊斜率 濾波器(inverse noise slope filter)施行。結果,應獲得斜率補償的、可能平坦的功率譜,其未顯出單聲道語音音頻訊號之側訊號之功率譜的特性斜率。 In the subsequent steps, the characteristic slope of the power spectrum of the side signal noise can be compensated. The characteristic slope (in the design/tuning phase) can be experimentally determined, for example, by determining the average power spectrum of the side signals of a set of mono signals. Alternatively or in addition, adaptability determines the slope of the characteristic from the current side signal, for example using linear regression on the power spectrum of the current side signal. Compensation of characteristic slope can be reversed noise slope The inverse noise slope filter is implemented. As a result, a slope-compensated, possibly flat power spectrum should be obtained which does not show the characteristic slope of the power spectrum of the side signal of the monophonic speech audio signal.
使用(斜率補償的)功率譜,可決定SFM值。SFM可依據下式計算
或者,SFM可在頻譜之子集上計算,僅包括範圍從K low 到K high 的頻率帶。能排除例如一或少許頻率帶的該方式以為了移除不想要的DC(例如低頻)偏移。當調整帶邊界時,上面所提用於計算SFM之公式應據以修改。 Alternatively, the SFM can be computed over a subset of the spectrum, including only frequency bands ranging from K low to K high . This approach, for example one or a few frequency bands, can be excluded in order to remove unwanted DC (eg low frequency) offsets. When adjusting the band boundary, the formula proposed above for calculating the SFM should be modified accordingly.
為了限制計算複雜度的原因,SFM公式可基於泰勒展開(Taylor expansion)、查找表(look-up table)或在軟體建置之領域的熟練者普遍知道的類似技術而替代的由其之數值近似(numerical approximation)取代。進一步而言,亦有其它測量頻譜平坦度之方法,例如像是標準差或頻率功率直條(bin)之最小和最大之間的差等。在本文件中,術語「SFM」意指這些測量中的任一 者。 In order to limit the computational complexity, the SFM formula can be approximated by a numerical expansion based on Taylor expansion, a look-up table, or a similar technique generally known to those skilled in the art of software construction. (numerical approximation) replaced. Further, there are other methods of measuring spectral flatness, such as, for example, the difference between the minimum and maximum of a standard deviation or a frequency power bin. In this document, the term "SFM" means any of these measurements. By.
使用用於特定時間周期或側訊號之時框的SFM值,能決定衝擊因子。為此目的,將SFM映射上至例如0到1之標度。SFM衝擊因子之映射和決定可依據下式決定
WO PCT/EP2011/064084說明藉由可靠的單聲道偵測器(mono-detector)結合使用先前估計的立體聲參數的機制而隱蔽FM接收器1之單聲道接收的短間隔的技術以於這類單聲道時間間隔期間產生雜訊降低FM立體聲訊號。在WO PCT/EP2011/064084中概述的技術亦能應用到在本文件中說明的預測式FM無線電雜訊降低系統。 WO PCT/EP2011/064084 describes a technique for concealing the short interval of the mono reception of the FM receiver 1 by means of a reliable mono-detector in combination with the previously estimated stereo parameters. Noise is generated during the class-like time interval to reduce the FM stereo signal. The technique outlined in WO PCT/EP2011/064084 can also be applied to the predictive FM radio noise reduction system described in this document.
如上所指,由於時變/時間變量(time-variant)不良接收條件(例如「衰退(fading)」),FM 接收器1可在立體聲及單聲道之間轉態(toggle)。為在單聲道/立體聲切換期間維持立體聲的聲像(sound image),錯誤隱蔽技術可用以隱蔽短的單聲道降退。在預測式FM無線電雜訊中隱蔽的手法係為使用預測和解相關參數a和b,其係基於先前估計的參數,以防因為FM接收器1之音頻輸出已降下到單聲道而不能計算新的參數a和b的情形。因此,當FM立體聲接收器1切換到單聲道音頻輸出時,圖3之預測式FM無線電雜訊降低系統持續使用先前估計的參數a和b(個別的針對各頻率帶)。若在立體聲輸出中的降退周期夠短使得在降退周期期間FM無線電訊號之立體聲的聲像保持相似的話,在設備2之音頻輸出中降退是聽不見的或是僅僅是幾乎聽不見的。另一個手法可為從先前估計的參數內插及/或外插參數a和b。假使,FM接收並不夠快速的回到立體聲,參數a和b能緩慢的衰減以在少許時間之後接近0,其意味就單聲道訊號(亦即中訊號)被輸出。 As indicated above, the FM receiver 1 can toggle between stereo and mono due to time-variant poor reception conditions (eg, "fading"). To maintain stereo sound image during mono/stereo switching, error concealment techniques can be used to conceal short mono downsizing. The hidden method in predictive FM radio noise is to use the prediction and decorrelation parameters a and b , which are based on previously estimated parameters, in case the audio output of the FM receiver 1 has been lowered to mono and cannot be calculated new. The case of parameters a and b . Thus, when the FM stereo receiver 1 switches to the mono audio output, the predictive FM radio noise reduction system of Figure 3 continues to use the previously estimated parameters a and b (individual for each frequency band). If the fallback period in the stereo output is short enough that the stereo image of the FM radio signal remains similar during the fallback period, the fallback in the audio output of device 2 is inaudible or only barely audible. . Another approach may be to interpolate and/or extrapolate parameters a and b from previously estimated parameters. If the FM reception is not fast enough to return to stereo, the parameters a and b can be slowly attenuated to approach 0 after a few times, which means that the mono signal (i.e., the medium signal) is output.
替代的或此外,預測式FM立體聲雜訊降低系統可參數對a及/或b使用內定值(default value)產生「假立體聲」訊號,以防接收條件如此不良使僅接收單聲道訊號的情形。內定值可取決於中訊號之語音/音樂分類。換句話說,預測式FM立體聲雜訊降低系統可包含分類器以用於基於接收的中訊號而分類接收的FM無線電訊號之類型。藉範例的方式,分類器可組態成將接收的FM無線電訊號分類為語音訊號或音樂訊號(例如基於接收的 中訊號之頻率分析)。預測式FM立體聲雜訊降低系統可接著基於接收的FM無線電訊號之決定的類型來選擇用於a及/或b的適當的值。如此一來,接收的FM無線電訊號之單聲道降退可使用(類型相關)內定參數值而隱蔽。 Alternatively or in addition, the predictive FM stereo noise reduction system may parameterize a and/or b to generate a "false stereo" signal using a default value to prevent the reception condition from being so bad that only mono signals are received. situation. The default value can depend on the voice/music classification of the medium signal. In other words, the predictive FM stereo noise reduction system can include a classifier for classifying the type of received FM radio signal based on the received medium signal. By way of example, the classifier can be configured to classify the received FM radio signal into a voice signal or a music signal (eg, based on the frequency analysis of the received medium signal). The predictive FM stereo noise reduction system can then select an appropriate value for a and/or b based on the type of decision of the received FM radio signal. As a result, the mono drop of the received FM radio signal can be concealed using (type dependent) default parameter values.
在預測式FM無線電雜訊降低系統內使用隱蔽需要單聲道降退之可靠的偵測,以為了觸發隱蔽,亦即為了從0到1設定隱蔽狀態δ conceal 。可能的單聲道/立體聲偵測器能夠基於偵測訊號之單聲道區段,其滿足條件為左訊號=右訊號(或左訊號-右訊號=0)。然而,這類單聲道/立體聲偵測器會導致對於隱蔽處理之不穩定的行為,其係由於左訊號和右訊號能量以及側訊號能量能變動很大(即使在健全的接收條件下)的事實。 The use of concealment in predictive FM radio noise reduction systems requires reliable detection of mono-destination in order to trigger concealment, ie to set the concealed state δ conceal from 0 to 1. A possible mono/stereo detector can be based on a mono segment of the detected signal, which satisfies the condition of left signal = right signal (or left signal - right signal = 0). However, such mono/stereo detectors can cause unstable behavior for concealed processing because the left and right signal energies and side signal energies can vary widely (even under sound reception conditions). fact.
為了避免這類隱蔽之不穩定行為,能夠將單聲道/立體聲偵測和隱蔽機制建置為狀態機(state machine)。範例的狀態機係在圖6中闡述。圖6之狀態機利用側訊號S之絕對能量的兩個參考級數,亦即ES(或如上所界定的PS)。用以計算Es的側訊號S可能以典型250Hz之截止頻率而被高通濾波。這些參考級數係為上部參考級數ref_high和下部參考級數ref_low。在上部參考級數(ref_high)之上訊號視為立體聲,而在下部參考級數(ref_low)之下其則視為單聲道。 In order to avoid such hidden and unstable behavior, the mono/stereo detection and concealment mechanism can be built as a state machine. An exemplary state machine is illustrated in Figure 6. The state machine of Figure 6 utilizes two reference levels of the absolute energy of the side signal S, namely E S (or P S as defined above). The side signal S used to calculate E s may be high pass filtered with a cutoff frequency of typically 250 Hz. These reference series are the upper reference level ref_high and the lower reference level ref_low. The signal is considered stereo above the upper reference level (ref_high) and mono in the lower reference level (ref_low).
側訊號能量ES係計算為狀態機之控制參數。ES可經一段時間窗而計算,時間窗可以例如對應到參數a及b之有效性的時間周期。換句話說,決定側訊號能量之 頻率可與決定參數a和b之頻率對準。在本文件中,用於決定側訊號能量ES之時間周期(以及可能地決定參數a和b)參照為訊號時框。圖6之狀態機包含5個條件(其每次計算新的時框之能量ES進行驗證): The side signal energy E S is calculated as the control parameter of the state machine. E S can be calculated over a period of time window, which can for example correspond to a time period of validity of parameters a and b . In other words, the frequency at which the side signal energy is determined can be aligned with the frequency at which parameters a and b are determined. In this document, the time period for determining the side signal energy E S (and possibly the parameters a and b ) is referred to as the signal time frame. The state machine of Figure 6 contains five conditions (which are verified each time a new time frame energy E S is calculated):
- 條件A指示側訊號能量ES超過上部參考級數ref_high。上部參考級數可參照為較高臨界。 - Condition A indicates that the side signal energy E S exceeds the upper reference level ref_high. The upper reference series can be referred to as a higher criticality.
- 條件B指示側訊號能量ES低於或等於上部參考級數ref_high且高於或等於下部參考級數ref_low。下部參考級數可參照為較低臨界。 - Condition B indicates that the side signal energy E S is lower than or equal to the upper reference level ref_high and higher than or equal to the lower reference level ref_low. The lower reference series can be referred to as a lower critical.
- 條件B1對應到條件B,但添加額外的時間條件。時間條件規定條件B滿足小於時框之臨界數目或小於臨界時間。此臨界可參照為時框臨界。 - Condition B1 corresponds to condition B, but additional time conditions are added. The time condition specifies that condition B satisfies a critical number less than the time frame or less than the critical time. This criticality can be referred to as the time frame critical.
- 條件B2對應到條件B,具有規定條件B滿足大於或等於時框之臨界數目或者大於或等於臨界時間的額外時間條件。 - Condition B2 corresponds to condition B, with an additional time condition that specifies that condition B satisfies the critical number of boxes greater than or equal to or greater than or equal to the critical time.
- 條件C指示側訊號能量ES低於下部參考級數ref_low。 - Condition C indicates that the side signal energy E S is lower than the lower reference level ref_low.
進一步而言,圖6之範例狀態機利用5個狀態。不同的狀態受到上面所提的條件且受到在圖6中闡述的狀態示意圖而到達。以下作動通常於預測式FM無線電立體聲雜訊降低系統內在不同的狀態中施行: Further, the example state machine of Figure 6 utilizes five states. The different states are subject to the conditions mentioned above and are reached by the state diagrams illustrated in Figure 6. The following actions are typically performed in different states within the predictive FM radio stereo noise reduction system:
- 在狀態1中,施行常態立體聲操作,例如基於從目前音頻訊號決定的參數a及b。隱蔽狀態δ conceal 保持在0。 - In state 1, a normal stereo operation is performed, for example based on parameters a and b determined from the current audio signal. The concealed state δ conceal remains at zero.
- 在狀態2中,常態立體聲操作係基於在目前音頻訊號上決定的參數a及b而施行。此狀態僅為過渡的,鑒於在時框之數目大於或等於時框臨界,或者時間大於或等於時間臨界(亦即,條件B2)或者在此時框之數目的經過或時間的經過之前之任一段時間滿足狀態B的事實,則滿足條件A或C。隱蔽狀態δ conceal 保持在0。 - In State 2, normal stereo operation is performed based on parameters a and b determined on the current audio signal. This state is only transitional, since the frame is critical when the number of time frames is greater than or equal to, or the time is greater than or equal to the time threshold (ie, condition B2) or before the elapse of time or the number of frames at this time. If the fact that the state B is satisfied for a while, the condition A or C is satisfied. The concealed state δ conceal remains at zero.
- 在狀態3中,立體聲操作係基於在目前音頻訊號上決定的參數a及b而施行。能看見的是,能在從狀態1經由狀態2去到狀態3的路徑上到達狀態3。鑒於條件B2需要最小數目的時框或最小數目的時間用於轉移的事實,路徑「狀態1、狀態2、狀態3」代表從常態立體聲操作(例如,音樂)到常態單聲道操作(例如,語音)的緩慢的(亦即平滑的)轉移。隱蔽狀態δ conceal 係設定在或保持在0。 - In state 3, stereo operation is performed based on parameters a and b determined on the current audio signal. It can be seen that state 3 can be reached on the path from state 1 to state 3 via state 2. In view of the fact that condition B2 requires a minimum number of time frames or a minimum number of times for transfer, the path "State 1, State 2, State 3" represents from normal stereo operation (eg, music) to normal mono operation (eg, Slow (ie smooth) transfer of speech). The concealed state δ conceal is set at or maintained at zero.
- 在狀態4中,單聲道降退隱蔽係使用先前決定的參數a和b開始,例如曾在狀態1中決定的最新的參數a和b。能看到的是,若滿足條件C的話,亦即若側訊號能量ES陡峭的從ref_high以上下降到ref_low以下的話,能從狀態1直接到達狀態4。或者,然而僅若僅於少許數目的時框或僅於短暫的周期時間而滿足條件B的話,能從狀態1經由狀態2到達狀態4。如此一來,路徑「狀態1、狀態4」以及「狀態1、狀態2、狀態4」代表從常態立體聲操作(例如,音樂) 快速的,亦即突然的轉移到強制單聲道操作。若在立體聲多工訊號中19kHz導頻音(pilot tone)之級數或強度落到預定級數以下的話,強制單聲道操作通常係由於FM接收器其例如突然的截止側訊號,因此從接收的立體聲多工訊號做成可靠的解調側訊號是不可能的。隱蔽狀態δ conceal 係設定到1,以為了指示在預測式FM無線電雜訊降低系統內使用隱蔽。 - In state 4, the mono-reduction concealment begins with the previously determined parameters a and b , such as the most recent parameters a and b that were determined in state 1. It can be seen that if the condition C is satisfied, that is, if the side signal energy E S falls steeply from ref_high or more to less than ref_low, the state 4 can be directly reached from the state 1. Alternatively, however, state 4 can be reached from state 1 via state 2 only if condition B is satisfied only for a small number of time frames or only for a short cycle time. As a result, the paths "state 1, state 4" and "state 1, state 2, state 4" represent a fast, ie abrupt transition from, a normal stereo operation (eg, music) to a forced mono operation. If the series or intensity of the 19 kHz pilot tone falls below a predetermined number of levels in the stereo multiplex signal, the forced mono operation is usually received by the FM receiver, for example, by a sudden cut-off side signal. It is impossible to make a stereo demodulation signal to make a reliable demodulation side signal. The concealed state δ conceal is set to 1 to indicate the use of concealment within the predictive FM radio noise reduction system.
- 在狀態5中,例如基於已在狀態4中建立的參數a和b,則持續單聲道降退隱蔽。在闡述的實施例中,若滿足狀態C的話,僅能從狀態4到達狀態5,亦即狀態5代表穩定的單聲道降退隱蔽狀態,在其中使用先前決定的參數a和b以為了從接收的中訊號產生雜訊降低側訊號。參數a和b可以少許秒之時間常數衰減到0,造成緩慢的從立體聲轉移到單聲道的輸出訊號。隱蔽狀態δ conceal 通常保持在1。 - In state 5, for example based on the parameters a and b that have been established in state 4, the mono drop concealment is continued. In the illustrated embodiment, if state C is satisfied, state 5 can only be reached from state 4, that is, state 5 represents a stable mono drop concealed state in which previously determined parameters a and b are used in order to The received medium signal generates a noise reduction side signal. The parameters a and b can decay to zero with a time constant of a few seconds, resulting in a slow transition from stereo to mono output. The hidden state δ conceal is usually kept at 1.
如已指示者,闡述的狀態示意圖確保僅若由FM接收器接收的音頻訊號在少許時間窗/時框內從立體聲去到單聲道,亦即若從立體聲到單聲道之轉移為突然的話,則觸發隱蔽。在另一方面,於在具有低於立體聲級數(ref_high),但高於單聲道級數(ref_low)的能量ES之側訊號中有雜訊的情形中,亦即於在側訊號內仍有充足的資訊以產生適當的a及b的情形中,則防止觸發隱蔽。同時,即使當訊號從立體聲改變到單聲道時,例如當訊號從音樂轉變到語音時,將不會觸發隱蔽偵測,藉此確保原始 單聲道訊號不會由於錯誤的應用隱蔽而被呈現成為人為的立體聲訊號。基於從ref_high之上到ref_low之下側訊號能量ES之平滑的轉移,能偵測從立體聲到單聲到的真實轉移。 As indicated, the state diagram illustrated ensures that only the audio signal received by the FM receiver goes from stereo to mono in a small time window/time frame, ie if the transition from stereo to mono is sudden. , triggers concealment. On the other hand, in the case where there is noise in the side signal having the energy level E S lower than the stereo level (ref_high) but higher than the mono level (ref_low), that is, in the side signal In the case where there is still sufficient information to generate the appropriate a and b , it is prevented from triggering concealment. At the same time, even when the signal changes from stereo to mono, such as when the signal transitions from music to speech, it will not trigger covert detection, thereby ensuring that the original mono signal is not presented due to erroneous application concealment. Become an artificial stereo signal. Based on the smooth transition from the ref_high to the lower signal energy E S below the ref_low, the true transition from stereo to mono can be detected.
在本文件中,已說明用於改善FM無線電接收器之感知效能的方法與系統。特別是,已說明用於使用預測式手法決定雜訊降低FM立體聲訊號的方法及系統。藉使用預測式FM無線電雜訊降低系統,相較於PS為基的FM無線電雜訊降低系統,能降低用於雜訊降低的計算複雜度。進一步而言,已說明用於改善預測式FM無線電雜訊降低系統之效能的各種方法。特別是,已說明使用品質指示器以在雜訊降低側訊號與原始側訊號之間混合。進一步而言,已說明用於將預測式FM無線電雜訊降低系統之參數適用到接收的側訊號之頻譜特性的方法,藉此可靠的在嘈雜的與良好的接收條件之間進行區別。此外,已說明隱蔽方法以為了將預測式FM無線電雜訊降低系統適用到單聲道降退情況。 In this document, methods and systems for improving the perceived performance of an FM radio receiver have been described. In particular, methods and systems for determining noise reduction FM stereo signals using predictive techniques have been described. By using a predictive FM radio noise reduction system, the computational complexity for noise reduction can be reduced compared to a PS-based FM radio noise reduction system. Further, various methods for improving the performance of predictive FM radio noise reduction systems have been described. In particular, the use of a quality indicator has been described to mix between the noise reduction side signal and the original side signal. Further, a method for applying the parameters of the predictive FM radio noise reduction system to the spectral characteristics of the received side signals has been described, thereby reliably distinguishing between noisy and good reception conditions. In addition, the concealment method has been described in order to apply the predictive FM radio noise reduction system to the mono drop condition.
在本文件中說明的方法與系統可建置為軟體、韌體及/或硬體。某些組件可例如建置為在數位訊號處理器或微處理器上運行的軟體。其它組件可例如建置為硬體及或特定應用積體電路(application specific integrated circuits)。在說明的方法及系統中遭遇的訊號可儲存在媒體上,像是隨機存取記憶體或光學儲存媒體。他們可經由網路傳送,像是無線電網路、衛星網路、無線 網路或有線網路(例如網際網路)。利用在本文件中說明的方法與系統之典型的裝置係為可攜式電子裝置或其它用以儲存及/或呈現音頻訊號的消費設備。 The methods and systems described in this document can be implemented as software, firmware, and/or hardware. Some components may be implemented, for example, as software running on a digital signal processor or microprocessor. Other components may be implemented, for example, as hardware and or application specific integrated circuits. The signals encountered in the described methods and systems can be stored on the media, such as random access memory or optical storage media. They can be transmitted via the Internet, such as radio networks, satellite networks, wireless Network or wired network (such as the Internet). A typical device utilizing the methods and systems described in this document is a portable electronic device or other consumer device for storing and/or presenting audio signals.
2‧‧‧設備 2‧‧‧ Equipment
20‧‧‧偵測單元 20‧‧‧Detection unit
30‧‧‧旁通增益單元 30‧‧‧Bypass gain unit
31‧‧‧雜訊降低增益單元 31‧‧‧ Noise reduction gain unit
32‧‧‧合併單元 32‧‧‧Merge unit
71‧‧‧濾波器排 71‧‧‧Filter row
72‧‧‧濾波器排 72‧‧‧ Filter row
73‧‧‧濾波器排 73‧‧‧Filter row
74‧‧‧延遲 74‧‧‧Delay
75‧‧‧LR-to-MS轉換器 75‧‧‧LR-to-MS converter
76‧‧‧MS-to-LR轉換器 76‧‧‧MS-to-LR converter
77‧‧‧參數決定單元 77‧‧‧Parameter decision unit
78‧‧‧解相關器 78‧‧‧Resolver
79‧‧‧雜訊降低單元 79‧‧‧ Noise Reduction Unit
80‧‧‧薄線 80‧‧‧ thin line
81‧‧‧厚線 81‧‧‧ thick line
82‧‧‧點線 82‧‧‧ dotted line
Claims (34)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW103103758A TW201532035A (en) | 2014-02-05 | 2014-02-05 | Prediction-based FM stereo radio noise reduction |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW103103758A TW201532035A (en) | 2014-02-05 | 2014-02-05 | Prediction-based FM stereo radio noise reduction |
Publications (1)
Publication Number | Publication Date |
---|---|
TW201532035A true TW201532035A (en) | 2015-08-16 |
Family
ID=54343165
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW103103758A TW201532035A (en) | 2014-02-05 | 2014-02-05 | Prediction-based FM stereo radio noise reduction |
Country Status (1)
Country | Link |
---|---|
TW (1) | TW201532035A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI620171B (en) * | 2016-01-19 | 2018-04-01 | 博姆雲360公司 | Method for performing crosstalk simulation on an input audio signal, audio processing system, and non-transitory computer readable medium |
US10225657B2 (en) | 2016-01-18 | 2019-03-05 | Boomcloud 360, Inc. | Subband spatial and crosstalk cancellation for audio reproduction |
US10313820B2 (en) | 2017-07-11 | 2019-06-04 | Boomcloud 360, Inc. | Sub-band spatial audio enhancement |
US10764704B2 (en) | 2018-03-22 | 2020-09-01 | Boomcloud 360, Inc. | Multi-channel subband spatial processing for loudspeakers |
US10841728B1 (en) | 2019-10-10 | 2020-11-17 | Boomcloud 360, Inc. | Multi-channel crosstalk processing |
TWI854496B (en) * | 2023-02-21 | 2024-09-01 | 瑞昱半導體股份有限公司 | Watermark-based audio processing method and audio player |
-
2014
- 2014-02-05 TW TW103103758A patent/TW201532035A/en unknown
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10225657B2 (en) | 2016-01-18 | 2019-03-05 | Boomcloud 360, Inc. | Subband spatial and crosstalk cancellation for audio reproduction |
US10721564B2 (en) | 2016-01-18 | 2020-07-21 | Boomcloud 360, Inc. | Subband spatial and crosstalk cancellation for audio reporoduction |
TWI620171B (en) * | 2016-01-19 | 2018-04-01 | 博姆雲360公司 | Method for performing crosstalk simulation on an input audio signal, audio processing system, and non-transitory computer readable medium |
US10009705B2 (en) | 2016-01-19 | 2018-06-26 | Boomcloud 360, Inc. | Audio enhancement for head-mounted speakers |
US10313820B2 (en) | 2017-07-11 | 2019-06-04 | Boomcloud 360, Inc. | Sub-band spatial audio enhancement |
US10764704B2 (en) | 2018-03-22 | 2020-09-01 | Boomcloud 360, Inc. | Multi-channel subband spatial processing for loudspeakers |
US10841728B1 (en) | 2019-10-10 | 2020-11-17 | Boomcloud 360, Inc. | Multi-channel crosstalk processing |
US11284213B2 (en) | 2019-10-10 | 2022-03-22 | Boomcloud 360 Inc. | Multi-channel crosstalk processing |
TWI854496B (en) * | 2023-02-21 | 2024-09-01 | 瑞昱半導體股份有限公司 | Watermark-based audio processing method and audio player |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2845202C (en) | Prediction-based fm stereo radio noise reduction | |
JP5809754B2 (en) | High quality detection in FM stereo radio signal | |
CN103069721B (en) | Reduce the pseudo-irrelevance in FM radio broadcast noise | |
CN103098131B (en) | Concealment of intermittent mono reception of fm stereo radio receivers | |
EP3035712B1 (en) | Improvement of an audio signal of an fm stereo radio receiver by using parametric stereo | |
TW201532035A (en) | Prediction-based FM stereo radio noise reduction | |
TWM527596U (en) | An apparartus for prediction-based FM stereo radio noise reduction |