CN108053829A - A kind of cochlear implant coding method based on cochlea sense of hearing Nonlinear Dynamics - Google Patents
A kind of cochlear implant coding method based on cochlea sense of hearing Nonlinear Dynamics Download PDFInfo
- Publication number
- CN108053829A CN108053829A CN201711486488.3A CN201711486488A CN108053829A CN 108053829 A CN108053829 A CN 108053829A CN 201711486488 A CN201711486488 A CN 201711486488A CN 108053829 A CN108053829 A CN 108053829A
- Authority
- CN
- China
- Prior art keywords
- cochlea
- cochlear
- signal
- nonlinear
- active
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/75—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 for modelling vocal tract parameters
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Prostheses (AREA)
Abstract
本发明公开了一种基于耳蜗听觉非线性动力学机理的电子耳蜗编码方法,包括(1)建立耳蜗非线性动力学模型;(2)构建非线性耳蜗阵列;非线性耳蜗阵列为一组包含n个不同固有频率的主动仿真模块,每一个主动仿真模块将接收的输入语音信号进行相应的运算后,获得每一个主动仿真模块的实时响应输出信号;(3)将各个主动仿真模块的实时响应输出信号进行整流和低通滤波后获得振幅包络;并将频率和振幅包络转换为脉冲信号传递给其对应的电极。本发明利用耳蜗非线性动力学模型使得经过处理后的声信号能够展现出与耳蜗处理结果类似的非线性动态范围压缩效应,并产生与音调相关的结合音,从而解决传统电子耳蜗动态范围小,音调信息传递困难的问题。
The invention discloses an electronic cochlear encoding method based on the nonlinear dynamic mechanism of cochlear hearing, which comprises (1) establishing a nonlinear dynamic model of the cochlea; (2) constructing a nonlinear cochlear array; the nonlinear cochlear array is a group comprising n Active simulation modules with different natural frequencies, each active simulation module performs corresponding calculations on the received input voice signal, and obtains the real-time response output signal of each active simulation module; (3) outputs the real-time response of each active simulation module After the signal is rectified and low-pass filtered, the amplitude envelope is obtained; and the frequency and amplitude envelopes are converted into pulse signals and transmitted to the corresponding electrodes. The present invention utilizes the cochlear nonlinear dynamic model to enable the processed acoustic signal to exhibit a nonlinear dynamic range compression effect similar to that of the cochlear processing result, and to generate combined tones related to the pitch, thereby solving the problem of the small dynamic range of the traditional electronic cochlea. Difficult problems with tonal information transfer.
Description
技术领域technical field
本发明属于生物医学工程领域,更具体地,涉及一种基于耳蜗听觉非线性动力学机理的电子耳蜗编码方法。The invention belongs to the field of biomedical engineering, and more specifically relates to an electronic cochlear encoding method based on the nonlinear dynamic mechanism of cochlear hearing.
背景技术Background technique
听觉作为人类与外界交流的重要信息通道,是人类生存发展、感知世界的重要途径,然而世界上却有很大一部分人,由于各种原因生活在无声的世界。据世界卫生组织统计,全球目前有3.6亿人患有耳聋或听力障碍,占全球总人数的5%,绝大多数的深度耳聋患者无法将声信号正常转换为传向听神经电信号使其感受声音信息。电子耳蜗是一种通过电刺激,将声音信号转换为听神经信号,从而使完全丧失听力的全聋患者重获听觉的人体植入听力康复装置,主要包括语音处理器和植入电极两部分。语音处理器首先将声信号转换为植入耳蜗中不同部位的电极上的电刺激信号,由这些电极上的电刺激信号激发、诱导听神经产生传向大脑的神经电信号,从而使患者听到声音。目前,国内外已有多个品牌的电子耳蜗在临床上使用,它们为耳聋患者带来了福音,使其重回有声世界。现有电子耳蜗能使患者获得一定程度的言语感知能力,但是其性能仍有很大改进的空间。例如,电子耳蜗只能提供20dB的动态感知范围,而正常人能感受120dB动态范围的声音。另一个显著的问题是电子耳蜗还不能使患者有效感受声音的音调信息,使其在音调、语调以及音乐感受上存在困难。As an important information channel for human beings to communicate with the outside world, hearing is an important way for human beings to survive and develop and perceive the world. However, a large number of people in the world live in a silent world due to various reasons. According to the statistics of the World Health Organization, there are currently 360 million people in the world who suffer from deafness or hearing impairment, accounting for 5% of the total number of people in the world. The vast majority of patients with profound deafness cannot convert the sound signal into an electrical signal that is transmitted to the auditory nerve so that they can feel the sound information. Cochlear Electronics is a human implanted hearing rehabilitation device that converts sound signals into auditory nerve signals through electrical stimulation, thereby enabling total deaf patients with complete hearing loss to regain hearing. It mainly includes two parts: a voice processor and implanted electrodes. The speech processor first converts the acoustic signal into an electrical stimulation signal on the electrodes implanted in different parts of the cochlea, and the electrical stimulation signal on these electrodes excites and induces the auditory nerve to generate a nerve electrical signal that is transmitted to the brain, so that the patient can hear the sound . At present, many brands of cochlear implants have been used clinically at home and abroad, and they have brought good news to deaf patients, allowing them to return to the world of sound. Existing cochlear implants can enable patients to obtain a certain degree of speech perception, but there is still much room for improvement in their performance. For example, cochlear implants can only provide a dynamic perception range of 20dB, while normal people can perceive sounds with a dynamic range of 120dB. Another notable problem is that cochlear implants still cannot enable patients to effectively perceive the pitch information of sounds, making them difficult in pitch, intonation, and musical perception.
电子耳蜗的听力康复效果取决于电刺激引起的听神经信号能否像正常人的听神经信号那样正确传递声音信息。研究发现,耳蜗具有频率分析功能,它将不同频率的声信号通过不同部位的神经通道传入大脑,而信号的大小则通过听神经中单位是时间的脉冲个数,也即发放率来表达。这简称为频率-部位、强度-发放率编码机理。现有的电子耳蜗均以此为基础制定相应的声音信号处理与编码方案。目前,电子耳蜗的语音处理方案大致可以分为两大类:一类是提取语音信号的基频和共振峰等特征信息,称为SPEAK方案。例如Nucleus耳蜗植入装置所使用的F0/F1/F2解决方法;另一类通过对语音信号进行分频段滤波处理,然后根据耳蜗频率分布,将对应频率信号通过电极传送到耳蜗的不同区域。例如目前被广泛使用的,连续间隔采样语音信号处理器(Continuous Interleaved Sampling,CIS)。上述编码方案,无论是运用共振峰的SPEAK方案,还是CIS方案,对于耳蜗这个重要的部位的信号处理,采用的是频谱分析或者是带通滤波来模拟。它们能够在一定程度上反映在反应其频率-部位、强度-发放率编码,但不能反映耳蜗的非线性。The hearing rehabilitation effect of cochlear implants depends on whether the auditory nerve signals caused by electrical stimulation can transmit sound information correctly like the auditory nerve signals of normal people. Studies have found that the cochlea has a frequency analysis function, which transmits acoustic signals of different frequencies to the brain through nerve channels in different parts, and the magnitude of the signal is expressed by the number of pulses in the auditory nerve with the unit of time, that is, the firing rate. This is referred to as the frequency-site, intensity-firing rate coding mechanism for short. The existing cochlear electrons are based on this to formulate corresponding sound signal processing and coding schemes. At present, the voice processing schemes of cochlear implants can be roughly divided into two categories: one is to extract characteristic information such as the fundamental frequency and formant of the voice signal, which is called the SPEAK scheme. For example, the F0/F1/F2 solution used by the Nucleus cochlear implant device; the other type filters the speech signal in frequency bands, and then transmits the corresponding frequency signal to different regions of the cochlea through electrodes according to the cochlear frequency distribution. For example, a continuous interval sampling speech signal processor (Continuous Interleaved Sampling, CIS) is currently widely used. For the above coding schemes, whether it is the SPEAK scheme using the formant or the CIS scheme, the signal processing of the important part of the cochlea is simulated by spectrum analysis or band-pass filtering. They can reflect its frequency-site, intensity-firing rate coding to a certain extent, but cannot reflect the nonlinearity of the cochlea.
我们长期从事听觉外周的信号处理机制研究,在国家自然科学基金支持下开展的《耳蜗音调信息感知的激光干涉研究》的最新结果表明,耳蜗还通过非线性对音调的感知发挥作用,使其能对基频缺失信号以及准周期信号的音调通过非线性得以表达。这意味着现有电子耳蜗编码方案采用的频谱分析、带通滤波等线性计算在反映耳蜗信号处理上存在缺陷,需要有更好的方法来替代。国际上也注意到耳蜗的非线性在听觉信号处理中的重要性,认为耳蜗可以看成是一个停留在Hopf分叉点的振子,但是只是一个数学上的抽像方程,用以反映耳蜗具有的部分典型非线性特征,实际行为与真实耳蜗还存差距。We have long been engaged in the research on the signal processing mechanism of the auditory periphery. The latest results of "Laser Interference Research on Cochlear Pitch Information Perception" supported by the National Natural Science Foundation of China show that the cochlea also plays a role in the perception of pitch through nonlinearity, enabling it to The pitch for fundamental-frequency-absent signals and quasi-periodic signals is expressed by nonlinearity. This means that the linear calculations such as spectrum analysis and band-pass filtering used in existing cochlear coding schemes have defects in reflecting cochlear signal processing, and better methods are needed to replace them. The international community has also noticed the importance of the nonlinearity of the cochlea in auditory signal processing. It is believed that the cochlea can be regarded as an oscillator that stays at the Hopf bifurcation point, but it is only a mathematical abstract equation to reflect the cochlea's uniqueness. Some typical non-linear features, there is still a gap between the actual behavior and the real cochlea.
发明内容Contents of the invention
针对现有技术的缺陷,本发明的目的在于提供一种基于耳蜗听觉非线性动力学机理的电子耳蜗编码方法,旨在解决采用传统耳蜗模拟计算方法的电子耳蜗动态范围小,音调信息传递困难的问题。Aiming at the defects of the prior art, the purpose of the present invention is to provide a cochlear electronic coding method based on the nonlinear dynamic mechanism of cochlear hearing, aiming at solving the problems of small dynamic range and difficult tone information transmission using the traditional cochlear simulation calculation method. question.
本发明提供了一种基于耳蜗听觉非线性动力学机理的电子耳蜗编码方法,包括下述步骤:The invention provides a cochlear coding method based on the nonlinear dynamic mechanism of cochlear hearing, which comprises the following steps:
(1)建立耳蜗非线性动力学模型;(1) Establish a nonlinear dynamic model of the cochlea;
(2)根据耳蜗非线性动力学模型构建非线性耳蜗阵列;(2) Construct a nonlinear cochlear array according to the nonlinear dynamic model of the cochlea;
所述非线性耳蜗阵列为一组包含n个不同固有频率的主动仿真模块,每一个主动仿真模块将接收的输入语音信号按照所述耳蜗非线性动力学模型进行相应的运算后,获得每一个主动仿真模块的实时响应输出信号;The nonlinear cochlear array is a group of active simulation modules containing n different natural frequencies, and each active simulation module performs corresponding calculations on the received input voice signals according to the cochlear nonlinear dynamics model to obtain each active simulation module. The real-time response output signal of the simulation module;
(3)将各个主动仿真模块的实时响应输出信号进行整流和低通滤波后获得所述实时响应输出信号的振幅包络;并将所述实时响应输出信号的频率和振幅包络转换为脉冲信号传递给不同部位的电极;(3) Rectify and low-pass filter the real-time response output signal of each active simulation module to obtain the amplitude envelope of the real-time response output signal; and convert the frequency and amplitude envelope of the real-time response output signal into a pulse signal Electrodes delivered to different parts;
其中,n为主动仿真模块的数目,n取大于等于1的整数。Wherein, n is the number of active simulation modules, and n is an integer greater than or equal to 1.
更进一步地,所述耳蜗非线性动力学模型为:其中,x为基底膜偏离平衡位置位移,t为时间,γ为阻尼系数,γα为自适力系数,B为外毛细胞电致伸缩系数,x0为外毛细胞原长,ωi为耳蜗该部位的固有圆频率,S(t)为输入语音信号,xi(t)为第i个主动仿真模块的实时响应输出信号,i为主动仿真模块的序号,i=1,2,3......n。Furthermore, the nonlinear dynamic model of the cochlea is: Among them, x is the displacement of the basement membrane from the equilibrium position, t is the time, γ is the damping coefficient, γ α is the adaptive force coefficient, B is the electrostriction coefficient of the outer hair cell, x0 is the original length of the outer hair cell, and ω i is the cochlea The natural circular frequency of this part, S(t) is the input voice signal, x i (t) is the real-time response output signal of the i-th active simulation module, i is the serial number of the active simulation module, i=1,2,3. … n.
更进一步地,耳蜗非线性动力学模型中自适力系数γα应满足如下范围:0<γα≤γ,在此范围内γα值越大,主动仿真模块对其固有频率附近的语音信号的放大作用越大。Furthermore, the self-adaptive force coefficient γ α in the nonlinear dynamic model of the cochlea should satisfy the following range: 0<γ α ≤ γ. The greater the magnification effect.
更进一步地,n个主动仿真模块的固有频率可按照如下方式设定:对于固有频率范围为a~a*eε(n-1)Hz(ε<1)的非线性耳蜗阵列,其中第i个主动仿真模块的固有频率为fi=a*eε(i-1)Hz;i为主动仿真模块的序号,i=1,2,3......n。Furthermore, the natural frequencies of n active simulation modules can be set as follows: For a nonlinear cochlear array with a natural frequency range of a~a*e ε(n-1) Hz (ε<1), the i-th The natural frequency of each active emulation module is f i =a*e ε(i-1) Hz; i is the serial number of the active emulation module, i=1,2,3...n.
更进一步地,20Hz≤a≤200Hz。Furthermore, 20Hz≤a≤200Hz.
通过本发明所构思的以上技术方案,与现有技术相比,由于引入了利用耳蜗非线性动力学模型的非线性耳蜗阵列代替传统的被动滤波器组以及压缩模块对声信号进行处理,使得经过处理后的声信号能够展现出与耳蜗处理结果类似的非线性动态范围压缩效应,对小信号进行放大,对大信号进行压缩,从而解决传统电子耳蜗动态范围小的问题。同时经过处理后的声信号能够展现与音调相关的结合音信息,从而解决传统电子耳蜗动态范围小,音调信息传递困难的问题。Through the above technical scheme conceived by the present invention, compared with the prior art, due to the introduction of the nonlinear cochlear array using the cochlear nonlinear dynamics model instead of the traditional passive filter bank and the compression module to process the acoustic signal, so that after The processed acoustic signal can exhibit a nonlinear dynamic range compression effect similar to the cochlear processing result, amplifying small signals and compressing large signals, thereby solving the problem of small dynamic range of traditional cochlear electronics. At the same time, the processed acoustic signal can display combined sound information related to the pitch, thereby solving the problems of the traditional cochlear electronic cochlear with a small dynamic range and difficult transfer of pitch information.
附图说明Description of drawings
图1为本发明运用耳蜗非线性动力学模型实现新型电子耳蜗语音处理策略的技术框图。Fig. 1 is a technical block diagram of implementing a novel cochlear electronic speech processing strategy using a cochlear nonlinear dynamic model in the present invention.
图2为主动仿真模块在325Hz+425Hz的激励信号下,系统的振动响应波形图。Figure 2 is a waveform diagram of the vibration response of the system under the excitation signal of 325Hz+425Hz of the active simulation module.
图3为主动仿真模块的调谐曲线。Fig. 3 is the tuning curve of the active simulation module.
图4为主动仿真模块响应幅度随声音强度的变化图像,与耳蜗生理实验响应图像。其中4(a)为仿真模块的响应图像与被动系统的对比,4(b)为生理实验结果。Figure 4 is the image of the response amplitude of the active simulation module changing with the sound intensity, and the response image of the cochlear physiological experiment. Among them, 4(a) is the comparison between the response image of the simulation module and the passive system, and 4(b) is the result of the physiological experiment.
图5为不同参数情况下,非线性动态压缩结果;其中图5(a)为固定参数B改变参数x0,图5(b)为固定参数x0改变参数B。Figure 5 shows the results of nonlinear dynamic compression under different parameters; where Figure 5(a) changes parameter x0 for fixed parameter B, and Figure 5(b) changes parameter B for fixed parameter x0.
图6为主动仿真模块中的两音抑制效应。Figure 6 shows the two-tone suppression effect in the active emulation module.
图7为主动仿真模块的响应频谱,与生理实验中的耳蜗响应频谱。其中图7(a)为动仿真模块的响应频谱,7(b)为生理实验中的耳蜗响应频谱。Figure 7 shows the response spectrum of the active simulation module and the cochlea response spectrum in the physiological experiment. Figure 7(a) is the response spectrum of the dynamic simulation module, and Figure 7(b) is the response spectrum of the cochlea in the physiological experiment.
具体实施方式Detailed ways
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.
本发明提供的基于耳蜗听觉非线性动力学机理的电子耳蜗编码方法主要用于听力康复的电子耳蜗技术。心理声学的研究表明,听觉不是线性感知。例如,当两个不同频率的单频声音激励时,听者除了能听到这两个激励信号频率成分外,还能听到有它们的差频等线性组合的频率成分(称为“结合音Combination tone”)存在。当两个音同时出现时,会出现两个音均比单独激励时听起来小一些的两音抑制效应等,这些非线性效应对听觉感知起了非常重要的作用。实验研究表明,结合音、两音抑制、非线性动态范围压缩等非线性效应发生在耳蜗的信号处理过程中。The cochlear coding method based on the nonlinear dynamic mechanism of cochlear hearing provided by the present invention is mainly used in the cochlear technology for hearing rehabilitation. Psychoacoustic research has shown that hearing is not a linear perception. For example, when two single-frequency sounds of different frequencies are excited, the listener can not only hear the frequency components of the two excitation signals, but also the frequency components of linear combinations such as their difference frequency (called "combined tone"). Combination tone") exists. When two tones appear at the same time, there will be a two-tone suppression effect in which the two tones sound smaller than when they are separately excited. These nonlinear effects play a very important role in auditory perception. Experimental studies have shown that nonlinear effects such as combined tones, two-tone suppression, and nonlinear dynamic range compression occur during signal processing in the cochlea.
我们通过对耳蜗声信号处理的生理过程的研究,在耳蜗中毛细胞主动力产生机制认识的基础上,建立了耳蜗信号处理的主动动力学模型。这个模型在能够定性反映耳蜗非线性本质的同时,还能很好的反映其非线性动态压缩等效应,提供与生理实验观察一致的结果。本发明正是从这一基础研究的结果出发,建立了一套基于听觉声感知主动机制的电子耳蜗编码方法。通过融入耳蜗的非线性,解决现有电子耳蜗性能的不足,尤其是听觉动态范围不足、音调感知缺失等问题。Based on the study of the physiological process of cochlear acoustic signal processing and the understanding of the mechanism of active force generation in hair cells in the cochlea, we established an active dynamics model of cochlear signal processing. This model can not only reflect the nonlinear nature of the cochlea qualitatively, but also reflect its nonlinear dynamic compression and other effects well, and provide results consistent with physiological experimental observations. Based on the results of this basic research, the present invention establishes a cochlear coding method based on the active mechanism of auditory sound perception. By integrating the nonlinearity of the cochlea, it solves the shortcomings of the existing cochlear electronic performance, especially the insufficient auditory dynamic range and the lack of pitch perception.
本发明采用的具体技术方案如下:The concrete technical scheme that the present invention adopts is as follows:
(1)耳蜗非线性动力学模型的建立(1) Establishment of the nonlinear dynamic model of the cochlea
我们以耳蜗基底膜动力学特性为基础,以耳蜗局部为例进行受力分析。在声传导过程中,耳蜗基底膜局部将受到外界声刺激引发的外力Fs(t),基底膜自身弹力FT=-kx,淋巴液以及自身产生的阻力以及受外毛细胞电致伸缩,以及纤毛运动调控的非线性自适力Fa,其简化表达形式如下:根据牛顿力学定律建立的耳蜗非线性动力学模型如下:其中x为基底膜偏离平衡位置位移,γ为阻尼系数,γα为自适力系数,B为外毛细胞电致伸缩系数,x0为外毛细胞原长,ωi为耳蜗该部位的固有圆频率,输入信号为S(t)。求解上述非线性方程即可得到声信号S(t)激励下的耳蜗基底膜实时响应输出xi(t)。Based on the dynamic characteristics of the cochlear basilar membrane, we take the local cochlea as an example to analyze the force. During the process of sound conduction, the cochlear basilar membrane will be subjected to external force F s (t) caused by external sound stimulation, the basilar membrane's own elasticity F T =-kx, lymph fluid and its own resistance And the nonlinear self-adaptive force F a regulated by the electrostriction of outer hair cells and the movement of cilia, its simplified expression is as follows: The nonlinear dynamic model of the cochlea established according to Newton's laws of mechanics is as follows: Where x is the displacement of the basement membrane from the equilibrium position, γ is the damping coefficient, γ α is the adaptive force coefficient, B is the electrostrictive coefficient of the outer hair cell, x0 is the original length of the outer hair cell, and ω i is the intrinsic circle of the cochlea frequency, the input signal is S(t). The real-time response output x i (t) of the cochlear basilar membrane excited by the acoustic signal S(t) can be obtained by solving the above nonlinear equation.
(2)采用耳蜗非线性动力学模型的电子耳蜗编码方法(2) Cochlear coding method using the nonlinear dynamic model of the cochlea
如图1所示,新的电子耳蜗编码方法需要根据耳蜗非线性动力学模型构建非线性耳蜗阵列,以模拟耳蜗对声音的处理机制。非线性耳蜗阵列为一组由n个根据(1)中方程设计的,拥有不同固有频率的主动仿真模块构成的非线性仿真阵列。输入声信号为S(t),数值求解上述方程即可得到处理过后不同通道的实时响应输出xi(t)。之后经整流、低通滤波等检测出其振幅包络,最后将声音的频率信息,振幅信息转换为脉冲信号传递给不同部位的电极。需要注意的是,仿真模块设计时可以通过调节参数γα改变仿真模块调谐曲线的尖锐程度,γα值越大,调谐曲线越尖锐。一般情况下应使自适力系数γα设定在:0<γα≤γ的范围,当γα=0时,自适力为零,系统变为被动系统;当γα=γ时,系统最终将自持振荡。同时,设计时可以通过调节参数B或x0,改变仿真模块的动态范围,和压缩范围,如图5所示。耳蜗非线性动力学模型能够很好的模拟出耳蜗良好的频率选择特性,动态压缩特性,以及多音畸变特性,故而替代传统电子耳蜗声信号处理方案中的滤波器组与压缩模块对声音信号进行处理,得到大的动态响应范围,以及更好的音调信息传递结果。As shown in Figure 1, the new cochlear coding method needs to construct a nonlinear cochlear array according to the cochlear nonlinear dynamic model to simulate the cochlea's sound processing mechanism. The nonlinear cochlear array is a group of n designed according to the equation in (1), with different natural frequencies A nonlinear simulation array composed of active simulation modules. The input acoustic signal is S(t), and the real-time response output x i (t) of different channels after processing can be obtained by numerically solving the above equation. Afterwards, the amplitude envelope is detected by rectification and low-pass filtering, and finally the frequency information and amplitude information of the sound are converted into pulse signals and transmitted to electrodes in different parts. It should be noted that the sharpness of the tuning curve of the simulation module can be changed by adjusting the parameter γ α during the design of the simulation module. The larger the value of γ α , the sharper the tuning curve. In general, the adaptive force coefficient γ α should be set in the range of 0<γ α ≤γ, when γ α =0, the adaptive force is zero, and the system becomes a passive system; when γ α =γ, The system will eventually oscillate on its own. At the same time, the dynamic range and compression range of the simulation module can be changed by adjusting the parameter B or x0 during design, as shown in Figure 5. The cochlear nonlinear dynamic model can well simulate the cochlea's good frequency selection characteristics, dynamic compression characteristics, and multi-tone distortion characteristics, so it replaces the filter bank and compression module in the traditional electronic cochlear sound signal processing scheme to process the sound signal. Processing, a large dynamic response range, and better tonal information transfer results.
下面结合附图和具体实例对本发明作进一步详细说明。The present invention will be described in further detail below in conjunction with the accompanying drawings and specific examples.
图1为运用耳蜗非线性动力学模型实现新型电子耳蜗编码的技术框图。具体策略如下:根据频率高低需设计n个频带通道,每个频带通道中包含不同固有频率的主动仿真模块,包络探测模块,脉冲调制模块以及电极。通过麦克风记录下来的声信号经过预处理后得到S(t),通过不同的仿真模块处理输出为xi(t),之后xi(t)之后经整流、低通滤波等检测出其振幅包络,最后将声音的频率信息,振幅信息转换为脉冲信号传递给电极。Fig. 1 is a technical block diagram of implementing the new cochlear coding by using the nonlinear dynamic model of the cochlea. The specific strategy is as follows: According to the frequency, n frequency band channels need to be designed, and each frequency band channel contains active simulation modules with different natural frequencies, envelope detection modules, pulse modulation modules and electrodes. The acoustic signal recorded by the microphone is preprocessed to obtain S(t), and the output is xi (t) through different simulation modules, and then xi (t) is rectified and low-pass filtered to detect its amplitude package Finally, the frequency information and amplitude information of the sound are converted into pulse signals and transmitted to the electrodes.
图2为主动仿真模块在325Hz+425Hz的激励信号下,系统的振动响应波形图;从图的波形的谱分析得出其频率成份如图7,展示了与耳蜗生理实验观察结果类似的结合音。Figure 2 is the vibration response waveform diagram of the system under the excitation signal of 325Hz+425Hz of the active simulation module; from the spectrum analysis of the waveform in the figure, its frequency components are shown in Figure 7, which shows the combined sound similar to the cochlear physiological experiment observation results .
图3为仿真模块调频曲线展示。如图所示,仿真模块处理结果与实际耳蜗响应结果相似,出现了非线性调谐,具有很好的频率选择特性。因此可以代替以往方案中的滤波器组来提取不同频段的声音信息。Figure 3 shows the frequency modulation curve of the simulation module. As shown in the figure, the processing results of the simulation module are similar to the actual cochlear response results, with nonlinear tuning and good frequency selection characteristics. Therefore, it can replace the filter bank in the previous scheme to extract the sound information of different frequency bands.
图4(a)为主动系统与被动系统响应随声音强度变化图,虚线为被动系统响应,实线为主动系统响应。从图中可以看出被动系统的响应是线性的,而主动系统的响应出现了明显的非线性特性。由于被动系统响应的线性,以往的方案中必须使用额外的压缩模块对声音进行处理,以获得更大的动态范围。传统方案中使用的压缩函数为指数或者对数函数。然而,生理实验结果显示(图4(b)),听觉系统的响应特性是一种非线性动态压缩特性,即:听觉系统不仅能够实现大信号的压缩,也能实现小信号的放大,这是传统的指数或者对数压缩函数所不能实现的。而主动仿真模块却能很好的模拟出这一听觉特性(图4(a)实线),同时其动态范围与正常耳蜗动态范围接近。因此主动仿真模块能够代替传统方案中的压缩模块,使听觉的动态范围得到更好的提高。Figure 4(a) is a graph showing the response of the active system and the passive system as a function of the sound intensity. The dotted line is the response of the passive system, and the solid line is the response of the active system. It can be seen from the figure that the response of the passive system is linear, while the response of the active system has obvious nonlinear characteristics. Due to the linear nature of the passive system response, previous solutions had to use additional compression modules to process the sound in order to obtain greater dynamic range. The compression functions used in traditional schemes are exponential or logarithmic functions. However, the results of physiological experiments (Fig. 4(b)) show that the response characteristic of the auditory system is a nonlinear dynamic compression characteristic, that is, the auditory system can not only realize the compression of large signals, but also the amplification of small signals, which is What cannot be achieved with traditional exponential or logarithmic compression functions. However, the active simulation module can simulate this auditory characteristic very well (Fig. 4(a) solid line), and its dynamic range is close to that of the normal cochlea. Therefore, the active simulation module can replace the compression module in the traditional solution, so that the dynamic range of hearing can be better improved.
图5为不同参数情况下,主动仿真模块中非线性动态压缩效果展示。从图5(a)中可以看到,随着x0的逐渐增大,曲线逐渐向x轴的正方向移动,但是曲线压缩区域的范围并不会随着x0的变化而发生变化。x0的取值越小,曲线越早达到压缩状态。因此如果想要更早的达到压缩状态,我们可以通过减小x0的取值而实现。从图5(b)中可以看出,随着参数B的增大,曲线逐渐向x轴的负向移动,同时曲线压缩区域的范围变得越来越宽。因此若要得到更大的压缩区域可以调节参数B。Figure 5 shows the effect of nonlinear dynamic compression in the active simulation module under different parameters. It can be seen from Figure 5(a) that as x0 increases gradually, the curve gradually moves to the positive direction of the x-axis, but the range of the curve compression area does not change with the change of x0 . The smaller the value of x0 , the earlier the curve reaches the compressed state. Therefore, if we want to reach the compressed state earlier, we can achieve it by reducing the value of x 0 . It can be seen from Figure 5(b) that as the parameter B increases, the curve gradually moves to the negative direction of the x-axis, and the range of the curve compression region becomes wider and wider. Therefore, parameter B can be adjusted to obtain a larger compression area.
图6显示主动仿真模块能够模拟耳蜗中的两音抑制效应。Figure 6 shows that the active simulation module is able to simulate two-tone suppression in the cochlea.
图7为多音畸变效应展示,从图7(a)中能够看出,仿真模块的响应频谱中出现了被动系统中不可能出现的畸变产物,即:结合音。图7(b)为实际生理实验耳蜗基底膜上的响应结果。对比可知,主动仿真模块能够很好的模拟出耳蜗中的多音畸变效应,这也是本策略提升电子耳蜗音调解析能力的基础。Figure 7 shows the multi-tone distortion effect. From Figure 7(a), it can be seen that the response spectrum of the simulation module has distortion products that cannot appear in passive systems, namely: combined tones. Fig. 7(b) is the response result on the cochlear basilar membrane in the actual physiological experiment. It can be seen from the comparison that the active simulation module can well simulate the polyphonic distortion effect in the cochlea, which is also the basis for this strategy to improve the ability of the electronic cochlear tone analysis.
本发明提供的基于耳蜗非线性动力学机理的电子耳蜗编码方法,包括下述步骤:The electronic cochlear coding method based on the cochlear nonlinear dynamics mechanism provided by the present invention comprises the following steps:
(1)建立耳蜗非线性动力学模型 (1) Establish a nonlinear dynamic model of the cochlea
(2)根据耳蜗非线性动力学模型构建非线性耳蜗阵列,非线性耳蜗阵列为一组包含n个不同固有频率的主动仿真模块的仿真阵列,其作用是按照上述非线性动力学模型进行相应的数学运算。主动仿真模块的固有频率可按照如下模式设定:例如对于固有频率范围为a~a*eε(n-1)Hz(ε<1)(一般情况下取20Hz≤a≤200Hz)的非线性耳蜗阵列,其中第i个主动仿真模块的固有频率为fi=a*eε(i-1)Hz(i=1,2,3......n)。主动仿真模块设计时应使自适力系数γα设定在:0<γα≤γ的范围,在此范围内γα值越大,主动仿真模块对其固有频率附近的语音信号的放大作用越大;主动仿真模块设计时可以通过调节参数B或x0,改变仿真模块的动态范围和压缩范围。参数B的越大,主动仿真模块压缩区域的范围越宽。x0的取值越小,曲线越早达到压缩状态。(2) Construct a nonlinear cochlear array according to the nonlinear dynamic model of the cochlea. The nonlinear cochlear array is a group of simulation arrays containing n active simulation modules with different natural frequencies. computation. The natural frequency of the active simulation module can be set according to the following mode: For example , for the nonlinear A cochlear array, wherein the natural frequency of the ith active simulation module is f i =a*e ε(i-1) Hz (i=1,2,3...n). When designing the active simulation module, the self-adaptive force coefficient γ α should be set in the range of 0<γ α ≤ γ, the larger the value of γ α within this range, the greater the amplification effect of the active simulation module on the voice signal near its natural frequency The larger is; the dynamic range and compression range of the simulation module can be changed by adjusting the parameter B or x0 during the design of the active simulation module. The larger the parameter B is, the wider the range of the compression area of the active simulation module will be. The smaller the value of x0, the earlier the curve reaches the compressed state.
输入语音信号为S(t),求解上述方程即可得到处理过后每一个主动仿真模块的实时响应输出xi(t);The input voice signal is S(t), and the real-time response output x i (t) of each active simulation module after processing can be obtained by solving the above equation;
(3)将各主动仿真模块的输出信号xi(t)进行整流,低通滤波检测出其振幅包络。之后将声音的频率信息,振幅信息转换为脉冲信号传递给与之对应的电极。(3) The output signal x i (t) of each active simulation module is rectified, and its amplitude envelope is detected by low-pass filtering. Afterwards, the frequency information and amplitude information of the sound are converted into pulse signals and transmitted to the corresponding electrodes.
由于主动仿真模块能够实现对声音信号的动态压缩,即对小信号放大,大信号压缩,因此代替传统的指数或对数压缩函数对声音信号进行压缩,从而提升电子耳蜗动态范围。同时,主动仿真模块能够很好的模拟出耳蜗的多音畸变效应,因此经其处理后的信号能够展现出与音调相关的结合音,从而使电子耳蜗更好的传递音调信息。Since the active simulation module can realize the dynamic compression of the sound signal, that is, to amplify the small signal and compress the large signal, it replaces the traditional exponential or logarithmic compression function to compress the sound signal, thereby improving the dynamic range of the cochlea. At the same time, the active simulation module can well simulate the polyphonic distortion effect of the cochlea, so the processed signal can show the combined tone related to the tone, so that the electronic cochlear can better transmit the tone information.
本领域的技术人员容易理解,以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。It is easy for those skilled in the art to understand that the above descriptions are only preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present invention, All should be included within the protection scope of the present invention.
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711486488.3A CN108053829B (en) | 2017-12-29 | 2017-12-29 | An Electronic Cochlear Coding Method Based on Cochlear Auditory Nonlinear Dynamics |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711486488.3A CN108053829B (en) | 2017-12-29 | 2017-12-29 | An Electronic Cochlear Coding Method Based on Cochlear Auditory Nonlinear Dynamics |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108053829A true CN108053829A (en) | 2018-05-18 |
CN108053829B CN108053829B (en) | 2020-06-02 |
Family
ID=62129725
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711486488.3A Expired - Fee Related CN108053829B (en) | 2017-12-29 | 2017-12-29 | An Electronic Cochlear Coding Method Based on Cochlear Auditory Nonlinear Dynamics |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108053829B (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110931034A (en) * | 2019-11-27 | 2020-03-27 | 深圳市悦尔声学有限公司 | Pickup noise reduction method for built-in earphone of microphone |
CN110957040A (en) * | 2019-08-15 | 2020-04-03 | 任鹏宇 | Construction method for vestibular neuron information conversion model |
CN111510837A (en) * | 2020-04-23 | 2020-08-07 | 武汉声尼听觉科技有限公司 | Hearing aid hearing recovery method, recovery system, storage medium and hearing aid |
CN113924786A (en) * | 2019-06-09 | 2022-01-11 | 根特大学 | Neural network model for cochlear mechanics and processing |
CN120148530A (en) * | 2025-05-15 | 2025-06-13 | 浙江大学 | Sound pulse coding and identifying method and system for biological hearing elicitation |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5388182A (en) * | 1993-02-16 | 1995-02-07 | Prometheus, Inc. | Nonlinear method and apparatus for coding and decoding acoustic signals with data compression and noise suppression using cochlear filters, wavelet analysis, and irregular sampling reconstruction |
CN101645267A (en) * | 2009-04-03 | 2010-02-10 | 中国科学院声学研究所 | Voice processing method applied in electronic ear |
CN103778920A (en) * | 2014-02-12 | 2014-05-07 | 北京工业大学 | Speech enhancing and frequency response compensation fusion method in digital hearing-aid |
CN105741849A (en) * | 2016-03-06 | 2016-07-06 | 北京工业大学 | Voice enhancement method for fusing phase estimation and human ear hearing characteristics in digital hearing aid |
CN106098077A (en) * | 2016-07-28 | 2016-11-09 | 浙江诺尔康神经电子科技股份有限公司 | Artificial cochlea's speech processing system of a kind of band noise reduction and method |
-
2017
- 2017-12-29 CN CN201711486488.3A patent/CN108053829B/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5388182A (en) * | 1993-02-16 | 1995-02-07 | Prometheus, Inc. | Nonlinear method and apparatus for coding and decoding acoustic signals with data compression and noise suppression using cochlear filters, wavelet analysis, and irregular sampling reconstruction |
CN101645267A (en) * | 2009-04-03 | 2010-02-10 | 中国科学院声学研究所 | Voice processing method applied in electronic ear |
CN103778920A (en) * | 2014-02-12 | 2014-05-07 | 北京工业大学 | Speech enhancing and frequency response compensation fusion method in digital hearing-aid |
CN105741849A (en) * | 2016-03-06 | 2016-07-06 | 北京工业大学 | Voice enhancement method for fusing phase estimation and human ear hearing characteristics in digital hearing aid |
CN106098077A (en) * | 2016-07-28 | 2016-11-09 | 浙江诺尔康神经电子科技股份有限公司 | Artificial cochlea's speech processing system of a kind of band noise reduction and method |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113924786A (en) * | 2019-06-09 | 2022-01-11 | 根特大学 | Neural network model for cochlear mechanics and processing |
US11800301B2 (en) | 2019-06-09 | 2023-10-24 | Universiteit Gent | Neural network model for cochlear mechanics and processing |
CN113924786B (en) * | 2019-06-09 | 2024-03-29 | 根特大学 | Neural network models for cochlear mechanics and processing |
CN110957040A (en) * | 2019-08-15 | 2020-04-03 | 任鹏宇 | Construction method for vestibular neuron information conversion model |
CN110957040B (en) * | 2019-08-15 | 2023-05-02 | 任鹏宇 | Construction method for vestibular neuron information conversion model |
CN110931034A (en) * | 2019-11-27 | 2020-03-27 | 深圳市悦尔声学有限公司 | Pickup noise reduction method for built-in earphone of microphone |
CN110931034B (en) * | 2019-11-27 | 2022-05-24 | 深圳市悦尔声学有限公司 | Pickup noise reduction method for built-in earphone of microphone |
CN111510837A (en) * | 2020-04-23 | 2020-08-07 | 武汉声尼听觉科技有限公司 | Hearing aid hearing recovery method, recovery system, storage medium and hearing aid |
CN120148530A (en) * | 2025-05-15 | 2025-06-13 | 浙江大学 | Sound pulse coding and identifying method and system for biological hearing elicitation |
Also Published As
Publication number | Publication date |
---|---|
CN108053829B (en) | 2020-06-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108053829A (en) | A kind of cochlear implant coding method based on cochlea sense of hearing Nonlinear Dynamics | |
CN102958561B (en) | The timing of envelope particular stimulation | |
CN104661700B (en) | The reduction of the transient sound in hearing implant | |
CN103140260B (en) | Cochlear implant stimulation with low frequency channel privilege | |
JP2791031B2 (en) | Artificial hearing organ and signal processing device | |
CN109545233A (en) | System and method for handling audio signal to be reset on audio devices | |
Yao et al. | The application of bionic wavelet transform to speech signal processing in cochlear implants using neural network simulations | |
CN102579159B (en) | Electrical cochlea speech processor and processing method with signal compression in wide dynamic range | |
CN105050014A (en) | Hearing-aid device and method based on smart phone | |
US20220068289A1 (en) | Speech Processing Method and System in A Cochlear Implant | |
Hansen et al. | CCi-MOBILE: Design and evaluation of a cochlear implant and hearing aid research platform for speech scientists and engineers | |
CN105555354A (en) | Auditory prosthesis stimulation rate as a multiple of intrinsic oscillation | |
JP4295765B2 (en) | Voice conversion method for cochlear implant | |
CN104307100B (en) | A kind of method and system improving artificial cochlea's pitch perception | |
CN100502819C (en) | Artificial cochlea manufacture method suitable for Chinese voice coding strategy | |
CN101848410A (en) | N band FM demodulation to aid cochlear hearing impaired persons | |
Huang et al. | Combination and comparison of sound coding strategies using cochlear implant simulation with mandarin speech | |
CN103892939A (en) | Artificial cochlea speech processor and method for improving Chinese tone recognition rate | |
Vicen-Bueno et al. | A hearing aid simulator to test adaptive signal processing algorithms | |
US9020172B2 (en) | Methods, systems, and devices for detecting feedback | |
JP5046233B2 (en) | Speech enhancement processor | |
CN108198546A (en) | A kind of speech signal pre-processing method based on cochlea Nonlinear Dynamics | |
Drgas et al. | Dynamic processing neural network architecture for hearing loss compensation | |
DE102013207080A1 (en) | Binaural microphone adaptation using your own voice | |
CN111510837B (en) | Hearing recovery method, recovery system, storage medium and hearing aid |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20200602 Termination date: 20201229 |
|
CF01 | Termination of patent right due to non-payment of annual fee |