CN114040308B - Skin hearing aid device based on emotion gain - Google Patents
Skin hearing aid device based on emotion gain Download PDFInfo
- Publication number
- CN114040308B CN114040308B CN202111358689.1A CN202111358689A CN114040308B CN 114040308 B CN114040308 B CN 114040308B CN 202111358689 A CN202111358689 A CN 202111358689A CN 114040308 B CN114040308 B CN 114040308B
- Authority
- CN
- China
- Prior art keywords
- module
- emotional
- sound
- hearing aid
- skin
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
- H04R25/50—Customised settings for obtaining desired overall acoustical characteristics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61F—FILTERS IMPLANTABLE INTO BLOOD VESSELS; PROSTHESES; DEVICES PROVIDING PATENCY TO, OR PREVENTING COLLAPSING OF, TUBULAR STRUCTURES OF THE BODY, e.g. STENTS; ORTHOPAEDIC, NURSING OR CONTRACEPTIVE DEVICES; FOMENTATION; TREATMENT OR PROTECTION OF EYES OR EARS; BANDAGES, DRESSINGS OR ABSORBENT PADS; FIRST-AID KITS
- A61F11/00—Methods or devices for treatment of the ears or hearing sense; Non-electric hearing aids; Methods or devices for enabling ear patients to achieve auditory perception through physiological senses other than hearing sense; Protective devices for the ears, carried on the body or in the hand
- A61F11/04—Methods or devices for enabling ear patients to achieve auditory perception through physiological senses other than hearing sense, e.g. through the touch sense
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/63—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for estimating an emotional state
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Probability & Statistics with Applications (AREA)
- Neurosurgery (AREA)
- Quality & Reliability (AREA)
- Child & Adolescent Psychology (AREA)
- Hospice & Palliative Care (AREA)
- Psychiatry (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Neurology (AREA)
- Physiology (AREA)
- Artificial Intelligence (AREA)
- Biophysics (AREA)
- Psychology (AREA)
- Heart & Thoracic Surgery (AREA)
- Vascular Medicine (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
一种基于情感增益的皮肤听声助听装置,属于助听技术领域,包括麦克风、助听主机和电极贴,其中,麦克风可独立设置,通过导线与助听主机连接,也可设置在助听主机上,电极贴采用复合平面电极,包括一张或并联设置的两张,张贴在人耳后的皮肤表层,通过导线与助听主机连接,助听主机内置电源。本发明结构新颖,构思巧妙,通过计算机识别的情感特征来增益皮肤听声效果,使其助听效果更好,对声音频率的接收能力强,同时有效的降低听障人士学习的难度和门槛,效果显著。
A skin-acoustic hearing aid device based on emotional gain, belonging to the field of hearing aid technology, including a microphone, a hearing aid host and electrode stickers, wherein the microphone can be set independently and connected to the hearing aid host through wires, or can be set on the hearing aid On the host, the electrode stickers use composite planar electrodes, including one or two in parallel, pasted on the skin surface behind the human ear, connected to the hearing aid host through wires, and the hearing aid host has a built-in power supply. The present invention has a novel structure and ingenious conception, and uses the emotional characteristics recognized by the computer to increase the skin hearing effect, so that the hearing aid effect is better, the ability to receive sound frequencies is strong, and at the same time, it effectively reduces the learning difficulty and threshold for hearing-impaired people. The effect is remarkable.
Description
技术领域technical field
本发明涉及助听技术领域,尤其涉及皮肤听声,具体是一种基于情感增益的皮肤听声助听技术及应用。The invention relates to the technical field of hearing aids, in particular to skin hearing aids, in particular to a skin hearing aid technology and application based on emotional gain.
背景技术Background technique
据世界卫生组织2018年报告显示,全球有3.6亿听力障碍人士,占全球总人口的5.3%。在我国,目前,听力障碍人士高达2780万之多,占全国人口的1.679%,而且,每年因各种原因而新增的新生残障人士约有3万人。According to the 2018 report of the World Health Organization, there are 360 million hearing-impaired people in the world, accounting for 5.3% of the total global population. In my country, at present, there are as many as 27.8 million hearing-impaired people, accounting for 1.679% of the national population, and there are about 30,000 newly born disabled people due to various reasons every year.
听力障碍,是指听觉系统中的传音、感音以及对声音的综合分析的各级神经中枢发生器质性或功能性异常,而导致听力出现不同程度的减退,惯称为耳聋。其实只有听力严重减退才称之为聋,其表现为患者双耳均不能听到任何言语,而听力损失未达到此严重程度者被称之为听力减退。由于人体在生长发育完成后,听力开始不可逆的逐年损失,在这个过程中,可能因为一些外界因素的影响而加快听力损失的速度,如药物、噪音环境、外伤等,大部分成年人在工作生活过程中无法避免的因为较长时间的接听电话、较长时间的佩戴耳机/耳塞、较长时间处于嘈杂的环境中,而发生较快速度的听力损失,而目前,针对听障尚无有效的恢复和改善方法,这使得我国的轻度及中度的听障人士的人群以极快的速度增长,且远远超出统计数量。Hearing impairment refers to the organic or functional abnormalities of the nerve centers at all levels of sound transmission, sensory sound and comprehensive analysis of sound in the auditory system, resulting in varying degrees of hearing loss, commonly known as deafness. In fact, only severe hearing loss is called deafness, which is manifested by the patient's inability to hear any speech in both ears, and those whose hearing loss has not reached this level of severity are called hearing loss. After the human body grows and develops, the hearing begins to lose irreversibly year by year. In this process, the speed of hearing loss may be accelerated due to the influence of some external factors, such as drugs, noise environment, trauma, etc. Most adults have Unavoidable in the process is the rapid hearing loss due to answering calls for a long time, wearing earphones/earplugs for a long time, and being in a noisy environment for a long time. At present, there is no effective treatment for hearing impairment. Restoration and improvement methods, which make the population of mild and moderate hearing-impaired people in our country grow at an extremely fast rate, and far exceed the statistical quantity.
为了改善这种情况,一方面积极的培养人们良好的听力使用习惯,延缓听力损失的速度,另一方面通过助听装置来补偿重度及以上听障人士的听力损失。对于后一种,现有的助听装置有助听器和人工耳蜗两种办法,助听器对重度及以下听障人士的听力损失能够起到一定的补偿效果,价格相对便宜,但现有的这类助听器往往存在难以消除的噪音,在补偿使用者听力损失的同时也会进一步的造成使用者的听力损失,同时,耳聋患者使用完全无效;人工耳蜗是通过手术实现的,能够将患者听力提高到完全正常的状态,但价格昂贵,手术成功率不高,手术会彻底破坏替换掉患者原有的听力器官,不管成功与否,患者的听力器官都无法修复,即使手术成功,以后使用损坏同样会造成患者听力的无法修复。因此,研究更优秀的助听方案势在必行,而皮肤听声技术就是其中一种表现优秀的思路。In order to improve this situation, on the one hand, actively cultivate people's good listening habits and slow down the speed of hearing loss, and on the other hand, use hearing aids to compensate for the hearing loss of people with severe or above hearing impairment. For the latter, the existing hearing aid devices include hearing aids and cochlear implants. Hearing aids can compensate for the hearing loss of people with severe hearing impairment and below, and are relatively cheap. There are often noises that are difficult to eliminate, which will further cause the user's hearing loss while compensating for the user's hearing loss. At the same time, the use of deaf patients is completely ineffective; cochlear implants are achieved through surgery, which can improve the patient's hearing to completely normal state, but the price is expensive, and the success rate of the operation is not high. The operation will completely destroy and replace the patient's original hearing organ. No matter whether it is successful or not, the patient's hearing organ cannot be repaired. Hearing is irreparable. Therefore, it is imperative to study better hearing aid solutions, and skin hearing technology is one of the excellent ideas.
通过研究表明,人的皮肤能够敏感的感受到各种刺激,如压力、振动、电等,神经系统会将这些来自皮肤的刺激送给大脑进行识别和处理。由于人体耳部的听觉本身就是经过一系列的震动放大—震动识别并转化为电信号—神经系统传递给大脑—大脑识别,因此通过皮肤也可以完全“听到”外界的声音,这对于部分听觉器官破坏而完全丧失听力的听障人士而言,是个重大的利好消息,在现有专利《变压式皮肤听声器》(申请号200410026265.5)中已经公开了一种利用皮肤听声技术而制成的一种助听装置,全聋人也能通过该装置而获得对于语音及常遇到的声音的听觉。Studies have shown that human skin can sensitively feel various stimuli, such as pressure, vibration, electricity, etc., and the nervous system will send these stimuli from the skin to the brain for identification and processing. Since the hearing of the human ear itself is amplified through a series of vibrations—the vibrations are recognized and converted into electrical signals—the nervous system transmits them to the brain—the brain recognizes them, so the external sound can also be completely “heard” through the skin, which is important for part of the sense of hearing. It is great news for the hearing-impaired persons whose organs are destroyed and completely lose their hearing. In the existing patent "Variable Pressure Skin Hearing Device" (Application No. 200410026265.5), a device made by using skin listening technology has been disclosed. It is a kind of hearing aid device, through which the totally deaf can also obtain the hearing of speech and frequently encountered sounds.
然而,我们生存的环境中充斥着各种各样的声音,除了人们交谈的声音外,还有各种背景噪音,拥有听力的我们通过长时间的训练和学习,大脑学会了分析各种声音,将各种声音进行归类,哪些是人声,哪些是各类环境噪音,然后大脑将注意力集中到想要听的一类声音上,并忽略其它类别的声音,这使得我们耳中的声音世界充满了层次感。而对于一种先天性完全丧失听力的人而言,这是一个完全陌生的领域,直接面对了一涌而来的各类声音,在缺失了“幼年—少年”段的高效学习能力之后,大脑更是需要很长时间去训练和学习,因此对于成年人而言,该专利提出的技术方案并不友善,使用的难度和门槛都很高。However, the environment in which we live is full of various sounds. In addition to the sound of people talking, there are also various background noises. We who have hearing have learned to analyze various sounds through long-term training and learning. Classify various sounds, which are human voices, and which are various environmental noises, and then the brain will focus on the type of sound it wants to hear, and ignore other types of sounds, which makes the sound in our ears The world is full of layers. For a person with a congenital complete loss of hearing, this is a completely unfamiliar field, directly facing all kinds of voices, after losing the efficient learning ability of the "childhood-juvenile" stage, The brain needs a long time to train and learn, so for adults, the technical solution proposed by this patent is not friendly, and the difficulty and threshold of use are very high.
人机情感交互的基础源于计算机应用,通过算法和大量的学习来模拟出计算机“人工心理”和“人工情感”的能力,通过分析各种情感的模型特征来实现情感建模,进一步进行计算机的情感识别,来完成进一步的情感交互,经过国内外专家的研究表明,这一切都是切实可行的,在这个基础上,中科院更进一步的将上述理论应用到情感语音的理解和合成,这将人及情感交互的进程推进了一大步,一方面能够使得计算机通过识别语音来准确判定人的情绪,另一方面也能够将文字转化输出为带有情绪的语音或其它信号。The basis of human-computer emotional interaction comes from computer applications, through algorithms and a lot of learning to simulate the computer's "artificial psychology" and "artificial emotion" capabilities, by analyzing the model characteristics of various emotions to achieve emotional modeling, and further computer Based on the research of domestic and foreign experts, all of these are feasible. On this basis, the Chinese Academy of Sciences will further apply the above theory to the understanding and synthesis of emotional speech, which will The process of human-emotional interaction has taken a big step forward. On the one hand, it can enable computers to accurately determine human emotions by recognizing speech, and on the other hand, it can also convert text into speech or other signals with emotions.
将上述思路应用到针对听障人士的助听装置上,能够极大的提高听障人士的听力效果,对其听力的补偿效果非常明显。Applying the above ideas to hearing aids for the hearing-impaired can greatly improve the hearing effect of the hearing-impaired, and the effect of compensating their hearing is very obvious.
发明内容Contents of the invention
为了解决现有技术的不足,本发明提出一种基于情感增益的皮肤听声助听装置,来为听障人士提供听力补偿。In order to solve the deficiencies of the prior art, the present invention proposes a skin-acoustic hearing aid device based on emotional gain to provide hearing compensation for hearing-impaired persons.
本发明要解决的技术问题是通过以下技术方案实现的:The technical problem to be solved in the present invention is achieved through the following technical solutions:
一种基于情感增益的皮肤听声助听装置,包括麦克风、电极贴,以及连接麦克风和电极贴的助听主机,在本发明中,所述助听主机包括声音预处理模块、情感分析模块、情感附加模块、信息处理模块和声音信息输出模块,其中声音预处理模块与麦克风电连接,包括数字化模块,数字化模块将麦克风输出的模拟信号转化为数字信号,情感分析模块分别与声音预处理模块和情感附加模块相连接,信息处理模块分别连接声音预处理模块、情感附加模块和声音信息输出模块,信息处理模块对来自声音预处理模块的数字信号进行处理,使其方便残障人士学习和训练,并送给声音信息输出模块,声音信息输出模块分别连接信息处理模块和电极贴,声音信息输出模块将数字信号转化为模拟信号后,通过电极贴传递到人的皮肤。A skin-acoustic hearing aid device based on emotional gain, including a microphone, electrode stickers, and a hearing aid host connected to the microphone and the electrode stickers. In the present invention, the hearing aid host includes a sound preprocessing module, an emotion analysis module, Emotional additional module, information processing module and sound information output module, wherein the sound preprocessing module is electrically connected with the microphone, including a digital module, the digital module converts the analog signal output by the microphone into a digital signal, and the emotion analysis module is respectively connected with the sound preprocessing module and the The emotional additional module is connected, and the information processing module is respectively connected to the sound preprocessing module, the emotional additional module and the sound information output module, and the information processing module processes the digital signal from the sound preprocessing module to make it convenient for disabled people to learn and train, and Send it to the sound information output module, the sound information output module is respectively connected to the information processing module and the electrode patch, and the sound information output module converts the digital signal into an analog signal, and transmits it to the human skin through the electrode patch.
在本发明中,所述信息处理模块将经过预处理的数字信号分离,将人声和背景声音分别处理,人声经过情感附加模块的调整后与背景声音合并,送到声音信息输出模块。其中,信息处理模块与情感附加模块相连接,将人声的数字信号编译成为容易识别的文字的数字信号后,经过情感附加模块调整音节和语速,输出为能够表达为模拟语音的数字信号,经过重读,清理出人声语言中的不和谐成分——比如方言等——转化为更方便接受和识别的模拟人声,一般为普通话,方便理解,降低学习的难度和门槛。In the present invention, the information processing module separates the preprocessed digital signal, processes the human voice and the background sound separately, and the human voice is combined with the background sound after being adjusted by the emotional additional module, and sent to the sound information output module. Among them, the information processing module is connected with the emotional additional module, and after compiling the digital signal of the human voice into a digital signal of an easily recognizable text, the syllable and speech rate are adjusted by the emotional additional module, and the output is a digital signal that can be expressed as an analog voice. After re-reading, the dissonant components in the human voice language are cleaned up—such as dialects—and transformed into simulated human voices that are easier to accept and recognize, generally in Mandarin, which is easy to understand and reduces the difficulty and threshold of learning.
进一步的,所述情感分析模块连接有情感特征数据库,提取数字化模块得到的数字信号中的情感特征,与情感特征数据相比对,确定数字信号表达出的情感特征所代表的的语言中蕴含的情感,发送到情感附加模块,通过情感附加模块将相对应的情感特征叠加到信息处理模块处理的数字信号中。Further, the emotional analysis module is connected with an emotional feature database, extracts the emotional features in the digital signal obtained by the digitization module, compares them with the emotional feature data, and determines the emotional features contained in the language represented by the emotional features expressed by the digital signal. The emotion is sent to the emotion additional module, and the corresponding emotion feature is superimposed into the digital signal processed by the information processing module through the emotion additional module.
在本发明中,所述声音信息输出模块包括滤波器和升压器,其中,所述信息处理模块、滤波器、升压器和电极贴依次连接,所述滤波器和升压器有若干组并联设置。In the present invention, the sound information output module includes a filter and a booster, wherein the information processing module, the filter, the booster and the electrode stickers are connected in sequence, and there are several sets of filters and boosters Parallel setting.
进一步的,所述滤波器为多通道带通滤波器,各组滤波器分别选择不同的中心频率,选择范围为15Hz~15kHz。Further, the filter is a multi-channel band-pass filter, each set of filters selects a different center frequency, and the selection range is 15Hz~15kHz.
进一步的,所述滤波器和升压器优选48组,通过不同的频率,采用48通道输出到电极贴上。Further, there are preferably 48 groups of filters and boosters, and 48 channels of different frequencies are used to output to the electrode stickers.
在本发明中,所述电极贴与助听主机通过导线连接,电极贴张贴在人神经分布密集的皮肤上。In the present invention, the electrode stickers are connected to the hearing aid host through wires, and the electrode stickers are pasted on the skin with dense distribution of human nerves.
进一步的,所述电极贴张贴的位置在人耳后的皮肤表层。Further, the position where the electrode paste is pasted is on the surface layer of the skin behind the human ear.
在本发明中,所述声音预处理模块包括去噪模块和音节校验模块。In the present invention, the sound preprocessing module includes a denoising module and a syllable checking module.
在本发明中,所述助听主机内置有电源。In the present invention, the hearing aid host has a built-in power supply.
在本发明中,所述电极贴采用复合平面电极。In the present invention, the electrode paste adopts composite planar electrodes.
在本发明中,所述情感特征的提取和情感特征数据库的建立基于汉语语音,我国包括北京航空航天大学在内的部分专家学者创建了结合情感特征的汉语语音情感提取和建模方法,以及由此建立起来的数据库,该汉语语音情感特征的提取方法为:指定情感特征数据库规范,包括发音人规范、录音脚本设计规范、音频文件命名规范等;收集情感特征数据:情感特征愉悦度、激活度、优势度(PAD)评测,即由区别与说话者的至少十名评测者对情感特征数据进行PAD主观听取评测试验。该汉语语音情感特征建模方法为:首先根据Fisher比率选择语音特征训练性别识别支持向量机模型(SVM);其次为男声和女声分别建立情感特征隐马尔科夫模型(HMM),并根据SVM性别识别结果选择相应的HMM进行情感特征分类。In the present invention, the extraction of described emotional features and the establishment of emotional feature database are based on Chinese speech, and some experts and scholars including Beijing University of Aeronautics and Astronautics in my country have created Chinese speech emotion extraction and modeling methods combined with emotional features, and by Based on the established database, the extraction method of the emotional features of Chinese speech is: specifying the emotional feature database specification, including speaker specification, recording script design specification, audio file naming specification, etc.; collecting emotional feature data: emotional feature pleasure, activation degree , Predominance degree (PAD) evaluation, that is, the PAD subjective listening evaluation test is conducted on the emotional feature data by at least ten evaluators who are different from the speaker. The modeling method of Chinese speech emotional features is as follows: firstly, according to the Fisher ratio, select the speech features to train the gender recognition support vector machine model (SVM); secondly, establish emotional feature hidden Markov models (HMM) for male voice and female voice respectively, and use the SVM gender The recognition result selects the corresponding HMM for emotional feature classification.
与现有技术相比,本发明具有以下优点:Compared with the prior art, the present invention has the following advantages:
1)以皮肤听声的原理为基础,保证听障人士能够听到声音,通过情感增益来调整,使得听障人士听到的声音更加真实;1) Based on the principle of skin hearing, it ensures that the hearing-impaired can hear the sound, and adjusts it through emotional gain to make the sound heard by the hearing-impaired more real;
2)整个处理过程将麦克风输出的模拟信号转化为数字信号,处理完成后转化为模拟信号由电极贴输出,使得信号更加方便进行处理,有效的实现了情感增益对声音信号的编辑和强化;2) During the whole processing process, the analog signal output by the microphone is converted into a digital signal. After the processing is completed, the analog signal is converted into an analog signal and output by the electrode sticker, which makes the signal more convenient to process, and effectively realizes the editing and strengthening of the emotional gain to the sound signal;
3)在声音信号的处理过程中,分离成为人声和背景声音,并对人声进行进一步的增益处理,方便听障人士对人声的识别,又不影响其对环境声音的感知;3) During the processing of the sound signal, it is separated into human voice and background sound, and further gain processing is performed on the human voice, which is convenient for the hearing-impaired to recognize human voice without affecting their perception of environmental sound;
4)在声音信号的处理过程中,对人声进行了识别和转化两道程序,重读能够将带有方言等不和谐成分的人声语言转化为更方便接受和识别的普通话,方便理解,降低学习的难度和门槛;4) In the process of sound signal processing, two procedures of recognition and conversion are carried out for the human voice. Rereading can convert the human voice language with discordant components such as dialects into Mandarin which is easier to accept and recognize, which is convenient for understanding and reduces the The difficulty and threshold of learning;
5)声音信息输出模块通过48通道滤波,全面覆盖各种频谱的声音信号,使得听障人士感知到的声音频率范围与正常人耳感知到的一致,避免其受到额外的歧视或排斥。5) The sound information output module fully covers sound signals of various spectrums through 48-channel filtering, so that the sound frequency range perceived by the hearing-impaired is consistent with that perceived by normal ears, avoiding additional discrimination or rejection.
因此,本发明结构新颖,构思巧妙,通过计算机识别的情感特征来增益皮肤听声效果,使其助听效果更好,对声音频率的接收能力强,同时有效的降低听障人士学习的难度和门槛,效果显著。Therefore, the present invention has a novel structure and ingenious conception. The emotional characteristics recognized by the computer are used to enhance the skin hearing effect, so that the hearing aid effect is better, the ability to receive sound frequencies is strong, and at the same time, it effectively reduces the difficulty and difficulty of learning for the hearing impaired. Threshold, the effect is remarkable.
附图说明Description of drawings
图1为本发明的整体结构示意图;Fig. 1 is the overall structure schematic diagram of the present invention;
图2为图1的声音预处理模块示意图;Fig. 2 is the schematic diagram of the sound preprocessing module of Fig. 1;
图3为图1的信息处理模块原理图;Fig. 3 is a schematic diagram of the information processing module of Fig. 1;
图4 为图1的声音信息输出模块示意图。FIG. 4 is a schematic diagram of the sound information output module in FIG. 1 .
具体实施方式Detailed ways
以下结合说明书附图和具体优选的实施例对本发明作进一步描述,但并不因此而限制本发明的保护范围。The present invention will be further described below in conjunction with the accompanying drawings and specific preferred embodiments, but the protection scope of the present invention is not limited thereby.
一种基于情感增益的皮肤听声助听装置,如图1-4所示,包括麦克风、助听主机和电极贴,其中,麦克风可独立设置,通过导线与助听主机连接,也可设置在助听主机上,电极贴采用复合平面电极,包括一张或并联设置的两张,张贴在人耳后的皮肤表层,通过导线与助听主机连接,助听主机内置电源。A skin hearing aid device based on emotional gain, as shown in Figure 1-4, includes a microphone, a hearing aid host and electrode stickers, wherein the microphone can be set independently and connected to the hearing aid host through wires, or it can be set on the On the hearing-aid host, the electrode stickers use composite planar electrodes, including one or two in parallel, pasted on the skin surface behind the human ear, connected to the hearing-aid host through wires, and the hearing-aid host has a built-in power supply.
助听主机包括声音预处理模块、情感分析模块、情感附加模块、信息处理模块和声音信息输出模块,其中声音预处理模块与麦克风电连接,包括数字化模块、去噪模块和声音校正模块,数字化模块将麦克风输出的模拟信号转化为数字信号,去噪模块清除部分不谐杂波避免影响后续处理,声音校正模块调整波形;情感分析模块分别与声音预处理模块和情感附加模块相连接,情感分析模块连接有情感特征数据库;信息处理模块分别连接声音预处理模块、情感附加模块和声音信息输出模块,信息处理模块对来自声音预处理模块的数字信号进行处理,使其方便残障人士学习和训练,并送给声音信息输出模块,声音信息输出模块分别连接信息处理模块和电极贴,声音信息输出模块将数字信号转化为模拟信号后,通过电极贴传递到人的皮肤;声音信息输出模块包括滤波器和升压器,信息处理模块、滤波器、升压器和电极贴依次连接,所述滤波器和升压器有若干组并联设置,滤波器为多通道带通滤波器,各组滤波器分别选择不同的中心频率,选择范围为15Hz~15kHz,滤波器和升压器优选48组,通过不同的频率,采用48通道输出到电极贴上。The hearing aid host includes a sound preprocessing module, an emotional analysis module, an emotional additional module, an information processing module and a sound information output module, wherein the sound preprocessing module is electrically connected to the microphone, including a digital module, a noise removal module and a sound correction module, and the digital module The analog signal output by the microphone is converted into a digital signal, the denoising module removes some inharmonic clutter to avoid affecting subsequent processing, the sound correction module adjusts the waveform; the emotion analysis module is connected to the sound preprocessing module and the emotion additional module, and the emotion analysis module It is connected to an emotional feature database; the information processing module is respectively connected to the sound preprocessing module, the emotional additional module and the sound information output module, and the information processing module processes the digital signal from the sound preprocessing module to make it convenient for disabled people to learn and train, and Send to the sound information output module, the sound information output module is respectively connected to the information processing module and the electrode sticker, the sound information output module converts the digital signal into an analog signal, and transmits it to the human skin through the electrode sticker; the sound information output module includes a filter and The booster, the information processing module, the filter, the booster and the electrode stickers are connected in sequence, the filter and the booster have several sets of parallel settings, the filter is a multi-channel bandpass filter, and each set of filters is selected separately Different center frequencies, the selection range is 15Hz~15kHz, 48 groups of filters and boosters are preferred, through different frequencies, 48 channels are used to output to the electrode stickers.
在本发明中,信息处理模块对来自声音预处理模块的数字信号进行处理包括依次进行的分离、识别、附加、转化、合并几个环节,其中,分离环节是将声音预处理模块的输入声音数字信号分离,将人声与背景声音分开,识别环节是将人声数字信号识别转化为文字数字信号,附加环节是将文字数字信号接受情感附加模块发送的特定情感特征,转化环节是将文字数字信号进行重读,转化为模拟人声的数字信号,一般为普通话,结合特定情感特征,调整音节和语速,输出为具有特定情感的模拟人声数字信号,合并环节是将上述具有特定情感的模拟人声数字信号与背景声音数字信号合并,输出为经过处理的声音数字信号。In the present invention, the processing of the digital signal from the sound preprocessing module by the information processing module includes successive steps of separation, recognition, addition, conversion, and merging. Signal separation, which separates the human voice from the background sound. The recognition link is to convert the human voice digital signal into an alphanumeric signal. The additional link is to receive the alphanumeric signal from the specific emotional characteristics sent by the emotional additional module. The conversion link is to convert the alphanumeric signal Perform rereading, convert it into a digital signal of simulated human voice, generally in Mandarin, combine specific emotional characteristics, adjust syllables and speech speed, and output a digital signal of simulated human voice with specific emotion. The sound digital signal is combined with the background sound digital signal, and the processed sound digital signal is output.
本发明应用于残障人士的日常助听时,麦克风接收到声音后,将其转化为模拟信号输入到助听主机,在数字化模块的处理下,声音模拟信号转化为声音数字信号,经过去噪和音节校正,声音数字信号传输向信息处理模块,同时,情感分析模块提取声音数字信号中的情感特征,与情感特征数据库对比,确定为指定的情感,并将所有该情感对应的情感特征传递给情感附加模块;信息处理模块得到声音数字信号后,将人声与背景声音分开,人声附加来自情感附加模块的情感特征后,重读为模拟人声的数字信号,与背景声音数字信号合并,将经过处理的声音数字信号输出为声音信息输出模块;声音信息输出模块经过48路不同频率滤波器的滤波和升压器的放大后,输出到电极贴;电极通过刺激人体皮肤,电击传递到人耳神经,转变成人体神经脉冲信号,通过神经系统的传递,投射到大脑的听觉皮层,由此产生听觉。When the present invention is applied to the daily hearing aid of the disabled, after the microphone receives the sound, it converts it into an analog signal and inputs it to the hearing aid mainframe. Syllable correction, the sound digital signal is transmitted to the information processing module, at the same time, the emotion analysis module extracts the emotional features in the sound digital signal, compares it with the emotional feature database, determines it as the specified emotion, and transfers all the emotional features corresponding to the emotion to the emotion Additional module; after the information processing module obtains the sound digital signal, it separates the human voice from the background sound, and after adding the emotional features from the emotional additional module to the human voice, it rereads the digital signal of the simulated human voice, merges it with the background sound digital signal, and passes through The processed sound digital signal output is the sound information output module; the sound information output module is filtered by 48 channels of different frequency filters and amplified by the booster, and then output to the electrode sticker; the electrode stimulates the human skin, and the electric shock is transmitted to the human ear nerve , converted into human nerve impulse signals, transmitted through the nervous system, and projected to the auditory cortex of the brain, thereby generating hearing.
在本发明中,残障人士听到的并非是原声,而是经过AI处理的电子声,可以轻易的清除人声语言中的不和谐成分,比如方言等,转化为更方便接受和识别的普通话,因此对于先天性完全丧失听力的听障人士而言,能够有效的降低学习的难度和门槛,但对于后天完全丧失听力的听障人士来说,由于已经学习过语言,见识过原声,需要经过较短时间的适应。但总而言之,能够有效的解决完全丧失听力的听障人士的难题,使其能够通过助听装置来准确的听到来自世界的声音。In the present invention, what people with disabilities hear is not the original sound, but the electronic sound processed by AI, which can easily remove the dissonant components in the human voice language, such as dialects, etc., and convert it into Mandarin, which is easier to accept and recognize. Therefore, for the hearing-impaired persons with congenital complete hearing loss, it can effectively reduce the difficulty and threshold of learning, but for the hearing-impaired persons with complete loss of hearing the day after tomorrow, since they have already learned the language and seen the original sound, they need to go through a relatively long process. short-term adaptation. But all in all, it can effectively solve the problem of hearing-impaired people who have completely lost their hearing, so that they can accurately hear the sounds from the world through hearing aids.
因此,结合上述构造和步骤可以发现,本发明所述的基于情感增益的皮肤听声装置通过计算机识别的情感特征来增益皮肤听声效果,使其助听效果更好,对声音频率的接收能力强,同时有效的降低听障人士学习的难度和门槛,效果显著。Therefore, in combination with the above structure and steps, it can be found that the skin hearing device based on emotion gain according to the present invention can gain the skin hearing effect through the emotional characteristics recognized by the computer, so that the hearing aid effect is better, and the receiving ability of the sound frequency is improved. At the same time, it effectively reduces the difficulty and threshold of learning for the hearing-impaired, and the effect is remarkable.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111358689.1A CN114040308B (en) | 2021-11-17 | 2021-11-17 | Skin hearing aid device based on emotion gain |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111358689.1A CN114040308B (en) | 2021-11-17 | 2021-11-17 | Skin hearing aid device based on emotion gain |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114040308A CN114040308A (en) | 2022-02-11 |
CN114040308B true CN114040308B (en) | 2023-06-30 |
Family
ID=80144656
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111358689.1A Active CN114040308B (en) | 2021-11-17 | 2021-11-17 | Skin hearing aid device based on emotion gain |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114040308B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6985594B1 (en) * | 1999-06-15 | 2006-01-10 | Hearing Enhancement Co., Llc. | Voice-to-remaining audio (VRA) interactive hearing aid and auxiliary equipment |
CN1748250A (en) * | 2002-12-11 | 2006-03-15 | 索夫塔马克斯公司 | System and method for speech processing using independent component analysis under stability restraints |
JP2008122729A (en) * | 2006-11-14 | 2008-05-29 | Sony Corp | Noise reducing device, noise reducing method, noise reducing program, and noise reducing audio outputting device |
KR20180125393A (en) * | 2017-05-15 | 2018-11-23 | 한국전기연구원 | Environment feature extract method and hearing aid operation method using thereof |
CN212381404U (en) * | 2020-07-07 | 2021-01-19 | 昆山快乐岛运动电子科技有限公司 | Glasses with hearing aid function |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5256119B2 (en) * | 2008-05-27 | 2013-08-07 | パナソニック株式会社 | Hearing aid, hearing aid processing method and integrated circuit used for hearing aid |
US7843337B2 (en) * | 2009-03-09 | 2010-11-30 | Panasonic Corporation | Hearing aid |
CN102222500A (en) * | 2011-05-11 | 2011-10-19 | 北京航空航天大学 | Extracting method and modeling method for Chinese speech emotion combining emotion points |
CN104053107B (en) * | 2014-06-06 | 2018-06-05 | 重庆大学 | One kind is for Sound seperation and localization method under noise circumstance |
CN105310826B (en) * | 2015-03-12 | 2017-10-24 | 汪勇 | A kind of skin listens acoustic device and its listens method for acoustic |
CN104902423A (en) * | 2015-05-04 | 2015-09-09 | 上海交通大学 | Implantable hearing aid device and implementation method thereof |
CN110337314A (en) * | 2016-10-12 | 2019-10-15 | 易科利迪有限公司 | The multifactor control of ear stimulation |
EP3373603B1 (en) * | 2017-03-09 | 2020-07-08 | Oticon A/s | A hearing device comprising a wireless receiver of sound |
DE102017207581A1 (en) * | 2017-05-05 | 2018-11-08 | Sivantos Pte. Ltd. | Hearing system and hearing device |
CN110798789A (en) * | 2018-08-03 | 2020-02-14 | 张伟明 | Hearing aid and method of use |
EP3641345B1 (en) * | 2018-10-16 | 2024-03-20 | Sivantos Pte. Ltd. | A method for operating a hearing instrument and a hearing system comprising a hearing instrument |
EP3641344B1 (en) * | 2018-10-16 | 2023-12-06 | Sivantos Pte. Ltd. | A method for operating a hearing instrument and a hearing system comprising a hearing instrument |
CN110008481B (en) * | 2019-04-10 | 2023-04-28 | 南京魔盒信息科技有限公司 | Translated voice generating method, device, computer equipment and storage medium |
CN112714390B (en) * | 2019-11-17 | 2021-12-14 | 江苏欧百家居用品有限公司 | Hearing aid based on electronic skin technology |
WO2021127228A1 (en) * | 2019-12-17 | 2021-06-24 | Starkey Laboratories, Inc. | Hearing assistance systems and methods for monitoring emotional state |
-
2021
- 2021-11-17 CN CN202111358689.1A patent/CN114040308B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6985594B1 (en) * | 1999-06-15 | 2006-01-10 | Hearing Enhancement Co., Llc. | Voice-to-remaining audio (VRA) interactive hearing aid and auxiliary equipment |
CN1748250A (en) * | 2002-12-11 | 2006-03-15 | 索夫塔马克斯公司 | System and method for speech processing using independent component analysis under stability restraints |
JP2008122729A (en) * | 2006-11-14 | 2008-05-29 | Sony Corp | Noise reducing device, noise reducing method, noise reducing program, and noise reducing audio outputting device |
KR20180125393A (en) * | 2017-05-15 | 2018-11-23 | 한국전기연구원 | Environment feature extract method and hearing aid operation method using thereof |
CN212381404U (en) * | 2020-07-07 | 2021-01-19 | 昆山快乐岛运动电子科技有限公司 | Glasses with hearing aid function |
Non-Patent Citations (3)
Title |
---|
research on communication app for deaf and mute people based on face emotion recongnition technology;Y Tao;《2020 IEEE 2rd ICCASIT》;全文 * |
学龄前不同助听方式听障儿童和健听儿童情感语调识别能力比较;张芳;《听力学及言语疾病杂志》;全文 * |
智能数字助听器中声场景分类的研究;丁一坤;《中国优秀硕士论文全文数据库工程科技II辑》;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN114040308A (en) | 2022-02-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11043210B2 (en) | Sound processing apparatus utilizing an electroencephalography (EEG) signal | |
Luo et al. | Enhancing Chinese tone recognition by manipulating amplitude envelope: Implications for cochlear implants | |
Vongphoe et al. | Speaker recognition with temporal cues in acoustic and electric hearing | |
CN102973277B (en) | Frequency following response signal test system | |
Lan et al. | A novel speech-processing strategy incorporating tonal information for cochlear implants | |
CN102265335B (en) | Hearing aid adjustment device and method | |
Yao et al. | The application of bionic wavelet transform to speech signal processing in cochlear implants using neural network simulations | |
Fletcher | Can haptic stimulation enhance music perception in hearing-impaired listeners? | |
Zhu et al. | Contributions of temporal cue on the perception of speaker individuality and vocal emotion for noise-vocoded speech | |
CN110368005A (en) | A kind of intelligent earphone and mood and physiological health monitoring method based on intelligent earphone | |
CN113178195B (en) | Speaker identification method based on sound-induced electroencephalogram signals | |
CN108320625A (en) | Vibrational feedback system towards speech rehabilitation and device | |
Fletcher et al. | Electro-haptic stimulation: A new approach for improving cochlear-implant listening | |
CN104307100B (en) | A kind of method and system improving artificial cochlea's pitch perception | |
Ifukube | Sound-based assistive technology | |
CN114040308B (en) | Skin hearing aid device based on emotion gain | |
CN118173117A (en) | A silent speech recognition method and system | |
CN111150934B (en) | Evaluation system of Chinese tone coding strategy of cochlear implant | |
CN102426839B (en) | A Speech Recognition Method for Hearing Impaired People | |
CN208540163U (en) | A kind of auditory prosthesis | |
Ahad | An EEG-Based Comparative Analysis of Natural Speech Perception by Native Speakers of American English vs. Bilingual Individuals | |
Zhu et al. | Important role of temporal cues in speaker identification for simulated cochlear implants | |
Saxena et al. | Refinement of input speech by suppressing the unwanted amplitudes for blue hearing system | |
Barda et al. | CODING AND ANALYSIS OF SPEECH IN COCHLEAR IMPLANT: A REVIEW. | |
Summers et al. | Choice of speech features for tactile presentation to the profoundly deaf |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |