ATE347161T1 - Rauschrobuste mustererkennung - Google Patents
Rauschrobuste mustererkennungInfo
- Publication number
- ATE347161T1 ATE347161T1 AT01124141T AT01124141T ATE347161T1 AT E347161 T1 ATE347161 T1 AT E347161T1 AT 01124141 T AT01124141 T AT 01124141T AT 01124141 T AT01124141 T AT 01124141T AT E347161 T1 ATE347161 T1 AT E347161T1
- Authority
- AT
- Austria
- Prior art keywords
- noise
- pattern recognition
- training
- signal
- recognition model
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Filters That Use Time-Delay Elements (AREA)
- Noise Elimination (AREA)
- Circuit For Audible Band Transducer (AREA)
- Holo Graphy (AREA)
- Inspection Of Paper Currency And Valuable Securities (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/688,950 US6876966B1 (en) | 2000-10-16 | 2000-10-16 | Pattern recognition training method and apparatus using inserted noise followed by noise reduction |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE347161T1 true ATE347161T1 (de) | 2006-12-15 |
Family
ID=24766456
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT01124141T ATE347161T1 (de) | 2000-10-16 | 2001-10-10 | Rauschrobuste mustererkennung |
Country Status (5)
Country | Link |
---|---|
US (1) | US6876966B1 (de) |
EP (1) | EP1199708B1 (de) |
JP (1) | JP4195211B2 (de) |
AT (1) | ATE347161T1 (de) |
DE (1) | DE60124842T2 (de) |
Families Citing this family (71)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6778994B2 (en) | 2001-05-02 | 2004-08-17 | Victor Gogolak | Pharmacovigilance database |
US7542961B2 (en) * | 2001-05-02 | 2009-06-02 | Victor Gogolak | Method and system for analyzing drug adverse effects |
US7925612B2 (en) * | 2001-05-02 | 2011-04-12 | Victor Gogolak | Method for graphically depicting drug adverse effect risks |
US7461006B2 (en) * | 2001-08-29 | 2008-12-02 | Victor Gogolak | Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data |
US7165028B2 (en) * | 2001-12-12 | 2007-01-16 | Texas Instruments Incorporated | Method of speech recognition resistant to convolutive distortion and additive distortion |
US7209881B2 (en) * | 2001-12-20 | 2007-04-24 | Matsushita Electric Industrial Co., Ltd. | Preparing acoustic models by sufficient statistics and noise-superimposed speech data |
US7130776B2 (en) * | 2002-03-25 | 2006-10-31 | Lockheed Martin Corporation | Method and computer program product for producing a pattern recognition training set |
US7117148B2 (en) | 2002-04-05 | 2006-10-03 | Microsoft Corporation | Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization |
US7174292B2 (en) | 2002-05-20 | 2007-02-06 | Microsoft Corporation | Method of determining uncertainty associated with acoustic distortion-based noise reduction |
US7107210B2 (en) * | 2002-05-20 | 2006-09-12 | Microsoft Corporation | Method of noise reduction based on dynamic aspects of speech |
US7103540B2 (en) * | 2002-05-20 | 2006-09-05 | Microsoft Corporation | Method of pattern recognition using noise reduction uncertainty |
JP4352790B2 (ja) * | 2002-10-31 | 2009-10-28 | セイコーエプソン株式会社 | 音響モデル作成方法および音声認識装置ならびに音声認識装置を有する乗り物 |
US7370057B2 (en) * | 2002-12-03 | 2008-05-06 | Lockheed Martin Corporation | Framework for evaluating data cleansing applications |
WO2004104908A1 (en) * | 2003-05-21 | 2004-12-02 | Koninklijke Philips Electronics N.V. | Method and device for verifying the identity of an object |
US8041026B1 (en) | 2006-02-07 | 2011-10-18 | Avaya Inc. | Event driven noise cancellation |
US20070239444A1 (en) * | 2006-03-29 | 2007-10-11 | Motorola, Inc. | Voice signal perturbation for speech recognition |
JP4245617B2 (ja) * | 2006-04-06 | 2009-03-25 | 株式会社東芝 | 特徴量補正装置、特徴量補正方法および特徴量補正プログラム |
JP4316583B2 (ja) | 2006-04-07 | 2009-08-19 | 株式会社東芝 | 特徴量補正装置、特徴量補正方法および特徴量補正プログラム |
US7840287B2 (en) * | 2006-04-13 | 2010-11-23 | Fisher-Rosemount Systems, Inc. | Robust process model identification in model based control techniques |
US8407160B2 (en) * | 2006-11-15 | 2013-03-26 | The Trustees Of Columbia University In The City Of New York | Systems, methods, and media for generating sanitized data, sanitizing anomaly detection models, and/or generating sanitized anomaly detection models |
US8195453B2 (en) * | 2007-09-13 | 2012-06-05 | Qnx Software Systems Limited | Distributed intelligibility testing system |
EP2210427B1 (de) | 2007-09-26 | 2015-05-06 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung, Verfahren und Computerprogramm zum Extrahieren eines Umgebungssignal |
US8615397B2 (en) * | 2008-04-04 | 2013-12-24 | Intuit Inc. | Identifying audio content using distorted target patterns |
NO328622B1 (no) | 2008-06-30 | 2010-04-06 | Tandberg Telecom As | Anordning og fremgangsmate for reduksjon av tastaturstoy i konferanseutstyr |
JP5150542B2 (ja) * | 2009-03-26 | 2013-02-20 | 株式会社東芝 | パターン認識装置、パターン認識方法、及び、プログラム |
US11416214B2 (en) | 2009-12-23 | 2022-08-16 | Google Llc | Multi-modal input on an electronic device |
EP4318463A3 (de) | 2009-12-23 | 2024-02-28 | Google LLC | Multimodale eingabe in eine elektronische vorrichtung |
US8660842B2 (en) * | 2010-03-09 | 2014-02-25 | Honda Motor Co., Ltd. | Enhancing speech recognition using visual information |
US8265928B2 (en) * | 2010-04-14 | 2012-09-11 | Google Inc. | Geotagged environmental audio for enhanced speech recognition accuracy |
US8468012B2 (en) | 2010-05-26 | 2013-06-18 | Google Inc. | Acoustic model adaptation using geographic information |
US8484023B2 (en) * | 2010-09-24 | 2013-07-09 | Nuance Communications, Inc. | Sparse representation features for speech recognition |
US8352245B1 (en) | 2010-12-30 | 2013-01-08 | Google Inc. | Adjusting language models |
US8296142B2 (en) | 2011-01-21 | 2012-10-23 | Google Inc. | Speech recognition using dock context |
HUP1200018A2 (en) | 2012-01-11 | 2013-07-29 | 77 Elektronika Mueszeripari Kft | Method of training a neural network, as well as a neural network |
US8484017B1 (en) | 2012-09-10 | 2013-07-09 | Google Inc. | Identifying media content |
US20140074466A1 (en) | 2012-09-10 | 2014-03-13 | Google Inc. | Answering questions using environmental context |
US9734819B2 (en) | 2013-02-21 | 2017-08-15 | Google Technology Holdings LLC | Recognizing accented speech |
US20140270249A1 (en) | 2013-03-12 | 2014-09-18 | Motorola Mobility Llc | Method and Apparatus for Estimating Variability of Background Noise for Noise Suppression |
US20140278393A1 (en) | 2013-03-12 | 2014-09-18 | Motorola Mobility Llc | Apparatus and Method for Power Efficient Signal Conditioning for a Voice Recognition System |
US9237225B2 (en) | 2013-03-12 | 2016-01-12 | Google Technology Holdings LLC | Apparatus with dynamic audio signal pre-conditioning and methods therefor |
US9275638B2 (en) | 2013-03-12 | 2016-03-01 | Google Technology Holdings LLC | Method and apparatus for training a voice recognition model database |
CN105580071B (zh) * | 2013-05-06 | 2020-08-21 | 谷歌技术控股有限责任公司 | 用于训练声音识别模型数据库的方法和装置 |
CN103310789B (zh) * | 2013-05-08 | 2016-04-06 | 北京大学深圳研究生院 | 一种基于改进的并行模型组合的声音事件识别方法 |
US9842592B2 (en) | 2014-02-12 | 2017-12-12 | Google Inc. | Language models using non-linguistic context |
US9412365B2 (en) | 2014-03-24 | 2016-08-09 | Google Inc. | Enhanced maximum entropy models |
US9858922B2 (en) | 2014-06-23 | 2018-01-02 | Google Inc. | Caching speech recognition scores |
US9953646B2 (en) | 2014-09-02 | 2018-04-24 | Belleau Technologies | Method and system for dynamic speech recognition and tracking of prewritten script |
US9299347B1 (en) | 2014-10-22 | 2016-03-29 | Google Inc. | Speech recognition using associative mapping |
KR102167719B1 (ko) | 2014-12-08 | 2020-10-19 | 삼성전자주식회사 | 언어 모델 학습 방법 및 장치, 음성 인식 방법 및 장치 |
US9535905B2 (en) * | 2014-12-12 | 2017-01-03 | International Business Machines Corporation | Statistical process control and analytics for translation supply chain operational management |
KR101988222B1 (ko) * | 2015-02-12 | 2019-06-13 | 한국전자통신연구원 | 대어휘 연속 음성 인식 장치 및 방법 |
US10134394B2 (en) | 2015-03-20 | 2018-11-20 | Google Llc | Speech recognition using log-linear model |
US9786270B2 (en) | 2015-07-09 | 2017-10-10 | Google Inc. | Generating acoustic models |
KR102494139B1 (ko) * | 2015-11-06 | 2023-01-31 | 삼성전자주식회사 | 뉴럴 네트워크 학습 장치 및 방법과, 음성 인식 장치 및 방법 |
US20170148466A1 (en) * | 2015-11-25 | 2017-05-25 | Tim Jackson | Method and system for reducing background sounds in a noisy environment |
CN105448303B (zh) * | 2015-11-27 | 2020-02-04 | 百度在线网络技术(北京)有限公司 | 语音信号的处理方法和装置 |
US10229672B1 (en) | 2015-12-31 | 2019-03-12 | Google Llc | Training acoustic models using connectionist temporal classification |
US9978367B2 (en) | 2016-03-16 | 2018-05-22 | Google Llc | Determining dialog states for language models |
US20180018973A1 (en) | 2016-07-15 | 2018-01-18 | Google Inc. | Speaker verification |
US10832664B2 (en) | 2016-08-19 | 2020-11-10 | Google Llc | Automated speech recognition using language models that selectively use domain-specific model components |
US10311860B2 (en) | 2017-02-14 | 2019-06-04 | Google Llc | Language model biasing system |
US10706840B2 (en) | 2017-08-18 | 2020-07-07 | Google Llc | Encoder-decoder models for sequence to sequence mapping |
CN112639968B (zh) | 2018-08-30 | 2024-10-01 | 杜比国际公司 | 用于控制对经低比特率编码的音频的增强的方法和装置 |
CN110505332A (zh) * | 2019-09-05 | 2019-11-26 | 深圳传音控股股份有限公司 | 一种降噪方法、装置、移动终端及存储介质 |
CN111210810A (zh) * | 2019-12-17 | 2020-05-29 | 秒针信息技术有限公司 | 模型训练方法和装置 |
EP3862782A1 (de) * | 2020-02-04 | 2021-08-11 | Infineon Technologies AG | Vorrichtung und verfahren zur korrektur eines eingangssignals |
CN111429930B (zh) * | 2020-03-16 | 2023-02-28 | 云知声智能科技股份有限公司 | 一种基于自适应采样率的降噪模型处理方法及系统 |
CN111863008A (zh) * | 2020-07-07 | 2020-10-30 | 北京达佳互联信息技术有限公司 | 一种音频降噪方法、装置及存储介质 |
CN112614484B (zh) * | 2020-11-23 | 2022-05-20 | 北京百度网讯科技有限公司 | 特征信息挖掘方法、装置及电子设备 |
CN113515556A (zh) * | 2021-04-15 | 2021-10-19 | 阿里巴巴新加坡控股有限公司 | 数据处理方法、客户端及电子设备 |
CN114190953B (zh) * | 2021-12-09 | 2024-07-23 | 四川新源生物电子科技有限公司 | 针对脑电采集设备的脑电信号降噪模型的训练方法和系统 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE4309985A1 (de) * | 1993-03-29 | 1994-10-06 | Sel Alcatel Ag | Geräuschreduktion zur Spracherkennung |
DE4322372A1 (de) * | 1993-07-06 | 1995-01-12 | Sel Alcatel Ag | Verfahren und Vorrichtung zur Spracherkennung |
US6067517A (en) * | 1996-02-02 | 2000-05-23 | International Business Machines Corporation | Transcription of speech data with segments from acoustically dissimilar environments |
US6026359A (en) * | 1996-09-20 | 2000-02-15 | Nippon Telegraph And Telephone Corporation | Scheme for model adaptation in pattern recognition based on Taylor expansion |
US5950157A (en) * | 1997-02-28 | 1999-09-07 | Sri International | Method for establishing handset-dependent normalizing models for speaker recognition |
US6529872B1 (en) * | 2000-04-18 | 2003-03-04 | Matsushita Electric Industrial Co., Ltd. | Method for noise adaptation in automatic speech recognition using transformed matrices |
-
2000
- 2000-10-16 US US09/688,950 patent/US6876966B1/en not_active Expired - Lifetime
-
2001
- 2001-10-10 DE DE60124842T patent/DE60124842T2/de not_active Expired - Lifetime
- 2001-10-10 EP EP01124141A patent/EP1199708B1/de not_active Expired - Lifetime
- 2001-10-10 AT AT01124141T patent/ATE347161T1/de not_active IP Right Cessation
- 2001-10-16 JP JP2001317824A patent/JP4195211B2/ja not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
JP4195211B2 (ja) | 2008-12-10 |
US6876966B1 (en) | 2005-04-05 |
EP1199708B1 (de) | 2006-11-29 |
EP1199708A3 (de) | 2003-10-15 |
DE60124842T2 (de) | 2007-04-12 |
EP1199708A2 (de) | 2002-04-24 |
DE60124842D1 (de) | 2007-01-11 |
JP2002140089A (ja) | 2002-05-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE347161T1 (de) | Rauschrobuste mustererkennung | |
DE69823947D1 (de) | Verfahren, Vorrichtung und Aufzeichnungsmedium zur Erzeugung von Tondaten | |
DE60042588D1 (de) | Signalverarbeitungsvorrichtung und verfahren und aufzeichnungsmedium | |
DE60139877D1 (de) | Teileerkennungsdatenerzeugungsverfahren und vorrichtung, anbringvorrichtung für elektronische teile und aufzeichnungsmedium | |
TW356548B (en) | Sound identifying device method of sound identification and the game machine using the said device | |
TW200705387A (en) | Systems, methods, and apparatus for highband time warping | |
DE69619054D1 (de) | Verfahren und Vorrichtung zur Sprachkodierung | |
DE60023517D1 (de) | Klassifizierung von schallquellen | |
DE60222739D1 (de) | Gerät und Verfahren zur Erzeugung von digitalen Signalen, die jeweils einen analogen Signalwert kodieren | |
ATE224124T1 (de) | Verfahren und vorrichtung zur übertragung von inhaltsinformation und darauf bezogener zusatzinformation | |
DE60128270D1 (de) | Verfahren und System zur Erzeugung von Sprechererkennungsdaten, und Verfahren und System zur Sprechererkennung | |
ATE412941T1 (de) | Speicherschnittstellenprotokoll zur unterscheidung von statusinformationen von lesedaten | |
EP0683595A3 (de) | Vorrichtung zur Erzeugung eines Bildes. | |
DE3781393D1 (de) | Verfahren und einrichtung zur komprimierung von sprachsignaldaten. | |
DE69800320D1 (de) | Verfahren und Vorrichtung zur Sprechererkennung durch Prüfung von mündlicher Information mittels Zwangsdekodierung | |
DE60138696D1 (de) | Verfahren und system zum speichern eines codierungsmusters | |
DE50103752D1 (de) | Verfahren und sendeschaltung zur erzeugung eines sendesignals | |
ATE319160T1 (de) | Verfahren zur rauschrobusten klassifikation in der sprachkodierung | |
ATE264584T1 (de) | Verfahren zur signalspitzenskalierung und entsprechender sender | |
DE60227308D1 (de) | System, Verfahren und Vorrichtung zur Bestimmung der Grenze eines Informationselements | |
ATE450033T1 (de) | Verfahren zur geräuschunterdrückung | |
ATE381915T1 (de) | Audioinformationsübertragungsvorrichtung und zugehöriges verfahren | |
ATE286334T1 (de) | Vorrichtung zur klassifikation von komplexen signalen mit linearer digitaler modulation | |
DE60325736D1 (de) | Verfahren und Vorrichtung zur Rauschverminderung in einem Schallsignal | |
EP1220199A3 (de) | Verfahren zur Detektion und Wiedergabe des Untertones einer Stimme und Vorrichtung dafür |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |