CN1623186A - 用于噪声环境的话音活动检测器和验证器 - Google Patents
用于噪声环境的话音活动检测器和验证器 Download PDFInfo
- Publication number
- CN1623186A CN1623186A CNA038026821A CN03802682A CN1623186A CN 1623186 A CN1623186 A CN 1623186A CN A038026821 A CNA038026821 A CN A038026821A CN 03802682 A CN03802682 A CN 03802682A CN 1623186 A CN1623186 A CN 1623186A
- Authority
- CN
- China
- Prior art keywords
- frame
- speech
- communication unit
- voice
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000000694 effects Effects 0.000 title claims abstract description 49
- 230000001133 acceleration Effects 0.000 claims abstract description 63
- 238000000034 method Methods 0.000 claims abstract description 50
- 238000005259 measurement Methods 0.000 claims abstract description 40
- 238000001514 detection method Methods 0.000 claims abstract description 35
- 238000004891 communication Methods 0.000 claims abstract description 34
- 230000007246 mechanism Effects 0.000 claims abstract description 16
- 238000001228 spectrum Methods 0.000 claims description 27
- 230000008569 process Effects 0.000 claims description 19
- 238000000605 extraction Methods 0.000 claims description 4
- 230000008859 change Effects 0.000 claims description 2
- 230000002596 correlated effect Effects 0.000 claims 2
- 238000012545 processing Methods 0.000 abstract description 25
- 230000008901 benefit Effects 0.000 abstract description 4
- 230000004044 response Effects 0.000 abstract description 3
- 230000005540 biological transmission Effects 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 238000005096 rolling process Methods 0.000 description 9
- 230000003595 spectral effect Effects 0.000 description 8
- 238000013459 approach Methods 0.000 description 6
- 230000003044 adaptive effect Effects 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 238000012795 verification Methods 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 3
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000006837 decompression Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000004378 air conditioning Methods 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000006735 deficit Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000012854 evaluation process Methods 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000003825 pressing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
- G10L2025/786—Adaptive threshold
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Mobile Radio Communication Systems (AREA)
- Telephone Function (AREA)
Abstract
Description
Claims (15)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0201585.7 | 2002-01-24 | ||
GB0201585A GB2384670B (en) | 2002-01-24 | 2002-01-24 | Voice activity detector and validator for noisy environments |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1623186A true CN1623186A (zh) | 2005-06-01 |
CN1307613C CN1307613C (zh) | 2007-03-28 |
Family
ID=9929648
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB038026821A Expired - Lifetime CN1307613C (zh) | 2002-01-24 | 2003-01-10 | 用于噪声环境的话音活动检测器和验证器 |
Country Status (6)
Country | Link |
---|---|
JP (2) | JP2005516247A (zh) |
KR (2) | KR20040075959A (zh) |
CN (1) | CN1307613C (zh) |
FI (1) | FI124869B (zh) |
GB (1) | GB2384670B (zh) |
WO (1) | WO2003063138A1 (zh) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100543841C (zh) * | 2005-10-21 | 2009-09-23 | 神基科技股份有限公司 | 音源处理电路结构及其处理方法 |
WO2011044853A1 (zh) * | 2009-10-15 | 2011-04-21 | 华为技术有限公司 | 一种实现通信系统中背景噪声的跟踪的方法和装置 |
CN102884575A (zh) * | 2010-04-22 | 2013-01-16 | 高通股份有限公司 | 话音活动检测 |
US8898058B2 (en) | 2010-10-25 | 2014-11-25 | Qualcomm Incorporated | Systems, methods, and apparatus for voice activity detection |
CN104575498A (zh) * | 2015-01-30 | 2015-04-29 | 深圳市云之讯网络技术有限公司 | 有效语音识别方法及系统 |
CN109841223A (zh) * | 2019-03-06 | 2019-06-04 | 深圳大学 | 一种音频信号处理方法、智能终端及存储介质 |
CN113614829A (zh) * | 2019-11-18 | 2021-11-05 | 谷歌有限责任公司 | 用于瞬态噪声抑制的自适应能量限制 |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100657912B1 (ko) * | 2004-11-18 | 2006-12-14 | 삼성전자주식회사 | 잡음 제거 방법 및 장치 |
JP4758879B2 (ja) * | 2006-12-14 | 2011-08-31 | 日本電信電話株式会社 | 仮音声区間決定装置、方法、プログラム及びその記録媒体、音声区間決定装置、方法 |
GB2450886B (en) | 2007-07-10 | 2009-12-16 | Motorola Inc | Voice activity detector and a method of operation |
US8407044B2 (en) | 2008-10-30 | 2013-03-26 | Telefonaktiebolaget Lm Ericsson (Publ) | Telephony content signal discrimination |
KR101196518B1 (ko) | 2011-04-05 | 2012-11-01 | 한국과학기술연구원 | 실시간 음성 활동 검출 장치 및 검출 방법 |
RU2544293C1 (ru) * | 2013-10-11 | 2015-03-20 | Сергей Александрович Косарев | Способ измерения физической величины с помощью мобильного электронного устройства и внешнего блока |
US9953661B2 (en) * | 2014-09-26 | 2018-04-24 | Cirrus Logic Inc. | Neural network voice activity detection employing running range normalization |
JP2016167678A (ja) * | 2015-03-09 | 2016-09-15 | 株式会社リコー | 通信装置、通信システム、ログデータ蓄積方法、及びプログラム |
CN112820324B (zh) * | 2020-12-31 | 2024-06-25 | 平安科技(深圳)有限公司 | 多标签语音活动检测方法、装置及存储介质 |
KR102453919B1 (ko) | 2022-05-09 | 2022-10-12 | (주)피플리 | 인공지능 기반 문화 콘텐츠 관련 가이드 음원의 검증 방법, 장치 및 시스템 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IT1209561B (it) * | 1983-07-14 | 1989-08-30 | Gte Laboratories Inc | Rivelazione complementare della parola. |
JP2559475B2 (ja) * | 1988-09-22 | 1996-12-04 | 積水化学工業株式会社 | 音声検出方式 |
JPH03114100A (ja) * | 1989-09-28 | 1991-05-15 | Matsushita Electric Ind Co Ltd | 音声区間検出装置 |
JP3024447B2 (ja) * | 1993-07-13 | 2000-03-21 | 日本電気株式会社 | 音声圧縮装置 |
JP3109978B2 (ja) * | 1995-04-28 | 2000-11-20 | 松下電器産業株式会社 | 音声区間検出装置 |
US5774849A (en) | 1996-01-22 | 1998-06-30 | Rockwell International Corporation | Method and apparatus for generating frame voicing decisions of an incoming speech signal |
JPH10171497A (ja) * | 1996-12-12 | 1998-06-26 | Oki Electric Ind Co Ltd | 背景雑音除去装置 |
US5946649A (en) * | 1997-04-16 | 1999-08-31 | Technology Research Association Of Medical Welfare Apparatus | Esophageal speech injection noise detection and rejection |
JP3297346B2 (ja) * | 1997-04-30 | 2002-07-02 | 沖電気工業株式会社 | 音声検出装置 |
JPH10327089A (ja) * | 1997-05-23 | 1998-12-08 | Matsushita Electric Ind Co Ltd | 携帯電話装置 |
JPH113091A (ja) * | 1997-06-13 | 1999-01-06 | Matsushita Electric Ind Co Ltd | 音声信号の立ち上がり検出装置 |
US6032116A (en) * | 1997-06-27 | 2000-02-29 | Advanced Micro Devices, Inc. | Distance measure in a speech recognition system for speech recognition using frequency shifting factors to compensate for input signal frequency shifts |
FR2768544B1 (fr) * | 1997-09-18 | 1999-11-19 | Matra Communication | Procede de detection d'activite vocale |
JP4221537B2 (ja) * | 2000-06-02 | 2009-02-12 | 日本電気株式会社 | 音声検出方法及び装置とその記録媒体 |
-
2002
- 2002-01-24 GB GB0201585A patent/GB2384670B/en not_active Expired - Lifetime
-
2003
- 2003-01-10 CN CNB038026821A patent/CN1307613C/zh not_active Expired - Lifetime
- 2003-01-10 KR KR10-2004-7011459A patent/KR20040075959A/ko not_active Ceased
- 2003-01-10 KR KR1020097022615A patent/KR100976082B1/ko not_active Expired - Lifetime
- 2003-01-10 JP JP2003562919A patent/JP2005516247A/ja active Pending
- 2003-01-10 WO PCT/EP2003/000271 patent/WO2003063138A1/en active Application Filing
-
2004
- 2004-07-22 FI FI20041013A patent/FI124869B/fi active IP Right Grant
-
2009
- 2009-11-02 JP JP2009251650A patent/JP2010061151A/ja active Pending
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100543841C (zh) * | 2005-10-21 | 2009-09-23 | 神基科技股份有限公司 | 音源处理电路结构及其处理方法 |
WO2011044853A1 (zh) * | 2009-10-15 | 2011-04-21 | 华为技术有限公司 | 一种实现通信系统中背景噪声的跟踪的方法和装置 |
US8095361B2 (en) | 2009-10-15 | 2012-01-10 | Huawei Technologies Co., Ltd. | Method and device for tracking background noise in communication system |
US8447601B2 (en) | 2009-10-15 | 2013-05-21 | Huawei Technologies Co., Ltd. | Method and device for tracking background noise in communication system |
US9165567B2 (en) | 2010-04-22 | 2015-10-20 | Qualcomm Incorporated | Systems, methods, and apparatus for speech feature detection |
CN102884575A (zh) * | 2010-04-22 | 2013-01-16 | 高通股份有限公司 | 话音活动检测 |
US8898058B2 (en) | 2010-10-25 | 2014-11-25 | Qualcomm Incorporated | Systems, methods, and apparatus for voice activity detection |
CN104575498A (zh) * | 2015-01-30 | 2015-04-29 | 深圳市云之讯网络技术有限公司 | 有效语音识别方法及系统 |
CN104575498B (zh) * | 2015-01-30 | 2018-08-17 | 深圳市云之讯网络技术有限公司 | 有效语音识别方法及系统 |
CN109841223A (zh) * | 2019-03-06 | 2019-06-04 | 深圳大学 | 一种音频信号处理方法、智能终端及存储介质 |
CN109841223B (zh) * | 2019-03-06 | 2020-11-24 | 深圳大学 | 一种音频信号处理方法、智能终端及存储介质 |
CN113614829A (zh) * | 2019-11-18 | 2021-11-05 | 谷歌有限责任公司 | 用于瞬态噪声抑制的自适应能量限制 |
CN113614829B (zh) * | 2019-11-18 | 2024-12-24 | 谷歌有限责任公司 | 用于瞬态噪声抑制的自适应能量限制 |
Also Published As
Publication number | Publication date |
---|---|
KR20040075959A (ko) | 2004-08-30 |
CN1307613C (zh) | 2007-03-28 |
GB2384670B (en) | 2004-02-18 |
KR20090127182A (ko) | 2009-12-09 |
WO2003063138A1 (en) | 2003-07-31 |
GB2384670A (en) | 2003-07-30 |
JP2005516247A (ja) | 2005-06-02 |
FI124869B (fi) | 2015-02-27 |
KR100976082B1 (ko) | 2010-08-16 |
JP2010061151A (ja) | 2010-03-18 |
GB0201585D0 (en) | 2002-03-13 |
FI20041013L (fi) | 2004-09-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1623186A (zh) | 用于噪声环境的话音活动检测器和验证器 | |
KR100944252B1 (ko) | 오디오 신호 내에서 음성활동 탐지 | |
US8301440B2 (en) | Bit error concealment for audio coding systems | |
US9305567B2 (en) | Systems and methods for audio signal processing | |
CN1257486C (zh) | 用于将可感知相关信息保留在音频信号中的方法和设备 | |
CN106464683B (zh) | 选择分组丢失隐藏过程 | |
CN1220179C (zh) | 在通信系统中确定速率的装置和方法 | |
JP6377862B2 (ja) | エンコーダ選択 | |
JP6031041B2 (ja) | 複数のオーディオセンサを有する装置とその動作方法 | |
CN1223109C (zh) | 回波抑制系统中增强近端语音信号 | |
WO2008121436A1 (en) | Method and apparatus for quickly detecting a presence of abrupt noise and updating a noise estimate | |
JP2003514473A (ja) | ノイズ抑制 | |
KR20110025667A (ko) | 스펙트럼 콘트라스트 인핸스먼트를 위한 시스템, 방법, 장치, 및 컴퓨터 프로그램 제품 | |
JP6098149B2 (ja) | 音声処理装置、音声処理方法および音声処理プログラム | |
CN1046366C (zh) | 静态和非静态信号的鉴别 | |
CN1763844A (zh) | 基于滑动窗口的端点检测方法、装置和语音识别系统 | |
CN1949364A (zh) | 检测输入语音信号可识别度的系统与方法 | |
JP2002258899A (ja) | 雑音抑圧方法および雑音抑圧装置 | |
CN1787079A (zh) | 一种噪声检测装置和方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MOTOROLA MOBILE CO., LTD. Free format text: FORMER OWNER: MOTOROLA INC. Effective date: 20110113 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20110113 Address after: Illinois State Patentee after: MOTOROLA MOBILITY, Inc. Address before: Illinois, USA Patentee before: Motorola, Inc. |
|
C41 | Transfer of patent application or patent right or utility model | ||
C56 | Change in the name or address of the patentee | ||
CP01 | Change in the name or title of a patent holder |
Address after: Illinois State Patentee after: MOTOROLA MOBILITY LLC Address before: Illinois State Patentee before: MOTOROLA MOBILITY, Inc. |
|
TR01 | Transfer of patent right |
Effective date of registration: 20160516 Address after: California, USA Patentee after: Google Technology Holdings LLC Address before: Illinois State Patentee before: MOTOROLA MOBILITY LLC |
|
CX01 | Expiry of patent term | ||
CX01 | Expiry of patent term |
Granted publication date: 20070328 |