HK1056427A1 - Method and apparatus for constructing voice templates for a speaker-independent voice recognition system. - Google Patents

Method and apparatus for constructing voice templates for a speaker-independent voice recognition system.

Info

Publication number: HK1056427A1
Authority: HK; Hong Kong
Prior art keywords: utterances; generate; speaker; recognition system; training
Prior art date: 2000-07-13

Application number

HK03108617A

Other languages

English (en)

Inventor

Ning Bi

Original Assignee

Qualcomm Inc

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2000-07-13

Filing date

2003-11-26

Publication date

2004-02-13

2003-11-26 Application filed by Qualcomm Inc filed Critical Qualcomm Inc

2004-02-13 Publication of HK1056427A1 publication Critical patent/HK1056427A1/xx

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training

Landscapes

Engineering & Computer Science (AREA)
Multimedia (AREA)
Artificial Intelligence (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Computational Linguistics (AREA)
Acoustics & Sound (AREA)
Telephonic Communication Services (AREA)
Image Analysis (AREA)
Image Processing (AREA)
Compression, Expansion, Code Conversion, And Decoders (AREA)
Electrically Operated Instructional Devices (AREA)
Machine Translation (AREA)
Audible-Bandwidth Dynamoelectric Transducers Other Than Pickups (AREA)

HK03108617A 2000-07-13 2003-11-26 Method and apparatus for constructing voice templates for a speaker-independent voice recognition system. HK1056427A1 (en)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
US09/615,572 US6735563B1 (en)	2000-07-13	2000-07-13	Method and apparatus for constructing voice templates for a speaker-independent voice recognition system
PCT/US2001/022009 WO2002007145A2 (en)	2000-07-13	2001-07-11	Method and apparatus for constructing voice templates for a speaker-independent voice recognition system

Publications (1)

Publication Number	Publication Date
HK1056427A1 true HK1056427A1 (en)	2004-02-13

Family

ID=24465970

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
HK03108617A HK1056427A1 (en)	2000-07-13	2003-11-26	Method and apparatus for constructing voice templates for a speaker-independent voice recognition system.

Country Status (13)

Country	Link
US (1)	US6735563B1 (xx)
EP (1)	EP1301919B1 (xx)
JP (1)	JP4202124B2 (xx)
KR (1)	KR100766761B1 (xx)
CN (1)	CN1205601C (xx)
AT (1)	ATE345562T1 (xx)
AU (1)	AU2001273410A1 (xx)
BR (1)	BR0112405A (xx)
DE (1)	DE60124551T2 (xx)
ES (1)	ES2275700T3 (xx)
HK (1)	HK1056427A1 (xx)
TW (1)	TW514867B (xx)
WO (1)	WO2002007145A2 (xx)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US6990446B1 (en) *	2000-10-10	2006-01-24	Microsoft Corporation	Method and apparatus using spectral addition for speaker recognition
DE10127559A1 (de)	2001-06-06	2002-12-12	Philips Corp Intellectual Pty	Benutzergruppenspezifisches Musterverarbeitungssystem
TW541517B (en) *	2001-12-25	2003-07-11	Univ Nat Cheng Kung	Speech recognition system
EP1363271A1 (de)	2002-05-08	2003-11-19	Sap Ag	Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs
DE10220524B4 (de)	2002-05-08	2006-08-10	Sap Ag	Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache
KR100533601B1 (ko) *	2002-12-05	2005-12-06	베스티안파트너스(주)	휴대전화의 화자독립형 음성인식을 위한 성별 구분방법
US7509257B2 (en) *	2002-12-24	2009-03-24	Marvell International Ltd.	Method and apparatus for adapting reference templates
WO2005026043A2 (en)	2003-07-29	2005-03-24	Intelligent Energy, Inc.	Methods for providing thin hydrogen separation membranes and associated uses
US7389233B1 (en) *	2003-09-02	2008-06-17	Verizon Corporate Services Group Inc.	Self-organizing speech recognition for information extraction
KR100827074B1 (ko) *	2004-04-06	2008-05-02	삼성전자주식회사	이동 통신 단말기의 자동 다이얼링 장치 및 방법
WO2006033104A1 (en) *	2004-09-22	2006-03-30	Shalon Ventures Research, Llc	Systems and methods for monitoring and modifying behavior
US8219391B2 (en) *	2005-02-15	2012-07-10	Raytheon Bbn Technologies Corp.	Speech analyzing system with speech codebook
CN1963918A (zh) *	2005-11-11	2007-05-16	株式会社东芝	说话人模板的压缩、合并装置和方法，以及说话人认证
US8612229B2 (en)	2005-12-15	2013-12-17	Nuance Communications, Inc.	Method and system for conveying an example in a natural language understanding application
JP4745094B2 (ja) *	2006-03-20	2011-08-10	富士通株式会社	クラスタリングシステム、クラスタリング方法、クラスタリングプログラムおよびクラスタリングシステムを用いた属性推定システム
US20070276668A1 (en) *	2006-05-23	2007-11-29	Creative Technology Ltd	Method and apparatus for accessing an audio file from a collection of audio files using tonal matching
US8532984B2 (en)	2006-07-31	2013-09-10	Qualcomm Incorporated	Systems, methods, and apparatus for wideband encoding and decoding of active frames
US8239190B2 (en)	2006-08-22	2012-08-07	Qualcomm Incorporated	Time-warping frames of wideband vocoder
TWI349266B (en) *	2007-04-13	2011-09-21	Qisda Corp	Voice recognition system and method
CN101465123B (zh) *	2007-12-20	2011-07-06	株式会社东芝	说话人认证的验证方法和装置以及说话人认证系统
US20120168331A1 (en) *	2010-12-30	2012-07-05	Safecode Drug Technologies Corp.	Voice template protector for administering medicine
CN102623008A (zh) *	2011-06-21	2012-08-01	中国科学院苏州纳米技术与纳米仿生研究所	声纹识别方法
CN105989849B (zh) *	2015-06-03	2019-12-03	乐融致新电子科技(天津)有限公司	一种语音增强方法、语音识别方法、聚类方法及装置
US10134425B1 (en) *	2015-06-29	2018-11-20	Amazon Technologies, Inc.	Direction-based speech endpointing
KR101901965B1 (ko) *	2017-01-12	2018-09-28	엘에스산전 주식회사	프로젝트 화면 작성장치
KR102509821B1 (ko) *	2017-09-18	2023-03-14	삼성전자주식회사	Oos 문장을 생성하는 방법 및 이를 수행하는 장치
CN110706710A (zh) *	2018-06-25	2020-01-17	普天信息技术有限公司	一种语音识别方法、装置、电子设备及存储介质
CN109801622B (zh) *	2019-01-31	2020-12-22	嘉楠明芯(北京)科技有限公司	一种语音识别模板训练方法、语音识别方法及装置
CN111063348B (zh) *	2019-12-13	2022-06-07	腾讯科技（深圳）有限公司	一种信息处理方法、装置、设备及计算机存储介质

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US4415767A (en) *	1981-10-19	1983-11-15	Votan	Method and apparatus for speech recognition and reproduction
CA1261472A (en)	1985-09-26	1989-09-26	Yoshinao Shiraki	Reference speech pattern generating method
US4797929A (en) *	1986-01-03	1989-01-10	Motorola, Inc.	Word recognition in a speech recognition system using data reduced word templates
CA1299750C (en) *	1986-01-03	1992-04-28	Ira Alan Gerson	Optimal method of data reduction in a speech recognition system
US4855910A (en) *	1986-10-22	1989-08-08	North American Philips Corporation	Time-clustered cardio-respiratory encoder and method for clustering cardio-respiratory signals
US5226084A (en) *	1990-12-05	1993-07-06	Digital Voice Systems, Inc.	Methods for speech quantization and error correction
AU671952B2 (en)	1991-06-11	1996-09-19	Qualcomm Incorporated	Variable rate vocoder
US5337394A (en) *	1992-06-09	1994-08-09	Kurzweil Applied Intelligence, Inc.	Speech recognizer
US5682464A (en) *	1992-06-29	1997-10-28	Kurzweil Applied Intelligence, Inc.	Word model candidate preselection for speech recognition using precomputed matrix of thresholded distance values
JP3336754B2 (ja) *	1994-08-19	2002-10-21	ソニー株式会社	デジタルビデオ信号の記録方法及び記録装置
US5839103A (en) *	1995-06-07	1998-11-17	Rutgers, The State University Of New Jersey	Speaker verification system using decision fusion logic
JP3180655B2 (ja) *	1995-06-19	2001-06-25	日本電信電話株式会社	パターンマッチングによる単語音声認識方法及びその方法を実施する装置
KR0169414B1 (ko) *	1995-07-01	1999-01-15	김광호	복수채널 직렬 접속 제어회로
CN1302427A (zh) *	1997-11-03	2001-07-04	T－内提克斯公司	用于说话者认证的模型自适应系统和方法
US6278972B1 (en) *	1999-01-04	2001-08-21	Qualcomm Incorporated	System and method for segmentation and recognition of speech signals
US6266643B1 (en) *	1999-03-03	2001-07-24	Kenneth Canfield	Speeding up audio without changing pitch by comparing dominant frequencies
US6510534B1 (en) *	2000-06-29	2003-01-21	Logicvision, Inc.	Method and apparatus for testing high performance circuits

2000
- 2000-07-13 US US09/615,572 patent/US6735563B1/en not_active Expired - Lifetime
2001
- 2001-07-11 CN CNB018127711A patent/CN1205601C/zh not_active Expired - Fee Related
- 2001-07-11 AU AU2001273410A patent/AU2001273410A1/en not_active Abandoned
- 2001-07-11 EP EP01952681A patent/EP1301919B1/en not_active Expired - Lifetime
- 2001-07-11 BR BR0112405-6A patent/BR0112405A/pt not_active IP Right Cessation
- 2001-07-11 AT AT01952681T patent/ATE345562T1/de not_active IP Right Cessation
- 2001-07-11 JP JP2002512966A patent/JP4202124B2/ja not_active Expired - Fee Related
- 2001-07-11 WO PCT/US2001/022009 patent/WO2002007145A2/en active IP Right Grant
- 2001-07-11 ES ES01952681T patent/ES2275700T3/es not_active Expired - Lifetime
- 2001-07-11 DE DE60124551T patent/DE60124551T2/de not_active Expired - Lifetime
- 2001-07-11 KR KR1020037000496A patent/KR100766761B1/ko not_active IP Right Cessation
- 2001-07-13 TW TW090117207A patent/TW514867B/zh not_active IP Right Cessation
2003
- 2003-11-26 HK HK03108617A patent/HK1056427A1/xx not_active IP Right Cessation

Also Published As

Publication number	Publication date
JP4202124B2 (ja)	2008-12-24
TW514867B (en)	2002-12-21
EP1301919B1 (en)	2006-11-15
WO2002007145A3 (en)	2002-05-23
DE60124551D1 (de)	2006-12-28
CN1205601C (zh)	2005-06-08
ATE345562T1 (de)	2006-12-15
ES2275700T3 (es)	2007-06-16
CN1441947A (zh)	2003-09-10
BR0112405A (pt)	2003-12-30
US6735563B1 (en)	2004-05-11
WO2002007145A2 (en)	2002-01-24
DE60124551T2 (de)	2007-09-06
JP2004504641A (ja)	2004-02-12
KR20030014332A (ko)	2003-02-15
EP1301919A2 (en)	2003-04-16
KR100766761B1 (ko)	2007-10-17
AU2001273410A1 (en)	2002-01-30

Publication	Publication Date	Title
HK1056427A1 (en)	2004-02-13	Method and apparatus for constructing voice templates for a speaker-independent voice recognition system.
Gupta et al.	2014	I-vector-based speaker adaptation of deep neural networks for french broadcast audio transcription
US4918732A (en)	1990-04-17	Frame comparison method for word recognition in high noise environments
EP0413361B1 (en)	1999-05-06	Speech-recognition circuitry employing nonlinear processing, speech element modelling and phoneme estimation
Li et al.	2005	Large margin HMMs for speech recognition
Sugamura et al.	1983	Isolated word recognition using phoneme-like templates
Paliwal	1990	Lexicon-building methods for an acoustic sub-word based speech recognizer
Hon et al.	1994	Towards large vocabulary Mandarin Chinese speech recognition
Yokoya et al.	1992	Recovery of superquadric primitives from a range image using simulated annealing
CN106297769A (zh)	2017-01-04	一种应用于语种识别的鉴别性特征提取方法
Euler et al.	1990	Statistical segmentation and word modeling techniques in isolated word recognition
Tian et al.	2004	Tone recognition with fractionized models and outlined features
CA1301338C (en)	1992-05-19	Frame comparison method for word recognition in high noise environments
Makino et al.	2014	Utilizing state-level distance vector representation for improved spoken term detection by text and spoken queries.
JPH01202798A (ja)	1989-08-15	音声認識方法
Kockmann et al.	2008	Contour modeling of prosodic and acoustic features for speaker recognition
Ma et al.	2009	Acoustic segment modeling for speaker recognition
Sakai et al.	1976	A classification method of spoken words in continuous speech for many speakers
Singh et al.	2013	Effect of MFCC based features for speech signal alignments
Aktas et al.	1986	Large-vocabulary isolated word recognition with fast coarse time alignment
Omar	2007	Regularized feature-based maximum likelihood linear regression for speech recognition.
Zhou et al.	1995	Multisegment multiple VQ codebooks-based speaker independent isolated-word recognition using unbiased mel cepstrum
Radfar et al.	2006	A joint identification-separation technique for single channel speech separation
Diwakar et al.	2017	Repetition detection in dysarthric speech
Soldi et al.	2015	Phone adaptive training for short-duration speaker verification

Legal Events

Date	Code	Title	Description
2012-02-24	PC	Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee)	Effective date: 20110711