ATE237176T1 - Kontextabhängige phonemnetzwerke zur kodierung von sprachinformation - Google Patents

Kontextabhängige phonemnetzwerke zur kodierung von sprachinformation

Info

Publication number: ATE237176T1
Authority: AT; Austria
Prior art keywords: context; speech information; dependent phoneme; encoding speech; phoneme networks
Prior art date: 1997-12-01

Application number

AT98958652T

Other languages

English (en)

Inventor

Sreeram Balakrishnan

Stephen Austin

Original Assignee

Motorola Inc

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1997-12-01

Filing date

1998-11-19

Publication date

2003-04-15

1998-11-19 Application filed by Motorola Inc filed Critical Motorola Inc

2003-04-15 Application granted granted Critical

2003-04-15 Publication of ATE237176T1 publication Critical patent/ATE237176T1/de

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications

Landscapes

Engineering & Computer Science (AREA)
Physics & Mathematics (AREA)
Computational Linguistics (AREA)
Multimedia (AREA)
Theoretical Computer Science (AREA)
Human Computer Interaction (AREA)
Health & Medical Sciences (AREA)
Acoustics & Sound (AREA)
Artificial Intelligence (AREA)
Audiology, Speech & Language Pathology (AREA)
Databases & Information Systems (AREA)
General Engineering & Computer Science (AREA)
General Physics & Mathematics (AREA)
Data Mining & Analysis (AREA)
Machine Translation (AREA)
Compression, Expansion, Code Conversion, And Decoders (AREA)
Telephone Function (AREA)

AT98958652T 1997-12-01 1998-11-19 Kontextabhängige phonemnetzwerke zur kodierung von sprachinformation ATE237176T1 (de)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
US08/980,954 US6182038B1 (en)	1997-12-01	1997-12-01	Context dependent phoneme networks for encoding speech information
PCT/US1998/024727 WO1999028899A1 (en)	1997-12-01	1998-11-19	Context dependent phoneme networks for encoding speech information

Publications (1)

Publication Number	Publication Date
ATE237176T1 true ATE237176T1 (de)	2003-04-15

Family

ID=25527992

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT98958652T ATE237176T1 (de)	1997-12-01	1998-11-19	Kontextabhängige phonemnetzwerke zur kodierung von sprachinformation

Country Status (9)

Country	Link
US (1)	US6182038B1 (de)
EP (1)	EP0954856B1 (de)
AT (1)	ATE237176T1 (de)
AU (1)	AU1465099A (de)
DE (1)	DE69813180T2 (de)
FR (1)	FR2773413B1 (de)
GB (1)	GB2331826B (de)
TW (1)	TW462037B (de)
WO (1)	WO1999028899A1 (de)

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
CA2321299A1 (en) *	1998-03-09	1999-09-16	Lernout & Hauspie Speech Products N.V.	Apparatus and method for simultaneous multimode dictation
US6408272B1 (en) *	1999-04-12	2002-06-18	General Magic, Inc.	Distributed voice user interface
US20050261907A1 (en)	1999-04-12	2005-11-24	Ben Franklin Patent Holding Llc	Voice integration platform
US6484136B1 (en) *	1999-10-21	2002-11-19	International Business Machines Corporation	Language model adaptation via network of similar users
US6442519B1 (en) *	1999-11-10	2002-08-27	International Business Machines Corp.	Speaker model adaptation via network of similar users
US9076448B2 (en)	1999-11-12	2015-07-07	Nuance Communications, Inc.	Distributed real time speech recognition system
US6633846B1 (en)	1999-11-12	2003-10-14	Phoenix Solutions, Inc.	Distributed realtime speech recognition system
US7050977B1 (en) *	1999-11-12	2006-05-23	Phoenix Solutions, Inc.	Speech-enabled server for internet website and method
US7725307B2 (en)	1999-11-12	2010-05-25	Phoenix Solutions, Inc.	Query engine for processing voice based queries including semantic decoding
US6615172B1 (en)	1999-11-12	2003-09-02	Phoenix Solutions, Inc.	Intelligent query engine for processing voice based queries
US7392185B2 (en)	1999-11-12	2008-06-24	Phoenix Solutions, Inc.	Speech based learning/training system using semantic decoding
US6687689B1 (en)	2000-06-16	2004-02-03	Nusuara Technologies Sdn. Bhd.	System and methods for document retrieval using natural language-based queries
US7451085B2 (en) *	2000-10-13	2008-11-11	At&T Intellectual Property Ii, L.P.	System and method for providing a compensated speech recognition model for speech recognition
US20020087313A1 (en) *	2000-12-29	2002-07-04	Lee Victor Wai Leung	Computer-implemented intelligent speech model partitioning method and system
US7609829B2 (en) *	2001-07-03	2009-10-27	Apptera, Inc.	Multi-platform capable inference engine and universal grammar language adapter for intelligent voice application execution
US20030007609A1 (en) *	2001-07-03	2003-01-09	Yuen Michael S.	Method and apparatus for development, deployment, and maintenance of a voice software application for distribution to one or more consumers
US7013275B2 (en) *	2001-12-28	2006-03-14	Sri International	Method and apparatus for providing a dynamic speech-driven control and remote service access system
US7016849B2 (en) *	2002-03-25	2006-03-21	Sri International	Method and apparatus for providing speech-driven routing between spoken language applications
US7697673B2 (en) *	2003-11-17	2010-04-13	Apptera Inc.	System for advertisement selection, placement and delivery within a multiple-tenant voice interaction service system
US20050163136A1 (en) *	2003-11-17	2005-07-28	Leo Chiu	Multi-tenant self-service VXML portal
US7949533B2 (en) *	2005-02-04	2011-05-24	Vococollect, Inc.	Methods and systems for assessing and improving the performance of a speech recognition system
US8200495B2 (en)	2005-02-04	2012-06-12	Vocollect, Inc.	Methods and systems for considering information about an expected response when performing speech recognition
US7827032B2 (en)	2005-02-04	2010-11-02	Vocollect, Inc.	Methods and systems for adapting a model for a speech recognition system
US7895039B2 (en) *	2005-02-04	2011-02-22	Vocollect, Inc.	Methods and systems for optimizing model adaptation for a speech recognition system
US7865362B2 (en) *	2005-02-04	2011-01-04	Vocollect, Inc.	Method and system for considering information about an expected response when performing speech recognition
KR100901640B1 (ko) *	2006-05-10	2009-06-09	주식회사 케이티	음성 인식을 위한 음성 특징 벡터 양자화에 있어 비균일표본을 기반으로 하는 학습 데이터 선정 방법
US11416214B2 (en)	2009-12-23	2022-08-16	Google Llc	Multi-modal input on an electronic device
EP4318463A3 (de)	2009-12-23	2024-02-28	Google LLC	Multimodale eingabe in eine elektronische vorrichtung
US8352245B1 (en)	2010-12-30	2013-01-08	Google Inc.	Adjusting language models
US8296142B2 (en)	2011-01-21	2012-10-23	Google Inc.	Speech recognition using dock context
US8914290B2 (en)	2011-05-20	2014-12-16	Vocollect, Inc.	Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
EP2721606A4 (de) *	2011-06-19	2015-04-01	Mmodal Ip Llc	Dokumentenerweiterung in einem wörterbuchbasierten dokumentenerzeugungs-workflow
JP6131249B2 (ja)	2011-06-19	2017-05-17	エムモーダルアイピーエルエルシー	コンテキストアウェア認識モデルを使用した音声認識
HK1158011A2 (en) *	2012-02-03	2012-06-22	Gilkron Ltd	An online procurement system for the provision of intellectually oriented services
CA2881564A1 (en)	2012-08-13	2014-02-20	Mmodal Ip Llc	Maintaining a discrete data representation that corresponds to information contained in free-form text
US9978395B2 (en)	2013-03-15	2018-05-22	Vocollect, Inc.	Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US9842592B2 (en)	2014-02-12	2017-12-12	Google Inc.	Language models using non-linguistic context
US9412365B2 (en)	2014-03-24	2016-08-09	Google Inc.	Enhanced maximum entropy models
JP6658515B2 (ja) *	2014-05-15	2020-03-04	日本電気株式会社	検索装置、方法、およびプログラム
KR102281178B1 (ko) *	2014-07-09	2021-07-23	삼성전자주식회사	멀티-레벨 음성 인식 방법 및 장치
US9721564B2 (en)	2014-07-31	2017-08-01	Rovi Guides, Inc.	Systems and methods for performing ASR in the presence of heterographs
US9830321B2 (en)	2014-09-30	2017-11-28	Rovi Guides, Inc.	Systems and methods for searching for a media asset
US10428914B2 (en) *	2014-11-26	2019-10-01	GM Global Technology Operations LLC	Continuously variable transmission
US10134394B2 (en)	2015-03-20	2018-11-20	Google Llc	Speech recognition using log-linear model
US9978367B2 (en)	2016-03-16	2018-05-22	Google Llc	Determining dialog states for language models
US10714121B2 (en)	2016-07-27	2020-07-14	Vocollect, Inc.	Distinguishing user speech from background speech in speech-dense environments
US10832664B2 (en)	2016-08-19	2020-11-10	Google Llc	Automated speech recognition using language models that selectively use domain-specific model components
US10311860B2 (en)	2017-02-14	2019-06-04	Google Llc	Language model biasing system
CN111312253A (zh) *	2018-12-11	2020-06-19	青岛海尔洗衣机有限公司	语音控制方法、云端服务器及终端设备

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
GB224023A (en)	1923-08-23	1924-11-06	William Forber	Improvements in surface finishing tools for white metal or the like
GB8908205D0 (en)	1989-04-12	1989-05-24	Smiths Industries Plc	Speech recognition apparatus and methods
GB2240203A (en) *	1990-01-18	1991-07-24	Apple Computer	Automated speech recognition system
US5497319A (en)	1990-12-31	1996-03-05	Trans-Link International Corp.	Machine translation and telecommunications system
DE4131387A1 (de) *	1991-09-20	1993-03-25	Siemens Ag	Verfahren zur erkennung von mustern in zeitvarianten messsignalen
US5502790A (en) *	1991-12-24	1996-03-26	Oki Electric Industry Co., Ltd.	Speech recognition method and system using triphones, diphones, and phonemes
US5293584A (en) *	1992-05-21	1994-03-08	International Business Machines Corporation	Speech recognition system for natural language translation
JP2524472B2 (ja) *	1992-09-21	1996-08-14	インターナショナル・ビジネス・マシーンズ・コーポレイション	電話回線利用の音声認識システムを訓練する方法
US5515475A (en) *	1993-06-24	1996-05-07	Northern Telecom Limited	Speech recognition method using a two-pass search
US5615296A (en) *	1993-11-12	1997-03-25	International Business Machines Corporation	Continuous speech recognition and voice response system and method to enable conversational dialogues with microprocessors
US5621859A (en) *	1994-01-19	1997-04-15	Bbn Corporation	Single tree method for grammar directed, very large vocabulary speech recognizer
US5745649A (en) *	1994-07-07	1998-04-28	Nynex Science & Technology Corporation	Automated speech recognition using a plurality of different multilayer perception structures to model a plurality of distinct phoneme categories
US5715367A (en) *	1995-01-23	1998-02-03	Dragon Systems, Inc.	Apparatuses and methods for developing and using models for speech recognition
US5651096A (en)	1995-03-14	1997-07-22	Apple Computer, Inc.	Merging of language models from two or more application programs for a speech recognition system
US5754671A (en)	1995-04-12	1998-05-19	Lockheed Martin Corporation	Method for improving cursive address recognition in mail pieces using adaptive data base management
NZ331430A (en) *	1996-05-03	2000-07-28	British Telecomm	Automatic speech recognition
US5867817A (en)	1996-08-19	1999-02-02	Virtual Vision, Inc.	Speech recognition manager
US5915001A (en) *	1996-11-14	1999-06-22	Vois Corporation	System and method for providing and using universally accessible voice and speech data files
US5960399A (en) *	1996-12-24	1999-09-28	Gte Internetworking Incorporated	Client/server speech processor/recognizer

1997
- 1997-12-01 US US08/980,954 patent/US6182038B1/en not_active Expired - Lifetime
1998
- 1998-11-19 EP EP98958652A patent/EP0954856B1/de not_active Expired - Lifetime
- 1998-11-19 AT AT98958652T patent/ATE237176T1/de not_active IP Right Cessation
- 1998-11-19 WO PCT/US1998/024727 patent/WO1999028899A1/en active IP Right Grant
- 1998-11-19 DE DE69813180T patent/DE69813180T2/de not_active Expired - Lifetime
- 1998-11-19 AU AU14650/99A patent/AU1465099A/en not_active Abandoned
- 1998-12-01 FR FR9815131A patent/FR2773413B1/fr not_active Expired - Fee Related
- 1998-12-01 TW TW087119918A patent/TW462037B/zh not_active IP Right Cessation
- 1998-12-01 GB GB9826231A patent/GB2331826B/en not_active Expired - Fee Related

Also Published As

Publication number	Publication date
US6182038B1 (en)	2001-01-30
GB9826231D0 (en)	1999-01-20
DE69813180D1 (de)	2003-05-15
TW462037B (en)	2001-11-01
FR2773413B1 (fr)	2000-05-19
WO1999028899A1 (en)	1999-06-10
FR2773413A1 (fr)	1999-07-09
GB2331826B (en)	2001-12-19
DE69813180T2 (de)	2003-10-23
EP0954856B1 (de)	2003-04-09
AU1465099A (en)	1999-06-16
GB2331826A (en)	1999-06-02
EP0954856A1 (de)	1999-11-10

Publication	Publication Date	Title
ATE237176T1 (de)	2003-04-15	Kontextabhängige phonemnetzwerke zur kodierung von sprachinformation
DE59803850D1 (de)	2002-05-23	Verfahren und system zur bereitstellung und übermittlung individualisierter verkehrsinformationen
MXPA98008052A (es)	2004-10-14	Metodo y aparato para generar entradas de informacion, consistentes semanticamente, en un manejadorde dialogos.
AU1191899A (en)	1999-05-10	System and method for representing complex information auditorially
DE69607142D1 (de)	2000-04-20	Verwendung von mehrpunktverbindungsdiensten zur herstellung von rufanzapfungspunkten in einem vermittlungsnetz
ATE205344T1 (de)	2001-09-15	Verfahren und system zur kommunikation von steuerinformation eines steuergenerators zu einer oder mehreren computerinstallationen
ATE223383T1 (de)	2002-09-15	Verfahren zur herstellung von 3-haloalkyl-1h- pyrazole
WO1998044643A3 (en)	1999-01-21	Audio interface for document based information resource navigation and method therefor
SE9301596D0 (sv)	1993-05-10	Anordning foer att oeka talfoerstaaelsen vid oeversaetttning av tal fraan ett foersta spraak till ett andra spraak
ATE262702T1 (de)	2004-04-15	System und verfahren zur herstellung eines echtzeit-agentpools zwischen rechnersystemen
AU1067900A (en)	2000-06-13	Network and language models for use in a speech recognition system
DE60118712D1 (de)	2006-05-24	Verfahren und system zur bereitstellung einer kundenspezifischen medienliste
SG107089A1 (en)	2004-11-29	Music system, tone generator and musical tone-synthesizing method
SE0004838D0 (sv)	2000-12-22	Method and communication apparatus in a communication system
DK0852867T3 (da)	2001-10-01	Fremgangsmåde og system til hurtigt at generere og transmittere en tegnsekvens ved hjælp af talefrekvenser
DE69622985D1 (de)	2002-09-19	Asymmetrische sprachkompression verwendendes und mit sehr niedriger bitrate arbeitendes sprachnachrichtensystem
SE9600959D0 (sv)	1996-03-13	Metod och anordning vid tal-till-talöversättning
WO1998025260A3 (en)	1998-08-06	Speech synthesis using dual neural networks
JPS5720061A (en)	1982-02-02	Automatic guiding method by telephone
Chambers et al.	1995	Comparison of computer codes for propagation of sonic booms through the atmosphere
Ghitza	1995	Session lpSC
DE69329375D1 (de)	2000-10-12	Verfahren zur Realisierung von Tonkurven für Sprachnachrichten und Verfahren zur Sprachsynthese und Einrichtung zu seiner Anwendung
TW365728B (en)	1999-08-01	Method and system for enabling access to a multimedia document
KR20040052110A (ko)	2004-06-19	Ｔｔｓ를 이용한 코러스 및 아카펠라 구현방법
JPS6485467A (en)	1989-03-30	Videotex information terminal set incorporated with voice synthesizer

Legal Events

Date	Code	Title	Description
2003-10-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties