ES2143079T3 - Reconocimiento de voz. - Google Patents
Reconocimiento de voz.Info
- Publication number
- ES2143079T3 ES2143079T3 ES95935526T ES95935526T ES2143079T3 ES 2143079 T3 ES2143079 T3 ES 2143079T3 ES 95935526 T ES95935526 T ES 95935526T ES 95935526 T ES95935526 T ES 95935526T ES 2143079 T3 ES2143079 T3 ES 2143079T3
- Authority
- ES
- Spain
- Prior art keywords
- user
- transcripts
- speech recognition
- generated
- network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
- Telephonic Communication Services (AREA)
- Selective Calling Equipment (AREA)
- Document Processing Apparatus (AREA)
- Navigation (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
Abstract
UN DISPOSITIVO DE RECONOCIMIENTO DEL HABLA EN EL QUE SE GENERA UN VOCABULARIO DE RECONOCIMIENTO A PARTIR DEL HABLA PROPIA DE UN USUARIO POR MEDIO DE LA REALIZACION DE TRANSCRIPCIONES FONETICAS DE LAS ARTICULACIONES DEL USUARIO Y LA UTILIZACION DE ESTAS TRANSCRIPCIONES CON OBJETO DE LLEVAR A CABO UN RECONOCIMIENTO FUTURO. LAS TRANSCRIPCIONES FONETICAS SE GENERAN UTILIZANDO UNA RED LIGERAMENTE RESTRINGIDA, PREFERIBLEMENTE UNA RED RESTRINGIDA SOLO POR EL RUIDO. LAS TRANSCRIPCIONES RESULTANTES PRODUCEN, DE ESTA FORMA, UN PARECIDO CERCANO AL HABLA INTRODUCIDA POR USUARIO PERO REQUIERE UNOS REQUISITOS DE ALMACENAJE SIGNIFICATIVAMENTE REDUCIDOS SI SE COMPARA CON LAS REPRESENTACIONES DE LAS PALABRAS DE UN HABLANTE CONOCIDO.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP94308023 | 1994-11-01 |
Publications (1)
Publication Number | Publication Date |
---|---|
ES2143079T3 true ES2143079T3 (es) | 2000-05-01 |
Family
ID=8217896
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES95935526T Expired - Lifetime ES2143079T3 (es) | 1994-11-01 | 1995-11-01 | Reconocimiento de voz. |
Country Status (17)
Country | Link |
---|---|
US (1) | US6389395B1 (es) |
EP (1) | EP0789901B1 (es) |
JP (1) | JPH10507536A (es) |
KR (1) | KR100383353B1 (es) |
CN (1) | CN1121680C (es) |
AU (1) | AU707355B2 (es) |
CA (1) | CA2202656C (es) |
DE (1) | DE69514382T2 (es) |
DK (1) | DK0789901T3 (es) |
ES (1) | ES2143079T3 (es) |
FI (1) | FI971822A0 (es) |
HK (1) | HK1002787A1 (es) |
MX (1) | MX9703138A (es) |
NO (1) | NO309750B1 (es) |
NZ (1) | NZ294659A (es) |
PT (1) | PT789901E (es) |
WO (1) | WO1996013827A1 (es) |
Families Citing this family (45)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000022609A1 (en) * | 1998-10-13 | 2000-04-20 | Telefonaktiebolaget Lm Ericsson (Publ) | Speech recognition and control system and telephone |
JP2000187435A (ja) * | 1998-12-24 | 2000-07-04 | Sony Corp | 情報処理装置、携帯機器、電子ペット装置、情報処理手順を記録した記録媒体及び情報処理方法 |
CN1343337B (zh) | 1999-03-05 | 2013-03-20 | 佳能株式会社 | 用于产生包括音素数据和解码的字的注释数据的方法和设备 |
US7310600B1 (en) | 1999-10-28 | 2007-12-18 | Canon Kabushiki Kaisha | Language recognition using a similarity measure |
US6882970B1 (en) | 1999-10-28 | 2005-04-19 | Canon Kabushiki Kaisha | Language recognition using sequence frequency |
US7212968B1 (en) | 1999-10-28 | 2007-05-01 | Canon Kabushiki Kaisha | Pattern matching method and apparatus |
GB0011798D0 (en) * | 2000-05-16 | 2000-07-05 | Canon Kk | Database annotation and retrieval |
GB0015233D0 (en) | 2000-06-21 | 2000-08-16 | Canon Kk | Indexing method and apparatus |
GB0023930D0 (en) | 2000-09-29 | 2000-11-15 | Canon Kk | Database annotation and retrieval |
GB0027178D0 (en) * | 2000-11-07 | 2000-12-27 | Canon Kk | Speech processing system |
GB0028277D0 (en) | 2000-11-20 | 2001-01-03 | Canon Kk | Speech processing system |
US20030009331A1 (en) * | 2001-07-05 | 2003-01-09 | Johan Schalkwyk | Grammars for speech recognition |
US6990445B2 (en) * | 2001-12-17 | 2006-01-24 | Xl8 Systems, Inc. | System and method for speech recognition and transcription |
US20030115169A1 (en) * | 2001-12-17 | 2003-06-19 | Hongzhuan Ye | System and method for management of transcribed documents |
US7181398B2 (en) * | 2002-03-27 | 2007-02-20 | Hewlett-Packard Development Company, L.P. | Vocabulary independent speech recognition system and method using subword units |
US20030200094A1 (en) * | 2002-04-23 | 2003-10-23 | Gupta Narendra K. | System and method of using existing knowledge to rapidly train automatic speech recognizers |
US7206738B2 (en) * | 2002-08-14 | 2007-04-17 | International Business Machines Corporation | Hybrid baseform generation |
DE10244169A1 (de) * | 2002-09-23 | 2004-04-01 | Infineon Technologies Ag | Spracherkennungseinrichtung, Steuereinrichtung und Verfahren zum rechnergestützten Ergänzen eines elektronischen Wörterbuches für eine Spracherkennungseinrichtung |
WO2004036939A1 (fr) * | 2002-10-18 | 2004-04-29 | Institute Of Acoustics Chinese Academy Of Sciences | Appareil de communication mobile numerique portable, procede de commande vocale et systeme |
US7149688B2 (en) * | 2002-11-04 | 2006-12-12 | Speechworks International, Inc. | Multi-lingual speech recognition with cross-language context modeling |
JP4072718B2 (ja) * | 2002-11-21 | 2008-04-09 | ソニー株式会社 | 音声処理装置および方法、記録媒体並びにプログラム |
US7302389B2 (en) * | 2003-05-14 | 2007-11-27 | Lucent Technologies Inc. | Automatic assessment of phonological processes |
US20040230431A1 (en) * | 2003-05-14 | 2004-11-18 | Gupta Sunil K. | Automatic assessment of phonological processes for speech therapy and language instruction |
US7373294B2 (en) * | 2003-05-15 | 2008-05-13 | Lucent Technologies Inc. | Intonation transformation for speech therapy and the like |
US20040243412A1 (en) * | 2003-05-29 | 2004-12-02 | Gupta Sunil K. | Adaptation of speech models in speech recognition |
JP4943335B2 (ja) * | 2004-09-23 | 2012-05-30 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | 話者に依存しない堅牢な音声認識システム |
US20090291419A1 (en) * | 2005-08-01 | 2009-11-26 | Kazuaki Uekawa | System of sound representaion and pronunciation techniques for english and other european languages |
US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
US7774202B2 (en) * | 2006-06-12 | 2010-08-10 | Lockheed Martin Corporation | Speech activated control system and related methods |
US8386248B2 (en) * | 2006-09-22 | 2013-02-26 | Nuance Communications, Inc. | Tuning reusable software components in a speech application |
US7881932B2 (en) * | 2006-10-02 | 2011-02-01 | Nuance Communications, Inc. | VoiceXML language extension for natively supporting voice enrolled grammars |
EP2308042B1 (en) * | 2008-06-27 | 2011-11-02 | Koninklijke Philips Electronics N.V. | Method and device for generating vocabulary entries from acoustic data |
US20110184736A1 (en) * | 2010-01-26 | 2011-07-28 | Benjamin Slotznick | Automated method of recognizing inputted information items and selecting information items |
US20110224982A1 (en) * | 2010-03-12 | 2011-09-15 | c/o Microsoft Corporation | Automatic speech recognition based upon information retrieval methods |
US20120116764A1 (en) * | 2010-11-09 | 2012-05-10 | Tze Fen Li | Speech recognition method on sentences in all languages |
GB2513821A (en) * | 2011-06-28 | 2014-11-12 | Andrew Levine | Speech-to-text conversion |
US8781825B2 (en) * | 2011-08-24 | 2014-07-15 | Sensory, Incorporated | Reducing false positives in speech recognition systems |
US9135912B1 (en) * | 2012-08-15 | 2015-09-15 | Google Inc. | Updating phonetic dictionaries |
TWI536366B (zh) | 2014-03-18 | 2016-06-01 | 財團法人工業技術研究院 | 新增口說語彙的語音辨識系統與方法及電腦可讀取媒體 |
US9607618B2 (en) * | 2014-12-16 | 2017-03-28 | Nice-Systems Ltd | Out of vocabulary pattern learning |
US10719115B2 (en) * | 2014-12-30 | 2020-07-21 | Avago Technologies International Sales Pte. Limited | Isolated word training and detection using generated phoneme concatenation models of audio inputs |
KR102509821B1 (ko) * | 2017-09-18 | 2023-03-14 | 삼성전자주식회사 | Oos 문장을 생성하는 방법 및 이를 수행하는 장치 |
WO2020014890A1 (zh) * | 2018-07-18 | 2020-01-23 | 深圳魔耳智能声学科技有限公司 | 基于口音的语音识别处理方法、电子设备和存储介质 |
CN109074808B (zh) * | 2018-07-18 | 2023-05-09 | 深圳魔耳智能声学科技有限公司 | 语音控制方法、中控设备和存储介质 |
CN112951270B (zh) * | 2019-11-26 | 2024-04-19 | 新东方教育科技集团有限公司 | 语音流利度检测的方法、装置和电子设备 |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4489434A (en) | 1981-10-05 | 1984-12-18 | Exxon Corporation | Speech recognition method and apparatus |
US5129000A (en) * | 1986-04-05 | 1992-07-07 | Sharp Kabushiki Kaisha | Voice recognition method by analyzing syllables |
US4903305A (en) * | 1986-05-12 | 1990-02-20 | Dragon Systems, Inc. | Method for representing word models for use in speech recognition |
US4866778A (en) * | 1986-08-11 | 1989-09-12 | Dragon Systems, Inc. | Interactive speech recognition apparatus |
US4837831A (en) * | 1986-10-15 | 1989-06-06 | Dragon Systems, Inc. | Method for creating and using multiple-word sound models in speech recognition |
US5129001A (en) * | 1990-04-25 | 1992-07-07 | International Business Machines Corporation | Method and apparatus for modeling words with multi-arc markov models |
US5181237A (en) | 1990-10-12 | 1993-01-19 | At&T Bell Laboratories | Automation of telephone operator assistance calls |
US5465318A (en) * | 1991-03-28 | 1995-11-07 | Kurzweil Applied Intelligence, Inc. | Method for generating a speech recognition model for a non-vocabulary utterance |
DE4111781A1 (de) * | 1991-04-11 | 1992-10-22 | Ibm | Computersystem zur spracherkennung |
US5502790A (en) * | 1991-12-24 | 1996-03-26 | Oki Electric Industry Co., Ltd. | Speech recognition method and system using triphones, diphones, and phonemes |
CA2088080C (en) * | 1992-04-02 | 1997-10-07 | Enrico Luigi Bocchieri | Automatic speech recognizer |
US5297183A (en) * | 1992-04-13 | 1994-03-22 | Vcs Industries, Inc. | Speech recognition system for electronic switches in a cellular telephone or personal communication network |
EP0590173A1 (de) * | 1992-09-28 | 1994-04-06 | International Business Machines Corporation | Computersystem zur Spracherkennung |
AU5803394A (en) * | 1992-12-17 | 1994-07-04 | Bell Atlantic Network Services, Inc. | Mechanized directory assistance |
US5390279A (en) * | 1992-12-31 | 1995-02-14 | Apple Computer, Inc. | Partitioning speech rules by context for speech recognition |
US5384892A (en) * | 1992-12-31 | 1995-01-24 | Apple Computer, Inc. | Dynamic language model for speech recognition |
US5488652A (en) * | 1994-04-14 | 1996-01-30 | Northern Telecom Limited | Method and apparatus for training speech recognition algorithms for directory assistance applications |
US5710864A (en) * | 1994-12-29 | 1998-01-20 | Lucent Technologies Inc. | Systems, methods and articles of manufacture for improving recognition confidence in hypothesized keywords |
US5717826A (en) * | 1995-08-11 | 1998-02-10 | Lucent Technologies Inc. | Utterance verification using word based minimum verification error training for recognizing a keyboard string |
-
1995
- 1995-11-01 DE DE69514382T patent/DE69514382T2/de not_active Expired - Lifetime
- 1995-11-01 JP JP8513513A patent/JPH10507536A/ja active Pending
- 1995-11-01 CN CN95195955A patent/CN1121680C/zh not_active Expired - Lifetime
- 1995-11-01 KR KR1019970702853A patent/KR100383353B1/ko not_active Expired - Lifetime
- 1995-11-01 DK DK95935526T patent/DK0789901T3/da active
- 1995-11-01 WO PCT/GB1995/002563 patent/WO1996013827A1/en active IP Right Grant
- 1995-11-01 AU AU37516/95A patent/AU707355B2/en not_active Expired
- 1995-11-01 US US08/817,072 patent/US6389395B1/en not_active Expired - Lifetime
- 1995-11-01 ES ES95935526T patent/ES2143079T3/es not_active Expired - Lifetime
- 1995-11-01 NZ NZ294659A patent/NZ294659A/xx not_active IP Right Cessation
- 1995-11-01 EP EP95935526A patent/EP0789901B1/en not_active Expired - Lifetime
- 1995-11-01 CA CA002202656A patent/CA2202656C/en not_active Expired - Lifetime
- 1995-11-01 PT PT95935526T patent/PT789901E/pt unknown
- 1995-11-01 MX MX9703138A patent/MX9703138A/es unknown
-
1997
- 1997-04-29 FI FI971822A patent/FI971822A0/fi unknown
- 1997-04-30 NO NO972026A patent/NO309750B1/no not_active IP Right Cessation
-
1998
- 1998-02-20 HK HK98101344A patent/HK1002787A1/xx not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
KR970707529A (ko) | 1997-12-01 |
US6389395B1 (en) | 2002-05-14 |
HK1002787A1 (en) | 1998-09-18 |
CN1121680C (zh) | 2003-09-17 |
JPH10507536A (ja) | 1998-07-21 |
NO309750B1 (no) | 2001-03-19 |
FI971822L (fi) | 1997-04-29 |
CA2202656A1 (en) | 1996-05-09 |
DE69514382D1 (de) | 2000-02-10 |
EP0789901A1 (en) | 1997-08-20 |
PT789901E (pt) | 2000-04-28 |
NO972026L (no) | 1997-04-30 |
NZ294659A (en) | 1999-01-28 |
AU3751695A (en) | 1996-05-23 |
DK0789901T3 (da) | 2000-06-19 |
CN1162365A (zh) | 1997-10-15 |
AU707355B2 (en) | 1999-07-08 |
EP0789901B1 (en) | 2000-01-05 |
KR100383353B1 (ko) | 2003-10-17 |
NO972026D0 (no) | 1997-04-30 |
WO1996013827A1 (en) | 1996-05-09 |
CA2202656C (en) | 2002-01-01 |
DE69514382T2 (de) | 2001-08-23 |
MX9703138A (es) | 1997-06-28 |
FI971822A0 (fi) | 1997-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ES2143079T3 (es) | Reconocimiento de voz. | |
BR0113725A (pt) | Combinação de dtw e hmm nos modos de reconhecimento de fala dependente e independente do falante | |
WO2002097590A3 (en) | Language independent and voice operated information management system | |
DE60000138D1 (de) | Erzeugung von mehreren Aussprachen eines Eigennames für die Spracherkennung | |
ATE372573T1 (de) | Spracherkennungsystem mittels impliziter sprecheradaption | |
CA2275774A1 (en) | Selection of superwords based on criteria relevant to both speech recognition and understanding | |
EP1291848A3 (en) | Multilingual pronunciations for speech recognition | |
EP1901282A3 (en) | Speech communications system for a vehicle and method of operating a speech communications system for a vehicle | |
ATE421136T1 (de) | Audiovisuelle sprachaktivitätsdetektion für ein spracherkennungssystem | |
DE60207784D1 (de) | Sprecheranpassung für die Spracherkennung | |
CA2419112A1 (en) | Voice activated language translation | |
IL146985A0 (en) | Automatic dynamic speech recognition vocabulary based on external sources of information | |
EP1207518A3 (en) | Speech recognition with dynamic programming | |
ATE316678T1 (de) | Spracherkennung | |
FI20010792A0 (fi) | Käyttäjäriippumattoman puheentunnistuksen järjestäminen | |
DE69413912D1 (de) | Sprachumsetzungsverfahren | |
DE60008893D1 (de) | Sprachgesteuertes tragbares Endgerät | |
EP1189204A3 (en) | HMM-based noisy speech recognition | |
HK1043233A1 (en) | Method and apparatus for testing user interface integrity of speech-enabled devices. | |
DE60109650D1 (de) | Taktiles kommunikationssystem | |
DE60315544D1 (de) | Telekommunikationsendgerät zur Veränderung eines übertragenen Sprachsignals bei einer bestehenden Fernsprechverbindung | |
WO2002054715A3 (en) | Programming of a ringing tone in a telephone apparatus | |
JP3068370B2 (ja) | 携帯用音声認識出力補助装置 | |
WO2000026901A3 (en) | Performing spoken recorded actions | |
Elenius | Techniques and devices for automatic speech recognition: Acoustic front-end processing and selected linguistic aspects. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG2A | Definitive protection |
Ref document number: 789901 Country of ref document: ES |