SG11201912053XA - Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface - Google Patents

Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface

Info

Publication number: SG11201912053XA
Authority: SG; Singapore
Prior art keywords: speech recognition; received via; automatically determining; automated assistant; spoken utterance
Prior art date: 2018-04-16

Application number

SG11201912053XA

Other languages

English (en)

Inventor

Pu-Sen Chao

Diego Melendo Casado

Ignacio Lopez Moreno

Original Assignee

Google Llc

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2018-04-16

Filing date

2018-04-16

Publication date

2020-01-30

2018-04-16 Application filed by Google Llc filed Critical Google Llc

2020-01-30 Publication of SG11201912053XA publication Critical patent/SG11201912053XA/en

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/005—Language recognition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

Engineering & Computer Science (AREA)
Physics & Mathematics (AREA)
Multimedia (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Computational Linguistics (AREA)
Acoustics & Sound (AREA)
Artificial Intelligence (AREA)
Theoretical Computer Science (AREA)
Probability & Statistics with Applications (AREA)
Computer Vision & Pattern Recognition (AREA)
General Health & Medical Sciences (AREA)
General Engineering & Computer Science (AREA)
General Physics & Mathematics (AREA)
Machine Translation (AREA)
User Interface Of Digital Computer (AREA)

SG11201912053XA 2018-04-16 2018-04-16 Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface SG11201912053XA (en)

Applications Claiming Priority (1)

Application Number	Priority Date	Filing Date	Title
PCT/US2018/027812 WO2019203795A1 (en)	2018-04-16	2018-04-16	Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface

Publications (1)

Publication Number	Publication Date
SG11201912053XA true SG11201912053XA (en)	2020-01-30

Family

ID=62111243

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
SG11201912053XA SG11201912053XA (en)	2018-04-16	2018-04-16	Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface

Country Status (5)

Country	Link
US (5)	US10896672B2 (de)
EP (3)	EP3723082B1 (de)
CN (2)	CN111052229B (de)
SG (1)	SG11201912053XA (de)
WO (1)	WO2019203795A1 (de)

Families Citing this family (72)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US9318108B2 (en)	2010-01-18	2016-04-19	Apple Inc.	Intelligent automated assistant
US8977255B2 (en)	2007-04-03	2015-03-10	Apple Inc.	Method and system for operating a multi-function portable electronic device using voice-activation
US8676904B2 (en)	2008-10-02	2014-03-18	Apple Inc.	Electronic devices with voice command and contextual data processing capabilities
JP2016508007A (ja)	2013-02-07	2016-03-10	アップルインコーポレイテッド	デジタルアシスタントのためのボイストリガ
US9715875B2 (en)	2014-05-30	2017-07-25	Apple Inc.	Reducing the need for manual start/end-pointing and trigger phrases
US10170123B2 (en)	2014-05-30	2019-01-01	Apple Inc.	Intelligent assistant for home automation
US9338493B2 (en)	2014-06-30	2016-05-10	Apple Inc.	Intelligent automated assistant for TV user interactions
US9886953B2 (en)	2015-03-08	2018-02-06	Apple Inc.	Virtual assistant activation
US10460227B2 (en)	2015-05-15	2019-10-29	Apple Inc.	Virtual assistant in a communication session
US10747498B2 (en)	2015-09-08	2020-08-18	Apple Inc.	Zero latency digital assistant
US10671428B2 (en)	2015-09-08	2020-06-02	Apple Inc.	Distributed personal assistant
US11587559B2 (en)	2015-09-30	2023-02-21	Apple Inc.	Intelligent device identification
US10691473B2 (en)	2015-11-06	2020-06-23	Apple Inc.	Intelligent automated assistant in a messaging environment
US10586535B2 (en)	2016-06-10	2020-03-10	Apple Inc.	Intelligent digital assistant in a multi-tasking environment
DK201670540A1 (en)	2016-06-11	2018-01-08	Apple Inc	Application integration with a digital assistant
US12197817B2 (en)	2016-06-11	2025-01-14	Apple Inc.	Intelligent device arbitration and control
US11204787B2 (en)	2017-01-09	2021-12-21	Apple Inc.	Application integration with a digital assistant
DK180048B1 (en)	2017-05-11	2020-02-04	Apple Inc.	MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION
DK179496B1 (en)	2017-05-12	2019-01-15	Apple Inc.	USER-SPECIFIC Acoustic Models
DK201770428A1 (en)	2017-05-12	2019-02-18	Apple Inc.	LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT
DK201770411A1 (en)	2017-05-15	2018-12-20	Apple Inc.	Multi-modal interfaces
US10303715B2 (en)	2017-05-16	2019-05-28	Apple Inc.	Intelligent automated assistant for media exploration
DK179549B1 (en)	2017-05-16	2019-02-12	Apple Inc.	FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES
GB2569335B (en) *	2017-12-13	2022-07-27	Sage Global Services Ltd	Chatbot system
US10818288B2 (en)	2018-03-26	2020-10-27	Apple Inc.	Natural assistant interaction
WO2019203795A1 (en) *	2018-04-16	2019-10-24	Google Llc	Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
WO2019203794A1 (en)	2018-04-16	2019-10-24	Google Llc	Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
US10928918B2 (en)	2018-05-07	2021-02-23	Apple Inc.	Raise to speak
DK201870355A1 (en)	2018-06-01	2019-12-16	Apple Inc.	VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS
DK180639B1 (en)	2018-06-01	2021-11-04	Apple Inc	DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT
WO2019235100A1 (ja) *	2018-06-08	2019-12-12	株式会社Ｎｔｔドコモ	対話装置
KR102637339B1 (ko) *	2018-08-31	2024-02-16	삼성전자주식회사	음성 인식 모델을 개인화하는 방법 및 장치
KR102225984B1 (ko) *	2018-09-03	2021-03-10	엘지전자 주식회사	음성 인식 서비스를 제공하는 서버
US11308939B1 (en) *	2018-09-25	2022-04-19	Amazon Technologies, Inc.	Wakeword detection using multi-word model
US11119725B2 (en) *	2018-09-27	2021-09-14	Abl Ip Holding Llc	Customizable embedded vocal command sets for a lighting and/or other environmental controller
US11462215B2 (en)	2018-09-28	2022-10-04	Apple Inc.	Multi-modal inputs for voice commands
US11194973B1 (en) *	2018-11-12	2021-12-07	Amazon Technologies, Inc.	Dialog response generation
US11043214B1 (en) *	2018-11-29	2021-06-22	Amazon Technologies, Inc.	Speech recognition using dialog history
US10878805B2 (en) *	2018-12-06	2020-12-29	Microsoft Technology Licensing, Llc	Expediting interaction with a digital assistant by predicting user responses
US11393478B2 (en) *	2018-12-12	2022-07-19	Sonos, Inc.	User specific context switching
US20200192984A1 (en) *	2018-12-18	2020-06-18	Attendant.Ai, Inc	System and Method for Interactive Table Top Ordering in Multiple Languages and Restaurant Management
US12014740B2 (en)	2019-01-08	2024-06-18	Fidelity Information Services, Llc	Systems and methods for contactless authentication using voice recognition
US12021864B2 (en) *	2019-01-08	2024-06-25	Fidelity Information Services, Llc.	Systems and methods for contactless authentication using voice recognition
US11348573B2 (en)	2019-03-18	2022-05-31	Apple Inc.	Multimodality in digital assistant systems
US11307752B2 (en)	2019-05-06	2022-04-19	Apple Inc.	User configurable task triggers
DK201970509A1 (en)	2019-05-06	2021-01-15	Apple Inc	Spoken notifications
US11475884B2 (en) *	2019-05-06	2022-10-18	Apple Inc.	Reducing digital assistant latency when a language is incorrectly determined
US11468890B2 (en)	2019-06-01	2022-10-11	Apple Inc.	Methods and user interfaces for voice-based control of electronic devices
EP3970139B1 (de)	2019-10-15	2025-04-09	Google LLC	Erkennung und/oder registrierung von hot commands zur auslösung einer reaktion durch einen automatischen assistenten
CN110718223B (zh) *	2019-10-28	2021-02-12	百度在线网络技术（北京）有限公司	用于语音交互控制的方法、装置、设备和介质
US20210158803A1 (en) *	2019-11-21	2021-05-27	Lenovo (Singapore) Pte. Ltd.	Determining wake word strength
CN111581362A (zh) *	2020-04-29	2020-08-25	联想(北京)有限公司	一种处理方法及装置
US12301635B2 (en)	2020-05-11	2025-05-13	Apple Inc.	Digital assistant hardware abstraction
US11061543B1 (en)	2020-05-11	2021-07-13	Apple Inc.	Providing relevant data items based on context
US20230215438A1 (en) *	2020-05-27	2023-07-06	Google Llc	Compensating for hardware disparities when determining whether to offload assistant-related processing tasks from certain client devices
US11490204B2 (en)	2020-07-20	2022-11-01	Apple Inc.	Multi-device audio adjustment coordination
US11438683B2 (en)	2020-07-21	2022-09-06	Apple Inc.	User identification using headphones
JP7584942B2 (ja) *	2020-08-07	2024-11-18	株式会社東芝	入力支援システム、入力支援方法およびプログラム
US11823684B2 (en) *	2020-11-19	2023-11-21	Google Llc	Generating and/or utilizing voice authentication biasing parameters for assistant devices
US11558546B2 (en)	2020-11-24	2023-01-17	Google Llc	Conditional camera control via automated assistant commands
US11676594B2 (en) *	2020-12-03	2023-06-13	Google Llc	Decaying automated speech recognition processing results
US20220189475A1 (en) *	2020-12-10	2022-06-16	International Business Machines Corporation	Dynamic virtual assistant speech modulation
KR20220086342A (ko) *	2020-12-16	2022-06-23	삼성전자주식회사	음성 입력의 응답 제공 방법 및 이를 지원하는 전자 장치
EP4040433B1 (de) *	2021-02-04	2025-01-01	Deutsche Telekom AG	Dynamische generierung einer kette von funktionsmodulen eines virtuellen assistenten
CN113064561A (zh) *	2021-03-26	2021-07-02	珠海奔图电子有限公司	语音打印控制方法、装置及系统
US11776542B1 (en) *	2021-03-30	2023-10-03	Amazon Technologies, Inc.	Selecting dialog acts using controlled randomness and offline optimization
US11978445B1 (en) *	2021-03-30	2024-05-07	Amazon Technologies, Inc.	Confidence scoring for selecting tones and text of voice browsing conversations
US12100385B2 (en) *	2021-04-22	2024-09-24	Microsoft Technology Licensing, Llc	Systems, methods and interfaces for multilingual processing
US11656819B2 (en) *	2021-06-18	2023-05-23	Fujifilm Business Innovation Corp.	Information processing apparatus and printing request for designating documents based on a spoken voice
US11908463B1 (en) *	2021-06-29	2024-02-20	Amazon Technologies, Inc.	Multi-session context
CN114446279A (zh) *	2022-02-18	2022-05-06	青岛海尔科技有限公司	语音识别方法、装置、存储介质及电子设备
US12254279B1 (en) *	2024-05-23	2025-03-18	Honesty Innovations Holdings, Llc	Dynamic resource allocation of large language model deployments for conversational interface

Family Cites Families (73)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5515475A (en) *	1993-06-24	1996-05-07	Northern Telecom Limited	Speech recognition method using a two-pass search
US6594629B1 (en) *	1999-08-06	2003-07-15	International Business Machines Corporation	Methods and apparatus for audio-visual speech detection and recognition
US7620547B2 (en) *	2002-07-25	2009-11-17	Sony Deutschland Gmbh	Spoken man-machine interface with speaker identification
US7752031B2 (en) *	2006-03-23	2010-07-06	International Business Machines Corporation	Cadence management of translated multi-speaker conversations using pause marker relationship models
US7756708B2 (en) *	2006-04-03	2010-07-13	Google Inc.	Automatic language model update
US20090124272A1 (en) *	2006-04-05	2009-05-14	Marc White	Filtering transcriptions of utterances
US8015014B2 (en) *	2006-06-16	2011-09-06	Storz Endoskop Produktions Gmbh	Speech recognition system with user profiles management component
US7873517B2 (en) *	2006-11-09	2011-01-18	Volkswagen Of America, Inc.	Motor vehicle with a speech interface
US7818176B2 (en)	2007-02-06	2010-10-19	Voicebox Technologies, Inc.	System and method for selecting and presenting advertisements based on natural language processing of voice-based input
US8949266B2 (en) *	2007-03-07	2015-02-03	Vlingo Corporation	Multiple web-based content category searching in mobile search application
US8909528B2 (en)	2007-05-09	2014-12-09	Nuance Communications, Inc.	Method and system for prompt construction for selection from a list of acoustically confusable items in spoken dialog systems
US8935147B2 (en)	2007-12-31	2015-01-13	Sap Se	Runtime data language selection in object instance
CN201332158Y (zh)	2008-12-29	2009-10-21	凡甲电子(苏州)有限公司	电力连接器
US8498857B2 (en) *	2009-05-19	2013-07-30	Tata Consultancy Services Limited	System and method for rapid prototyping of existing speech recognition solutions in different languages
US8468012B2 (en)	2010-05-26	2013-06-18	Google Inc.	Acoustic model adaptation using geographic information
WO2012177646A2 (en)	2011-06-19	2012-12-27	Mmodal Ip Llc	Speech recognition using context-aware recognition models
US8972263B2 (en)	2011-11-18	2015-03-03	Soundhound, Inc.	System and method for performing dual mode speech recognition
US9129591B2 (en) *	2012-03-08	2015-09-08	Google Inc.	Recognizing speech in multiple languages
US9489940B2 (en)	2012-06-11	2016-11-08	Nvoq Incorporated	Apparatus and methods to update a language model in a speech recognition system
US9606767B2 (en) *	2012-06-13	2017-03-28	Nvoq Incorporated	Apparatus and methods for managing resources for a system using voice recognition
US9043205B2 (en)	2012-06-21	2015-05-26	Google Inc.	Dynamic language model
JP6131537B2 (ja)	2012-07-04	2017-05-24	セイコーエプソン株式会社	音声認識システム、音声認識プログラム、記録媒体及び音声認識方法
US9786281B1 (en)	2012-08-02	2017-10-10	Amazon Technologies, Inc.	Household agent learning
US9035884B2 (en) *	2012-10-17	2015-05-19	Nuance Communications, Inc.	Subscription updates in multiple device language models
US9569421B2 (en)	2012-10-31	2017-02-14	Excalibur Ip, Llc	Method and system for improved language identification using language tags
US9031829B2 (en) *	2013-02-08	2015-05-12	Machine Zone, Inc.	Systems and methods for multi-user multi-lingual communications
US9223837B2 (en)	2013-03-14	2015-12-29	Toyota Motor Engineering & Manufacturing North America, Inc.	Computer-based method and system for providing active and automatic personal assistance using an automobile or a portable electronic device
AU2014227586C1 (en) *	2013-03-15	2020-01-30	Apple Inc.	User training by intelligent digital assistant
US20150006147A1 (en) *	2013-07-01	2015-01-01	Toyota Motor Engineering & Manufacturing North America, Inc.	Speech Recognition Systems Having Diverse Language Support
US9666188B2 (en)	2013-10-29	2017-05-30	Nuance Communications, Inc.	System and method of performing automatic speech recognition using local private data
US9189742B2 (en) *	2013-11-20	2015-11-17	Justin London	Adaptive virtual intelligent agent
US9953634B1 (en) *	2013-12-17	2018-04-24	Knowles Electronics, Llc	Passive training for automatic speech recognition
WO2015105994A1 (en) *	2014-01-08	2015-07-16	Callminer, Inc.	Real-time conversational analytics facility
EP3097553B1 (de) *	2014-01-23	2022-06-01	Nuance Communications, Inc.	Verfahren und vorrichtung zur verwendung von sprachfähigkeitsinformationen in einer automatischen spracherkennung
US9812130B1 (en) *	2014-03-11	2017-11-07	Nvoq Incorporated	Apparatus and methods for dynamically changing a language model based on recognized text
CN104978015B (zh) *	2014-04-14	2018-09-18	博世汽车部件(苏州)有限公司	具有语种自适用功能的导航系统及其控制方法
US10770075B2 (en)	2014-04-21	2020-09-08	Qualcomm Incorporated	Method and apparatus for activating application by speech input
US9418567B1 (en)	2014-04-23	2016-08-16	Google Inc.	Selecting questions for a challenge-response test
US20150364129A1 (en) *	2014-06-17	2015-12-17	Google Inc.	Language Identification
US10410630B2 (en)	2014-06-19	2019-09-10	Robert Bosch Gmbh	System and method for speech-enabled personalized operation of devices and services in multiple operating environments
US9620106B2 (en)	2014-07-30	2017-04-11	At&T Intellectual Property I, L.P.	System and method for personalization in speech recogniton
CN104282307A (zh)	2014-09-05	2015-01-14	中兴通讯股份有限公司	唤醒语音控制系统的方法、装置及终端
US9318107B1 (en) *	2014-10-09	2016-04-19	Google Inc.	Hotword detection on multiple devices
US20160162469A1 (en)	2014-10-23	2016-06-09	Audience, Inc.	Dynamic Local ASR Vocabulary
US20160174195A1 (en) *	2014-12-11	2016-06-16	Qualcomm Incorporated	Embms audio packets protection in dual-sim dual-standby or srlte mobile device
CN104505091B (zh) *	2014-12-26	2018-08-21	湖南华凯文化创意股份有限公司	人机语音交互方法及系统
US10114817B2 (en) *	2015-06-01	2018-10-30	Microsoft Technology Licensing, Llc	Data mining multilingual and contextual cognates from user profiles
US9875081B2 (en) *	2015-09-21	2018-01-23	Amazon Technologies, Inc.	Device selection for providing a response
US20170092278A1 (en)	2015-09-30	2017-03-30	Apple Inc.	Speaker recognition
US9928840B2 (en)	2015-10-16	2018-03-27	Google Llc	Hotword recognition
US9747926B2 (en)	2015-10-16	2017-08-29	Google Inc.	Hotword recognition
US10691898B2 (en) *	2015-10-29	2020-06-23	Hitachi, Ltd.	Synchronization method for visual information and auditory information and information processing device
US10691473B2 (en)	2015-11-06	2020-06-23	Apple Inc.	Intelligent automated assistant in a messaging environment
US10373612B2 (en) *	2016-03-21	2019-08-06	Amazon Technologies, Inc.	Anchored speech detection and speech recognition
TWI595478B (zh) *	2016-04-21	2017-08-11	國立臺北大學	可學習不同語言及模仿不同語者說話方式之韻律參數語速正規化器、語速相依韻律模型建立器、可控語速之韻律訊息產生裝置及韻律訊息產生方法
US10192552B2 (en)	2016-06-10	2019-01-29	Apple Inc.	Digital assistant providing whispered speech
CN105957516B (zh) *	2016-06-16	2019-03-08	百度在线网络技术（北京）有限公司	多语音识别模型切换方法及装置
US10418026B2 (en) *	2016-07-15	2019-09-17	Comcast Cable Communications, Llc	Dynamic language and command recognition
US10403268B2 (en)	2016-09-08	2019-09-03	Intel IP Corporation	Method and system of automatic speech recognition using posterior confidence scores
US9786271B1 (en)	2016-09-28	2017-10-10	International Business Machines Corporation	Voice pattern coding sequence and cataloging voice matching system
CN106710586B (zh) *	2016-12-27	2020-06-30	北京儒博科技有限公司	一种语音识别引擎自动切换方法和装置
US10741174B2 (en) *	2017-01-24	2020-08-11	Lenovo (Singapore) Pte. Ltd.	Automatic language identification for speech
CN106997762A (zh) *	2017-03-08	2017-08-01	广东美的制冷设备有限公司	家用电器的语音控制方法以及装置
US10540983B2 (en) *	2017-06-01	2020-01-21	Sorenson Ip Holdings, Llc	Detecting and reducing feedback
CN107623614B (zh)	2017-09-19	2020-12-08	百度在线网络技术（北京）有限公司	用于推送信息的方法和装置
US10747817B2 (en)	2017-09-29	2020-08-18	Rovi Guides, Inc.	Recommending language models for search queries based on user profile
CN110770779B (zh) *	2017-10-13	2022-05-24	美的集团股份有限公司	用于提供个性化现场信息交换的方法和系统
CN107895578B (zh)	2017-11-15	2021-07-20	百度在线网络技术（北京）有限公司	语音交互方法和装置
US10679615B2 (en)	2018-04-16	2020-06-09	Google Llc	Adaptive interface in a voice-based networked system
WO2019203795A1 (en)	2018-04-16	2019-10-24	Google Llc	Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
WO2019203794A1 (en)	2018-04-16	2019-10-24	Google Llc	Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
US11119725B2 (en) *	2018-09-27	2021-09-14	Abl Ip Holding Llc	Customizable embedded vocal command sets for a lighting and/or other environmental controller
US11520561B1 (en) *	2018-11-28	2022-12-06	Amazon Technologies, Inc.	Neural network accelerator with compact instruct set

2018
- 2018-04-16 WO PCT/US2018/027812 patent/WO2019203795A1/en unknown
- 2018-04-16 EP EP20177711.7A patent/EP3723082B1/de active Active
- 2018-04-16 EP EP23191531.5A patent/EP4270385B1/de active Active
- 2018-04-16 EP EP18722336.7A patent/EP3580751B8/de active Active
- 2018-04-16 SG SG11201912053XA patent/SG11201912053XA/en unknown
- 2018-04-16 US US15/769,023 patent/US10896672B2/en active Active
- 2018-04-16 CN CN201880039579.9A patent/CN111052229B/zh active Active
- 2018-04-16 CN CN202311023420.7A patent/CN116959420A/zh active Pending
- 2018-05-07 US US15/973,466 patent/US10679611B2/en active Active
2020
- 2020-05-21 US US16/880,647 patent/US11817084B2/en active Active
- 2020-12-14 US US17/120,906 patent/US11817085B2/en active Active
2023
- 2023-11-13 US US18/389,033 patent/US12249319B2/en active Active

Also Published As

Publication number	Publication date
EP4270385A2 (de)	2023-11-01
US20190318724A1 (en)	2019-10-17
US20200135187A1 (en)	2020-04-30
EP3723082B1 (de)	2023-09-06
US10679611B2 (en)	2020-06-09
US12249319B2 (en)	2025-03-11
US20210097981A1 (en)	2021-04-01
US20200286467A1 (en)	2020-09-10
CN111052229B (zh)	2023-09-01
CN116959420A (zh)	2023-10-27
EP3580751A1 (de)	2019-12-18
US11817084B2 (en)	2023-11-14
US10896672B2 (en)	2021-01-19
EP3580751B8 (de)	2021-02-24
US11817085B2 (en)	2023-11-14
CN111052229A (zh)	2020-04-21
EP4270385A3 (de)	2023-12-13
EP4270385B1 (de)	2024-12-18
EP3580751B1 (de)	2020-06-03
EP3723082A1 (de)	2020-10-14
WO2019203795A1 (en)	2019-10-24
US20240194191A1 (en)	2024-06-13

Publication	Publication Date	Title
SG11201912053XA (en)	2020-01-30	Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
SG11201912061WA (en)	2020-01-30	Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface
EP3752957A4 (de)	2021-11-17	System und verfahren für sprachverständnis über integrierte audio- und videobasierte spracherkennung
EP3888084A4 (de)	2022-01-05	Verfahren und vorrichtung zur bereitstellung eines spracherkennungsdienstes
GB202306034D0 (en)	2023-06-07	Improving speech recognition transcriptions
GB2572020B (en)	2022-02-02	A speech processing system and a method of processing a speech signal
EP3806089A4 (de)	2021-07-21	Verfahren und vorrichtung für gemischte spracherkennung und computerlesbares speichermedium
EP3754650A4 (de)	2021-10-06	Ortsabhängiges spracherkennungssystem durch sprachbefehl
EP3353766A4 (de)	2019-03-20	Verfahren zur automatischen erzeugung von scores für die produktion von sprachprobenassets für benutzer eines verteilten sprachenlernsystems, automatische akzenterkennung und quantifizierung sowie verbesserte spracherkennung
EP3504703A4 (de)	2019-08-21	Spracherkennungsverfahren und -vorrichtung
EP3193328A4 (de)	2017-12-06	Verfahren und vorrichtung zur durchführung von spracherkennung mit einem grammatikmodell
GB202219165D0 (en)	2023-02-01	Phrase alternatives representation for automatic speech recognition and methods of use
GB2560174B (en)	2020-09-23	Training an automatic speech recognition system
GB2604675B (en)	2023-10-25	Improving speech recognition transcriptions
EP3652732A4 (de)	2021-03-17	Auf silben basierende automatische spracherkennung
EP4053837A4 (de)	2023-11-08	Automatischer spracherkenner und spracherkennungsverfahren mit tastaturmakrofunktion
GB2596350B (en)	2023-10-04	A system and method for understanding and explaining spoken interactions using speech acoustic and linguistic markers
GB202117611D0 (en)	2022-01-19	Systems and methods for speech recognition
GB202405371D0 (en)	2024-05-29	Automatic measurement of semantic similarity of conversations
ATE400047T1 (de)	2008-07-15	Verfahren und system zum automatischen bereitstellen linguistischer formulierungen, die ausserhalb einer erkennungsdomäne eines automatischen spracherkennungssystems liegen
NZ753616A (en)	2020-05-29	System and method for parameterization of speech recognition grammar specification (srgs) grammars
EP3712886A4 (de)	2021-08-18	Vorrichtung und verfahren zur automatischen spracherkennung
EP4281965A4 (de)	2024-08-21	Qualitätsschätzung für automatische spracherkennung
GB2590277B (en)	2022-07-13	Speech recognition
EP3544001B8 (de)	2022-01-12	Verarbeitung von sprach-zu-text-transkriptionen