BR112016024086A2 - geração de modelo de palavra-chave para detecção de palavra-chave definida por usuário - Google Patents
geração de modelo de palavra-chave para detecção de palavra-chave definida por usuárioInfo
- Publication number
- BR112016024086A2 BR112016024086A2 BR112016024086A BR112016024086A BR112016024086A2 BR 112016024086 A2 BR112016024086 A2 BR 112016024086A2 BR 112016024086 A BR112016024086 A BR 112016024086A BR 112016024086 A BR112016024086 A BR 112016024086A BR 112016024086 A2 BR112016024086 A2 BR 112016024086A2
- Authority
- BR
- Brazil
- Prior art keywords
- keyword
- user
- model
- subword
- template generation
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/027—Syllables being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- User Interface Of Digital Computer (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
de acordo com um aspecto da presente revelação, um método para gerar um modelo de palavra-chave de uma palavra-chave definida por usuário em um dispositivo eletrônico é revelado. o método inclui receber pelo menos uma entrada indicativa da palavra-chave definida pelo usuário, determinar uma sequência de palavras-chave a partir de pelo menos uma entrada, gerar o modelo de palavra-chave associado à palavra-chave definida pelo usuário com base na sequência de subpalavras e um modelo de subpalavra das subpalavras, em que o modelo de palavra-chave é configurado para modelar uma pluralidade de características acústicas das subpalavras com base em um banco de dados de fala e fornecer o modelo de palavra-chave associado à palavra-chave definida pelo usuário a uma unidade de ativação de voz configurada com um modelo de palavra-chave associado a uma palavra-chave predeterminada.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201461980911P | 2014-04-17 | 2014-04-17 | |
US14/466,644 US9953632B2 (en) | 2014-04-17 | 2014-08-22 | Keyword model generation for detecting user-defined keyword |
PCT/US2015/024873 WO2015160586A1 (en) | 2014-04-17 | 2015-04-08 | Keyword model generation for detecting user-defined keyword |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112016024086A2 true BR112016024086A2 (pt) | 2017-08-15 |
Family
ID=54322537
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112016024086A BR112016024086A2 (pt) | 2014-04-17 | 2015-04-08 | geração de modelo de palavra-chave para detecção de palavra-chave definida por usuário |
Country Status (7)
Country | Link |
---|---|
US (1) | US9953632B2 (pt) |
EP (1) | EP3132442B1 (pt) |
JP (1) | JP2017515147A (pt) |
KR (1) | KR20160145634A (pt) |
CN (1) | CN106233374B (pt) |
BR (1) | BR112016024086A2 (pt) |
WO (1) | WO2015160586A1 (pt) |
Families Citing this family (59)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US10019983B2 (en) * | 2012-08-30 | 2018-07-10 | Aravind Ganapathiraju | Method and system for predicting speech recognition performance using accuracy scores |
DE212014000045U1 (de) | 2013-02-07 | 2015-09-24 | Apple Inc. | Sprach-Trigger für einen digitalen Assistenten |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US9866741B2 (en) * | 2015-04-20 | 2018-01-09 | Jesse L. Wobrock | Speaker-dependent voice-activated camera system |
US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
US10304440B1 (en) * | 2015-07-10 | 2019-05-28 | Amazon Technologies, Inc. | Keyword spotting using multi-task configuration |
US9792907B2 (en) | 2015-11-24 | 2017-10-17 | Intel IP Corporation | Low resource key phrase detection for wake on voice |
US9972313B2 (en) * | 2016-03-01 | 2018-05-15 | Intel Corporation | Intermediate scoring and rejection loopback for improved key phrase detection |
CN105868182B (zh) * | 2016-04-21 | 2019-08-30 | 深圳市中兴移动软件有限公司 | 一种文本信息处理方法及装置 |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US12197817B2 (en) | 2016-06-11 | 2025-01-14 | Apple Inc. | Intelligent device arbitration and control |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
US10043521B2 (en) | 2016-07-01 | 2018-08-07 | Intel IP Corporation | User defined key phrase detection by user dependent sequence modeling |
US10083689B2 (en) * | 2016-12-23 | 2018-09-25 | Intel Corporation | Linear scoring for low power wake on voice |
US10276161B2 (en) * | 2016-12-27 | 2019-04-30 | Google Llc | Contextual hotwords |
JP6599914B2 (ja) * | 2017-03-09 | 2019-10-30 | 株式会社東芝 | 音声認識装置、音声認識方法およびプログラム |
CN107146611B (zh) * | 2017-04-10 | 2020-04-17 | 北京猎户星空科技有限公司 | 一种语音响应方法、装置及智能设备 |
US20180336275A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Intelligent automated assistant for media exploration |
US10313845B2 (en) * | 2017-06-06 | 2019-06-04 | Microsoft Technology Licensing, Llc | Proactive speech detection and alerting |
WO2018228515A1 (en) * | 2017-06-15 | 2018-12-20 | Beijing Didi Infinity Technology And Development Co., Ltd. | Systems and methods for speech recognition |
CN107564517A (zh) * | 2017-07-05 | 2018-01-09 | 百度在线网络技术(北京)有限公司 | 语音唤醒方法、设备及系统、云端服务器与可读介质 |
CN109903751B (zh) * | 2017-12-08 | 2023-07-07 | 阿里巴巴集团控股有限公司 | 关键词确认方法和装置 |
EP3692522B1 (en) * | 2017-12-31 | 2025-06-18 | Midea Group Co., Ltd. | Method and system for controlling home assistant devices |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
CN108665900B (zh) | 2018-04-23 | 2020-03-03 | 百度在线网络技术(北京)有限公司 | 云端唤醒方法及系统、终端以及计算机可读存储介质 |
JP2019191490A (ja) * | 2018-04-27 | 2019-10-31 | 東芝映像ソリューション株式会社 | 音声対話端末、および音声対話端末制御方法 |
CN110797021B (zh) | 2018-05-24 | 2022-06-07 | 腾讯科技(深圳)有限公司 | 混合语音识别网络训练方法、混合语音识别方法、装置及存储介质 |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
US10714122B2 (en) | 2018-06-06 | 2020-07-14 | Intel Corporation | Speech classification of audio for wake on voice |
US10269376B1 (en) * | 2018-06-28 | 2019-04-23 | Invoca, Inc. | Desired signal spotting in noisy, flawed environments |
US10650807B2 (en) | 2018-09-18 | 2020-05-12 | Intel Corporation | Method and system of neural network keyphrase detection |
US11100923B2 (en) * | 2018-09-28 | 2021-08-24 | Sonos, Inc. | Systems and methods for selective wake word detection using neural network models |
CN109635273B (zh) * | 2018-10-25 | 2023-04-25 | 平安科技(深圳)有限公司 | 文本关键词提取方法、装置、设备及存储介质 |
CN109473123B (zh) * | 2018-12-05 | 2022-05-31 | 百度在线网络技术(北京)有限公司 | 语音活动检测方法及装置 |
CN109767763B (zh) * | 2018-12-25 | 2021-01-26 | 苏州思必驰信息科技有限公司 | 自定义唤醒词的确定方法和用于确定自定义唤醒词的装置 |
TW202029181A (zh) * | 2019-01-28 | 2020-08-01 | 正崴精密工業股份有限公司 | 語音識別用於特定目標喚醒的方法及裝置 |
CN109979440B (zh) * | 2019-03-13 | 2021-05-11 | 广州市网星信息技术有限公司 | 关键词样本确定方法、语音识别方法、装置、设备和介质 |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11127394B2 (en) | 2019-03-29 | 2021-09-21 | Intel Corporation | Method and system of high accuracy keyphrase detection for low resource devices |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
CN110349566B (zh) * | 2019-07-11 | 2020-11-24 | 龙马智芯(珠海横琴)科技有限公司 | 语音唤醒方法、电子设备及存储介质 |
WO2021030918A1 (en) * | 2019-08-22 | 2021-02-25 | Fluent.Ai Inc. | User-defined keyword spotting |
JP7098587B2 (ja) * | 2019-08-29 | 2022-07-11 | 株式会社東芝 | 情報処理装置、キーワード検出装置、情報処理方法およびプログラム |
CN110634468B (zh) * | 2019-09-11 | 2022-04-15 | 中国联合网络通信集团有限公司 | 语音唤醒方法、装置、设备及计算机可读存储介质 |
US11295741B2 (en) | 2019-12-05 | 2022-04-05 | Soundhound, Inc. | Dynamic wakewords for speech-enabled devices |
CN111128138A (zh) * | 2020-03-30 | 2020-05-08 | 深圳市友杰智新科技有限公司 | 语音唤醒方法、装置、计算机设备和存储介质 |
CN111540363B (zh) * | 2020-04-20 | 2023-10-24 | 合肥讯飞数码科技有限公司 | 关键词模型及解码网络构建方法、检测方法及相关设备 |
US12301635B2 (en) | 2020-05-11 | 2025-05-13 | Apple Inc. | Digital assistant hardware abstraction |
CN111798840B (zh) * | 2020-07-16 | 2023-08-08 | 中移在线服务有限公司 | 语音关键词识别方法和装置 |
US11438683B2 (en) | 2020-07-21 | 2022-09-06 | Apple Inc. | User identification using headphones |
KR20220099003A (ko) | 2021-01-05 | 2022-07-12 | 삼성전자주식회사 | 전자 장치 및 이의 제어 방법 |
KR20220111574A (ko) | 2021-02-02 | 2022-08-09 | 삼성전자주식회사 | 전자 장치 및 그 제어 방법 |
WO2023150132A1 (en) * | 2022-02-01 | 2023-08-10 | Apple Inc. | Keyword detection using motion sensing |
US20230245657A1 (en) * | 2022-02-01 | 2023-08-03 | Apple Inc. | Keyword detection using motion sensing |
Family Cites Families (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5199077A (en) * | 1991-09-19 | 1993-03-30 | Xerox Corporation | Wordspotting for voice editing and indexing |
CA2088080C (en) * | 1992-04-02 | 1997-10-07 | Enrico Luigi Bocchieri | Automatic speech recognizer |
US5623578A (en) * | 1993-10-28 | 1997-04-22 | Lucent Technologies Inc. | Speech recognition system allows new vocabulary words to be added without requiring spoken samples of the words |
US5768474A (en) * | 1995-12-29 | 1998-06-16 | International Business Machines Corporation | Method and system for noise-robust speech processing with cochlea filters in an auditory model |
US5960395A (en) | 1996-02-09 | 1999-09-28 | Canon Kabushiki Kaisha | Pattern matching method, apparatus and computer readable memory medium for speech recognition using dynamic programming |
CN100397384C (zh) | 1997-02-07 | 2008-06-25 | 卡西欧计算机株式会社 | 车载终端装置 |
JP3790038B2 (ja) * | 1998-03-31 | 2006-06-28 | 株式会社東芝 | サブワード型不特定話者音声認識装置 |
US6292778B1 (en) * | 1998-10-30 | 2001-09-18 | Lucent Technologies Inc. | Task-independent utterance verification with subword-based minimum verification error training |
JP2001042891A (ja) * | 1999-07-27 | 2001-02-16 | Suzuki Motor Corp | 音声認識装置、音声認識搭載装置、音声認識搭載システム、音声認識方法、及び記憶媒体 |
US20060074664A1 (en) | 2000-01-10 | 2006-04-06 | Lam Kwok L | System and method for utterance verification of chinese long and short keywords |
GB0028277D0 (en) * | 2000-11-20 | 2001-01-03 | Canon Kk | Speech processing system |
EP1215661A1 (en) * | 2000-12-14 | 2002-06-19 | TELEFONAKTIEBOLAGET L M ERICSSON (publ) | Mobile terminal controllable by spoken utterances |
US7027987B1 (en) * | 2001-02-07 | 2006-04-11 | Google Inc. | Voice interface for a search engine |
JP4655184B2 (ja) * | 2001-08-01 | 2011-03-23 | ソニー株式会社 | 音声認識装置および方法、記録媒体、並びにプログラム |
CN100349206C (zh) * | 2005-09-12 | 2007-11-14 | 周运南 | 文字语音互转装置 |
KR100679051B1 (ko) | 2005-12-14 | 2007-02-05 | 삼성전자주식회사 | 복수의 신뢰도 측정 알고리즘을 이용한 음성 인식 장치 및방법 |
CN101320561A (zh) * | 2007-06-05 | 2008-12-10 | 赛微科技股份有限公司 | 提升个人语音识别率的方法及模块 |
JP5467043B2 (ja) * | 2008-06-06 | 2014-04-09 | 株式会社レイトロン | 音声認識装置、音声認識方法および電子機器 |
JP5375423B2 (ja) * | 2009-08-10 | 2013-12-25 | 日本電気株式会社 | 音声認識システム、音声認識方法および音声認識プログラム |
US8438028B2 (en) * | 2010-05-18 | 2013-05-07 | General Motors Llc | Nametag confusability determination |
US9117449B2 (en) | 2012-04-26 | 2015-08-25 | Nuance Communications, Inc. | Embedded system for construction of small footprint speech recognition with user-definable constraints |
US9672815B2 (en) | 2012-07-20 | 2017-06-06 | Interactive Intelligence Group, Inc. | Method and system for real-time keyword spotting for speech analytics |
US10019983B2 (en) | 2012-08-30 | 2018-07-10 | Aravind Ganapathiraju | Method and system for predicting speech recognition performance using accuracy scores |
CN104700832B (zh) * | 2013-12-09 | 2018-05-25 | 联发科技股份有限公司 | 语音关键字检测系统及方法 |
-
2014
- 2014-08-22 US US14/466,644 patent/US9953632B2/en active Active
-
2015
- 2015-04-08 EP EP15717387.3A patent/EP3132442B1/en active Active
- 2015-04-08 WO PCT/US2015/024873 patent/WO2015160586A1/en active Application Filing
- 2015-04-08 JP JP2016562023A patent/JP2017515147A/ja not_active Ceased
- 2015-04-08 CN CN201580020007.2A patent/CN106233374B/zh active Active
- 2015-04-08 BR BR112016024086A patent/BR112016024086A2/pt not_active IP Right Cessation
- 2015-04-08 KR KR1020167030186A patent/KR20160145634A/ko not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
EP3132442A1 (en) | 2017-02-22 |
JP2017515147A (ja) | 2017-06-08 |
US9953632B2 (en) | 2018-04-24 |
WO2015160586A1 (en) | 2015-10-22 |
CN106233374A (zh) | 2016-12-14 |
EP3132442B1 (en) | 2018-07-04 |
KR20160145634A (ko) | 2016-12-20 |
CN106233374B (zh) | 2020-01-10 |
US20150302847A1 (en) | 2015-10-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112016024086A2 (pt) | geração de modelo de palavra-chave para detecção de palavra-chave definida por usuário | |
CO2017007032A2 (es) | Actualización de modelos de clasificador de entendimiento de lenguaje para un asistente digital personal basándose en externalización masiva | |
MX342073B (es) | Modelo de gramatica para consultas de busqueda estructuradas. | |
BR112017010222A2 (pt) | discriminando expressões ambíguas para aprimorar experiência do usuário | |
WO2014197334A3 (en) | System and method for user-specified pronunciation of words for speech synthesis and recognition | |
CO2017007037A2 (es) | Métodos para el entendimiento de consulta de lenguaje natural incompleta | |
BR112018010876A2 (pt) | dispositivo eletrônico que gera notificação com base nos dados de contexto em resposta à frase da fala de usuário | |
MX361584B (es) | Presentacion de lenguaje natural de consultas de busqueda estructuradas. | |
BR112016028797A2 (pt) | modelagem de contexto de sessão para sistemas de entendimento de conversação | |
TW201612773A (en) | Multi-command single utterance input method | |
BR112016022268A2 (pt) | Treinamento, reconhecimento e geração em uma rede de extrema convicção de pico (dbn) | |
BR112016002229A2 (pt) | sistema de reconhecimento de comportamento neurolinguístico cognitivo para fusão de dados de multissensor | |
BR112016007121A2 (pt) | método para a análise de acústica de uma máquina, sistema para a análise de acústica de uma máquina e produto | |
BR112019024679A2 (pt) | sistema e método para gerar automaticamente saída musical | |
WO2013009578A3 (en) | Systems and methods for speech command processing | |
BR112017002834A2 (pt) | transformações de fluxos de eventos | |
MX2019001216A (es) | Sistemas y metodos para ejecutar una funcion suplementaria para una consulta de lenguaje natural. | |
MX2016016289A (es) | Aprendizaje y uso de reglas de recuperacion de contenido contextual para desambiguacion de consulta. | |
CL2015002728A1 (es) | Un método para proporcionar un entorno de aprendizaje de idiomas | |
BR112013017175A2 (pt) | dispositivo e método de provisão de informação, e, meio de gravação legível por computador | |
RU2013156495A (ru) | Разрешение семантической неоднозначности при помощи семантического классификатора | |
MX393839B (es) | Mejoramiento de las operaciones del yacimiento petrolifero con computacion cognitiva. | |
CL2014002859A1 (es) | Metodo y sistema para utilizar ejemplos negativos de palabras en un sistema de reconocimiento de voz, en que el metodo comprende definir un conjunto de palabras, identificar un conjunto de ejemplos negativos de dichas palabras, realizar un reconocimiento de palabra clave en dichos conjuntos, determinar valores de confianza de palabras en dichos conjuntos, identificar al menos una palabra candidata de dicho conjunto de palabras, comparar valores de confianza, aceptar la palabra candidata. | |
BR112019000188A2 (pt) | método implementado por computador, meio não transitório, legível por computador e sistema implementado por computador | |
BR112017018726A2 (pt) | ?método implementado por computador para associar um usuário a um dispositivo vestível, e, sistema para associar um usuário a um dispositivo vestível?. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B08F | Application dismissed because of non-payment of annual fees [chapter 8.6 patent gazette] |
Free format text: REFERENTE A 5A ANUIDADE. |
|
B08K | Patent lapsed as no evidence of payment of the annual fee has been furnished to inpi [chapter 8.11 patent gazette] |
Free format text: REFERENTE AO DESPACHO 8.6 PUBLICADO NA RPI 2561 DE 04/02/2020. |
|
B350 | Update of information on the portal [chapter 15.35 patent gazette] |