KR100632400B1 - 음성 인식을 이용한 입출력 장치 및 그 방법 - Google Patents
음성 인식을 이용한 입출력 장치 및 그 방법 Download PDFInfo
- Publication number
- KR100632400B1 KR100632400B1 KR1020050107944A KR20050107944A KR100632400B1 KR 100632400 B1 KR100632400 B1 KR 100632400B1 KR 1020050107944 A KR1020050107944 A KR 1020050107944A KR 20050107944 A KR20050107944 A KR 20050107944A KR 100632400 B1 KR100632400 B1 KR 100632400B1
- Authority
- KR
- South Korea
- Prior art keywords
- input
- speech recognition
- output device
- pointing
- command
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 37
- 238000012545 processing Methods 0.000 claims abstract description 15
- 238000005192 partition Methods 0.000 claims description 2
- 238000005516 engineering process Methods 0.000 abstract description 9
- 238000010586 diagram Methods 0.000 description 8
- 238000004891 communication Methods 0.000 description 5
- 238000010295 mobile communication Methods 0.000 description 5
- 238000013507 mapping Methods 0.000 description 3
- 210000001747 pupil Anatomy 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000004424 eye movement Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04842—Selection of displayed objects or displayed text elements
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims (10)
- 입출력 장치에 있어서,외부의 음성 명령을 인식하기 위한 음성 인식 수단;상기 음성 인식 수단으로부터 전달받은 음성 인식 결과에 해당되는 화면상의 포인팅 위치를 계산하기 위한 포인팅 제어 수단;화면을 디스플레이하기 위한 화면 표시 수단; 및현재 포인팅 위치와 관련된 각종 명령어를 처리하기 위한 명령어 제어 수단을 포함하는 음성 인식을 이용한 입출력 장치.
- 제 1 항에 있어서,상기 명령어 제어 수단은,클릭, 더블클릭, 스크롤링을 위한 이벤트 명령어를 처리하는 것을 특징으로 하는 음성 인식을 이용한 입출력 장치.
- 제 1 항에 있어서,상기 명령어 제어 수단은,응용 프로그램 제어, 시스템 설정을 위한 실행 명령어를 처리하는 것을 특징 으로 하는 음성 인식을 이용한 입출력 장치.
- 제 1 항 내지 제 3 항 중 어느 한 항에 있어서,상기 포인팅 제어 수단은,화면을 일정 크기로 분할하여 포인팅 위치를 계산하는 것을 특징으로 하는 음성 인식을 이용한 입출력 장치.
- 제 4 항에 있어서,상기 포인팅 제어 수단은,세부 포인팅을 위해, 각 분할 영역을 다단계로 재분할하여 포인팅 위치를 계산하는 것을 특징으로 하는 음성 인식을 이용한 입출력 장치.
- 입출력 장치에서의 음성 인식을 이용한 입출력 방법에 있어서,외부의 음성 명령을 인식하는 음성 명령 인식 단계;상기 음성 인식된 명령어에 해당되는 화면상의 포인팅 위치를 계산하는 포인팅 위치 계산 단계;상기 계산한 포인팅 위치를 식별 가능하도록 디스플레이하는 화면 표시 단 계; 및상기 포인팅 위치와 관련된 각종 명령어를 실행하는 명령어 처리 단계를 포함하는 입출력 장치에서의 음성 인식을 이용한 입출력 방법.
- 제 6 항에 있어서,상기 명령어 처리 단계는,클릭, 더블클릭, 스크롤링을 위한 이벤트 명령어를 처리하는 것을 특징으로 하는 입출력 장치에서의 음성 인식을 이용한 입출력 방법.
- 제 6 항에 있어서,상기 명령어 처리 단계는,응용 프로그램 제어, 시스템 설정을 위한 실행 명령어를 처리하는 것을 특징으로 하는 입출력 장치에서의 음성 인식을 이용한 입출력 방법.
- 제 6 항 내지 제 8 항 중 어느 한 항에 있어서,상기 포인팅 위치 계산 단계는,화면을 일정 크기로 분할하여 포인팅 위치를 계산하는 것을 특징으로 하는 입출력 장치에서의 음성 인식을 이용한 입출력 방법.
- 제 9 항에 있어서,상기 포인팅 위치 계산 단계는,상기 분할 영역을 다단계로 재분할하여, 인식되는 음성 명령 차례에 따라 세부 포인팅 위치를 계산하는 것을 특징으로 하는 입출력 장치에서의 음성 인식을 이용한 입출력 방법.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020050107944A KR100632400B1 (ko) | 2005-11-11 | 2005-11-11 | 음성 인식을 이용한 입출력 장치 및 그 방법 |
US12/093,091 US8478600B2 (en) | 2005-11-11 | 2006-09-11 | Input/output apparatus based on voice recognition, and method thereof |
PCT/KR2006/003605 WO2007055470A1 (en) | 2005-11-11 | 2006-09-11 | Input/output apparatus based on voice recognition, and method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020050107944A KR100632400B1 (ko) | 2005-11-11 | 2005-11-11 | 음성 인식을 이용한 입출력 장치 및 그 방법 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR100632400B1 true KR100632400B1 (ko) | 2006-10-11 |
Family
ID=37635488
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020050107944A Active KR100632400B1 (ko) | 2005-11-11 | 2005-11-11 | 음성 인식을 이용한 입출력 장치 및 그 방법 |
Country Status (3)
Country | Link |
---|---|
US (1) | US8478600B2 (ko) |
KR (1) | KR100632400B1 (ko) |
WO (1) | WO2007055470A1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10629196B2 (en) | 2013-05-21 | 2020-04-21 | Samsung Electronics Co., Ltd. | Apparatus, system, and method for generating voice recognition guide by transmitting voice signal data to a voice recognition server which contains voice recognition guide information to send back to the voice recognition apparatus |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9794348B2 (en) * | 2007-06-04 | 2017-10-17 | Todd R. Smith | Using voice commands from a mobile device to remotely access and control a computer |
US8434153B2 (en) * | 2009-08-24 | 2013-04-30 | Microsoft Corporation | Application display on a locked device |
KR101474856B1 (ko) * | 2013-09-24 | 2014-12-30 | 주식회사 디오텍 | 음성인식을 통해 이벤트를 발생시키기 위한 장치 및 방법 |
JP2015207181A (ja) * | 2014-04-22 | 2015-11-19 | ソニー株式会社 | 情報処理装置、情報処理方法及びコンピュータプログラム |
US20170047065A1 (en) * | 2014-05-13 | 2017-02-16 | Nam Tae Park | Voice-controllable image display device and voice control method for image display device |
WO2016017978A1 (en) * | 2014-07-31 | 2016-02-04 | Samsung Electronics Co., Ltd. | Device and method for performing functions |
CN105100460A (zh) * | 2015-07-09 | 2015-11-25 | 上海斐讯数据通信技术有限公司 | 一种声音操控智能终端的方法及系统 |
CN105653164B (zh) * | 2015-07-31 | 2019-02-01 | 宇龙计算机通信科技(深圳)有限公司 | 一种语音输入用户事件的方法及终端 |
CN105677152A (zh) * | 2015-12-31 | 2016-06-15 | 宇龙计算机通信科技(深圳)有限公司 | 一种语音触屏操作处理的方法、装置以及终端 |
CN105955602B (zh) * | 2016-04-19 | 2019-07-30 | 深圳市全智达科技有限公司 | 一种移动终端操作方法及装置 |
CN106371801A (zh) * | 2016-09-23 | 2017-02-01 | 安徽声讯信息技术有限公司 | 一种基于语音识别技术的语音鼠标系统 |
AU2018226844B2 (en) | 2017-03-03 | 2021-11-18 | Pindrop Security, Inc. | Method and apparatus for detecting spoofing conditions |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5386494A (en) * | 1991-12-06 | 1995-01-31 | Apple Computer, Inc. | Method and apparatus for controlling a speech recognition function using a cursor control device |
JP2973726B2 (ja) * | 1992-08-31 | 1999-11-08 | 株式会社日立製作所 | 情報処理装置 |
CA2115210C (en) * | 1993-04-21 | 1997-09-23 | Joseph C. Andreshak | Interactive computer system recognizing spoken commands |
US5748974A (en) * | 1994-12-13 | 1998-05-05 | International Business Machines Corporation | Multimodal natural language interface for cross-application tasks |
JP3363283B2 (ja) * | 1995-03-23 | 2003-01-08 | 株式会社日立製作所 | 入力装置、入力方法、情報処理システムおよび入力情報の管理方法 |
DE69619592T2 (de) * | 1995-04-11 | 2002-11-07 | Dragon Systems Inc | Bewegung eines auf dem Bildschirm gezeigten Zeigers |
US5677990A (en) * | 1995-05-05 | 1997-10-14 | Panasonic Technologies, Inc. | System and method using N-best strategy for real time recognition of continuously spelled names |
KR19990041133A (ko) * | 1997-11-21 | 1999-06-15 | 윤종용 | 음성을 이용한 화면제어방법 |
KR20010009476A (ko) | 1999-07-09 | 2001-02-05 | 이주섭 | 적외선 무선 헤드 마우스 |
US6542866B1 (en) * | 1999-09-22 | 2003-04-01 | Microsoft Corporation | Speech recognition method and apparatus utilizing multiple feature streams |
KR100367590B1 (ko) | 2000-04-28 | 2003-01-10 | 엘지전자 주식회사 | 정보 표시 장치 및 방법 |
KR20020030156A (ko) | 2000-10-16 | 2002-04-24 | 박기범 | 음성인식을 이용한 컴퓨터 프로그램의 제어방법 |
KR100677294B1 (ko) | 2001-04-23 | 2007-02-05 | 엘지전자 주식회사 | 이동통신 단말기의 메뉴 선택 인터페이스 장치 |
KR20030010279A (ko) | 2001-07-26 | 2003-02-05 | 삼성전자주식회사 | 음성인식이 가능한 컴퓨터시스템 및 그 제어방법 |
US20020158827A1 (en) * | 2001-09-06 | 2002-10-31 | Zimmerman Dennis A. | Method for utilization of a gyroscopic or inertial device as a user interface mechanism for headmounted displays and body worn computers |
US7036080B1 (en) * | 2001-11-30 | 2006-04-25 | Sap Labs, Inc. | Method and apparatus for implementing a speech interface for a GUI |
-
2005
- 2005-11-11 KR KR1020050107944A patent/KR100632400B1/ko active Active
-
2006
- 2006-09-11 WO PCT/KR2006/003605 patent/WO2007055470A1/en active Application Filing
- 2006-09-11 US US12/093,091 patent/US8478600B2/en not_active Expired - Fee Related
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10629196B2 (en) | 2013-05-21 | 2020-04-21 | Samsung Electronics Co., Ltd. | Apparatus, system, and method for generating voice recognition guide by transmitting voice signal data to a voice recognition server which contains voice recognition guide information to send back to the voice recognition apparatus |
US11024312B2 (en) | 2013-05-21 | 2021-06-01 | Samsung Electronics Co., Ltd. | Apparatus, system, and method for generating voice recognition guide by transmitting voice signal data to a voice recognition server which contains voice recognition guide information to send back to the voice recognition apparatus |
US11869500B2 (en) | 2013-05-21 | 2024-01-09 | Samsung Electronics Co., Ltd. | Apparatus, system, and method for generating voice recognition guide by transmitting voice signal data to a voice recognition server which contains voice recognition guide information to send back to the voice recognition apparatus |
Also Published As
Publication number | Publication date |
---|---|
US20080288260A1 (en) | 2008-11-20 |
US8478600B2 (en) | 2013-07-02 |
WO2007055470A1 (en) | 2007-05-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8478600B2 (en) | Input/output apparatus based on voice recognition, and method thereof | |
US11532306B2 (en) | Detecting a trigger of a digital assistant | |
EP4057279B1 (en) | Natural assistant interaction | |
US10395659B2 (en) | Providing an auditory-based interface of a digital assistant | |
US10230841B2 (en) | Intelligent digital assistant for declining an incoming call | |
US10789945B2 (en) | Low-latency intelligent automated assistant | |
EP3320459B1 (en) | Distributed personal assistant | |
US11010550B2 (en) | Unified language modeling framework for word prediction, auto-completion and auto-correction | |
US10592601B2 (en) | Multilingual word prediction | |
US10366158B2 (en) | Efficient word encoding for recurrent neural network language models | |
EP3120344B1 (en) | Visual indication of a recognized voice-initiated action | |
US20190122666A1 (en) | Digital assistant providing whispered speech | |
EP2426598B1 (en) | Apparatus and method for user intention inference using multimodal information | |
US7548859B2 (en) | Method and system for assisting users in interacting with multi-modal dialog systems | |
EP4060659B1 (en) | Low-latency intelligent automated assistant | |
EP3593350B1 (en) | User interface for correcting recognition errors | |
DK179558B1 (en) | DETECTING A TRIGGER OF A DIGITAL ASSISTANT | |
EP4298501B1 (en) | Predictive input interface having improved robustness for processing low precision inputs | |
KR20090022465A (ko) | 단말기 메뉴 선택 방법 및 이를 구비한 단말기 | |
Stern et al. | State-machine based approach for improving robustness in multimodal control |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
PA0109 | Patent application |
Patent event code: PA01091R01D Comment text: Patent Application Patent event date: 20051111 |
|
PA0201 | Request for examination | ||
E701 | Decision to grant or registration of patent right | ||
PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 20060922 |
|
GRNT | Written decision to grant | ||
PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 20060928 Patent event code: PR07011E01D |
|
PR1002 | Payment of registration fee |
Payment date: 20060929 End annual number: 3 Start annual number: 1 |
|
PG1601 | Publication of registration | ||
PR1001 | Payment of annual fee |
Payment date: 20090901 Start annual number: 4 End annual number: 4 |
|
PR1001 | Payment of annual fee |
Payment date: 20100901 Start annual number: 5 End annual number: 5 |
|
PR1001 | Payment of annual fee |
Payment date: 20110831 Start annual number: 6 End annual number: 6 |
|
FPAY | Annual fee payment |
Payment date: 20120910 Year of fee payment: 7 |
|
PR1001 | Payment of annual fee |
Payment date: 20120910 Start annual number: 7 End annual number: 7 |
|
FPAY | Annual fee payment |
Payment date: 20130829 Year of fee payment: 8 |
|
PR1001 | Payment of annual fee |
Payment date: 20130829 Start annual number: 8 End annual number: 8 |
|
LAPS | Lapse due to unpaid annual fee |