KR100202425B1 - 가전제품의 리모콘 명령어를 인식하기 위한 음성 인식 시스템 - Google Patents
가전제품의 리모콘 명령어를 인식하기 위한 음성 인식 시스템 Download PDFInfo
- Publication number
- KR100202425B1 KR100202425B1 KR1019920015484A KR920015484A KR100202425B1 KR 100202425 B1 KR100202425 B1 KR 100202425B1 KR 1019920015484 A KR1019920015484 A KR 1019920015484A KR 920015484 A KR920015484 A KR 920015484A KR 100202425 B1 KR100202425 B1 KR 100202425B1
- Authority
- KR
- South Korea
- Prior art keywords
- output
- value
- binarization
- voice
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
- Selective Calling Equipment (AREA)
Abstract
Description
Claims (3)
- 가전제품의 리모콘 명령어를 인식하기 위한 음성 인식 시스템에 있어서, 사용자에 의해서 입력되는 음성을 입력하기 위한 마이크로폰; 상기 마이크로폰을 통해서 입력되는 음성신호가 소정시간 주기 내에서 복수 개의 데이터로 나늬어지는 음성분석수단; 상기 음성분석수단으로부터의 복수개의 데이터가 각각 인가 되는 복수개의 필터를 구비하고, 상기 복수개의 필터들 중에서 인접한 두 필터의 출력을 차례로 비교한 값에 따라서 이진화를 수행하여 이진화 데이터를 출력하기 위한 이진화수단; 및 각각의 신경회로망이 적어도 하나의 층으로 된 부신경회로망을 가지는 복수층의 신경회로망을 구비하고, 상기 이진화수단에서 출력된 이진화 데이터를 상기 각 층의 신경회로망을 통한 학습에 따라서 그 결과를 통합하여 출력하는 다층 신경회로망을 포함하는 것을 특징으로 하는 음성 인식 시스템.
- 제1항에 있어서, 상기 이진화수단은, 상기 복수개 데이터가 각각 인가되는 복수개의 필터를 구비하고, 상기 복수개의 필터중에서 인접한 두 필터의 출력을 차례로 비교하여 그 크기가 증가하면, 제1상태의 값을 그렇지 않으면, 제2상태의 값을 할당하는 제1수단; 상기 복수개 필터 중에서 한 필터를 중심으로 인접된 양쪽 필터가 중심 필터의 값보다 작을 때, 제1상태의 값을 할당하고 그렇지 않으면, 제2상태의 값을 할당하는 제2수단; 및 상기 필터의 출력을 일정 비율로 정규화한 다음 정해진 문턱값보다 클 때, 제1상태의 값을 할당하고 그렇지 않으면, 제2상태의 값을 할당하는 제3수단을 포함하는 것을 특징으로 하는 음성 인식 시스템.
- 제1항에 있어서, 상기 다층 신경회로망은, 모든 노드간의 가중치를 초기화하고, 입력과 이에 대응하는 출력의 쌍을 입출력에 제시하여, 각 노드에서 입력의 가중치의 합을 구하고, 하드 리미트 비선형 함수에 의해 출력을 발생하며, 출력노드에서 출력을 원하는 출력값과 비교하여 오차를 계산하고, 오차값에 따른 가중치의 변화분을 저장하고, 상기 과정을 모든 입력에 대해 수행하고 모든 출력값이 원하는 값과 같으면, 학습을 끝내고 그렇지 않으면 이때의 가중치 변화분의 합을 각각의 가중치에 더하며, 상기 과정을 일정 획수 반복후 , 원하는 결과가 나오지 않을 때는 층을 증가시킨 후, 앞의 층에서의 출력과 원래의 입력을 새로운 입력으로 하여 상기 과정을 반복하는 것을 특징으로 하는 음성 인식 시스템.
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1019920015484A KR100202425B1 (ko) | 1992-08-27 | 1992-08-27 | 가전제품의 리모콘 명령어를 인식하기 위한 음성 인식 시스템 |
JP5209702A JPH06161496A (ja) | 1992-08-27 | 1993-08-24 | 家電製品のリモコン命令語を認識するための音声認識システム |
DE4328752A DE4328752B4 (de) | 1992-08-27 | 1993-08-26 | Spracherkennungssystem |
US08/112,037 US5471557A (en) | 1992-08-27 | 1993-08-26 | Speech recognition system utilizing a neural network |
FR9310270A FR2695246B1 (fr) | 1992-08-27 | 1993-08-26 | Systeme de reconnaissance de la parole. |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1019920015484A KR100202425B1 (ko) | 1992-08-27 | 1992-08-27 | 가전제품의 리모콘 명령어를 인식하기 위한 음성 인식 시스템 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR100202425B1 true KR100202425B1 (ko) | 1999-06-15 |
Family
ID=19338592
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1019920015484A Expired - Fee Related KR100202425B1 (ko) | 1992-08-27 | 1992-08-27 | 가전제품의 리모콘 명령어를 인식하기 위한 음성 인식 시스템 |
Country Status (5)
Country | Link |
---|---|
US (1) | US5471557A (ko) |
JP (1) | JPH06161496A (ko) |
KR (1) | KR100202425B1 (ko) |
DE (1) | DE4328752B4 (ko) |
FR (1) | FR2695246B1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11449307B2 (en) | 2017-07-10 | 2022-09-20 | Samsung Electronics Co., Ltd. | Remote controller for controlling an external device using voice recognition and method thereof |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5904697A (en) * | 1995-02-24 | 1999-05-18 | Heartport, Inc. | Devices and methods for performing a vascular anastomosis |
DE19705471C2 (de) * | 1997-02-13 | 1998-04-09 | Sican F & E Gmbh Sibet | Verfahren und Schaltungsanordnung zur Spracherkennung und zur Sprachsteuerung von Vorrichtungen |
KR100644173B1 (ko) | 1997-07-03 | 2006-11-10 | 가부시끼가이샤 도시바 | 무선 수신 장치 |
DE19754382A1 (de) * | 1997-12-08 | 1999-06-10 | Siemens Nixdorf Inf Syst | Gerätekombination aus Fernseh- und Rechnerteil mit Zugriff zu einem Kommunikationsnetz sowie Fernbedienung dafür |
US7266498B1 (en) * | 1998-12-18 | 2007-09-04 | Intel Corporation | Method and apparatus for reducing conflicts between speech-enabled applications sharing speech menu |
JP3979556B2 (ja) * | 1998-12-22 | 2007-09-19 | パイオニア株式会社 | 番組選択装置及び番組選択方法 |
US6397186B1 (en) | 1999-12-22 | 2002-05-28 | Ambush Interactive, Inc. | Hands-free, voice-operated remote control transmitter |
ES2273870T3 (es) * | 2000-07-28 | 2007-05-16 | Koninklijke Philips Electronics N.V. | Sistema para controlar una aparato con instrucciones de voz. |
US7006969B2 (en) * | 2000-11-02 | 2006-02-28 | At&T Corp. | System and method of pattern recognition in very high-dimensional space |
US7369993B1 (en) | 2000-11-02 | 2008-05-06 | At&T Corp. | System and method of pattern recognition in very high-dimensional space |
US6845357B2 (en) * | 2001-07-24 | 2005-01-18 | Honeywell International Inc. | Pattern recognition using an observable operator model |
EP1417678A1 (de) * | 2001-08-13 | 2004-05-12 | Hans Geiger | Verfahren und vorrichtung zum erkennen einer phonetischen lautfolge oder zeichenfolge |
KR20030034443A (ko) * | 2001-10-23 | 2003-05-09 | 삼성전자주식회사 | 음성 인식 사용자 인터페이스 제어 장치 및 방법 |
KR20030047153A (ko) * | 2001-12-08 | 2003-06-18 | 임소영 | 음성인식을 적용한 전자 기기의 신방식 유저 인터페이스시스템 및 방법 |
US20080147579A1 (en) * | 2006-12-14 | 2008-06-19 | Microsoft Corporation | Discriminative training using boosted lasso |
CN103679185B (zh) * | 2012-08-31 | 2017-06-16 | 富士通株式会社 | 卷积神经网络分类器系统、其训练方法、分类方法和用途 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2558682B2 (ja) * | 1987-03-13 | 1996-11-27 | 株式会社東芝 | 知的ワ−クステ−シヨン |
DE3819178A1 (de) * | 1987-06-04 | 1988-12-22 | Ricoh Kk | Spracherkennungsverfahren und -einrichtung |
US5136653A (en) * | 1988-01-11 | 1992-08-04 | Ezel, Inc. | Acoustic recognition system using accumulate power series |
US5214745A (en) * | 1988-08-25 | 1993-05-25 | Sutherland John G | Artificial neural device utilizing phase orientation in the complex number domain to encode and decode stimulus response patterns |
GB8908205D0 (en) * | 1989-04-12 | 1989-05-24 | Smiths Industries Plc | Speech recognition apparatus and methods |
GB8911461D0 (en) * | 1989-05-18 | 1989-07-05 | Smiths Industries Plc | Temperature adaptors |
US5086479A (en) * | 1989-06-30 | 1992-02-04 | Hitachi, Ltd. | Information processing system using neural network learning function |
DE4031421C2 (de) * | 1989-10-05 | 1995-08-24 | Ricoh Kk | Musteranpassungssystem für eine Spracherkennungseinrichtung |
JPH03123399A (ja) * | 1989-10-06 | 1991-05-27 | Ricoh Co Ltd | 音声認識装置 |
-
1992
- 1992-08-27 KR KR1019920015484A patent/KR100202425B1/ko not_active Expired - Fee Related
-
1993
- 1993-08-24 JP JP5209702A patent/JPH06161496A/ja active Pending
- 1993-08-26 FR FR9310270A patent/FR2695246B1/fr not_active Expired - Fee Related
- 1993-08-26 DE DE4328752A patent/DE4328752B4/de not_active Expired - Fee Related
- 1993-08-26 US US08/112,037 patent/US5471557A/en not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11449307B2 (en) | 2017-07-10 | 2022-09-20 | Samsung Electronics Co., Ltd. | Remote controller for controlling an external device using voice recognition and method thereof |
Also Published As
Publication number | Publication date |
---|---|
DE4328752A1 (de) | 1994-03-03 |
US5471557A (en) | 1995-11-28 |
DE4328752B4 (de) | 2004-08-05 |
FR2695246A1 (fr) | 1994-03-04 |
JPH06161496A (ja) | 1994-06-07 |
FR2695246B1 (fr) | 1996-06-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100202425B1 (ko) | 가전제품의 리모콘 명령어를 인식하기 위한 음성 인식 시스템 | |
Liu et al. | Speech emotion recognition with local-global aware deep representation learning | |
CN111627458B (zh) | 一种声源分离方法及设备 | |
US5150323A (en) | Adaptive network for in-band signal separation | |
US6038338A (en) | Hybrid neural network for pattern recognition | |
KR0173923B1 (ko) | 다층구조 신경망을 이용한 음소 분할 방법 | |
CN111160163B (zh) | 一种基于区域关系建模和信息融合建模的表情识别方法 | |
Suuny et al. | Performance of different classifiers in speech recognition | |
US5787408A (en) | System and method for determining node functionality in artificial neural networks | |
Singh | Speaker emotion Recognition System using Artificial neural network classification method for brain-inspired application | |
CN111371611A (zh) | 一种基于深度学习的加权网络社区发现方法及装置 | |
Midenet et al. | Learning associations by self-organization: the LASSO model | |
Lusquino Filho et al. | A weightless regression system for predicting multi-modal empathy | |
Mouawad et al. | On modeling affect in audio with non-linear symbolic dynamics | |
Sunny et al. | Feature extraction methods based on linear predictive coding and wavelet packet decomposition for recognizing spoken words in malayalam | |
Helmi et al. | Speech recognition with fuzzy neural network for discrete words | |
Sanchiz et al. | A neural network-based algorithm to detect dominant points from the chain-code of a contour | |
CN117763464B (zh) | 一种耦合公共-私有拓扑模式学习模型的建模方法 | |
Aggarwal et al. | Urban sound classification using neural networks | |
Webber | Generalisation and discrimination emerge from a self-organising componential network: a speech example | |
Ashurov et al. | Classification of Environmental Sounds Through Spectrogram-Like Images Using Dilation-Based CNN | |
Asikainen | ACOUSTIC SCENE CLASSIFICATION WITH INTERPRETABLE DEEP NEURAL NETWORKS | |
Dörfler | Learning how to listen: time-frequency analysis meets convolutional neural networks | |
Ehgartner et al. | Identification with CNNs by Fusing Face, Fingerprint, and Voice Recognition | |
JPH05143094A (ja) | 話者認識システム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PA0109 | Patent application |
St.27 status event code: A-0-1-A10-A12-nap-PA0109 |
|
R17-X000 | Change to representative recorded |
St.27 status event code: A-3-3-R10-R17-oth-X000 |
|
PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
A201 | Request for examination | ||
PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
R17-X000 | Change to representative recorded |
St.27 status event code: A-3-3-R10-R17-oth-X000 |
|
E902 | Notification of reason for refusal | ||
PE0902 | Notice of grounds for rejection |
St.27 status event code: A-1-2-D10-D21-exm-PE0902 |
|
E701 | Decision to grant or registration of patent right | ||
PE0701 | Decision of registration |
St.27 status event code: A-1-2-D10-D22-exm-PE0701 |
|
GRNT | Written decision to grant | ||
PR0701 | Registration of establishment |
St.27 status event code: A-2-4-F10-F11-exm-PR0701 |
|
PR1002 | Payment of registration fee |
St.27 status event code: A-2-2-U10-U11-oth-PR1002 Fee payment year number: 1 |
|
R18-X000 | Changes to party contact information recorded |
St.27 status event code: A-5-5-R10-R18-oth-X000 |
|
PG1601 | Publication of registration |
St.27 status event code: A-4-4-Q10-Q13-nap-PG1601 |
|
PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 4 |
|
PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 5 |
|
PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 6 |
|
FPAY | Annual fee payment |
Payment date: 20050221 Year of fee payment: 7 |
|
PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 7 |
|
LAPS | Lapse due to unpaid annual fee | ||
PC1903 | Unpaid annual fee |
St.27 status event code: A-4-4-U10-U13-oth-PC1903 Not in force date: 20060320 Payment event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE |
|
PC1903 | Unpaid annual fee |
St.27 status event code: N-4-6-H10-H13-oth-PC1903 Ip right cessation event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE Not in force date: 20060320 |
|
R18-X000 | Changes to party contact information recorded |
St.27 status event code: A-5-5-R10-R18-oth-X000 |
|
P22-X000 | Classification modified |
St.27 status event code: A-4-4-P10-P22-nap-X000 |