EP0319078A3 - Method and apparatus for the determination of the begin and end points of isolated words in a speech signal - Google Patents
Method and apparatus for the determination of the begin and end points of isolated words in a speech signal Download PDFInfo
- Publication number
- EP0319078A3 EP0319078A3 EP88202629A EP88202629A EP0319078A3 EP 0319078 A3 EP0319078 A3 EP 0319078A3 EP 88202629 A EP88202629 A EP 88202629A EP 88202629 A EP88202629 A EP 88202629A EP 0319078 A3 EP0319078 A3 EP 0319078A3
- Authority
- EP
- European Patent Office
- Prior art keywords
- speech signal
- determination
- begin
- end points
- windows
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/87—Detection of discrete points within a voice signal
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mobile Radio Communication Systems (AREA)
- Analogue/Digital Conversion (AREA)
- Traffic Control Systems (AREA)
Abstract
Zur Ermittlung von Anfangs- und Endpunkt eines Wortsignals innerhalb eines Sprachsignals aus isoliert gesprochenen Wörtern werden bei jedem neuen Digitalwert drei benach barte Fenster für die letzten bisher eingetroffenen gespeicherten Digitalwerte bestimmt, von denen das mittlere Fenster das eigentliche Wortsignal enthalten soll. Die Länge dieses mittleren Fensters wird für jeden Digitalwert zwischen einem minimalen und einem maximalen Wert variiert, und von der darin enthaltenen Energie wird jeweils ein Schwellwert subtrahiert, der aus den beiden benachbarten Fenstern bestimmt wird. Auf diese Weise berücksichtigt das erfindungsgemäße Verfahren jeweils das gesamte Sprachsignal anstatt einzelner isolierter Bereiche, wodurch eine zuverlässigere Endpunktbestimmung möglich ist. To determine the start and end point of a word signal within a speech signal from isolated spoken Words are three adjacent to each new digital value bearded windows for the last ones so far stored digital values, of which the middle window contain the actual word signal should. The length of this middle window is for everyone Digital value between a minimum and a maximum Value varies, and is based on the energy contained therein each subtracts a threshold value from the two neighboring windows is determined. In this way the method according to the invention takes that into account entire speech signal instead of individual isolated Areas, creating a more reliable endpoint determination is possible.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE3739681 | 1987-11-24 | ||
DE19873739681 DE3739681A1 (en) | 1987-11-24 | 1987-11-24 | METHOD FOR DETERMINING START AND END POINT ISOLATED SPOKEN WORDS IN A VOICE SIGNAL AND ARRANGEMENT FOR IMPLEMENTING THE METHOD |
Publications (2)
Publication Number | Publication Date |
---|---|
EP0319078A2 EP0319078A2 (en) | 1989-06-07 |
EP0319078A3 true EP0319078A3 (en) | 1990-01-10 |
Family
ID=6341078
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP88202629A Withdrawn EP0319078A3 (en) | 1987-11-24 | 1988-11-23 | Method and apparatus for the determination of the begin and end points of isolated words in a speech signal |
Country Status (4)
Country | Link |
---|---|
US (1) | US4945566A (en) |
EP (1) | EP0319078A3 (en) |
JP (1) | JPH01167799A (en) |
DE (1) | DE3739681A1 (en) |
Families Citing this family (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5148429A (en) * | 1988-10-27 | 1992-09-15 | Kabushiki Kaisha Toshiba | Voice data transmission system and method |
WO1993021588A1 (en) * | 1992-04-10 | 1993-10-28 | Avid Technology, Inc. | Digital audio workstation providing digital storage and display of video information |
US5634020A (en) * | 1992-12-31 | 1997-05-27 | Avid Technology, Inc. | Apparatus and method for displaying audio data as a discrete waveform |
US5596680A (en) * | 1992-12-31 | 1997-01-21 | Apple Computer, Inc. | Method and apparatus for detecting speech activity using cepstrum vectors |
US5692104A (en) * | 1992-12-31 | 1997-11-25 | Apple Computer, Inc. | Method and apparatus for detecting end points of speech activity |
US5675778A (en) * | 1993-10-04 | 1997-10-07 | Fostex Corporation Of America | Method and apparatus for audio editing incorporating visual comparison |
DE4422545A1 (en) * | 1994-06-28 | 1996-01-04 | Sel Alcatel Ag | Start / end point detection for word recognition |
US5596679A (en) * | 1994-10-26 | 1997-01-21 | Motorola, Inc. | Method and system for identifying spoken sounds in continuous speech by comparing classifier outputs |
US5638486A (en) * | 1994-10-26 | 1997-06-10 | Motorola, Inc. | Method and system for continuous speech recognition using voting techniques |
US5638487A (en) * | 1994-12-30 | 1997-06-10 | Purespeech, Inc. | Automatic speech recognition |
US5819217A (en) * | 1995-12-21 | 1998-10-06 | Nynex Science & Technology, Inc. | Method and system for differentiating between speech and noise |
US6418431B1 (en) * | 1998-03-30 | 2002-07-09 | Microsoft Corporation | Information retrieval and speech recognition based on language models |
US6321197B1 (en) * | 1999-01-22 | 2001-11-20 | Motorola, Inc. | Communication device and method for endpointing speech utterances |
US6324509B1 (en) * | 1999-02-08 | 2001-11-27 | Qualcomm Incorporated | Method and apparatus for accurate endpointing of speech in the presence of noise |
US6865528B1 (en) * | 2000-06-01 | 2005-03-08 | Microsoft Corporation | Use of a unified language model |
US7031908B1 (en) | 2000-06-01 | 2006-04-18 | Microsoft Corporation | Creating a language model for a language processing system |
US8229753B2 (en) * | 2001-10-21 | 2012-07-24 | Microsoft Corporation | Web server controls for web enabled recognition and/or audible prompting |
US7711570B2 (en) * | 2001-10-21 | 2010-05-04 | Microsoft Corporation | Application abstraction with dialog purpose |
US8301436B2 (en) * | 2003-05-29 | 2012-10-30 | Microsoft Corporation | Semantic object synchronous understanding for highly interactive interface |
US7200559B2 (en) * | 2003-05-29 | 2007-04-03 | Microsoft Corporation | Semantic object synchronous understanding implemented with speech application language tags |
US8160883B2 (en) * | 2004-01-10 | 2012-04-17 | Microsoft Corporation | Focus tracking in dialogs |
US8311819B2 (en) * | 2005-06-15 | 2012-11-13 | Qnx Software Systems Limited | System for detecting speech with background voice estimates and noise estimates |
US8170875B2 (en) | 2005-06-15 | 2012-05-01 | Qnx Software Systems Limited | Speech end-pointer |
US7568758B2 (en) * | 2007-01-03 | 2009-08-04 | Kolcraft Enterprises | High chairs and methods to use high chairs |
US9099098B2 (en) * | 2012-01-20 | 2015-08-04 | Qualcomm Incorporated | Voice activity detection in presence of background noise |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1986003047A1 (en) * | 1984-11-08 | 1986-05-22 | American Telephone & Telegraph | Endpoint detector |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3243231A1 (en) * | 1982-11-23 | 1984-05-24 | Philips Kommunikations Industrie AG, 8500 Nürnberg | METHOD FOR DETECTING VOICE BREAKS |
JPS59115625A (en) * | 1982-12-22 | 1984-07-04 | Nec Corp | Voice detector |
-
1987
- 1987-11-24 DE DE19873739681 patent/DE3739681A1/en not_active Withdrawn
-
1988
- 1988-11-18 US US07/274,093 patent/US4945566A/en not_active Expired - Fee Related
- 1988-11-22 JP JP63293724A patent/JPH01167799A/en active Pending
- 1988-11-23 EP EP88202629A patent/EP0319078A3/en not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1986003047A1 (en) * | 1984-11-08 | 1986-05-22 | American Telephone & Telegraph | Endpoint detector |
Non-Patent Citations (3)
Title |
---|
PATENT ABSTRACTS OF JAPAN, unexamined applications, Sektion E, Band 1, Nr. 156, 13. Dezember 1977 THE PATENT OFFICE JAPANESE GOVERNMENT Seite 8422 E 77 * |
PATENT ABSTRACTS OF JAPAN, unexamined applications, Sektion E, Band 3, Nr. 15, 9. Februar 1979 THE PATENT OFFICE JAPANESE GOVERNMENT Seite 97 E 89 * |
PATENT ABSTRACTS OF JAPAN, unexamined applications, Sektion E, Band 4, Nr. 5, 16. JÛnner 1980 THE PATENT OFFICE JAPANESE GOVERNMENT Seite 10 E 165 * |
Also Published As
Publication number | Publication date |
---|---|
DE3739681A1 (en) | 1989-06-08 |
JPH01167799A (en) | 1989-07-03 |
EP0319078A2 (en) | 1989-06-07 |
US4945566A (en) | 1990-07-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0319078A3 (en) | Method and apparatus for the determination of the begin and end points of isolated words in a speech signal | |
EP0391396A3 (en) | Drainage and instruments canal for arthroscopy | |
EP1209658A3 (en) | Method and device for the treatment of speech information | |
DE2357067A1 (en) | SPEECH ANALYSIS DEVICE | |
EP0300214A3 (en) | Fish filleting device | |
DE3708002C2 (en) | ||
DE3574739D1 (en) | METHOD FOR PRODUCING HEAVY PROFILES FROM A WELDABLE STAINLESS STEEL AUSTENITIC STEEL. | |
DE1905680A1 (en) | Signal processing system | |
EP0314035A3 (en) | Method of making the superstructure of a permanent way | |
DE4441906C2 (en) | Arrangement and method for speech synthesis | |
DE2357949A1 (en) | PROCEDURE FOR DETERMINING THE INTERVAL CORRESPONDING TO THE PERIOD OF THE EXCITATION FREQUENCY OF THE VOICE RANGES | |
EP1236539A3 (en) | Honing method | |
DE2307441C1 (en) | Method for obfuscating speech signals | |
DE69318223T2 (en) | METHOD FOR VOICE ANALYSIS | |
DE1910135A1 (en) | Non-linear encoder | |
EP0834861A3 (en) | Process for the calculation of a threshold value for the speech recognition of a key-word | |
DE2104012C3 (en) | Electrical device for recognizing speech sounds | |
EP0203434A3 (en) | Reinforcing steel, particularly for injection concrete | |
DE2315398C1 (en) | Method for obfuscating speech signals | |
DE2807198A1 (en) | Cup-shaped boring tool mfr. - in which cutters are welded onto rim of cup and end of central drill | |
EP0829803A3 (en) | Digital signal processor and method for performing a multiplication in a digital signal processor | |
DE2950066C2 (en) | Method for storing and reproducing an analog signal | |
EP0133256A3 (en) | Method of manufacturing a drum, for example for a cable | |
DE1487540B2 (en) | Process for the analysis and synthesis of electrical acoustic signals | |
DE897082C (en) | Device for extracting and loading hard coal or the like. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE FR GB |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): DE FR GB |
|
17P | Request for examination filed |
Effective date: 19900626 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 19920603 |