[go: up one dir, main page]

EP0319078A3 - Method and apparatus for the determination of the begin and end points of isolated words in a speech signal - Google Patents

Method and apparatus for the determination of the begin and end points of isolated words in a speech signal Download PDF

Info

Publication number
EP0319078A3
EP0319078A3 EP88202629A EP88202629A EP0319078A3 EP 0319078 A3 EP0319078 A3 EP 0319078A3 EP 88202629 A EP88202629 A EP 88202629A EP 88202629 A EP88202629 A EP 88202629A EP 0319078 A3 EP0319078 A3 EP 0319078A3
Authority
EP
European Patent Office
Prior art keywords
speech signal
determination
begin
end points
windows
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP88202629A
Other languages
German (de)
French (fr)
Other versions
EP0319078A2 (en
Inventor
Dieter Dr. Mergel
Hermann Dr. Ney
Horst Tomaschewski
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Philips Intellectual Property and Standards GmbH
Koninklijke Philips NV
Original Assignee
Philips Patentverwaltung GmbH
Philips Gloeilampenfabrieken NV
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Philips Patentverwaltung GmbH, Philips Gloeilampenfabrieken NV, Koninklijke Philips Electronics NV filed Critical Philips Patentverwaltung GmbH
Publication of EP0319078A2 publication Critical patent/EP0319078A2/en
Publication of EP0319078A3 publication Critical patent/EP0319078A3/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Analogue/Digital Conversion (AREA)
  • Traffic Control Systems (AREA)

Abstract

Zur Ermittlung von Anfangs- und Endpunkt eines Wortsignals innerhalb eines Sprachsignals aus isoliert gesprochenen Wörtern werden bei jedem neuen Digitalwert drei benach­ barte Fenster für die letzten bisher eingetroffenen gespeicherten Digitalwerte bestimmt, von denen das mittlere Fenster das eigentliche Wortsignal enthalten soll. Die Länge dieses mittleren Fensters wird für jeden Digitalwert zwischen einem minimalen und einem maximalen Wert variiert, und von der darin enthaltenen Energie wird jeweils ein Schwellwert subtrahiert, der aus den beiden benachbarten Fenstern bestimmt wird. Auf diese Weise berücksichtigt das erfindungsgemäße Verfahren jeweils das gesamte Sprachsignal anstatt einzelner isolierter Bereiche, wodurch eine zuverlässigere Endpunktbestimmung möglich ist. To determine the start and end point of a word signal within a speech signal from isolated spoken Words are three adjacent to each new digital value bearded windows for the last ones so far stored digital values, of which the middle window contain the actual word signal should. The length of this middle window is for everyone Digital value between a minimum and a maximum Value varies, and is based on the energy contained therein each subtracts a threshold value from the two neighboring windows is determined. In this way the method according to the invention takes that into account entire speech signal instead of individual isolated Areas, creating a more reliable endpoint determination is possible.

EP88202629A 1987-11-24 1988-11-23 Method and apparatus for the determination of the begin and end points of isolated words in a speech signal Withdrawn EP0319078A3 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE3739681 1987-11-24
DE19873739681 DE3739681A1 (en) 1987-11-24 1987-11-24 METHOD FOR DETERMINING START AND END POINT ISOLATED SPOKEN WORDS IN A VOICE SIGNAL AND ARRANGEMENT FOR IMPLEMENTING THE METHOD

Publications (2)

Publication Number Publication Date
EP0319078A2 EP0319078A2 (en) 1989-06-07
EP0319078A3 true EP0319078A3 (en) 1990-01-10

Family

ID=6341078

Family Applications (1)

Application Number Title Priority Date Filing Date
EP88202629A Withdrawn EP0319078A3 (en) 1987-11-24 1988-11-23 Method and apparatus for the determination of the begin and end points of isolated words in a speech signal

Country Status (4)

Country Link
US (1) US4945566A (en)
EP (1) EP0319078A3 (en)
JP (1) JPH01167799A (en)
DE (1) DE3739681A1 (en)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5148429A (en) * 1988-10-27 1992-09-15 Kabushiki Kaisha Toshiba Voice data transmission system and method
WO1993021588A1 (en) * 1992-04-10 1993-10-28 Avid Technology, Inc. Digital audio workstation providing digital storage and display of video information
US5634020A (en) * 1992-12-31 1997-05-27 Avid Technology, Inc. Apparatus and method for displaying audio data as a discrete waveform
US5596680A (en) * 1992-12-31 1997-01-21 Apple Computer, Inc. Method and apparatus for detecting speech activity using cepstrum vectors
US5692104A (en) * 1992-12-31 1997-11-25 Apple Computer, Inc. Method and apparatus for detecting end points of speech activity
US5675778A (en) * 1993-10-04 1997-10-07 Fostex Corporation Of America Method and apparatus for audio editing incorporating visual comparison
DE4422545A1 (en) * 1994-06-28 1996-01-04 Sel Alcatel Ag Start / end point detection for word recognition
US5596679A (en) * 1994-10-26 1997-01-21 Motorola, Inc. Method and system for identifying spoken sounds in continuous speech by comparing classifier outputs
US5638486A (en) * 1994-10-26 1997-06-10 Motorola, Inc. Method and system for continuous speech recognition using voting techniques
US5638487A (en) * 1994-12-30 1997-06-10 Purespeech, Inc. Automatic speech recognition
US5819217A (en) * 1995-12-21 1998-10-06 Nynex Science & Technology, Inc. Method and system for differentiating between speech and noise
US6418431B1 (en) * 1998-03-30 2002-07-09 Microsoft Corporation Information retrieval and speech recognition based on language models
US6321197B1 (en) * 1999-01-22 2001-11-20 Motorola, Inc. Communication device and method for endpointing speech utterances
US6324509B1 (en) * 1999-02-08 2001-11-27 Qualcomm Incorporated Method and apparatus for accurate endpointing of speech in the presence of noise
US6865528B1 (en) * 2000-06-01 2005-03-08 Microsoft Corporation Use of a unified language model
US7031908B1 (en) 2000-06-01 2006-04-18 Microsoft Corporation Creating a language model for a language processing system
US8229753B2 (en) * 2001-10-21 2012-07-24 Microsoft Corporation Web server controls for web enabled recognition and/or audible prompting
US7711570B2 (en) * 2001-10-21 2010-05-04 Microsoft Corporation Application abstraction with dialog purpose
US8301436B2 (en) * 2003-05-29 2012-10-30 Microsoft Corporation Semantic object synchronous understanding for highly interactive interface
US7200559B2 (en) * 2003-05-29 2007-04-03 Microsoft Corporation Semantic object synchronous understanding implemented with speech application language tags
US8160883B2 (en) * 2004-01-10 2012-04-17 Microsoft Corporation Focus tracking in dialogs
US8311819B2 (en) * 2005-06-15 2012-11-13 Qnx Software Systems Limited System for detecting speech with background voice estimates and noise estimates
US8170875B2 (en) 2005-06-15 2012-05-01 Qnx Software Systems Limited Speech end-pointer
US7568758B2 (en) * 2007-01-03 2009-08-04 Kolcraft Enterprises High chairs and methods to use high chairs
US9099098B2 (en) * 2012-01-20 2015-08-04 Qualcomm Incorporated Voice activity detection in presence of background noise

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1986003047A1 (en) * 1984-11-08 1986-05-22 American Telephone & Telegraph Endpoint detector

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3243231A1 (en) * 1982-11-23 1984-05-24 Philips Kommunikations Industrie AG, 8500 Nürnberg METHOD FOR DETECTING VOICE BREAKS
JPS59115625A (en) * 1982-12-22 1984-07-04 Nec Corp Voice detector

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1986003047A1 (en) * 1984-11-08 1986-05-22 American Telephone & Telegraph Endpoint detector

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
PATENT ABSTRACTS OF JAPAN, unexamined applications, Sektion E, Band 1, Nr. 156, 13. Dezember 1977 THE PATENT OFFICE JAPANESE GOVERNMENT Seite 8422 E 77 *
PATENT ABSTRACTS OF JAPAN, unexamined applications, Sektion E, Band 3, Nr. 15, 9. Februar 1979 THE PATENT OFFICE JAPANESE GOVERNMENT Seite 97 E 89 *
PATENT ABSTRACTS OF JAPAN, unexamined applications, Sektion E, Band 4, Nr. 5, 16. JÛnner 1980 THE PATENT OFFICE JAPANESE GOVERNMENT Seite 10 E 165 *

Also Published As

Publication number Publication date
DE3739681A1 (en) 1989-06-08
JPH01167799A (en) 1989-07-03
EP0319078A2 (en) 1989-06-07
US4945566A (en) 1990-07-31

Similar Documents

Publication Publication Date Title
EP0319078A3 (en) Method and apparatus for the determination of the begin and end points of isolated words in a speech signal
EP0391396A3 (en) Drainage and instruments canal for arthroscopy
EP1209658A3 (en) Method and device for the treatment of speech information
DE2357067A1 (en) SPEECH ANALYSIS DEVICE
EP0300214A3 (en) Fish filleting device
DE3708002C2 (en)
DE3574739D1 (en) METHOD FOR PRODUCING HEAVY PROFILES FROM A WELDABLE STAINLESS STEEL AUSTENITIC STEEL.
DE1905680A1 (en) Signal processing system
EP0314035A3 (en) Method of making the superstructure of a permanent way
DE4441906C2 (en) Arrangement and method for speech synthesis
DE2357949A1 (en) PROCEDURE FOR DETERMINING THE INTERVAL CORRESPONDING TO THE PERIOD OF THE EXCITATION FREQUENCY OF THE VOICE RANGES
EP1236539A3 (en) Honing method
DE2307441C1 (en) Method for obfuscating speech signals
DE69318223T2 (en) METHOD FOR VOICE ANALYSIS
DE1910135A1 (en) Non-linear encoder
EP0834861A3 (en) Process for the calculation of a threshold value for the speech recognition of a key-word
DE2104012C3 (en) Electrical device for recognizing speech sounds
EP0203434A3 (en) Reinforcing steel, particularly for injection concrete
DE2315398C1 (en) Method for obfuscating speech signals
DE2807198A1 (en) Cup-shaped boring tool mfr. - in which cutters are welded onto rim of cup and end of central drill
EP0829803A3 (en) Digital signal processor and method for performing a multiplication in a digital signal processor
DE2950066C2 (en) Method for storing and reproducing an analog signal
EP0133256A3 (en) Method of manufacturing a drum, for example for a cable
DE1487540B2 (en) Process for the analysis and synthesis of electrical acoustic signals
DE897082C (en) Device for extracting and loading hard coal or the like.

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE FR GB

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE FR GB

17P Request for examination filed

Effective date: 19900626

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 19920603