[go: up one dir, main page]

SE9601811L - Speech-to-speech conversion method and system with extraction of prosody information - Google Patents

Speech-to-speech conversion method and system with extraction of prosody information

Info

Publication number
SE9601811L
SE9601811L SE9601811A SE9601811A SE9601811L SE 9601811 L SE9601811 L SE 9601811L SE 9601811 A SE9601811 A SE 9601811A SE 9601811 A SE9601811 A SE 9601811A SE 9601811 L SE9601811 L SE 9601811L
Authority
SE
Sweden
Prior art keywords
speech
information
prosody information
extraction
inputs
Prior art date
Application number
SE9601811A
Other languages
Swedish (sv)
Other versions
SE9601811D0 (en
SE506003C2 (en
Inventor
Bertil Lyberg
Original Assignee
Telia Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telia Ab filed Critical Telia Ab
Priority to SE9601811A priority Critical patent/SE506003C2/en
Publication of SE9601811D0 publication Critical patent/SE9601811D0/en
Priority to EP97919840A priority patent/EP0919052B1/en
Priority to DE69723449T priority patent/DE69723449T2/en
Priority to DK97919840T priority patent/DK0919052T3/en
Priority to PCT/SE1997/000583 priority patent/WO1997043756A1/en
Publication of SE9601811L publication Critical patent/SE9601811L/en
Publication of SE506003C2 publication Critical patent/SE506003C2/en
Priority to NO19985179A priority patent/NO318557B1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • G10L13/10Prosody rules derived from text; Stress or intonation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Use Of Switch Circuits For Exchanges And Methods Of Control Of Multiplex Exchanges (AREA)

Abstract

The invention provides a speech-to-speech conversion system and method wherein prosody information is extracted from speech, applied to the input of the system, or handled by the method; the prosody information is in the form of the fundamental tone curve of the input speech; the fundamental tone curve is used to obtain dialectal and sentence accent information for the input speech; the sentence accent information is used in the interpretation of the speech inputs, the result of the interpretation being used to obtain speech information data form a database which is used in the formulation of voice responses to the speech inputs; and the dialectal information is used to ensure that the voice responses to the speech inputs have a dialect to match that of respective speech inputs.
SE9601811A 1996-05-13 1996-05-13 Speech-to-speech conversion method and system with extraction of prosody information SE506003C2 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
SE9601811A SE506003C2 (en) 1996-05-13 1996-05-13 Speech-to-speech conversion method and system with extraction of prosody information
EP97919840A EP0919052B1 (en) 1996-05-13 1997-04-08 A method and a system for speech-to-speech conversion
DE69723449T DE69723449T2 (en) 1996-05-13 1997-04-08 METHOD AND SYSTEM FOR LANGUAGE-TO-LANGUAGE IMPLEMENTATION
DK97919840T DK0919052T3 (en) 1996-05-13 1997-04-08 A speech-to-speech conversion method and system
PCT/SE1997/000583 WO1997043756A1 (en) 1996-05-13 1997-04-08 A method and a system for speech-to-speech conversion
NO19985179A NO318557B1 (en) 1996-05-13 1998-11-06 Speech-to-speech conversion method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SE9601811A SE506003C2 (en) 1996-05-13 1996-05-13 Speech-to-speech conversion method and system with extraction of prosody information

Publications (3)

Publication Number Publication Date
SE9601811D0 SE9601811D0 (en) 1996-05-13
SE9601811L true SE9601811L (en) 1997-11-03
SE506003C2 SE506003C2 (en) 1997-11-03

Family

ID=20402543

Family Applications (1)

Application Number Title Priority Date Filing Date
SE9601811A SE506003C2 (en) 1996-05-13 1996-05-13 Speech-to-speech conversion method and system with extraction of prosody information

Country Status (6)

Country Link
EP (1) EP0919052B1 (en)
DE (1) DE69723449T2 (en)
DK (1) DK0919052T3 (en)
NO (1) NO318557B1 (en)
SE (1) SE506003C2 (en)
WO (1) WO1997043756A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1159702C (en) * 2001-04-11 2004-07-28 国际商业机器公司 Speech-to-speech translation system and method with emotion
US7181397B2 (en) * 2005-04-29 2007-02-20 Motorola, Inc. Speech dialog method and system
DE102007011039B4 (en) * 2007-03-07 2019-08-29 Man Truck & Bus Ag Hands-free device in a motor vehicle
US8150020B1 (en) 2007-04-04 2012-04-03 At&T Intellectual Property Ii, L.P. System and method for prompt modification based on caller hang ups in IVRs
US8024179B2 (en) * 2007-10-30 2011-09-20 At&T Intellectual Property Ii, L.P. System and method for improving interaction with a user through a dynamically alterable spoken dialog system
JP5282469B2 (en) 2008-07-25 2013-09-04 ヤマハ株式会社 Voice processing apparatus and program
EP3389043A4 (en) 2015-12-07 2019-05-15 Yamaha Corporation Speech interacting device and speech interacting method
CN113470670B (en) * 2021-06-30 2024-06-07 广州资云科技有限公司 Method and system for rapidly switching electric tone basic tone

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2165969B (en) * 1984-10-19 1988-07-06 British Telecomm Dialogue system
JPH0772840B2 (en) * 1992-09-29 1995-08-02 日本アイ・ビー・エム株式会社 Speech model configuration method, speech recognition method, speech recognition device, and speech model training method
SE9301596L (en) * 1993-05-10 1994-05-24 Televerket Device for increasing speech comprehension when translating speech from a first language to a second language
SE504177C2 (en) * 1994-06-29 1996-12-02 Telia Ab Method and apparatus for adapting a speech recognition equipment for dialectal variations in a language

Also Published As

Publication number Publication date
DK0919052T3 (en) 2003-11-03
NO985179D0 (en) 1998-11-06
NO985179L (en) 1998-11-11
SE9601811D0 (en) 1996-05-13
DE69723449T2 (en) 2004-04-22
WO1997043756A1 (en) 1997-11-20
SE506003C2 (en) 1997-11-03
NO318557B1 (en) 2005-04-11
DE69723449D1 (en) 2003-08-14
EP0919052A1 (en) 1999-06-02
EP0919052B1 (en) 2003-07-09

Similar Documents

Publication Publication Date Title
BR9815258A (en) System and method for auditing sgml data pages
Gårding Intonation in Swedish
EP0831460A3 (en) Speech synthesis method utilizing auxiliary information
Fitzpatrick-Cole The alpine intonation of Bern Swiss German
DE69811921D1 (en) DEVICE AND METHOD FOR DISTINATING SIMILAR-SOUNDING WORDS IN VOICE RECOGNITION
DE68913669D1 (en) Pronunciation of names by a synthesizer.
EP0749109A3 (en) Speech recognition for tonal languages
SE9601811L (en) Speech-to-speech conversion method and system with extraction of prosody information
Schmidt et al. Phonetic transcription standards for european names (onomastica).
SE9600959D0 (en) Speech-to-speech translation method and apparatus
Bonafonte Cávez et al. A billingual texto-to-speech system in spanish and catalan
Olaszy et al. Prosody generation for German CTS/TTS systems (from theoretical intonation patterns to practical realisation)
SE9601812D0 (en) Improvements in, or Relating to, Speech-To-Speech Conversion
DE69908106D1 (en) EXTENSION OF A VOICE RECOGNITION Vocabulary Using Derived Words
Gustafson ONOMASTICA-Creating a multi-lingual dictionary of European names
SE9303902D0 (en) Device and method of speech synthesis
Thatphithakkul et al. The development of LOTUS-TRD: A Thai regional dialect speech corpus
Nebbia et al. A specialised speech synthesis technique for application to automatic reverse directory service
Jose et al. Malayalam Text-to-Speech
JP2658476B2 (en) Document Braille device
Adinlewa et al. Linguistics variation: A case study of Oka-Akoko, Ondo State, Nigeria
Van Dong et al. COMPUTATIONAL LINGUISTIC MATERIAL FOR VIETNAMESE SPEECH PROCESSING: APPLYING IN VIETNAMESE TEXT-TO-SPEECH.
JPH01224797A (en) Systematic voice synthesizing device
Carlson et al. Vowel dynamics in a text-to-speech system some considerations.
Hirschberg et al. Voice response systems: Technologies and applications