[go: up one dir, main page]

WO1998025260A3 - Speech synthesis using dual neural networks - Google Patents

Speech synthesis using dual neural networks Download PDF

Info

Publication number
WO1998025260A3
WO1998025260A3 PCT/US1997/018815 US9718815W WO9825260A3 WO 1998025260 A3 WO1998025260 A3 WO 1998025260A3 US 9718815 W US9718815 W US 9718815W WO 9825260 A3 WO9825260 A3 WO 9825260A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech
neural networks
speech synthesis
dual neural
speech parameters
Prior art date
Application number
PCT/US1997/018815
Other languages
French (fr)
Other versions
WO1998025260A2 (en
Inventor
Orhan Karaali
Noel Massey
Gerald Corrigan
Original Assignee
Motorola Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc filed Critical Motorola Inc
Priority to EP97946261A priority Critical patent/EP0932896A2/en
Publication of WO1998025260A2 publication Critical patent/WO1998025260A2/en
Publication of WO1998025260A3 publication Critical patent/WO1998025260A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)

Abstract

A method (500, 600), device (201 and 206) and system (203) provide, in response to text/linguistic information, efficient generation of a parametric representation of speech. A coder parameter generating system provides a principal set and a supplementary set of speech parameters, the principal set of speech parameters being the parametric representation of speech. Then feedback is provided to the coder parameter generating system using the supplementary set of speech parameters to modify the principal set of speech parameters.
PCT/US1997/018815 1996-12-05 1997-10-15 Speech synthesis using dual neural networks WO1998025260A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP97946261A EP0932896A2 (en) 1996-12-05 1997-10-15 Method, device and system for supplementary speech parameter feedback for coder parameter generating systems used in speech synthesis

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US76162796A 1996-12-05 1996-12-05
US08/761,627 1996-12-05

Publications (2)

Publication Number Publication Date
WO1998025260A2 WO1998025260A2 (en) 1998-06-11
WO1998025260A3 true WO1998025260A3 (en) 1998-08-06

Family

ID=25062802

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1997/018815 WO1998025260A2 (en) 1996-12-05 1997-10-15 Speech synthesis using dual neural networks

Country Status (2)

Country Link
EP (1) EP0932896A2 (en)
WO (1) WO1998025260A2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5930754A (en) * 1997-06-13 1999-07-27 Motorola, Inc. Method, device and article of manufacture for neural-network based orthography-phonetics transformation

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5165008A (en) * 1991-09-18 1992-11-17 U S West Advanced Technologies, Inc. Speech synthesis using perceptual linear prediction parameters

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0710378A4 (en) * 1994-04-28 1998-04-01 Motorola Inc A method and apparatus for converting text into audible signals using a neural network

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5165008A (en) * 1991-09-18 1992-11-17 U S West Advanced Technologies, Inc. Speech synthesis using perceptual linear prediction parameters

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ARTIFICIAL NEURAL NETWORKS, 1993 THIRD INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, CAWLEY G.C. et al., "LSP Speech Synthesis Using Backpropogation Networks", pages 291-294. *
See also references of EP0932896A4 *

Also Published As

Publication number Publication date
WO1998025260A2 (en) 1998-06-11
EP0932896A2 (en) 1999-08-04
EP0932896A4 (en) 1999-09-08

Similar Documents

Publication Publication Date Title
GB2331826B (en) Context dependent phoneme networks for encoding speech information
AU1191899A (en) System and method for representing complex information auditorially
AU8593998A (en) Method and system for using speech recognition to access the internet, includingaccess via a telephone
AU4705796A (en) System amd method for generating and using context dependent sub-syllable models to recognize a tonal language
CA2317359A1 (en) A method and apparatus for interactive language instruction
AU5594996A (en) Method for producing oxygen and generating power using a solid electrolyte membrane integrated with a gas turbine
EP0683483A3 (en) Method and arrangement for converting speech into text.
AU5266596A (en) Method for signature and session key generation
AU1067900A (en) Network and language models for use in a speech recognition system
AU5170793A (en) Improved membrane computer keyboard and method
CA2161540A1 (en) A Method and Apparatus for Converting Text Into Audible Signals Using a Neural Network
AU3274395A (en) Method and system for continuous speech recognition using voting techniques
EP0750293A3 (en) State transition model design method and voice recognition method and apparatus using same
EP0922279A3 (en) Method and apparatus for executing a human-machine dialogue in the form of two-sided speech as based on a modular dialogue structure
AU5017393A (en) Keyboard and method for producing
AU1028697A (en) Method of operating a gas-turbine-powered generating set using low-calorific-value fuel
FI964304L (en) Method and plant for producing hydrogen peroxide
SG107089A1 (en) Music system, tone generator and musical tone-synthesizing method
GB9824762D0 (en) Self-service terminal
AU680788B2 (en) Method for producing oxygen and hydrogen
EP0646896A3 (en) System and method for generating a solid model.
BR9610837A (en) Reactor composite membrane and method for the synthesis of hydrogen peroxide
AUPO199796A0 (en) Method and device for generating hydrogen and oxygen
WO1998025260A3 (en) Speech synthesis using dual neural networks
GB2313243A (en) System,method and program product for generating a sine or cosine waveform utilizing combined look-up tables

Legal Events

Date Code Title Description
AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1997946261

Country of ref document: EP

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1997946261

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 1997946261

Country of ref document: EP