EP1344211A1 - Device and method for differentiated speech output - Google Patents
Device and method for differentiated speech outputInfo
- Publication number
- EP1344211A1 EP1344211A1 EP01991746A EP01991746A EP1344211A1 EP 1344211 A1 EP1344211 A1 EP 1344211A1 EP 01991746 A EP01991746 A EP 01991746A EP 01991746 A EP01991746 A EP 01991746A EP 1344211 A1 EP1344211 A1 EP 1344211A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- speech
- output
- parameters
- voice
- parameter set
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
Definitions
- the present invention relates to a device for differentiated speech output or speech generation and an associated method, systems for use with the speech output device and combinations of a speech output device with at least two systems, in particular for use in a vehicle.
- a voice output module is directly assigned to each of these systems.
- PCM pulse code modulation
- MPEG subsequent compression
- Other systems use speech synthesis methods, which form words and sentences mainly by assembling syllable segments (phonemes) (signal manipulation).
- Methods are also known which are based on a full synthesis of the language.
- methods are known which implement the human vocal tract as an electrical equivalent and work with a tone generator and several downstream filters (source-filter model).
- a device that works according to this process is a so-called formant synthesizer (eg KLATTALK).
- KLATTALK formant synthesizer
- Such a formant synthesizer has the advantage that the voice characteristics can be influenced.
- the object of the invention is to provide a device and an associated method with which a differentiated speech output is possible, as well as systems for use with the speech output device and combinations of a speech output device with at least two systems, in particular for use in vehicles.
- the invention has the advantage that speech outputs for different systems are possible with a single speech output device or speech synthesis device, each system being identifiable by voice characteristic differences.
- a parameter set is assigned to each system, which is used by the speech synthesis device in a speech output by this system.
- a first parameter set for an on-board computer a second parameter set for a navigation system, a third parameter set for traffic information, a fourth parameter set for a TTS system (Text to Speach system), such as e-mail, and one or more further parameter sets 'provided for additional systems.
- TTS system Text to Speach system
- the speech synthesis device generates the speech output, for example with a soft female voice, e.g. B. for voice output of a navigation system, or with a hard male bass voice, e.g. B. for voice output of traffic reports.
- a soft female voice e.g. B. for voice output of a navigation system
- a hard male bass voice e.g. B. for voice output of traffic reports.
- a method and a device for a full synthesis of speech is used, preferably a formant synthesizer.
- the control parameters for the synthesizer are divided into classes.
- a class of dynamic parameters controls the articulation, like the movement of the speech tract when speaking.
- a second class of static parameters controls speaker-characteristic features, such as the generator basic frequency and fixed formants, which are used in a child, a woman or a male speaker are formed by the different geometric dimensions of the speech tract.
- the device according to the invention and the method according to the invention can be used in particular in systems of a vehicle.
- Each system has two options for voice output to control the voice output.
- the first way of voice output involves sending an output of control commands for voice articulation, the sequence of control parameters for words, sentences and sentence sequences being stored in the system.
- the second option for controlling the speech output is via a second output, which switches over a parameter set that is decisive for the speaker characteristic.
- the generator and formant parameters are also changed dynamically. This makes it possible to achieve audible differences in the prosody, such as the duration and / or emphasis on syllable segments and / or the sentence melody.
- prosodic modulation depending on e.g. B. from a traffic situation or a traffic situation can be used for the voice output of announcement texts.
- the explosiveness of information can be expressed by modulating the voice.
- the invention has the advantage that, for. B. in a vehicle only a single voice generator with a small parameter memory from multiple information sources can be controlled.
- the information sources can be equipped with different voice characteristics.
- a vocal tract synthesis device shows that the method is speaker-independent and no high-quality studio recordings are required.
- emotional expression in the voice can also be given according to the invention.
- the voice characteristics can be changed very easily using pre-made parameter templates.
- the procedure is also suitable for converting free texts into speech (Text to Speech), e.g. B. reading aloud email.
- FIG. 1 shows a basic illustration of a preferred embodiment of the invention for differentiated speech output with a plurality of systems according to the invention.
- the preferred embodiment of the invention shown in FIG. 1 has a speech output unit 1 with a speech synthesis device 10, which in the example is a vocal tract synthesis module and is based on full speech synthesis.
- a speech synthesis device 10 which in the example is a vocal tract synthesis module and is based on full speech synthesis.
- the speech synthesis device 10 is connected to an amplifier 12, the output 14 of which supplies an audio signal which outputs speech via a loudspeaker (not shown).
- the speech synthesis device 10 is assigned N parameter sets 21, 22 to 2N, which in the example shown are stored in a memory 20 of the speech output unit 1.
- N systems 31, 32 to 3N are shown, each of which is connected to the voice output unit 1 via a data connection, such as individual lines, a bus system or data channels.
- Each system can carry out a voice output via the voice output unit.
- an on-board computer 31 with an associated parameter set for the on-board Computer 21 a navigation system 32 with an associated parameter set for navigation 22, a traffic information system 33 with an associated parameter set for traffic information 23, an e-mail system such as TTS system 34 with an associated parameter set for e-mail 24.
- Additional systems 3N with a respective assigned parameter set 2N can be provided.
- a parameter set 23 can also be provided for traffic reports, for example, with which a hard male bass voice is used in the speech output.
- the sequence of the speech outputs can take place in succession according to the receipt of the order for the speech output from the systems.
- Information with a higher priority e.g. Traffic information in dangerous situations such as wrong-way drivers is first output by voice output.
- Information with the highest priority e.g. Information is immediately output from the on-board computer about malfunctions of the vehicle or the onset of slippery road surfaces, whereby an ongoing voice output can be interrupted. The interrupted speech output can then be completed or repeated.
- the invention has the advantage that systems with an acoustic display provide the driver with information from various systems without distracting him from his task, as is the case with visual displays. Costs can be saved by using a speech synthesis device that can be used by various on-board computers. Compared to previously used language-producing methods in navigation systems, for example, the storage space requirement can be reduced.
- the invention can be used particularly advantageously in motor vehicles.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Navigation (AREA)
- Traffic Control Systems (AREA)
Abstract
Description
Claims
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE10063503A DE10063503A1 (en) | 2000-12-20 | 2000-12-20 | Device and method for differentiated speech output |
DE10063503 | 2000-12-20 | ||
PCT/EP2001/013488 WO2002050815A1 (en) | 2000-12-20 | 2001-11-21 | Device and method for differentiated speech output |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1344211A1 true EP1344211A1 (en) | 2003-09-17 |
EP1344211B1 EP1344211B1 (en) | 2011-02-16 |
Family
ID=7667936
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP01991746A Expired - Lifetime EP1344211B1 (en) | 2000-12-20 | 2001-11-21 | Device and method for differentiated speech output |
Country Status (6)
Country | Link |
---|---|
US (1) | US7698139B2 (en) |
EP (1) | EP1344211B1 (en) |
JP (1) | JP2004516515A (en) |
DE (2) | DE10063503A1 (en) |
ES (1) | ES2357700T3 (en) |
WO (1) | WO2002050815A1 (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2412046A (en) * | 2004-03-11 | 2005-09-14 | Seiko Epson Corp | Semiconductor device having a TTS system to which is applied a voice parameter set |
DE102005063077B4 (en) * | 2005-12-29 | 2011-05-05 | Airbus Operations Gmbh | Record digital cockpit ground communication on an accident-protected voice recorder |
ATE456845T1 (en) * | 2006-06-02 | 2010-02-15 | Koninkl Philips Electronics Nv | LANGUAGE DIFFERENTIATION |
DE102008019071A1 (en) * | 2008-04-15 | 2009-10-29 | Continental Automotive Gmbh | Method for displaying information, particularly in motor vehicle, involves occurring display of acoustic paraverbal information for display of information, particularly base information |
JP7133149B2 (en) * | 2018-11-27 | 2022-09-08 | トヨタ自動車株式会社 | Automatic driving device, car navigation device and driving support system |
JP7336862B2 (en) * | 2019-03-28 | 2023-09-01 | 株式会社ホンダアクセス | Vehicle navigation system |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5667470A (en) * | 1979-11-07 | 1981-06-06 | Canon Inc | Voice desk-top calculator |
US5559927A (en) * | 1992-08-19 | 1996-09-24 | Clynes; Manfred | Computer system producing emotionally-expressive speech messages |
US5561736A (en) * | 1993-06-04 | 1996-10-01 | International Business Machines Corporation | Three dimensional speech synthesis |
JPH08328573A (en) * | 1995-05-29 | 1996-12-13 | Sanyo Electric Co Ltd | Karaoke (sing-along machine) device, audio reproducing device and recording medium used by the above |
US5924068A (en) * | 1997-02-04 | 1999-07-13 | Matsushita Electric Industrial Co. Ltd. | Electronic news reception apparatus that selectively retains sections and searches by keyword or index for text to speech conversion |
JP3287281B2 (en) * | 1997-07-31 | 2002-06-04 | トヨタ自動車株式会社 | Message processing device |
JP3502247B2 (en) * | 1997-10-28 | 2004-03-02 | ヤマハ株式会社 | Voice converter |
DE19908137A1 (en) * | 1998-10-16 | 2000-06-15 | Volkswagen Ag | Method and device for automatic control of at least one device by voice dialog |
US20020087655A1 (en) * | 1999-01-27 | 2002-07-04 | Thomas E. Bridgman | Information system for mobile users |
GB9925297D0 (en) * | 1999-10-27 | 1999-12-29 | Ibm | Voice processing system |
US6181996B1 (en) * | 1999-11-18 | 2001-01-30 | International Business Machines Corporation | System for controlling vehicle information user interfaces |
US6539354B1 (en) * | 2000-03-24 | 2003-03-25 | Fluent Speech Technologies, Inc. | Methods and devices for producing and using synthetic visual speech based on natural coarticulation |
-
2000
- 2000-12-20 DE DE10063503A patent/DE10063503A1/en not_active Ceased
-
2001
- 2001-11-21 WO PCT/EP2001/013488 patent/WO2002050815A1/en active Application Filing
- 2001-11-21 EP EP01991746A patent/EP1344211B1/en not_active Expired - Lifetime
- 2001-11-21 ES ES01991746T patent/ES2357700T3/en not_active Expired - Lifetime
- 2001-11-21 JP JP2002551833A patent/JP2004516515A/en active Pending
- 2001-11-21 DE DE50115798T patent/DE50115798D1/en not_active Expired - Lifetime
-
2003
- 2003-06-20 US US10/465,839 patent/US7698139B2/en not_active Expired - Lifetime
Non-Patent Citations (1)
Title |
---|
See references of WO0250815A1 * |
Also Published As
Publication number | Publication date |
---|---|
EP1344211B1 (en) | 2011-02-16 |
DE50115798D1 (en) | 2011-03-31 |
US20030225575A1 (en) | 2003-12-04 |
DE10063503A1 (en) | 2002-07-04 |
US7698139B2 (en) | 2010-04-13 |
WO2002050815A1 (en) | 2002-06-27 |
ES2357700T3 (en) | 2011-04-28 |
JP2004516515A (en) | 2004-06-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69821673T2 (en) | Method and apparatus for editing synthetic voice messages, and storage means with the method | |
DE69031165T2 (en) | SYSTEM AND METHOD FOR TEXT-LANGUAGE IMPLEMENTATION WITH THE CONTEXT-DEPENDENT VOCALALLOPHONE | |
DE60112512T2 (en) | Coding of expression in speech synthesis | |
EP1892700A1 (en) | Method for speech recognition and speech reproduction | |
EP1105867B1 (en) | Method and device for the concatenation of audiosegments, taking into account coarticulation | |
EP1282897B1 (en) | Method for creating a speech database for a target vocabulary in order to train a speech recognition system | |
EP1121684B1 (en) | Method and device for information and/or messages by means of speech | |
EP1344211B1 (en) | Device and method for differentiated speech output | |
EP0058130B1 (en) | Method for speech synthesizing with unlimited vocabulary, and arrangement for realizing the same | |
EP2380171A2 (en) | Method and device for processing acoustic voice signals | |
EP1110203B1 (en) | Device and method for digital voice processing | |
DE19503419A1 (en) | Method and device for outputting digitally coded traffic reports using synthetically generated speech | |
DE69607928T2 (en) | METHOD AND DEVICE FOR PROVIDING AND USING DIPHONES FOR MULTI-LANGUAGE TEXT-BY-LANGUAGE SYSTEMS | |
WO2008064742A1 (en) | Method for the rendition of text information by speech in a vehicle | |
DE10033104C2 (en) | Methods for generating statistics of phone durations and methods for determining the duration of individual phones for speech synthesis | |
EP2592623B1 (en) | Technique for outputting an acoustic signal by means of a navigation system | |
DE19837661C2 (en) | Method and device for co-articulating concatenation of audio segments | |
DE69329375T2 (en) | Method for realizing tone curves for voice messages and method for speech synthesis and device for its application | |
DE3232835A1 (en) | Method and circuit group arrangement for speech synthesis | |
WO2000031722A1 (en) | Method for controlling duration in speech synthesis | |
EP3144929A1 (en) | Synthetic generation of a naturally-sounding speech signal | |
DE1922170A1 (en) | Speech synthesis system | |
EP1212748A1 (en) | Digital speech synthesis method with intonation reproduction | |
DE102023116308A1 (en) | Method for adapting an audio content to a competency profile of a user of a motor vehicle, computer program and/or computer-readable medium, data processing device, motor vehicle | |
DE102017213246A1 (en) | Method, apparatus and computer program for generating auditory messages |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20030425 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
RBV | Designated contracting states (corrected) |
Designated state(s): DE ES FR GB IT SE |
|
17Q | First examination report despatched |
Effective date: 20070808 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE ES FR GB IT SE |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REF | Corresponds to: |
Ref document number: 50115798 Country of ref document: DE Date of ref document: 20110331 Kind code of ref document: P |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 50115798 Country of ref document: DE Effective date: 20110331 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2357700 Country of ref document: ES Kind code of ref document: T3 Effective date: 20110428 |
|
REG | Reference to a national code |
Ref country code: SE Ref legal event code: TRGR |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20111117 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 50115798 Country of ref document: DE Effective date: 20111117 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 15 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 16 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 17 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20201126 Year of fee payment: 20 Ref country code: GB Payment date: 20201123 Year of fee payment: 20 Ref country code: ES Payment date: 20201214 Year of fee payment: 20 Ref country code: FR Payment date: 20201119 Year of fee payment: 20 Ref country code: IT Payment date: 20201130 Year of fee payment: 20 Ref country code: SE Payment date: 20201123 Year of fee payment: 20 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R071 Ref document number: 50115798 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: PE20 Expiry date: 20211120 |
|
REG | Reference to a national code |
Ref country code: SE Ref legal event code: EUG |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20211120 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FD2A Effective date: 20220228 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: ES Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20211122 |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230502 |