KR0173340B1 - 텍스트/음성변환기에서 억양패턴 정규화와 신경망 학습을 이용한 억양 생성 방법 - Google Patents
텍스트/음성변환기에서 억양패턴 정규화와 신경망 학습을 이용한 억양 생성 방법 Download PDFInfo
- Publication number
- KR0173340B1 KR0173340B1 KR1019950055841A KR19950055841A KR0173340B1 KR 0173340 B1 KR0173340 B1 KR 0173340B1 KR 1019950055841 A KR1019950055841 A KR 1019950055841A KR 19950055841 A KR19950055841 A KR 19950055841A KR 0173340 B1 KR0173340 B1 KR 0173340B1
- Authority
- KR
- South Korea
- Prior art keywords
- sentence
- pattern
- accent
- word
- intonation
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 34
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 15
- 238000010606 normalization Methods 0.000 title claims abstract description 9
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 11
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 11
- 238000006243 chemical reaction Methods 0.000 abstract description 6
- 238000004891 communication Methods 0.000 abstract description 2
- 230000001186 cumulative effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000033764 rhythmic process Effects 0.000 description 2
- 238000001308 synthesis method Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Description
Claims (1)
- 텍스트/음성변환장치에 적용되는 억양패턴 정규화와 신경망 학습을 이용한 억양생성 방법에 있어서, 합성 데이터베이스(3)로부터 음성 데이터를 읽어 음절의 피치 패턴을 정규화 및 표준화하고 어절내 각 음절의 평균 피치값에서 어절의 평균 피치값을 뺀 피치값으로 어절피치패턴을 학습하고, 문장내 각 어절의 평균 피치값으로부터 문장의 기준 억양을 추정한 후, 문장의 문맥에 따른 문법 속성열과 그에 해당되는 억양패턴테이블을 작성하는 제1단계(10 내지 15); 한국어 문장과 문법 속성열이 입력되면 문장의 기준억양 생성 과정에서 각 어절에 대해 문장내 위치에 따라 1차 평균 피치값을 할당하고, 비균일 단위의 억양패턴 생성 과정에서 입력된 문법 속성열을 이용하여 왼쪽 우선 검색 방식으로 최장 일치 부분을 억양 패턴 테이블에서 찾아 해당 어절에 2차 평균 피치값을 할당하고, 어절의 피치 패턴 생성 과정에서 신경망을 이용하여 각 음절의 평균 피치값 변화량을 계산하고, 음절의 피치 패턴 생성 과정에서는 각 음절을 구성하는 음소열과 표준 피치 패턴 테이블을 이용하여 음절의 피치 패턴을 계산하고, 각 과정의 결과를 합하여 전체 문장의 억양을 생성하여 출력하는 제2단계(16 내지 21)를 포함하는 것을 특징으로 하는 억양 생성 방법.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1019950055841A KR0173340B1 (ko) | 1995-12-23 | 1995-12-23 | 텍스트/음성변환기에서 억양패턴 정규화와 신경망 학습을 이용한 억양 생성 방법 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1019950055841A KR0173340B1 (ko) | 1995-12-23 | 1995-12-23 | 텍스트/음성변환기에서 억양패턴 정규화와 신경망 학습을 이용한 억양 생성 방법 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR970050108A KR970050108A (ko) | 1997-07-29 |
KR0173340B1 true KR0173340B1 (ko) | 1999-04-01 |
Family
ID=19444005
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1019950055841A KR0173340B1 (ko) | 1995-12-23 | 1995-12-23 | 텍스트/음성변환기에서 억양패턴 정규화와 신경망 학습을 이용한 억양 생성 방법 |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR0173340B1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11398223B2 (en) | 2018-03-22 | 2022-07-26 | Samsung Electronics Co., Ltd. | Electronic device for modulating user voice using artificial intelligence model and control method thereof |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102072162B1 (ko) | 2018-01-05 | 2020-01-31 | 서울대학교산학협력단 | 인공 지능 기반 외국어 음성 합성 방법 및 장치 |
-
1995
- 1995-12-23 KR KR1019950055841A patent/KR0173340B1/ko not_active IP Right Cessation
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11398223B2 (en) | 2018-03-22 | 2022-07-26 | Samsung Electronics Co., Ltd. | Electronic device for modulating user voice using artificial intelligence model and control method thereof |
Also Published As
Publication number | Publication date |
---|---|
KR970050108A (ko) | 1997-07-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6751592B1 (en) | Speech synthesizing apparatus, and recording medium that stores text-to-speech conversion program and can be read mechanically | |
KR100811568B1 (ko) | 대화형 음성 응답 시스템들에 의해 스피치 이해를 방지하기 위한 방법 및 장치 | |
JP5198046B2 (ja) | 音声処理装置及びそのプログラム | |
JP2007249212A (ja) | テキスト音声合成のための方法、コンピュータプログラム及びプロセッサ | |
CN101156196A (zh) | 混合语音合成器、方法和使用 | |
US7069216B2 (en) | Corpus-based prosody translation system | |
Yegnanarayana et al. | Significance of knowledge sources for a text-to-speech system for Indian languages | |
KR0146549B1 (ko) | 한국어 텍스트/음성 변환 방법 | |
Hoffmann et al. | Evaluation of a multilingual TTS system with respect to the prosodic quality | |
KR0173340B1 (ko) | 텍스트/음성변환기에서 억양패턴 정규화와 신경망 학습을 이용한 억양 생성 방법 | |
JPH08335096A (ja) | テキスト音声合成装置 | |
JPS62138898A (ja) | 音声規則合成方式 | |
Romsdorfer et al. | A mixed-lingual phonological component which drives the statistical prosody control of a polyglot TTS synthesis system | |
JPH037995A (ja) | 歌音声合成データの作成装置 | |
JPH03245192A (ja) | 外国語単語の発音決定方法 | |
Ouh-Young et al. | A Chinese text-to-speech system based upon a syllable concatenation model | |
Kaur et al. | BUILDING AText-TO-SPEECH SYSTEM FOR PUNJABI LANGUAGE | |
Aparna et al. | Text to speech synthesis of Hindi language using polysyllable units | |
Tatham | Voice output for man-machine interaction | |
IMRAN | ADMAS UNIVERSITY SCHOOL OF POST GRADUATE STUDIES DEPARTMENT OF COMPUTER SCIENCE | |
Morton | PALM: psychoacoustic language modelling | |
JPH04350699A (ja) | テキスト音声合成装置 | |
Khalil et al. | Optimization of Arabic database and an implementation for Arabic speech synthesis system using HMM: HTS_ARAB_TALK | |
JP2024111781A (ja) | 音声合成システム及び音声合成方法 | |
JPH09146576A (ja) | 原文対音声の人工的神経回路網にもとづく韻律の合成装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
PA0109 | Patent application |
Patent event code: PA01091R01D Comment text: Patent Application Patent event date: 19951223 |
|
PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 19951223 Comment text: Request for Examination of Application |
|
PG1501 | Laying open of application | ||
E701 | Decision to grant or registration of patent right | ||
PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 19980929 |
|
GRNT | Written decision to grant | ||
PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 19981029 Patent event code: PR07011E01D |
|
PR1002 | Payment of registration fee |
Payment date: 19981029 End annual number: 3 Start annual number: 1 |
|
PG1601 | Publication of registration | ||
PR1001 | Payment of annual fee |
Payment date: 20010927 Start annual number: 4 End annual number: 4 |
|
PR1001 | Payment of annual fee |
Payment date: 20020930 Start annual number: 5 End annual number: 5 |
|
PR1001 | Payment of annual fee |
Payment date: 20031001 Start annual number: 6 End annual number: 6 |
|
PR1001 | Payment of annual fee |
Payment date: 20041001 Start annual number: 7 End annual number: 7 |
|
PR1001 | Payment of annual fee |
Payment date: 20051011 Start annual number: 8 End annual number: 8 |
|
PR1001 | Payment of annual fee |
Payment date: 20061002 Start annual number: 9 End annual number: 9 |
|
PR1001 | Payment of annual fee |
Payment date: 20070919 Start annual number: 10 End annual number: 10 |
|
FPAY | Annual fee payment |
Payment date: 20081001 Year of fee payment: 11 |
|
PR1001 | Payment of annual fee |
Payment date: 20081001 Start annual number: 11 End annual number: 11 |
|
LAPS | Lapse due to unpaid annual fee | ||
PC1903 | Unpaid annual fee |
Termination category: Default of registration fee Termination date: 20100910 |