JPS5982608A - System for controlling reproducing speed of sound - Google Patents
System for controlling reproducing speed of soundInfo
- Publication number
- JPS5982608A JPS5982608A JP57192310A JP19231082A JPS5982608A JP S5982608 A JPS5982608 A JP S5982608A JP 57192310 A JP57192310 A JP 57192310A JP 19231082 A JP19231082 A JP 19231082A JP S5982608 A JPS5982608 A JP S5982608A
- Authority
- JP
- Japan
- Prior art keywords
- section
- frame
- signal
- silent
- steady
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000001514 detection method Methods 0.000 claims description 9
- 238000000034 method Methods 0.000 claims description 8
- 238000000605 extraction Methods 0.000 claims description 6
- 230000005236 sound signal Effects 0.000 abstract description 8
- 238000010586 diagram Methods 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000000593 degrading effect Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing Not Specific To The Method Of Recording And Reproducing (AREA)
Abstract
Description
【発明の詳細な説明】
発明の技術分野
本発明は、録音した音声を再生する際、再生音の周波数
を変化させることなく、再生速度を増加または減少させ
ることができる音声の再生速度制御方式に関するもので
ある。DETAILED DESCRIPTION OF THE INVENTION Technical Field of the Invention The present invention relates to a sound playback speed control method that can increase or decrease the playback speed without changing the frequency of the playback sound when playing back recorded sound. It is something.
従来技術と問題点
テープレコーダ等に於いて、再生速度を変化させると、
再生音の周波数が再生速度の変化に伴なって変化し、大
変聞きづらいものとなる。このような欠点をなくす為に
、音声を数10ミリ秒毎のフレームに分割し、フレーム
を単位として音声の間引き或は補間を行なうことによシ
、再生音の周波数を変化させることなく、再生速度を増
加或は減少できるようにしたものも提案されているが、
間引き或は補間を行なった後の隣接するフレームの接続
部の不連続点でクリック音が生じる為、再生音の明瞭性
が低下する欠点があった。Conventional technology and problems In tape recorders, etc., when the playback speed is changed,
The frequency of the reproduced sound changes as the reproduction speed changes, making it very difficult to hear. In order to eliminate this drawback, audio is divided into frames every several tens of milliseconds, and the audio is thinned out or interpolated in units of frames, thereby making it possible to reproduce sound without changing the frequency of the reproduced sound. Some proposals have been made that allow the speed to be increased or decreased, but
Since click sounds are generated at discontinuous points between adjacent frames after thinning or interpolation, there is a drawback that the clarity of the reproduced sound deteriorates.
また上述の如き欠点を改善する為に、フレーム接続部の
相互相関係数を計算し、フレームを連続的に接続できる
ように接続時期を微調整する方式も提案されているが、
間引きまたは補間を行なうフレームの長さと位置とを、
音声の性質とは無関係に固定的に決定している為、破裂
音のような短時間に状態が変化する音素の明瞭性が低下
する欠点があった。In order to improve the above-mentioned drawbacks, a method has also been proposed in which the cross-correlation coefficient of the frame connection part is calculated and the connection timing is finely adjusted so that frames can be connected continuously.
The length and position of frames to be thinned out or interpolated,
Because it is fixedly determined regardless of the nature of the voice, it has the disadvantage that the clarity of phonemes that change state in a short period of time, such as plosives, deteriorates.
また、この他にも、無音の部分のみを伸縮する方式も提
案されているが、この方式では実質的々音声部分の速度
制御は不可能である。In addition, a method has also been proposed in which only the silent portion is expanded or contracted, but with this method, it is virtually impossible to control the speed of the audio portion.
発明の目的
本発明は前述の如き欠点を改善したものであり、その目
的は、再生音の周波数を変化させることなく、且つ再生
音の品質を劣化させることなく、再生速度を増加或は減
少できるようにすることにある。以下実施例について詳
細に説明する。Purpose of the Invention The present invention improves the above-mentioned drawbacks, and its purpose is to increase or decrease the reproduction speed without changing the frequency of the reproduced sound and without degrading the quality of the reproduced sound. The purpose is to do so. Examples will be described in detail below.
発明の実施例
先ず、第1図の音声信号波形図を参照して、本発明の詳
細な説明する。同図に於いて、aは無音区間、bは定常
無声子音区間、Cは非定常区間、dは定常母音区間を表
わしている。Embodiments of the Invention First, the present invention will be described in detail with reference to the audio signal waveform diagram of FIG. In the figure, a represents a silent section, b a stationary unvoiced consonant section, C a non-stationary section, and d a stationary vowel section.
無音区間a1定常無声子音区間すに於いては、図示の如
く波形の振幅が小さい為、適当な区間長で間引き或は補
間を行なっても、再生音の明瞭性にはほとんど影響がな
い。また、定常母音区間dに於いては、図示の如く音声
の基本周期T毎に類似の波形が繰返されるので、基本周
期Tで間引き或は補間を行なうことによシ、音声の周波
数成分を変えることなく、シかも波形の不連続点をほと
んど生じることなく、再生音の伸縮が可能である。In the silent section a1, the steady unvoiced consonant section, the amplitude of the waveform is small as shown in the figure, so even if thinning or interpolation is performed with an appropriate section length, the clarity of the reproduced sound will hardly be affected. In addition, in the stationary vowel interval d, similar waveforms are repeated every basic period T of the voice as shown in the figure, so by thinning out or interpolating at the basic period T, the frequency components of the voice can be changed. It is possible to expand and contract the reproduced sound without causing any waveform discontinuities.
しかし、非定常区間Cに於いては、音声波形の性質が急
激に変化するものであるから、間引き或は補間を行なう
と、音素としての特徴が失なわれたシ、短い音素の場合
には、音素そのものが欠落したシして再生者の明瞭性が
低下する。However, in the non-stationary interval C, the properties of the speech waveform change rapidly, so if thinning or interpolation is performed, the characteristics of the phoneme will be lost, and in the case of short phonemes, , since the phoneme itself is missing, the intelligibility for the player deteriorates.
本発明は、上述した理由により、無音区間a1定常無声
子音区間すに於いては、適当な区間長で間引き或は補間
を行ない、また、非定常区間Cに於いては間引きも補間
にも行なわず、定常母音区間dに於いては、音声の基本
周期Tf:単位として間引き或は補間を行なうようにし
、音声の周波数成分を変化させることなく、シかも不連
続点をほとんど生じさせることなく、再生速度を増加或
は減少できるようにしたものである。For the reasons mentioned above, the present invention performs thinning or interpolation with an appropriate interval length in the silent interval a1 and the steady unvoiced consonant interval, and also performs thinning and interpolation in the unsteady interval C. First, in the stationary vowel interval d, the fundamental period Tf of the voice is thinned out or interpolated as a unit, without changing the frequency components of the voice and without causing almost any discontinuity. This allows the playback speed to be increased or decreased.
第2図は本発明の実施例のブロック線図であり、1はア
ナログ音声信号の入力端子、2はAD変換器、3は入力
バッファメモリ、4は無音検出部、5は定常性判定部、
6は基本周期抽出部、7は補間/間引き部、8は出力バ
ッファメモリ、9はDA変換器、IOは出力端子である
。FIG. 2 is a block diagram of an embodiment of the present invention, in which 1 is an analog audio signal input terminal, 2 is an AD converter, 3 is an input buffer memory, 4 is a silence detection section, 5 is a stationarity determination section,
Reference numeral 6 designates a basic period extraction section, 7 an interpolation/decimation section, 8 an output buffer memory, 9 a DA converter, and IO an output terminal.
入力端子1からのアナログ音声信号はAD変換器2でデ
ィジタル符号化された後、一定周期tのクロック信号に
よシ人カバツファメモリ3に蓄積される。入カバツ7ア
3からは、lフレーム毎に音声データが読出され、無音
検出部4に転送される。An analog audio signal from an input terminal 1 is digitally encoded by an AD converter 2 and then stored in a buffer memory 3 in accordance with a clock signal having a constant period t. Audio data is read out from the input cover 7a 3 every l frame and transferred to the silence detection section 4.
尚、1フレームには例えば32ミリ秒分の音声データが
収容されているとする。無音検出部4では、フレーム内
の音声データに基づいて、そのフレームに対応する音声
信号のパワーと零交差数とを算出し、それらが予め定め
た閾値よシ大きい場合は、そのフレームは有音、即ち、
非定常区間C或は定常母音区間dであるとみなし、定常
性判定部5へ音声データを転送する。また、予め定めた
閾値以下の場合は、そのフレームは無音、即ち無音区間
a或は無声子音区間すであるとみなし、フレーム内の音
声データを補間/間引き部7へ転送する。It is assumed that one frame contains, for example, 32 milliseconds worth of audio data. The silence detection unit 4 calculates the power and zero crossing number of the audio signal corresponding to the frame based on the audio data in the frame, and if these are greater than a predetermined threshold, the frame is determined to be sound. , that is,
It is assumed that this is an unsteady section C or a steady vowel section d, and the audio data is transferred to the stationarity determining section 5. If the value is less than or equal to a predetermined threshold, it is assumed that the frame is silent, that is, a silent section a or a silent consonant section, and the audio data in the frame is transferred to the interpolation/decimation unit 7.
定常性判定部5はフレーム内の音声テークに基づいて、
そのフレームに対応する音声信号の自己相関係数を算出
し、その極太値が予め定められている閾値より大きい場
合は、そのフレームは周期性を有する、即ち、定常母音
区間dであるとみなし、フレーム内の音声データを基本
周期抽出部6に転送する。また、自己相関係数の極大値
が予め淀められている閾値以下の場合は、そのフレーム
は非定常区間Cであるとみなし、フレーム内の音声デー
タを出力バッファメモリ9に加える。The stationarity determination unit 5 determines, based on the audio take within the frame,
Calculate the autocorrelation coefficient of the audio signal corresponding to that frame, and if the thickest value is larger than a predetermined threshold, consider that the frame has periodicity, that is, it is a stationary vowel interval d, The audio data within the frame is transferred to the fundamental period extraction section 6. If the maximum value of the autocorrelation coefficient is less than or equal to a predetermined threshold, the frame is considered to be in the non-stationary section C, and the audio data in the frame is added to the output buffer memory 9.
基本周期抽出部6は自己相関係数の極太値を力える係数
から音声信号の基本周期Tを算出し、音声データと共に
算出した基本周期Tを補間/間引き部7へ転送する。The fundamental period extraction section 6 calculates the fundamental period T of the audio signal from the coefficient that inputs the thickest value of the autocorrelation coefficient, and transfers the calculated fundamental period T to the interpolation/decimation section 7 together with the audio data.
補間/間引き部7は基本周期抽出部6よシ転送されてき
た音声データについては、基本周期Tを単位とし、所望
の再生速度に合わせた補間或は間引きを行ない、補間或
は間引きを行なった音声データを出力バッファメモリ8
に転送するものであり、例えば再生速度を1/2にする
場合には、第3図(A)に示すように、1フレームを、
基本周期Tを単位とする区間(1)〜(n−1)と余り
の区間(n)とに分割し、区間(1)〜(n−1)につ
いては、各区間を2回繰返しながら補間して出力し、余
シの区間(n)については、そのまま1回だけ出力する
ものである。また、例えば再生速度を2倍にする場合に
は、同図(B)に示すように、1フレームを、基本周期
Tを単位とする区間(1)〜(n−1)と余りの区間(
n)とに分割し、区間(1)〜(n−1)については、
1つおきに間引きして出力し、余りの区間(n)につい
てはそのまま出力するものである。The interpolation/decimation section 7 performs interpolation or thinning on the audio data transferred from the basic period extraction section 6, using the basic period T as a unit, and performs interpolation or thinning according to the desired playback speed. Output audio data buffer memory 8
For example, if you want to reduce the playback speed to 1/2, as shown in Figure 3 (A), one frame is
Divide into an interval (1) to (n-1) and a remainder interval (n) with the basic period T as a unit, and interpolate each interval by repeating each interval twice. The remaining section (n) is outputted only once as is. In addition, for example, when doubling the playback speed, one frame is divided into sections (1) to (n-1) whose unit is the basic period T and the remainder section (
n), and for interval (1) to (n-1),
Every other section is thinned out and output, and the remaining section (n) is output as is.
また、無音検出部4から直接転送されて来た音声データ
については、補間/間引き部7は、一定の長さく例えば
5ミリ秒)を単位とし、所望の再生速度に合わせた補間
或は間引きを行ない、補間或は間引きを行なった音声デ
ータを出力バッファメモリ8に転送する。出力バッファ
メモリ8に蓄積された音声データは一定周期tのクロッ
ク信号により読出され、DA変換器9を介して出力端子
10よ多出力される。Furthermore, regarding the audio data directly transferred from the silence detection unit 4, the interpolation/decimation unit 7 performs interpolation or thinning in units of a certain length (for example, 5 milliseconds) in accordance with the desired playback speed. The interpolated or thinned audio data is transferred to the output buffer memory 8. The audio data stored in the output buffer memory 8 is read out by a clock signal having a constant period t, and is outputted to an output terminal 10 via a DA converter 9.
発明の詳細
な説明したように、本発明は、録音した音声データを一
定長のフレームに分割するフレーム分割手段(実施例に
於いては入力バッファメモリ3等から成る)と、各フレ
ームが無音区間、無声子音区間、定常母音区間、非定常
区間の何れに対応しているかを検出する検出手段(実施
例に於いては無音検出部4と定常判定部5とから成る)
と、定常母音区間に於ける音声の基本周期を抽出する基
本周期抽出手段(実施例に於いては基本周期抽出部6か
ら成る)とを備え、音声波形が周期的に変化する定常母
音区間に対応したフレームについては、該フレームに収
容されている音声データを、音声の基本周期を単位とし
て補間或は間引きした後に再生するようにしたものであ
るから、不連続部分を減少させることができ、また、音
声波形が急激に変化する非定常区間に対応するフレーム
については、該フレームに収容されている音声データを
そのまま再生するものであるから、再生音の歪みを抑え
ることができ、従って、本発明によれば、再生速度を変
化させた場合に於いても高品質の再生音を得ることがで
きる利点がある。従って、本発明を、高速再生テープレ
コーダ、VTRの高速再生時の音声、留守番電話の高速
読出し、音声メールシステムの音声メツセージ編集、低
速再生を利用した口述筆記マシン、語学練習機など種々
の音声処理装置に適用すれば、非常に有効である。As described in detail, the present invention includes a frame dividing means (in the embodiment, consisting of an input buffer memory 3, etc.) that divides recorded audio data into frames of a fixed length, and a silent period for each frame. , a detection means for detecting whether it corresponds to a voiceless consonant section, a stationary vowel section, or an unsteady section (in the embodiment, it consists of a silence detection section 4 and a stationary determination section 5).
and a fundamental period extracting means (consisting of a fundamental period extracting section 6 in the embodiment) for extracting the fundamental period of speech in a stationary vowel interval, and a fundamental period extracting means (consisting of a fundamental period extracting section 6 in the embodiment), which extracts the fundamental period of speech in a stationary vowel interval in which the speech waveform changes periodically. Since the corresponding frame is played back after the audio data contained in the frame is interpolated or thinned out using the basic period of the audio as a unit, discontinuous parts can be reduced. In addition, for frames corresponding to unsteady sections where the audio waveform changes rapidly, the audio data contained in the frame is played back as is, so distortion of the reproduced sound can be suppressed, and therefore the main According to the invention, there is an advantage that high quality reproduced sound can be obtained even when the reproduction speed is changed. Therefore, the present invention can be applied to various audio processing applications such as high-speed playback tape recorders, high-speed playback of VTRs, high-speed reading of answering machines, voice message editing of voice mail systems, dictation machines using low-speed playback, language practice machines, etc. It is very effective when applied to equipment.
れぞれ音声データの補間、間引き方法の説明図である。
1は入力端子、2はAD変換器、3は入力バッファメモ
リ、4は無音検出部、5は定常性判定部、6は基本周期
検出部、7は補間/間引き部、8は出力バッファメモリ
、9はDA変換器、10は出力端子である。
特許出願人 日本電信電話公社
代理人弁理士 玉 蟲 久 五 部(外3名)′M 1
図
−ITI−
第 3 図FIG. 3 is an explanatory diagram of an interpolation method and a thinning method for audio data, respectively. 1 is an input terminal, 2 is an AD converter, 3 is an input buffer memory, 4 is a silence detection section, 5 is a stationarity determination section, 6 is a fundamental period detection section, 7 is an interpolation/decimation section, 8 is an output buffer memory, 9 is a DA converter, and 10 is an output terminal. Patent Applicant Nippon Telegraph and Telephone Public Corporation Representative Patent Attorney Hisa Gobu Tamamushi (3 others)'M 1
Figure-ITI- Figure 3
Claims (1)
度制御方式に於いて、前記音声データを一定長のフレー
ムに分割するフレーム分割手段と、前記各フレームが無
音区間、無声子音区間、定常母音区間、非定常区間の何
れに対応しているかを検出する検出手段と、前記定常母
音区間に於ける音声の基本周期を抽出する基本周期抽出
手段とを備え、前記無音区間及び無声子音区間に対応す
るフレームについては、該フレームに収容されている音
声データを、一定の長さを単位として、補間或は間引き
した後に再生し、前記定常母音区間に対応するフレーム
については、該フレームに収容されている音声データを
、前記基本周期抽出手段で抽出した基本周期を単位とし
て、補間或は間引きした後に再生し、前記非定常区間に
対応したフレームについては、該フレームに収容されて
いる音声データをそのまま再生することを特徴とする音
声の再生速度制御方式。In an audio playback speed control method for controlling the playback speed of recorded audio data, there is provided a frame dividing means for dividing the audio data into frames of a constant length, and each frame is divided into a silent section, a silent consonant section, and a steady vowel section. , a detection means for detecting which of the non-stationary intervals corresponds to the voice, and a fundamental period extraction means for extracting the fundamental period of the voice in the steady vowel interval, which corresponds to the silent interval and the voiceless consonant interval. As for the frame, the audio data contained in the frame is interpolated or thinned out in units of a certain length, and then played back, and the frame corresponding to the stationary vowel section is played back. The audio data is interpolated or thinned out using the basic period extracted by the basic period extraction means as a unit, and then played back, and for frames corresponding to the non-stationary section, the audio data contained in the frame is played back as is. An audio playback speed control method characterized by:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP57192310A JPS5982608A (en) | 1982-11-01 | 1982-11-01 | System for controlling reproducing speed of sound |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP57192310A JPS5982608A (en) | 1982-11-01 | 1982-11-01 | System for controlling reproducing speed of sound |
Publications (1)
Publication Number | Publication Date |
---|---|
JPS5982608A true JPS5982608A (en) | 1984-05-12 |
Family
ID=16289141
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP57192310A Pending JPS5982608A (en) | 1982-11-01 | 1982-11-01 | System for controlling reproducing speed of sound |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPS5982608A (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS61122700A (en) * | 1984-11-19 | 1986-06-10 | 富士通株式会社 | Synthetic voice pronunciation speed control system |
JPS61290499A (en) * | 1985-06-18 | 1986-12-20 | 日本電気株式会社 | Enunciation speed varying voice synthesizer |
JPS63204300A (en) * | 1987-02-20 | 1988-08-23 | 富士通株式会社 | encoded transmission device |
JPS63234299A (en) * | 1987-03-20 | 1988-09-29 | 株式会社日立製作所 | Speech analysis and synthesis method |
JPS649500A (en) * | 1987-07-01 | 1989-01-12 | Hitachi Ltd | Voice analysis and synchronization method and apparatus |
FR2695750A1 (en) * | 1992-09-17 | 1994-03-18 | Lefevre Frank | Speech signal treatment device for hard of hearing - has speech analyser investigating types of sound-noise, and adjusts signal treatment according to speech type |
WO1995014297A1 (en) * | 1992-09-17 | 1995-05-26 | Frank Lefevre | Device for processing a sound signal and apparatus comprising such a device |
JPH07192391A (en) * | 1993-09-25 | 1995-07-28 | Daewoo Electron Co Ltd | Adjusting device of reproducing speed of laser disk player |
WO1996013826A1 (en) * | 1994-10-28 | 1996-05-09 | Nippon Steel Corporation | Coded data decoding device and video/audio multiplexed data decoding device using it |
WO1997026647A1 (en) * | 1996-01-19 | 1997-07-24 | Matsushita Electric Industrial Co., Ltd. | Reproducing speed changer |
WO2017164216A1 (en) * | 2016-03-24 | 2017-09-28 | ヤマハ株式会社 | Acoustic processing method and acoustic processing device |
-
1982
- 1982-11-01 JP JP57192310A patent/JPS5982608A/en active Pending
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS61122700A (en) * | 1984-11-19 | 1986-06-10 | 富士通株式会社 | Synthetic voice pronunciation speed control system |
JPS61290499A (en) * | 1985-06-18 | 1986-12-20 | 日本電気株式会社 | Enunciation speed varying voice synthesizer |
JPS63204300A (en) * | 1987-02-20 | 1988-08-23 | 富士通株式会社 | encoded transmission device |
JPS63234299A (en) * | 1987-03-20 | 1988-09-29 | 株式会社日立製作所 | Speech analysis and synthesis method |
JPS649500A (en) * | 1987-07-01 | 1989-01-12 | Hitachi Ltd | Voice analysis and synchronization method and apparatus |
WO1995014297A1 (en) * | 1992-09-17 | 1995-05-26 | Frank Lefevre | Device for processing a sound signal and apparatus comprising such a device |
FR2695750A1 (en) * | 1992-09-17 | 1994-03-18 | Lefevre Frank | Speech signal treatment device for hard of hearing - has speech analyser investigating types of sound-noise, and adjusts signal treatment according to speech type |
JPH07192391A (en) * | 1993-09-25 | 1995-07-28 | Daewoo Electron Co Ltd | Adjusting device of reproducing speed of laser disk player |
WO1996013826A1 (en) * | 1994-10-28 | 1996-05-09 | Nippon Steel Corporation | Coded data decoding device and video/audio multiplexed data decoding device using it |
US5765136A (en) * | 1994-10-28 | 1998-06-09 | Nippon Steel Corporation | Encoded data decoding apparatus adapted to be used for expanding compressed data and image audio multiplexed data decoding apparatus using the same |
WO1997026647A1 (en) * | 1996-01-19 | 1997-07-24 | Matsushita Electric Industrial Co., Ltd. | Reproducing speed changer |
US6085157A (en) * | 1996-01-19 | 2000-07-04 | Matsushita Electric Industrial Co., Ltd. | Reproducing velocity converting apparatus with different speech velocity between voiced sound and unvoiced sound |
WO2017164216A1 (en) * | 2016-03-24 | 2017-09-28 | ヤマハ株式会社 | Acoustic processing method and acoustic processing device |
US10891966B2 (en) | 2016-03-24 | 2021-01-12 | Yamaha Corporation | Audio processing method and audio processing device for expanding or compressing audio signals |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6205420B1 (en) | Method and device for instantly changing the speed of a speech | |
Arons | Techniques, perception, and applications of time-compressed speech | |
JP2000511651A (en) | Non-uniform time scaling of recorded audio signals | |
JPS5982608A (en) | System for controlling reproducing speed of sound | |
JPH10187188A (en) | Method and device for speech reproducing | |
JP3619946B2 (en) | Speaking speed conversion device, speaking speed conversion method, and recording medium | |
US6085157A (en) | Reproducing velocity converting apparatus with different speech velocity between voiced sound and unvoiced sound | |
JP2001184100A (en) | Speaking speed converting device | |
JPH09152889A (en) | Speech speed transformer | |
US6070135A (en) | Method and apparatus for discriminating non-sounds and voiceless sounds of speech signals from each other | |
JP2001154684A (en) | Speech speed converter | |
JP3081469B2 (en) | Speech speed converter | |
JP2734028B2 (en) | Audio recording device | |
JP3357742B2 (en) | Speech speed converter | |
JPH0854895A (en) | Reproducing device | |
JP3201327B2 (en) | Recording and playback device | |
KR100359988B1 (en) | real-time speaking rate conversion system | |
JPH09146587A (en) | Speech speed changer | |
JPS62125577A (en) | Voice storing and reproducing device | |
JPS61122700A (en) | Synthetic voice pronunciation speed control system | |
JPH05303400A (en) | Method and device for audio reproduction | |
JPH08292796A (en) | Reproducing device | |
KR100337996B1 (en) | a controlling device for replaying audio signal and a controlling method therefor | |
JPH10214098A (en) | Voice converting toy | |
JP2962777B2 (en) | Audio signal time-base expansion / compression device |