JPH08137492A

JPH08137492A - Speech time base converter

Info

Publication number: JPH08137492A
Application number: JP7234903A
Authority: JP
Inventors: Kazuhiko Sato; 和彦佐藤; Masayuki Misaki; 正之三崎; Takeshi Norimatsu; 武志則松; Kimiharu Watanabe; 公治渡辺; Norikazu Ueno; 憲和上野
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1994-09-14
Filing date: 1995-09-13
Publication date: 1996-05-31
Anticipated expiration: 2015-09-13
Also published as: JP2874607B2

Abstract

(57)【要約】【目的】音響信号を記録時の速度以下で再生する際
に、記録時の音程を保ちながらデータの廃棄を極力抑え
自動的に聴きやすい速度に変換することのできる音声時
間軸変換装置を提供する。【構成】記録再生部１０１からＭ倍速再生された音声
は、Ａ／Ｄ変換器１０２で周期Ｔ／Ｍでデジタル信号に
変換される。この信号は有音無音判定部１０３で有音無
音判定され、書き込み制御部１０４により有音部分のみ
がバッファメモリ１０５に蓄積される。読み出し制御部
１０６は周期Ｔでバッファメモリ１０５からデータを読
み出し、時間軸制御部１０９ではそのデータの時間軸変
更処理を行う。データ残量監視部１０７ではバッファメ
モリ内の現在の書き込み、読み出し位置からデータ残量
を逐次計測し、この結果をもとに適応速度制御部１０８
でデータ残量に応じてあらかじめ定めた規則に基づいて
時間軸変換速度比を決定する。 (57) [Summary] [Purpose] When playing back an audio signal at a speed lower than the speed at which it was recorded, the audio time that can be automatically converted to a speed that is easy to listen to while minimizing the loss of data while maintaining the pitch at the time of recording. An axis conversion device is provided. [Structure] The sound reproduced at M times speed from the recording / reproducing unit 101 is converted into a digital signal at a cycle T / M by an A / D converter 102. This signal is subjected to voice / non-voice determination by the voice / non-voice determination unit 103, and only the voice / voice portion is accumulated in the buffer memory 105 by the write control unit 104. The read control unit 106 reads data from the buffer memory 105 at a cycle T, and the time axis control unit 109 performs a time axis changing process of the data. The data remaining amount monitoring unit 107 sequentially measures the data remaining amount from the current writing / reading position in the buffer memory, and based on this result, the adaptive speed control unit 108
Then, the time base conversion speed ratio is determined based on a predetermined rule according to the remaining amount of data.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ビデオテープレコーダ
ー（ＶＴＲ）等で音声の低速再生を行なう際に必要とな
る、音声の時間軸の長さを任意に伸長を行うことを可能
にする音声時間軸変換装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice capable of arbitrarily extending the length of the time axis of the voice, which is required when the voice is played back at a low speed on a video tape recorder (VTR) or the like. The present invention relates to a time axis conversion device.

【０００２】[0002]

【従来の技術】従来より、音声信号を記録時の速度と異
なる速度で再生する音声時間軸変換装置は存在する。例
えばテープレコーダーではテープの走行スピードを調節
して再生速度を速くしたり遅くしたりすることができ
る。しかし、再生スピードが変化するのに伴って同時に
音程も変化してしまうので、内容が聴きづらくなってし
まう。そこで、音程を変化させずに再生速度を変化させ
ることの可能な音声時間軸変換装置が提案されている。2. Description of the Related Art Conventionally, there has been an audio time base conversion device for reproducing an audio signal at a speed different from that at the time of recording. For example, in a tape recorder, the playback speed can be increased or decreased by adjusting the tape running speed. However, as the playback speed changes, the pitch changes at the same time, making it difficult to hear the content. Therefore, there has been proposed an audio time base conversion device capable of changing the reproduction speed without changing the pitch.

【０００３】以下、従来の音声時間軸変換装置について
図面を参照しながら説明する。図８は従来の音声時間軸
変換装置の構成を表すブロック図である。図８において
１は音響信号の記録および再生を行なう記録再生部、２
は再生されたアナログ信号をデジタル信号に変換するＡ
／Ｄ変換器、３はデジタルデータを蓄積するためのバッ
ファメモリ、４はＤ／Ａ変換器、５はバッファメモリへ
のデータの書き込みを制御する書き込み制御部、６はバ
ッファメモリのデータの読み出しを制御する読み出し制
御部である。A conventional speech time base converter will be described below with reference to the drawings. FIG. 8 is a block diagram showing the configuration of a conventional audio time base converter. In FIG. 8, 1 is a recording / reproducing unit for recording and reproducing an acoustic signal, 2
A converts the reproduced analog signal to a digital signal
/ D converter, 3 is a buffer memory for accumulating digital data, 4 is a D / A converter, 5 is a write controller for controlling writing of data into the buffer memory, and 6 is for reading data from the buffer memory. It is a read control unit for controlling.

【０００４】以上のように構成された音声時間軸変換装
置について以下にその動作を説明する。ここでは記録媒
体への記録速度以下で音声信号を再生する場合に、音程
を記録時の状態に戻して再生する音声時間軸変換装置に
ついて説明する。The operation of the speech time base conversion apparatus configured as described above will be described below. Here, a description will be given of an audio time axis conversion device that restores a pitch to a state at the time of recording and reproduces the audio signal when the audio signal is reproduced at a recording speed or less on a recording medium.

【０００５】まず、記録再生部１は記録時の速度で音響
信号を再生する。ここで記録再生部とは、例えばＶＴ
Ｒ、テープレコーダー等である。次に、記録再生部１か
ら再生された音響信号を、Ａ／Ｄ変換器２により標本化
定理を満足するサンプリング周期Ｔでデジタル信号に変
換させる。Ａ／Ｄ変換されたこれらのデジタル信号を、
書き込み制御部５により周期Ｔで順次バッファメモリ３
に蓄積させる。次に、バッファメモリ３に蓄積された各
デジタル信号を、一定時間に対応する区間ごとにグルー
プ化し、それぞれの区間を少なくとも２回以上繰り返し
連続で読み出す。最後に、読み出し制御部６により読み
出されたデジタル信号を、Ｄ／Ａ変換器４によりサンプ
リング周期Ｔでアナログ信号に変換する。これら一連の
処理により音程を変化させずに音声時間軸変換が実現で
きる。First, the recording / reproducing unit 1 reproduces an acoustic signal at a recording speed. Here, the recording / reproducing unit is, for example, VT.
R, tape recorder, etc. Next, the acoustic signal reproduced from the recording / reproducing unit 1 is converted into a digital signal by the A / D converter 2 at a sampling period T that satisfies the sampling theorem. These digital signals that have been A / D converted,
The write control unit 5 sequentially operates the buffer memory 3 at a cycle T.
Accumulate in. Next, the digital signals accumulated in the buffer memory 3 are grouped into sections corresponding to a fixed time, and the sections are repeatedly read at least twice or more continuously. Finally, the D / A converter 4 converts the digital signal read by the read control unit 6 into an analog signal at a sampling cycle T. By this series of processes, the voice time base conversion can be realized without changing the pitch.

【０００６】図９は時間軸変換速度比０．５の場合の処
理例を示している。（ａ）は記録時のデータを示してお
り、（ｂ）はバッファメモリに蓄えられたデータを示し
ている。（ｂ）の各ブロックを２回ずつ繰り返しながら
サンプリング周期Ｔで再生したものが（ｃ）のデータ列
となり、これは（ａ）のデータ列と音程が同じであり長
さが時間軸方向に２倍になっている。FIG. 9 shows an example of processing when the time-axis conversion speed ratio is 0.5. (A) shows the data at the time of recording, and (b) shows the data stored in the buffer memory. The data string of (c) is reproduced by repeating each block of (b) twice at the sampling cycle T. The data string has the same pitch as the data string of (a) and the length is 2 in the time axis direction. Is doubled.

【０００７】[0007]

【発明が解決しようとする課題】上述した従来例では、
音程は記録時のものと変わらないが、音声の速度が記録
媒体からの再生速度だけに依存してしまい、均一の再生
速度の制御しか行えない。さらに、繰り返されるブロッ
ク間では音声の連続性が損なわれ再生音の品質は低下し
聴きづらいのものになる。また、記録媒体からの再生速
度よりも遅い速度に変換する場合には入力信号を蓄積し
ておくバッファメモリが不足する可能性がある。その場
合、バッファメモリ上のデータを減らすために何らかの
データを破棄しなければならず、その結果、情報の欠落
が生じる。In the above-mentioned conventional example,
Although the pitch is the same as that at the time of recording, the speed of voice depends only on the reproduction speed from the recording medium, and only uniform reproduction speed control can be performed. Furthermore, the continuity of the voice is lost between the repeated blocks, and the quality of the reproduced sound deteriorates, making it difficult to hear. Further, when converting to a speed slower than the speed of reproduction from the recording medium, there is a possibility that the buffer memory for accumulating the input signal becomes insufficient. In that case, some data must be discarded in order to reduce the amount of data on the buffer memory, resulting in the loss of information.

【０００８】本発明は、上記課題を解決するものであ
り、記録媒体から記録速度を含めてそれ以下の再生速度
で読み出した場合に、情報の欠落をできるだけ抑制しな
がら自動的に聴きやすい速度に変更することが可能な音
声時間軸変換装置を提供することを目的とする。The present invention is intended to solve the above-mentioned problems, and when reading from a recording medium at a reproduction speed including a recording speed and lower than that, the information loss is suppressed as much as possible and the speed is automatically made easy to listen to. An object of the present invention is to provide a voice time axis conversion device that can be changed.

【０００９】[0009]

【課題を解決するための手段】上記課題を解決するため
に本発明の請求項１記載の音声時間軸変換装置は、記録
媒体から記録速度以下の再生速度で音響信号を読み出す
記録再生部と、再生されたアナログ信号をデジタル信号
に変換するＡ／Ｄ変換器と、入力信号中の有音部と無音
部の判定を行う有音無音判定部と、入力信号を蓄積する
ためのバッファメモリと、バッファメモリへのデータの
書き込みおよびそのアドレスを制御する書き込み制御部
と、バッファメモリからのデータの読み出しおよびその
アドレスを制御する読み出し制御部と、現在の書き込み
アドレスおよび読み出しアドレスの位置関係からバッフ
ァメモリ中のデータ残量を計測するデータ残量監視部
と、データ残量に応じてあらかじめ定められた規則に基
づき時間軸変換速度比を決定する適応速度制御部と、適
応速度制御部で決定された時間軸変換速度比に従って音
声時間軸の伸長を行う時間軸制御部と、時間軸変換後の
データをアナログ信号に変換するＤ／Ａ変換器とを備え
たものである。In order to solve the above problems, an audio time axis converter according to claim 1 of the present invention comprises a recording / reproducing unit for reading out an audio signal from a recording medium at a reproduction speed equal to or lower than a recording speed, An A / D converter for converting the reproduced analog signal into a digital signal, a sound / silence judgment unit for judging a sound part and a silence part in the input signal, a buffer memory for accumulating the input signal, A write control unit that controls writing of data to the buffer memory and its address, a read control unit that controls reading of data from the buffer memory and its address, and the current write address and read address Data remaining amount monitoring unit that measures the remaining amount of data in the , A time axis control section for expanding the audio time axis according to the time axis conversion speed ratio determined by the adaptive speed control section, and D / for converting the data after time axis conversion into an analog signal. And an A converter.

【００１０】請求項２記載の音声時間軸変換装置は、デ
ータ残量に比例して時間軸変換速度比を決定する適応速
度制御部を備えたものである。An audio time base conversion device according to a second aspect of the present invention comprises an adaptive speed control unit which determines a time base conversion speed ratio in proportion to the remaining amount of data.

【００１１】請求項３記載の音声時間軸変換装置は、デ
ータ残量に非線形に対応した変換規則に基づき時間軸変
換速度比を決定する適応速度制御部を備えたものであ
る。According to another aspect of the present invention, there is provided an audio time base conversion device including an adaptive speed control unit for determining a time base conversion speed ratio based on a conversion rule that nonlinearly corresponds to the remaining amount of data.

【００１２】請求項４記載の音声時間軸変換装置は、連
続する同じ個数のサンプル値を含みＡ、Ｂの順に互いに
隣り合う区間Ａ、Ｂと、単調増加する重み係数を区間Ａ
のサンプル値に掛け合わせた値と、単調減少する重み係
数を区間Ｂのサンプル値に掛け合わせた値とを、加算す
ることによって得られるクロスフェード処理区間Ｘを、
Ａ−Ｘ−Ｂの順につなげた区間Ｄの長さと、データを直
接出力する無処理区間Ｃの長さを調整しながら、区間
Ｄ、Ｃを交互に出力することにより前記適応速度制御部
から出力される時間軸変換速度比にあわせて時間軸を調
整する時間軸制御部を備えたものである。According to a fourth aspect of the present invention, there is provided an audio time axis conversion apparatus in which sections A and B which include the same number of consecutive sample values and are adjacent to each other in the order of A and B and a weighting coefficient which increases monotonically are section A.
The crossfade processing section X obtained by adding the value obtained by multiplying the sample value of
Output from the adaptive speed control unit by alternately outputting the sections D and C while adjusting the length of the section D connected in the order of A-X-B and the length of the unprocessed section C that directly outputs the data. The time axis control unit adjusts the time axis according to the time axis conversion speed ratio.

【００１３】請求項５および請求項６記載の音声時間軸
変換装置は、データ残量が一定値以下の時はすべてのデ
ータをバッファメモリに蓄積し、一定値以上の場合は有
音区間のみをバッファメモリに蓄積する書き込み制御部
を備えたものである。According to the fifth aspect and the sixth aspect of the present invention, when the remaining amount of data is less than a certain value, all the data are accumulated in the buffer memory, and when the remaining amount is more than the certain value, only the voiced section is stored. A write control unit for accumulating in the buffer memory is provided.

【００１４】[0014]

【作用】請求項１記載の構成によれば、有音無音の判定
の結果、有音と判定された区間のみをバッファメモリに
蓄積し、書き込みと読み出しのアドレス位置からデータ
残量を計測し、その残量に応じて残量の多いときはでき
るだけ再生速度に近い速度で、残量が少なくなるにつれ
て徐々に低速になるといったように、無音の量に応じて
自動的に速度を調整でき、可能な限り有音部を遅く再生
することが可能となる。また、記録速度と同じ速度で再
生する場合でも、無音の量に応じて時間軸変換後の速度
を記録速度より低速になるように制御すれば、もともと
早口の音声や外国語ソースの聴き取りを容易にすること
も可能である。According to the structure of claim 1, only the section which is determined to be voiced as a result of the voiced / non-voiced determination is accumulated in the buffer memory, and the remaining amount of data is measured from the write and read address positions. Depending on the remaining amount, the speed can be automatically adjusted according to the amount of silence, such as when the remaining amount is high, the speed is as close as possible to the playback speed, and as the remaining amount decreases, the speed gradually decreases. It is possible to reproduce the sound part as late as possible. Even when playing at the same speed as the recording speed, if you control the speed after time axis conversion to be lower than the recording speed according to the amount of silence, you can hear fast-speaking voices or foreign language sources originally. It can also be made easy.

【００１５】請求項２記載の構成によれば、データ残量
と時間軸変換速度比が比例して変化するので、音声の速
度が急激に変化することが抑えられ、音声が聴き取りや
すくなる。According to the second aspect of the present invention, since the remaining amount of data and the time-axis conversion speed ratio change in proportion to each other, it is possible to prevent a rapid change in the speed of the voice and make the voice easy to hear.

【００１６】請求項３記載の構成によれば、低速の速度
に長くとどまるような非線形の変換規則を設定すれば、
時間軸変換速度比を低く抑えようとする効果により低速
側で安定して音声を再生することが可能であり、非常に
聴き取りやすい音声時間軸変換装置が実現できる。According to the third aspect of the present invention, if a non-linear conversion rule that stays at a low speed for a long time is set,
Due to the effect of suppressing the time-axis conversion speed ratio to be low, it is possible to stably reproduce the sound on the low speed side, and it is possible to realize an audio time-axis conversion device that is very easy to hear.

【００１７】請求項４記載の構成によれば、クロスフェ
ード処理区間長および無処理区間長の長さを調整するこ
とによって、任意の時間軸変換速度比で時間軸変換する
ことができる。さらに、クロスフェード処理を行う区間
長を固定長にすれば、バッファメモリに蓄積されたデー
タを直接出力する無処理区間長を調節するだけで、任意
の時間軸変換速度比を得ることができ、非常に簡単な構
成で任意の速度に変換可能な音声時間軸変換装置が実現
できる。According to the fourth aspect of the invention, the time axis conversion can be performed at an arbitrary time axis conversion speed ratio by adjusting the lengths of the crossfade processing section length and the non-processing section length. Furthermore, if the section length for crossfade processing is set to a fixed length, it is possible to obtain an arbitrary time-axis conversion speed ratio simply by adjusting the unprocessed section length that directly outputs the data accumulated in the buffer memory. It is possible to realize a voice time base conversion device that can convert to an arbitrary speed with a very simple configuration.

【００１８】請求項５および請求項６記載の構成によれ
ば、データ残量が少ない時点では入力信号の有音無音判
定結果にかかわらずすべてのデータをバッファメモリに
蓄積することにより、読み出すべきデータがなくなり途
中で再生音が無音状態となる（ミュート状態となる）こ
とがなく、違和感の少ない自然な再生音を得ることがで
きる。According to the fifth and sixth aspects, at the time when the remaining amount of data is small, all the data is stored in the buffer memory regardless of the sound / non-sound determination result of the input signal, so that the data to be read out. It is possible to obtain a natural reproduced sound with little discomfort, since the reproduced sound does not become silent (mute state) in the middle.

【００１９】[0019]

【実施例】以下、本発明の第１の実施例について図面を
参照しながら説明する。図１は本発明の第１の実施例に
おける音声時間軸変換装置の構成を示すブロック図であ
る。図１において、１０１は音響信号の記録および再生
を行う記録再生部、１０２は記録再生部１で再生された
アナログ信号をデジタル信号に変換するＡ／Ｄ変換器、
１０３は変換されたデジタル信号列が有音部であるか無
音部であるかを判定する有音無音判定部、１０４は有音
部と判定された信号列を書き込むバッファメモリ１０５
上への書き込みおよびそのアドレスを制御する書き込み
制御部、１０６はバッファメモリ１０５に蓄積されたデ
ータの読み出しおよびそのアドレスを制御する読み出し
制御部、１０７は現在のバッファメモリ１０５上への書
き込みアドレスと読み出しアドレスの位置関係からデー
タ残量を計測するデータ残量監視部、１０８はデータ残
量監視部で得られたデータ残量から時間軸変換速度比を
決定する適応速度制御部、１０９は適応速度制御部で決
定された時間軸変換速度比に従って時間軸の変換処理を
行う時間軸制御部、１１０は時間軸変換されたデジタル
データをアナログ信号に変換するＤ／Ａ変換器である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A first embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the configuration of a speech time base conversion apparatus according to the first embodiment of the present invention. In FIG. 1, 101 is a recording / reproducing unit for recording and reproducing an acoustic signal, 102 is an A / D converter for converting an analog signal reproduced by the recording / reproducing unit 1 into a digital signal,
Reference numeral 103 denotes a sound / sound determination unit that determines whether the converted digital signal sequence is a voiced portion or a silence portion, and 104 is a buffer memory 105 for writing the signal sequence determined to be the voiced portion.
A writing control unit for controlling writing to the upper side and its address, 106 is a reading control unit for controlling reading of data stored in the buffer memory 105 and its address, and 107 is a present writing address and reading on the buffer memory 105. A data remaining amount monitoring unit that measures the data remaining amount from the positional relationship of addresses, 108 is an adaptive speed control unit that determines the time axis conversion speed ratio from the data remaining amount obtained by the data remaining amount monitoring unit, and 109 is an adaptive speed control A time axis control unit that performs time axis conversion processing according to the time axis conversion speed ratio determined by the unit, and 110 is a D / A converter that converts time axis converted digital data into an analog signal.

【００２０】以上のように構成された音声時間軸変換装
置について、以下その動作を図１を参照しながら詳細に
説明する。The operation of the speech time base conversion apparatus configured as described above will be described in detail below with reference to FIG.

【００２１】まず、記録再生部１０１から記録時のＭ
（≦１）倍の速度で音響信号が読み出される。以後、時
間軸変換速度比とは記録速度に対する相対速度を表すこ
ととする。ここで、記録再生部１０１での記録時のサン
プリング周期をＴとすると、記録再生部１０１よりＭ倍
速で再生された音響信号は逐次Ａ／Ｄ変換器１０２によ
りサンプリング周期Ｔ／Ｍでデジタル信号系列に変換さ
れる。有音無音判定部１０３によりこのサンプル値列が
有音部であるか無音部であるかを判定する。First, from the recording / reproducing unit 101, M at the time of recording
The acoustic signal is read at a speed of (≦ 1) times. Hereinafter, the time-axis conversion speed ratio will represent the relative speed to the recording speed. Here, assuming that the sampling period at the time of recording in the recording / reproducing unit 101 is T, the acoustic signal reproduced by the recording / reproducing unit 101 at M times speed is successively digital signal sequence at the sampling period T / M by the A / D converter 102. Is converted to. The voiced / unvoiced determination unit 103 determines whether the sample value sequence is a voiced part or a silent part.

【００２２】有音部か無音部かの判定は例えば次のよう
に行う。デジタル信号のサンプル値列をＳiとすると、
Ｎ個のサンプル値系列に対して（数１）を満たす場合
に、そのサンプル値列は有音、満たさない場合は無音で
あると判定する。ここでＰthはあらかじめ定めた有音無
音判定のためのしきい値である。The determination as to whether there is a sound part or a silent part is made as follows, for example. If the sample value sequence of the digital signal is Si,
When (Numerical formula 1) is satisfied for the N sample value series, it is determined that the sample value sequence is voiced, and when it is not satisfied, it is silent. Here, Pth is a predetermined threshold value for determining whether or not there is sound.

【００２３】[0023]

【数１】 [Equation 1]

【００２４】ここで、バッファメモリ１０５上で次にデ
ータを格納すべきアドレスを指し示すポインタ（以降、
書き込みポインタと呼ぶ）をＰｗとすると、上記（数
１）で有音と判定された場合には、書き込み制御部１０
４によりそのサンプル値系列が書き込みポインタＰｗの
示すバッファメモリ１０５のアドレス位置に順次格納さ
れ、その都度Ｐｗはインクリメントされる。逆に、無音
と判定された場合には、書き込み制御部１０４はバッフ
ァメモリ１０５へのデータの書き込み動作を停止する。
これによりバッファメモリ１０５には有音部のデータの
みが蓄積される。Here, a pointer (hereinafter, referred to as an address indicating an address at which data is to be stored next on the buffer memory 105)
If it is determined that there is sound in the above (Equation 1), the write control unit 10
4, the sample value series is sequentially stored at the address position of the buffer memory 105 indicated by the write pointer Pw, and Pw is incremented each time. On the contrary, when it is determined that there is no sound, the write control unit 104 stops the data writing operation to the buffer memory 105.
As a result, only the sound part data is stored in the buffer memory 105.

【００２５】なお、ここでは（数１）を満たす場合に有
音、満たさない場合に無音と判定したが、（数１）を満
たす区間に対してその前方、あるいは後方に続く無音と
判定された短いサンプル値列を有音区間に含めてもよ
い。It should be noted that here, it is determined that the sound is present when the expression (1) is satisfied, and the sound is not satisfied when the expression (1) is satisfied. However, it is determined that there is silence before or after the section satisfying the expression (1). A short sample value sequence may be included in the voiced section.

【００２６】読み出し制御部１０６では、バッファメモ
リ１０５のデータを周期Ｔで順次読み出し、時間軸制御
部１０９へ送る。ここで次に読み出すべきバッファメモ
リ１０５上のデータのアドレスを示すポインタ（以降、
書き込みポインタと呼ぶ）をＰｒとすると、データ残量
監視部１０７は、前記書き込みポインタＰｗと読み出し
ポインタＰｒとの位置関係から、バッファメモリ１０５
上のまだ処理されていないデータの残量を逐次計測す
る。The read control unit 106 sequentially reads the data in the buffer memory 105 at a cycle T and sends it to the time axis control unit 109. Here, a pointer indicating the address of the data on the buffer memory 105 to be read next (hereinafter,
Let Pr be the write pointer), the data remaining amount monitoring unit 107 determines the buffer memory 105 based on the positional relationship between the write pointer Pw and the read pointer Pr.
The remaining amount of the above unprocessed data is sequentially measured.

【００２７】図２はデータ残量の計測方法を示す説明図
であり、２つのポインタの位置関係により同図（ａ）
（ｂ）の２つの場合が考えられる。図２において、バッ
ファメモリの開始アドレスをa₀、終了アドレスをa
_n-1(ただし、a_n-1>a₀）とすると、まだ読み出されてい
ないデータ残量Ｚは図２の斜線部分で示され、次式で計
算できる。FIG. 2 is an explanatory diagram showing a method of measuring the remaining amount of data, and FIG. 2 (a) shows the positional relationship between two pointers.
Two cases of (b) can be considered. In FIG. 2, the start address of the buffer memory is a ₀ , and the end address is a
_{If n-1} (however, a _n-1 > a ₀ ), the remaining data amount Z that has not been read is shown by the shaded portion in FIG. 2 and can be calculated by the following equation.

【００２８】[0028]

【数２】 [Equation 2]

【００２９】これは、バッファメモリ１０５をいわゆる
巡回メモリとして扱っていることと等価である。ここ
で、メモリからデータを読み出し出力するためには、巡
回メモリ上で書き込みポインタＰｗが読み出しポインタ
Ｐｒより先行していなければならないため、ＰｗとＰｒ
が重なっている場合（Ｐｗ＝Ｐｒ）は、読み出し制御部
１０６が読み出し動作を停止し、読み出しポインタＰｒ
はそのときのアドレス値を維持する。ここで、ＰｗとＰ
ｒが重なった状態として、図２（ａ）の状態からＰｒが
Ｐｗに追いついた場合と、図２（ｂ）の状態からＰｗが
Ｐｒに追いついた場合の２通りの状態が考えられる。前
者は未処理データの量が０になった場合であり、後者は
未処理データの量がバッファメモリ１０５の容量と等し
くなった場合である。前者では、未処理データが完全に
なくなったのであるが、後者では、未処理データがバッ
ファメモリ１０５を埋めてしまい残っている。しかし後
者の場合も、これらの未処理データをすべて破棄しデー
タ残量を０にリセットする。This is equivalent to treating the buffer memory 105 as a so-called cyclic memory. Here, in order to read and output data from the memory, the write pointer Pw must precede the read pointer Pr on the cyclic memory, so Pw and Pr
If they overlap (Pw = Pr), the read control unit 106 stops the read operation and the read pointer Pr
Maintains the address value at that time. Where Pw and P
There are two possible overlapping states of r: one in which Pr catches up with Pw from the state in FIG. 2A, and one in which Pw catches up with Pr from the state in FIG. 2B. The former is a case where the amount of unprocessed data becomes 0, and the latter is a case where the amount of unprocessed data is equal to the capacity of the buffer memory 105. In the former case, unprocessed data is completely lost, but in the latter case, unprocessed data fills the buffer memory 105 and remains. However, also in the latter case, all of these unprocessed data are discarded and the remaining amount of data is reset to zero.

【００３０】適応速度制御部１０８では、データ残量監
視部１０７で得られたデータ残量Ｚの値をもとに、時間
軸変換速度比を設定する。すなわち、データ残量が多い
場合はできるだけ再生速度に近い速度に、データ残量が
少ない場合はできるだけ書き込みポインタＰｗに読み出
しポインタＰｒが追いつかないような遅い速度に、時間
軸変換後の速度がなるように時間軸変換速度比を設定す
る。The adaptive speed control unit 108 sets the time-axis conversion speed ratio based on the value of the data remaining amount Z obtained by the data remaining amount monitoring unit 107. That is, when the remaining amount of data is large, the speed after the time axis conversion is set to a speed as close as possible to the reproduction speed, and when the remaining amount of data is small, the speed is as slow as possible so that the read pointer Pr cannot catch up with the write pointer Pw. Set the time axis conversion speed ratio to.

【００３１】以下、適応速度制御部１０８の動作を、記
録再生部１０１から記録速度と同じ速度で再生した場合
について説明する。ここでは時間軸変換後の速度の最大
値を記録速度と同じ、最小値をＶ_oとする。図３はデー
タ残量とそれに対する時間軸変換速度比の関係を示して
おり、これが時間軸変換速度比を設定するための規則と
なる。図３（ａ）はデータ残量と時間軸変換速度比を線
形に対応づける規則を示している。この場合、時間軸変
換速度比Ｖは次式により算出できる。Hereinafter, the operation of the adaptive speed control unit 108 will be described in the case where the recording / reproducing unit 101 reproduces at the same speed as the recording speed. Here, the maximum value of the speed after the time axis conversion is the same as the recording speed, and the minimum value is V _o . FIG. 3 shows the relationship between the remaining amount of data and the time-axis conversion speed ratio for the remaining amount, which is the rule for setting the time-axis conversion speed ratio. FIG. 3A shows a rule for linearly associating the remaining amount of data with the time-axis conversion speed ratio. In this case, the time-axis conversion speed ratio V can be calculated by the following equation.

【００３２】[0032]

【数３】 (Equation 3)

【００３３】図３（ｂ）はデータ残量と時間軸変換速度
比を非線形に対応づける規則の例を示したものである。
ここで、２次曲線に対応づけるとすると、時間軸変換速
度比Ｖは次式により算出できる。FIG. 3 (b) shows an example of a rule for non-linearly associating the remaining amount of data with the time-axis conversion speed ratio.
Here, if it is made to correspond to a quadratic curve, the time axis conversion speed ratio V can be calculated by the following equation.

【００３４】[0034]

【数４】 [Equation 4]

【００３５】図３（ａ）の場合は、データ残量の増減に
おいて滑らかに時間軸変換速度比を変更することがで
き、一方、図３（ｂ）の場合はある程度の量のデータが
バッファメモリ１０５に蓄積されるまでは、できるだけ
遅い速度で安定させることができるという特徴がある。
図３（ｃ）は非線形の対応付けを階段状に規定した場合
の例であり、時間軸変換速度比Ｖは次式により算出でき
る。In the case of FIG. 3 (a), the time-axis conversion speed ratio can be changed smoothly when the remaining amount of data is increased or decreased, while in the case of FIG. 3 (b), a certain amount of data is stored in the buffer memory. It is characterized in that it can be stabilized at the slowest speed until it is stored in 105.
FIG. 3C is an example of the case where the non-linear correspondence is defined in a stepwise manner, and the time-axis conversion speed ratio V can be calculated by the following equation.

【００３６】[0036]

【数５】 (Equation 5)

【００３７】図３（ｃ）に示した規則は、（ｂ）の規則
とほぼ同等の制御を、より少ない演算量、回路規模で実
現することが可能である。According to the rule shown in FIG. 3C, it is possible to realize almost the same control as the rule of FIG. 3B with a smaller amount of calculation and a circuit scale.

【００３８】このように、図３の対応規則に基づき時間
軸変換速度比を決定することにより、記録速度と同じ速
度で再生された信号でも、無音部分がある程度以上存在
する入力信号に対しては、有音部分の速度を再生速度よ
りも遅い速度に設定することが可能であり、また、無音
部分が含まれない信号が継続する場合は最大の時間軸変
換速度比（ここでは記録速度に等しい速度、つまり、
１）が設定されるので、情報の欠落が発生しない。As described above, by determining the time-axis conversion speed ratio based on the correspondence rule of FIG. 3, even for a signal reproduced at the same speed as the recording speed, for an input signal having a silent portion to a certain extent or more. , It is possible to set the speed of the voiced part to a speed slower than the playback speed, and when the signal without the silent part continues, the maximum time-axis conversion speed ratio (equal to the recording speed here) Speed, that is,
Since 1) is set, there is no loss of information.

【００３９】なお、ここでは時間軸変換速度比の最大値
を１、すなわち記録速度としたが、最大値を１より小さ
な値（例えば０.８）としても同様の規則が設定でき
る。ただし、この場合は有音が継続したときに記録速度
で再生された信号に対しても最大でも０.８未満の時間
軸変換速度比にしか設定されないため、バッファメモリ
が一杯になりデータの書き込みができずにデータの一部
を廃棄する必要が生じる場合がある。これは、例えば図
２（ｂ）において、ＰｗがＰｒに追いついた場合に相当
するが、この場合上で述べたようにデータ残量を０にリ
セットすることにより、今までに蓄積されたバッファメ
モリの容量分のデータを廃棄することになる。Although the maximum value of the time-axis conversion speed ratio is 1, that is, the recording speed, the same rule can be set even if the maximum value is smaller than 1 (for example, 0.8). However, in this case, since the time axis conversion speed ratio of less than 0.8 is set at the maximum even for the signal reproduced at the recording speed when the sound continues, the buffer memory becomes full and the data is written. In some cases, it may be necessary to discard some of the data without being able to. This corresponds to the case where Pw catches up with Pr in FIG. 2B, for example. In this case, by resetting the remaining data amount to 0 as described above, the buffer memory accumulated so far is The data for the capacity will be discarded.

【００４０】例えば、バッファメモリの容量を２５６Ｋ
ビット、サンプリング周波数を１０ｋＨｚ、１サンプル
を８ビットデータとして扱う場合では、３２Ｋポイント
（約３.２秒）の有音データを廃棄することになる。For example, if the capacity of the buffer memory is 256K
If the bit and sampling frequency are 10 kHz and one sample is treated as 8-bit data, 32 K points (about 3.2 seconds) of voiced data is discarded.

【００４１】このように時間軸変換後の速度の最大値を
低く抑えるような設定を行えば、無音の量によってはデ
ータの一部が欠落するが、データ全体をよりゆっくりし
た聴きやすい速度で再生することが可能となる。By thus setting the maximum value of the speed after the time axis conversion to be low, a part of the data is lost depending on the amount of silence, but the entire data is reproduced at a slower and easy-to-listen speed. It becomes possible to do.

【００４２】適応速度制御部１０８により決定された時
間軸変換速度比Ｖの値は、時間軸制御部１０９に送出さ
れ、時間軸制御部１０９により時間軸変換速度比Ｖに応
じた時間軸の変換処理が施される。The value of the time axis conversion speed ratio V determined by the adaptive speed control unit 108 is sent to the time axis control unit 109, and the time axis conversion unit 109 converts the time axis according to the time axis conversion speed ratio V. Processing is performed.

【００４３】図４は時間軸制御部１０９の詳細構成を示
すブロック図である。図４において、４０１は全体の制
御を行う制御回路、４０２は制御回路の指令により重み
づけ加算を行うクロスフェード処理か無処理かを切り替
える切替回路、４０３は一時的にデータを保持するラッ
チ回路、４０４は重みづけ加算処理を行うクロスフェー
ド回路であり、その他のブロックは図１の同名のブロッ
クと同一のものであるので同番号を付してある。以下、
図４に沿って時間軸制御部１０９の動作を説明する。FIG. 4 is a block diagram showing a detailed structure of the time axis control unit 109. In FIG. 4, reference numeral 401 is a control circuit that performs overall control, 402 is a switching circuit that switches between cross-fade processing or non-processing that performs weighted addition according to a command from the control circuit, 403 is a latch circuit that temporarily holds data, Reference numeral 404 is a crossfade circuit that performs weighted addition processing, and the other blocks are the same as the blocks of the same name in FIG. Less than,
The operation of the time axis control unit 109 will be described with reference to FIG.

【００４４】まず、制御回路４０１は時間軸変換速度比
Ｖを実現するために、クロスフェード区間長Ｋと無処理
区間長Ｓを決定する。ここでは、クロスフェード区間長
Ｋを固定値とするが、時間軸変換速度比Ｖに応じてＫの
値を可変にすることももちろん可能である。First, the control circuit 401 determines the crossfade section length K and the unprocessed section length S in order to realize the time-axis conversion speed ratio V. Here, the crossfade section length K is a fixed value, but it is of course possible to change the value of K according to the time-axis conversion speed ratio V.

【００４５】図５は時間軸変換処理を説明するための模
式図であり、図５（ａ）は処理前のデータ、図５（ｂ）
は処理後のデータを表している。また、図５（ｂ）のデ
ータ列Ａとデータ列Ｂに挟まれた長さＫの部分は、デー
タ列Ａとデータ列Ｂをクロスフェード処理して得られた
データ列である。FIG. 5 is a schematic diagram for explaining the time axis conversion processing. FIG. 5 (a) is data before processing, and FIG. 5 (b).
Indicates data after processing. Further, the portion of the length K sandwiched between the data string A and the data string B in FIG. 5B is a data string obtained by crossfading the data string A and the data string B.

【００４６】ここで、時間軸変換速度比Ｖを実現するた
めには、処理前のデータ列Ａ、Ｂ、Ｃの合計の長さ（２
Ｋ＋Ｓ）の１／Ｖが時間軸処理後のデータ列長（３Ｋ＋
Ｓ）になるようにＳを設定すればよい。そこで、無処理
区間長Ｓは次式により算出される。Here, in order to realize the time-axis conversion speed ratio V, the total length of the unprocessed data strings A, B and C (2
1 / V of (K + S) is the data string length (3K +
S may be set so as to be S). Therefore, the unprocessed section length S is calculated by the following equation.

【００４７】[0047]

【数６】 (Equation 6)

【００４８】今、読み出しポインタＰｒが図５（ａ）の
データ列Ａの先頭を指しているとして、クロスフェード
処理について説明する。クロスフェード処理は、３つの
処理からなる。Now, assuming that the read pointer Pr points to the beginning of the data string A in FIG. 5A, the crossfade processing will be described. The crossfade process is composed of three processes.

【００４９】まず第１の処理について説明する。図６は
クロスフェード処理の一部をフローチャートで示したも
のである。まず、時間軸変換速度比Ｖを参照しながら制
御回路４０１は切替回路４０２を無処理側に切り替える
（Ｓ１１）。次に、読み出し制御部１０６にポインタＰ
ｒの示すデータを読み出すように指令する（Ｓ１２）。
そして、読み出されたデータは、無処理で直接Ｄ／Ａ変
換器１１０に入力される（Ｓ１３）。最後に、読み出し
ポインタＰｒがインクリメントされる（Ｓ１４）。そし
て、データ列Ａを全て処理するまで、上記の動作を繰り
返す（Ｓ１５）。First, the first process will be described. FIG. 6 is a flowchart showing a part of the crossfade processing. First, the control circuit 401 switches the switching circuit 402 to the non-processing side while referring to the time-axis conversion speed ratio V (S11). Next, the read control unit 106 is given a pointer P.
An instruction is issued to read the data indicated by r (S12).
Then, the read data is directly input to the D / A converter 110 without any processing (S13). Finally, the read pointer Pr is incremented (S14). Then, the above operation is repeated until all the data strings A are processed (S15).

【００５０】次に第２の処理について説明する。まず、
制御回路４０１は読み出し制御部１０６に読み出しポイ
ンタＰｒがデータ列Ａの先頭を指すように指令する。次
に、制御回路４０１は切替回路４０２をクロスフェード
処理前に切り替えるとともに、読み出し制御部１０６に
読み出しポインタＰｒの示すデータを読み出すように指
令する。読み出されたデータはラッチ回路４０３に入力
され保持される。次に、制御回路４０１は読み出し制御
部１０６に、Ｋサンプル先の（Ｐｒ＋Ｋ）のアドレスの
示すデータを読み出すように指令する。読み出されたデ
ータは直接クロスフェード回路４０４に入力される。こ
のデータと直前にラッチ回路４０３に保持されたデータ
の２つのデータを用いて重みづけ加算を実行する。ここ
で、図５（ａ）のデータ列Ａをd₀、d₁、・・・、d_K-1、デ
ータ列Ｂをd_K、ｄ_K+1、・・・、d_2K-1とする。また、単調
増加する重み関数をｗ₁(t)（ただし、0≦ｗ₁(t)≦１、t
=0,1,・・・,Ｋ-1）、単調減少する関数をｗ₂(t)=1-ｗ₁(t)
とすると、重みづけ加算後の値Cｔは次式により得られ
る。Next, the second processing will be described. First,
The control circuit 401 commands the read control unit 106 so that the read pointer Pr points to the beginning of the data string A. Next, the control circuit 401 switches the switching circuit 402 before the crossfade process, and instructs the read control unit 106 to read the data indicated by the read pointer Pr. The read data is input to and held in the latch circuit 403. Next, the control circuit 401 commands the read control unit 106 to read the data indicated by the (Pr + K) address of the K sample destination. The read data is directly input to the crossfade circuit 404. Weighted addition is performed using this data and the two data held in the latch circuit 403 immediately before. Here, the data sequence A in FIG. 5A is d ₀ , d ₁ , ..., D _K-1 , and the data sequence B is d _K , d _{K + 1} , ..., d _2K-1 . . Also, the weighting function that increases monotonically is w ₁ (t) (where 0 ≦ w ₁ (t) ≦ 1, t
= 0,1, ..., K-1), the function that decreases monotonically is w ₂ (t) = 1-w ₁ (t)
Then, the value Ct after weighted addition is obtained by the following equation.

【００５１】[0051]

【数７】 (Equation 7)

【００５２】この後、読み出しポインタＰｒはインクリ
メントされる。制御回路４０１は上記と同じ処理とＫ回
続けて行う。図５（ａ）のデータ列Ａとデータ列Ｂとの
クロスフェード処理がすべて終了した後、読み出しポイ
ンタはクロスフェード開始時のＰｒの値にＫを加えたＰ
ｒ＋Ｋの値に設定される。After that, the read pointer Pr is incremented. The control circuit 401 continuously performs the same process as the above K times. After all the cross-fading processing of the data sequence A and the data sequence B of FIG. 5A is completed, the read pointer P
It is set to the value of r + K.

【００５３】次に第３の処理について説明する。第２の
処理が終了した時点では、読み出しポインタＰｒはデー
タ列Ｂの先頭を指しているので、このまま第１の処理に
おけるデータ列Ａに対する処理と同様の処理をデータ列
Ｂに対して施す。具体的には、制御回路４０１は、切替
回路４０２を無処理側に切り替え、読み出し制御部１０
６にポインタＰｒの示すデータを読み出すように指令す
る。読み出されたデータは無処理で直接Ｄ／Ａ変換器１
１０に入力される。最後に読み出しポインタＰｒがイン
クリメントされる。データ列Ｂをすべて処理するまで、
上記動作を繰り返す。Next, the third processing will be described. At the time when the second process is completed, the read pointer Pr points to the beginning of the data string B, so that the same process as the process for the data string A in the first process is performed on the data string B as it is. Specifically, the control circuit 401 switches the switching circuit 402 to the non-processing side, and the read control unit 10
6 is instructed to read the data indicated by the pointer Pr. The read data is directly processed by the D / A converter 1 without any processing.
Input to 10. Finally, the read pointer Pr is incremented. Until all data string B is processed,
The above operation is repeated.

【００５４】これらの３つの処理の結果クロスフェード
処理が終了すると、制御回路４０１は切替回路４０２を
無処理側に切り替え、（数６）で決定された長さＳに該
当する個数のデータがバッファメモリ１０５から読み出
され、直接Ｄ／Ａ変換器１１０に入力される。When the cross-fade process is completed as a result of these three processes, the control circuit 401 switches the switching circuit 402 to the non-processing side, and the data of the number corresponding to the length S determined in (Equation 6) is buffered. It is read from the memory 105 and directly input to the D / A converter 110.

【００５５】以降、長さ３Ｋのクロスフェード処理され
たデータと長さＳの無処理データを交互に出力すること
を繰り返すことにより、時間軸変換速度比Ｖでの時間軸
変換が実現できる。ある時点に適応速度制御部１０８で
設定される時間軸変換速度比が変更された場合は、（数
６）により無処理区間長を変更し、以降上記と同様の処
理を続けることにより、随時、時間軸変換速度比を変更
する。After that, the time axis conversion at the time axis conversion speed ratio V can be realized by repeating alternately outputting the cross-faded data of 3K in length and the unprocessed data of length S. When the time-axis conversion speed ratio set by the adaptive speed control unit 108 is changed at a certain time point, the non-processed section length is changed by (Equation 6), and the same process as above is continued thereafter, so that Change the time axis conversion speed ratio.

【００５６】このようにして時間軸変換処理されたデー
タ列は、最終的に周期ＴでＤ／Ａ変換器１１０によりア
ナログ信号に変換される。その結果、音程が記録時と同
一で、速度が再生速度Ｍ以下で適応的に切り替わる音声
信号を得ることができる。The data string thus time-axis converted is finally converted into an analog signal by the D / A converter 110 in the cycle T. As a result, it is possible to obtain an audio signal in which the pitch is the same as that at the time of recording and the speed is adaptively switched at the reproduction speed M or less.

【００５７】以上のように、上記した第１の実施例によ
ると、有音無音判定部１０３と、書き込みポインタと読
み出しポインタの位置関係からデータ残量を計測するデ
ータ残量監視部１０７と、データ残量に応じて時間軸変
換速度比を決定する適応速度制御部１０８とを設け、デ
ータ残量が多い場合は時間軸変換後の速度を徐々に速
く、データ残量が少ない場合には徐々に遅くなるように
制御することにより、低速に再生された音声信号をその
中に含まれる無音の量に応じて再生速度以下のゆっくり
した速度で、しかも情報の欠落がほとんど生じることな
く聴取することが可能となる。As described above, according to the first embodiment described above, the voice / sound determination unit 103, the data remaining amount monitoring unit 107 for measuring the data remaining amount from the positional relationship between the write pointer and the read pointer, and the data An adaptive speed control unit 108 that determines the time-axis conversion speed ratio according to the remaining amount is provided, and the speed after time-axis conversion is gradually increased when the data remaining amount is large, and gradually when the data remaining amount is small. By controlling so as to be slow, it is possible to listen to a low-speed reproduced audio signal at a slow speed equal to or lower than the reproduction speed according to the amount of silence contained therein, and with almost no loss of information. It will be possible.

【００５８】また、クロスフェード区間長と無処理区間
長を調整することによって、希望する速度に時間軸を変
換することのできる時間軸制御部１０９を設けたことに
より、高品質の時間軸変換が実現できる。さらに、クロ
スフェード区間長をあらかじめ設定した値に固定する
と、無処理区間長のみを変更するだけで任意の速度での
時間軸変換が達成され、非常に簡単な構成で音声時間軸
変換装置を実現することができる。Further, by providing the time axis control unit 109 capable of converting the time axis to a desired speed by adjusting the crossfade section length and the unprocessed section length, high quality time axis conversion can be performed. realizable. Furthermore, if the crossfade section length is fixed to a preset value, the time axis conversion can be achieved at any speed by changing only the unprocessed section length, and a voice time axis conversion device is realized with a very simple configuration. can do.

【００５９】特にＶＴＲなどのように画像を伴った記録
再生部では、例えば画像は１倍速再生で、音声のみを１
倍速以下のゆっくりした速度で再生することができ、聴
き易さが向上する効果が大きい。Particularly, in a recording / reproducing section accompanied by an image such as a VTR, for example, the image is reproduced at 1 × speed and only the sound is reproduced as 1
Since it can be played back at a slow speed of double speed or less, it has a great effect of improving the ease of listening.

【００６０】次に、本発明の音声時間軸変換装置の第２
の実施例について説明する。本実施例の音声時間軸変換
装置の基本構成は図１に示したブロック図と同一であ
り、書き込み制御部１０４の動作のみが異なる。Next, the second embodiment of the voice time base conversion device of the present invention
An example will be described. The basic configuration of the audio time axis converter of the present embodiment is the same as the block diagram shown in FIG. 1, and only the operation of the write controller 104 is different.

【００６１】図７は本実施例の書き込み制御部の動作を
示すフローチャートである。以下では、図７を参照しな
がら書き込み制御部の動作についてのみ説明する。書き
込み制御部１０４はデータ残量監視部１０７により計測
されたデータ残量Ｚの値を逐次取り込み（Ｓ１）、あら
かじめ設定したしきい値Ｚthと比較する（Ｓ２）。ここ
でＺがＺthよりも大きい場合はデータ残量が多いので、
有音無音判定部１０３の結果から現在の入力データが有
音か無音かを判断し（Ｓ３）、有音である場合のみその
データをバッファメモリ１０５に書き込み（Ｓ４）、書
き込みポインタＰｗをインクリメントする（Ｓ５）。Ｓ
２においてＺがＺthよりも小さい場合はデータ残量が少
ないので、有音無音の判定結果にかかわらず、データを
バッファメモリ１０５に書き込み、書き込みポインタＰ
ｗをインクリメントする。FIG. 7 is a flow chart showing the operation of the write controller of this embodiment. Hereinafter, only the operation of the write control unit will be described with reference to FIG. 7. The write control unit 104 sequentially takes in the value of the remaining data amount Z measured by the remaining data amount monitoring unit 107 (S1) and compares it with a preset threshold value Zth (S2). If Z is larger than Zth, there is a large amount of remaining data.
Based on the result of the sound / sound determination unit 103, it is determined whether or not the current input data is sound / sound (S3), and only when there is sound, the data is written in the buffer memory 105 (S4) and the write pointer Pw is incremented. (S5). S
If Z is smaller than Zth in 2, the remaining amount of data is small, so that the data is written to the buffer memory 105 regardless of the sound / sound determination result, and the write pointer P
Increment w.

【００６２】上記した一連の処理では、無音が多く含ま
れる信号の場合に、図２（ａ）において読み出しポイン
タＰｒが書き込みポインタＰｗに追いつかないよう、す
なわちデータ残量が０にならないように制御している。In the series of processes described above, in the case of a signal containing a lot of silence, control is performed so that the read pointer Pr does not catch up with the write pointer Pw in FIG. 2A, that is, the remaining amount of data does not become zero. ing.

【００６３】以上のように、上記した第２の実施例によ
ると、データ残量があらかじめ定めた一定量より少ない
ときにはすべてのデータをバッファメモリに蓄積する書
き込み制御部を設けたことにより、有音区間が少ない信
号を再生するときにデータ残量が０にならずに再生音が
途切れる（ミュート状態になる）ことが防止され、違和
感の少ない自然な再生が可能な音声時間軸変換装置を実
現することができる。As described above, according to the above-described second embodiment, when the remaining amount of data is less than the predetermined fixed amount, the write control unit for accumulating all the data in the buffer memory is provided, so that the voice sound is generated. When a signal with few sections is reproduced, it is possible to prevent the reproduction sound from being interrupted (mute state) without the data remaining amount becoming 0, and to realize a sound time axis conversion device capable of natural reproduction with less discomfort. be able to.

【００６４】[0064]

【発明の効果】以上のように、本発明では、記録速度Ｍ
（≦１）倍で再生された音響信号から有音無音判定によ
り有音部分のみをバッファメモリに蓄積し、書き込みと
読み出しのアドレス位置からバッファメモリに蓄積され
ているデータ残量を計測し、その結果に応じて、残量の
少ないときにはできるだけ低速に、残量が多くなるにつ
れて徐々に再生速度に近づくように、無音の量に応じて
自動的に時間軸変換後の速度を調整できるように構成し
たことにより、低速再生時に、音程は記録時と同一で、
しかも再生速度よりも遅く聴きやすい速度で音声を再生
することが可能な音声時間軸変換装置を提供することが
できる。これにより、記録速度で再生した音声をさらに
遅くすることが可能となり、早口の言葉や外国語ソース
の聴き取りを容易にすることができる。As described above, in the present invention, the recording speed M
Only the voiced part is stored in the buffer memory by the voiced / non-voiced determination from the audio signal reproduced at (≦ 1) times, and the remaining amount of data stored in the buffer memory is measured from the write and read address positions. Depending on the result, it is possible to automatically adjust the speed after time axis conversion according to the amount of silence so that it becomes as slow as possible when the remaining amount is low and gradually approaches the playback speed as the remaining amount increases. As a result, during low speed playback, the pitch is the same as during recording,
Moreover, it is possible to provide an audio time axis conversion device capable of reproducing audio at a speed slower than the reproduction speed and easy to listen to. As a result, the sound reproduced at the recording speed can be further slowed down, and it is possible to easily listen to fast-paced words and foreign language sources.

【００６５】また、クロスフェード区間長と無処理区間
長を調整することで、希望する速度に時間軸を変換する
時間軸制御部を設けることにより、任意の速度での時間
軸変換処理が実現でき、さらに、クロスフェード長を一
定にすれば、無処理区間長を調節するだけで任意の速度
での時間軸変換が実現できるので、簡単な回路構成で時
間軸変換された高品質な再生音を提供することができ
る。By adjusting the crossfade section length and the unprocessed section length to provide a time axis control unit for converting the time axis to a desired speed, time axis conversion processing at an arbitrary speed can be realized. Moreover, if the crossfade length is fixed, the time axis conversion can be realized at any speed simply by adjusting the length of the unprocessed section, so the time axis converted high quality playback sound can be achieved with a simple circuit configuration. Can be provided.

【００６６】また、データ残量が少ない時には有音無音
の判定にかかわらずすべてのデータをバッファメモリに
蓄積するように構成することにより、有音区間が少ない
信号を再生するときに出力するデータがなくなり再生途
中でミュート状態になることがなく、違和感の少ない自
然な再生音声を出力する音声時間軸変換装置を提供する
ことができる。Further, when the remaining amount of data is small, all the data is stored in the buffer memory regardless of whether the sound is present or not, so that the data to be output when the signal with a small number of voiced sections is reproduced. It is possible to provide a sound time axis conversion device that outputs a natural reproduced sound with little discomfort, without being muted during reproduction.

[Brief description of drawings]

【図１】本発明の一実施例における音声時間軸変換装置
の構成を示すブロック図FIG. 1 is a block diagram showing a configuration of an audio time base converter according to an embodiment of the present invention.

【図２】本実施例のメモリ残量の計測方法の説明図FIG. 2 is an explanatory diagram of a method for measuring a remaining memory amount according to the present embodiment.

【図３】本実施例に係る適応速度制御部の速度設定方法
の説明図FIG. 3 is an explanatory diagram of a speed setting method of an adaptive speed control unit according to the present embodiment.

【図４】本実施例に係る時間軸制御部の回路図FIG. 4 is a circuit diagram of a time axis control unit according to the present embodiment.

【図５】本実施例に係る時間軸制御部の動作を示す原理
図FIG. 5 is a principle diagram showing an operation of a time axis control unit according to the present embodiment.

【図６】本実施例に係る時間軸制御部の動作の一部を示
すフローチャートFIG. 6 is a flowchart showing a part of the operation of the time axis control unit according to the present embodiment.

【図７】本発明の第２の実施例に係る書き込み制御部の
動作を示すフローチャートFIG. 7 is a flowchart showing the operation of the write control unit according to the second embodiment of the present invention.

【図８】従来例の音声時間軸変換装置のブロック図FIG. 8 is a block diagram of a conventional audio time base converter.

【図９】従来例の音声時間軸変換装置のデータ処理方法
を示す原理図FIG. 9 is a principle diagram showing a data processing method of a conventional audio time base converter.

[Explanation of symbols]

１０１記録再生部１０２Ａ／Ｄ変換器１０３有音無音判定部１０４書き込み制御部１０５バッファメモリ１０６読み出し制御部１０７データ残量監視部１０８適応速度制御部１０９時間軸制御部１１０Ｄ／Ａ変換器 Reference numeral 101 recording / reproducing unit 102 A / D converter 103 sound / silence determination unit 104 writing control unit 105 buffer memory 106 reading control unit 107 remaining data amount monitoring unit 108 adaptive speed control unit 109 time axis control unit 110 D / A converter

───────────────────────────────────────────────────── フロントページの続き (72)発明者渡辺公治大阪府門真市大字門真1006番地松下電器産業株式会社内 (72)発明者上野憲和大阪府門真市大字門真1006番地松下電器産業株式会社内 ─────────────────────────────────────────────────── ─── Continuation of front page (72) Inventor Koji Watanabe 1006 Kadoma, Kadoma City, Osaka Prefecture Matsushita Electric Industrial Co., Ltd. (72) Norikazu Ueno, 1006 Kadoma, Kadoma City, Osaka Matsushita Electric Industrial Co., Ltd.

Claims

[Claims]

1. A recording / reproducing unit for reading an acoustic signal stored in a recording medium at a speed of M (≦ 1) times as high as that at the time of recording, and an analog signal read by the recording / reproducing unit is converted into a digital signal. An A / D converter, a sound / silence judging section for judging a sound part and a silent part of an input signal, a buffer memory for accumulating a digital signal output from the A / D converter, and the sound part. The write control unit that controls the write address so that only the data in the section that is determined to be sound by the silence determination unit is written in the buffer memory, the read control unit that reads the data in the buffer memory, and the last in the write control unit. By monitoring the positional relationship between the address on the buffer memory written in the buffer memory and the address on the buffer memory last read by the read controller. A data remaining amount monitoring unit that measures the amount of unprocessed data in the buffer memory, and a speed after time-axis conversion with respect to the recording speed according to a predetermined rule according to the remaining amount of data obtained from the data remaining amount monitoring unit An adaptive speed control unit that determines the ratio of the time, a time axis control unit that extends the time axis of the voice according to the time axis conversion speed ratio determined by the adaptive speed control unit, and a process obtained by the time axis control unit. An audio time base converter comprising a D / A converter for converting data into an analog signal.

2. The adaptive speed control unit sets the maximum value and the minimum value of the converted speed equal to or less than the reproduction speed M from the recording medium, and determines the time-axis conversion speed ratio in proportion to the remaining amount of data. The audio time base conversion device according to claim 1, wherein

3. The adaptive speed control unit sets the maximum value and the minimum value of the converted speed to be equal to or less than the reproduction speed M from the recording medium, and sets the time based on the conversion rule nonlinearly associated with the remaining amount of data. The audio time axis converter according to claim 1, wherein the axis conversion speed ratio is determined.

4. The time axis control unit includes sections A and B that include the same number of consecutive sample values and are adjacent to each other in the order of A and B.
And a value obtained by multiplying the sample value in the section A by a monotonically increasing weight coefficient and a value obtained by multiplying the sample value in the section B by a monotonically decreasing weight coefficient, a crossfade processing section X By adjusting the length of the section D connected in the order of A-X-B and the length of the unprocessed section C for directly outputting the data, and outputting the sections D and C alternately. The audio time axis converter according to any one of claims 1, 2 and 3, wherein the time axis is adjusted in accordance with the time axis conversion speed ratio output from.

5. When the data remaining amount monitoring unit determines that the remaining amount of data is equal to or less than a certain amount, the writing control unit writes all the data regardless of the determination result of the sound / silence determination unit. When the remaining amount of data is accumulated in the buffer memory and the remaining amount of data in the data remaining amount monitoring unit is equal to or more than a certain amount, the write address is controlled so that only the data which is determined to be voiced by the voice / non-voice determination unit is stored in the buffer memory. The audio time base conversion device according to claim 1, characterized in that.

6. When the data remaining amount monitoring unit determines that the remaining amount of data is equal to or less than a certain amount, the writing control unit writes all the data regardless of the determination result of the sound / silence determination unit. When the remaining amount of data is accumulated in the buffer memory and the remaining amount of data in the data remaining amount monitoring unit is equal to or more than a certain amount, the write address is controlled so that only the data which is determined to be voiced by the voice / non-voice determination unit is stored in the buffer memory. The audio time base conversion device according to claim 4, wherein.