JPS5947903B2

JPS5947903B2 - Digital call audio introduction method

Info

Publication number: JPS5947903B2
Application number: JP10834678A
Authority: JP
Inventors: 陽太郎八塚
Original assignee: Kokusai Denshin Denwa KK
Current assignee: KDDI Corp
Priority date: 1978-09-04
Filing date: 1978-09-04
Publication date: 1984-11-22
Also published as: JPS5535519A

Description

【発明の詳細な説明】本発明はデジタル通話音声挿入方式に関し、音声挿入の
際に発生するフリーズアウトを好適に吸収し得るもので
ある。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a digital call voice insertion method, and is capable of suitably absorbing freeze-outs that occur during voice insertion.

デジタル信号処理技術の進歩と経済化に伴ない、デジタ
ル化された複数の音声信号を伝送する場合、各音声信号
のうち音声部分が存在する音声部分を検出し、検出され
た音声部分だけを伝送するデジタル通話音声挿入方式（
ＤｉｇｉｔａｌＳｐｅｅｃｈＩｎｔｅｒｐｏｌａ−ｔｉ
ｏｎ、略してＤＳＩ）が採用されている。With the advancement and economicization of digital signal processing technology, when transmitting multiple digitized audio signals, it is necessary to detect the audio portion of each audio signal where audio portions exist, and transmit only the detected audio portions. Digital call voice insertion method (
DigitalSpeechInterpola-ti
on (abbreviated as DSI) has been adopted.

この場合、音声信号を所定個数の一連の標本値群である
単位ブロックに区分し、各単位ブロック毎に当該単位ブ
ロック内に音声が存在するか否かを音声検出器で判定し
て、音声が存在する単位ブロックを伝送する。また、帯
域圧縮を図るため、デジタル化された過去の標本値群か
ら現在の標本値を予測器で予測し、この予測値と現在の
標本値との差（予測誤差）を減算器で算出したのち、こ
の予測誤差だけを量子化器で改めて量子化・符号化して
低ビットレートで伝送する予測符号化方式がある。この
予測符号化方式ではｌビットで符号化するデルタ変調方
式と２ビット以上で符号化するＤＰＣＭ（Ｄｉｆｆｅｒ
ｅｎｔｉａｌＰＣＭ）方式とが代表的なものであり、Ｄ
ＰＣＭ方式にも量子化器の量子化レベル間隔あるいは予
測器の予測係数を随時制御する適応型ＤＰＣＭ（ＡＤＰ
ＣＭ）方式がある。そこで前記ＤＳＩ方式と前記ＤＰＣ
Ｍ方式（あるいはＡＤＰＣＭ方式）とを併用した、高能
率ＤＳＩ方式が考えられる。この高能率ＤＳＩ方式では
、音声部分だけを伝送することによる伝送路の有効利用
と、この音声部分を更に低ビットレートで伝送すること
による帯域圧縮とにより、伝送路の飛躍的な活用が可能
となつている。つまり、音声信号全体に対する検出され
た音声部分の占める割合（平均動作率）から決定される
ＤＳＩ利得（Ｂ：）晋）：に、に■■）とし、音声信号
のもとの符号化ビット数に対する予測符号化のビット数
の低減率の逆数を予測符号化利得とすれば、高能・率Ｄ
ＳＩ方式の総合利得はＤＳＩ利得と予測符号化利得との
積で与えられるからである。ところで、通話音声の平均
動作率は約４０％程度であるから理論的には約２．５倍
のＤＳＩ利得が得られるはずである。In this case, the audio signal is divided into a predetermined number of unit blocks that are a series of sample values, and a sound detector determines whether or not audio exists within the unit block for each unit block. Transmit existing unit blocks. In addition, in order to compress the bandwidth, a predictor predicts the current sample value from a group of digitized past sample values, and a subtracter calculates the difference (prediction error) between this predicted value and the current sample value. Later, there is a predictive coding method in which only this prediction error is quantized and coded again using a quantizer and transmitted at a low bit rate. This predictive coding method uses the delta modulation method, which encodes with l bits, and the DPCM (Differential modulation method, which encodes with 2 or more bits).
D
The PCM method also uses adaptive DPCM (ADP
CM) method. Therefore, the DSI method and the DPC
A high-efficiency DSI method can be considered that uses the M method (or ADPCM method) in combination. This high-efficiency DSI method makes it possible to make dramatic use of the transmission path by transmitting only the audio part, thereby effectively utilizing the transmission path, and by transmitting this audio part at an even lower bit rate to compress the bandwidth. It's summery. In other words, the DSI gain (B:) is determined from the ratio of the detected audio portion to the entire audio signal (average operation rate), and the original number of encoded bits of the audio signal is If the predictive coding gain is the reciprocal of the reduction rate of the number of predictive coding bits for
This is because the overall gain of the SI method is given by the product of the DSI gain and the predictive coding gain. By the way, since the average operation rate of voice calls is about 40%, theoretically a DSI gain of about 2.5 times should be obtained.

し力化、音声挿入時に瞬時的には、ＤＳＩ入力トランク
内の音声がある入力トランク即ちアクティブな入力トラ
ンク数がＤＳＩ出力チャネル数を超えてしまい、そのま
までは一部のアクテイブな入力トランクを接続できず、
音声が途中で跡絶える場合が発生する。この状態をフリ
ーズアウトと呼んでいる。したがつて、このフリーズア
ウトを避けるために、従来ではＤＳＩ利得が２倍程度に
抑えられている。一方予測符号化利得に関しては、音声
品質（信号対量子化雑音比、▼）として８〜６ビツトＰ
ＣＭ相当を想定すると４ビツトの予測符号化が用いられ
る。このＫｂ結果、８ビツト・８ＫＨｚサンプリングの
６４τＰＣＭと比較すると総合利得は４倍程度になる。When powering up or inserting audio, the number of input trunks with audio in the DSI input trunks, that is, the number of active input trunks, exceeds the number of DSI output channels, and some active input trunks cannot be connected. figure,
There are cases where the audio is cut off midway through. This state is called freeze-out. Therefore, in order to avoid this freeze-out, conventionally the DSI gain is suppressed to about twice. On the other hand, regarding the predictive coding gain, the voice quality (signal to quantization noise ratio, ▼) is 8 to 6 bits P.
Assuming that it corresponds to a CM, 4-bit predictive coding is used. As a result of this Kb, the total gain is about 4 times that of 64τPCM of 8 bits and 8 KHz sampling.

叙上の如く高能率ＤＳＩ方式においても、従来の総合利
得が４程度と然程高くない理由として次の二点が挙げら
れる。即ち、イフリーズアウトの多発を避けるあまり、
ＤＳＩの利得を２倍として安全設計を抹用していること
、及び口予測符号化のビツト数を一律にし、音声信号の
うち一の低）ＮＴＯ−
１ｓい部分に必要なビツト数（例えば４ビツト）を一Ｎｑの
高い部分にもそのまま適用しているため予測符号化にま
だ冗長性が存在すること、である。As mentioned above, even in the high-efficiency DSI system, the following two points can be cited as reasons why the conventional total gain is not very high at about 4. In other words, in order to avoid frequent if-freeze-outs,
NTO-
The number of bits (for example, 4 bits) required for the low part is applied as is to the high part of 1Nq, so there is still redundancy in predictive coding.

本発明は、上記従来技術の欠点に鑑み、総合利得が著し
く高いデジタル通話音声挿入方式を提供することを目的
とする。斯かる目的を達成する本５発明の構成は、デジ
タル化された複数の音声信号を夫々所定個数の一連の標
本値群である単位プロツクに区分し、音声が存在する単
位プロツクだけを予測符号化して伝送するデジタル通話
音声挿入方式において、前記予測符号化を可変ビツト形
の．予測符号化器で行なうと共に前記音声が存在する単
位プロツク内の音声の性質を判別し、音声挿入の際にフ
リーズアウトが生ずる場合は、信号対量子化雑音比が良
好とみなせる単位プロツクから順に予測符号化のビツト
数を削減することを特徴と．する。以下、本発明を図面
に基づいて詳細に説明する。SUMMARY OF THE INVENTION In view of the above-mentioned drawbacks of the prior art, it is an object of the present invention to provide a digital call voice insertion system with a significantly high overall gain. The configuration of the present invention to achieve such an object divides a plurality of digitized audio signals into unit blocks, each of which is a predetermined number of series of sample values, and predictively encodes only the unit blocks in which audio exists. In the digital voice insertion method for transmission using variable bits, the predictive coding is performed using a variable bit type . At the same time, the characteristics of the voice within the unit block in which the voice exists is determined, and if freeze-out occurs when inserting voice, prediction is performed in order from the unit block where the signal to quantization noise ratio is considered to be good. It is characterized by reducing the number of encoding bits. do. Hereinafter, the present invention will be explained in detail based on the drawings.

第１図は本発明の一実施例を示すプロツク図であり、本
実施例では入力音声信号を８ビツト・８ＫＨｚサンプリ
ングＰＣＭとし、且つその単位プロ・ツクは３２個の一
連の標本値群とする。第１図において、１は伸張器でこ
の場合８ビツトの非直線ＰＣＭ信号ＮＬを１３ビツトの
直線ＰＣＭ信号Ｌに変換する。したがつて音声信号がも
ともと直線ＰＣＭであればこの伸張器１は不要となる。
２はバツフアメモリで、各トランクの音声信号Ｌが単位
プロツク毎に記憶される。FIG. 1 is a block diagram showing one embodiment of the present invention. In this embodiment, the input audio signal is 8-bit, 8-kHz sampling PCM, and the unit program is a series of 32 sample values. . In FIG. 1, reference numeral 1 denotes an expander which in this case converts an 8-bit non-linear PCM signal NL into a 13-bit linear PCM signal L. Therefore, if the audio signal is originally a linear PCM, this expander 1 is unnecessary.
2 is a buffer memory in which the audio signal L of each trunk is stored for each unit block.

３は音声検出器、４はデータ・シグナリング検出器、５
は音声信号用の可変ビツト形ＡＤＰＣＭ符号化器、６は
データ用のＡＤＰＣＭ符号化器、７は割当制御器、８は
マルチプレクサ、９は割当状態信号発生器、１１は符号
化ビツト長表示信号発生器である。3 is a voice detector, 4 is a data signaling detector, 5
is a variable bit type ADPCM encoder for audio signals, 6 is an ADPCM encoder for data, 7 is an allocation controller, 8 is a multiplexer, 9 is an allocation status signal generator, and 11 is a coded bit length display signal generator. It is a vessel.

即ち、本実施例では、音声帯域信号のうちデータやシグ
ナリングを音声信号と分離してＤＳＩチヤネルＣＨから
外して専用のＡＤＰＣＭ符号化器６で符号化して伝送さ
せ、音声信号だけＤＳＩチヤネルＣＨに乗せるようにな
つている。しかし、音声用とデータ用にＡＤＰＣＭを分
けず同一のＡＤＰＣＭで動作させても良い。ここで、音
声信号に関する部分を第２図に抽出して示す。第２図に
おいて、可変ビツト形ＡＤＰＣＭ符号化器５はＡＤＰＣ
Ｍ符号化器１９とビツト長制御器１０とから構成される
。そして１２〜１６はＡＤＰＣＭ符号化器１９を構成す
るもので、１２は減算器、１３は適応形の量子化器、１
４は適応形の予測器、１５は逆量子化器、１６は加算器
である。なお、１７は直線ＰＣＭ信号に変換された音声
信号Ｌの入力端子、１８は本発明方式により音声挿入後
予測符号化された出力信号の出力端子である。ところで
、第２図は一つの独立した系としてプロツク図化されて
いるが、第１図と照合すれば理解できるように、バツフ
アメモリ２と音声検出器３とはバツフアメモリ２の入力
側である各ＤＳＩ入力トランクＴＫに共用され得る一方
、可変ビツト形ＡＤＰＣＭ符号化器５はバツフアメモリ
２の出力側であるＤＳＩ出力チヤネルＣＨの数だけ用意
される。前記音声検出器３は、単位プロツク毎に当該単
位プロツクに音声が存在するか否かを検出すると共に音
声が存在する場合には当該音声の信号対量子化雑音比（
？渭）の良否を予測するものである。That is, in this embodiment, data and signaling in the audio band signal are separated from the audio signal, removed from the DSI channel CH, encoded by a dedicated ADPCM encoder 6, and transmitted, and only the audio signal is placed on the DSI channel CH. It's becoming like that. However, the same ADPCM may be operated without separating the ADPCMs for voice and data. Here, the portion related to the audio signal is extracted and shown in FIG. In FIG. 2, the variable bit type ADPCM encoder 5 is an ADPC
It consists of an M encoder 19 and a bit length controller 10. 12 to 16 constitute an ADPCM encoder 19, 12 is a subtracter, 13 is an adaptive quantizer, 1
4 is an adaptive predictor, 15 is an inverse quantizer, and 16 is an adder. Note that 17 is an input terminal for the audio signal L converted into a linear PCM signal, and 18 is an output terminal for an output signal predictively encoded after audio insertion according to the method of the present invention. Incidentally, although FIG. 2 is diagrammed as one independent system, as can be understood by comparing it with FIG. While the variable bit type ADPCM encoders 5 can be shared by the input trunk TK, the same number of variable bit type ADPCM encoders 5 as the number of DSI output channels CH on the output side of the buffer memory 2 are prepared. The voice detector 3 detects for each unit block whether or not voice exists in the unit block, and when voice exists, calculates the signal to quantization noise ratio (
? It predicts the quality of the product (Wai).

そして、音声検出器３によつて音声が存在すると判定さ
れた単位プロツクは、割当制御器７からの制御信号によ
つでバツフアメモリ２より読出されで割当てられたチヤ
ネルＣＨの可変ビツト形ＡＤＰＣＭ符号化器５に出力さ
れる。同時に、音声検出Ｓ器３からは当該単位プロツク
の一の良否情報がＮｑビツト長制御器１０に出力され、
フリーズアウトの発生がある場合には、ビツト長制御器
１０より予測符号化ビツト長を削減すべき単位プロツク
及びそのビツト長が量子化器１３に指令される。Then, the unit block determined by the voice detector 3 as having voice is read out from the buffer memory 2 by a control signal from the allocation controller 7 and encoded into variable bit type ADPCM of the allocated channel CH. It is output to the device 5. At the same time, the voice detector S unit 3 outputs the quality information of one of the unit blocks to the Nq bit length controller 10.
When a freeze-out occurs, the bit length controller 10 instructs the quantizer 13 about the unit block whose predictive coding bit length is to be reduced and its bit length.

この時、ビツト削限される単位プロツクは−の良好なも
のから指定される。単位プロツクのビツト長情報は符号
化ビツト長表示信号発生器１１より出力される。また、
割当制御器７の制御信号に従つて、入力トランクＴＫと
出力チヤネルＣＨとの対応ずけを行なう割当状態信号が
割当状態信号発生器９から出力される。音声検出器３は
、単位プロツク内の内部電力や零交叉数を利用して当該
単位プロツク内に音声が存在するか否かを検出する音声
検出部と、単位プロツク内の音声の性質、即ち信号対量
子化雑音比の良否を判別する音声性質判定部とを有する
。At this time, the unit blocks whose bits are to be reduced are designated in descending order of the good bits. The bit length information of the unit block is outputted from the encoded bit length display signal generator 11. Also,
According to the control signal from the allocation controller 7, an allocation status signal for associating the input trunk TK with the output channel CH is outputted from the allocation status signal generator 9. The voice detector 3 includes a voice detection section that detects whether or not voice exists in a unit block by using the internal power and zero-crossing number in the unit block, and a voice detector that detects the nature of the voice in the unit block, that is, the signal. and a voice quality determination unit that determines whether the quantization-to-noise ratio is good or bad.

ところで、音声の信号対量子化雑音比（Ｌ）にっいては
、一般に、低域の周波数成分を多く含む部分即ち有声音
の部分は予測をしやすいため予測誤差が小さく依つて−
が良いとされ、高域の周波数成分を多く含む部分あるい
は周波数スペクトラムが平担な部分即ち無声音の部分ま
たは有声音と無声音との過渡部分は予測をし難いため予
測誤差が大きく依つて−が悪くなると言える。したがっ
て、母音等が大部分を占める有声音は−が良く、摩擦音
や破裂音等の無声音並びにその前後の過渡部分は−が悪
いとみなすことができる。以上のことから、本発明にお
いては、信号対量子化雑音比（Ｌ）の検出に代えて有声
音か無声音かを判定し、有声音であれば一が良く、無声
音であれば−が悪いとして判別している。このため、前
記音声性質判定部としては、零交叉数に着目して零交叉
数の特に多い単位プロツクは音声の−が悪いとするもの
、あるいは低域に設定した予測器を用い予測外れの多発
する単位プロツクは音声の一が悪いとするものを用い得
るが、本実施例では内部電力及び周波数成分に着目した
第３図に示すものを用いた。第３図において、２０は電
力計算器、２１は零交叉数計数器、２２はスレシヨルド
選定器、２３は極性ビツトマトリクス処理器、２４は判
定器、２５は音声検出表示端子、２６は有声−無声音表
示端子である。By the way, regarding the signal-to-quantization noise ratio (L) of speech, in general, parts containing many low-frequency components, that is, voiced sounds, are easy to predict, so the prediction error is small.
is said to be good, but it is difficult to predict parts that contain many high-frequency components or parts where the frequency spectrum is flat, that is, unvoiced parts, or transient parts between voiced and unvoiced sounds, and the prediction error is large and - is bad. I can say that it will be. Therefore, it can be considered that - is good for voiced sounds, which are mostly vowels, etc., and that - is bad for unvoiced sounds such as fricatives and plosives, and the transient parts before and after them. From the above, in the present invention, instead of detecting the signal to quantization noise ratio (L), it is determined whether the sound is voiced or unvoiced, and if it is a voiced sound, 1 is good, and if it is unvoiced, - is bad. It is determined. For this reason, the speech quality determining section may focus on the number of zero crossings and determine that a unit block with a particularly large number of zero crossings has poor speech, or may use a predictor set to a low frequency range to detect frequent mispredictions. Although a unit block for determining the quality of audio may be used, in this embodiment, the unit block shown in FIG. 3, which focuses on internal power and frequency components, is used. In FIG. 3, 20 is a power calculator, 21 is a zero-crossing counter, 22 is a threshold selector, 23 is a polarity bit matrix processor, 24 is a judger, 25 is a voice detection display terminal, and 26 is a voiced/unvoiced sound. This is a display terminal.

電力計算器２０は、各単位プロツクの内部電力Ｓｔを計
算するものである。この内部電力Ｓｔは判定器２４に入
力されて、例えばＳＴｈなる基準値と比較され、Ｓｔ＜
ＳＴｈのときは単位プロツクｔの音声は無音声（Ｓｉｌ
ｅｎｃｅ）と判定される。ＳＴｈ≦Ｓｔのときは直ちに
音声があると判定し端子２５から音声検出信号を出力す
る。更に単位プロツクｔ自体の各標本値の極性の系列で
ある主極性系列及び当該単位プロツクｔがフイルタにか
けられたのちの各標本値の極性の系列である副極性系列
に着目し、その周期性を中心に有声音か無声音かを判定
し、端子２６から有声−無声音表示信号を出力する。こ
のため、デジタル化された複数個の周波数の正弦波及び
余弦波夫々の各標本値の極性の列である基準極性系列群
に対する前記主及び副極性系列の類似度を極性ビツトマ
トリクス処理器２３で演算する。前記基準極性系列群は
次式（１），（２）で与えられる±ｌのエレメントから
なる極性ビツト・シーケンスマトリクスＨである。−方
、前記主及び副極性系列は、デジタルフイルタの特性を
第１表に示すように、ｌ（＝ｌ〜５）で指定すればＺ’
（ｔ）で表記される。The power calculator 20 calculates the internal power St of each unit block. This internal power St is input to the determiner 24 and compared with a reference value, for example STh, so that St<
When STh, the audio of unit block t is silent (Sil
ence). When STh≦St, it is immediately determined that there is a voice, and a voice detection signal is output from the terminal 25. Furthermore, we pay attention to the main polarity series, which is the series of polarities of each sample value of unit block t itself, and the subpolarity series, which is the series of polarities of each sample value after the unit block t is filtered, and calculate their periodicity. It is determined whether the center is a voiced sound or an unvoiced sound, and a voiced/unvoiced sound display signal is output from the terminal 26. For this reason, the polarity bit matrix processor 23 calculates the similarity of the main and sub-polarity sequences to a reference polarity sequence group, which is a polarity sequence of each sample value of each of the digitized sine waves and cosine waves of a plurality of frequencies. calculate. The reference polarity sequence group is a polarity bit sequence matrix H consisting of ±1 elements given by the following equations (1) and (2). - On the other hand, if the characteristics of the digital filter are specified by l (=l~5), the main and sub polarity series can be determined by Z'
It is written as (t).

したがつて、主極性系列はＺ２（ｔ）であり、Ｚ１（ｔ
）はローパスされた副極性系列、Ｚ３（ｔ）〜Ｚ５（ｔ
）は夫々ハイパス（プリエンフアシス）された副極性系
列となる。Therefore, the main polarity series is Z2(t) and Z1(t
) is the low-passed sub-polarity series, Z3(t) to Z5(t
) are high-passed (pre-emphasized) secondary polarity sequences.

そこで、主・副極性系列Ｚ８（ｔ）と基準極性系列Ｈと
のパターンマツチング度Ｙを次式（３），（４）で定め
ると、フーリエ変換に対応させた極性パターンマツチン
グパワ一Ｐ５ｋ（ｔ）が次式（５）で与えられる。この
極性パターンマツチングパワーＰｌｋ（ｔ）が前記類似
度に他ならない。但し、ｍ＝３２，ｋ＝ｒ１〜Ｒ２であ
る。以上を要約すれば、極性ビツトマトリクス処理器２
３は、主・副極性系列Ｚ１（ｔ）と基準極性系列群Ｈと
をマトリクス演算して、類似度Ｐｌｋ（ｔ）を出力する
。Therefore, if the pattern matching degree Y between the main and sub polarity series Z8(t) and the reference polarity series H is determined by the following equations (3) and (4), the polarity pattern matching power P5k corresponding to the Fourier transform is (t) is given by the following equation (5). This polar pattern matching power Plk(t) is nothing but the similarity. However, m=32 and k=r1 to R2. To summarize the above, polarity bit matrix processor 2
3 performs a matrix operation on the main/secondary polarity series Z1(t) and the reference polarity series group H, and outputs the similarity Plk(t).

この類似度Ｐ５ｋ（ｔ）はｌで表わされるオリジナル及
びフイルタにかけられた単位プロツクｔが、ｋで表わさ
れる１２５Ｈｚ及びその高調波とど．れくらい類似して
いるかを、ｌ及びｋの組合せ毎に示すものである。この
ように極性ビツトマトリクス処理器２３で演算処理され
た類似度Ｐ５ｋ（ｔ）は判定器２４に入力され、基準値
Ｐｌｈ，との大小比較がなされる。This degree of similarity P5k(t) indicates that the original and filtered unit blocks t, denoted by l, are 125Hz and its harmonics, denoted by k. This shows how similar each combination of l and k is. The degree of similarity P5k(t) thus calculated by the polarity bit matrix processor 23 is input to the determiner 24, where it is compared in magnitude with the reference value Plh.

この基．準値ＰＴｈ，は、例えばフイルタの特性１、周
波数ｋ及び零交叉数Ｚｃｔ，ｚｃｐｔをパラメータとし
て設定される。零交叉数Ｚｃｔはオリジナルの単位プロ
ツクｔ（７）零交叉数であり、零交叉数Ｚｃｐｔは、例
えば第４図に示すデジタルフイルタで、ａ１＝１、Ａ２
＝−０．５、Ａ３＝ｏとした場合のプリエンフアシス後
の単位プロツクｔの零交叉数である。なお、第４図中、
Ｄは遅延器、ａ１〜Ａ３は係数である。これらの零交叉
数Ｚｃｔ，ｚｃｐｔは零交叉数計数器２１で単位プロツ
クｔ毎に計数され、次々とスレシヨルド選定器２２に入
力され、スレシヨルド選定器２２からは零交叉数Ｚｃｔ
，ｚｃｐｔに見合う基準値ＰＴｈ，が判定器２４に出力
される。以上基準値ＰＴｈ，の設定は、第１表にｋ及び
ｉで示すように、Ｚ１（ｔ）は低域に、Ｚ２（ｔ）は中
低域に、Ｚ３（ｔ）は中域に、Ｚ４（ｔ）は中高域に、
またＺ５（ｔ）は高域の周波数に類似しやすいよう各フ
イルタ１が記定されていることに基づいて為される。This base. The quasi-value PTh is set using, for example, the filter characteristic 1, the frequency k, and the zero crossing numbers Zct, zcpt as parameters. The zero crossing number Zct is the original unit block t(7) zero crossing number, and the zero crossing number Zcpt is, for example, in the digital filter shown in FIG. 4, when a1=1, A2
This is the number of zero crossings of the unit block t after pre-emphasis when A3=-0.5 and A3=o. In addition, in Figure 4,
D is a delay device, and a1 to A3 are coefficients. These zero crossing numbers Zct and zcpt are counted by the zero crossing number counter 21 for each unit block t, and are inputted one after another to the threshold selector 22, and from the threshold selector 22, the zero crossing number Zct is counted.
, zcpt, is output to the determiner 24. The settings of the reference value PTh, as shown in Table 1 with k and i, are as follows: Z1(t) is set to the low range, Z2(t) is set to the mid-low range, Z3(t) is set to the mid range, and Z4 is set to the middle range. (t) for mid-high range,
Further, Z5(t) is determined based on the fact that each filter 1 is specified so as to be easily similar to a high frequency.

かくしで、ＳＴｈ≦Ｓｔの場合には判定器２４により後
述するイ若しくは口等の基準に基づいて当該単位プロツ
クｔの音声の性質が判別される。イＰｌｋ（ｔ）≧ＰＴ
ｈｌあるいはＰ２ｋ≧ＰＴｈ２が少なくとも１個のｋに
ついて成立する場合に有声音とし、それ以外を無声音と
する。ＣｌＰ３ｋ（ｔ）≧ＰＴｈ３，Ｐ（ｔ）≧ＰＴｈ
４及びＰ５ｋ（ｔ）≧ＰＴｈ５が夫々において少なくと
も１個のｋについで成立する場合あるいはＰｌｋ（ｔ）
≧ＰＴｈｌ，Ｐ２ｋ（ｔ）≧ＰＴｈ２，Ｐ３ｋ（ｔ）≧
ＰＴｈ３，Ｐ（ｔ）≧Ｐｌｈ４，Ｐ５ｋ（ｔ）≧ＰＴｈ
５のいずれもが成立しない場合に当該単位プロツクｔを
無声音とし、それ以外を有声音とする。したがつて、フ
リーズアウトの発生を割当制御器７にて検出し、フリー
ズアウトが発生した場合には、無声音の単位プロツクを
基本ビツト長で予測符号化する一方、有声音の単位プロ
ツクから順に基本ビツト長より短かいビツト長で予測符
号化することになる。In this case, if STh≦St, the determiner 24 determines the nature of the voice of the unit proc t based on criteria such as A or Mouth, which will be described later. iPlk(t)≧PT
If hl or P2k≧PTh2 holds true for at least one k, it is considered a voiced sound, and the other sounds are considered unvoiced. ClP3k(t)≧PTh3, P(t)≧PTh
4 and P5k(t)≧PTh5 holds for at least one k or Plk(t)
≧PThl, P2k(t)≧PTh2, P3k(t)≧
PTh3, P(t)≧Plh4, P5k(t)≧PTh
If none of the above conditions hold true, the unit block t is considered unvoiced, and the rest are voiced. Therefore, the allocation controller 7 detects the occurrence of a freeze-out, and when a freeze-out occurs, the unit blocks of unvoiced sounds are predictively encoded using the basic bit length, while the unit blocks of voiced sounds are sequentially encoded in the basic bit length. Predictive encoding is performed using a bit length shorter than the bit length.

以上のことにより、フリーズアウトが発生しても削減し
たビツト数を寄せ集めることにより新たなチヤネルを確
保でき、フリーズアウトが回避される。As a result of the above, even if a freeze-out occurs, a new channel can be secured by gathering the reduced number of bits, and the freeze-out can be avoided.

この場合信号対量子化雑音比（？）の良い単位プロツク
がビツト削減を受けているため、全体の音声品質に殆ん
ど影響を与えることはない。ここでＤＳＩ利得とフリー
ズアウトとの関係を説明する。平均動作率が３５〜３８
％におけるＤＳＩ利得とフリーズアウト発生率の関係は
、基本ビツト長（４ビツト）で換算して３倍のＤＳＩ利
得の場合２４０トランクでは約１０％のチヤネルが不足
し、平均動作率３５〜３８％の逆数のＤＳＩ利得の場合
２４０トランクでは約５％のチヤネルが不足する。これ
に対し全て３ビツトに削減すれば、平均動作率３８％に
対しては約５．５×１０〜２％また平均動作率３５％に
対しては３×１０−３％以下のフリーズアウト（チヤネ
ル不足）となりＤＳＩ利得を３倍に設定しても殆んどフ
リーズアウトを吸収できる。以上実施例とともに具体的
に説明したように、本発明によれば、予測符号化器を可
変ビツト形とすると共に、信号対量子化雑音比が良いと
みなされる単位プロツクからビツト長を削減するので、
音声品質に殆んど影響なくフリーズアウトを回避でき、
高能率ＤＳＩの総合利得を大幅に向上できる。In this case, since the unit block with a good signal to quantization noise ratio (?) is subjected to bit reduction, the overall voice quality is hardly affected. Here, the relationship between DSI gain and freeze-out will be explained. Average operating rate is 35-38
%, the relationship between DSI gain and freeze-out occurrence rate is as follows: If the DSI gain is 3 times the basic bit length (4 bits), 240 trunks will have a channel shortage of about 10%, and the average operating rate will be 35 to 38%. For a DSI gain of the reciprocal of 240 trunks, there will be a channel deficit of about 5%. On the other hand, if all the bits are reduced to 3 bits, the freeze-out will be approximately 5.5 x 10-2% for an average operation rate of 38%, and less than 3 x 10-3% for an average operation rate of 35% ( channel shortage), and even if the DSI gain is set to 3 times, most of the freeze-outs can be absorbed. As specifically explained above in conjunction with the embodiments, according to the present invention, the predictive encoder is of variable bit type, and the bit length is reduced from the unit block that is considered to have a good signal to quantization noise ratio. ,
Freeze-outs can be avoided with little impact on audio quality,
The overall gain of high-efficiency DSI can be greatly improved.

[Brief explanation of the drawing]

第１図は本発明の一実施例を示すプロツク図、第２図は
その要部を示すプロツク図、第３図は更にその要部を示
すプロツク図、第４図はデジタルフイルタの一例を示す
プロツク図である。図面中、２はバツフアメモリ、３は音声検出器、５は可
変ビツト形ＡＤＰＣＭ符号化器、７は割当制御器、９は
割当状態信号発生器、１０はビツト長制御器、１１はビ
ツト長表示信号発生器、１９はＡＤＰＣＭ符号化器、２
０は電力計算器、２１は零交叉数計数器、２２はスレシ
ヨルド選定器、２３は極性ビツトマトリクス処理器、２
４は判定器である。Fig. 1 is a block diagram showing an embodiment of the present invention, Fig. 2 is a block diagram showing its essential parts, Fig. 3 is a block diagram further showing its essential parts, and Fig. 4 shows an example of a digital filter. It is a block diagram. In the drawing, 2 is a buffer memory, 3 is a voice detector, 5 is a variable bit type ADPCM encoder, 7 is an allocation controller, 9 is an allocation status signal generator, 10 is a bit length controller, and 11 is a bit length display signal. generator, 19 is an ADPCM encoder, 2
0 is a power calculator, 21 is a zero-crossing counter, 22 is a threshold selector, 23 is a polarity bit matrix processor, 2
4 is a determiner.

Claims

[Claims]

1. In a digital call voice insertion method that divides a plurality of digitized voice signals into unit blocks each of which is a predetermined number of series of sample value groups, and predictively encodes and transmits only the unit blocks in which voice exists, the prediction Encoding is performed using a variable bit type predictive encoder, and the characteristics of the voice within the unit block in which the voice exists is determined, and if freeze-out occurs during voice insertion, the signal to quantization noise ratio is good. A digital call voice insertion method characterized by reducing the number of predictive coding bits in order from unit blocks that can be regarded as .