TWI453733B

TWI453733B - Device and method for audio quantization codec

Info

Publication number: TWI453733B
Application number: TW100150057A
Authority: TW
Inventors: Shih Chieh Huang; Chien Lung Chen
Original assignee: Nyquest Corp Ltd
Priority date: 2011-12-30
Filing date: 2011-12-30
Publication date: 2014-09-21
Also published as: TW201327548A

Description

Audio quantization codec device and method thereof

本發明係關於為一種量化裝置，特別是一種音訊量化編解碼裝置及其方法。The present invention relates to a quantization apparatus, and more particularly to an audio quantization codec apparatus and method thereof.

語音信號原為類比信號，經過數位化及壓縮會產生失真，一般而言壓縮率較高，信號失真較大，但所需傳輸碼率較低。所以在傳輸頻寬不足情況下，在可辨識通話內容的條件下，通常會選擇壓縮率較高的協定。如果沒有傳輸頻寬的問題，一般採用信號失真較小G.711協定是較好的選擇。The speech signal is originally an analog signal. After digitization and compression, distortion is generated. Generally, the compression ratio is high and the signal distortion is large, but the required transmission code rate is low. Therefore, in the case of insufficient transmission bandwidth, a protocol with a high compression ratio is usually selected under the condition that the content of the call can be recognized. If there is no problem with the transmission bandwidth, the G.711 protocol with a small signal distortion is generally a better choice.

請參考第1圖，其為先前技術之語音編碼與解碼系統圖，包含：語音輸入訊號100、語音編碼器200、記憶體300、語音解碼器400、語音輸出訊號500。其中，語音輸入訊號100為一段真實的聲音，其為類比訊號。舉例而言，語音編碼器200若為16 bit的單聲道，若以每秒8KHz的頻率取樣，資料量為每秒128kbit。當語音輸入訊號110輸入至語音編碼器200，語音輸入訊號100即會被取樣為每秒128kbit的單聲道資料，再經過壓縮編碼後，儲存在記憶體300中。語音編碼器200在實際上的應用，即為一種壓縮器。於實際的應用上，有時為了降低記憶體300的使用量，一般會把16 bit的語音資料壓縮為較低的解析度資料(如5bit或4bit)並存在記憶體300內，即可有效降低記憶體300的使用量。最後，語音解碼器400會將記憶體300內所儲存壓縮過後的較低的解析度資料解讀，再轉換成具有16bit的單聲道語音資料，並轉換為語音輸出訊號500。Please refer to FIG. 1 , which is a prior art speech encoding and decoding system diagram, including: a voice input signal 100 , a speech encoder 200 , a memory 300 , a speech decoder 400 , and a voice output signal 500 . The voice input signal 100 is a real sound, which is an analog signal. For example, if the speech encoder 200 is a 16-bit mono, if the sampling is performed at a frequency of 8 KHz per second, the amount of data is 128 kbits per second. When the voice input signal 110 is input to the voice encoder 200, the voice input signal 100 is sampled as a mono-channel data of 128 kbits per second, and then compressed and encoded, and stored in the memory 300. The practical application of the speech encoder 200 is a compressor. In practical applications, in order to reduce the usage of the memory 300, 16-bit voice data is generally compressed into lower resolution data (such as 5 bit or 4 bit) and stored in the memory 300, which can effectively reduce The amount of memory 300 used. Finally, the speech decoder 400 interprets the compressed lower resolution data stored in the memory 300, converts it into 16-bit mono speech data, and converts it into a speech output signal 500.

接著，請參考第2A圖，係為先前技術的語音編碼器200之詳細方塊圖。其中，語音編碼器200包含：類比數位轉換器210、減碼轉換器220、量化器230與資料編碼器240。其中，類比數位轉換器210接收類比的語音輸入訊號而轉換為數位的第一語音資料。減碼轉換器220連接類比數位轉換器210，對第一語音資料進行減碼。量化器230連接減碼轉換器220，接收第一語音資料並進行量化而產生一數位碼，該數位碼包含符號資料與數字資料。資料編碼器240連接量化器230，接收至少一個數位碼以產生一串接語音資料。Next, please refer to FIG. 2A, which is a detailed block diagram of the prior art speech encoder 200. The speech encoder 200 includes an analog-to-digital converter 210, a down-converter 220, a quantizer 230, and a data encoder 240. The analog digital converter 210 receives the analog voice input signal and converts it into a digital first voice material. The down-converter 220 is coupled to the analog-to-digital converter 210 to down-code the first speech material. The quantizer 230 is coupled to the down-converter 220, receives the first speech data and quantizes to generate a digital code comprising symbol data and digital data. The data encoder 240 is coupled to the quantizer 230 and receives at least one digital code to generate a series of speech data.

其中，另一種先前實施方式，請參考第2B圖，在一外部的第一記憶體110已儲存了數位的第一語音資料，其中，減碼轉換器220對第一語音資料進行減碼。量化器230連接減碼轉換器220，接收第一語音資料並進行量化而產生一數位碼，該數位碼包含符號資料與數字資料。資料編碼器240連接量化器230，接收至少一個數位碼以產生一串接語音資料。In another previous embodiment, please refer to FIG. 2B. The first first memory 110 has stored a digital first audio data, and the down converter 220 performs a subtraction on the first voice data. The quantizer 230 is coupled to the down-converter 220, receives the first speech data and quantizes to generate a digital code comprising symbol data and digital data. The data encoder 240 is coupled to the quantizer 230 and receives at least one digital code to generate a series of speech data.

以下列舉一範例：接著，請參考第3圖，由類比數位轉換器210所轉換之16bit語音資料，其中包含了7筆數位的第一語音資料：111110110001000、1111001100001000、1111111100001000、0000000100001000、0000010100001000、0000100000001000、0000111100001000。減碼轉換器220再將這7筆的16bit第一語音資料轉為8 bit的有帶正、負符號的第一語音資料。其中減碼轉換器220直接把16bit的第一語音資料中的第1bit至第8bit的直接去掉，只保留原來的第一語音的第9bit到第16bit的資料，而這留下的資料，即為新的第一語音資料。所以第一語音資料最後只留下8bit的具有正、負符號的資料，且其資料範圍為-128至127。其中，而第1bit至第7bit則代表數字資料(語音訊號的量)，而第8bit則代表符號資料(語音訊號的正、負值)。所以第一語音資料經過減碼轉換器220減碼之後，得到新的7個第一語音資料為11111011、11110011、11111111、00000010、00000101、00001000、00001111(-5、-13、-1、2、5、8、15)。An example is given below: Next, please refer to FIG. 3, the 16-bit voice data converted by the analog-to-digital converter 210, which includes the first voice data of 7 digits: 111110110001000, 1111001100001000, 1111111100001000, 0000000100001000, 0000010100001000, 0000100000001000, 0000111100001000. The down-converter 220 then converts the seventeen 16-bit first voice data into an 8-bit first voice material with positive and negative signs. The down-converter 220 directly removes the first bit to the eighth bit of the first bit of the 16-bit voice data, and retains only the data of the 9th bit to the 16th bit of the original first voice, and the data left by the original voice is New first voice material. Therefore, the first voice data only leaves 8 bits of data with positive and negative signs, and the data range is -128 to 127. Among them, the first bit to the seventh bit represent digital data (the amount of voice signals), and the eighth bit represents symbol data (positive and negative values of voice signals). Therefore, after the first speech data is down-coded by the down-converter 220, the new seven first speech data are obtained as 11111011, 11110011, 11111111, 00000010, 00000101, 00001000, and 00001111 (-5, -13, -1, 2, 5, 8, 15).

接著，量化器230將第一語音資料量化而產生數位碼，而量化的方式可利用查表法來進行。雖然語音資料有正負之別，但為了節省記憶體的使用，通常只會建立正半週的量化表(quantization table)。在進行量化程序之前，先將代表語音資料隸屬於正半週或負半週的符號位元紀錄下來，然後再將語音資料取絕對值，利用正半週的量化表將語音資料量化。以下列舉一個5bit表格，將第一語音資料(-5、-13、-1、2、5、8、15)利用量化表1進行量化程序。例如：紀錄語音資料-5、-13、-1的符號位元為1而2、5、8、15的符號位元為0,再將所有資料取絕對值之後得到(5、13、1、2、5、8、15),其中5根據量化表1得到最佳的索引碼為3，而對應的量化數位碼為00011,13根據量化表1得到最佳的索引碼為7，而對應的量化數位碼為00111,在第五位元加上符號位元之後的數位資料分別是10011,10111。Next, the quantizer 230 quantizes the first speech data to generate a digital code, and the manner of quantization can be performed using a look-up table method. Although the voice data is positive or negative, in order to save memory usage, usually only a semi-circular quantization table is established. Before performing the quantification procedure, the symbolic bits representing the speech data belonging to the positive half cycle or the negative half cycle are recorded, and then the speech data is taken to an absolute value, and the speech data is quantized using the positive half-cycle quantization table. The following is a list of 5 bit tables, and the first speech data (-5, -13, -1, 2, 5, 8, 15) is quantized using the quantization table 1. For example, the sign bit of the recorded voice data -5, -13, -1 is 1 and the sign bit of 2, 5, 8, and 15 is 0, and then all the data are taken as absolute values (5, 13, 1, 2, 5, 8, 15), wherein 5 according to the quantization table 1, the best index code is 3, and the corresponding quantized digit code is 00011, 13 according to the quantization table 1, the best index code is 7, and the corresponding The quantized digit code is 00111, and the digit data after the fifth bit plus the sign bit is 10011, 10111, respectively.

於是，第一語音資料(-5、-13、-1、2、5、8、15)的絕對值根據表1得到索引碼為(3、7、1、2、3、4、7)，其對應之二進位數位碼為(00011、00111、00001、00010、00011、00100、00111)，在第5位元替換符號位元之後的二進位數位碼為(10011、10111、10001、00010、00011、00100、00111)。其中，13所對映的表格碼，最接近者為15，因此，選擇其為對應的表值。Therefore, the absolute value of the first voice data (-5, -13, -1, 2, 5, 8, 15) is obtained according to Table 1 as (3, 7, 1, 2, 3, 4, 7). The corresponding binary digits are (00011, 00111, 00001, 00010, 00011, 00100, 00111), and the binary digits after the fifth digit replaces the sign bit are (10011, 10111, 10001, 00010, 00011) , 00100, 00111). Among them, the table code of the 13 pairs is the closest to 15, so it is selected as the corresponding table value.

Table 1

在實際的應用上，為了降低量化誤差，經常會使用多個量化表來對應不同動態範圍的語音資料。請參考第4圖，數位碼600由符號資料612再加上數字資料614構成。串接語音資料為多個數位碼600所組成而7筆數位碼600的資料量共有35 bit。串接語音資料通常亦包含訊框標記資料606(Frame Header)，訊框標記資料606記載當前的訊框所對應最佳量化表的索引，以及訊框切換控制碼(一般是採用和語音編碼資料相同位元數但不重複的特殊碼)。訊框標記資料606的資料長度端視量化表的個數而定，例如，當採用8個量化表時訊框標記資料606需要3bit來對應最佳量化表的索引，採用32個量化表時訊框標記資料606需要5bit來對應最佳量化表的索引。以10 bit訊框標記資料長度為例(5bit訊框切換控制碼加上5bit最佳量化表的索引),此串接語音資料經過編碼後的大小為35+10=45bit。於是原來7筆16bit的第一語音資料共有112bit,經過減碼轉換器220將7筆16bit的資料轉換成只有7筆8bit共56bit的第一語音資料。再利用量化器230的量化結果將每筆8bit資料(表格碼)變成5bit的索引碼資料，最後，7筆5bit的資料量是35 bit。由此可知，我們由原先的112bit的資料量經過減碼轉換器220與量化器230，最後變成只有35bit的資料量。之後，再加上10bit的訊框標記資料606，現在總資料量共為45bit。In practical applications, in order to reduce the quantization error, multiple quantization tables are often used to correspond to different dynamic range speech data. Referring to FIG. 4, the digit code 600 is composed of symbol data 612 plus digital data 614. The serial voice data is composed of a plurality of digit codes 600 and the data amount of the seven digit code 600 is 35 bits. The concatenated voice data usually also includes a frame header data 606 (Frame Header), and the frame tag data 606 records the index of the optimal quantization table corresponding to the current frame, and the frame switching control code (generally using the speech coding data). Special code with the same number of bits but not repeated). The length of the data of the frame mark data 606 depends on the number of quantization tables. For example, when eight quantization tables are used, the frame mark data 606 needs 3 bits to correspond to the index of the optimal quantization table, and 32 quantization tables are used. The box tag data 606 requires 5 bits to correspond to the index of the best quantization table. Taking the data length of the 10-bit frame as an example (the index of the 5-bit frame switching control code plus the 5-bit optimal quantization table), the size of the serialized speech data is 35+10=45 bits. Therefore, the original 7-bit 16-bit first voice data has a total of 112 bits, and the 16-bit data is converted into a first voice data of only 7 8 bits and a total of 56 bits through the down-converter 220. The quantized result of the quantizer 230 is used to convert each 8-bit data (table code) into 5-bit index code data. Finally, the data amount of the seven 5-bit data is 35 bits. From this, we can see that the original 112-bit data amount passes through the down-converter 220 and the quantizer 230, and finally becomes a data amount of only 35 bits. After that, plus 10bit frame mark data 606, the total amount of data is now 45bit.

由以上的先前技術可知，在做語音量化編碼時，其量化的每筆數位碼含有符號碼與數字碼，而每筆數位碼都包含有符號碼，無形中會多浪費儲存的資料量。所以為了減少浪費儲存的資料量，實有必要提出一種新的架構來減少儲存的資料量。It can be known from the above prior art that when performing speech quantization coding, each quantized digital code contains a symbol code and a digital code, and each digital code contains a symbol code, which inevitably wastes the stored data amount. Therefore, in order to reduce the amount of waste of stored data, it is necessary to propose a new architecture to reduce the amount of stored data.

本發明提供一種音訊量化編解碼裝置，運用一記憶體以進行訊號之編解碼，記憶體紀錄有複數個數位第一語音資料與一第二編碼資料串，包含：音訊量化編碼模組與音訊量化解碼模組。音訊量化編碼模組包含：量化器、訊號分割器與資料編碼器。訊號分割器讀取數位第一語音資料並進行複數個零交越點判斷而依序產生複數個第一符號資料，並將所有的數位第一語音資料切割為複數個訊框。量化器連接訊號分割器，接收每個訊框所對應的數位第一語音資料與第一符號資料，並將每次所接收之訊框所對應之多個數位第一語音資料量化後個別對應產生多個第一數字資料，並依據每個訊框所量化之結果對應產生一個第一訊框標記資料。資料編碼器連接量化器與訊號分割器，接收該些第一數字資料、該些第一符號資料與該些第一訊框標記資料並編碼成複數個第一編碼資料串。音訊量化解碼模組包含：資料解碼器與反量化器。資料解碼器連接記憶體，讀取第二編碼資料串並進行解碼而產生複數個第二解碼資料串，每個第二解碼資料串包含：一第二訊框標記資料、一第二符號資料、複數個第二數字資料。反量化器連接資料解碼器，接收第二解碼資料串，並依據第二訊框標記資料、第二符號資料之值進行該些第二數字資料之反量化而依序產生複數個數位第二語音資料。The invention provides an audio quantization codec device, which uses a memory for encoding and decoding signals, and the memory records a plurality of digital first speech data and a second encoded data string, including: audio quantization coding module and audio quantization Decoding module. The audio quantization coding module comprises: a quantizer, a signal divider and a data encoder. The signal splitter reads the digital first speech data and performs a plurality of zero crossing point judgments to sequentially generate a plurality of first symbol data, and cuts all digits of the first speech data into a plurality of frames. The quantizer is connected to the signal splitter, and receives the first digit of the first voice data and the first symbol data corresponding to each frame, and quantizes the plurality of digits of the first voice data corresponding to each frame received by the corresponding frame. a plurality of first digital data, and correspondingly generating a first frame mark data according to the quantized result of each frame. The data encoder is connected to the quantizer and the signal divider, and receives the first digital data, the first symbol data and the first frame label data and encodes the plurality of first encoded data strings. The audio quantization decoding module comprises: a data decoder and an inverse quantizer. The data decoder is connected to the memory, reads the second encoded data string and performs decoding to generate a plurality of second decoded data strings, each second decoded data string includes: a second frame mark data, a second symbol data, A plurality of second digital materials. The inverse quantizer is connected to the data decoder, receives the second decoded data string, and performs inverse quantization of the second digital data according to the values of the second frame mark data and the second symbol data to sequentially generate a plurality of digital second voices data.

本發明又提供一種音訊量化編碼方法，運用於語音數位訊號之編碼，包含：讀取複數個數位第一語音資料並進行複數個零交越點判斷而依序產生複數個第一符號資料，並將所有的數位第一語音資料切割為複數個訊框；接收每個訊框所對應之多個數位第一語音資料與第一符號資料，並將每次所接收之訊框所對應之該些數位第一語音資料量化後對應產生複數個第一數字資料，並依據每個訊框所量化之結果對應產生第一訊框標記資料；及接收該些第一數字資料、第一符號資料與第一訊框標記資料並編碼成複數個第一編碼資料串。The invention further provides an audio quantization coding method, which is applied to the encoding of a voice digital signal, comprising: reading a plurality of digital first speech data and performing a plurality of zero crossing point judgments to sequentially generate a plurality of first symbol data, and Cutting all digits of the first voice data into a plurality of frames; receiving a plurality of digits of the first voice data and the first symbol data corresponding to each frame, and corresponding to each of the received frames The digital first data is quantized to generate a plurality of first digital data, and corresponding to the quantized result of each frame, corresponding to the first frame marking data; and receiving the first digital data, the first symbol data and the first The frame marks the data and encodes into a plurality of first encoded data strings.

本發明另提供一種音訊量化解碼方法，運用於數位語音資料解碼，包含：讀取一第二編碼資料串並進行解碼而產生複數個第二解碼資料串，每個第二解碼資料串包含：一第二訊框標記資料、一第二符號資料、複數個第二數字資料；及接收該第二解碼資料串，並依據第二訊框標記資料、第二符號資料之值進行該些第二數字資料之反量化而依序產生複數個數位第二語音資料。The present invention further provides an audio quantization decoding method, which is applied to digital voice data decoding, comprising: reading a second encoded data string and performing decoding to generate a plurality of second decoded data strings, each second decoded data string comprising: a second frame mark data, a second symbol data, and a plurality of second digital data; and receiving the second decoded data string, and performing the second number according to the value of the second frame mark data and the second symbol data The inverse quantization of the data sequentially generates a plurality of digital second speech data.

本發明再提供一種音訊量化編解碼方法，包含：讀取該些數位第一語音資料並進行複數個零交越點判斷而依序產生複數個第一符號資料，並將該些數位第一語音資料切割為複數個訊框；接收每個訊框所對應之多個數位第一語音資料與第一符號資料，並將每次所接收之該訊框所對應之數位第一語音資料量化後產生複數個第一數字資料，並依據訊框所量化之結果對應產生一個第一訊框標記資料；接收該些第一數字資料、第一符號資料與第一訊框標記資料並編碼成第一編碼資料串；讀取一第二編碼資料串並進行解碼而產生複數個第二解碼資料串，每個第二解碼資料串包含：一第二訊框標記資料、一第二符號資料、複數個第二數字資料；及接收該第二解碼資料串，並依據第二訊框標記資料、第二符號資料之值進行第二數字資料之反量化而依序產生複數個數位第二語音資料。The present invention further provides an audio quantization and decoding method, comprising: reading the digital first speech data and performing a plurality of zero crossing point judgments to sequentially generate a plurality of first symbol data, and the digital first speech Cutting the data into a plurality of frames; receiving the plurality of digits of the first voice data and the first symbol data corresponding to each frame, and quantizing the digital first voice data corresponding to the frame received each time a plurality of first digital data, and correspondingly generating a first frame mark data according to the quantized result of the frame; receiving the first digital data, the first symbol data and the first frame mark data and encoding the first code Data string; reading a second encoded data string and decoding to generate a plurality of second decoded data strings, each second decoded data string comprising: a second frame mark data, a second symbol data, and a plurality of And receiving the second decoded data string, and performing inverse quantization of the second digital data according to the values of the second frame mark data and the second symbol data to sequentially generate the plurality of numbers The second voice data.

本發明提出一種更有效率的編碼裝置，現有的編碼裝置其每一筆量化的數位碼都包含有符號碼，無形中會多浪費儲存的資料量。本發明只提出只用一個符號碼，並串聯複數個數字資料，在維持解析度的前提下可以減少儲存的資料量，而在追求音質的前提下可以透過微幅增加資料量來達到顯著提升音質的功效。The present invention proposes a more efficient encoding device. In the prior art encoding device, each quantized digital code contains a symbol code, which inevitably wastes the stored data amount. The invention only proposes that only one symbol code is used, and a plurality of digital data are connected in series, and the amount of stored data can be reduced while maintaining the resolution, and in the pursuit of sound quality, the amount of data can be increased by a slight increase to achieve a significant improvement in sound quality. The effect.

為讓本發明之上述和其他目的、特徵、和優點能更明顯易懂，下文特舉數個較佳實施例，並配合所附圖式，作詳細說明如下：The above and other objects, features and advantages of the present invention will become more apparent and understood.

請參考第5A圖，係為本發明的音訊量化編碼模組200A之實施例，包含：量化器230、訊號分割器250與資料編碼器240。訊號分割器250包括了一個暫存器251，訊號分割器250從第一記憶體110讀取所儲存之數位第一語音資料並進行一零交越點判斷而依序產生複數個第一符號資料612，並將數位第一語音資料切割為複數個訊框。量化器230連接訊號分割器250，依序依據訊號分割器250所切割的訊框當中所對應的多個數位第一語音資料與與相對應的第一符號資料，並將訊框所對應之多個數位第一語音資料量化後一對一對應產生第一數字資料，並依據此次訊框所量化之結果產生第一訊框標記資料606。資料編碼器240則連接量化器230與訊號分割器250，接收第一數字資料、第一符號資料與訊框標記資料並將其編碼成第一編碼資料串，每個第一編碼資料串包含第一訊框標記資料、第一符號資料、多個第一數字資料。之後，再將第一編碼資料串儲存之第二記憶體300中。Please refer to FIG. 5A , which is an embodiment of the audio quantization coding module 200A of the present invention, including: a quantizer 230, a signal divider 250 and a data encoder 240. The signal divider 250 includes a register 251. The signal divider 250 reads the stored digital first voice data from the first memory 110 and performs a zero crossing point judgment to sequentially generate a plurality of first symbol data. 612, and cutting the digital first voice data into a plurality of frames. The quantizer 230 is connected to the signal divider 250, and sequentially according to the plurality of digits of the first voice data corresponding to the frame cut by the signal divider 250 and the corresponding first symbol data, and corresponding to the frame. The digitized first voice data is quantized and the first digital data is generated in a one-to-one correspondence, and the first frame marker data 606 is generated according to the result quantized by the frame. The data encoder 240 is connected to the quantizer 230 and the signal divider 250, and receives the first digital data, the first symbol data and the frame marker data and encodes the first encoded data string into a first encoded data string. A frame mark data, a first symbol data, and a plurality of first digital materials. Then, the first encoded data string is stored in the second memory 300.

實務上，第一記憶體110與第二記憶體300可以是一個相同記憶體當中的不同區塊。In practice, the first memory 110 and the second memory 300 may be different blocks in the same memory.

接著，請參考第5B圖，係為本發明的音訊量化編碼模組200B之實施例。第5B圖與第5A圖中主要的差異為，第5B圖中之量化器230每次所讀取之訊框所對應的多個數位第一語音資料係直接從第一記憶體110讀取後並進行量化。而第5A圖中，量化器230每次所讀取之訊框所對應的多個數位第一語音資料先由第一記憶體110讀取後放置於訊號分割器250之暫存器251中，再由訊號分割器250之暫存器251讀取數位第一語音資料後進行量化。Next, please refer to FIG. 5B, which is an embodiment of the audio quantization coding module 200B of the present invention. The main difference between FIG. 5B and FIG. 5A is that the plurality of digits of the first voice data corresponding to the frame read by the quantizer 230 in FIG. 5B are directly read from the first memory 110. And quantify. In FIG. 5A, the plurality of digits of the first voice data corresponding to the frame read by the quantizer 230 are first read by the first memory 110 and then placed in the register 251 of the signal divider 250. Then, the digital first data is read by the register 251 of the signal divider 250 and quantized.

接著，請參考第6A圖，其為本發明的音訊量化編碼模組200C之實施例。其為於第5A圖的實施例中，增加了一個減碼轉換器220。減碼轉換器220連接第一記憶體110與訊號分割器250之間，對暫存於第一記憶體110當中一個訊框的所有第一語音資料進行減碼之動作，再儲存於暫存器251當中。Next, please refer to FIG. 6A, which is an embodiment of the audio quantization coding module 200C of the present invention. In the embodiment of FIG. 5A, a down code converter 220 is added. The down code converter 220 is connected between the first memory 110 and the signal divider 250, and performs a code reduction operation on all the first voice data temporarily stored in one frame of the first memory 110, and then stored in the temporary register. 251.

接著，第6B圖，其為本發明的音訊量化編碼模組200D之實施例。其為於第5B圖的實施例中，增加了一個減碼轉換器220。減碼轉換器220連接於第一記憶體110與訊號分割器250、量化器230之間，對儲存於第一記憶體中110的第一語音資料進行減碼之動作，再傳送至量化器230。Next, FIG. 6B is an embodiment of the audio quantization coding module 200D of the present invention. In the embodiment of FIG. 5B, a down code converter 220 is added. The down code converter 220 is connected between the first memory 110 and the signal divider 250 and the quantizer 230, and performs the action of reducing the first voice data stored in the first memory 110 to be transmitted to the quantizer 230. .

其中，量化器230包含：控制單元及向量單元，控制單元依據每個訊框所對應之所有的第一語音資料或經減碼之第一語音資料並計算量化誤差等以供量化表的選擇，並產生訊框標記資料。向量單元接收第一語音資料並進行量化表的查尋而對應產生數字資料。The quantizer 230 includes: a control unit and a vector unit, and the control unit calculates a quantization error or the like according to all the first voice data or the subtracted first voice data corresponding to each frame for the selection of the quantization table, And generate frame marker data. The vector unit receives the first voice data and performs a lookup of the quantization table to generate digital data.

本發明的減碼轉換器220並不侷限只有16bit減少為8bit，亦可由16bit變成10bit，又或者24bit減少為12bit，本發明並沒有限定格式，完全依據系統的設計來加以選定。The down-converter 220 of the present invention is not limited to only 16 bits reduced to 8 bits, and can be changed from 16 bits to 10 bits, or 24 bits to 12 bits. The present invention does not have a format, and is selected according to the design of the system.

第一語音資料可為經過減碼後的資料，例如，由16bit減碼為8bit或10bit。在本發明的一些實施例中，當多個第一語音資料為負值時，將數位碼600當中代表正、負符號的符號資料省略並整合至訊框標記資料之後，並形成一個新的數字資料614，其僅包含了代表聲音資料的內容，而不包含正、負的資訊，如第7圖所示者。如此，可減少先前技術當中的數位碼600的位元數，例如，原先5bit，減少為4bit，以降低資料量。或者，增加數位碼600的位元數，例如，由原先數位碼當中的5bit當中只占了4bit的數字資料，在本發明中，增加1bit，也就是5bit的數字資料，以提高編碼資料的解析度。The first voice data may be a subtracted data, for example, 16bit down to 8bit or 10bit. In some embodiments of the present invention, when the plurality of first voice data is negative, the symbol data representing the positive and negative symbols in the digit code 600 is omitted and integrated into the frame marker material, and a new number is formed. The data 614, which only contains the content representing the sound material, does not contain positive and negative information, as shown in FIG. In this way, the number of bits of the digit code 600 in the prior art can be reduced, for example, the original 5 bits is reduced to 4 bits to reduce the amount of data. Or, the number of bits of the digital code 600 is increased. For example, only 4 bits of the 5 bits of the original digital code are used. In the present invention, 1 bit, that is, 5 bits of digital data is added to improve the analysis of the encoded data. degree.

請參考第7圖，其為本發明之串接語音資料之資料結構示意圖。個別的串接語音資料620、622均包括了訊框標記資料606、符號資料612與多筆數字資料614。換句話說，每筆串接語音資料起始於訊框標記資料606與符號資料612，而終於下一筆訊框標記資料606之前。串接語音資料620、622的長度取決於兩個零交越點的長度當中的數字資料的點數，亦即，訊框標記資料606+符號資料612+兩個零交越點的點數×數字資料的長度(例如，4bit或者5bit)。Please refer to FIG. 7, which is a schematic structural diagram of the data of the serial voice data of the present invention. The individual serial voice data 620, 622 includes frame mark data 606, symbol data 612 and multiple digital data 614. In other words, each concatenated speech data begins with frame tag data 606 and symbol data 612, and finally before the next frame tag data 606. The length of the concatenated speech data 620, 622 depends on the number of points of the digital data among the lengths of the two zero crossing points, that is, the frame mark data 606 + the symbol data 612 + the number of points of the two zero crossing points × The length of the digital material (for example, 4bit or 5bit).

換句話說，該符號資料612的正號或負號，係利用訊號分割器判斷該零交越點的產生，而零交越點的判斷，則是依據連續二個第一語音資料的資料變化而定。當本發明的訊號分割器250接收到接續一個正第一語音資料與一個負第一語音資料時(無論何者先出現)，即可判斷有零交越點，訊號分割器250就會產生代表零交越點產生的符號資料612，以提供給資料編碼器240。而符號資料612的正號，可以0來代表，而符號資料612的負號，可以1來代表。例如：第一個第一語音資料為A，第二個第一語音資料為B，當A<0，且B>=0時，訊號分割器250則產生符號資料612為”0”(正號)，即第一語音資料為由負轉正的情形，亦即，後續出現的第一語音資料將都為正；當A>=0，且B<0時，訊號分割器250則產生符號資料612為”1”(負號)，即第一語音資料為由正轉負的情形，亦即，後續出現的第一語音資料將都為負。上述只是本發明實施零交越點判斷的一實施例，本發明不侷限此種方式。In other words, the positive or negative sign of the symbol data 612 is determined by the signal divider to determine the zero crossing point, and the zero crossing point is judged according to the data of the two consecutive first speech data. And set. When the signal divider 250 of the present invention receives a positive first speech data and a negative first speech data (whichever occurs first), it can be determined that there is a zero crossing point, and the signal divider 250 generates a representative zero. The symbol data 612 generated by the crossover point is provided to the data encoder 240. The positive sign of the symbol data 612 can be represented by 0, and the negative sign of the symbol data 612 can be represented by 1. For example, the first first voice data is A, and the second first voice data is B. When A<0 and B>=0, the signal divider 250 generates the symbol data 612 as “0” (positive sign) ), that is, the first voice data is negatively positive, that is, the subsequent first voice data will be positive; when A>=0, and B<0, the signal divider 250 generates the symbol data 612. It is "1" (negative sign), that is, the first voice data is changed from positive to negative, that is, the subsequent first voice data will be negative. The above is only one embodiment of the zero crossing point judgment of the present invention, and the present invention is not limited to this manner.

範例一：Example 1:

本實施例係說明在代表語音訊號的數字資料編碼長度固定為4bit的情況。訊框標記資料606所對應的量化表可以有2個或2個以上，此時，訊框標記資料606須採用至少1個bit來指示採用哪個表。例如，當只用到二個量化表的狀況下，訊框標記資料606可以設定表2(本實施例的第1個表)所對應的表值為”0”，而表3(本實施例的第2個表)的表值可設定為”1”。當採用5個量化表時，此時，訊框標記資料606就需要3bit，分別對映的表值為000、001、010、011、100等。本發明可對映的量化表數，可以是單一個表，或者，複數個表。This embodiment illustrates the case where the digital data encoding length representing the voice signal is fixed to 4 bits. There may be two or more quantization tables corresponding to the frame mark data 606. At this time, the frame mark data 606 must use at least one bit to indicate which table to use. For example, when only two quantization tables are used, the frame mark data 606 can set the table value corresponding to Table 2 (the first table in this embodiment) to "0", and Table 3 (this embodiment) The table value of the second table) can be set to "1". When five quantization tables are used, at this time, the frame mark data 606 needs 3 bits, and the table values respectively mapped are 000, 001, 010, 011, 100, and the like. The number of quantifiable tables that can be mapped in the present invention may be a single table or a plurality of tables.

在有多個量化表的情形下，一般會依據資料的量化誤差大小來決定最適合的表。接著，請回頭參考第3圖，其為多個第一語音資料序列(-6、-12、-1、3、5、8、15、8、5)，其有二個零交越點的發生。假設共有8個表可供使用，在經過比對篩選後，表2與表3是最合適的表可分別運用於(-6、-12、-1)序列及(3、5、8、15、8、5)序列。適合的量化表選擇，係為熟習該項技藝者所熟知，不再贅述。In the case of multiple quantization tables, the most suitable table is generally determined based on the size of the quantization error of the data. Next, please refer back to Figure 3, which is a sequence of multiple first speech data (-6, -12, -1, 3, 5, 8, 15, 8, 5) with two zero crossing points. occur. Assuming a total of 8 tables are available for use, after comparison screening, Tables 2 and 3 are the most suitable tables for the (-6, -12, -1) sequence and (3, 5, 8, 15 respectively). , 8, 5) sequence. Suitable quantification table selections are well known to those skilled in the art and will not be described again.

首先，經過第一個零交越點後，先遇到(-6、-13、-1)時，先在前面取負的符號資料612為1，再把(-6、-13、-1)取絕對值變成(6、13、1)，首先表2的訊框標記資料606先設為000，最後再經由查表2後可以得到索引碼為(4、8、1)，最後再對映表2的數字資料得到的序列為(0100、1000、0001)。之後，再加上訊框標記資料606的值000、符號資料612的值1。最後得到的串接語音資料620為(000、1、0100、1000、0001)。First, after the first zero crossing point, when (-6, -13, -1) is encountered first, the negative symbol data 612 is taken as 1 in the front, and then (-6, -13, -1) The absolute value becomes (6, 13, 1). First, the frame mark data 606 of Table 2 is first set to 000, and finally, after looking up the table 2, the index code is (4, 8, 1), and finally The sequence obtained by mapping the digital data of Table 2 is (0100, 1000, 0001). Thereafter, the value 000 of the frame mark material 606 and the value 1 of the symbol data 612 are added. The resulting concatenated speech material 620 is (000, 1, 0100, 1000, 0001).

接著，經過第二個零交越點後，表3的訊框標記資料606先設為01，最後看(3、5、8、15、8、5)，本符號資料612為0代表為正值，再對映表3，找最接近的表格碼，此為熟習該項技藝所熟知，可以得到索引碼為(1、2、3、5、3、2)，最後再對映表3的數字資料的編碼為(0001、0010、0011、0101、0011、0010)。之後，再加上訊框標記資料606的值001以及符號資料612的值0得到的串接語音資料為(001、0、0001、0010、0011、0101、0011、0010)。Then, after the second zero crossing point, the frame mark data 606 of Table 3 is first set to 01, and finally (3, 5, 8, 15, 8, 5), and the symbol data 612 is 0 for positive Value, then map 3, find the closest form code, which is well known in the art, you can get the index code (1, 2, 3, 5, 3, 2), and finally map 3 The code of the digital data is (0001, 0010, 0101, 0101, 0011, 0010). Then, the concatenation voice data obtained by adding the value 001 of the frame mark data 606 and the value 0 of the symbol data 612 is (001, 0, 0001, 0010, 0011, 0101, 0011, 0010).

接著，請參考第7圖，其最後再將正的與負的訊框編碼資料結合並加上訊框切換控制碼1111得到(1111、000、1、0100、1000、0001、1111、001、0、0001、0010、0011、0101、0011、0010)完整的語音資料序列字串。Next, please refer to FIG. 7 , and finally combine the positive and negative frame coded data and add the frame switch control code 1111 to obtain (1111, 000, 1, 0100, 1000, 0001, 1111, 001, 0). , 0001, 0010, 0101, 0101, 0011, 0010) complete speech data sequence string.

在前述的實施例中，透過將起始碼當中增設一個位元的符號資料，即可省略後續帶正負號數位碼的符號位元。當每兩個零交越點的點數越多，可省略的資料量越多。In the foregoing embodiment, by adding a symbol data of one bit to the start code, the symbol bit of the subsequent signed digital code can be omitted. When the number of points per two zero crossing points is increased, the amount of data that can be omitted is increased.

從另一個觀點而言，以4bit的量化編碼為例，以先前技術經過編碼後的量化結果，每4個bit的數位碼當中，皆包含有1個符號資料，數值資料只有3bit。而在本發明中，在暨有的4bit架構當中，則可將原先的4個bit全部運用為數值資料。在此類查表法的應用中，可在相同的資料量的基礎下，將查表的結果，亦即，與音訊號編碼的解析度提高將近1倍，大幅提升語音訊號編碼的品質。From another point of view, taking 4 bit quantization coding as an example, the coded quantization result of the prior art includes one symbol data for each 4-bit digital code, and the numerical data is only 3 bits. In the present invention, in the 4 bit architecture of the cum, the original 4 bits can be used as the numerical data. In the application of such a look-up table method, the result of the look-up table, that is, the resolution of the audio signal coding can be nearly doubled under the same amount of data, and the quality of the voice signal coding is greatly improved.

前述所揭露者，係為編碼裝置的部分。接著，請參考第8圖，其說明了本發明之音訊量化解碼模組，可將本發明所編碼的串接語音資料予以解碼。音訊量化解碼模組包含：資料解碼器410與反量化器420。資料解碼器410讀取第二記憶體300內的第二編碼資料串並進行解碼而產生複數個第二解碼資料串，每個該第二解碼資料串包含：一第二訊框標記資料、一第二符號資料、複數個第二數字資料；及一反量化器420，連接該資料解碼器，接收該些第二解碼資料串，並依據該第二訊框標記資料、該第二符號資料之值進行該些第二數字資料之反量化而依序產生複數個數位第二語音資料後，儲存於第三記憶體510。The foregoing disclosure is part of an encoding device. Next, please refer to FIG. 8, which illustrates the audio quantization decoding module of the present invention, which can decode the serial voice data encoded by the present invention. The audio quantization decoding module includes a data decoder 410 and an inverse quantizer 420. The data decoder 410 reads the second encoded data string in the second memory 300 and performs decoding to generate a plurality of second decoded data strings, each of the second decoded data strings comprising: a second frame marking material, a second symbol data, a plurality of second digital data; and an inverse quantizer 420, connected to the data decoder, receiving the second decoded data strings, and according to the second frame marking data, the second symbol data The values are inverse quantized by the second digital data to sequentially generate a plurality of digital second speech data, and then stored in the third memory 510.

實務上，第二記憶體300與第三記憶體510可以是一個相同記憶體當中的不同區塊。In practice, the second memory 300 and the third memory 510 may be different blocks in the same memory.

例如：經過資料解碼器410解碼後可得到前述範例1的語音資料序列字串(1111、000、1、0100、1000、0001、1111、001、0、0001、0010、0011、0101、0011、0010)。For example, after decoding by the data decoder 410, the speech data sequence string of the foregoing example 1 can be obtained (1111, 000, 1, 0100, 1000, 0001, 1111, 001, 0, 0001, 0010, 0011, 0101, 0011, 0010). ).

接著，先去掉語音資料序列字串的訊框切換控制碼1111，再接著反量化器420取符合資料的表2，且取符號資料為1，代表接下來的數字資料會是負值，再經由表2得到索引碼為(4、8、1)，最後對映表2則可得到表格碼(6、12、1)，由於符號資料612為1代表為負，所以得到的多個語音資料為(-6、-12、-1)。相同的，第二個序列中，先先去掉語音資料序列字串的訊框切換控制碼1111，且001代表第二個量化表(表3)，符號資料612為0，索引碼為(2、3、4、8、4、2)，最後再個別對應表3，可個別得到表格碼(5、8、12、25、12、5)。最後反量器將還原資料為多個第一語音資料(-5、-12、-1、5、8、12、25、12、5)。最後，在將多個第一語音資料儲存到第三記憶體510中。Then, the frame switching control code 1111 of the voice data sequence string is first removed, and then the inverse quantizer 420 takes the table 2 of the matching data, and takes the symbol data as 1, indicating that the next digital data will be a negative value, and then Table 2 obtains the index code (4, 8, 1), and finally the mapping table 2 can get the table code (6, 12, 1). Since the symbol data 612 is 1 and is negative, the obtained multiple voice data is (-6, -12, -1). In the same, in the second sequence, the frame switching control code 1111 of the speech data sequence string is first removed, and 001 represents the second quantization table (Table 3), the symbol data 612 is 0, and the index code is (2). 3, 4, 8, 4, 2), and finally corresponding to Table 3, the form code (5, 8, 12, 25, 12, 5) can be obtained individually. Finally, the inverse meter will restore the data to a plurality of first voice data (-5, -12, -1, 5, 8, 12, 25, 12, 5). Finally, a plurality of first voice materials are stored in the third memory 510.

請參考第9圖，本發明的音訊量化編碼之流程圖，包含以下的步驟：步驟110：讀取複數個數位第一語音資料並進行複數個零交越點判斷而依序產生複數個第一符號資料，並將該些數位第一語音資料切割為複數個訊框。Referring to FIG. 9, the flowchart of the audio quantization coding of the present invention includes the following steps: Step 110: reading a plurality of digits of the first voice data and performing a plurality of zero-crossing point judgments to sequentially generate a plurality of firsts. Symbol data, and cutting the digital first speech data into a plurality of frames.

在步驟110中，更可對該些數位第一語音資料進行減碼之動作。In step 110, the digitizing the first voice data may be further coded.

步驟120：依序接收該些訊框與該些第一符號資料，並將每次所接收之該訊框所包括之該些數位第一語音資料量化後產生複數個第一數字資料，並依據該些訊框所量化之結果對應產生複數個第一訊框標記資料。Step 120: sequentially receiving the frames and the first symbol data, and quantizing the digital first speech data included in the received frame each time to generate a plurality of first digital data, and according to The quantized results of the frames correspond to generating a plurality of first frame tag data.

步驟130：接收該些第一數字資料、該些第一符號資料與該些第一訊框標記資料並編碼成複數個第一編碼資料串，每個該第一編碼資料串對應該訊框並包含該第一訊框標記資料、該第一符號資料、該訊框所對應之該些第一數字資料。Step 130: Receive the first digital data, the first symbol data, and the first frame label data, and encode the plurality of first encoded data strings, each of the first encoded data strings corresponding to the frame The first frame tag data, the first symbol data, and the first digital data corresponding to the frame are included.

其中第一語音資料以對映查法方式求得數字資料。其中零交越點的計算係利用連續二個第一語音資料相乘進行判斷，當連續二個第一語音資料相乘後為負值，代表零交越點成立。The first voice data is obtained by means of a mapping method. The calculation of the zero crossing point is judged by multiplying two consecutive first speech data, and when the two consecutive first speech data are multiplied, the value is negative, indicating that the zero crossing point is established.

請參考第10圖，本發明的音訊量化解碼之流程圖，包含以下的步驟：步驟210：讀取一第二編碼資料串並進行解碼而產生複數個第二解碼資料串，每個該第二解碼資料串包含：一第二訊框標記資料、一第二符號資料、複數個第二數字資料。Referring to FIG. 10, a flowchart of audio quantization decoding according to the present invention includes the following steps: Step 210: Read a second encoded data string and perform decoding to generate a plurality of second decoded data strings, each of the second The decoded data string includes: a second frame mark data, a second symbol data, and a plurality of second digital materials.

步驟220：接收該些第二解碼資料串，並依據該第二訊框標記資料、該第二符號資料之值進行該些第二數字資料之反量化而依序產生複數個數位第二語音資料。Step 220: Receive the second decoded data strings, and perform inverse quantization of the second digital data according to the second frame marking data and the second symbol data to sequentially generate a plurality of digital second voice data. .

請參考第11圖，本發明的音訊量化編解碼之流程圖，包含以下的步驟：步驟310：讀取該些數位第一語音資料並進行複數個零交越點判斷而依序產生複數個第一符號資料，並將該些數位第一語音資料切割為複數個訊框。Please refer to FIG. 11 , the flow chart of the audio quantization codec of the present invention includes the following steps: Step 310: Read the digital first speech data and perform a plurality of zero crossing point judgments to sequentially generate a plurality of numbers. a symbol data, and cutting the digital first speech data into a plurality of frames.

在步驟310中，更可對該些數位第一語音資料進行減碼之動作。In step 310, the digitizing the first voice data may be further coded.

步驟320：依序讀取該些訊框與該些第一符號資料，並將每次所接收之該訊框所包括之該些數位第一語音資料量化後產生複數個第一數字資料，並依據該些訊框所量化之結果對應產生複數個第一訊框標記資料。Step 320: sequentially reading the frames and the first symbol data, and quantizing the digital first speech data included in the received frame to generate a plurality of first digital data, and A plurality of first frame tag data are generated corresponding to the quantized results of the frames.

步驟330：接收該些第一數字資料、該些第一符號資料與該些第一訊框標記資料並編碼成複數個第一編碼資料串，每個該第一編碼資料串對應該訊框並包含該第一訊框標記資料、該第一符號資料、該訊框所對應之該些第一數字資料。Step 330: Receive the first digital data, the first symbol data, and the first frame label data, and encode the plurality of first encoded data strings, each of the first encoded data strings corresponding to the frame The first frame tag data, the first symbol data, and the first digital data corresponding to the frame are included.

步驟340：讀取一第二編碼資料串並進行解碼而產生複數個第二解碼資料串，每個該第二解碼資料串包含：一第二訊框標記資料、一第二符號資料、複數個第二數字資料。Step 340: Read a second encoded data string and perform decoding to generate a plurality of second decoded data strings, each of the second decoded data strings: a second frame mark data, a second symbol data, and a plurality of Second digital data.

步驟350：接收該些第二解碼資料串，並依據該第二訊框標記資料、該第二符號資料之值進行該些第二數字資料之反量化而依序產生複數個數位第二語音資料。Step 350: Receive the second decoded data strings, and perform inverse quantization of the second digital data according to the second frame mark data and the second symbol data to sequentially generate a plurality of digital second voice data. .

其中該些第一數字資料與該第一訊框標記資料係利用一或多個量化表以該些第一語音資料進行查表而產生。其中該些第二語音資料係利用一或多個量化表以該第二訊框標記資料、該第二符號資料、該些第二數字資料進行查表而產生。其中該零交越點的計算係以連續二個該第一語音資料相乘後為負值，判斷該零交越點成立。其中該第一符號資料、該第二符號資料係代表一正值或一負值。The first digital data and the first frame mark data are generated by using the one or more quantization tables to perform lookup by using the first voice data. The second voice data is generated by using the one or more quantization tables to perform the lookup table by using the second frame mark data, the second symbol data, and the second digital data. The zero crossing point is calculated by multiplying two consecutive first speech data to a negative value, and determining that the zero crossing point is established. The first symbol data and the second symbol data represent a positive value or a negative value.

接著，請參考本發明的語音編解碼系統圖，如第12A與12B圖所示，其分別為運用第5A圖、第5B圖之語音編碼器之編解碼器實施例。綜合第12A、12B圖的實施例，音訊量化編解碼裝置係運用記憶體(包括了第一記憶體110、第二記憶體300與第三記憶體510)以進行訊號編解碼，第一記憶體110紀錄有複數個數位第一語音資料，第二記憶體300記錄有第二編碼資料串。編解碼器包含：訊號分割器250、量化器230、資料編碼器240、資料解碼器410與反量化器420。Next, please refer to the speech codec system diagram of the present invention, as shown in FIGS. 12A and 12B, which are respectively codec embodiments of the speech encoder using the 5A and 5B diagrams. In the embodiment of FIGS. 12A and 12B, the audio quantization codec device uses the memory (including the first memory 110, the second memory 300, and the third memory 510) for signal encoding and decoding, the first memory. The 110 record has a plurality of digits of the first voice data, and the second memory 300 records the second coded data string. The codec includes a signal divider 250, a quantizer 230, a data encoder 240, a data decoder 410, and an inverse quantizer 420.

其中，訊號分割器250讀取多個數位第一語音資料並進行複數個零交越點判斷而依序產生複數個第一符號資料，並將多個數位第一語音資料切割為複數個訊框。量化器230連接訊號分割器250，接收每個訊框所對應的多個數位第一語音資料與第一符號資料，並將每次所接收之訊框所所對應之多個數位第一語音資料量化後產生複數個第一數字資料，並依據訊框量化之結果對應產生一第一訊框標記資料。資料編碼器240連接量化器230與訊號分割器250，接收量化器230對每個訊框所產生多個第一數字資料、第一符號資料與第一訊框標記資料並編碼成一第一編碼資料串。資料解碼器410連接第二記憶體300，由其讀取第二編碼資料串並進行解碼而產生複數個第二解碼資料串。每個第二解碼資料串包含：第二訊框標記資料、第二符號資料、複數個第二數字資料。反量化器420連接資料解碼器410，接收第二解碼資料串，並依據第二訊框標記資料、第二符號資料之值進行多個第二數字資料之反量化而依序產生複數個數位第二語音資料後，儲存至第三記憶體510。The signal divider 250 reads the plurality of digits of the first voice data and performs a plurality of zero-crossing point determinations to sequentially generate the plurality of first symbol data, and cuts the plurality of digits of the first voice data into a plurality of frames. . The quantizer 230 is connected to the signal divider 250, and receives a plurality of digital first voice data and a first symbol data corresponding to each frame, and receives a plurality of digital first voice data corresponding to each received frame. After the quantization, a plurality of first digital data are generated, and a first frame mark data is generated according to the result of the frame quantization. The data encoder 240 is connected to the quantizer 230 and the signal divider 250. The receiving quantizer 230 generates a plurality of first digital data, first symbol data and first frame label data for each frame and encodes the first encoded data into a first encoded data. string. The data decoder 410 is connected to the second memory 300, and reads the second encoded data string and decodes it to generate a plurality of second decoded data strings. Each second decoded data string includes: a second frame mark data, a second symbol data, and a plurality of second digital materials. The inverse quantizer 420 is connected to the data decoder 410, receives the second decoded data string, and performs inverse quantization of the plurality of second digital data according to the values of the second frame marking data and the second symbol data to sequentially generate a plurality of digits. After the two voice data, the data is stored in the third memory 510.

實務上，第一記憶體110、第二記憶體300與第三記憶體510可以是一個相同記憶體當中的不同區塊。In practice, the first memory 110, the second memory 300, and the third memory 510 may be different blocks among the same memory.

雖然本發明之較佳實施例揭露如上所述，然其並非用以限定本發明，任何熟習相關技藝者，在不脫離本發明之精神和範圍內，當可作些許之更動與潤飾，因此本發明之專利保護範圍須視本說明書所附之申請專利範圍所界定者為準。While the preferred embodiment of the invention has been described above, it is not intended to limit the invention, and it is obvious to those skilled in the art that the invention may be modified and modified without departing from the spirit and scope of the invention. The patent protection scope of the invention is subject to the definition of the scope of the patent application attached to the specification.

100．．．語音輸入訊號100. . . Voice input signal

110．．．第一記憶體110. . . First memory

200．．．語音編碼器200. . . Speech encoder

210．．．類比數位轉換器210. . . Analog digital converter

220．．．減碼轉換器220. . . Minus converter

230．．．量化器230. . . Quantizer

240．．．資料編碼器240. . . Data encoder

250．．．訊號分割器250. . . Signal splitter

251．．．暫存器251. . . Register

300．．．第二記憶體300. . . Second memory

400．．．語音解碼器400. . . Speech decoder

410．．．資料解碼器410. . . Data decoder

420．．．反量化器420. . . Inverse quantizer

500．．．語音輸出訊號500. . . Voice output signal

510．．．第三記憶體510. . . Third memory

600．．．數位碼600. . . Digital code

606．．．訊框標記資料606. . . Frame marker data

612．．．符號資料612. . . Symbolic data

614．．．數字資料614. . . Digital data

624．．．第一編碼資料串624. . . First encoded data string

626．．．第一編碼資料串626. . . First encoded data string

700．．．音訊編解碼器700. . . Audio codec

第1圖係為先前語音編碼與解碼系統圖(先前技術)；Figure 1 is a diagram of a prior speech coding and decoding system (prior art);

第2A圖係為先前語音編碼器之功能方塊圖之第一實施例(先前技術)；Figure 2A is a first embodiment of a functional block diagram of a prior speech coder (prior art);

第2B圖係為先前語音編碼器之功能方塊圖之第二實施例(先前技術)；Figure 2B is a second embodiment of the functional block diagram of the prior speech coder (prior art);

第3圖係為先前類比數位取樣圖(先前技術)；Figure 3 is a previous analog digital sampling map (prior art);

第4圖係為先前串接語音資料圖(先前技術)；Figure 4 is a diagram of a previously concatenated speech data (previous technique);

第5A圖係為本發明之語音編碼器之功能方塊圖之第一實施例；Figure 5A is a first embodiment of a functional block diagram of a speech coder of the present invention;

第5B圖係為本發明之語音編碼器之功能方塊圖之第二實施例；Figure 5B is a second embodiment of the functional block diagram of the speech coder of the present invention;

第6A圖係為本發明之語音編碼器之功能方塊圖之第三實施例；Figure 6A is a third embodiment of the functional block diagram of the speech coder of the present invention;

第6B圖係為本發明之語音編碼器之功能方塊圖之第四實施例；Figure 6B is a fourth embodiment of the functional block diagram of the speech coder of the present invention;

第7圖係為本發明之串接語音資料之實施例圖；Figure 7 is a diagram showing an embodiment of the serial voice data of the present invention;

第8圖係為本發明之語音解碼器之功能方塊圖；Figure 8 is a functional block diagram of the speech decoder of the present invention;

第9圖係為本發明之音訊量化編碼之流程圖；Figure 9 is a flow chart of the audio quantization coding of the present invention;

第10圖係為本發明之音訊量化解碼之流程圖；Figure 10 is a flow chart of the audio quantization decoding of the present invention;

第11圖係為本發明之音訊量化編解碼之流程圖；Figure 11 is a flow chart of the audio quantization codec of the present invention;

第12A圖係為本發明之語音編解碼器之功能方塊圖之第一實施例；及Figure 12A is a first embodiment of a functional block diagram of a speech codec of the present invention; and

第12B圖係為本發明之語音編解碼器之功能方塊圖之第二實施例。Figure 12B is a second embodiment of the functional block diagram of the speech codec of the present invention.

110．．．第一記憶體110. . . First memory

200．．．語音編碼器200. . . Speech encoder

230．．．量化器230. . . Quantizer

240．．．資料編碼器240. . . Data encoder

250．．．訊號分割器250. . . Signal splitter

300．．．第二記憶體300. . . Second memory

Claims

An audio quantization codec device uses a memory for signal encoding and decoding. The memory records a plurality of digital first speech data and a second encoded data string, and includes: an audio quantization coding module, including: a signal a splitter, connected to the memory, reading the digital first speech data and performing a plurality of zero crossing point judgments to sequentially generate a plurality of first symbol data, and according to the distribution of the first symbol data The digital first voice data is cut into a plurality of frames, and each of the digital first voice data contained in the frame belongs to the same positive half cycle or negative half cycle, and has the same first symbol data; a quantizer And connecting the signal divider to receive the digital first voice data corresponding to each of the frames and the first symbol data, and each of the digits of the first voice data corresponding to the received frame Quantizing and generating a plurality of first digital data, and calculating the quantization error according to the digital first speech data belonging to the same positive half cycle or negative half cycle corresponding to the frame a first frame marking data; and a data encoder connecting the quantizer and the signal divider, and receiving the first digital data, the first symbol data generated by the quantizer for each frame The first frame marking data is encoded into a first encoded data string; and an audio quantization decoding module includes: a data decoder connected to the memory, reading the second encoded data string and decoding to generate a plurality of a second decoded data string, each of the second decoded data strings includes: a second frame mark data, a second symbol data, and a plurality of second digital data; and an inverse quantizer connected to the data decoder to receive The second decoded data string is subjected to inverse quantization of the second digital data according to the second frame mark data and the second symbol data to sequentially generate a plurality of digital second voice data.

The audio quantization codec device of claim 1, further comprising: a down code converter, coupled between the memory and the signal divider, and performing the code reduction operation on the first voice data.

The audio quantization codec device of claim 1, further comprising: a down code converter connected between the memory and the signal splitter, and connected to the quantizer to subtract the first voice data The action of the code.

The audio quantization and codec device of claim 1, wherein the first encoded data string corresponds to the frame and includes the first frame mark data, the first symbol data, and the first corresponding to the frame Digital data.

The audio quantization and codec device of claim 1, wherein the digits of the first voice data corresponding to the frame read by the quantizer are read from the memory.

The audio-quantity codec device of claim 1, wherein the signal divider comprises a register, the register is configured to store the digitized first voice data corresponding to the frame, and the quantizer is The digitized first voice data corresponding to the read frame is read from the register of the signal divider.

The audio quantization and codec device of claim 1, wherein the quantizer uses one or more quantization tables to find the first speech data corresponding to each frame by looking up the table and calculating the quantization error. The quantization table corresponding to the minimum quantization error is obtained, and the first digital data is obtained according to the quantization table.

The audio quantization and codec device of claim 1, wherein the inverse quantizer uses the one or more quantization tables to include the second frame marker data, the second symbol data, and the second symbol data included in each of the decoded data strings. The second digital data corresponding to the quantization table corresponding to the second frame mark data corresponding to the second voice data.

The audio quantization codec device of claim 1, wherein each of the zero crossing points is calculated The two consecutive first speech data are multiplied, and when they are negative, the zero crossing point is established.

The audio quantization and codec device of claim 1, wherein the first symbol data and the second symbol data represent a positive value or a negative value.

An audio quantization coding method includes: reading a plurality of digital first speech data and performing a plurality of zero crossing point judgments to sequentially generate a plurality of first symbol data, and according to the distribution of the first symbol data The digital first voice data is cut into a plurality of frames, so that the digital first voice data included in each frame belongs to the same positive half cycle or negative half cycle, and has the same first symbol data; The digitized first voice data and the first symbol data corresponding to the frame, and quantizing the digitized first voice data corresponding to the frame received each time to generate a plurality of first digital data And correspondingly generating a first frame mark data according to the calculation of the quantization error corresponding to the digital first voice data belonging to the same positive half cycle or negative half cycle corresponding to the frame; and receiving corresponding to each of the The first digital data, the first symbol data and the first frame label data are generated by the frame and encoded into a first encoded data string.

The method as claimed in claim 11, wherein the first encoded data string corresponds to the frame and includes the first frame mark data, the first symbol data, and the first numbers corresponding to the frame. data.

The audio quantization coding method of claim 11, further comprising: performing a code reduction operation on the digital first speech data.

The audio quantization coding method of claim 11, wherein the digital data and the frame mark data are generated by using the one or more quantization tables to perform a lookup table using the first voice data.

The audio quantization coding method according to claim 11, wherein the zero-crossing point judgment determines that the zero-crossing point is established by multiplying two or more first speech data by a negative value.

An audio quantization decoding method includes: reading a second encoded data string and decoding to generate a plurality of second decoded data strings, each of the second decoded data strings comprising: a second frame mark data, a second a symbol data, a plurality of second digital data, each of the second decoded data strings belonging to the same positive half cycle or negative half cycle; and receiving the second decoded data string, and according to the second frame marking data, the first The value of the two symbol data is subjected to inverse quantization of the second digital data to sequentially generate a plurality of digits of the second voice data.

The audio quantization decoding method of claim 16, wherein the second voice data is generated by using one or more quantization tables to perform table lookup by using the frame mark data, the symbol data, and the digital data.

An audio quantization codec method includes: reading the digital first speech data and performing a plurality of zero crossing point judgments to sequentially generate a plurality of first symbol data, and according to the distribution of the first symbol data The digital first speech data is cut into a plurality of frames, such that the digital first speech data included in each frame belongs to the same positive half cycle or negative half cycle, and has the same first symbol data; Each of the digitized first voice data corresponding to the frame and the first symbol data, and each of the digitized first voice data corresponding to the received frame is quantized to generate a plurality of first Digital data, and corresponding to the calculation of quantization error corresponding to the digital first data of the same positive half cycle or negative half cycle corresponding to the frame, corresponding to generating a first frame mark data; receiving corresponding to each The first digital data generated by the frame, the first symbol data and the first frame label data are encoded into a first encoded data string; a second encoded data string is read and decoded A plurality of second decoding data a string, each of the second decoded data strings comprising: a second frame mark data, a second symbol data, a plurality of second digital data; and receiving the second decoded data strings, and according to the second frame The value of the mark data and the second symbol data is subjected to inverse quantization of the second digital data to sequentially generate a plurality of digital second voice data.

The method for encoding and decoding an audio code according to claim 18, wherein the first encoded data string corresponds to the frame and includes the first frame mark data, the first symbol data, and the first corresponding to the frame. Digital data.

The audio quantization and coding and decoding method of claim 18, further comprising: performing a code reduction operation on the digital first speech data.

The method of claim 18, wherein the first digital data and the first frame marking data use the one or more quantization tables to include the first voices included in each of the frames. The data is obtained by looking up the table and calculating the quantization error, and the quantization table corresponding to the minimum quantization error is obtained, and the first digital data is obtained according to the quantization table.

The audio quantization and coding and decoding method of claim 18, wherein the second voice data uses the one or more quantization tables to include the second frame marker data and the second symbol included in each of the decoded data strings. The data and the second digital data are generated according to a corresponding one of the quantization tables corresponding to the second frame mark data.

The audio quantization codec method of claim 18, wherein each of the zero crossing points is calculated by multiplying two consecutive first speech data, and when it is a negative value, the zero crossing point is established.

The audio quantization decoding method of claim 18, wherein the first symbol data and the second symbol data represent a positive value or a negative value.