KR0144935B1

KR0144935B1 - Coding and decoding apparatus for bit rate

Info

Publication number: KR0144935B1
Application number: KR1019940040670A
Authority: KR
Inventors: 김상욱
Original assignee: 김광호; 삼성전자주식회사
Priority date: 1994-12-31
Filing date: 1994-12-31
Publication date: 1998-08-17
Also published as: KR960027372A

Abstract

본 발명에 의한 비트율 조정이 가능한 부호화 및 복호화장치에서는 입력되는 오디오신호와 비디오신호를 사용자의 선택에 따라 다양한 비트율로 별도로 부호화하고, 부호화된 비트스트림을 형성하기 위한 부호화기와 부호화기에서 형성된 비트스트림을 오디오신호와 비디오신호로 분류하여 복원하기 위한 복호화기로 구성된다.In the encoding and decoding apparatus capable of adjusting the bit rate according to the present invention, the input audio signal and the video signal are separately encoded at various bit rates according to a user's selection, and the bit stream formed by the encoder and the encoder for forming the encoded bit stream are audio. And a decoder for classifying and restoring the signal and the video signal.

따라서 저장되어 있는 데이타 또는 전송되고 있는 데이타를 미리 정해진 몇 개의 비트율 중 하나로 선택해서 부호화 혹은 복호화하고, 사용자의 선택에 따라 오디오신호나 비디오신호에 대해 별도의 비트율 조정이 가능하고, 오디오신호 및 비디오신호 상호간의 비트율 조절이 가능하다.Therefore, the stored or transmitted data can be selected and encoded or decoded at one of several predetermined bit rates, and the bit rate can be adjusted separately for the audio signal or the video signal according to the user's selection. Mutual bit rate adjustment is possible.

Description

Encoding and decoding device with adjustable bit rate

제1도는 본 발명에 의한 비트율 조정이 가능한 부호화장치의 블록도.1 is a block diagram of an encoding apparatus capable of adjusting a bit rate according to the present invention.

제2도는 본 발명에 의한 비트율 조정이 가능한 복호화장치의 블록도.2 is a block diagram of a decoding apparatus capable of adjusting a bit rate according to the present invention.

제3도는 제1도에 도시된 부호화장치의 상세블럭도.3 is a detailed block diagram of the encoding apparatus shown in FIG.

제4도는 제2도에 도시된 복호화장치의 상세블록도4 is a detailed block diagram of the decoding apparatus shown in FIG.

제5a∼5e도는 비트 사용예를 나타낸 도면.5A to 5E are diagrams showing examples of bit use.

제6a,6b도는 양자화 및 양자화단계의 예를 나타낸 도면.6A and 6B show examples of quantization and quantization steps.

제7a∼7c도는 각 모드에 대한 처리단위 프레임을 나타낸 도면.7A to 7C show processing unit frames for each mode.

본 발명은 비트율 조정이 가능한 부호화 및 복호화장치에 관한 것으로서, 특히 사용자가 선택적으로 비트율을 조정할 수 있는 부호화 및 복호화장치에 관한 것이다.The present invention relates to an encoding and decoding apparatus capable of adjusting a bit rate, and more particularly, to an encoding and decoding apparatus capable of selectively adjusting a bit rate by a user.

오디오기기와 비디오기기는 사용자가 필요시에 음성정보나 화상정보를 얻을 수 있는 기계장치로서, 인간의 정보 입수 채널인 오감 가운데 귀와 눈을 통해 정보를 얻게 하는 장치이다.Audio and video equipment is a device that allows a user to obtain voice information or image information when needed, and is a device that obtains information through the ears and eyes among the five senses, which are channels of human information acquisition.

자연계에 존재하는 신호들은 일반적으로 연속된 신호형태인 아날로그로 존재한다. 이러한 아날로그 신호들은 재현장치 또는 전송장치에 사용시 열화가 쉽게 발생하는 문제와, 발생가능한 오차신호에 대한 보상이 어려운 문제, 시스템을 구성했을 경우 시스템의 크기가 커지는 문제등을 가지고 있다. 이와 같은 문제들은 아날로그신호를 샘플링과 양자화처리를 통해 디지탈신호로 변형해 줌으로써 개선할 수 있다.The signals present in nature generally exist in analog form, a continuous signal form. These analog signals have a problem of easily deterioration when used in a reproducing apparatus or a transmission apparatus, a problem of difficult compensation for a possible error signal, and a problem of increasing the size of a system when a system is configured. These problems can be solved by transforming analog signals into digital signals through sampling and quantization.

입수되는 정보는 시간의 흐름에 따라 변화하기 때문에 매 시간간격마다 데이타를 입수해 주어야 하고, 이런 점을 고려하여 임의의 시스템을 꾸밀때 기준이 되는 데이타의 실시간처리에 적합한 샘플링 간격이 결정된다. 시스템이 구성되었을때 이러한 점들이 만족되어야 실시간 구현되었다고 하는데 이러한 사실의 일예로는 영화관에서 보는 영화의 필름이 일초에 최소한 몇장이 반복되어야 사람이 눈으로 볼때 연속된 정보로 인식한다는 점이나 CD의 샘플링 주파수를 44.1 kHz로 하는 것과 관계가 있다.Since the information obtained changes over time, data must be obtained at every time interval. In consideration of this, a sampling interval suitable for real-time processing of the reference data is determined when arranging an arbitrary system. When the system is configured, these points must be satisfied in real time. An example of this fact is that a film of a movie in a movie theater must be repeated at least several times a second to be recognized as continuous information by human eyes or sampling of a CD. It is related to setting the frequency to 44.1 kHz.

결정된 샘플링 간격에 의해 시간영역의 데이타들이 얻어지고, 샘플링된 값들을 양자화처리하여 디지탈신호로 나타낼 수 있다. 신호를 디지탈 방식으로 처리하면, 아날로그 방식에 비해 음질이 향상되는 효과가 있는 반면 아날로그 방식에 비해 데이타량이 증대되는 역효과가 있다.Data in the time domain are obtained by the determined sampling interval, and the sampled values may be quantized and represented as a digital signal. When the signal is processed in the digital method, the sound quality is improved compared to the analog method, while the data amount is increased compared to the analog method.

이러한 데이타의 증가에 대한 영향을 감소시키기 위해 데이타압축을 수행한다.Data compression is performed to reduce the impact of this increase in data.

만약 데이타 압축을 수행하지 않을 경우에는 영상정보 혹은 오디오정보의 전달에 있어서 많은 경우 채널이 요구하는 대역폭을 제공하지 못하기 때문에 전송이 불가능하다. 일예로, 영상의 경우 1000 x 1000 라인의 해상도를 갖는 그레이-스케일 영상에 있어서, 각 화소가 8비트로 구성되어 각 화소에 대해 256계조의 다른 인텐시티로 표현하는 경우, 압축을 하지 않으면 이 영상의 전송에 사용되는 비트수는 8백만비트에 달할 것이다. 일반적인 전화선의 경우, 1초에 9600 비트들을 전송할 수 있기 때문에 이 영상을 전송하기 위해서는 10분 이상이 소요된다. 다른 예로, 44.1 kHz로 샘플링되는 오디오의 경우 각 샘플링된 정보가 16비트로 구성되어, 65536 계조를 갖는다고 할 때 한개의 채널에 대한 비트수가 약 700 k 이고, 일반 전화선 사용시 약 73초가 소요되어 실시간처리가 불가능한 문제가 있다.If data compression is not performed, transmission of video information or audio information is impossible because the channel cannot provide the bandwidth required in many cases. For example, in a gray-scale image having a resolution of 1000 x 1000 lines in the case of an image, when each pixel is composed of 8 bits and represented with different intensity of 256 gray levels for each pixel, transmission of the image is performed without compression. The number of bits used will reach 8 million bits. In a typical telephone line, 9600 bits can be transmitted per second, so it takes more than 10 minutes to transmit this image. As another example, in the case of audio sampled at 44.1 kHz, each sampled information consists of 16 bits. When the 65536 gray level is used, the number of bits for one channel is about 700 k and about 73 seconds when using a general telephone line. There is an impossible problem.

이와 같은 데이타 전송에서는 전송채널의 특징에 따라 전송에 사용되는 비트율이 결정되고, 실시간 처리를 위한 비트율이 결정된다. 이러한 전송채널의 결정은 각 전송되는 데이타를 최적으로 복원해내기 위한 장치와 그 전송율을 최대한 효과적으로 사용하도록 부호화해주는 장치들의 개발을 하게끔 하였다. 이에 전송이나 저장한계를 효과적으로 사용하기 위해 데이타압축에 대한 많은 연구들이 수행되고 있다.In such data transmission, the bit rate used for transmission is determined according to the characteristics of the transmission channel, and the bit rate for real time processing is determined. The determination of the transport channel led to the development of devices for optimally reconstructing each transmitted data and encoding devices to use the data rate as effectively as possible. In order to effectively use transmission and storage limits, many studies on data compression have been conducted.

이러한 이유로 데이타 압축으로서, 파형 부호화, 변환 부호화, 대역분할 부호화, 프랙탈 코딩, 예측 부호화 및 움직임보상부호화 등 여러 방식들이 개발되어 있고, 동영상과 오디오의 기록 및 전송에 대한 표준화안을 만들기 위한 노력이 ISO 산하에서 ISC/IEC JTC1/SC29/WG11(MPEG)에 의해 1989년부터 시작되었다.For this reason, as data compression, various methods such as waveform coding, transform coding, band division coding, fractal coding, predictive coding, and motion compensation encoding have been developed, and efforts to create a standardization scheme for recording and transmitting video and audio are under ISO. Was started in 1989 by ISC / IEC JTC1 / SC29 / WG11 (MPEG).

그러나, 종래에는 전송채널이 결정되어 있는 경우 전송가능한 최대 비트율이 고정되어 있기 때문에 그 전송율에서 최적의 효과적인 특정 재현장치의 설계가 가능하였으나, 여러가지 시스템의 각 전송율에 맞도록 데이타를 형성해 보내 주는 방법은 개발되지 않았다. 즉, 어떤 정보가 전송된다고 할때 전송되는 데이타의 비트율을 사용자가 임의로 선택하여 복원하는 것이 불가능하였고, 특정 재현장치만이 복원이 가능하였다.However, in the related art, since the maximum bit rate that can be transmitted is fixed when a transmission channel is determined, it is possible to design a specific reproducing apparatus that is optimally effective at the transmission rate. Not developed. That is, when certain information is transmitted, it is impossible for the user to arbitrarily select and restore the bit rate of the data to be transmitted, and only a specific reproduction apparatus can restore it.

또한, 광디스크에 영상과 음성정보가 저장되어 있을때 영상과 음성정보에 대한 비트율을 선택적으로 조절해서 재현하는 것이 불가능하였다. 즉 전체 비트율 가운데 영상정보에 할당된 비트율과 음성정보에 할당된 비트율을 조절하면서 화질과 음질을 조절하는 처리가 불가능하였다.In addition, when video and audio information are stored on an optical disc, it is impossible to selectively adjust the bit rate for the video and audio information. That is, it was impossible to control image quality and sound quality while adjusting the bit rate allocated to the image information and the bit rate allocated to the audio information among the total bit rates.

따라서 본 발명의 목적은 상술한 문제점을 해결하기 위하여 저장되어 있는 데이타 또는 전송되고 있는 데이타를 미리 정해진 몇개의 비트율 중 하나로 선택해서 부호화 혹은 복호화하기 위한 장치를 제공하는데 있다.Accordingly, an object of the present invention is to provide an apparatus for encoding or decoding by selecting one of several predetermined bit rates of stored data or transmitted data in order to solve the above problems.

상기 목적을 달성하기 위하여 본 발명에 의한 비트율 조정이 가능한 부호화 및 복호화장치는 입력되는 오디오신호와 비디오신호를 사용자의 선택에 따라 다양한 비트율로 별도로 부호화하고, 부호화된 비트스트림을 형성하기 위한 부호화기; 및 상기 부호화기에서 형성된 비트스트림을 오디오신호와 비디오신호로 분류하여 복원하기 위한 복호화기를 포함하는 것을 특징으로 한다.In order to achieve the above object, an encoding and decoding apparatus capable of adjusting a bit rate according to the present invention includes an encoder for separately encoding an input audio signal and a video signal at various bit rates according to a user's selection, and forming an encoded bitstream; And a decoder for classifying and restoring the bitstream formed by the encoder into an audio signal and a video signal.

이하, 첨부된 도면을 참조하여 본 발명에 대하여 상세히 설명하기로 한다.Hereinafter, with reference to the accompanying drawings will be described in detail with respect to the present invention.

제1도는 본 발명에 의한 비트율 조정이 가능한 부호화장치의 블럭도로서, 비디오신호 처리부(11), 오디오신호 처리부(12), 제1 및 제2엔코더(13,14). 제어부(15)와 비트팩킹부(16)로 구성되며 상세블럭도는 제3도에 도시되어 있다.1 is a block diagram of a coding apparatus capable of adjusting a bit rate according to the present invention, wherein a video signal processor 11, an audio signal processor 12, and first and second encoders 13 and 14 are shown. It is composed of a control unit 15 and a bit packing unit 16 and a detailed block diagram is shown in FIG.

제2도는 본 발명에 의한 비트율 조정이 가능한 복호화장치의 블럭도로서, 비트 언팩킹부(21), 오디오신호 처리부(22), 비디오신호 처리부(23), 제1 및 제2디코더(24, 25)와 제어부(26)로 구성되며 상세블럭도는 제4도에 도시되어 있다.2 is a block diagram of a decoding apparatus capable of adjusting the bit rate according to the present invention. The bit unpacking unit 21, the audio signal processor 22, the video signal processor 23, the first and second decoders 24 and 25 are shown in FIG. ) And a control unit 26, the detailed block diagram of which is shown in FIG.

제5a∼5e도는 비트 사용예를 나타낸 도면으로서, 제5a도는 표준 비트를 사용한 경우, 제5b도는 오디오신호가 강조된 경우, 제5c도는 비디오신호가 강조된 경우, 제5d도는 비디오신호만 있는 경우, 제5e도는 오디오신호만 있는 경우를 각각 나타낸다.5a to 5e show examples of bit use, FIG. 5a shows standard bits, FIG. 5b shows audio signals highlighted, FIG. 5c shows video signals highlighted, and FIG. 5d shows video signals only. 5e shows a case where only an audio signal is present.

제6a,6b도는 양자화 및 양자화단계의 예를 나타낸 도면이다.6A and 6B are diagrams showing examples of quantization and quantization steps.

제7a∼7c도는 각 모드에 대한 처리단위 프레임을 나타낸 도면으로서, 제7a도는 고정된 한 유니트식 비트율을 증가하는 경우, 제7b도는 고정된 최대한의 유니트를 사용한 비트율 조정의 경우, 제7c도는 고정되지 않은 비트율 증가의 경우를 각각 나타낸다. 여기서, SNC는 동기화, nf는 정규화 팩터, bai는 비트 할당정보, ba2는 비트할당2이고, 2n은 0≤2n≤2k의값이다.7A to 7C show processing unit frames for each mode, and FIG. 7A shows a case where a fixed unitary bit rate is increased, and FIG. 7B shows a bit rate adjustment using a fixed maximum unit. Each case of bit rate increase that is not shown is shown. Here, SNC is synchronization, nf is normalization factor, bai is bit allocation information, ba2 is bit allocation 2, and 2n is 0≤2n≤2k.

그러면 본 발명의 동작을 제1도 내지 제7도를 참조하여 설명하면 다음과 같다.Next, the operation of the present invention will be described with reference to FIGS. 1 to 7.

본 발명은, 부호화가에 의해 오디오신호와 비디오신호를 각각 부호화하여 부호화된 비트스트림을 형성하고, 복호화기에 의해 비트스트림을 오디오신호와 비디오신호들로 분류하여 복원하는 형태로 되어 있다. 부호화장치는 제1 및 제3도에서와 같이, 시간/주파수 변환부, 비트율, 비트율 표현방식, 시간영역처리 혹은 주파수영역처리를 제어하는 제어부, 가장 비트율이 작은 경우에 대한 제1부호화부, 가장 비트율이 작은 경우에 대한 복호화부, 복호화된 신호와 원신호 사이의 차이를 계산하는 오차계산부, 제어부로부터 선택된 비트율 표현방식에 따라 모드 1, 2, 3 중 하나를 선택해 오차신호로부터 2차 비트스트림을 형성하는 2차 비트스트림 형성부와 가장 비트율이 작은 경우의 비트스트림과 2차 비트스트림을 이용하여 최종 비트스트림을 형성하는 제2부호화부로 이루어진다.According to the present invention, an audio signal and a video signal are encoded by an encoder to form an encoded bitstream, and the decoder classifies and restores the bitstream into audio and video signals. As shown in FIGS. 1 and 3, the encoding apparatus includes a time / frequency converter, a controller for controlling a bit rate, a bit rate representation method, a time domain processing or a frequency domain processing, a first encoder for the smallest bit rate, Decoding unit for the case where the bit rate is small, error calculation unit for calculating the difference between the decoded signal and the original signal, the second bit stream from the error signal by selecting one of modes 1, 2, 3 according to the bit rate representation method selected from the control The secondary bitstream forming unit for forming the first bitstream and the second encoding unit for forming the final bitstream using the secondary bitstream and the bitstream having the smallest bit rate.

한편, 복호화장치는 제2도 및 제4도에서와 같이 입력신호를 동기하여 1차 비트와 2차 비트로 구분하여 1차 비트들을 복호화하고, 1차 비트들의 부가정보를 이용해 2차 비트들을 제어부의 제어신호에 따른 비트율에 따라 읽어들여 복호화하는 처리를 하고, 1차 비트들에 의한 정보에 2차 비트들에 의한 정보를 합쳐 스케일러빌러티(scalability)에 대한 구현을 하는 비트형성부로 이루어진다.Meanwhile, the decoding apparatus decodes primary bits by dividing the input signal into primary bits and secondary bits by synchronizing the input signal as shown in FIGS. 2 and 4, and uses the additional information of the primary bits to control the secondary bits. A bit forming unit is configured to read and decode according to a bit rate according to a control signal, and implements scalability by combining information of secondary bits with information of secondary bits.

먼저 부호화장치를 살펴보면, 오디오, 비디오, 또는 a/v를 선택해 저장하고자 하는 데이타의 종류를 결정한다. 저장하고자 하는 데이타의 종류가 결정되면 채널의 저장능력을 고려하여 저장될 수 있는 비트율을 결정하고, 그 비트율에 따라 프레임의 크기를 정한다. 일예로 비디오에 사용되는 비트율(Vbit)을 k로 정하는 경우, 채널의 특성에 의해 결정된 오디오/비디오에 사용되는 비트율(AVbit)을 이용하여 오디오에 사용되는 비트율(Abit)을 계산하는 과정은 Vbit가 AVbit 보다 작거나 같은 경우 Abit=AVbit-Vbit로 나타낼 수 있다.First, referring to the encoding apparatus, audio, video, or a / v is selected to determine the type of data to be stored. When the type of data to be stored is determined, the bit rate that can be stored is determined in consideration of the storage capacity of the channel, and the frame size is determined according to the bit rate. For example, if the bit rate (Vbit) used for video is set to k, the process of calculating the bit rate (Abit) used for audio using the bit rate (AVbit) used for audio / video determined by the characteristics of the channel is Vbit. If it is less than or equal to AVbit, it can be represented as Abit = AVbit-Vbit.

즉, 제5a도에서 오디오신호의 비트율을 높여서 오디오를 보다 양질로 듣고자 할 경우에는 Vbit 값을 줄여 제5b도와 같이 사용하고, 이와는 달리 비디오신호의 비트율을 높여서 보다 양질의 화질로 보고자 할 경우 Vbit를 늘려 제5c도에서와 같이 사용한다. 극단적인 경우 AVbit 전체를 비디오 혹은 오디오에 사용하며 제5d도 혹은 제5e도와 같게 된다.That is, in the case of FIG. 5a, if you want to listen to the audio with higher quality by increasing the bit rate of the audio signal, decrease the Vbit value and use it as in FIG. 5b.In contrast, if you want to see the higher quality of the video signal by increasing the bit rate of the video signal, Vbit. Increase the to use as shown in Figure 5c. In the extreme case, the entire AVbit is used for video or audio, which is the same as either 5d or 5e.

앞에서 정해진 데이타의 종류에 따라 결정된 프레임의 크기에서 미리 정해진 최소 비트율 Abit(min), Vbit(min)에 따라 데이타 압축을 행한다. 미리 정해진 최소 비트율은 스케일러빌러티를 제공하기 위한 기초복원신호를 만드는데 사용된다. 미리 정해진 최소 비트율 외의 남는 비트수들은 스케일러빌러티를 제공하기 위한 다양한 비트율을 제공하는데 사용된다.Data compression is performed according to the predetermined minimum bit rates Abit (min) and Vbit (min) at the size of the frame determined according to the type of data previously determined. The predetermined minimum bit rate is used to create the base restoration signal to provide scalability. The remaining number of bits other than the predetermined minimum bit rate is used to provide various bit rates for providing scalability.

즉, 오디오의 경우,In other words, for audio,

Abit2 = Abit - Abit(min)Abit2 = Abit-Abit (min)

비디오의 경우,For video,

Vbit2 = Vbit - Vbit(min)Vbit2 = Vbit-Vbit (min)

에 의해 계산되는 Abit2, Vbit2를 스케일러빌러티를 제공하기 위한 비트율을 제공하는데 사용된다.The Abit2, Vbit2, computed by is used to provide the bitrate to provide scalability.

남는 비트들을 다양한 비트율을 이용, 저장 및 복원하는데에는 여러가지 방식이 있을 수 있다. 이 방식은 비트율 증가가 고정된 값으로 증가하느냐, 고정되지 않은 값으로 증가하느냐, 질의 증가에 따라 비트율을 한 유니트씩 증가시키느냐, 아니면 여분의 정보를 놓고 복호화시 비트율을 고려해 계산해 주느냐에 따라서 다음과 같은 3가지 방식이 있을 수 있다.There may be many ways to use, store and restore the remaining bits at various bit rates. This method depends on whether the bit rate increases to a fixed value, to a non-fixed value, to increase the bit rate by one unit as the query increases, or to consider the bit rate when decoding extra information. There can be three same ways.

방식1:고정된 한 유니트씩의 비트율 증가Method 1: increase bit rate by fixed unit

방식2:고정된 최대한의 유니트를 사용한 비트율 조절Method 2: Bit rate adjustment using fixed maximum unit

방식3:고정되지 않은 비트율 증가Method 3: Unfixed Bit Rate Increase

각 방식들을 보다 자세히 살펴보면 다음과 같다.Looking at each method in more detail as follows.

방식1:고정된 한 유니트씩의 비트율 증가.Method 1: Increase bit rate by fixed unit.

데이타 처리에 사용되는 값이 주파수영역의 값이냐, 시간영역의 값이냐에 따라 다음과 같이 처리해 준다. 주파수영역의 값인 경우 주파수영역의 해상도에 따라 예컨데 k개의 주파수영역값이 존재하는 경우, 음질을 하나씩 높임에 따라 2k 비트를 더 읽어주고, 음질을 하나씩 낮춤에 따라 2k 비트를 덜 읽어주는 처리를 함으로써 음질의 조절이 가능하게 한다. 시간영역의 값인 경우, k개의 시간영역값에 대하여 2k비트를 더 읽어 줌으로써 음질을 높이고 덜 읽어 줌으로써 음질을 낮춘다. 이때 오디오신호 또는 비디오신호에 대한 프레임의 크기가 결정된 다음 그 프레임의 크기가 M1, 각 신호에 대한 최저 비트율의 크기가 M2이면, 비트율조정에 사용할 수 있는 여분의 비트는 M1-M2가 된다. 여기서 각 신호의 해상도에 대한 값이 k이면 각 신호에 있어서 최대한 더 사용할 수 있는 값들은 M3=(M1-M2)/k가 된다. 이 k를 2로 나누어주면 2차 비트스트림에 대한 유니트의 갯수가 계산된다.Depending on whether the value used for data processing is the value of the frequency domain or the time domain, the processing is as follows. In the case of the value of the frequency domain, for example, if there are k frequency domain values according to the resolution of the frequency domain, the process of reading 2k bits more by increasing the sound quality one by one and reading 2k bits less by decreasing the sound quality one by one Allows adjustment of sound quality. In the case of time-domain values, the sound quality is increased by reading 2k bits more for k time-domain values, and lowered by reading less. At this time, if the size of the frame for the audio signal or the video signal is determined and the size of the frame is M1 and the size of the minimum bit rate for each signal is M2, the extra bits available for bit rate adjustment are M1-M2. If the value of the resolution of each signal is k, the more usable values for each signal are M3 = (M1-M2) / k. Dividing k by 2 calculates the number of units for the secondary bitstream.

여기서 비트수의 증감에 따라서 음질의 조절이 되는 이유는 비트수가 늘어남에 따라 양자화오차를 줄여주는 처리를 해 줄 수 있고, 양자화오차가 줄어듦에 따라 보다 원음에 가까와지기 때문이다. 비트율을 변화시키는데 있어서, 2k 씩의 비트가 사용되는 이유는 +1, * Δ, 0, -1 * Δ의 처리를 고려하기 위해서이다. 여기서 Δ는 양자화 스텝을 델타라 한 경우 델타의 반에 해당하는 값이다.The reason why the sound quality is adjusted according to the increase or decrease of the number of beats is that the processing of reducing the quantization error can be performed as the number of bits increases, and the closer to the original sound as the quantization error decreases. The reason why 2k bits are used in changing the bit rate is to consider the processing of +1, * Δ, 0, -1 * Δ. Here, Δ is a value corresponding to half of the delta when the quantization step is called delta.

여기서는 DPCM에서 사용되는 경우와 마찬가지로 앞의 신호와의 차이를 이용하여 원신호를 복원한다. 제6도에 신호와 양자화기의 양자화단계가 작용되는 예를 나타내었다. 이전에 M인 값이 비트수가 증가함으로써 M + Δ, M -Δ, M으로 변화함으로서 보다 원신호에 가까와진다. 일예로 양자화된 값들이 0,10,20,30,40,....,250 인 값들을 가진다고 할 때 원신호가 77이면 양자화된 값은 80으로 된다. 이 경우 Δ는 5인데, 비트수를 2개 늘려 -Δ에 대한 처리를 해 주면 새로 양자회된 값은 75로 80인 경우인 +3인 오차보다 적은 -2의 오차를 가지게 되어 보다 원신호에 가깝게 된다. 이 경우 새로운 Δ는 2.5가 되고, 비트수를 2개 더 늘려 주면 양자화값은 77.5로 오차를 0.5로 줄일 수 있고 원음에 더 가까워질 수 있다.Here, as in the case of the DPCM, the original signal is restored by using the difference from the previous signal. 6 shows an example in which a quantization step of a signal and a quantizer is operated. As the value of M increases, the number of bits increases to M + Δ, M-Δ, and M, which is closer to the original signal. For example, if the quantized values have values of 0, 10, 20, 30, 40, ..., 250, if the original signal is 77, the quantized value is 80. In this case, Δ is 5, but if you increase the number of bits by 2 and process for -Δ, the newly quantized value is 75, which has an error of -2 less than the error of +3, which is 80. Come close. In this case, the new Δ becomes 2.5, and if the number of bits is increased by two more, the quantization value is 77.5, which can reduce the error to 0.5 and get closer to the original sound.

고정된 한 유니트씩의 비트율 증가를 사용하면, 매번 혹은 -의 부호에 대해서 고려해 주어야 하는 번거로움이 있다. 이러한 문제를 해결하고, 기존의 비트가 2개의 비트로 3가지 경우만(+Δ,0,-Δ)표현해주던 것을 비트 k로 2EXP(k)-１인 경우에 대해 표현해 줌으로써부호표현의 중복성을 줄임으로써 데이타를 효율적으로 사용하기 위해 고정된 최대한의 유니트를 사용하는 방식이 있다. 방식1에서와 같이 오디오신호 또는 비디오신호에 대한 프레임의 크기가 결정된 다음 그 프레임의 크기가 M1, 각 신호에 대한 최저 비트율의 크기가 M2이면, 비트율조정에 사용할 수 있는 여분의 비트는 M1-M2가 된다. 여기서 각 신호의 해상도에 대한 값이 k이면 각 신호에 있어서 최대한 더 사용할 수 있는 값들은 M3=(M1-M2)/k가 된다. 즉, 각 신호에 대해서 M3 개의 비트가 더 있을 수 있어 최대한 Δ * 2 EXP(M3-1) 로 양자화오차를 줄일 수 있다. 이러한 경우 비트율 조절은 다음과 같이 한다. 방식1에서의 일예로 설명하면 77의 값이 80이 되었을때 M3개의비트가 가령 4라면 부호표현을 위한 1비트는 빼고 나머지 3비트로 차이를 표현하면 '011'과 같이 -3인 오차를 표현할 수 있다.Using a fixed bit-rate increment of one unit is a hassle to consider for each or every minus sign. It solves this problem and reduces the redundancy of the code expression by expressing the case where the existing bits represent three cases (+ Δ, 0, -Δ) with two bits for the case of 2EXP (k) -1 with the bit k. One way is to use a fixed maximum number of units to efficiently use data. If the size of the frame for the audio signal or the video signal is determined as in the method 1, and then the size of the frame is M1 and the size of the lowest bit rate for each signal is M2, the extra bits available for bit rate adjustment are M1-M2. Becomes If the value of the resolution of each signal is k, the more usable values for each signal are M3 = (M1-M2) / k. That is, there may be M3 more bits for each signal, thereby reducing the quantization error with Δ * 2 EXP (M3-1) as much as possible. In this case, bit rate adjustment is as follows. As an example of the method 1, when the value of 77 becomes 80, if M3 bits are 4, for example, subtracting 1 bit for sign representation and expressing the difference with the remaining 3 bits can express an error of -3 as in '011'. have.

여기서, 사용자가 음질을 더 높이고자 하는 경우에는 여분의 비트인 M3로 표현되는 값을 읽어 비교에 의해 출력값에 영향을 미치도록 한다.In this case, when the user wants to improve the sound quality, the value represented by the extra bit M3 is read to influence the output value by comparison.

즉, L =-3인 경우That is, when L = -3

Case 1 : 0 |L| ≤ 1,Case 1: 0 | L | ≤ 1,

Case 2 : 1 |L| ≤ 2,Case 2: 1 | L | ≤ 2,

Case 3 : 2 |L| ≤ 3,Case 3: 2 | L | ≤ 3,

에 의해 차례로 오차값을 줄여가면서 음질 또는 화질을 개선할 수 있다.By reducing the error value in turn can improve the sound quality or image quality.

가령 Case 2인 경우로서, 사용자가 음성데이타나 화상데이타를 즐기다가 요구에 따라 Case 1으로 변화하면 음질 또는 화질을 낮추어 제현해 듣는 처리가 가능하게 된다. 이 경우 확장 부분에 대한 비트들을 읽은 뒤에 사용자의 요구에 따라 변형된 값들을 계산한다.For example, in case 2, when a user enjoys voice data or image data, and then changes to case 1 as required, the user can lower the sound quality or the image quality to perform the listening process. In this case, after reading the bits for the extended part, the modified values are calculated according to the user's request.

방식3:고정되지 않은 비트율 증가Method 3: Unfixed Bit Rate Increase

방식1 및 방식2에서의 고정된 비트율 증감의 경우, 비트들의 증감에 따라 효율적으로 비트들을 할당하기 위해서, 오디오의 경우, 임계대역에 근거하여 처리한다. 임계대역에 근거하여 처리함으로써 양자화처리에 의한 오차의 영향을 국부적으로 해 주고, 보다 인간의 청각심리에 적합한 처리가 가능하게 한다. 이 경우 방식1 혹은 방식2에서 주파수 해상도가 k 인 경우 2k 또는 M3 * k에 의한 비트할당을 해 준것과는 달리 주파수영역 또는 시간영역을 처리에 사용하는 단위블럭으로 나누어 단위블럭별로 비트들을 할당해 준다. 일예로 오디오 데이타의경우 임계대역 24 대역으로 하는 경우 각 대역별로 비트들을 더 할당해 줄 것인가 혹은 할당하지 않을 것인가에 대한 플래그부를 가짐으로써 비트들을 효율적으로 사용할 수 있다.In the case of the fixed bit rate increase and decrease in the method 1 and the method 2, in order to efficiently allocate bits according to the increase or decrease of the bits, the audio is processed based on the threshold band. By processing based on the critical band, the effect of error due to quantization processing is localized, and the processing more suitable for human hearing psychology is possible. In this case, if the frequency resolution is k in the scheme 1 or scheme 2, unlike the bit allocation by 2k or M3 * k, the bits are allocated to the unit blocks by dividing the frequency domain or the time domain into unit blocks used for processing. . For example, in the case of audio data having a threshold band of 24 bands, bits can be efficiently used by having a flag unit for assigning or not allocating bits for each band.

일예로, 처리블럭 3에만 한 비트씩을 더 사용함으로서 질을 높일 수 있다고 하면 플래그부의 24 비트에는 '0 0 1 0 0... 0'과 같은 값을 갖고, 3번째 처리블럭의 처리만 보다 세밀하게 한다는 정보를 주고, 플래그부의 뒤에는 3번째 처리블럭 신호에 대한 +Δ, 0, -Δ에 대한 값을 줌으로써 음질 또는 화질의 변화를 가질 수 있다.For example, if the quality can be improved by using one more bit for only the processing block 3, the 24 bits of the flag portion have a value equal to '0 0 1 0 0 ... 0', and only the processing of the third processing block is more detailed. In addition, the flag unit may have a change in sound quality or image quality by giving values for + Δ, 0, and -Δ for the third processing block signal.

상기 앞에서 결정한 최소비트율에 의해 압축된 정보에 최소비트율외의 남는 비트수에 의한 정보들을 합해 오디오부분 혹은 비디오부분의 처리단위 프레임을 형성시킨다.The processing unit frame of the audio portion or the video portion is formed by adding the information compressed by the minimum bit rate determined above to the information by the remaining number of bits other than the minimum bit rate.

처리결과 각 모드에 대한 처리단위 프레임의 모양은 제7a∼7c도와 같다. 이와 같은 처리단위 프레임으로 디스크나 데이프에 데이타를 저장하거나 무선 혹은 유선으로 데이타 전송을 하는데 사용한다.As a result of the processing, the shape of the processing unit frame for each mode is shown in Figs. 7A to 7C. This processing unit frame is used to store data on disk or tape or to transmit data wirelessly or by wire.

한편, 저장된 데이타나 전송된 데이타를 읽거나 받아서 처리해 주는 과정은 제2도 및 제4도의 복호화장치에서 수행된다. 복원하는데에는 우선, 오디오신호인지 비디오신호인지에 따라 처리대기버퍼에 읽는 신호들을 다르게 하고, 비트율에 따라 버퍼의 신호들의 양을 조절한다. 최소비트율과 원하는 비트율 사이의 차이에 따라 계산된 값과 유사한 값의 다양한 비트율을 제공하기 위한 값을 계산한다.Meanwhile, the process of reading or receiving stored data or transmitted data is performed by the decoding apparatus of FIGS. 2 and 4. In order to recover, first, signals to be read to the processing buffer are different depending on whether they are audio or video signals, and the amount of signals in the buffer is adjusted according to the bit rate. Depending on the difference between the minimum bit rate and the desired bit rate, a value is calculated to provide various bit rates with values similar to the calculated values.

처리대기버퍼에 신호들이 차면, 미리 정해진 비트율에 따른 복호화 처리를 수행하고, 선택된 모드에 의해 제공된 다양한 비트율 제공을 위한 값들을 복호화하여, 최소비트율에 따른 복호화처리를 해 준 값의 결과에 따라 변형시켜 비트율의 변형에 따른 음질 또는 화질의 변화를 갖는다. 이 방식에 의한 처리예는 다음과 같다.When the signals are filled in the processing standby buffer, decoding processing according to a predetermined bit rate is performed, and values for providing various bit rates provided by the selected mode are decoded and modified according to the result of the decoding process according to the minimum bit rate. It has a change in sound quality or image quality according to the variation of the bit rate. The processing example by this method is as follows.

최소비트율에 의해 계산된 신호가 m*delta 인 경우 2차 비트에 의해 복원된 신호를 n이라 하면 이 비트율에서의 신호는 m*delta+(delta/2)*n이 된다. 역시 delta는 하나의 유니트로 10이고, m이 80, n이 -1인 예의 경우를 보면 80+(10/2)*(-1)로 75가 되고, 이때 비트를 한 유니트 더 증가하면 m이 75, n이 +1, delta가 5이므로 75+(5/2)*(+1)로 77.5가 되는 것을 알 수 있다.When the signal calculated by the minimum bit rate is m * delta, if the signal reconstructed by the second bit is n, the signal at this bit rate becomes m * delta + (delta / 2) * n. Also, delta is 10 as one unit, and m is 80 and n is -1. In the example, 80 + (10/2) * (-1) is 75, and if the bit is increased by one more unit, m is Since 75, n is +1 and delta is 5, it can be seen that 75+ (5/2) * (+ 1) becomes 77.5.

상술한 바와 같이 본 발명에 의한 비트율 조정이 가능한 영상 및 음성 부호화 및 복호화장치에서는 사용자의 선택에 따라 오디오신호나 비디오신호에 대해 별도의 비트율 조정이 가능하고, 오디오신호 및 비디오신호 상호간의 비트율 조절이 가능하다.As described above, in the video and audio encoding and decoding apparatus capable of adjusting the bit rate according to the present invention, it is possible to separately adjust the bit rate of the audio signal or the video signal according to the user's selection, and the bit rate control between the audio signal and the video signal is It is possible.

또한, 다양한 비트율에 대해 동시에 기록재생 또는 전송됨으로써 사용자가 필요에 따라 비트율을 선택할 수 있을 뿐 아니라 다양한 비트율에 대해 여러가지 모드로 표현함으로써 데이타 표현의 중복성을 막아주어 효율적으로 데이타를 표현할 수 있다. 또한, 다양한 비트율에 대한 비트스트림을 이용하여 전송시 전송선로의 특성에 따른 효과적인 전송이 가능한 이점이 있다.In addition, by simultaneously recording and reproducing or transmitting the various bit rates, the user can not only select the bit rate if necessary, but also express the data efficiently by preventing the redundancy of the data representation by expressing the various bit rates in various modes. In addition, there is an advantage that can be effectively transmitted according to the characteristics of the transmission line during transmission using the bitstream for various bit rates.

Claims

An encoding apparatus for forming a bitstream by encoding an input audio signal and a video signal at a predetermined bit rate, the encoding device comprising: a predetermined minimum bit rate determined for each of the minimum bits necessary for representing the input audio signal and the video signal. A first encoder configured to encode the audio signal and the video signal, respectively, to form a first bitstream; a decoder to decode the first bitstream signal encoded by the first encoder; an audio signal decoded by the decoder; An error calculator which calculates an error signal between the video signal and the input audio signal and the video signal, respectively; differently set bit rates for the audio signal and the video signal according to a predetermined selected bit rate, and the audio signal and the video signal Encode an error signal for A second encoder for forming a second bit stream; And a formatter for forming a final bitstream by combining the first bitstream generated by the first encoder and the second bitstream generated by the second encoder.

A first encoder configured to form a first bitstream by encoding the audio signal and the video signal according to a predetermined minimum bit rate, respectively, for the minimum bit required for the representation of the input audio signal and the video signal; A second encoder which encodes an audio signal and a video signal obtained by decoding a bitstream signal and an error signal between the input audio signal and the video signal according to a bit rate selected according to a predetermined setting mode to form a second bit stream; And a formatter configured to add a first bitstream generated by the first encoder and a second bitstream generated by the second encoder to form a final bitstream. A decoding apparatus for restoring a bitstream transmitted from an encoding apparatus encoding a bit rate to form a bitstream, into an original audio signal and a video signal, comprising: an input for separating and storing an audio signal and a video signal from the transmitted bitstream A first decoder to decode a first bitstream included in the audio signal and a video signal; a second decoder to decode a second bitstream included in the audio signal and the video signal, respectively; And an output buffer for transforming the value decoded by the first decoder according to the value decoded by the second decoder.

The encoding apparatus according to claim 1, wherein the bit rate of the second encoder is determined according to a bit rate of one of the audio signal and the video signal.

4. The encoding apparatus of claim 3, wherein the bit rate of the audio signal is determined by subtracting the minimum video bit rate from the bit rates of the entire audio signal and the video signal.

4. The encoding apparatus of claim 3, wherein the bit rate of the video signal is determined by subtracting the minimum audio bit rate from the bit rates of the entire audio signal and the video signal.

An encoding and decoding apparatus comprising an encoder for encoding and transmitting an input audio signal and a video signal at a predetermined bit rate, and a decoder for reconstructing a signal transmitted from the encoder into an original audio signal and a video signal. A first encoder configured to form a first bitstream by encoding the audio signal and the video signal according to a predetermined minimum bit rate, respectively, for a minimum bit necessary for representing the input audio signal and the video signal; A decoder configured to decode the first bitstream signal encoded by the encoder; an error calculator configured to calculate the error signals of the audio and video signals and the audio input and video signals decoded by the decoder; The audio signal according to a bit rate selected according to a setting mode A second encoder for forming a second bit stream by coding each of the error signal for the video signal; And a formatter configured to combine the first bitstream generated by the first encoder and the second bitstream generated by the second encoder to form a final bitstream. The decoder includes audio from the transmitted bitstream. An input buffer configured to separately store a signal and a video signal; a first decoder configured to decode a first bitstream included in the audio signal and the video signal, respectively; decoding a second bitstream included in the audio signal and the video signal, respectively; A second decoding unit; And an output buffer for transforming and outputting the value decoded by the first decoder according to the value decoded by the second decoder.