CN101385349A

CN101385349A - Inter-Layer Prediction Method for Video Signal

Info

Publication number: CN101385349A
Application number: CNA2007800056109A
Authority: CN
Inventors: 朴胜煜; 全柄文; 朴志皓
Original assignee: LG Electronics Inc
Current assignee: LG Electronics Inc
Priority date: 2006-01-09
Filing date: 2007-01-09
Publication date: 2009-03-11
Anticipated expiration: 2027-01-09
Also published as: CN101385351A; CN101385352A; CN101416522A; CN101385351B; CN101385348A; CN101385350B; CN101385355A; CN101385349B; CN101385354B; CN101385353B; KR20070074451A; KR20070074452A; CN101385352B; KR20070074453A; CN101385354A; CN101385350A; CN101385353A; CN101385348B; CN101385355B

Abstract

The present invention relates to a method for using interlaced video signal of a base layer in interlayer texture prediction. The present method separates interaced video signal of a base layer into even- field and odd- field components, interpolates the even- field and the odd- field components respectively in vertical and/or horizontal direction, and constructs a combined video data by interleaving the interpolated even-field and odd-field components.

Description

Inter-Layer Prediction Method for Video Signal

1.技术领域 1. Technical field

本发明涉及用于在编码/解码视频信号时进行层间预测的方法。The present invention relates to methods for inter-layer prediction when encoding/decoding video signals.

2.背景技术 2. Background technology

可升降级视频编解码器(SVC)将视频编码成具有最高图像质量的画面序列，同时确保经编码的画面序列的部分(具体地是，从整个帧序列中间歇地选出的部分帧序列)能够被解码并用于以低图像质量来表现该视频。A scalable video codec (SVC) encodes video into a picture sequence with the highest image quality while ensuring that parts of the picture sequence are encoded (specifically, a partial frame sequence intermittently selected from the entire frame sequence) Can be decoded and used to represent the video with low image quality.

尽管可通过接收和处理根据可升降级方案编码的画面序列的部分来表现低图像质量视频，但仍然存在如果比特率降低则图像质量显著下降的问题。该问题的一种解决方案是提供低比特率的辅助画面序列——例如具有小屏幕尺寸和/或低帧率的画面序列——作为阶层结构中的至少一层。Although low image quality video can be represented by receiving and processing parts of a sequence of pictures encoded according to a scalable downscaling scheme, there remains a problem that image quality drops significantly if the bit rate is reduced. One solution to this problem is to provide a low bitrate auxiliary picture sequence, eg a picture sequence with a small screen size and/or a low frame rate, as at least one layer in the hierarchy.

当假设提供两个序列时，辅助(下)画面序列被称为基层，而主(上)画面序列被称为增强或加强层。基层和增强层的视频信号具有冗余性，因为相同的视频信号源被编码成两层。为了提高增强层的编解码效率，增强层的视频信号使用基层的经编解码的信息(运动信息或纹理信息)来编解码。When it is assumed that two sequences are provided, the auxiliary (lower) picture sequence is called the base layer, and the main (upper) picture sequence is called the enhancement or enhancement layer. The base layer and enhancement layer video signals are redundant because the same video source is coded into both layers. In order to improve the codec efficiency of the enhancement layer, the video signal of the enhancement layer is coded using the coded information (motion information or texture information) of the base layer.

尽管可如图1a所示将单个视频源1编码成具有不同传递率的多个层，但也可如图1b所示将包含相同内容2a的不同扫描模式下的多个视频源2b编码成相应各层。同样，在这种情况下，编码上层的编码器可通过利用下层的经编码的信息执行层间预测来提高编解码增益，因为两个源2b提供相同的内容2a。While a single video source 1 can be encoded into multiple layers with different transfer rates as shown in Figure 1a, multiple video sources 2b in different scan modes containing the same content 2a can also be encoded into corresponding layers as shown in Figure 1b. layers. Also in this case, the encoder encoding the upper layer can increase the codec gain by performing inter-layer prediction with the encoded information of the lower layer, since both sources 2b provide the same content 2a.

因此，需要提供一种在将不同源编码成相应各层时把视频信号的扫描模式纳入考虑的层间预测方法。当编码隔行视频时，它可被编码成偶场和奇场，并且也可被编码成一帧中的奇和偶宏块对。相应地，对于层间预测还必须考虑用于编解码隔行视频信号的画面类型。Therefore, there is a need to provide an inter-layer prediction method that takes into account the scanning mode of the video signal when encoding different sources into respective layers. When encoding interlaced video, it can be encoded as even and odd fields, and also as odd and even pairs of macroblocks in a frame. Correspondingly, for inter-layer prediction, the picture type used for encoding and decoding interlaced video signals must also be considered.

一般而言，增强层提供分辨率高于基层分辨率的画面。相应地，如果在将不同的源编码成相应各层时诸层的画面具有不同的分辨率，则还需要执行内插来提高画面分辨率(即，画面大小)。因为对于预测编解码而言在层间预测中使用的基层画面的图像越接近于增强层画面的图像，编解码率就越高，所以需要提供一种将诸层的视频信号的扫描模式纳入考虑的内插方法。In general, the enhancement layer provides a picture with a higher resolution than the base layer. Correspondingly, if the pictures of different layers have different resolutions when encoding different sources into corresponding layers, it is also necessary to perform interpolation to increase the picture resolution (ie, the picture size). Because for predictive coding and decoding, the closer the image of the base layer picture used in the inter-layer prediction is to the image of the enhancement layer picture, the higher the codec rate is, so it is necessary to provide a scanning mode that takes the video signals of the layers into consideration interpolation method.

3.发明内容 3. Contents of the invention

本发明的目的是提供一种在两层中至少有一层具有隔行视频信号分量的状况下执行层间预测的方法。It is an object of the present invention to provide a method for performing inter-layer prediction in the case where at least one of the two layers has an interlaced video signal component.

本发明的另一个目的是提供一种根据画面类型执行对具有不同空间分辨率(可升降级性)的画面的诸层的层间运动预测的方法。Another object of the present invention is to provide a method of performing inter-layer motion prediction for layers of pictures having different spatial resolutions (scalability) according to picture types.

本发明的又一个目的是提供一种执行对具有不同空间和/或时间分辨率(可升降级性)的画面的诸层的层间纹理预测的方法。Yet another object of the present invention is to provide a method of performing inter-layer texture prediction for layers of pictures with different spatial and/or temporal resolutions (scalability).

一种根据本发明的层间运动预测方法包括：将内模式宏块的运动相关信息设置成间模式宏块的运动相关信息，该内模式和间模式宏块是基层的两个垂直毗邻的宏块；然后基于这两个垂直毗邻的宏块获得垂直毗邻宏块对的运动信息用于层间运动预测。An inter-layer motion prediction method according to the present invention includes: setting motion-related information of an intra-mode macroblock as motion-related information of an inter-mode macroblock, the intra-mode and inter-mode macroblocks being two vertically adjacent macroblocks of a base layer block; then obtain the motion information of the vertically adjacent macroblock pair based on these two vertically adjacent macroblocks for inter-layer motion prediction.

另一种根据本发明的层间运动预测方法包括：将作为基层的两个垂直毗邻的内模式和间模式宏块之一的内模式宏块设置成具有0运动相关信息的间模式块；然后基于这两个垂直毗邻的宏块获得垂直毗邻宏块对的运动信息用于层间运动预测。Another inter-layer motion prediction method according to the present invention includes: setting an intra-mode macroblock as one of two vertically adjacent intra-mode and inter-mode macroblocks of the base layer as an inter-mode block with 0 motion-related information; and then Motion information for a vertically adjacent macroblock pair is obtained based on the two vertically adjacent macroblocks for inter-layer motion prediction.

另一种根据本发明的层间运动预测方法包括：从基层的垂直毗邻帧宏块对的运动信息推导单个宏块的运动信息；以及将所推导出的运动信息用作当前层中的场宏块的运动信息或当前层中的场宏块对的各自的运动信息的预测信息。Another method of inter-layer motion prediction according to the present invention includes: deriving motion information for a single macroblock from motion information for a pair of vertically adjacent frame macroblocks in a base layer; and using the derived motion information as a field macroblock in the current layer The motion information of the block or the prediction information of the respective motion information of the field macroblock pair in the current layer.

另一种根据本发明的层间运动预测方法包括从基层的单个场宏块的运动信息或选自基层的垂直毗邻场宏块对的单个场宏块的运动信息推导两个宏块各自的运动信息；以及将所推导出的各自的运动信息用作当前层的帧宏块对各自的运动信息的预测信息。Another inter-layer motion prediction method according to the invention consists in deriving the respective motion of two macroblocks from the motion information of a single field macroblock of the base layer or of a single field macroblock selected from a pair of vertically adjacent field macroblocks of the base layer information; and using the derived respective motion information as prediction information for the respective motion information of the frame macroblocks of the current layer.

一种根据本发明用于具有不同分辨率的画面的诸层的层间运动预测方法包括：通过根据画面的类型和画面中宏块的类型选择性地使用变换成帧宏块的预测方法将下层的画面变换成相同分辨率的帧画面；升采样该帧画面以使其具有与上层的分辨率相同的分辨率；然后应用适用于此经升采样的帧画面中的帧宏块的类型和上层画面中的宏块类型的层间预测方法。An interlayer motion prediction method for layers of pictures having different resolutions according to the present invention includes: converting lower layers into frames by selectively using a prediction method according to the type of the picture and the type of macroblocks in the picture to a frame of the same resolution; upsample the frame to have the same resolution as the upper layer; then apply the type and upper layer for the frame macroblocks in this upsampled frame Inter-layer prediction method for macroblock types in a picture.

另一种根据本发明用于具有不同分辨率的画面的诸层的层间运动预测方法包括：标识出下层和上层的画面的类型和/或包括在这些画面中的宏块的类型；根据标识出的结果对下层画面应用从单个场宏块预测帧宏块对的方法以构造具有与上层画面的纵横比相同的纵横比的虚拟画面；升采样该虚拟画面；然后利用此经升采样的虚拟画面对上层应用层间运动预测。Another inter-layer motion prediction method according to the present invention for layers of pictures with different resolutions includes: identifying the types of pictures of lower and upper layers and/or the types of macroblocks included in these pictures; Apply the method of predicting frame macroblock pairs from a single field macroblock to the lower layer picture to construct a virtual picture with the same aspect ratio as the upper layer picture; upsample the virtual picture; then use this upsampled virtual The picture applies inter-layer motion prediction to the upper layer.

另一种根据本发明用于具有不同分辨率的画面的诸层的层间运动预测方法包括：标识出下层和上层的画面的类型和/或包括在这些画面中的宏块的类型；根据标识出的结果对下层画面应用从单个场宏块预测帧宏块对的方法以构造具有与上层画面的纵横比相同的纵横比的虚拟画面；以及利用所构造出的虚拟画面对上层的画面应用层间运动预测。Another inter-layer motion prediction method according to the present invention for layers of pictures with different resolutions includes: identifying the types of pictures of lower and upper layers and/or the types of macroblocks included in these pictures; The obtained result applies the method of predicting frame macroblock pairs from single field macroblocks to the lower layer picture to construct a virtual picture having the same aspect ratio as that of the upper layer picture; and applies the layer to the upper layer picture using the constructed virtual picture motion prediction.

另一种根据本发明用于具有不同分辨率的画面的诸层的层间运动预测方法包括：标识出下层和上层画面的类型；如果下层画面的类型是场且上层画面的类型是逐行，则拷贝下层画面中的块的运动信息以构造虚拟画面；升采样该虚拟画面；以及在此经升采样的虚拟画面和上层画面之间应用帧宏块-宏块运动预测方法。Another inter-layer motion prediction method according to the present invention for layers of pictures with different resolutions includes: identifying the types of lower-layer and upper-layer pictures; if the type of the lower-layer picture is field and the type of the upper-layer picture is progressive, The motion information of the blocks in the lower layer picture is then copied to construct a virtual picture; the virtual picture is upsampled; and a frame macroblock-macroblock motion prediction method is applied between this upsampled virtual picture and the upper layer picture.

另一种根据本发明用于具有不同分辨率的画面的诸层的层间运动预测方法包括：标识出下层和上层画面的类型；如果下层画面的类型是场且上层画面的类型是逐行，则拷贝下层的块的运动信息以构造虚拟画面；以及使用该虚拟画面来对上层画面应用层间运动预测。Another inter-layer motion prediction method according to the present invention for layers of pictures with different resolutions includes: identifying the types of lower-layer and upper-layer pictures; if the type of the lower-layer picture is field and the type of the upper-layer picture is progressive, The motion information of the blocks of the lower layer is then copied to construct a virtual picture; and the virtual picture is used to apply inter-layer motion prediction to the upper layer picture.

在本发明的实施例中，在层间运动预测中顺序地预测划分模式、参考索引、和运动向量。In an embodiment of the present invention, a division mode, a reference index, and a motion vector are sequentially predicted in inter-layer motion prediction.

在本发明的另一个实施例中，顺序地预测参考索引、运动向量、和划分模式。In another embodiment of the present invention, the reference index, motion vector, and partition mode are predicted sequentially.

在本发明的另一个实施例中，要用于层间运动预测的虚拟基层的场宏块对的运动信息是从基层的帧宏块对的运动信息推导出的。In another embodiment of the invention, the motion information of the field macroblock pairs of the virtual base layer to be used for inter-layer motion prediction is derived from the motion information of the frame macroblock pairs of the base layer.

在本发明的另一个实施例中，要用于层间运动预测的虚拟基层的偶或奇场画面中的场宏块的运动信息是从基层的帧宏块对的运动信息推导出的。In another embodiment of the invention, the motion information of the field macroblocks in the even or odd field picture of the virtual base layer to be used for inter-layer motion prediction is derived from the motion information of the frame macroblock pairs of the base layer.

在本发明的另一个实施例中，从基层的场宏块对中选择宏块，且要用于层间运动预测的虚拟基层的帧宏块对的运动信息是从所选宏块的运动信息推导出的。In another embodiment of the present invention, macroblocks are selected from field macroblock pairs of the base layer, and the motion information of the frame macroblock pairs of the virtual base layer to be used for inter-layer motion prediction is selected from the motion information of the selected macroblocks derived.

在本发明的另一个实施例中，要用于层间运动预测的虚拟基层的帧宏块对的运动信息是从基层的偶或奇场画面中的场宏块的运动信息推导出的。In another embodiment of the invention, the motion information of frame macroblock pairs of a virtual base layer to be used for inter-layer motion prediction is derived from the motion information of field macroblocks in even or odd field pictures of the base layer.

在本发明的另一个实施例中，基层的偶或奇场画面中的场宏块的信息被拷贝以另外构造虚拟场宏块，且要用于层间运动预测的虚拟基层的帧宏块对的运动信息是从以此方式构造出的场宏块对的运动信息推导出的。In another embodiment of the present invention, the information of the field macroblocks in the even or odd field picture of the base layer is copied to additionally construct a virtual field macroblock, and the frame macroblock pairs of the virtual base layer to be used for inter-layer motion prediction The motion information of is derived from the motion information of field macroblock pairs constructed in this way.

一种根据本发明的层间纹理预测方法包括：由基层的垂直毗邻帧宏块对构造场宏块对；以及将所构造出的场宏块对各自的纹理信息用作当前层的场宏块对各自的纹理预测信息。An inter-layer texture prediction method according to the present invention includes: constructing field macroblock pairs from vertically adjacent frame macroblock pairs of the base layer; and using the texture information of the constructed field macroblock pairs as field macroblocks of the current layer For the respective texture prediction information.

另一种根据本发明的层间纹理预测方法包括：由基层的垂直毗邻帧宏块对构造单个场宏块；以及将所构造出的单个场宏块的纹理信息用作当前层的场宏块的纹理预测信息。Another inter-layer texture prediction method according to the present invention includes: constructing a single field macroblock from vertically adjacent frame macroblock pairs of the base layer; and using the texture information of the constructed single field macroblock as the field macroblock of the current layer texture prediction information.

另一种根据本发明的层间纹理预测方法包括：由基层的单个场宏块或垂直毗邻场宏块对构造帧宏块对；以及将所构造出的帧宏块对各自的纹理信息用作当前层的帧宏块对各自的纹理预测信息。Another inter-layer texture prediction method according to the present invention includes: constructing a frame macroblock pair from a single field macroblock or a vertically adjacent field macroblock pair of the base layer; and using the constructed frame macroblock pair's respective texture information as The frame macroblocks of the current layer have their respective texture prediction information.

另一种根据本发明的层间纹理预测方法包括由基层的垂直毗邻场宏块对构造N对帧宏块，其中N是大于1的整数；以及将所构造出的N对帧宏块各自的纹理信息用作当前层中位于不同时间位置的N对帧宏块各自的纹理预测信息。Another inter-layer texture prediction method according to the present invention comprises constructing N pairs of frame macroblocks from pairs of vertically adjacent field macroblocks of the base layer, wherein N is an integer greater than 1; The texture information is used as the respective texture prediction information of N pairs of frame macroblocks located at different time positions in the current layer.

另一种根据本发明的层间纹理预测方法包括：将下层的每一帧分成多个场画面以允许下层具有与上层相同的时间分辨率；在垂直方向上升采样每一个所分离出的场画面以在垂直方向上扩展每一个所分离出的场画面；然后将每一个经升采样的场画面用于上层的每一帧的层间纹理预测。Another inter-layer texture prediction method according to the present invention includes: dividing each frame of the lower layer into multiple field pictures to allow the lower layer to have the same temporal resolution as the upper layer; upsampling each separated field picture in the vertical direction to expand each separated field picture in the vertical direction; and then use each up-sampled field picture for inter-layer texture prediction of each frame of the upper layer.

另一种根据本发明的层间纹理预测方法包括：在垂直方向上升采样下层的每一个场画面以在垂直方向上扩展每一个场画面；以及将每一个经升采样的场画面用于上层的每一帧的层间纹理预测。Another inter-layer texture prediction method according to the present invention includes: upsampling each field picture of the lower layer in the vertical direction to expand each field picture in the vertical direction; and using each up-sampled field picture for the upper layer Inter-layer texture prediction for each frame.

另一种根据本发明的层间纹理预测方法包括：将上层的每一帧分成多个场画面；降采样下层的画面以在垂直方向上缩小下层的画面；然后将经降采样的画面用于上层的分离出的场画面的层间纹理预测。Another inter-layer texture prediction method according to the present invention includes: dividing each frame of the upper layer into a plurality of field pictures; down-sampling the pictures of the lower layer to shrink the pictures of the lower layer in the vertical direction; and then using the down-sampled pictures for Inter-layer texture prediction of the separated field picture of the upper layer.

一种根据本发明利用层间预测编码视频信号的方法包括：确定在层间纹理预测中是使用通过交替地选择基层的任意性画面中的2N块的行然后以选择的次序编排所选行来构造的2N块各自的纹理信息，还是使用通过内插选自基层的2N块的一个块来构造的2N块各自的纹理信息；并将指示该确定的信息纳入到编码的信息中。A method of encoding a video signal using inter-layer prediction according to the present invention comprises: determining whether to use in inter-layer texture prediction by alternately selecting rows of 2N blocks in an arbitrary picture of a base layer and then arranging the selected rows in the selected order The texture information of each of the constructed 2N blocks, also using the texture information of each of the 2N blocks constructed by interpolating one block selected from the 2N blocks of the base layer; and incorporating information indicating this determination into the encoded information.

一种根据本发明利用层间预测来解码视频信号的方法包括：检查特定指示信息是否被包括在接收到的信号中；并基于所检查出的结果确定在层间纹理预测中是使用通过交替地选择基层的任意性画面中的2N块的行然后按选择的次序编排所选行来构造的2N块各自的纹理信息，还是使用通过内插选自基层的2N块的一个块来构造的2N块各自的纹理信息。A method for decoding a video signal using inter-layer prediction according to the present invention includes: checking whether specific indication information is included in a received signal; and determining whether to use in inter-layer texture prediction by alternately Select the rows of 2N blocks in the arbitrary picture of the base layer and then arrange the texture information of each of the 2N blocks constructed by arranging the selected rows in the selected order, or use the 2N blocks constructed by interpolating a block selected from the 2N blocks of the base layer Respective texture information.

在本发明的实施例中，上层或下层的每一帧被分成两个场画面。In an embodiment of the present invention, each frame of the upper or lower layer is divided into two field pictures.

在本发明的实施例中，如果特定指示信息未被包括在所接收到的信号中，则将该情况视为与接收到包括已被设为0的指示信息的信号且确定了其各自纹理信息将用于层间预测的块的情况相同。In an embodiment of the invention, if the specific indication information is not included in the received signal, the situation is considered to be related to receiving a signal including the indication information which has been set to 0 and determining its respective texture information The same is true for blocks to be used for inter-layer prediction.

一种根据本发明将基层的视频信号用于层间纹理预测的方法包括：将基层的隔行视频信号分成偶和奇场分量；在垂直和/或水平方向上将偶和奇场分量各自放大；然后将经放大的偶和奇场分量组组合用于层间纹理预测。A method for using the video signal of the base layer for inter-layer texture prediction according to the present invention includes: dividing the interlaced video signal of the base layer into even and odd field components; respectively amplifying the even and odd field components in the vertical and/or horizontal direction; The upscaled sets of even and odd field components are then combined for inter-layer texture prediction.

另一种根据本发明将基层的视频信号用于层间纹理预测的方法包括：将基层的逐行视频信号分成偶行组和奇行组；在垂直和/或水平方向上将偶和奇行组各自放大；将经放大的偶和奇行组组合并用于层间纹理预测。Another method for using the video signal of the base layer for inter-layer texture prediction according to the present invention includes: dividing the progressive video signal of the base layer into an even row group and an odd row group; Upscaling; the upscaled even and odd row groups are combined and used for inter-layer texture prediction.

另一种根据本发明将基层的视频信号用于层间纹理预测的方法包括：在垂直和/或水平方向上放大基层的隔行视频信号以使其具有与上层的逐行视频信号相同的分辨率；以及基于经放大的视频信号执行上层的视频信号的层间纹理预测。Another method of using the video signal of the base layer for inter-layer texture prediction according to the present invention includes: amplifying the interlaced video signal of the base layer in the vertical and/or horizontal direction to have the same resolution as the progressive video signal of the upper layer ; and performing inter-layer texture prediction of the video signal of the upper layer based on the upscaled video signal.

另一种根据本发明将基层的视频信号用于层间纹理预测的方法包括：在垂直和/或水平方向上放大基层的逐行视频信号以使其具有与上层的隔行视频信号相同的分辨率；以及基于经放大的视频信号执行上层的视频信号的层间纹理预测。Another method of using the video signal of the base layer for inter-layer texture prediction according to the present invention includes: amplifying the progressive video signal of the base layer in the vertical and/or horizontal direction to have the same resolution as the interlaced video signal of the upper layer ; and performing inter-layer texture prediction of the video signal of the upper layer based on the upscaled video signal.

在本发明的一个实施例中，视频信号分离和放大是在宏块级别(或即在宏块的基础上)执行的。In one embodiment of the invention, video signal separation and amplification is performed at the macroblock level (or, ie, on a macroblock basis).

在本发明的另一个实施例中，视频信号分离和放大是在画面级别上执行的。In another embodiment of the present invention, video signal separation and amplification is performed at the picture level.

在本发明的另一个实施例中，如果要对其应用层间纹理预测的两个层的画面格式不同，即如果一层包括逐行画面而另一层包括隔行画面，则执行视频信号分离和放大。In another embodiment of the present invention, video signal separation and enlarge.

在本发明的另一个实施例中，如果要对其应用层间纹理预测的两层的画面都是隔行的，则执行视频信号分离和放大。In another embodiment of the present invention, video signal separation and upscaling are performed if the pictures of both layers to which inter-layer texture prediction is to be applied are interlaced.

4.附图简述4. Brief description of the drawings

图1a和1b示出将单个视频源编码成多个层的方法；Figures 1a and 1b illustrate a method of encoding a single video source into multiple layers;

图2a和2b简要示出应用根据本发明的层间预测方法的视频信号编码装置的配置；Fig. 2 a and 2 b schematically show the configuration of the video signal coding apparatus applying the inter-layer prediction method according to the present invention;

图2c和2d示出用于编码隔行视频信号的画面序列的类型；Figures 2c and 2d illustrate the types of picture sequences used to encode interlaced video signals;

图3a和3b示意性示出根据本发明的实施例的其中为层间纹理预测构造基层画面并执行解块滤波的过程；Figures 3a and 3b schematically illustrate a process in which base layer pictures are constructed for inter-layer texture prediction and deblocking filtering is performed, according to an embodiment of the present invention;

图4a至4f示意性示出根据本发明的实施例的其中要用于MBAFF帧中场宏块的层间运动预测的虚拟基层的场宏块的运动信息利用帧宏块的运动信息来推导的过程；Figures 4a to 4f schematically illustrate how motion information of field macroblocks of a virtual base layer to be used for inter-layer motion prediction of field macroblocks of an MBAFF frame is derived using motion information of frame macroblocks according to an embodiment of the present invention process;

图4g示意性示出根据本发明的实施例的其中宏块对的纹理信息被用于MBAFF帧中的场宏块对的纹理预测的程序；Fig. 4g schematically shows a procedure in which texture information of macroblock pairs is used for texture prediction of field macroblock pairs in an MBAFF frame according to an embodiment of the present invention;

图4h示出根据本发明的实施例将帧宏块对变换成场宏块对的方法；Figure 4h shows a method for transforming a frame macroblock pair into a field macroblock pair according to an embodiment of the present invention;

图5a和5b示出根据本发明的另一个实施例的参考索引和运动信息推导程序；Figures 5a and 5b illustrate a reference index and motion information derivation procedure according to another embodiment of the present invention;

图6a至6c示意性示出根据本发明的实施例的其中利用帧宏块的运动信息推导虚拟基层中的场宏块的运动信息的程序；6a to 6c schematically illustrate a procedure in which motion information of field macroblocks in a virtual base layer is derived using motion information of frame macroblocks according to an embodiment of the present invention;

图6d示意性示出根据本发明的实施例的其中帧宏块对的纹理信息被用于场画面中的场宏块的纹理预测的程序；Fig. 6d schematically shows a procedure in which texture information of a frame macroblock pair is used for texture prediction of a field macroblock in a field picture according to an embodiment of the present invention;

图7a和7b示出根据本发明的另一个实施例的参考索引和运动信息推导程序；Figures 7a and 7b illustrate a reference index and motion information derivation procedure according to another embodiment of the present invention;

图8a至8c示意性示出根据本发明的实施例的其中要用于层间运动预测的虚拟基层的场宏块帧宏块的运动信息是利用MBAFF帧中的场宏块的运动信息来推导的程序；Figures 8a to 8c schematically illustrate a field macroblock frame macroblock of a virtual base layer to be used for inter-layer motion prediction in which motion information of field macroblocks in an MBAFF frame is derived using motion information of field macroblocks in an MBAFF frame according to an embodiment of the present invention program of;

图8d示意性示出根据本发明的实施例的其中MBAFF帧中的场宏块对的纹理信息被用于帧宏块对的纹理预测的程序；Fig. 8d schematically shows a procedure in which texture information of a field macroblock pair in an MBAFF frame is used for texture prediction of a frame macroblock pair according to an embodiment of the present invention;

图8e示出根据本发明的实施例将场宏块对变换成帧宏块对的方法；Figure 8e shows a method of transforming a field macroblock pair into a frame macroblock pair according to an embodiment of the present invention;

图8f和8g示意性示出根据本发明的实施例的当场宏块对中仅一个宏块是间模式时将MBAFF帧中的场宏块对的纹理信息用于帧宏块对的层间预测的程序；Figures 8f and 8g schematically illustrate the use of texture information of a field macroblock pair in an MBAFF frame for inter-layer prediction of a frame macroblock pair when only one macroblock in the field macroblock pair is inter-mode according to an embodiment of the present invention program of;

图8h示意性示出根据本发明的实施例的其中MBAFF帧中的场宏块对的纹理信息被用于多对帧宏块的纹理预测的程序；Figure 8h schematically shows a procedure in which texture information of field macroblock pairs in an MBAFF frame is used for texture prediction of multiple pairs of frame macroblocks according to an embodiment of the present invention;

图9a和9b示出根据本发明的另一个实施例的参考索引和运动信息推导程序；Figures 9a and 9b illustrate a reference index and motion information derivation procedure according to another embodiment of the present invention;

图10a至10c示意性示出根据本发明的实施例的在其中要用于层间运动预测的虚拟基层的帧宏块的运动信息是利用场画面中的场宏块的运动信息来推导的程序；Figures 10a to 10c schematically illustrate a procedure in which motion information of frame macroblocks of a virtual base layer to be used for inter-layer motion prediction is derived using motion information of field macroblocks in field pictures according to an embodiment of the present invention ;

图10d示意性示出根据本发明的实施例的其中场画面中的场宏块的纹理信息被用于帧宏块对的纹理预测的程序；Fig. 10d schematically shows a procedure in which texture information of field macroblocks in a field picture is used for texture prediction of frame macroblock pairs according to an embodiment of the present invention;

图11示出根据本发明的另一个实施例的参考索引和运动信息推导程序；FIG. 11 shows a reference index and motion information derivation procedure according to another embodiment of the present invention;

图12a和12b示意性示出根据本发明的另一个实施例的其中要用于层间运动预测的虚拟基层的帧宏块的运动信息是利用场画面中的场宏块的运动信息来推导的程序；Figures 12a and 12b schematically illustrate a motion information of a frame macroblock of a virtual base layer to be used for inter-layer motion prediction in which motion information of a field macroblock in a field picture is derived according to another embodiment of the present invention program;

图13a至13d分别根据画面的类型来示意性示出根据本发明的实施例的要用于层间运动预测的虚拟基层的场宏块的运动信息利用场宏块的运动信息来推导的程序；Figures 13a to 13d schematically show the procedure of deriving the motion information of the field macroblock of the virtual base layer to be used for inter-layer motion prediction according to the type of the picture, respectively, using the motion information of the field macroblock;

图14a至14k分别根据画面的类型来示出根据本发明的各种实施例的在诸层的空间分辨率不同时执行层间运动预测的方法；14a to 14k illustrate methods of performing inter-layer motion prediction when spatial resolutions of layers are different according to various embodiments of the present invention according to types of pictures, respectively;

图15a和15b示意性示出根据本发明的实施例的在增强层是逐行的而基层是隔行的时候将具有不同空间分辨率的基层的画面用于层间纹理预测的程序；Figures 15a and 15b schematically illustrate a procedure for using pictures of base layers with different spatial resolutions for inter-layer texture prediction when the enhancement layer is progressive and the base layer is interlaced, according to an embodiment of the present invention;

图16a和16b示意性示出根据本发明的实施例的其中为了将基层的画面用于层间纹理预测而将画面中的宏块对分成宏块且分离出的宏块被放大的程序；16a and 16b schematically illustrate a procedure in which pairs of macroblocks in a picture are divided into macroblocks and the separated macroblocks are enlarged in order to use a picture of a base layer for inter-layer texture prediction according to an embodiment of the present invention;

图17a和17b示意性示出根据本发明的实施例的在增强层是隔行的而基层是逐行的时候将具有不同空间分辨率的基层的画面用于层间纹理预测的程序；Figures 17a and 17b schematically illustrate a procedure for using pictures of a base layer with different spatial resolutions for inter-layer texture prediction when the enhancement layer is interlaced and the base layer is progressive according to an embodiment of the present invention;

图18示意性示出根据本发明的实施例的在增强层和基层都是隔行的时候将具有不同空间分辨率的基层的画面用于层间预测的程序；FIG. 18 schematically shows a procedure for using pictures of base layers with different spatial resolutions for inter-layer prediction when both the enhancement layer and the base layer are interlaced according to an embodiment of the present invention;

图19a示出根据本发明的实施例的在增强层是逐行帧序列且两层的画面类型和时间分辨率不同时应用层间预测的程序；Figure 19a shows a procedure for applying inter-layer prediction when the enhancement layer is a progressive frame sequence and the picture types and temporal resolutions of the two layers are different according to an embodiment of the present invention;

图19b示出根据本发明的实施例的在增强层是逐行帧序列且两层具有不同的画面类型和相同的分辨率时应用层间预测的程序；Figure 19b shows a procedure for applying inter-layer prediction when the enhancement layer is a progressive frame sequence and the two layers have different picture types and the same resolution according to an embodiment of the present invention;

图20示出根据本发明的实施例的在基层是逐行帧序列且两层的画面类型和时间分辨率不同时应用层间预测的程序；以及20 shows a procedure for applying inter-layer prediction when the base layer is a progressive frame sequence and the picture types and temporal resolutions of the two layers are different according to an embodiment of the present invention; and

图21示出根据本发明的实施例的在基层是逐行帧序列且两层具有不同的画面类型和相同的分辨率时应用层间预测的程序。FIG. 21 shows a procedure for applying inter-layer prediction when the base layer is a progressive frame sequence and the two layers have different picture types and the same resolution according to an embodiment of the present invention.

5.本发明的实施方式5. Embodiments of the present invention

现在参考附图详细描述本发明的实施例。Embodiments of the present invention will now be described in detail with reference to the accompanying drawings.

图2a示意性示出应用根据本发明的层间预测方法的视频信号编码装置的构件块。尽管图2a的装置被实现成将输入视频信号编码成两层，但以下描述的本发明的原理也适用于在视频信号被编码成三层或甚至更多层时的层间过程。Fig. 2a schematically shows building blocks of a video signal encoding device applying the inter-layer prediction method according to the present invention. Although the apparatus of Fig. 2a is implemented to encode an input video signal into two layers, the principles of the invention described below also apply to the inter-layer process when the video signal is encoded into three or even more layers.

根据本发明的层间预测方法在图2a的装置中的增强层(EL)编码器20处执行。经编码的信息(运动信息和纹理信息)在基层(EL)编码器21处接收。基于所接收的信息执行层间纹理预测或运动预测。如有需要，则解码所接收的信息并基于解码出的信息执行预测。当然，在本发明中，如图2b所示，输入视频信号可以是使用已经被编码的基层的视频源3来编解码的。在这种情形中如下所描述的层间预测方法同样适用。The inter-layer prediction method according to the invention is performed at the enhancement layer (EL) encoder 20 in the arrangement of Fig. 2a. The encoded information (motion information and texture information) is received at the base layer (EL) encoder 21 . Inter-layer texture prediction or motion prediction is performed based on the received information. If necessary, the received information is decoded and a prediction is performed based on the decoded information. Certainly, in the present invention, as shown in FIG. 2 b , the input video signal may be encoded and decoded using the already encoded video source 3 of the base layer. In this case the inter-layer prediction method described below also applies.

在图2a的情形中，可以有在其中BS编码器21编码隔行视频信号或在其中图2b的经编码的视频源3被编解码的两种方法。具体地，在这两种方法之一中，如图3a所示，将隔行视频信号在逐场的基础上简单地编码成场序列，而在另一种方法中，如图3b所示，通过以两个(偶和奇)场的宏块对来构造序列的每一帧来将帧编码成帧序列。以此方式编码的帧中的宏块对中上宏块被称为“顶宏块”，而下宏块被称为“底宏块”。如果顶宏块由偶(或奇)场图像分量构成，则底宏块由奇(或偶)场图像分量构成。以此方式构造的帧被称为宏块自适应帧场(MBAFF)帧。MBAFF帧不仅可包括各自包含奇和偶场宏块的宏块对，还可包括各自包含两个帧宏块的宏块对。In the case of Fig. 2a, there may be two methods in which the BS encoder 21 codes the interlaced video signal or in which the coded video source 3 of Fig. 2b is coded. Specifically, in one of the two methods, as shown in Figure 3a, the interlaced video signal is simply encoded as a sequence of fields on a field-by-field basis, while in the other method, as shown in Figure 3b, the Frames are coded into a sequence of frames by constructing each frame of the sequence in pairs of macroblocks of two (even and odd) fields. The upper macroblock of a pair of macroblocks in a frame encoded in this way is called the "top macroblock" and the lower macroblock is called the "bottom macroblock". If the top macroblock is composed of even (or odd) field image components, the bottom macroblock is composed of odd (or even) field image components. Frames constructed in this manner are referred to as macroblock adaptive frame field (MBAFF) frames. An MBAFF frame may include not only macroblock pairs each containing odd and even field macroblocks, but also macroblock pairs each containing two frame macroblocks.

相应地，当画面中的宏块具有隔行图像分量时，它可能是场中的宏块，并且也可能是帧中的宏块。每一个具有隔行图像分量的宏块被称为场宏块，而每一个具有逐行(扫描)图像分量的宏块称为帧宏块。Correspondingly, when a macroblock in a picture has an interlaced image component, it may be a macroblock in a field, and it may also be a macroblock in a frame. Each macroblock with an interlaced image component is called a field macroblock, and each macroblock with a progressive (scanning) image component is called a frame macroblock.

因此，需要通过确定要在EL编码器20处编码的宏块和要在宏块的层间预测中使用的基层宏块各自的类型是帧宏块类型还是场宏块类型来确定层间预测方法。如果宏块是场宏块，则需要通过确定它是场中的还是MBAFF帧中的场宏块来确定层间预测方法。Therefore, it is necessary to determine the interlayer prediction method by determining whether the respective types of the macroblock to be encoded at the EL encoder 20 and the base layer macroblock to be used in the interlayer prediction of the macroblock are the frame macroblock type or the field macroblock type . If the macroblock is a field macroblock, the inter-layer prediction method needs to be determined by determining whether it is a field macroblock in a field or an MBAFF frame.

将分别针对每一种情况描述该方法。在描述之前，假设当前层的分辨率等于基层的分辨率。即，假设SpatialScalabilityType()是0。当前层的分辨率高于基层分辨率时的描述将在稍后给出。在以下的描述和附图中，术语“顶”和“偶”(或奇)被可互换地使用，并且术语“底”和“奇”(或偶)被可互换地使用。The method will be described for each case separately. Before the description, it is assumed that the resolution of the current layer is equal to the resolution of the base layer. That is, assume that SpatialScalabilityType() is 0. A description of when the resolution of the current layer is higher than that of the base layer will be given later. In the following description and drawings, the terms "top" and "even" (or odd) are used interchangeably, and the terms "bottom" and "odd" (or even) are used interchangeably.

为了利用基层来执行层间预测以编码或解码增强层，首先需要解码基层。因此，首先描述基层解码如下。In order to perform inter-layer prediction using a base layer to encode or decode an enhancement layer, it is first necessary to decode the base layer. Therefore, base layer decoding is first described as follows.

在解码基层时，不仅解码诸如划分模式、参考索引、和运动向量之类的基层运动信息，还解码基层的纹理。When decoding the base layer, not only base layer motion information such as division mode, reference index, and motion vector but also texture of the base layer is decoded.

当基层的纹理被解码用于层间纹理预测时，并不是基层的所有图像样本数据都被解码，这是为了降低解码器的负荷。内模式宏块的图像样本数据被解码出来，而间模式宏块是仅残差数据——即图像样本数据之间的误差数据——被解码出来而不用毗邻画面进行运动补偿。When the texture of the base layer is decoded for inter-layer texture prediction, not all image sample data of the base layer are decoded, which is to reduce the load of the decoder. Image sample data of intra-mode macroblocks is decoded, while inter-mode macroblocks only residual data—that is, error data between image sample data—is decoded without motion compensation for adjacent frames.

此外，用于层间纹理预测的基层纹理解码不是在逐宏块的基础上而是在逐画面的基础上执行，以构造时间上与增强层画面一致的基层画面。基层画面是如上所述由从内模式宏块重构出的图像样本数据和从间模式宏块解码出的残差数据来构造的。Furthermore, base layer texture decoding for inter-layer texture prediction is performed not on a macroblock-by-macroblock basis but on a picture-by-picture basis to construct base layer pictures that are temporally consistent with enhancement layer pictures. A base layer picture is constructed as described above from image sample data reconstructed from intra-mode macroblocks and residual data decoded from inter-mode macroblocks.

诸如DCT和量化之类的内模式或间模式运动补偿和变换是在图像块基础上进行的，例如在16 x 16宏块基础上或在4 x 4子块基础上进行。这导致块边界处的分块伪像使图像畸变。应用解块滤波来减少这些分块伪像。解块滤波器使图像块的边沿平滑以提高视频帧的质量。Intra-mode or inter-mode motion compensation and transformations such as DCT and quantization are performed on a block basis, e.g. on a 16 x 16 macroblock basis or on a 4 x 4 subblock basis. This results in blocking artifacts at block boundaries that distort the image. Deblocking filtering is applied to reduce these blocking artifacts. The deblocking filter smoothes the edges of image blocks to improve the quality of video frames.

是否应用解块滤波来减少分块畸变取决于图像块在边界处的强度和边界周围像素的梯度。解块滤波器的力度或程度由量化参数、内模式、间模式、指示块大小等的图像块划分模式、运动向量、解块滤波前的像素值等确定。Whether to apply deblocking filtering to reduce block distortion depends on the intensity of the image block at the boundary and the gradient of the pixels around the boundary. The strength or degree of the deblocking filter is determined by quantization parameters, intra mode, inter mode, image block division mode indicating block size, etc., motion vectors, pixel values before deblocking filtering, and the like.

层间预测中的解块滤波器是被应用于作为增强层的基内模式(intraBL或层间内模式)宏块的纹理预测的基础的基层画面中的内模式宏块。A deblocking filter in inter-layer prediction is applied to an intra-mode macroblock in a base layer picture that is the basis for texture prediction of an intra-base mode (intraBL or inter-layer intra-mode) macroblock of the enhancement layer.

当要根据层间预测方法编码的两层全被如图2c所示地编码成场画面序列时，这两层全被看作是帧格式，从而使得从针对帧格式的编解码过程可容易地推导出包括解块滤波的编码/解码过程。When the two layers to be coded according to the inter-layer prediction method are both coded into a sequence of field pictures as shown in Fig. An encoding/decoding process including deblocking filtering is derived.

现在将针对基层的画面格式与增强层的画面格式不同的情况——即基层为帧(或即逐行)格式而基层为场(或即隔行)格式的情况、基层为场格式而基层为帧格式的情况、或是如图2c和2d所示的尽管增强层和基层两者都为场格式但增强层和基层之一被编码成场画面序列而另一个被编码成MBAFF帧的情况——来描述根据本发明的实施例执行解块滤波的方法。Now we will focus on the case where the picture format of the base layer is different from the picture format of the enhancement layer—that is, the case where the base layer is a frame (or progressive) format and the base layer is a field (or interlaced) format, the base layer is a field format and the base layer is a frame format, or the case where one of the enhancement layer and the base layer is coded as a sequence of field pictures and the other is coded as an MBAFF frame even though both the enhancement layer and the base layer are in field format as shown in Figures 2c and 2d— A method for performing deblocking filtering according to an embodiment of the present invention will be described.

图3a和3b示意性示出根据本发明的实施例的在其中构造基层画面以执行用于层间纹理预测的解块滤波的过程。Figures 3a and 3b schematically illustrate a process in which a base layer picture is constructed to perform deblocking filtering for inter-layer texture prediction according to an embodiment of the present invention.

图3a示出其中增强层为帧格式而基层为场格式的实施例，而图3b示出其中基层为场格式而基层为帧格式的实施例。Figure 3a shows an embodiment in which the enhancement layer is in frame format and the base layer is in field format, while Figure 3b shows an embodiment in which the base layer is in field format and the base layer is in frame format.

在这些实施例中，为了层间纹理预测，基层的间模式宏块和内模式宏块的纹理被解码，以构造包括图像样本数据和残差数据的基层画面，且在将解块滤波器应用于所构造出的画面以减少分块伪像之后根据基层的分辨率(或即屏幕大小)与增强层的分辨率之比来升采样所构造出的画面。In these embodiments, for inter-layer texture prediction, the texture of the inter-mode macroblocks and intra-mode macroblocks of the base layer is decoded to construct a base layer picture including image sample data and residual data, and after applying the deblocking filter After the constructed picture is used to reduce blocking artifacts, the constructed picture is up-sampled according to the ratio of the resolution of the base layer (or screen size) to the resolution of the enhancement layer.

图3a和3b中的第一方法(方法1)是其中基层被分成两个场画面以执行解块滤波的方法。在该方法中，当利用以不同画面格式编码的基层来创建增强层时，基层画面被分成偶行场画面和奇行场画面，且这两个场画面被解块(即，进行用于解块的滤波)并升采样。然后将这两个画面拼接成单个画面，并基于该单个画面执行层间纹理预测。The first method (method 1) in FIGS. 3a and 3b is a method in which a base layer is divided into two field pictures to perform deblocking filtering. In this method, when an enhancement layer is created using a base layer encoded in a different picture format, the base layer picture is divided into an even field picture and an odd field picture, and the two field pictures are deblocked (i.e., processed for deblocking filtering) and upsampling. These two pictures are then spliced into a single picture, and inter-layer texture prediction is performed based on the single picture.

该第一方法包括以下三个步骤。This first method includes the following three steps.

在分离步骤(步骤1)，基层画面被分成包括偶行的顶场(或奇场)画面和包括奇行的底场(或偶场)画面。基层画面是包括通过运动补偿从基层的数据流重构出的残差数据(间模式数据)和图像样本数据(内模式数据)的视频画面。In the separation step (step 1), the base layer picture is divided into a top field (or odd field) picture including even lines and a bottom field (or even field) picture including odd lines. A base layer picture is a video picture including residual data (inter-mode data) and image sample data (intra-mode data) reconstructed from the data stream of the base layer by motion compensation.

在解块步骤(步骤2)，在分离步骤中被分开的场画面通过解块滤波器被解块。这里，可使用常规的解块滤波器作为该解块滤波器。In the deblocking step (step 2), the field pictures separated in the separating step are deblocked by a deblocking filter. Here, a conventional deblocking filter may be used as the deblocking filter.

当增强层的分辨率与基层的分辨率不同时，经解块的场画面根据增强层的分辨率和基层的分辨率之比来升采样。When the resolution of the enhancement layer is different from that of the base layer, the deblocked field pictures are upsampled according to the ratio of the resolution of the enhancement layer to the resolution of the base layer.

在拼接步骤(步骤3)，经升采样的顶场画面和经升采样的底场画面以交替方式被隔行扫描以拼接成单个画面。之后，基于此单个画面执行增强层的纹理预测。In the stitching step (step 3), the up-sampled top field picture and the up-sampled bottom field picture are interlaced in an alternate manner to be stitched into a single picture. Afterwards, texture prediction of the enhancement layer is performed based on this single picture.

在图3a和3b中的第二方法(方法2)中，当利用以不同画面格式编码的基层来创建增强层时，不将基层画面分成两个场画面而是将其直接对其解块并升采样，并基于结果所得的画面执行层间纹理预测。In the second method (method 2) in Figures 3a and 3b, when creating an enhancement layer with a base layer coded in a different picture format, the base layer picture is not split into two field pictures but is directly deblocked and Upsampling and performing inter-layer texture prediction based on the resulting frame.

在该第二方法中，与要通过层间纹理预测来编码的增强层画面相对应的基层画面不被分成顶和底场画面而是被立即解块，然后升采样。之后，基于此经升采样的画面执行增强层的纹理预测。In this second approach, base layer pictures corresponding to enhancement layer pictures to be coded by inter-layer texture prediction are not split into top and bottom field pictures but are immediately deblocked and then upsampled. Then, texture prediction of the enhancement layer is performed based on this upsampled picture.

应用于为层间运动预测而构造的基层画面的解块滤波器仅被应用于包括从内模式宏块解码出的图像样本数据的区域，而不被应用于包括残差数据的区域。A deblocking filter applied to a base layer picture constructed for inter-layer motion prediction is applied only to a region including image sample data decoded from an intra-mode macroblock, and not to a region including residual data.

在图3a中的基层被编码成场格式——即基层如图2c所示地被编码成场画面序列或如图2d所示地被编码成MBAFF帧的情况下，为了应用第二方法，需要执行交替地隔行扫描顶和底场画面的行以将其组合成单个画面(在图2c的情况下)或交替地隔行扫描场宏块对的顶和底宏块的行以将其组合成单个画面(在图2d的情况下)的过程。该过程将参考图8d和8e详细描述。要被隔行扫描的顶和底场画面或者顶和底宏块是包括通过运动补偿重构出的残差数据(间模式数据)和图像样本数据(内模式数据)的场画面或宏块。In the case where the base layer in Fig. 3a is coded in field format - i.e. the base layer is coded as a sequence of field pictures as shown in Fig. 2c or as an MBAFF frame as shown in Fig. 2d, in order to apply the second method, one needs Alternately interleaving the lines of the top and bottom field pictures to combine them into a single picture (in the case of Figure 2c) or alternately interleaving the lines of the top and bottom macroblocks of a field macroblock pair to combine them into a single The process of the picture (in the case of Figure 2d). This process will be described in detail with reference to Figures 8d and 8e. The top and bottom field pictures or top and bottom macroblocks to be interlaced are field pictures or macroblocks including residual data (inter mode data) and image sample data (intra mode data) reconstructed by motion compensation.

此外，在如图2d所示的MBAFF帧中的(基层的)场宏块对的顶和底宏块是不同的模式并且从这些宏块中选择内模式块用于增强层的宏块对的层间纹理预测的情况下(在稍后描述的图8g的情况下)，在如图2d所示的编码成MBAFF帧中的场宏块对的基层中的任何帧(画面)在时间上与增强层画面不一致的情况下(在稍后描述的图8h的情况下)，或在具有宏块对的增强层的纹理是从如图2c所示的具有场画面的场宏块的基层预测的情况下(在稍后描述的图10d的情况下)，场宏块中选中的一个被升采样成临时的宏块对(图8g中的“841”和图8h中的“851”和“852”)或两个临时宏块(图10d中的“1021”)，并将解块滤波器应用于这些宏块中的内模式宏块。Furthermore, the top and bottom macroblocks of the field macroblock pairs (of the base layer) in an MBAFF frame as shown in Fig. 2d are of different modes and the intra-mode blocks are selected from these macroblocks for the In the case of inter-layer texture prediction (in the case of Figure 8g described later), any frame (picture) in the base layer encoded as a pair of field macroblocks in an MBAFF frame as shown in Figure 2d is temporally related to In the case of non-uniform enhancement layer pictures (in the case of Fig. 8h described later), or when the texture of the enhancement layer with macroblock pairs is predicted from the base layer with field macroblocks of field pictures as shown in Fig. 2c In the case (in the case of Fig. 10d described later), a selected one of the field macroblocks is upsampled into a temporary macroblock pair ("841" in Fig. 8g and "851" and "852" in Fig. 8h ") or two temporary macroblocks ("1021" in Fig. 10d), and apply the deblocking filter to the intra-mode macroblocks in these macroblocks.

在以下各种实施例中描述的层间纹理预测是基于图3a和3b的实施例中描述的经解块基层画面来执行的。The inter-layer texture prediction described in the following various embodiments is performed based on the deblocked base layer pictures described in the embodiments of Figs. 3a and 3b.

现在将针对根据要编解码的当前层中的宏块类型和要用于当前层的宏块的层间预测的基层的宏块类型分类的每一种情况分别描述层间预测方法。在本描述中，如上所述地假设当前层的空间分辨率等于基层的空间分辨率。The inter-layer prediction method will now be described separately for each case classified according to the macroblock type in the current layer to be coded and the macroblock type of the base layer to be used for inter-layer prediction of the macroblock of the current layer. In this description, it is assumed that the spatial resolution of the current layer is equal to the spatial resolution of the base layer as described above.

I.帧MB->MBAFF帧中的场MB的情况I. Case of field MB in frame MB->MBAFF frame

在这种情况下，当前层(EL)中的宏块被编码成MBAFF帧中的场宏块，并且要用于当前层的宏块的层间预测的基层中的宏块被编码成帧宏块。基层中的上宏块和下宏块两者中所包括的视频信号成分与当前层中一对同位的宏块中所包括的视频信号成分是相同的。上和下(顶和底)宏块将被称为宏块对，且术语“对”在以下的描述中将用于描述一对垂直毗邻的块。首先，描述层间运动预测如下。In this case, macroblocks in the current layer (EL) are coded as field macroblocks in an MBAFF frame, and macroblocks in the base layer to be used for inter-layer prediction of macroblocks in the current layer are coded as frame macroblocks piece. The video signal components included in both the upper macroblock and the lower macroblock in the base layer are the same as those included in a pair of co-located macroblocks in the current layer. The upper and lower (top and bottom) macroblocks will be referred to as macroblock pairs, and the term "pair" will be used in the following description to describe a pair of vertically adjacent blocks. First, inter-layer motion prediction is described as follows.

EL编码器20使用通过将基层的宏块对410归并成单个宏块(通过在垂直方向上压缩至一半大小)来获得的宏块划分模式作为当前宏块的划分模式。图4a示出该过程的详细例子。如所示，首先，将基层的相应宏块对410归并成单个宏块(S41)，且通过归并获得的宏块的划分模式被拷贝到另一个宏块以构造出宏块对411(S42)。之后，将该对宏块411各自的划分模式应用于虚拟基层的宏块对412(S43)。The EL encoder 20 uses the macroblock division mode obtained by merging the macroblock pairs 410 of the base layer into a single macroblock (by compressing to half the size in the vertical direction) as the division mode of the current macroblock. Figure 4a shows a detailed example of this process. As shown, first, the corresponding macroblock pairs 410 of the base layer are merged into a single macroblock (S41), and the division mode of the macroblock obtained by merging is copied to another macroblock to construct the macroblock pair 411 (S42). . After that, the respective division modes of the pair of macroblocks 411 are applied to the macroblock pair 412 of the virtual base layer (S43).

然而，在相应的宏块对410被归并成单个宏块时，可能会生成在划分模式中不允许的划分区域。为了防止这种情况，EL编码器20根据以下规则确定划分模式。However, when the corresponding macroblock pairs 410 are merged into a single macroblock, a split area that is not allowed in the split mode may be generated. In order to prevent this, the EL encoder 20 determines the division pattern according to the following rules.

1)基层的宏块对中的顶和底两个8 x 8块(图4a中的“B8_0”和“B8_2”)被归并成单个8 x 8块。但是，如果相应的8 x 8块中的任何块都未被细分，则它们被归并成两个8 x 4块，而如果相应的8 x 8块中有任何块已被细分，则它们被归并成四个4 x 4块(图4a中的“401”)。1) The top and bottom two 8 x 8 blocks ("B8_0" and "B8_2" in Figure 4a) of the macroblock pair in the base layer are merged into a single 8 x 8 block. However, if any of the corresponding 8 x 8 blocks are not subdivided, they are merged into two 8 x 4 blocks, and if any of the corresponding 8 x 8 blocks are subdivided, they are are merged into four 4 x 4 blocks (“401” in Figure 4a).

2)基层的8 x 16块缩小成8 x 8块，16 x 8块缩小成两个毗邻的8 x 4块，并且16 x 16块缩小成16 x 8块。2) The 8 x 16 block of the base layer is reduced to an 8 x 8 block, the 16 x 8 block is reduced to two adjacent 8 x 4 blocks, and the 16 x 16 block is reduced to a 16 x 8 block.

如果相应宏块对中至少有一个宏块是以内模式编码的，则EL编码器20在归并过程之前首先执行以下过程。If at least one macroblock in the corresponding macroblock pair is intra-mode coded, the EL encoder 20 first performs the following process before the merging process.

如果这两个宏块中仅有一个是内模式，则间宏块的诸如宏块划分模式、参考索引、和运动向量之类的运动信息如图4b所示被拷贝到内宏块，或者内宏块如图4c所示被认为是具有0运动向量和0参考索引的16 x 16间宏块。或者，如图4d所示，内宏块的参考索引通过将间宏块的参考索引拷贝到内宏块来设置，且将0运动向量分配给内宏块。然后，执行上面提及的归并过程，然后如下所述执行参考索引和运动向量推导程序。If only one of the two macroblocks is intra mode, the motion information of the inter macroblock such as macroblock partition mode, reference index, and motion vector is copied to the intra macroblock as shown in Figure 4b, or the intra macroblock A macroblock as shown in Figure 4c is considered to be a 16 x 16 inter-macroblock with a motion vector of 0 and a reference index of 0. Alternatively, as shown in Fig. 4d, the reference index of the inner macroblock is set by copying the reference index of the inter macroblock to the inner macroblock, and a 0 motion vector is assigned to the inner macroblock. Then, the merge process mentioned above is performed, followed by the reference index and motion vector derivation procedure as described below.

EL编码器20执行以下过程以从相应宏块对410的参考索引推导当前宏块对412的参考索引。The EL encoder 20 performs the following process to derive the reference index of the current macroblock pair 412 from the reference index of the corresponding macroblock pair 410 .

如果对应于当前8 x 8块的基层8 x 8块对中的每一块已被细分成相同数目个部分，则该8 x 8块对中一块(顶块或底块)的参考索引被确定为当前8 x 8块的参考索引。否则，该8 x 8块对中已被细分成较少数目个部分的那一块的参考索引被确定为当前8 x 8块的参考索引。If each block in the base layer 8 x 8 block pair corresponding to the current 8 x 8 block has been subdivided into the same number of parts, the reference index of a piece (top block or bottom block) in the 8 x 8 block pair is determined is the reference index of the current 8 x 8 block. Otherwise, the reference index of the block that has been subdivided into a smaller number of parts in the 8x8 block pair is determined to be the reference index of the current 8x8 block.

在本发明的另一个实施例中，为对应于当前8 x 8块的基层8 x 8块对设置的参考索引中较小的一个被确定为当前8 x 8块的参考索引。图4e的例子中的这种确定方法可表达如下：In another embodiment of the present invention, the smaller one of the reference indices set for the base layer 8 x 8 block pair corresponding to the current 8 x 8 block is determined as the reference index of the current 8 x 8 block. This method of determination in the example of Figure 4e can be expressed as follows:

当前B8_0的参考索引＝min(基顶帧MB的B8_0的参考索引，基顶帧MB的B8_2的参考索引)The current reference index of B8_0=min(the reference index of B8_0 of the base top frame MB, the reference index of B8_2 of the base top frame MB)

当前B8_1的参考索引＝min(基顶帧MB的B8_1的参考索引，基顶帧MB的B8_3的参考索引)Current reference index of B8_1=min(reference index of B8_1 of base top frame MB, reference index of B8_3 of base top frame MB)

当前B8_2的参考索引＝min(基底帧MB的B8_0的参考索引，基底帧MB的B8_2的参考索引)，以及Reference index of current B8_2=min(reference index of B8_0 of base frame MB, reference index of B8_2 of base frame MB), and

当前B8_3的参考索引＝min(基底帧MB的B8_1的参考索引，基底帧MB的B8_3的参考索引)。The current reference index of B8_3=min(the reference index of B8_1 of the base frame MB, the reference index of B8_3 of the base frame MB).

以上的参考索引推导程序可适用于顶和底场宏块两者。将以此方式确定的每一个8 x 8块的参考索引乘以2，并将相乘后的参考索引确定为其最终参考索引。作该乘法的原因是在解码时，画面的数目是帧序列中的数目的两倍，因为属于画面的场宏块被分成偶场和奇场。取决于解码算法，底场宏块的最终参考索引可通过将其参考索引乘以2然后将相乘后的参考索引加1来确定。The above reference index derivation procedure is applicable to both top and bottom field macroblocks. The reference index of each 8 x 8 block determined in this way is multiplied by 2, and the multiplied reference index is determined as its final reference index. The reason for this multiplication is that at the time of decoding, the number of pictures is twice that of the sequence of frames, since the field macroblocks belonging to a picture are divided into even and odd fields. Depending on the decoding algorithm, the final reference index of the bottom field macroblock can be determined by multiplying its reference index by 2 and adding 1 to the multiplied reference index.

以下是EL编码器20推导虚拟基层的宏块对的运动向量的程序。The following is the procedure for the EL encoder 20 to derive the motion vectors of the macroblock pairs of the virtual base layer.

运动向量是在4 x 4块的基础上确定的，因此基层的相应4 x 8块被标识出来，如图4f所示。如果该相应的4 x 8块已被细分，则其顶或底4 x 4块的运动向量被确定为当前4 x 4块的运动向量。否则，将对应的4 x 8块的运动向量确定为当前4 x 4块的运动向量。所确定的运动向量在其垂直分量除以2后被用作当前4 x 4块的最终运动向量。作该除法的原因是包括在两个帧宏块中的图像成分对应于一个场宏块的图像成分因而使得场图像的大小在垂直方向上减小一半。Motion vectors are determined on the basis of 4 x 4 blocks, so the corresponding 4 x 8 blocks of the base layer are identified, as shown in Fig. 4f. If the corresponding 4x8 block has been subdivided, the motion vector of its top or bottom 4x4 block is determined to be the motion vector of the current 4x4 block. Otherwise, determine the motion vector of the corresponding 4 x 8 block as the motion vector of the current 4 x 4 block. The determined motion vector is used as the final motion vector for the current 4x4 block after its vertical component is divided by 2. The reason for this division is that the image components included in two frame macroblocks correspond to the image components of one field macroblock thereby reducing the size of the field image by half in the vertical direction.

一旦虚拟基层的场宏块对412的运动信息以此方式确定，该运动信息就被用于增强层的目标场宏块对413的层间运动预测。同样，在以下的描述中，一旦虚拟基层的宏块或宏块对的运动信息被确定，该运动信息就被用于当前层的相应宏块或相应宏块对的层间运动预测。在以下的描述中，假设即使没有提及虚拟基层的宏块或宏块对的运动信息被用于当前层的相应宏块或相应宏块对的层间运动预测该过程也是被应用的。Once the motion information of the field macroblock pair 412 of the virtual base layer is determined in this way, this motion information is used for inter-layer motion prediction of the target field macroblock pair 413 of the enhancement layer. Likewise, in the following description, once the motion information of a macroblock or macroblock pair of a virtual base layer is determined, the motion information is used for inter-layer motion prediction of the corresponding macroblock or macroblock pair of the current layer. In the following description, it is assumed that the process is applied even without mentioning that motion information of a macroblock or a macroblock pair of a virtual base layer is used for inter-layer motion prediction of a corresponding macroblock or a corresponding macroblock pair of a current layer.

图5根据本发明的另一个实施例示意性示出要被用于层间预测的虚拟基层的场宏块对500的运动信息如何从对应于当前宏块对的基层帧宏块对的运动信息推导。在本实施例中，如图所示，基层的帧宏块对的顶宏块的顶或底8 x 8块的参考索引被用作虚拟基层的场宏块对500中的每一个宏块的顶8 x 8块的参考索引，且基层的底宏块的顶或底8 x 8块的参考索引被用作该场宏块对500中的每一个宏块的底8 x 8块的参考索引。另一方面，如图所示，基层的帧宏块对的顶宏块的最顶上的4 x 4块的运动向量被共用于虚拟基层的场宏块对500中的每一个宏块最顶上的4 x 4块，基层的帧宏块对的顶宏块的第三个4 x 4块的运动向量被共用于该场宏块对500中的每一个宏块的第二个4 x 4块，基层的帧宏块对的底宏块最顶上的4 x 4块的运动向量被共用于该场宏块对500中的每一个宏块的第三个4 x 4块，并且基层的帧宏块对的底宏块的第三个4 x 4块的运动向量被共用于该场宏块对500中的每一个宏块的第四个4 x 4块。5 schematically shows how the motion information of a field macroblock pair 500 of a virtual base layer to be used for inter-layer prediction is derived from the motion information of a base layer frame macroblock pair corresponding to the current macroblock pair according to another embodiment of the present invention. Derivation. In this embodiment, as shown, the reference index of the top or bottom 8x8 block of the top macroblock of the base layer's frame macroblock pair is used as the reference index of each macroblock in the virtual base layer's field macroblock pair 500 The reference index of the top 8 x 8 block, and the reference index of the top or bottom 8 x 8 block of the bottom macroblock of the base layer is used as the reference index of the bottom 8 x 8 block of each macroblock in the field macroblock pair 500 . On the other hand, the motion vectors for the topmost 4 x 4 block of the top macroblock of the base layer's frame macroblock pair 500 are shared for each of the topmost macroblocks in the virtual base layer's field macroblock pair 500, as shown 4 x 4 blocks above, the motion vector of the third 4 x 4 block of the top macroblock of the frame macroblock pair of the base layer is shared for the second 4 x 4 of each macroblock in the field macroblock pair 500 block, the motion vector of the topmost 4 x 4 block of the bottom macroblock of the frame macroblock pair of the base layer is shared for the third 4 x 4 block of each macroblock in the field macroblock pair 500, and the base layer's The motion vector for the third 4 x 4 block of the bottom macroblock of the frame macroblock pair is shared for the fourth 4 x 4 block of each macroblock in the field macroblock pair 500.

如图5a所示，为用于层间预测而构造的场宏块对500中8 x 8块中的8 x 8块中的顶4 x 4块501和底4 x 4块502使用基层的不同8 x 8块511和512中的4 x 4块的运动向量。这些运动向量可能是使用不同参考画面的运动向量。即，不同的8 x 8块511和512可能具有不同的参考索引。相应地，在这种情况下，为了构造虚拟基层的宏块对500，EL编码器20将为顶4 x 4块501选择的相应4 x 4块503的运动向量共用作虚拟基层的第二个4 x 4块502的运动向量，如图5b所示(521)。As shown in Figure 5a, the top 4 x 4 block 501 and the bottom 4 x 4 block 502 of the 8 x 8 block pairs 500 in a field macroblock constructed for use in inter-layer prediction use the difference in the base layer. Motion vectors for 4 x 4 blocks out of 8 x 8 blocks 511 and 512. These motion vectors may be motion vectors using different reference pictures. That is, different 8x8 blocks 511 and 512 may have different reference indices. Correspondingly, in this case, to construct the macroblock pair 500 of the virtual base layer, the EL encoder 20 will share the motion vectors of the corresponding 4 x 4 block 503 selected for the top 4 x 4 block 501 as the second one of the virtual base layer The motion vector for the 4 x 4 block 502, as shown in Figure 5b (521).

在参考图4a至4f描述的实施例中，为了构造虚拟基层的运动信息以预测当前宏块对的运动信息，EL编码器20基于基层的相应宏块对的运动信息顺序地推导划分模式、参考索引、和运动向量。然而，在参考图5a和5b所述的实施例中，EL编码器20首先基于基层的相应宏块对的运动信息推导虚拟基层的宏块对的参考索引和运动向量，然后基于所推导出的值最终确定虚拟基层的宏块对的划分模式。当划分模式被确定时，具有相同的推导出的运动向量和参考索引的4 x 4块单元被组合，且如果组合后的块模式是允许的划分模式，则将划分模式设置成此组合后的模式，否则将划分模式设置成组合前的模式。In the embodiment described with reference to FIGS. 4a to 4f, in order to construct the motion information of the virtual base layer to predict the motion information of the current macroblock pair, the EL encoder 20 sequentially derives the partitioning mode based on the motion information of the corresponding macroblock pair of the base layer, ref. index, and motion vector. However, in the embodiment described with reference to Figures 5a and 5b, the EL encoder 20 first derives the reference indices and motion vectors of the macroblock pairs of the virtual base layer based on the motion information of the corresponding macroblock pairs of the base layer, and then based on the derived The value ultimately determines the partitioning mode of the macroblock pairs of the virtual base layer. When the partition mode is determined, 4 x 4 block units with the same derived motion vector and reference index are combined, and if the combined block mode is an allowed partition mode, the partition mode is set to this combined mode, otherwise set the division mode to the mode before combination.

在上述的实施例中，如果基层的相应宏块对410中的两个宏块都是内模式，则对当前宏块对413只执行基内预测。在这种情况下，不执行运动预测。当然，在纹理预测的情况下不构造虚拟基层的宏块对。如果基层的相应宏块对410中只有一个宏块是内模式，则如图4b所示将间宏块的运动信息拷贝至内宏块，如图4c所示将内宏块的运动向量和参考索引设置成0，或者如图4d所示通过将间宏块的参考索引拷贝到内宏块来设置内宏块的参考索引并且将内宏块的运动向量设置成0。然后，虚拟基层的宏块对的运动信息如上所述地推导。In the above-mentioned embodiment, if both macroblocks in the corresponding macroblock pair 410 of the base layer are intra-mode, only intra-base prediction is performed on the current macroblock pair 413 . In this case, motion prediction is not performed. Of course, macroblock pairs of the virtual base layer are not constructed in the case of texture prediction. If only one macroblock in the corresponding macroblock pair 410 of the base layer is an intra mode, the motion information of the inter macroblock is copied to the intra macroblock as shown in Figure 4b, and the motion vector and reference The index is set to 0, or the reference index of the inner macroblock is set by copying the reference index of the inter macroblock to the inner macroblock and the motion vector of the inner macroblock is set to 0 as shown in Fig. 4d. Then, the motion information for the macroblock pairs of the virtual base layer is derived as described above.

在如上所述为层间运动预测构造虚拟基层的宏块对之后，EL编码器20使用所构造出的宏块对的运动信息来预测和编码当前场宏块对413的运动信息。After constructing the macroblock pairs of the virtual base layer for inter-layer motion prediction as described above, the EL encoder 20 uses the motion information of the constructed macroblock pairs to predict and encode the motion information of the current field macroblock pair 413 .

现在将描述层间纹理预测。图4g示出在“帧MB->MBAFF帧中的场MB”的情况下的示例层间纹理预测方法。EL编码器20标识出基层的相应帧宏块对410的块模式。如果相应帧宏块对410中的两个宏块或者都是内模式或者都是间模式，则EL编码器20将基层的相应宏块对410转换(变换)成临时的场宏块对421，以便或者执行当前场宏块对413的基内预测(当两个帧宏块410都是内模式时)或者以下面描述的方式执行其残差预测(当两个帧宏块410都是间模式时)。当相应的宏块对410中的两个宏块都是内模式时，该临时的场宏块对421包括如前所述的在内模式的情况下在完成解码后被解块(即，进行用于解块的滤波)的数据。在以下对各种实施例的描述中，对于从用于纹理预测的基层的宏块推导出的临时宏块对同样如此。Inter-layer texture prediction will now be described. Fig. 4g shows an example inter-layer texture prediction method in the case of "frame MB -> field MB in MBAFF frame". The EL encoder 20 identifies the block mode for the corresponding frame macroblock pair 410 of the base layer. If both macroblocks in a corresponding frame macroblock pair 410 are either both intra-mode or both inter-mode, the EL encoder 20 converts (transforms) the corresponding macroblock pair 410 of the base layer into a temporary field macroblock pair 421, In order to perform either the base intra prediction of the current field macroblock pair 413 (when both frame macroblocks 410 are intra mode) or its residual prediction in the manner described below (when both frame macroblocks 410 are inter mode hour). When both macroblocks in the corresponding macroblock pair 410 are intra-mode, the temporary field macroblock pair 421 is deblocked (i.e., performed Filtered data for deblocking). In the following description of various embodiments, the same is true for temporal macroblock pairs derived from macroblocks of the base layer used for texture prediction.

然而，在这两个宏块中仅有一个是间模式时不执行层间纹理预测。用于层间纹理预测的基层的宏块对410在宏块是内模式的情况下具有未经编码的原始图像数据(或经解码的图像数据)，而在宏块是间模式的情况下具有经编码的残差数据(或经解码的残差数据)。在以下对纹理预测的描述中对于基层的宏块对同样如此。However, inter-layer texture prediction is not performed when only one of the two macroblocks is inter mode. A macroblock pair 410 of a base layer for inter-layer texture prediction has uncoded raw image data (or decoded image data) if the macroblock is intra mode, and has uncoded raw image data (or decoded image data) if the macroblock is inter mode Encoded residual data (or decoded residual data). The same is true for macroblock pairs of the base layer in the following description of texture prediction.

图4h示出用于将帧宏块对转换成要用于层间纹理预测的场宏块对的方法。如图所示，顺序地选择一对帧宏块A和B的偶行以构造顶场宏块A′，并且顺序地选择该对帧宏块A和B的奇行以构造底场宏块B′。当用行来填充一个场宏块时，它首先以顶块A的偶(或奇)行(A_偶或A_奇)填充，然后以底块B的奇(或偶)行(B_偶或B_奇)来填充。Figure 4h shows a method for converting frame macroblock pairs into field macroblock pairs to be used for inter-layer texture prediction. As shown, the even rows of a pair of frame macroblocks A and B are sequentially selected to construct the top field macroblock A', and the odd rows of the pair of frame macroblocks A and B are sequentially selected to construct the bottom field macroblock B' . When a field macroblock is filled with rows, it is first filled with the even (or odd) row (A_even or A_odd) of the top block A, and then with the odd (or even) row (B_even) of the bottom block B (B_ even or B_odd) to pad.

II.帧MB->场画而中的场MB的情况II. The case of frame MB->field frame and field MB

在这种情况下，当前层中的宏块是被编码成场画面中的场宏块，并且要用于当前层的宏块的层间预测的基层中的宏块是被编码成帧宏块。基层中的宏块对中所包括的视频信号成分与当前层中的偶场或奇场中同位的宏块中所包括的视频信号成分相同。首先，层间运动预测描述如下。In this case, macroblocks in the current layer are coded as field macroblocks in field pictures, and macroblocks in the base layer to be used for inter-layer prediction of macroblocks in the current layer are coded as frame macroblocks . The video signal components included in the pair of macroblocks in the base layer are the same as those included in the co-located macroblocks in the even field or odd field in the current layer. First, inter-layer motion prediction is described as follows.

EL编码器20使用通过将基层的宏块对归并成单个宏块(通过在垂直方向上压缩至一半大小)获得的宏块划分模式作为虚拟基层的偶或奇宏块的划分模式。图6a示出该过程的详细示例。如图所示，首先将基层的相应宏块对610归并成单个宏块611(S61)，且将通过该归并获得的划分模式应用于要用于当前宏块613的层间运动预测的虚拟基层的宏块(S62)。归并规则与先前的情况I中的相同。在相应的宏块对610中至少有一个宏块以内模式编码时的处理方法与先前的情况I中的相同。The EL encoder 20 uses a macroblock division mode obtained by merging pairs of macroblocks of the base layer into a single macroblock (by compressing to half the size in the vertical direction) as a division mode of even or odd macroblocks of the virtual base layer. Figure 6a shows a detailed example of this process. As shown in the figure, the corresponding macroblock pairs 610 of the base layer are first merged into a single macroblock 611 (S61), and the division mode obtained by this merging is applied to the virtual base layer to be used for the inter-layer motion prediction of the current macroblock 613 macroblocks (S62). The merge rules are the same as in case I earlier. The processing method when at least one macroblock in the corresponding macroblock pair 610 is coded in the intra mode is the same as that in the previous case I.

用于推导参考索引和运动向量的程序也以与上面在先前的情况I中描述的方式相同方式执行。在情况I中，将相同的推导程序应用于顶和底宏块，因为偶和奇宏块对被携带在一个帧中。然而，本情况II与情况I不同之处在于将推导程序仅应用于一个场宏块，如图6b和6c所示，因为在要编解码的当前场画面中仅存在一个对应于基层宏块对610的宏块。The procedure for deriving reference indices and motion vectors is also performed in the same manner as described above in the previous case I. In case I, the same derivation procedure is applied to top and bottom macroblocks, since pairs of even and odd macroblocks are carried in one frame. However, this case II differs from case I in that the derivation procedure is only applied to one field macroblock, as shown in Figures 6b and 6c, because there is only one macroblock pair corresponding to the base layer macroblock 610 macroblocks.

在以上的实施例中，为了预测虚拟基层的宏块的运动信息，EL编码器20基于基层的相应宏块对的运动信息顺序地推导该宏块的划分模式、参考索引、和运动向量。In the above embodiments, in order to predict the motion information of a macroblock of a virtual base layer, the EL encoder 20 sequentially derives the division mode, reference index, and motion vector of the macroblock based on the motion information of the corresponding macroblock pair of the base layer.

在本发明的另一个实施例中，EL编码器20首先基于基层的相应宏块对的运动信息推导虚拟基层的宏块的参考索引和运动向量，然后，基于所推导出的值最终确定虚拟基层的宏块的块模式。图7a和7b示意性示出虚拟基层的场宏块的参考索引和运动向量的推导。在这种情况下用于推导的操作类似于参考图5a和5b描述的情况I中的操作，区别在于顶或底宏块的运动信息是利用基层的宏块对的运动信息来推导的。In another embodiment of the present invention, the EL encoder 20 first derives the reference index and the motion vector of the macroblocks of the virtual base layer based on the motion information of the corresponding macroblock pairs of the base layer, and then finally determines the virtual base layer based on the derived values The block mode of the macroblock. Figures 7a and 7b schematically illustrate the derivation of reference indices and motion vectors for field macroblocks of a virtual base layer. The operation for derivation in this case is similar to that in case I described with reference to Figures 5a and 5b, with the difference that the motion information for the top or bottom macroblock is derived using the motion information for the macroblock pair of the base layer.

当划分模式被最终确定时，具有相同的推导出的运动向量和参考索引的4 x 4块单元被组合，且如果组合后的块模式是允许的划分模式，则将划分模式设置成此组合后的模式，否则将划分模式设置成组合前的模式。When the partition mode is finalized, 4 x 4 block units with the same derived motion vector and reference index are combined, and if the combined block mode is an allowed partition mode, the partition mode is set to this combined mode, otherwise set the division mode to the mode before combination.

在上述的实施例中，如果基层的相应宏块对中的两个宏块都是内模式，则不执行运动预测，也不构造虚拟基层的宏块对的运动信息，而如果这两个宏块中仅有一个是内模式，则在该情况下如先前描述地执行运动预测。In the above-mentioned embodiments, if two macroblocks in the corresponding macroblock pair of the base layer are intra-mode, motion prediction is not performed, and motion information of the macroblock pair of the virtual base layer is not constructed, and if the two macroblocks Only one of the blocks is intra mode, in which case motion prediction is performed as previously described.

现在将描述层间纹理预测。图6d示出在“帧MB->场画面中的场MB”的情况下的示例层间纹理预测方法。EL编码器20标识出基层的相应宏块对610的块模式。如果该宏块对中的两个宏块或者都是内模式或者都是间模式，则EL编码器20由单对帧宏块610构造临时场宏块621。如果当前宏块613属于偶场画面，则EL编码器20由相应宏块对610的偶行构造临时场宏块621。如果当前宏块613属于奇场画面，则EL编码器20由相应宏块对610的奇行构造临时场宏块621。构造方法类似于图4h中构造单个场宏块A′或B′的方法。Inter-layer texture prediction will now be described. Fig. 6d shows an example inter-layer texture prediction method in the case of "frame MB -> field MB in field picture". EL encoder 20 identifies the block mode of the corresponding macroblock pair 610 of the base layer. The EL encoder 20 constructs a temporary field macroblock 621 from a single pair of frame macroblocks 610 if both macroblocks in the macroblock pair are either both intra-mode or both inter-mode. If the current macroblock 613 belongs to an even field picture, the EL encoder 20 constructs a temporary field macroblock 621 from the even row of the corresponding macroblock pair 610 . If the current macroblock 613 belongs to an odd field picture, the EL encoder 20 constructs a temporary field macroblock 621 from the odd row of the corresponding macroblock pair 610 . The method of construction is similar to the method of constructing a single field macroblock A' or B' in Fig. 4h.

一旦临时场宏块621被构造出来，EL编码器20就基于场宏块621中的纹理信息执行当前场宏块613的基内预测(当相应的宏块对610中的两个宏块都是内模式时)，或执行其残差预测(当相应的宏块对610中的两个宏块都是间模式时)。Once the temporary field macroblock 621 is constructed, the EL encoder 20 performs intra-base prediction of the current field macroblock 613 based on the texture information in the field macroblock 621 (when both macroblocks in the corresponding macroblock pair 610 are Intra mode), or perform its residual prediction (when both macroblocks in the corresponding macroblock pair 610 are inter mode).

如果相应的宏块对610中只有一个宏块是间模式，则EL编码器20不执行层间纹理预测。If only one macroblock in the corresponding macroblock pair 610 is inter mode, the EL encoder 20 does not perform inter-layer texture prediction.

III.MBAFF帧中的MB->帧MB的情况III. Case of MB in MBAFF frame -> frame MB

在这种情况下，当前层中的宏块是被编码成帧宏块，且要用于当前层的帧宏块的层间预测的基层中的宏块是被编码成MBAFF帧中的场宏块。基层中的场宏块中所包括的视频信号成分与当前层中的一对同位的宏块中所包括的视频信号成分是相同的。首先，层间运动预测描述如下。In this case, macroblocks in the current layer are coded as frame macroblocks, and macroblocks in the base layer to be used for inter-layer prediction of frame macroblocks in the current layer are coded as field macroblocks in MBAFF frames piece. The video signal components included in a field macroblock in the base layer are the same as those included in a pair of co-located macroblocks in the current layer. First, inter-layer motion prediction is described as follows.

EL编码器20使用通过扩展基层宏块对的顶或底宏块(在垂直方向上扩展到两倍)获得的宏块划分模式作为虚拟基层中的宏块对的划分模式。图8a示出该过程的详细例子。尽管在以下的描述和附图中是顶场宏块被选择，但在底场宏块被选择时下面所描述的同样适用。The EL encoder 20 uses a macroblock division pattern obtained by extending the top or bottom macroblock of a base layer macroblock pair (expanded to double in the vertical direction) as a division pattern of a macroblock pair in a virtual base layer. Figure 8a shows a detailed example of this process. Although the top field macroblock is selected in the following description and drawings, the same applies when the bottom field macroblock is selected.

如图8a所示，将基层的相应宏块对810的顶场宏块扩展到两倍以构造出两个宏块811(S81)，并将通过扩展获得的划分模式应用于虚拟基层的宏块对812(S82)。As shown in Figure 8a, the top field macroblocks of the corresponding macroblock pair 810 of the base layer are expanded twice to construct two macroblocks 811 (S81), and the division mode obtained by the expansion is applied to the macroblocks of the virtual base layer Pair 812 (S82).

然而，当相应场宏块在垂直方向上被扩展到两倍时，可能会生成在宏块划分模式中不允许的划分模式(或图案)。为了防止这种情况，EL编码器20按以下规则根据经扩展的划分模式来确定划分模式。However, when the macroblock of the corresponding field is expanded to double in the vertical direction, a division mode (or pattern) not allowed in the macroblock division mode may be generated. In order to prevent this, the EL encoder 20 determines the division pattern from the extended division pattern according to the following rule.

1)基层的4 x 4、8 x 4、和16 x 8块在扩展后被确定为通过将其在垂直方向上放大到两倍获得的4 x 8、8 x 8和16 x 16块。1) The 4 x 4, 8 x 4, and 16 x 8 blocks of the base layer after expansion are determined as 4 x 8, 8 x 8, and 16 x 16 blocks obtained by upscaling them to two times in the vertical direction.

2)基层的4 x 8、8 x 8、和16 x 16块在扩展后各自被确定为相同大小的顶和底两块。如图8a所示，基层的8 x 8块B8_0被确定为两个8 x 8块(801)。8 x 8块B8_0在扩展后未被设置成8 x 16块的原因是其左侧或右侧上毗邻的经扩展块可能不是8 x 16划分块，且在这种情况下没有哪种宏块划分模式得到支持。2) The 4 x 8, 8 x 8, and 16 x 16 blocks of the base layer are each determined to be the top and bottom two blocks of the same size after expansion. As shown in Figure 8a, the 8x8 block B8_0 of the base layer is determined as two 8x8 blocks (801). The reason why 8 x 8 block B8_0 is not set to 8 x 16 block after expansion is that the adjacent expanded blocks on the left or right side of it may not be 8 x 16 partition blocks, and in this case there is no kind of macroblock Partition mode is supported.

如果相应宏块对810中有一个宏块是以内模式编码的，则EL编码器20不是选择内模式的而是选择间模式的顶或底场宏块，并对其执行以上的扩展过程以确定虚拟基层中的宏块对812的划分模式。If one macroblock in the corresponding macroblock pair 810 is coded in intra mode, EL encoder 20 selects not intra mode but the top or bottom field macroblock in inter mode, and performs the above expansion process on it to determine The partition mode of macroblock pairs 812 in the virtual base layer.

如果相应的宏块对810中的两个宏块都是内模式，则EL编码器20只执行层间纹理预测，而不执行通过以上扩展过程进行的划分模式确定和以下描述的参考索引和运动向量推导过程。If both macroblocks in the corresponding macroblock pair 810 are intra-mode, the EL encoder 20 only performs inter-layer texture prediction, and does not perform the division mode determination through the above extension process and the reference index and motion described below Vector derivation process.

为了从相应场宏块的参考索引推导虚拟基层的宏块对的参考索引，EL编码器20将基层的相应8 x 8块B8_0的参考索引确定为该顶和底两个8 x 8块中的每一个的参考索引，如图8b所示，并将所确定的每一个8 x 8块的参考索引除以2以获得其最终的参考索引。作该除法的原因是为了能应用于帧序列，需要将画面数减少一半，因为场宏块的参考画面数目是基于分成偶和奇场的画面而设置的。In order to derive the reference index of the macroblock pair of the virtual base layer from the reference index of the corresponding field macroblock, the EL encoder 20 determines the reference index of the corresponding 8 x 8 block B8_0 of the base layer as the reference index of the top and bottom two 8 x 8 blocks The reference index of each is shown in Figure 8b, and the determined reference index of each 8 x 8 block is divided by 2 to obtain its final reference index. The reason for this division is that in order to be applicable to frame sequences, the number of pictures needs to be reduced by half, because the number of reference pictures for field macroblocks is set based on pictures divided into even and odd fields.

当推导虚拟基层的帧宏块对812的运动向量时，EL编码器20将基层的相应4 x 4块的运动向量确定为虚拟基层的宏块对812中的4 x 8块的运动向量，如图8c所示，并将所确定的运动向量在其垂直分量乘以2之后用作最终运动向量。作该乘法的原因是一个场宏块中所包括的图像成分对应于两个帧宏块的图像成分，因而使得帧图像的大小在垂直方向上增加到两倍。When deriving the motion vector of the frame macroblock pair 812 of the virtual base layer, the EL encoder 20 determines the motion vector of the corresponding 4x4 block of the base layer as the motion vector of the 4x8 block in the macroblock pair 812 of the virtual base layer, e.g. 8c, and use the determined motion vector after multiplying its vertical component by 2 as the final motion vector. The reason for this multiplication is that the image components included in one field macroblock correspond to the image components of two frame macroblocks, thereby doubling the size of the frame image in the vertical direction.

在上述的实施例中，为了预测虚拟基层的宏块对的运动信息，EL编码器20基于基层的相应场宏块的运动信息顺序地推导该宏块的划分模式、参考索引、和运动向量。In the above-described embodiments, in order to predict the motion information of a macroblock pair of a virtual base layer, the EL encoder 20 sequentially derives the division mode, reference index, and motion vector of the macroblock based on the motion information of the corresponding field macroblock of the base layer.

在本发明的另一个实施例中，当推导要用于当前宏块对的层间预测的虚拟基层的宏块对的运动信息时，EL编码器20首先基于基层的相应场宏块的运动信息获得虚拟基层的宏块对的参考索引和运动向量，然后基于所获得的值最终确定虚拟基层的宏块对中的每一个宏块的块模式，如图9a所示。当划分模式被最终确定时，具有相同的推导出的运动向量和参考索引的4 x 4块单元被组合，且如果组合后的块模式是允许的划分模式，则将划分模式设置成此组合后的模式，否则将划分模式设置成组合前的模式。In another embodiment of the present invention, when deriving the motion information of a macroblock pair of a virtual base layer to be used for inter-layer prediction of a current macroblock pair, the EL encoder 20 first bases the motion information of the corresponding field macroblock of the base layer on the basis of The reference index and the motion vector of the macroblock pair of the virtual base layer are obtained, and then the block mode of each macroblock in the macroblock pair of the virtual base layer is finally determined based on the obtained values, as shown in FIG. 9 a . When the partition mode is finalized, 4 x 4 block units with the same derived motion vector and reference index are combined, and if the combined block mode is an allowed partition mode, the partition mode is set to this combined mode, otherwise set the division mode to the mode before combination.

以下是图9a的实施例的更详细的描述。如图所示，基层的间模式场宏块被选择，并使用所选的宏块的运动向量和参考索引来推导要用于当前宏块对的运动预测的虚拟基层的帧宏块对的参考索引和运动向量。如果这两个宏块都是间模式，则顶和底宏块中任意性的一个被选择(901或902)，并使用所选宏块的运动向量和参考索引信息。如图所示，为了推导参考索引，所选宏块的顶8 x 8块的相应值被拷贝至虚拟基层的顶宏块的顶和底8 x 8块的参考索引，且所选宏块的底8 x 8块的相应值被拷贝至虚拟基层的底宏块的顶和底8 x 8块的参考索引。如图所示，为了推导运动向量，所选宏块的每一个4 x 4块的相应值被共用作虚拟基层的宏块对中相应的一对垂直毗邻的4 x 4块的运动向量。在本发明的另一个实施例中，基层的相应宏块对的运动信息可被混合并用于推导虚拟基层的帧宏块对的运动向量和参考索引，这与图9a所示的实施例不同。图9b示出根据该实施例的用于推导运动向量和参考索引的程序。虚拟基层的宏块对中的子块(8 x 8块和4 x 4块)的参考索引和运动向量的拷贝关联的详细描述在这里省略，因为其可从上述的运动信息推导程序的描述和图9b的插图中直观理解。A more detailed description of the embodiment of Figure 9a follows. As shown, the inter-mode field macroblock of the base layer is selected, and the motion vector and reference index of the selected macroblock are used to derive the reference of the frame macroblock pair of the virtual base layer to be used for motion prediction of the current macroblock pair index and motion vector. If both macroblocks are inter mode, any one of the top and bottom macroblocks is selected (901 or 902), and the motion vector and reference index information of the selected macroblock is used. As shown, to derive the reference index, the corresponding values of the top 8 x 8 block of the selected macroblock are copied to the reference indices of the top and bottom 8 x 8 blocks of the top macroblock of the virtual base layer, and the selected macroblock's The corresponding values of the bottom 8x8 block are copied to the reference indices of the top and bottom 8x8 blocks of the bottom macroblock of the virtual base layer. As shown, to derive motion vectors, the corresponding values of each 4 x 4 block of the selected macroblocks are shared as the motion vectors for a corresponding pair of vertically adjacent 4 x 4 blocks in the macroblock pair of the virtual base layer. In another embodiment of the present invention, the motion information of the corresponding macroblock pairs of the base layer can be mixed and used to derive the motion vector and reference index of the frame macroblock pairs of the virtual base layer, which is different from the embodiment shown in Fig. 9a. Fig. 9b shows the procedure for deriving motion vectors and reference indices according to this embodiment. A detailed description of the copy association of the reference index of the sub-block (8 x 8 block and 4 x 4 block) in the macroblock pair of the virtual base layer and the motion vector is omitted here, because it can be derived from the above-mentioned description of the motion information derivation procedure and Intuitive understanding in the inset of Figure 9b.

然而，因为在图9b的实施例中基层的场宏块对中的两个宏块的运动信息都被使用，所以如果基层的场宏块对中有一个宏块是内模式，则利用作为间模式宏块的另一个宏块的运动信息推导内模式宏块的运动信息。具体地，可在如图4b所示地通过将间模式宏块的相应信息拷贝至内模式宏块来构造内模式宏块的运动向量和参考索引之后，或在如图4c所示将内模式宏块视为具有0运动向量和0参考索引的间模式宏块之后，或在如图4d所示通过将间模式宏块的参考索引拷贝至内模式宏块来设置内模式宏块的参考索引并将其运动向量设置为0之后，如图9b所示来推导虚拟基层的宏块对的运动向量和参考索引信息。一旦推导出虚拟基层的宏块对的运动向量和参考索引信息，就如先前所述地基于所推导出的信息来确定宏块对的块模式。However, since the motion information of both macroblocks in the field macroblock pair of the base layer is used in the embodiment of FIG. The motion information of an intra-mode macroblock is derived from the motion information of another macroblock of the mode macroblock. Specifically, after constructing the motion vector and reference index of the intra-mode macroblock by copying the corresponding information of the inter-mode macroblock to the intra-mode macroblock as shown in Figure 4b, or after the intra-mode After the macroblock is considered as an inter-mode macroblock with 0 motion vector and 0 reference index, or after the reference index of the intra-mode macroblock is set by copying the reference index of the inter-mode macroblock to the intra-mode macroblock as shown in Figure 4d After the motion vector is set to 0, the motion vector and reference index information of the macroblock pair of the virtual base layer are derived as shown in FIG. 9b. Once the motion vector and reference index information of the macroblock pair of the virtual base layer is derived, the block mode of the macroblock pair is determined based on the derived information as previously described.

另一方面，如果基层的相应场宏块对中的两个宏块都是内模式，则不执行运动预测。On the other hand, if both macroblocks in the corresponding field macroblock pair of the base layer are intra-mode, no motion prediction is performed.

现在将描述层间纹理预测。图8d示出在“MBAFF帧中的场MB->帧MB”的情况下的示例层间纹理预测方法。EL编码器20标识出基层的相应场宏块对810的块模式。如果相应的帧宏块对810中的两个宏块或者都是内模式或者都是间模式，则EL编码器20将基层的相应场宏块对810转换成临时的帧宏块对821，以便或者执行当前帧宏块对813的基内预测(当这两个帧宏块810都是内模式时)或者以下面描述的方式执行其残差预测(当这两个帧宏块810都是间模式时)。当相应宏块对810中的两个宏块都是内模式时，宏块对810包括已被解码的数据，并如先前所述地将解块滤波器应用于帧宏块对821。图8e示出用于将场宏块对转换成帧宏块对的方法。如图所示，一对场宏块A和B的行从每一个宏块的顶部开始顺序地被交替选择(A->B->A->B->A->，...)，然后从顶部开始按所选次序顺序地排列以构造一对帧宏块A′和B′。因为是以此方式重新编排场宏块对的行，所以顶帧宏块A′是由该对场宏块A和B的上半部分的行构造的，而底帧宏块B′是由下半部分的行构造的。Inter-layer texture prediction will now be described. Fig. 8d shows an example inter-layer texture prediction method in the case of "field MB in MBAFF frame -> frame MB". EL encoder 20 identifies the block mode for the corresponding field macroblock pair 810 of the base layer. If both macroblocks in the corresponding frame macroblock pair 810 are either intra-mode or both inter-mode, the EL encoder 20 converts the corresponding field macroblock pair 810 of the base layer into a temporary frame macroblock pair 821 so that Either perform base intra prediction of the current frame macroblock pair 813 (when both frame macroblocks 810 are intra mode) or perform its residual prediction in the manner described below (when both frame macroblocks 810 are inter mode). When both macroblocks in a corresponding macroblock pair 810 are intra-mode, the macroblock pair 810 includes data that has already been decoded, and a deblocking filter is applied to the frame macroblock pair 821 as previously described. Figure 8e shows a method for converting field macroblock pairs into frame macroblock pairs. As shown, the rows of a pair of field macroblocks A and B are alternately selected sequentially from the top of each macroblock (A->B->A->B->A->, . . . ), They are then arranged sequentially from the top in the selected order to construct a pair of frame macroblocks A' and B'. Because the rows of a field pair of macroblocks are rearranged in this way, the top frame macroblock A' is constructed from the upper half rows of the pair of field macroblocks A and B, while the bottom frame macroblock B' is constructed from the lower half of the rows are constructed.

另一方面，如果基层的相应场宏块对810中只有一个宏块是间模式，则根据当前的帧宏块对813的块模式从基层的宏块对810中选择一个块，并将所选块用于层间纹理预测。或者，在确定当前的帧宏块对813的块模式前，可先应用以下描述的每一种方法来执行层间预测，然后可确定宏块对813的块模式。On the other hand, if only one macroblock in the corresponding field macroblock pair 810 of the base layer is inter mode, a block is selected from the macroblock pair 810 of the base layer according to the block mode of the current frame macroblock pair 813, and the selected Blocks are used for inter-layer texture prediction. Alternatively, before determining the block mode of the macroblock pair 813 of the current frame, each method described below may be applied to perform inter-layer prediction, and then the block mode of the macroblock pair 813 may be determined.

图8f和8g示出其中选择一个块以执行层间预测的示例。在当前的帧宏块对813是以间模式编码(或者在执行其间模式预测)的情况下，如图8f所示，从基层的场宏块对810中选择间模式块810a，且所选的块在垂直方向上被升采样以创建两个相应宏块831。然后将这两个宏块831用于当前的帧宏块对813的残差预测。在当前的帧宏块对813不是以间模式编码(或者在执行其内模式预测)的情况下，如图8g所示，从基层的场宏块对810中选择内模式块810b，且所选的块在垂直方向上被升采样以创建两个相应宏块841。在将解块滤波器应用于这两个宏块841之后，将这两个宏块841用于当前帧宏块对813的基内预测。Figures 8f and 8g show examples where one block is selected to perform inter-layer prediction. In the case that the current frame macroblock pair 813 is coded in inter mode (or is performing inter mode prediction), as shown in FIG. Blocks are up-sampled in the vertical direction to create two corresponding macroblocks 831 . These two macroblocks 831 are then used for residual prediction of the current frame macroblock pair 813 . In the case that the current frame macroblock pair 813 is not coded in inter mode (or is performing its intra mode prediction), as shown in Figure 8g, the intra mode block 810b is selected from the field macroblock pair 810 of the base layer, and The block of is vertically up-sampled to create two corresponding macroblocks 841. These two macroblocks 841 are used for intra-base prediction of the macroblock pair 813 of the current frame after applying a deblocking filter to the two macroblocks 841 .

图8f和8g中所示的其中一个块被选择并升采样以创建要用于层间纹理预测的宏块对的方法在各层具有不同的画面率时也能适用。当增强层的画面率高于基层的画面率时，增强层的画面序列中的某些画面可能在基层中没有时间上相对应的画面。在基层中没有时间上相对应的画面的增强层画面中所包括的帧宏块对的层间纹理预测可利用基层中时间上在前的画面中的一对空间上同位的场宏块中的一个宏块来执行。The method shown in Figures 8f and 8g where one block is selected and up-sampled to create pairs of macroblocks to be used for inter-layer texture prediction can also be applied when the layers have different picture rates. When the picture rate of the enhancement layer is higher than that of the base layer, some pictures in the picture sequence of the enhancement layer may not have corresponding pictures in time in the base layer. Inter-layer texture prediction of a frame macroblock pair included in an enhancement layer picture that has no temporally corresponding picture in the base layer may utilize A macroblock to execute.

图8h是增强层的画面率是基层画面率的两倍的情况下该方法的例子。Figure 8h is an example of this approach in the case where the frame rate of the enhancement layer is twice the frame rate of the base layer.

如图所示，增强层的画面率是基层的画面率的两倍。因此，增强层的每两个画面中有一个——诸如画面次序计数(POC)为“n2”的画面——在基层中没有画面顺序计数(POC)相同的画面。这里，相同的POC指示时间上的一致性。As shown, the frame rate of the enhancement layer is twice that of the base layer. Thus, for every two pictures in the enhancement layer - such as a picture with a picture order count (POC) of "n2" - there is no picture in the base layer with the same picture order count (POC). Here, the same POC indicates temporal consistency.

当基层中没有时间上一致的画面时(例如，在当前POC是n2时)，先前画面(即，POC比当前POC低1的画面)中的一对空间上同位的场宏块中所包括的底场宏块802被垂直升采样以创建临时宏块对852(S82)，然后使用此临时宏块对852来执行当前宏块对815的层间纹理预测。当基层中有时间上一致的画面时(例如，在当前POC是n1时)，此时间上一致的画面中的一对空间上同位的场宏块中所包括的顶场宏块801被垂直升采样以创建临时宏块对851(S82)，然后使用此临时宏块对851来执行当前宏块对814的层间纹理预测。当通过升采样创建的临时宏块对851或852中包括从内模式宏块解码的宏块对时，在对该宏块对应用解块滤波器之后将该宏块对用于层间纹理预测。When there is no temporally consistent picture in the base layer (for example, when the current POC is n2), a pair of spatially coherent field macroblocks included in a previous picture (that is, a picture with a POC lower than the current POC by 1) The bottom field macroblock 802 is vertically up-sampled to create a temporary macroblock pair 852 ( S82 ), which is then used to perform inter-layer texture prediction of the current macroblock pair 815 . When there is a temporally consistent picture in the base layer (for example, when the current POC is n1), the top field macroblock 801 included in a pair of spatially co-located field macroblocks in this temporally consistent picture is vertically upgraded Sampling to create a temporary macroblock pair 851 (S82), and then use this temporary macroblock pair 851 to perform inter-layer texture prediction of the current macroblock pair 814. When a macroblock pair decoded from an intra-mode macroblock is included in the temporary macroblock pair 851 or 852 created by upsampling, the macroblock pair is used for inter-layer texture prediction after applying a deblocking filter to the macroblock pair .

在本发明的另一个实施例中，当基层中有时间上一致的画面时(当图8h的示例中的当前POC是n1时)，帧宏块对不是使用图8h所示的方法而是可以根据图8d所示的实施例由场宏块对来创建，然后可将其用于层间纹理预测。此外，在当前画面在基层中没有时间上一致的画面时(当图8h的示例中的当前POC是n2时)，层间纹理预测可如图8h地来执行，或者可以不对当前画面中的宏块执行层间纹理预测。In another embodiment of the present invention, when there are temporally consistent pictures in the base layer (when the current POC in the example of Fig. 8h is n1), instead of using the method shown in Fig. 8h, the frame macroblock pair can be The embodiment shown in Fig. 8d is created from pairs of field macroblocks, which can then be used for inter-layer texture prediction. Furthermore, when the current picture has no temporally consistent picture in the base layer (when the current POC is n2 in the example of FIG. 8h ), inter-layer texture prediction may be performed as in FIG. block performs inter-layer texture prediction.

相应地，本发明的实施例分配标志‘field_base_flag(场基标志)’以指示层间纹理预测是根据图8d所示的方法执行还是根据图8h所示的方法执行的，并将此标志纳入在编码信息中。例如，在纹理预测是已根据如图8d的方法执行时将该标志设置为‘0’，而当纹理预测是已根据如图8h的方法执行时将该标志设置为‘1’。该标志被定义在要向解码器传送的增强层中的序列参数集、可升降级扩展中的序列参数、画面参数集、可升降级扩展中的画面参数集、切片头部、可升降级扩展中的切片头部、宏块层、或可升降级扩展中的宏块层中。Accordingly, an embodiment of the present invention assigns a flag 'field_base_flag' to indicate whether the inter-layer texture prediction is performed according to the method shown in FIG. 8d or according to the method shown in FIG. 8h, and includes this flag in coded information. For example, the flag is set to '0' when the texture prediction has been performed according to the method shown in Figure 8d, and set to '1' when the texture prediction has been performed according to the method shown in Figure 8h. This flag is defined in the sequence parameter set in the enhancement layer, the sequence parameter in the degradable extension, the picture parameter set, the picture parameter set in the degradable extension, the slice header, the degradable extension to be transmitted to the decoder In the slice header, in the macroblock layer, or in the macroblock layer in the scalable extension.

IV.场画面中的场MB->帧MB的情况IV. Case of Field MB -> Frame MB in Field Picture

在这种情况下，当前层(EL)中的宏块是被编码成帧宏块，而要用于当前层的帧宏块的层间预测的基层(BL)中的宏块是被编码成场画面中的场宏块。基层中的场宏块中所包括的视频信号成分与包括在当前层中一对同位的宏块中的视频信号成分是相同的。首先，层间运动预测描述如下。In this case, the macroblocks in the current layer (EL) are coded as frame macroblocks, and the macroblocks in the base layer (BL) to be used for inter-layer prediction of frame macroblocks in the current layer are coded as A field macroblock in a field picture. The video signal components included in a field macroblock in the base layer are the same as those included in a pair of co-located macroblocks in the current layer. First, inter-layer motion prediction is described as follows.

EL编码器20使用通过扩展基层的偶或奇场中的宏块(在垂直方向上扩展到两倍)获得的划分模式作为虚拟基层中的宏块的划分模式。图10a示出该过程的详细例子。图10a所示的程序与其中MBAFF帧中的顶或底场宏块被选择的情况III的程序的不同之处在于很自然地使用偶或奇场中的空间上同位的场宏块1010，而其与情况III的程序的类似之处在于同位的场宏块1010被扩展且通过扩展获得的两个宏块的划分模式被应用于虚拟基层的宏块对1012。当相应场宏块1010在垂直方向上被扩展到两倍时，可能会生成在宏块划分模式中不允许的划分模式(或图案)。为了防止该情况，EL编码器20按与在情况III中建议的规则1)和2)相同的规则根据经扩展的划分模式来确定划分模式。The EL encoder 20 uses, as a division mode of a macroblock in a virtual base layer, a division pattern obtained by extending a macroblock in an even or odd field of the base layer (expanding to double in the vertical direction). Figure 10a shows a detailed example of this process. The procedure shown in Figure 10a differs from that of case III where the top or bottom field macroblocks in the MBAFF frame are selected in that spatially co-located field macroblocks 1010 in even or odd fields are naturally used, whereas It is similar to the procedure of Case III in that the co-located field macroblock 1010 is extended and the division mode of the two macroblocks obtained by the extension is applied to the macroblock pair 1012 of the virtual base layer. When the corresponding field macroblock 1010 is expanded to double in the vertical direction, a division pattern (or pattern) that is not allowed in the macroblock division mode may be generated. In order to prevent this, the EL encoder 20 determines the division pattern from the extended division pattern by the same rules as the rules 1) and 2) suggested in Case III.

如果相应的宏块是已按内模式编码的，则EL编码器20只执行层间纹理预测，而不执行通过以上扩展过程进行的划分模式确定和以下描述的参考索引和运动向量推导过程。即，EL编码器20不执行层间运动预测。If the corresponding macroblock has been coded in the intra mode, the EL encoder 20 performs only inter-layer texture prediction without performing the division mode determination through the above extension process and the reference index and motion vector derivation process described below. That is, the EL encoder 20 does not perform inter-layer motion prediction.

参考索引和运动向量推导程序也与在前面的情况III中所描述的相类似。然而，本情况IV在以下方面不同于情况III。在情况III中，因为相应的基层宏块被携带在帧中的偶和奇宏块对中，所以顶和底宏块之一被选择并应用于推导程序。在本情况IV中，因为基层中仅存在一个对应于要编解码的当前宏块的宏块，所以虚拟基层的宏块对1012的运动信息从相应场宏块的运动信息推导，而没有如图10b和10c所示的宏块选择程序，并且推导出的运动信息被用于当前宏块对1013的层间运动预测。The reference index and motion vector derivation procedure is also similar to that described in Case III above. However, the present case IV differs from case III in the following respects. In case III, one of the top and bottom macroblocks is selected and used for the derivation procedure because the corresponding base layer macroblock is carried in the even and odd macroblock pair in the frame. In this case IV, since there is only one macroblock corresponding to the current macroblock to be coded in the base layer, the motion information of the macroblock pair 1012 of the virtual base layer is derived from the motion information of the corresponding field macroblock, without The macroblock selection procedure shown in 10b and 10c, and the derived motion information is used for the inter-layer motion prediction of the current macroblock pair 1013.

图11示意性示出根据本发明的另一个实施例的虚拟基层的宏块对的参考索引和运动向量的推导。在这种情况下，虚拟基层的宏块对的运动信息是从基层的偶或奇场宏块的运动信息推导的，这与以上参考图9a所述的情况不同。与图9a的情况相同的推导操作适用于本情况。然而，图9b所示的情况中的混合并使用宏块对的运动信息的过程在本情况IV中不适用，因为在基层中没有相应场中的顶和底宏块配对。Fig. 11 schematically shows the derivation of reference indices and motion vectors of macroblock pairs of a virtual base layer according to another embodiment of the present invention. In this case, the motion information of the macroblock pairs of the virtual base layer is derived from the motion information of the even or odd field macroblocks of the base layer, unlike the case described above with reference to Fig. 9a. The same derivation operation as in the case of Fig. 9a applies to this case. However, the process of mixing and using the motion information of pairs of macroblocks in the case shown in Fig. 9b does not apply in this case IV because there is no pair of top and bottom macroblocks in the corresponding field in the base layer.

在参考图10a至10c描述的实施例中，为了预测虚拟基层的宏块对的运动信息，EL编码器20基于基层的相应场宏块的运动信息顺序地推导划分模式、参考索引、和运动向量。然而，在图11的另一个实施例中，EL编码器20首先基于基层的相应宏块对的运动信息推导虚拟基层的宏块对的参考索引和运动向量，然后基于所推导出的值最终确定虚拟基层的宏块对的划分模式。当划分模式被确定时，具有相同的推导出的运动向量和参考索引的4x4块单元被组合，且如果组合后的块模式是允许的划分模式，则将划分模式设置成此组合后的模式，否则将划分模式设置成组合前的模式。In the embodiment described with reference to FIGS. 10a to 10c, in order to predict the motion information of a macroblock pair of a virtual base layer, the EL encoder 20 sequentially derives the division mode, reference index, and motion vector based on the motion information of the corresponding field macroblock of the base layer. . However, in another embodiment of FIG. 11, the EL encoder 20 first derives the reference index and the motion vector of the macroblock pair of the virtual base layer based on the motion information of the corresponding macroblock pair of the base layer, and then finally determines The division mode of the macroblock pair of the virtual base layer. When the partition mode is determined, the 4x4 block units with the same derived motion vector and reference index are combined, and if the combined block mode is an allowed partition mode, the partition mode is set to this combined mode, Otherwise, set the division mode to the mode before combining.

当在上述的实施例中执行纹理预测时，如果基层的相应场宏块是内模式，则对当前宏块执行基内预测编解码。如果相应场宏块是间模式，且如果当前宏块已以间模式编码，则执行层间残差预测编解码。这里，当然，在预测中使用的场宏块是在其在垂直方向上被升采样后用于纹理预测的。When texture prediction is performed in the above-mentioned embodiments, if the corresponding field macroblock of the base layer is an intra mode, the intra-base prediction codec is performed on the current macroblock. If the corresponding field macroblock is in inter mode, and if the current macroblock has been coded in inter mode, inter-layer residual prediction coding is performed. Here, of course, field macroblocks used in prediction are used for texture prediction after they are up-sampled in the vertical direction.

在本发明的另一个实施例中，由包括在奇或偶场中的场宏块创建虚拟宏块以构造宏块对，然后从所构造出的宏块对推导虚拟基层的宏块对的运动信息。图12a和图12b示出该实施例的例子。In another embodiment of the invention, virtual macroblocks are created from field macroblocks included in odd or even fields to construct macroblock pairs, and then the motion of the macroblock pairs of the virtual base layer is derived from the constructed macroblock pairs. information. Figures 12a and 12b show examples of this embodiment.

在该实施例中，基层的相应偶(或奇)场宏块的参考索引和运动向量被拷贝(1201和1202)以创建虚拟奇(或偶)场宏块来构造宏块对1211，且所构造出的宏块对1211的运动信息被混合以推导虚拟基层的宏块对1212的运动信息(1203和1204)。在混合并使用运动信息的示例方法中，如图12a和12b所示，相应顶宏块的顶8 x 8块的参考索引被应用于虚拟基层的宏块对1212的顶宏块的顶8 x 8块，底8 x 8块的参考索引被应用于底宏块的顶8 x 8块，相应底宏块的顶8 x 8块的参考索引被应用于虚拟基层的宏块对1212的顶宏块的底8 x 8块，且底8 x 8块的参考索引被应用于底宏块的底8 x 8块(1203)。根据参考索引应用运动向量(1204)。这里省略了该过程的描述，因为它可从图12a和12b直观地理解。In this embodiment, the reference indices and motion vectors of the corresponding even (or odd) field macroblocks of the base layer are copied (1201 and 1202) to create a virtual odd (or even) field macroblock to construct a macroblock pair 1211, and the The constructed motion information of the macroblock pair 1211 is mixed to derive the motion information of the macroblock pair 1212 of the virtual base layer (1203 and 1204). In an exemplary method of mixing and using motion information, as shown in Figures 12a and 12b, the reference index of the top 8 x 8 block of the corresponding top macroblock is applied to the top 8 x 8 of the top macroblock of the macroblock pair 1212 of the virtual base layer 8 blocks, the reference index of the bottom 8 x 8 block is applied to the top 8 x 8 block of the bottom macroblock, and the reference index of the top 8 x 8 block of the corresponding bottom macroblock is applied to the top macro of the macroblock pair 1212 of the virtual base layer The bottom 8x8 block of the block, and the reference index of the bottom 8x8 block is applied to the bottom 8x8 block of the bottom macroblock (1203). Motion vectors are applied according to the reference index (1204). The description of this process is omitted here because it can be intuitively understood from Figs. 12a and 12b.

在图12a和12b所示的实施例中，虚拟基层的宏块对1212的划分模式是使用与如上所述相同的方法基于推导出的参考索引和运动向量来确定的。In the embodiment shown in Figures 12a and 12b, the partition mode of the macroblock pair 1212 of the virtual base layer is determined based on the derived reference index and motion vector using the same method as described above.

现在将描述层间纹理预测。图10b示出针对“场画面中的场MB->帧MB”的情况的示例层间纹理预测方法。EL编码器20首先升采样基层的相应场宏块1010以创建两个临时宏块1021。如果相应场宏块1010是内模式，则EL编码器20将解块滤波器应用于所创建的这两个临时宏块1021，然后基于这两个临时宏块1021执行当前帧宏块对1013的基内预测。如果相应场宏块1010是间模式，则EL编码器20基于所创建的这两个临时宏块1021执行当前帧宏块对1013的残差预测。Inter-layer texture prediction will now be described. Fig. 10b shows an example inter-layer texture prediction method for the case of "field MB in field picture -> frame MB". The EL encoder 20 first upsamples the corresponding field macroblocks 1010 of the base layer to create two temporary macroblocks 1021 . If the corresponding field macroblock 1010 is intra-mode, the EL encoder 20 applies a deblocking filter to the created two temporary macroblocks 1021, and then performs the encoding of the current frame macroblock pair 1013 based on these two temporary macroblocks 1021. base forecast. If the corresponding field macroblock 1010 is inter mode, the EL encoder 20 performs residual prediction of the current frame macroblock pair 1013 based on the created two temporary macroblocks 1021 .

V.场MB->场MB的情况V. Case of field MB -> field MB

该情况被细分成以下四种情况，因为场宏块分成包括在场画面中的场宏块和包括在MBAFF帧中的场宏块。This case is subdivided into the following four cases because field macroblocks are divided into field macroblocks included in a field picture and field macroblocks included in an MBAFF frame.

i)基层和增强层是MBAFF帧的情况i) The case where the base layer and the enhancement layer are MBAFF frames

该情况在图13a中示出。如图所示，基层的相应宏块对的运动信息(划分模式、参考索引、和运动向量)是通过将相应宏块对的运动信息直接拷贝至虚拟基层的宏块对而被用作虚拟基层的宏块对的运动信息。这里，运动信息是在有相同奇偶性的宏块之间被拷贝的。具体地，偶场宏块的运动信息被拷贝至偶场宏块，而奇场宏块的运动信息被拷贝至奇场宏块，以构造用于当前层的宏块的运动预测的虚拟层的宏块。This situation is shown in Figure 13a. As shown in the figure, the motion information (partition mode, reference index, and motion vector) of the corresponding macroblock pair of the base layer is used as a virtual base layer by directly copying the motion information of the corresponding macroblock pair to the macroblock pair of the virtual base layer The motion information of the macroblock pair. Here, motion information is copied between macroblocks with the same parity. Specifically, the motion information of the even-field macroblock is copied to the even-field macroblock, and the motion information of the odd-field macroblock is copied to the odd-field macroblock, so as to construct a virtual layer for motion prediction of the macroblock of the current layer. macroblock.

在执行纹理预测时应用已知的帧宏块之间的层间纹理预测的方法。A known method of inter-layer texture prediction between frame macroblocks is applied when performing texture prediction.

ii)基层包括场画面而增强层包括MBAFF帧的情况ii) The case where the base layer includes field pictures and the enhancement layer includes MBAFF frames

该情况在图13b中示出。如图所示，基层的相应场宏块的运动信息(划分模式、参考索引、和运动向量)是通过将相应场宏块的运动信息直接拷贝至虚拟基层的宏块对中的每一个宏块而被用作虚拟基层的宏块对中的每一个宏块的运动信息。这里，相同奇偶性拷贝规则不适用，因为单个场宏块的运动信息被用于顶和底场宏块两者。This situation is shown in Figure 13b. As shown in the figure, the motion information (partition mode, reference index, and motion vector) of the corresponding field macroblock of the base layer is obtained by directly copying the motion information of the corresponding field macroblock to each macroblock in the macroblock pair of the virtual base layer Instead, it is used as motion information for each macroblock in the macroblock pair of the virtual base layer. Here, the same parity copy rule does not apply because the motion information of a single field macroblock is used for both top and bottom field macroblocks.

当执行纹理预测时，在具有相同(偶或奇)场属性的增强层和基层宏块之间应用基内预测(当基层的对应块是内模式时)或应用残差预测(当基层的相应块是间模式时)。When performing texture prediction, apply base intra prediction (when the corresponding block of the base layer is Intra mode) or apply residual prediction (when the corresponding block of the base layer block is an inter mode).

iii)基层包括MBAFF帧而增强层包括场画面的情况iii) The case where the base layer includes MBAFF frames and the enhancement layer includes field pictures

该情况在图13c中示出。如图所示，从对应于当前场宏块的基层宏块对中选择有相同奇偶性的场宏块，并通过将所选场宏块的运动信息直接拷贝至虚拟基层的场宏块来将所选场宏块的运动信息(划分模式、参考索引、和运动向量)用作虚拟基层的场宏块的运动信息。This situation is shown in Figure 13c. As shown in the figure, the field macroblocks with the same parity are selected from the base layer macroblock pairs corresponding to the current field macroblock, and the motion information of the selected field macroblock is directly copied to the field macroblock of the virtual base layer. The motion information (division mode, reference index, and motion vector) of the selected field macroblock is used as the motion information of the field macroblock of the virtual base layer.

iv)基层和增强层是场画面的情况iv) The case where the base layer and the enhancement layer are field pictures

该情况在图13d中示出。如图所示，通过将基层的相应场宏块的运动信息直接拷贝至虚拟基层的场宏块来将基层的相应场宏块的运动信息(划分模式、参考索引、和运动向量)用作虚拟基层的场宏块的运动信息。同样在这种情况下，运动信息是在有相同奇偶性的宏块之间被拷贝的。This situation is shown in Figure 13d. As shown in the figure, the motion information (partition mode, reference index, and motion vector) of the corresponding field macroblock of the base layer is used as a virtual Motion information for the field macroblocks of the base layer. Also in this case, motion information is copied between macroblocks with the same parity.

以上层间预测的描述是针对基层和增强层具有相同分辨率的情况给出的。以下的描述将就在增强层的分辨率高于基层分辨率时(即，当SpatialScalabilityType()大于0时)如何标识出每一层的画面类型(逐行帧、MBAFF帧、还是隔行场)和/或画面中宏块的类型、以及根据标识出的类型应用层间预测方法来给出。首先描述层间运动预测。The above description of inter-layer prediction is given for the case where the base layer and the enhancement layer have the same resolution. The following description will be how to identify the picture type of each layer (progressive frame, MBAFF frame, or interlaced field) and and/or the type of macroblock in the picture, and the application of the inter-layer prediction method according to the identified type is given. Inter-layer motion prediction will be described first.

M_A).基层(逐行帧)->增强层(MBAFF帧)M_A). Base layer (progressive frame) -> enhancement layer (MBAFF frame)

图14a示出针对该情况的处理方法。如图所示，首先，基层中的相应帧的所有宏块的运动信息被拷贝以创建虚拟帧。然后执行升采样。在该升采样中，利用基层画面的纹理信息以允许该画面的分辨率(或即画面大小)与当前层的分辨率相等的内插率来执行内插。此外，通过内插被放大的画面的每一个宏块的运动信息是基于该虚拟帧的每一个宏块的运动信息来构造的。可将多种已知方法中的一种用于该构造。以此方式构造出的临时性基层的画面具有与当前(增强)层的画面相同的分辨率。相应地，在这种情况下可应用上述的层间运动预测。Fig. 14a shows the processing method for this case. As shown, first, the motion information of all macroblocks of the corresponding frame in the base layer is copied to create a virtual frame. Then perform upsampling. In this upsampling, interpolation is performed using texture information of a base layer picture at an interpolation rate that allows the resolution (or ie picture size) of the picture to be equal to the resolution of the current layer. Furthermore, the motion information of each macroblock of the picture enlarged by interpolation is constructed based on the motion information of each macroblock of the virtual frame. One of a number of known methods can be used for this construction. The pictures of the temporary base layer constructed in this way have the same resolution as the pictures of the current (enhancement) layer. Accordingly, the inter-layer motion prediction described above can be applied in this case.

在这种情况下(图14a)，基层和当前层中的画面中的宏块是帧宏块和MBAFF帧中的场宏块，因为基层包括帧而当前层包括MBAFF帧。相应地，应用上述情况I的方法来执行层间运动预测。然而，如上所述不仅场宏块对，帧宏块对也可能被包括在同一MBAFF帧中。相应地，在对应于临时性基层的画面中的宏块对的当前层宏块对的类型已被标识出为帧宏块类型而不是场宏块类型时，应用已知的在帧宏块之间的包括运动信息的简单拷贝的运动预测的方法(帧-帧预测方法)。In this case (FIG. 14a), the macroblocks in the pictures in the base layer and current layer are frame macroblocks and field macroblocks in MBAFF frames, since the base layer includes frames and the current layer includes MBAFF frames. Accordingly, the method of case I above is applied to perform inter-layer motion prediction. However, not only field macroblock pairs but also frame macroblock pairs may be included in the same MBAFF frame as described above. Correspondingly, when the type of the current layer macroblock pair corresponding to the macroblock pair in the picture of the temporary base layer has been identified as a frame macroblock type instead of a field macroblock type, the known A method of motion prediction involving a simple copy of motion information between frames (frame-by-frame prediction method).

M_B).基层(逐行帧)->增强层(隔行场)M_B). Base layer (progressive frame) -> enhancement layer (interlaced field)

图14b示出针对该情况的处理方法。如图所示，首先，基层中的相应帧的所有宏块的运动信息被拷贝以创建虚拟帧。然后执行升采样。在该升采样中，利用基层画面的纹理信息，以允许画面的分辨率与当前层的分辨率相等的内插率执行内插。此外，通过内插被放大的画面的每一个宏块的运动信息是基于所创建的虚拟帧的每一个宏块的运动信息来构造的。Fig. 14b shows the processing method for this case. As shown, first, the motion information of all macroblocks of the corresponding frame in the base layer is copied to create a virtual frame. Then perform upsampling. In this upsampling, using the texture information of the base layer picture, interpolation is performed at an interpolation rate that allows the resolution of the picture to be equal to the resolution of the current layer. Also, the motion information of each macroblock of the picture enlarged by interpolation is constructed based on the motion information of each macroblock of the created virtual frame.

应用上述情况II的方法来执行层间运动预测，因为以此方式构造的临时性基层的画面的每一个宏块均是帧宏块，而当前层的每一个宏块均是场画面中的场宏块。Apply the method of case II above to perform inter-layer motion prediction, because each macroblock of the picture of the temporary base layer constructed in this way is a frame macroblock, and each macroblock of the current layer is a field in a field picture macroblock.

M_C).基层(MBAFF帧)->增强层(逐行帧)M_C). Base layer (MBAFF frame) -> enhancement layer (progressive frame)

图14c示出针对该情况的处理方法。如图所示，首先，将基层的相应MBAFF帧变换成逐行帧。上述情况III的方法适用于将MBAFF帧的场宏块对变换成逐行帧，并且已知的帧-帧预测方法适用于MBAFF帧的帧宏块对的变换。当然，当将情况III的方法应用于本情况中时，是利用通过不需执行对预测出的数据与实际要编解码的层的数据之差进行编解码的操作的层间预测获得的数据创建虚拟帧和该帧的每一个宏块的运动信息。Fig. 14c shows the processing method for this case. As shown, first, the corresponding MBAFF frame of the base layer is converted into a progressive frame. The method of case III above is suitable for transforming field macroblock pairs of MBAFF frames into progressive frames, and known frame-frame prediction methods are suitable for transforming frame macroblock pairs of MBAFF frames. Of course, when the method of Case III is applied to this case, it is to use the data created by the inter-layer prediction that does not need to perform the operation of encoding and decoding the difference between the predicted data and the data of the layer to be actually encoded and decoded. Virtual frame and motion information for each macroblock of that frame.

一旦获得虚拟帧，就对该虚拟帧执行升采样。在该升采样中，以允许基层的分辨率与当前层的分辨率相等的内插率执行内插。此外，利用多种已知方法中的一种基于虚拟帧的每一个宏块的运动信息构造经放大画面的每一个宏块的运动信息。这里，执行已知的帧宏块-宏块层间运动预测方法，因为以此方式构造出的临时性基层的画面的每一个宏块均是帧宏块，且当前层的每一个宏块均是帧宏块。Once a virtual frame is obtained, upsampling is performed on the virtual frame. In this upsampling, interpolation is performed at an interpolation rate that allows the resolution of the base layer to be equal to the resolution of the current layer. Furthermore, the motion information of each macroblock of the enlarged picture is constructed based on the motion information of each macroblock of the virtual frame using one of several known methods. Here, the known frame macroblock-macroblock inter-layer motion prediction method is performed, because each macroblock of the picture of the temporary base layer constructed in this way is a frame macroblock, and each macroblock of the current layer is a frame macroblock. is a frame macroblock.

M_D).基层(隔行场)->增强层(逐行帧)M_D). Base layer (interlaced field) -> enhancement layer (progressive frame)

图14d示出针对该情况的一种处理方法。在这种情况下，画面的类型与该画面的宏块的类型相同。如图所示，首先，将基层的相应场变换成逐行帧。变换出的帧具有与当前层的画面相同的垂直/水平(纵横)比。升采样过程和上述情况IV的方法适用于将隔行场变换成逐行帧。当然，当将情况IV的方法应用于本情况中时，是利用通过不需执行对预测出的数据与实际要编解码的层的数据之差进行编解码的操作的层间预测获得的数据来创建虚拟帧的纹理数据和该帧的每一个宏块的运动信息。Figure 14d shows one approach to this situation. In this case, the type of the picture is the same as the type of the macroblocks of the picture. As shown, first, the corresponding fields of the base layer are converted into progressive frames. The transformed frame has the same vertical/horizontal (aspect) ratio as the picture of the current layer. The up-sampling process and method of Case IV above apply to convert interlaced fields into progressive frames. Of course, when the method of case IV is applied to this case, the data obtained by inter-layer prediction without performing the operation of encoding and decoding the difference between the predicted data and the data of the layer to be actually encoded and decoded is used to Create the texture data of the virtual frame and the motion information of each macroblock of the frame.

一旦获得虚拟帧，就对该虚拟帧执行升采样。在该升采样中，执行内插以允许虚拟帧的分辨率等于当前层的分辨率。此外，使用多种已知方法中的一种基于虚拟帧的每一个宏块的运动信息构造内插出的画面的每一个宏块的运动信息。这里是执行已知的帧宏块-宏块层间运动预测方法，因为以此方式构造的临时性基层的画面的每一个宏块均是帧宏块，而当前层的每一个宏块是帧宏块。Once a virtual frame is obtained, upsampling is performed on the virtual frame. In this upsampling, interpolation is performed to allow the resolution of the virtual frame to be equal to the resolution of the current layer. Furthermore, the interpolated motion information for each macroblock of the picture is constructed based on the motion information for each macroblock of the virtual frame using one of several known methods. Here is to implement the known frame macroblock-macroblock inter-layer motion prediction method, because each macroblock of the picture of the temporary base layer constructed in this way is a frame macroblock, and each macroblock of the current layer is a frame macroblock.

图14e示出根据本发明的另一个实施例的针对以上情况M_D)的处理方法。如图所示，该实施例将奇或偶相应场变换成逐行帧。为了将隔行场变换成逐行帧，如图14d所示应用升采样和上述的情况IV的方法。一旦获得虚拟帧，就对虚拟帧应用具有相同纵横比的画面之间的运动预测的方法——其为多种已知方法中的一种——来进行当前层的画面与临时性层之间的运动预测，以执行当前层的逐行画面的每一个宏块的运动信息的预测编解码。Fig. 14e shows a processing method for the above case M_D) according to another embodiment of the present invention. As shown, this embodiment converts odd or even corresponding fields into progressive frames. To convert an interlaced field to a progressive frame, upsampling is applied as shown in Figure 14d and the method of Case IV above. Once a virtual frame is obtained, a method of motion prediction between pictures with the same aspect ratio is applied to the virtual frame - which is one of several known methods - between the pictures of the current layer and the temporary layer motion prediction to perform predictive coding and decoding of the motion information of each macroblock of the progressive picture of the current layer.

图14e所示的方法与图14d的方法的不同之处在于不生成临时的预测信号。The method shown in Fig. 14e differs from the method in Fig. 14d in that no temporary prediction signal is generated.

图14f示出根据本发明的另一个实施例的针对以上情况M_D)的处理方法。如图所示，该实施例拷贝基层的相应场的所有宏块的运动信息以创建虚拟画面。然后执行升采样。在该升采样中，使用基层的画面的纹理信息，并将不同的内插率用于垂直和水平内插以使得经放大的画面具有与当前层的画面相同的大小(或即分辨率)。此外，可将多种已知的预测方法中的一种(例如，扩展特殊可升降级性(ESS))应用于虚拟画面以构造经放大画面的各种句法信息和运动信息。在该过程中构造出的运动向量根据放大比率被扩展。一旦临时性基层的经升采样画面被构造出来，就将该画面用于执行当前层的画面中的每一个宏块的层间运动预测，以编解码当前层的画面的每一个宏块的运动信息。这里，应用已知的帧宏块＝宏块层间运动预测方法。Fig. 14f shows a processing method for the above case M_D) according to another embodiment of the present invention. As shown, this embodiment copies the motion information of all macroblocks of the corresponding field of the base layer to create a virtual picture. Then perform upsampling. In this upsampling, the texture information of the picture of the base layer is used, and different interpolation rates are used for vertical and horizontal interpolation so that the enlarged picture has the same size (or resolution) as the picture of the current layer. Furthermore, one of various known prediction methods such as Extended Specific Scalability (ESS) may be applied to the virtual picture to construct various syntax information and motion information of the enlarged picture. The motion vectors constructed in this process are expanded according to the magnification ratio. Once the upsampled picture of the temporary base layer is constructed, this picture is used to perform inter-layer motion prediction for each macroblock in the picture of the current layer to encode and decode the motion of each macroblock in the picture of the current layer information. Here, the known frame-macroblock=macroblock inter-layer motion prediction method is applied.

图14g示出根据本发明的另一个实施例的针对以上情况M_D)的处理方法。如图所示，该实施例首先拷贝基层的相应场的所有宏块的运动信息以创建虚拟画面。之后，使用基层的画面的纹理信息以对于垂直和水平内插不同的比率执行内插。通过该操作创建的纹理信息被用于层间纹理预测。此外，虚拟画面中的运动信息被用于执行当前层的画面中的每一个宏块的层间运动预测。这里，应用多种已知方法中的一种(例如，在联合可升降级视频模型(JSVM)中定义的扩展特殊可升降级性(ESS))来执行当前层的画面的运动预测编解码。Fig. 14g shows a processing method for the above case M_D) according to another embodiment of the present invention. As shown, this embodiment first copies the motion information of all macroblocks of the corresponding field of the base layer to create a virtual picture. Afterwards, interpolation is performed at different rates for vertical and horizontal interpolation using the texture information of the picture of the base layer. The texture information created by this operation is used for inter-layer texture prediction. Furthermore, the motion information in the virtual picture is used to perform inter-layer motion prediction for each macroblock in the picture of the current layer. Here, one of various known methods such as Extended Specific Scalability (ESS) defined in Joint Scalable Video Model (JSVM) is applied to perform motion prediction coding and decoding of pictures of the current layer.

图14g所示的方法与图14f的方法的不同之处在于不生成临时的预测信号。The method shown in Fig. 14g differs from the method in Fig. 14f in that no temporary prediction signal is generated.

M_E).基层(MBAFF帧)->增强层(MBAFF帧)M_E). Base layer (MBAFF frame) -> Enhancement layer (MBAFF frame)

图14h示出针对该情况的处理方法。如图所示，首先，将基层的相应MBAFF帧变换成逐行帧。为了将MBAFF帧变换成逐行帧，上述情况III的方法适用于MBAFF帧的场宏块对的变换，并且帧-帧预测方法适用于MBAFF帧的帧宏块对的变换。当然，当将情况III的方法应用于本情况中时，是利用通过不需执行编解码预测出的数据与实际要编解码的层的数据之差的操作的层间预测获得的数据来创建虚拟帧和该帧的每一个宏块的运动信息。Fig. 14h shows the processing method for this case. As shown, first, the corresponding MBAFF frame of the base layer is converted into a progressive frame. In order to transform MBAFF frames into progressive frames, the method of Case III above is applied to the transformation of field macroblock pairs of MBAFF frames, and the frame-to-frame prediction method is applied to the transformation of frame macroblock pairs of MBAFF frames. Of course, when the method of case III is applied to this case, the virtual Motion information for a frame and for each macroblock in that frame.

一旦获得虚拟帧，就对该虚拟帧执行升采样。在该升采样中，以允许基层的分辨率与当前层的分辨率相等的内插率执行内插。此外，利用多种已知方法中的一种基于虚拟帧的每一个宏块的运动信息构造经放大的画面的每一个宏块的运动信息。应用上述情况I的方法来执行层间运动预测，因为以此方式构造的临时性基层的画面的每一个宏块均是帧宏块，而当前层的每一个宏块均是MBAFF帧中的场宏块。然而，如上所述不仅场宏块对，帧宏块对也可被包括在同一MBAFF帧中。相应地，在对应于临时性基层的画面中的宏块对的当前层宏块对是帧宏块而不是场宏块时，应用已知的在帧宏块之间的包括运动信息的拷贝的运动预测的方法(帧-帧预测方法)。Once a virtual frame is obtained, upsampling is performed on the virtual frame. In this upsampling, interpolation is performed at an interpolation rate that allows the resolution of the base layer to be equal to the resolution of the current layer. Furthermore, the motion information of each macroblock of the enlarged picture is constructed based on the motion information of each macroblock of the virtual frame using one of various known methods. Apply the method of case I above to perform inter-layer motion prediction, because each macroblock of the picture of the temporary base layer constructed in this way is a frame macroblock, and each macroblock of the current layer is a field in an MBAFF frame macroblock. However, not only field macroblock pairs but also frame macroblock pairs may be included in the same MBAFF frame as described above. Correspondingly, when the current layer macroblock pair corresponding to the macroblock pair in the picture of the temporary base layer is a frame macroblock instead of a field macroblock, the known method of copying between frame macroblocks including motion information is applied A method of motion prediction (frame-by-frame prediction method).

M_F).基层(MBFF帧)->增强层(隔行场)M_F). Base layer (MBFF frame) -> enhancement layer (interlaced field)

图14i示出该情况的处理方法。如图所示，首先，将基层的相应MBAFF帧变换成逐行帧。为了将MBAFF帧变换成逐行帧，上述情况III的方法适用于MBAFF帧的场宏块对的变换，并且帧-帧预测方法适用于MBAFF帧的帧宏块对的变换。当然，同样，当将情况III的方法应用于本情况中时，是利用通过不需执行编解码预测出的数据与实际要编解码的层的数据之差的操作的层间预测获得的数据来创建虚拟帧和该帧的每一个宏块的运动信息。Figure 14i shows how this situation is handled. As shown, first, the corresponding MBAFF frame of the base layer is converted into a progressive frame. In order to transform MBAFF frames into progressive frames, the method of Case III above is applied to the transformation of field macroblock pairs of MBAFF frames, and the frame-to-frame prediction method is applied to the transformation of frame macroblock pairs of MBAFF frames. Of course, similarly, when the method of case III is applied to this case, the data obtained by the inter-layer prediction that does not need to perform the operation of the difference between the codec predicted data and the data of the layer actually to be codec is used to obtain Create a virtual frame and motion information for each macroblock in that frame.

一旦获取虚拟帧，就以允许分辨率等于当前层的分辨率的内插率对该虚拟帧执行内插。此外，使用多种已知方法中的一种基于虚拟帧的每一个宏块的运动信息构造经放大的画面的每一个宏块的运动信息。应用上述情况II的方法来执行层间运动预测，因为以此方式构造的临时性基层的画面的每一个宏块均是帧宏块，而当前层的每一个宏块均是偶或奇场中的场宏块。Once a virtual frame is acquired, it is interpolated at an interpolation rate that allows a resolution equal to that of the current layer. Furthermore, the motion information for each macroblock of the enlarged picture is constructed based on the motion information for each macroblock of the virtual frame using one of several known methods. Apply the method of the above case II to perform inter-layer motion prediction, because each macroblock of the picture of the temporary base layer constructed in this way is a frame macroblock, and each macroblock of the current layer is an even or odd field field macroblocks.

M_G).基层(隔行场)->增强层(MBAFF帧)M_G). Base layer (interlaced field) -> enhancement layer (MBAFF frame)

图14j示出针对该情况的处理方法。如图所示，首先，将基层的隔行场变换成逐行帧。应用升采样和上述情况IV的方法将隔行场变换成逐行帧。当然，同样，当将情况IV的方法应用于本情况中时，是利用通过不需执行对预测出的数据与实际要编解码的层的数据之差进行编解码的操作的层间预测获得的数据来创建虚拟帧和该帧的每一个宏块的运动信息。Fig. 14j shows the processing method for this case. As shown in the figure, first, the interlaced fields of the base layer are converted into progressive frames. Apply upsampling and the method of Case IV above to convert the interlaced fields into progressive frames. Of course, similarly, when the method of Case IV is applied to this case, it is obtained by inter-layer prediction without performing the operation of encoding and decoding the difference between the predicted data and the data of the layer to be actually coded and decoded. data to create a virtual frame and motion information for each macroblock of that frame.

一旦获得虚拟帧，就对该虚拟帧执行升采样以允许分辨率等于当前层的分辨率。此外，利用多种已知方法中的一种构造经放大的画面的每一个宏块的运动信息。应用上述情况I的方法来执行层间运动预测，因为以此方式构造的临时性基层的画面的每一个宏块均是帧宏块，而当前层的每一个宏块均是MBAFF帧中的场宏块。然而，如上所述不仅场宏块对，帧宏块对也可被包括在同一MBAFF帧中。因此，在对应于临时性基层的画面中的宏块对的当前层宏块对包括帧宏块而不是场宏块时，应用已知的在帧宏块之间的运动预测的方法(帧-帧预测方法)而不是上述情况I的预测方法。Once a virtual frame is obtained, upsampling is performed on the virtual frame to allow a resolution equal to that of the current layer. Furthermore, motion information for each macroblock of the enlarged picture is constructed using one of several known methods. Apply the method of case I above to perform inter-layer motion prediction, because each macroblock of the picture of the temporary base layer constructed in this way is a frame macroblock, and each macroblock of the current layer is a field in an MBAFF frame macroblock. However, not only field macroblock pairs but also frame macroblock pairs may be included in the same MBAFF frame as described above. Therefore, when the current layer macroblock pair corresponding to the macroblock pair in the picture of the temporary base layer comprises a frame macroblock instead of a field macroblock, the known method of motion prediction between frame macroblocks (frame- frame prediction method) instead of the prediction method of case I above.

M_H).基层(隔行场)->增强层(隔行场)M_H). Base layer (interlaced field) -> enhancement layer (interlaced field)

图14k示出针对该情况的处理方法。如图所示，首先，基层中的相应场的所有宏块的运动信息被拷贝以创建虚拟场，然后对该虚拟场执行升采样。该升采样以允许基层的分辨率与当前层的分辨率相等的升采样率执行。此外，使用多种已知方法中的一种基于所创建的虚拟帧的每一个宏块的运动信息构造经放大的画面的每一个宏块的运动信息。应用上述情况V中的情况iv)的方法来执行层间运动预测，因为以此方式构造的临时性基层的画面的每一个宏块均是场画面中的场宏块，而当前层的每一个宏块也均是场画面中的场宏块。Fig. 14k shows the processing method for this case. As shown in the figure, first, motion information of all macroblocks of a corresponding field in a base layer is copied to create a virtual field, and then upsampling is performed on the virtual field. This upsampling is performed at an upsampling rate that allows the resolution of the base layer to be equal to the resolution of the current layer. Furthermore, the motion information of each macroblock of the enlarged picture is constructed based on the motion information of each macroblock of the created virtual frame using one of several known methods. Apply the method of case iv) in the above case V to perform inter-layer motion prediction, because each macroblock of the picture of the temporary base layer constructed in this way is a field macroblock in the field picture, and each macroblock of the current layer Each macroblock is also a field macroblock in a field picture.

尽管在图14a至14k的实施例的描述中是使用临时性层的虚拟场或帧的纹理信息而不是基层的画面的纹理信息来进行升采样，但基层画面的纹理信息也可用于升采样。此外，如果不是必要的话，当推导要用于在后续级中执行的层间运动预测的临时性层的画面的运动信息时，在上述的升采样过程中可省略利用纹理信息的内插过程。Although in the description of the embodiment of Figs. 14a to 14k the texture information of the virtual field or frame of the temporary layer is used for upsampling instead of the texture information of the picture of the base layer, the texture information of the picture of the base layer can also be used for upsampling. Furthermore, if not necessary, when deriving motion information of a picture of a temporary layer to be used for inter-layer motion prediction performed in a subsequent stage, an interpolation process using texture information may be omitted in the above-described upsampling process.

另一方面，尽管纹理预测的描述是针对基层和增强层具有相同空间分辨率的情况给出的，但如上所述这两个层可能具有不同的空间分辨率。在增强层的分辨率高于基层的分辨率的情况下，首先，执行使基层的画面的分辨率等于增强层的画面的分辨率的操作，以创建具有与增强层的分辨率相同的分辨率的基层画面，并基于该画面中的每一个宏块选择与上述情况I-V中的每一种情况相对应的纹理预测方法以执行预测编解码。现在详细描述使基层画面的分辨率等于增强层画面的分辨率的程序。On the other hand, although the description of texture prediction is given for the case where the base layer and the enhancement layer have the same spatial resolution, the two layers may have different spatial resolutions as mentioned above. In the case where the resolution of the enhancement layer is higher than that of the base layer, first, an operation of making the resolution of the picture of the base layer equal to the resolution of the picture of the enhancement layer is performed to create a picture having the same resolution as that of the enhancement layer base layer picture, and select a texture prediction method corresponding to each of the above cases I-V on a per macroblock basis in the picture to perform predictive coding and decoding. The procedure for making the resolution of the base layer picture equal to the resolution of the enhancement layer picture is now described in detail.

当考虑用于层间预测的两层时，用于在两层之间编解码的画面格式(逐行和隔行格式)的组合数目是4，因为有两种视频信号扫描方法，一种是逐行扫描而另一种是隔行扫描。因此，将分别针对这四种情况中的每一种描述增加基层画面的分辨率以执行层间纹理预测的方法。When considering two layers for inter-layer prediction, the number of combinations of picture formats (progressive and interlaced formats) for encoding and decoding between the two layers is 4, because there are two video signal scanning methods, one is progressive One is line scanning and the other is interlaced scanning. Therefore, a method of increasing the resolution of a base layer picture to perform inter-layer texture prediction will be described for each of these four cases, respectively.

T_A).增强层是逐行的而基层是隔行的情况T_A). The case where the enhancement layer is progressive and the base layer is interlaced

图15a示出针对该情况将基层画面用于层间纹理预测的方法的实施例。如图所示，时间上对应于当前(增强)层的画面1500的基层画面1501包括在不同时间输出的偶和奇场。因此，首先，EL编码器20将基层的画面分成偶和奇场(S151)。基层画面1501的内模式宏块具有用于内模式预测的未被编码的原始图像数据(或已被解码的图像数据)，且其模式间宏块具有用于残差预测的经编码的残差数据(或经解码的残差数据)。当在下文中描述纹理预测时，对于基层宏块或画面同样如此。Figure 15a shows an embodiment of a method for using base layer pictures for inter-layer texture prediction for this case. As shown, a base layer picture 1501 corresponding in time to a picture 1500 of the current (enhancement) layer includes even and odd fields output at different times. Therefore, first, the EL encoder 20 divides the picture of the base layer into even and odd fields (S151). The intra-mode macroblocks of the base layer picture 1501 have uncoded raw image data (or decoded image data) for intra-mode prediction, and their inter-mode macroblocks have coded residuals for residual prediction. data (or decoded residual data). The same is true for base layer macroblocks or pictures when texture prediction is described below.

在将相应画面1501分成场分量后，EL编码器20在垂直和/或水平方向上执行分离出的场1501a和1501b的内插，以创建经放大的偶和奇画面1502a和1502b(S152)。该内插使用多种已知方法中的一种，诸如6抽头滤波和二进制线性滤波。用于通过内插增加画面的分辨率(即，大小)的垂直和水平比等于增强层画面1500的大小与基层画面1501的大小的垂直和水平比。垂直和水平比可彼此相等。例如，如果增强层和基层之间的分辨率是2，则对分离出的偶和奇场1501a和1501b执行内插，以在垂直和水平方向上在每个场中的每个像素之间再创建一个像素。After dividing the corresponding picture 1501 into field components, the EL encoder 20 performs interpolation of the separated fields 1501a and 1501b in vertical and/or horizontal directions to create enlarged even and odd pictures 1502a and 1502b (S152). This interpolation uses one of several known methods, such as 6-tap filtering and binary linear filtering. The vertical and horizontal ratios for increasing the resolution (ie, size) of a picture by interpolation are equal to the vertical and horizontal ratios of the size of the enhancement layer picture 1500 to the size of the base layer picture 1501 . The vertical and horizontal ratios may be equal to each other. For example, if the resolution between the enhancement layer and the base layer is 2, then interpolation is performed on the separated even and odd fields 1501a and 1501b to reconstruct between each pixel in each field both vertically and horizontally. Create a pixel.

一旦内插完成，则组合经放大的偶和奇场1502a和1502b以构造画面1503(S153)。在该组合中，交替地选择经放大的偶和奇场1502a和1502b的行(1502a->1502b->1502a->1502b->..)然后将其按选择的次序编排以构造出组合的画面1503。这里，确定组合的画面1503中的每一个宏块的块模式。例如，组合的画面1503的宏块的块模式被确定为与包括具有相同图像成分的区域的基层画面1501中的宏块的块模式相等。这种确定方法可应用于以下描述的经放大的画面的任何情况中。因为以此方式构造的组合画面1503具有与增强层的当前画面1500相同的空间分辨率，所以基于组合画面1503的相应宏块来执行当前逐行画面1500中的宏块的纹理预测(例如，帧-帧宏块间纹理预测)(S154)。Once the interpolation is complete, the amplified even and odd fields 1502a and 1502b are combined to construct the frame 1503 (S153). In this combination, rows of amplified even and odd fields 1502a and 1502b are alternately selected (1502a->1502b->1502a->1502b->..) and then arranged in the selected order to construct the combined picture 1503. Here, the block mode of each macroblock in the combined picture 1503 is determined. For example, the block mode of the macroblock of the combined picture 1503 is determined to be equal to the block mode of the macroblock in the base layer picture 1501 including an area having the same image component. This determination method is applicable in any case of the enlarged picture described below. Because the combined picture 1503 constructed in this way has the same spatial resolution as the current picture 1500 of the enhancement layer, the texture prediction of the macroblocks in the current progressive picture 1500 (e.g., frame - Inter-macroblock texture prediction) (S154).

图15b示出根据本发明的另一个实施例的在层间纹理预测中使用基层画面的方法。如图所示，该实施例不在场属性(奇偶性)的基础上分离基层画面，而是在垂直和/或水平方向上直接执行包括在不同时间输出的偶和奇场的基层画面的内插(S155)，以构造分辨率与增强层画面的分辨率(即，大小)相同的经放大画面。以此方式构造的经放大画面被用于执行增强层的当前逐行画面的层间纹理预测(S156)。Fig. 15b shows a method of using base layer pictures in inter-layer texture prediction according to another embodiment of the present invention. As shown, this embodiment does not separate base layer pictures on the basis of field attributes (parity), but directly performs interpolation of base layer pictures including even and odd fields output at different times in the vertical and/or horizontal direction (S155) to construct an enlarged picture with the same resolution (ie, size) as that of the enhancement layer picture. The enlarged picture constructed in this way is used to perform inter-layer texture prediction of the current progressive picture of the enhancement layer (S156).

图15a在画面级别上示出通过在场属性的基础上分离具有偶和奇场的画面来对其进行内插的程序。然而，EL编码器20可通过在宏块级别上执行图15a所示的程序来达成与图15a所示相同的结果。更具体地，当具有偶和奇场的基层是已被MBAFF编码时，画面1501中垂直毗邻宏块对——其与目前受到纹理预测编解码的增强层画面中的宏块对同位——可如图16a或16b中那样包括的偶和奇场分量的视频信号。图16a示出其中偶和奇场分量在一对宏块A和B中的每一个宏块中交织的帧MB对模式，而图16b示出其中一对宏块A和B中的每一个宏块包括具有相同场属性的视频行的场MB对模式。Figure 15a shows at picture level the procedure for interpolating pictures with even and odd fields by separating them on the basis of field attributes. However, the EL encoder 20 can achieve the same result as shown in FIG. 15a by performing the procedure shown in FIG. 15a at the macroblock level. More specifically, when the base layer with even and odd fields is MBAFF coded, vertically adjacent pairs of macroblocks in picture 1501 that are co-located with pairs of macroblocks in the enhancement layer picture currently subject to texture prediction codecs can be A video signal comprising even and odd field components as in Figure 16a or 16b. Figure 16a shows a frame MB pair mode in which even and odd field components are interleaved in each of a pair of macroblocks A and B, while Figure 16b shows a pattern in which each of a pair of macroblocks A and B A block consists of field MB pair patterns of video lines with the same field attributes.

在图16a的情况下，为了应用图15a中所示的方法，选择该对宏块A和B中每一个宏块的偶行来构造偶场块A′，并选择其奇行来构造奇场块B′，从而将每个宏块中都交织有偶和奇场分量的宏块对分成分别具有偶和奇场分量的两个块A′和B′。对以此方式分离出的两个宏块A′和B′中的每一个执行内插以构造经放大块。利用经放大块中与当前将受到纹理预测编解码的增强层画面中intra_BL(基层内)或residual_prediction(残差预测)模式的宏块相对应的区域中的数据来执行纹理预测。尽管图16a中未示出，但部分地在场属性基础上组合诸个体地放大的块可构造图15a中的经放大偶和奇画面1502a和1502b，因此图15a中的经放大偶和奇画面1502a和1502b可通过对每对宏块重复上面的操作来构造。In the case of Fig. 16a, to apply the method shown in Fig. 15a, the even row of each macroblock in the pair of macroblocks A and B is selected to construct the even-field block A', and its odd row is selected to construct the odd-field block B', thereby splitting the pair of macroblocks with even and odd field components interleaved in each macroblock into two blocks A' and B' with even and odd field components, respectively. Interpolation is performed on each of the two macroblocks A' and B' separated in this way to construct an enlarged block. Texture prediction is performed using data in an area of the upscaled block corresponding to a macroblock of intra_BL (intra-base layer) or residual_prediction (residual prediction) mode in an enhancement layer picture to be currently subject to texture prediction codec. Although not shown in FIG. 16a, combining the individually enlarged blocks on a field attribute basis may construct the enlarged even and odd frames 1502a and 1502b in FIG. 15a, so the enlarged even and odd frames 1502a in FIG. and 1502b can be constructed by repeating the above operation for each pair of macroblocks.

在如图16b那样基于场属性分割宏块对以构造每个宏块的情况下，上述的分离程序是从该宏块对简单地拷贝每个宏块以构造两个分离的宏块的过程。后续的程序类似于参考图16a所述的程序。In the case of splitting a pair of macroblocks based on field attributes to construct each macroblock as in FIG. 16b, the separation procedure described above is a process of simply copying each macroblock from the pair of macroblocks to construct two separate macroblocks. The subsequent procedure is similar to that described with reference to Figure 16a.

T_B).增强层是隔行的而基层是逐行的情况T_B). The case where the enhancement layer is interlaced and the base layer is progressive

图17a示出针对该情况将基层画面用于层间纹理预测的方法的实施例。如图所示，首先，EL编码器20为当前层画面1700构造两个画面(S171)。在应用构造两个画面的示例方法中，选择相应画面1701的偶行来构造一个画面1701a，并选择其奇行来构造另一个画面1701b。EL编码器20然后在垂直和/或水平方向上执行如此构造出的两个画面1701a和1701b的内插以创建两个经放大的画面1702a和1702b(S172)。该内插使用多种已知方法中的一种，诸如情况T_A)中的6抽头滤波和二进制线性滤波。用于增加分辨率的比也与情况T_A)中描述的那些相同。Figure 17a shows an embodiment of a method for using base layer pictures for inter-layer texture prediction for this case. As shown, first, the EL encoder 20 constructs two pictures for the current layer picture 1700 (S171). In applying an example method of constructing two pictures, an even row of a corresponding picture 1701 is selected to construct one picture 1701a, and an odd row thereof is selected to construct another picture 1701b. The EL encoder 20 then performs interpolation of the thus constructed two pictures 1701a and 1701b in the vertical and/or horizontal direction to create two enlarged pictures 1702a and 1702b (S172). This interpolation uses one of several known methods, such as 6-tap filtering and binary linear filtering in case T_A). The ratios for increasing the resolution are also the same as those described in case T_A).

一旦内插完成，就组合这两个经放大的场1702a和1702b以构造画面1703(S173)。在该组合中，交替地选择这两个经放大的场1702a和1702b的行(1702a->1702b->1702a->1702b->...)然后将其按选择的次序编排以构造组合的画面1703。因为以此方式构造出的组合画面1703具有与增强层的当前画面1700相同的空间分辨率，所以基于组合的画面1703的相应宏块来执行当前隔行画面1700中的宏块的纹理预测(例如，帧-帧宏块间纹理预测或参考图4g描述的纹理预测)(S174)。Once the interpolation is complete, the two upscaled fields 1702a and 1702b are combined to construct the frame 1703 (S173). In this combination, the lines of the two enlarged fields 1702a and 1702b are alternately selected (1702a->1702b->1702a->1702b->...) and then arranged in the selected order to construct the combined picture 1703. Because the combined picture 1703 constructed in this way has the same spatial resolution as the current picture 1700 of the enhancement layer, texture prediction of the macroblocks in the current interlaced picture 1700 is performed based on the corresponding macroblocks of the combined picture 1703 (e.g., Frame-to-frame inter-macroblock texture prediction or texture prediction described with reference to FIG. 4g) (S174).

图17b示出根据本发明的另一个实施例的在层间纹理预测中使用基层画面的方法。如图所示，该实施例不将基层画面分成两个画面，而是在垂直和/或水平方向上直接执行基层画面的内插(S175)，以构造分辨率与增强层画面分辨率(即，大小)相同的经放大画面。以此方式构造的经放大画面被用于执行增强层的当前隔行画面的层间纹理预测(S176)。Fig. 17b shows a method of using base layer pictures in inter-layer texture prediction according to another embodiment of the present invention. As shown in the figure, this embodiment does not divide the base layer picture into two pictures, but directly performs the interpolation of the base layer picture in the vertical and/or horizontal direction (S175), so as to construct a resolution that is consistent with the resolution of the enhancement layer picture (i.e. , size) the same enlarged picture. The enlarged picture constructed in this way is used to perform inter-layer texture prediction of the current interlaced picture of the enhancement layer (S176).

尽管图17a的描述也是在画面级别上给出的，但EL编码器20可如以上情况T_A)中所述在宏块级别上执行画面分离过程。当将单个画面1701视为垂直毗邻宏块对时，图17b的方法类似于图17a所示的分离和内插程序。这里省略了该程序的详细描述，因为其可从图17a直观地理解。Although the description of Fig. 17a is also given on the picture level, the EL encoder 20 may perform the picture separation process on the macroblock level as described in case T_A) above. The method of Fig. 17b is similar to the separation and interpolation procedure shown in Fig. 17a when a single picture 1701 is considered as a pair of vertically contiguous macroblocks. A detailed description of this procedure is omitted here because it can be intuitively understood from Fig. 17a.

T_C).增强层和基层两者都是隔行的情况T_C). The case where both the enhancement layer and the base layer are interlaced

图18示出针对该情况将基层画面用于层间纹理预测的方法的实施例。在这种情况下，如图所示，EL编码器20以与情况T_A)中相同的方式将时间上对应于当前层画面1800的基层画面1801分成偶和奇场(S181)。EL编码器20然后在垂直和/或水平方向上执行分离出的场1801a和1801b的内插以创建经放大的偶和奇画面1802a和1802b(S182)。EL编码器20然后组合经放大的偶和奇场1802a和1802b以构造画面1803(S182)。EL编码器20然后基于组合画面1803的相应宏块执行当前隔行画面1800中的宏块(MBAFF编码的帧宏块对)的层间纹理预测(例如，帧-帧宏块间纹理预测或参考图4g描述的纹理预测)(S184)。Fig. 18 shows an embodiment of a method for using base layer pictures for inter-layer texture prediction for this case. In this case, as shown in the drawing, the EL encoder 20 divides the base layer picture 1801 temporally corresponding to the current layer picture 1800 into even and odd fields in the same manner as in case T_A) (S181). The EL encoder 20 then performs interpolation of the separated fields 1801a and 1801b in the vertical and/or horizontal direction to create enlarged even and odd pictures 1802a and 1802b (S182). EL encoder 20 then combines the amplified even and odd fields 1802a and 1802b to construct picture 1803 (S182). The EL encoder 20 then performs inter-layer texture prediction (e.g., frame-to-frame inter-macroblock texture prediction or reference picture 4g described texture prediction) (S184).

尽管两个层具有相同的画面格式，但EL编码器20在场属性的基础上分离基层画面1801(S181)并个体地放大分离出的场(S182)然后组合经放大的画面(S183)，这是因为如果组合偶和奇场的画面1801在其具有偶和奇场的视频信号变化很大的特性时被直接内插，则与增强层的具有交织的偶和奇场的隔行画面1800相比，经放大的画面可能会具有畸变的图像(例如，具有伸展边界的图像)。相应地，即使两个层都是隔行的，根据本发明，EL编码器20也在将基层画面在场属性的基础上分离后使用其来获得两个场，并个体地放大这两个场，然后组合经放大的场。Although the two layers have the same picture format, the EL encoder 20 separates the base layer picture 1801 on the basis of field attributes (S181) and individually enlarges the separated fields (S182) and then combines the enlarged pictures (S183), which is Because if the combined even and odd field picture 1801 is directly interpolated when it has the even and odd field video signal's widely varying characteristics, compared to the enhancement layer's interlaced picture 1800 with interleaved even and odd fields, The enlarged frame may have a distorted image (eg, an image with stretched borders). Accordingly, even if both layers are interlaced, according to the present invention, the EL encoder 20 uses it to obtain two fields after separating the base layer picture on the basis of field attributes, and individually amplifies the two fields, and then Combine the amplified fields.

当然，可以并非在两个层的画面都是隔行时总是使用图18所示的方法，而是代之以根据画面的视频特性选择地使用该方法。Of course, the method shown in FIG. 18 may not always be used when the pictures of both layers are interlaced, but may be selectively used according to the video characteristics of the pictures instead.

图18在画面级别上示出根据本发明在场属性基础上分离并放大具有偶和奇场的画面的程序。然而，如以上T_A)中所述，EL编码器20可通过在宏块级别上执行图18所示的程序来达成与图18所示相同的结果，其包括参考图16a和16b描述的基于宏块的分离和内插过程(具体而言是将帧宏块对分成偶和奇行的块并个体地放大分离出的块)以及组合和层间纹理预测过程(具体而言是交替地选择经放大的块的行以构造一对经放大的块，并利用所构造出的经放大的块对来执行当前层的帧宏块对的纹理预测)。Fig. 18 shows at the picture level the procedure for separating and enlarging pictures with even and odd fields on the basis of field attributes according to the present invention. However, as described in T_A) above, the EL encoder 20 can achieve the same result as shown in FIG. 18 by executing the procedure shown in FIG. 18 at the macroblock level, which includes the macro-based Block separation and interpolation processes (specifically, splitting frame macroblock pairs into even and odd row blocks and upscaling the separated blocks individually) and combining and inter-layer texture prediction processes (specifically, alternately selecting the upscaled to construct a pair of enlarged blocks, and use the constructed pair of enlarged blocks to perform texture prediction of the frame macroblock pair of the current layer).

T_D).增强层和基层两者都是逐行的情况T_D). The case where both the enhancement layer and the base layer are progressive

在这种情况下，将基层画面放大至与增强层画面相同的大小，并将经放大的画面用于具有相同画面格式的当前增强层画面的层间纹理预测。In this case, the base layer picture is upscaled to the same size as the enhancement layer picture, and the upscaled picture is used for inter-layer texture prediction of the current enhancement layer picture with the same picture format.

尽管以上已经描述了在基层和增强层具有相同的时间分辨率时的纹理预测的实施例，但两个层可能具有不同的时间分辨率，即，不同的画面率。如果即使在诸层具有相同时间分辨率时诸层的画面也是不同的画面扫描类型，则这些画面可能包含具有不同输出时间的视频信号，即使它们是相同POC的画面(即，时间上彼此对应的画面)。现在将描述这种情况下的层间纹理预测方法。在以下的描述中，假设两个层最初具有相同的空间分辨率。如果两层具有不同的空间分辨率，则如上所述地在升采样基层的每一个画面以使空间分辨率等于增强层的分辨率之后再应用以下描述的方法。Although the above has described embodiments of texture prediction when the base layer and the enhancement layer have the same temporal resolution, the two layers may have different temporal resolutions, ie different frame rates. If the pictures of the layers are of different picture scan types even when the layers have the same temporal resolution, these pictures may contain video signals with different output times, even though they are pictures of the same POC (i.e., temporally corresponding to each other). screen). The inter-layer texture prediction method in this case will now be described. In the following description, it is assumed that the two layers initially have the same spatial resolution. If the two layers have different spatial resolutions, the method described below is applied after upsampling each picture of the base layer to make the spatial resolution equal to that of the enhancement layer, as described above.

a)增强层包括逐行帧、基层包括MBAFF帧、且增强层的时间分辨率达两倍之高的情况a) The case where the enhancement layer consists of progressive frames, the base layer consists of MBAFF frames, and the temporal resolution of the enhancement layer is twice as high

图19a示出针对这种情况的层间纹理预测方法。如图所示，基层的每一个MBAFF帧包括具有不同输出时间的偶和奇场，因此EL编码器20将每一个MABFF帧分成偶和奇场(S191)。EL编码器20将每一个MBAFF帧的偶场分量(例如，偶行)和奇场分量(例如，奇行)分别分成偶场和奇场。在以此方式将MBAFF帧分成两个场之后，EL编码器20在垂直方向上内插每个场以使其具有达两倍之高的分辨率(S192)。该内插使用多种已知方法中的一种，诸如6抽头滤波、二进制线性滤波、和样本行补零。一旦内插完成，增强层的每一帧在基层中就有时间上一致的画面，因此EL编码器20对增强层的每帧的宏块执行已知的层间纹理预测(例如，帧-帧宏块间预测)(S193)。Fig. 19a shows an inter-layer texture prediction method for this case. As shown, each MBAFF frame of the base layer includes even and odd fields with different output times, so the EL encoder 20 divides each MABFF frame into even and odd fields (S191). The EL encoder 20 separates even field components (eg, even lines) and odd field components (eg, odd lines) of each MBAFF frame into even fields and odd fields, respectively. After dividing the MBAFF frame into two fields in this way, the EL encoder 20 interpolates each field in the vertical direction to have up to twice as high resolution (S192). This interpolation uses one of several known methods, such as 6-tap filtering, binary linear filtering, and sample row zero padding. Once the interpolation is complete, each frame of the enhancement layer has temporally consistent pictures in the base layer, so the EL encoder 20 performs known inter-layer texture prediction (e.g., frame-by-frame inter-macroblock prediction) (S193).

还可将以上的程序应用于层间运动预测。这里，当将MBAFF帧分成两个场时，EL编码器20拷贝MBAFF帧中的场宏块对中的每一个宏块的运动信息作为具有相同场属性(奇偶性)的宏块的运动信息，以将其用于层间运动预测。即使在基层中没有时间上一致的画面时(在t1、t3...的情况下)，使用该方法也能根据上述方法创建出时间上一致的画面以执行层间运动预测。The above procedure can also be applied to inter-layer motion prediction. Here, when the MBAFF frame is divided into two fields, the EL encoder 20 copies the motion information of each macroblock in the field macroblock pair in the MBAFF frame as the motion information of the macroblock having the same field attribute (parity), to use it for inter-layer motion prediction. Even when there is no temporally consistent picture in the base layer (in the case of t1, t3, . . . ), using this method can create a temporally consistent picture according to the method described above to perform inter-layer motion prediction.

在两层之一的分辨率如图19a的例子中那样是另一层的分辨率的两倍之高时并且甚至在其是N倍(三倍或以上)之高时均能直接应用上述的方法。例如，当分辨率是三倍之高时，可另外拷贝这两个分离出的场之一以构造并使用三个场，并且当分辨率是四倍之高时，可再一次拷贝这两个分离出的场中的每一个以构造并使用四个场。显然，在有任何时间分辨率差异的情况下，本领域的技术人员无需任何创造性思考就可简单地通过应用本发明的原理来执行层间预测。因此，本说明书中没有描述的用于在有不同时间分辨率的层之间预测的任何方法自然落入本发明的范围内。以下描述的其他情况同样如此。The above can be directly applied when the resolution of one of the two layers is twice as high as that of the other layer as in the example of Fig. 19a and even when it is N times (three times or more) higher. method. For example, when the resolution is three times higher, one of the two separated fields can be additionally copied to construct and use three fields, and when the resolution is four times higher, the two can be copied again Each of the fields is separated to construct and use four fields. Obviously, in case of any temporal resolution difference, a person skilled in the art can perform inter-layer prediction simply by applying the principles of the present invention without any creative thinking. Therefore, any method for predicting between layers with different temporal resolutions not described in this specification naturally falls within the scope of the present invention. The same is true for other cases described below.

如果已经将基层编码成画面自适应场和帧(PAFF)而不是MBAFF帧，则两层可能如图19b中那样具有相同的时间分辨率。因此，在这种情况下，在通过无需进行将帧分成两场的过程而直接对帧进行内插来构造具有与当前层相同的时间分辨率的画面后再执行层间纹理预测。If the base layer has been coded as Picture Adaptive Fields and Frames (PAFF) instead of MBAFF frames, both layers may have the same temporal resolution as in Fig. 19b. Therefore, in this case, inter-layer texture prediction is performed after constructing a picture having the same temporal resolution as the current layer by directly interpolating a frame without performing a process of dividing the frame into two fields.

b)增强层包括MBAFF帧、基层包括逐行帧、且增强层的时间分辨率是基层的一半的情况b) The case where the enhancement layer includes MBAFF frames, the base layer includes progressive frames, and the temporal resolution of the enhancement layer is half of that of the base layer

图20示出针对这种情况的层间纹理预测方法。如图所示，增强层的每一个MBAFF帧包括具有不同输出时间的偶和奇场，因此EL编码器20将每一个MABFF帧分成偶和奇场(S201)。EL编码器20将每一个MBAFF帧的偶场分量(例如，偶行)和奇场分量(例如，奇行)分别分成偶场和奇场。EL编码器20在垂直方向上执行基层的每一帧的子采样以构造分辨率减半的画面(S202)。该子采样可使用行子采样或各种其它已知的降采样方法中的一种，在图20的例子中，EL编码器20选择具有偶画面索引的画面(画面t0、t2、t4...)的偶行以获得大小减半的画面，并选择具有奇画面索引的画面(画面t1、t3...)的奇行以获得大小减半的画面。也可以按倒序执行帧分离(S201)和子采样(S202)。Fig. 20 shows an inter-layer texture prediction method for this case. As shown, each MBAFF frame of the enhancement layer includes even and odd fields with different output times, so the EL encoder 20 divides each MABFF frame into even and odd fields (S201). The EL encoder 20 separates even field components (eg, even lines) and odd field components (eg, odd lines) of each MBAFF frame into even fields and odd fields, respectively. The EL encoder 20 performs subsampling of each frame of the base layer in the vertical direction to construct a half-resolution picture (S202). This subsampling can use row subsampling or one of various other known downsampling methods. In the example of FIG. .) to get half-sized pictures, and select odd lines of pictures with odd picture indices (pictures t1, t3...) to get half-sized pictures. Frame separation (S201) and subsampling (S202) may also be performed in reverse order.

一旦完成这两个过程S201和S202，从增强层的帧分离出的场2001在基层中就有了时间上与场2001一致且具有与场2001相同的空间分辨率的画面，由此EL编码器20对每个场中的宏块执行已知的层间纹理预测(例如，帧-帧宏块间预测)(S203)。Once these two processes S201 and S202 are completed, the field 2001 separated from the frame of the enhancement layer has a picture in the base layer that coincides in time with the field 2001 and has the same spatial resolution as the field 2001, whereby the EL encoder 20 performs known inter-layer texture prediction (for example, frame-to-frame inter-macroblock prediction) on macroblocks in each field (S203).

也可将以上的程序应用于层间运动预测。这里，当通过子采样从基层的每个帧获取大小减小的画面(S202)时，EL编码器20可根据适合的方法(例如，采用未被完全划分的块的运动信息的方法)从垂直毗邻宏块对中的每一个宏块的运动信息中获得相应宏块的运动信息，然后可将所获得的运动信息用于层间运动预测。The above procedure can also be applied to inter-layer motion prediction. Here, when acquiring a reduced-size picture from each frame of the base layer by subsampling (S202), the EL encoder 20 may vertically The motion information of each macroblock in the adjacent macroblock pair is obtained from the motion information of the corresponding macroblock, and then the obtained motion information can be used for inter-layer motion prediction.

在这种情况下，增强层的画面被PAFF编码以便传送，因为层间预测是对从MBAFF帧分离出的每个场画面2001执行的。In this case, the pictures of the enhancement layer are PAFF coded for transmission because inter-layer prediction is performed on each field picture 2001 separated from the MBAFF frame.

c)增强层包括MBAFF帧、基层包括逐行帧、且两层具有相同的时间分辨率的情况c) The enhancement layer includes MBAFF frames, the base layer includes progressive frames, and the two layers have the same temporal resolution

图21示出针对这种情况的层间纹理预测方法。如图所示，增强层的每一个MBAFF帧包括具有不同输出时间的偶和奇场，因此EL编码器20将每一个MABFF帧分成偶和奇场(S211)。EL编码器20将每一个MBAFF帧的偶场分量(例如，偶行)和奇场分量(例如，奇行)分别分成偶场和奇场。EL编码器20在垂直方向上执行基层的每一帧的子采样以构造分辨率减半的画面(S212)。该子采样可使用行子采样或各种其它已知的降采样方法中的一种。也可以按倒序执行帧分离(S211)和子采样(S212)。Fig. 21 shows an inter-layer texture prediction method for this case. As shown, each MBAFF frame of the enhancement layer includes even and odd fields with different output times, so the EL encoder 20 divides each MABFF frame into even and odd fields (S211). The EL encoder 20 separates even field components (eg, even lines) and odd field components (eg, odd lines) of each MBAFF frame into even fields and odd fields, respectively. The EL encoder 20 performs subsampling of each frame of the base layer in the vertical direction to construct a half-resolution picture (S212). This subsampling may use row subsampling or one of various other known downsampling methods. Frame separation (S211) and subsampling (S212) may also be performed in reverse order.

EL编码器20还可由MBAFF帧构造场(例如，偶场画面)，而不是将MBAFF帧分成两个场。这是因为两层具有相同的时间分辨率，因此从一个帧中分离出的两个场画面中仅有一个(而不是两者全都)在基层中具有可用于层间预测的相应帧。EL encoder 20 may also construct fields (eg, even field pictures) from MBAFF frames instead of splitting the MBAFF frame into two fields. This is because the two layers have the same temporal resolution, so only one of the two field pictures separated from a frame (not both) has a corresponding frame in the base layer that can be used for inter-layer prediction.

一旦完成这两个过程S211和S212，EL编码器20就基于基层中相应的经子采样的画面对从增强层的帧中分离出的场中的仅偶(奇)场执行已知的层间纹理预测(例如，帧-帧宏块间预测)(S213)。Once these two processes S211 and S212 are completed, the EL encoder 20 performs the known inter-layer Texture prediction (eg, frame-to-frame inter-macroblock prediction) (S213).

同样在这种情况下，可按与情况b)所述相同的方式对为其执行层间纹理预测的增强层的分离出的场执行层间运动预测。Also in this case, inter-layer motion prediction may be performed on the separated field of the enhancement layer for which inter-layer texture prediction is performed in the same manner as described in case b).

尽管以上的描述是就由图2a或2b的EL编码器20执行的层间预测操作给出的，但层间预测操作的所有描述可共同地适用于从基层接收经解码的信息并解码增强层流的EL解码器。在编码和解码程序中，上述的层间预测操作(包括用于分离、放大、和组合画面或宏块中的视频信号的操作)以相同的方式执行，但层间预测之后的操作以不同的方式执行。此差别的示例是：在执行运动和纹理预测之后，编码器编码预测出的信息或者预测出的信息与实际信息之间的差分信息，以便将其传送给解码器，而解码器通过将藉由执行与在编码器处所执行的相同的层间运动和纹理预测而获得的信息直接应用于当前宏块或通过另外使用实际接收到的宏块编解码信息来获得实际运动信息和纹理信息。本发明的以上从编码角度描述的详情和原理直接适用于解码接收到的两层的数据流的解码器。Although the above description is given in relation to the inter-layer prediction operation performed by the EL encoder 20 of Figure 2a or 2b, all descriptions of the inter-layer prediction operation are collectively applicable to receiving decoded information from the base layer and decoding the enhancement layer EL decoder for streams. In encoding and decoding procedures, the above-mentioned inter-layer prediction operations (including operations for separating, enlarging, and combining video signals in pictures or macroblocks) are performed in the same manner, but operations after inter-layer prediction are performed in different way to execute. An example of this difference is: After performing motion and texture prediction, the encoder encodes the predicted information or the difference between the predicted information and the actual information to transmit it to the decoder, and the decoder uses the The information obtained by performing the same inter-layer motion and texture prediction as performed at the encoder is directly applied to the current macroblock or the actual motion information and texture information are obtained by additionally using the actually received macroblock codec information. The details and principles of the present invention described above from an encoding perspective are directly applicable to a decoder decoding a received two-layer data stream.

然而，当EL编码器在如参考图20和21所述将增强层分离成场序列并执行层间预测之后以PAFF方式传送具有MBAFF帧的增强层时，解码器不对当前接收到的层执行上述将MBAFF帧分成场画面的程序。However, when an EL encoder transmits an enhancement layer with an MBAFF frame in a PAFF manner after separating the enhancement layer into field sequences and performing inter-layer prediction as described with reference to FIGS. 20 and 21 , the decoder does not perform the above-mentioned A program that divides MBAFF frames into field pictures.

此外，解码器然后从接收到信号解码出标识EL编码器20是已如图8d所示还是如图8h所示执行了宏块之间的层间纹理预测的标志′field_base_flag′。基于解码出的标志值，解码器确定宏块之间的预测是如图8d所示地执行还是如图8h所示地执行的，并根据此确定获取纹理预测信息。如果没有接收到标志′field_base_flag′，则EL解码器假设已接收具有“0”值的标志。即，EL解码器假设宏块之间的纹理预测是根据如图8d所示的方法执行的，并获得当前宏块对的预测信息以重建当前宏块或宏块对。Furthermore, the decoder then decodes from the received signal a flag 'field_base_flag' identifying whether the EL encoder 20 has performed inter-layer texture prediction between macroblocks as shown in Fig. 8d or as shown in Fig. 8h. Based on the decoded flag value, the decoder determines whether the prediction between macroblocks is performed as shown in FIG. 8d or as shown in FIG. 8h, and obtains texture prediction information based on this determination. If the flag 'field_base_flag' is not received, the EL decoder assumes that a flag with a value of "0" has been received. That is, the EL decoder assumes that the texture prediction between macroblocks is performed according to the method shown in Fig. 8d, and obtains the prediction information of the current macroblock pair to reconstruct the current macroblock or macroblock pair.

本发明的上述有限实施例中至少有一个甚至可在使用不同格式(或模式)的视频信号源时执行层间预测。因此，当编解码多个层时，可提高数据编码率，而不拘于视频信号的画面类型，诸如隔行信号、逐行信号、MBAFF帧画面、和场画面。此外，当两层之一是隔行视频信号源时，预测中使用的画面的图像可被构造成更类似于用于预测编解码的原始图像，从而提高数据编码率。At least one of the above-described limited embodiments of the present invention can perform inter-layer prediction even when video sources of different formats (or modes) are used. Therefore, when encoding and decoding multiple layers, the data encoding rate can be increased regardless of the picture type of the video signal, such as interlaced signal, progressive signal, MBAFF frame picture, and field picture. Furthermore, when one of the two layers is an interlaced video signal source, the image of the picture used in the prediction can be constructed to be more similar to the original image used for the prediction codec, thereby increasing the data encoding rate.

尽管已参考优选实施例描述了本发明，但对本领域的技术人员显而易见的是，可在本发明中进行各种改进、修改、替换、和增加而不会脱离本发明的范围和精神。因此，本发明旨在涵盖对本发明的改进、修改、替换、和增加，只要它们落在所附权利要求及其等效方案的范围内即可。Although the present invention has been described with reference to preferred embodiments, it will be apparent to those skilled in the art that various improvements, modifications, substitutions, and additions can be made in the present invention without departing from the scope and spirit of the invention. Thus, the present invention is intended to cover improvements, modifications, substitutions, and additions to this invention provided they come within the scope of the appended claims and their equivalents.

Claims

1. method that is used for when encoding/decoding video signal carrying out inter-layer motion prediction said method comprising the steps of:

A) from the derive movable information of single macro block of the right movable information that vertically adjoins the frame macro block of basic unit; And

B) with the movable information derived as the movable information of the field macro block in anterior layer or described field macro block in anterior layer to the information of forecasting of separately movable information.

2. the method for claim 1 is characterized in that, described step a) may further comprise the steps:

With described frame macro block to dwindling into single macro block, and based on the partition mode that obtains described single macro block according to the described partition mode that dwindles;

Related according to the amplification that described single macro block and described frame macro block are right, obtain the reference key of described single macro block based at least one reference key in 8 corresponding 8 x of 8 x in described frame macro block centering and the described single macro block, 16 zones; And

Related according to the amplification that described single macro block and described frame macro block are right, obtain the motion vector of described single macro block based at least one motion vector in 4 corresponding 4 x of 4 x in described frame macro block centering and the described single macro block, 8 zones.

3. the method for claim 1 is characterized in that, described step a) may further comprise the steps:

Related according to the amplification that described single macro block and described frame macro block are right, obtain the motion vector of described single macro block based at least one motion vector in 4 corresponding 4 x of 4 x, 8 zones of described frame macro block centering and described single macro block; And

Obtain the partition mode of described single macro block based on the reference key that is obtained and motion vector.

4. the method for claim 1, it is characterized in that, if described frame macro block is to comprising internal schema macro block and inter mode macro block, the movable information that then described step a) is included in described inter mode macro block is copied to after the movable information of described internal schema macro block, from the derive movable information of described single macro block of the right movable information of described frame macro block.

5. the method for claim 1, it is characterized in that, if described frame macro block is to comprising internal schema macro block and inter mode macro block, then described step a) is included in described internal schema macro block is considered as having after the inter mode macro block of 0 motion vector and 0 reference key, from the derive movable information of described single macro block of the right movable information of described frame macro block.

6. method that is used for when encoding/decoding video signal carrying out inter-layer motion prediction said method comprising the steps of:

A) from the movable information of single the macro block of basic unit or the movable information that vertically adjoins single right macro block of a macro block that is selected from described basic unit two macro blocks movable information separately of deriving; And

B) with the movable information separately derived as when the frame macro block of anterior layer information of forecasting to separately movable information.

7. method as claimed in claim 6 is characterized in that, described step a) may further comprise the steps:

Described single macro block is extended to two macro blocks, and obtains described two macro blocks partition mode separately based on the partition mode of expanding according to described expansion;

Related according to described single macro block with the amplification of described two macro blocks, by with in the reference key of 8 of 8 x in described single the macro block and described two macro blocks with described single field macro block in the reference key of 8 of 8 corresponding two 8 x of 8 x in each be associated, obtain 8 reference keys separately of described two 8 x; And

Related according to described single macro block with the amplification of described two macro blocks, by with in the motion vector of 4 of 4 x in described single the macro block and described two macro blocks with described single field macro block in the motion vector of 4 of 4 corresponding two 4 x of 4 x in each be associated, obtain 4 motion vectors separately of described two 4 x.

8. method as claimed in claim 7, it is characterized in that, the step of two macro blocks of described acquisition partition mode separately comprises and obtains described two macro blocks block mode separately as follows: the block mode that 4 x, 4,8 x 4 in described single the macro block and 16 x are 8 is configured to expand in vertical direction the block mode of twice, and other piece block mode separately is configured to two block modes of identical size.

9. method as claimed in claim 6 is characterized in that, described step a) may further comprise the steps:

Related according to described single macro block with the amplification of described two macro blocks, by with in the reference key of 8 of 8 x in described single the macro block and described two macro blocks with described single field macro block in the reference key of 8 of 8 corresponding two 8 x of 8 x in each be associated, obtain 8 reference keys separately of described two 8 x;

Related according to described single macro block with the amplification of described two macro blocks, by with in the motion vector of 4 of 4 x in described single the macro block and described two macro blocks with described single field macro block in the motion vector of 4 of 4 corresponding two 4 x of 4 x in each be associated, obtain 4 motion vectors separately of described two 4 x; And

Obtain described two macro blocks partition mode separately based on reference key that is obtained and motion vector.

10. method as claimed in claim 6, it is characterized in that, if described macro block is to comprising internal schema macro block and inter mode macro block, then described step a) comprises selects described inter mode macro block and with the movable information of selected inter mode macro block described two macro blocks movable information separately that is used to derive.

11. method as claimed in claim 6, it is characterized in that, described step a) comprises that the information of described single the macro block of copy is right with an other structure macro block, and from the macro block that constructed to described two macro blocks movable information separately of deriving of movable information separately.

12. a method that is used for carrying out inter-layer motion prediction when encoding/decoding video signal said method comprising the steps of:

A) confirm that two of basic unit adjoin vertically whether to have one in the macro block at least be internal schema; And

B) if confirm only have one to be internal schema in described two macro blocks, then the motion related information with inter mode macro block included in described two macro blocks is copied to described internal schema macro block described internal schema macro block is arranged to have the piece of movable information.

13. method as claimed in claim 12 is characterized in that, the motion related information that is copied is the information about motion vector and reference key.

14. method as claimed in claim 12 is characterized in that, and is further comprising the steps of:

After the step of the described piece of the internal schema macro block being arranged to have movable information, utilize the motion related information of described two macro blocks to determine the partition mode of described two macro blocks.

15. method as claimed in claim 12 is characterized in that, described two macro blocks that macro block is the frame component image.

16. method as claimed in claim 15 is characterized in that, the described motion related information of described two macro blocks is used to the right inter-layer motion prediction of macro block of field component image on the upper strata of described basic unit.