[go: up one dir, main page]

CN111699682A - Method and apparatus for encoding and decoding using selective information sharing between channels - Google Patents

Method and apparatus for encoding and decoding using selective information sharing between channels Download PDF

Info

Publication number
CN111699682A
CN111699682A CN201880088927.1A CN201880088927A CN111699682A CN 111699682 A CN111699682 A CN 111699682A CN 201880088927 A CN201880088927 A CN 201880088927A CN 111699682 A CN111699682 A CN 111699682A
Authority
CN
China
Prior art keywords
information
block
channel
transform
target block
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201880088927.1A
Other languages
Chinese (zh)
Other versions
CN111699682B (en
Inventor
全东山
姜晶媛
高玄硕
林成昶
李镇浩
李河贤
全炳宇
金晖容
朴智允
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sungkyunkwan University School Industry Cooperation
Electronics and Telecommunications Research Institute ETRI
Original Assignee
Sungkyunkwan University School Industry Cooperation
Electronics and Telecommunications Research Institute ETRI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sungkyunkwan University School Industry Cooperation, Electronics and Telecommunications Research Institute ETRI filed Critical Sungkyunkwan University School Industry Cooperation
Priority claimed from PCT/KR2018/015573 external-priority patent/WO2019112394A1/en
Publication of CN111699682A publication Critical patent/CN111699682A/en
Application granted granted Critical
Publication of CN111699682B publication Critical patent/CN111699682B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/463Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/119Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/12Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
    • H04N19/122Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/129Scanning of coding units, e.g. zig-zag scan of transform coefficients or flexible macroblock ordering [FMO]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/80Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
    • H04N19/82Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Discrete Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A video decoding method, a video decoding apparatus, a video encoding method, and a video encoding apparatus are disclosed. The encoding decision information of the representative channel of the target block is shared as the encoding decision information of the target channel of the target block, and decoding of the target block is performed using the encoding decision information of the target channel. Since the coding decision information of the representative channel is shared with other channels, it is possible to prevent the same coding decision information from being repeatedly signaled. By this prevention, the efficiency of encoding and decoding for the target block, and the like can be improved.

Description

用于使用通道之间的选择性信息共享进行编码和解码的方法 和设备Method for encoding and decoding using selective information sharing between channels and equipment

技术领域technical field

以下实施例总体上涉及一种视频解码方法和设备以及视频编码方法和设备,并且更具体地,涉及一种使用通道之间的选择性信息的共享的视频解码方法和设备以及视频编码方法和设备。The following embodiments relate generally to a video decoding method and apparatus and a video encoding method and apparatus, and more particularly, to a video decoding method and apparatus and video encoding method and apparatus using sharing of selective information between channels .

背景技术Background technique

随着信息与通信行业的持续发展,支持高清晰度(HD)分辨率的广播服务已经在全世界普及。通过这种普及,大量用户已经习惯了高分辨率和高清晰度图像和/或视频。With the continuous development of the information and communication industry, broadcasting services supporting high definition (HD) resolution have become widespread throughout the world. Through this popularity, a large number of users have become accustomed to high-resolution and high-definition images and/or videos.

为了满足用户对高清晰度的需求,大量机构已加速了对下一代成像装置的开发。除了高清TV(HDTV)和全高清(FHD)TV之外,用户对UHD TV的兴趣也已增加,其中,UHD TV的分辨率是全高清(FHD)TV的分辨率的四倍以上。随着其兴趣的增加,不断地需要针对具有更高分辨率和更高清晰度的图像的图像编码/解码技术。In order to meet users' demands for high definition, a large number of institutions have accelerated the development of next-generation imaging devices. In addition to high definition TV (HDTV) and full high definition (FHD) TV, user interest in UHD TV has also increased, wherein the resolution of UHD TV is more than four times that of full high definition (FHD) TV. As its interest increases, there is a constant need for image encoding/decoding techniques for images with higher resolution and higher definition.

图像编码/解码设备和方法可使用帧间预测技术、帧内预测技术、熵编码技术等,以便对高分辨率和高清晰度图像执行编码/解码。帧间预测技术可以是一种用于使用时间上在前的画面和/或时间上在后的画面对目标画面中包括的像素的值进行预测的技术。帧内预测技术可以是一种用于使用关于目标画面中的像素的信息对目标画面中包括的像素的值进行预测的技术。熵编码技术可以是一种用于将短码字分配给频繁出现的符号并且将长码字分配给很少出现的符号的技术。The image encoding/decoding apparatus and method may use an inter-frame prediction technique, an intra-frame prediction technique, an entropy encoding technique, etc., in order to perform encoding/decoding on high-resolution and high-definition images. The inter prediction technique may be a technique for predicting values of pixels included in a target picture using a temporally preceding picture and/or a temporally succeeding picture. The intra prediction technique may be a technique for predicting values of pixels included in a target picture using information about the pixels in the target picture. An entropy coding technique may be a technique for assigning short codewords to frequently occurring symbols and long codewords to infrequently occurring symbols.

近来,在各种应用领域中对能够提供高分辨率、进一步拓宽的颜色空间和优异的图像质量的高质量图像(诸如超高清(UHD)图像)的需求已经增加。随着图像趋向于更高分辨率和更高质量的图像,为了提供图像所需的图像数据量可能增加到超过现有图像数据量。在通过通信介质(诸如有线/无线宽带线路)或各种广播介质(诸如卫星、地面波、互联网协议(IP)网络、无线网络、电缆或移动通信网络)发送图像数据的情况下,或者在将图像数据存储在各种类型的存储介质(诸如光盘(CD)、数字通用盘(DVD)、通用串行总线(USB)介质和高清晰度(HD)-DVD)中的情况下,随着图像数据量的增加,传输成本和存储成本增加。Recently, there has been an increase in demand for high-quality images, such as Ultra High Definition (UHD) images, capable of providing high resolution, a further widened color space, and excellent image quality in various application fields. As images move toward higher resolution and higher quality images, the amount of image data required to provide an image may increase beyond the existing amount of image data. In the case of transmitting image data through a communication medium such as a wired/wireless broadband line or various broadcasting media such as satellite, ground wave, Internet Protocol (IP) network, wireless network, cable or mobile communication network, or in the case of In the case where image data is stored in various types of storage media such as compact disc (CD), digital versatile disc (DVD), universal serial bus (USB) media, and high definition (HD)-DVD, with the image As the amount of data increases, transmission costs and storage costs increase.

随着高分辨率和高质量图像的使用,为了解决图像数据中的不可避免的且更严重的问题并提供具有更高分辨率和更高图像质量的图像,需要高效的图像编码/解码技术。With the use of high-resolution and high-quality images, in order to solve inevitable and more serious problems in image data and provide images with higher resolution and higher image quality, efficient image encoding/decoding techniques are required.

发明内容SUMMARY OF THE INVENTION

技术问题technical problem

实施例旨在提供一种使用通道之间的选择性信息的共享的编码设备和方法以及解码设备和方法。Embodiments aim to provide an encoding apparatus and method and a decoding apparatus and method using sharing of selective information between channels.

技术方案Technical solutions

根据一方面,提供了一种解码方法,包括:共享目标块的代表性通道的编码决策信息作为目标块的目标通道的编码决策信息;并且对使用所述目标通道的编码决策信息的目标块执行解码。According to one aspect, there is provided a decoding method, comprising: sharing encoding decision information of a representative channel of a target block as encoding decision information of a target channel of the target block; and performing on the target block using the encoding decision information of the target channel decoding.

所述解码方法还可包括:接收包括关于目标块的信息的比特流。The decoding method may further include receiving a bitstream including information on the target block.

所述关于目标块的信息可包括所述代表性通道的编码决策信息。The information about the target block may include encoding decision information for the representative channel.

所述关于目标块的信息可不包括所述目标通道的编码决策信息。The information about the target block may not include coding decision information of the target channel.

所述代表性通道的编码决策信息可以是指示是否将跳过变换的变换跳过信息。The encoding decision information of the representative channel may be transform skip information indicating whether the transform will be skipped.

所述代表性通道的编码决策信息可指示哪种变换将被用于通道的变换块。The encoding decision information for the representative channel may indicate which transform is to be used for the channel's transform blocks.

所述代表性通道的编码决策信息可以是所述代表性通道的帧内编码决策信息。The encoding decision information of the representative channel may be intra-frame encoding decision information of the representative channel.

所述代表性通道和所述目标通道可以是YCbCr颜色空间中的通道。The representative channel and the target channel may be channels in the YCbCr color space.

所述代表性通道可以是亮度通道。The representative channel may be a luminance channel.

所述目标通道可以是色度通道。The target channel may be a chrominance channel.

所述代表性通道可以是与亮度信号具有最高相关性的颜色通道。The representative channel may be the color channel with the highest correlation with the luminance signal.

所述代表性通道可通过比特流中的指示所选择的代表性通道的索引来确定。The representative channel may be determined by an index in the bitstream indicating the selected representative channel.

当目标块的多个通道的图像属性彼此相似时,可执行所述共享操作。The sharing operation may be performed when the image attributes of the plurality of channels of the target block are similar to each other.

当目标块的色度通道的帧内预测模式是直接模式时,所述多个通道的图像属性可被确定为彼此相似。When the intra prediction mode of the chroma channel of the target block is the direct mode, the image properties of the plurality of channels may be determined to be similar to each other.

当使用跨通道预测时,可执行所述共享操作。The sharing operation may be performed when cross-channel prediction is used.

可基于从比特流获取的信息来推导是否使用跨通道预测。Whether to use cross-channel prediction may be derived based on information obtained from the bitstream.

当使用跨通道预测时,可执行所述共享操作。The sharing operation may be performed when cross-channel prediction is used.

可基于目标块的帧内预测模式来确定是否使用跨通道预测。Whether to use cross-channel prediction may be determined based on the intra prediction mode of the target block.

当目标块的帧内预测模式是INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式中的一个时,可使用跨通道预测。When the intra prediction mode of the target block is one of INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode, cross-channel prediction may be used.

可基于目标块的尺寸来确定是否将执行共享。Whether sharing is to be performed may be determined based on the size of the target block.

目标块的多个通道中的代表性通道的编码决策信息可被用于所述多个通道中的所有通道。The encoding decision information of a representative channel of the plurality of channels of the target block may be used for all channels of the plurality of channels.

根据另一方面,提供了一种编码方法,包括:确定目标块的代表性通道的编码决策信息;并且对使用所述代表性通道的编码决策信息的目标块执行编码,其中,所述代表性通道的编码决策信息与目标块的另外通道共享。According to another aspect, there is provided an encoding method comprising: determining encoding decision information of a representative channel of a target block; and performing encoding on a target block using the encoding decision information of the representative channel, wherein the representative channel The encoding decision information of the channel is shared with the other channels of the target block.

所述编码方法还可包括:生成包括关于目标块的信息的比特流。The encoding method may further include generating a bitstream including information on the target block.

所述关于目标块的信息可包括所述代表性通道的编码决策信息。The information about the target block may include encoding decision information for the representative channel.

所述关于目标块的信息可不包括所述另外通道的编码决策信息。The information on the target block may not include coding decision information for the additional channel.

所述代表性通道和所述另外通道可以是YCbCr颜色空间中的通道。The representative channel and the additional channel may be channels in the YCbCr color space.

根据另一方面,提供了一种存储用于图像解码的比特流的计算机可读存储介质,所述比特流包括关于目标块的信息,其中,所述关于目标块的信息包括目标块的代表性通道的编码决策信息,其中,所述目标块的代表性通道的编码决策信息被使用并被共享作为目标块的目标通道的编码决策信息,并且其中,使用所述目标通道的编码决策信息来执行目标块的解码。According to another aspect, there is provided a computer-readable storage medium storing a bitstream for image decoding, the bitstream including information about a target block, wherein the information about the target block includes a representation of the target block encoding decision information of a channel, wherein encoding decision information of a representative channel of the target block is used and shared as encoding decision information of a target channel of a target block, and wherein the encoding decision information of the target channel is used to perform Decoding of the target block.

有益效果beneficial effect

提供了一种使用通道之间的选择性信息的共享的编码设备和方法以及解码设备和方法。Provided are an encoding apparatus and method and a decoding apparatus and method using sharing of selectivity information between channels.

附图说明Description of drawings

图1是示出被应用了本公开的编码设备的实施例的配置的框图;1 is a block diagram showing the configuration of an embodiment of an encoding apparatus to which the present disclosure is applied;

图2是示出被应用了本公开的解码设备的实施例的配置的框图;2 is a block diagram showing a configuration of an embodiment of a decoding apparatus to which the present disclosure is applied;

图3是示意性地示出当图像被编码和解码时图像的分区结构的示图;3 is a diagram schematically showing a partition structure of an image when the image is encoded and decoded;

图4是示出编码单元(CU)能够包括的预测单元(PU)的形式的示图;4 is a diagram illustrating a form of a prediction unit (PU) that a coding unit (CU) can include;

图5是示出能够被包括在CU中的变换单元(TU)的形式的示图;5 is a diagram illustrating the form of a transform unit (TU) that can be included in a CU;

图6示出根据示例的块的划分;Figure 6 shows the division of blocks according to an example;

图7是用于解释帧内预测过程的实施例的示图;7 is a diagram for explaining an embodiment of an intra prediction process;

图8是用于解释帧内预测过程中使用的参考样点的位置的示图;8 is a diagram for explaining the positions of reference samples used in the intra prediction process;

图9是用于解释帧间预测过程的实施例的示图;9 is a diagram for explaining an embodiment of an inter prediction process;

图10示出根据实施例的空间候选;Figure 10 illustrates a spatial candidate according to an embodiment;

图11示出根据实施例的将空间候选的运动信息添加到合并列表的顺序;11 illustrates the order of adding motion information of spatial candidates to a merge list, according to an embodiment;

图12示出根据示例的变换和量化处理;Figure 12 illustrates transform and quantization processing according to an example;

图13示出根据示例的对角线扫描;Figure 13 shows a diagonal scan according to an example;

图14示出根据示例的水平扫描;Figure 14 shows a horizontal scan according to an example;

图15示出根据示例的垂直扫描;Figure 15 shows vertical scanning according to an example;

图16是根据实施例的编码设备的配置图;16 is a configuration diagram of an encoding apparatus according to an embodiment;

图17是根据实施例的解码设备的配置图;17 is a configuration diagram of a decoding apparatus according to an embodiment;

图18是根据实施例的用于对编码决策信息进行解码的方法的流程图;18 is a flowchart of a method for decoding encoding decision information, according to an embodiment;

图19是根据实施例的用于确定是否将跳过变换的解码方法的流程图;19 is a flowchart of a decoding method for determining whether a transform is to be skipped, according to an embodiment;

图20是根据实施例的用于参照帧内模式来确定是否将跳过变换的解码方法的流程图;20 is a flowchart of a decoding method for determining whether a transform is to be skipped with reference to an intra-mode, according to an embodiment;

图21是根据实施例的用于共享变换选择信息的方法的流程图;21 is a flowchart of a method for sharing transform selection information, according to an embodiment;

图22示出根据示例的单树块分区结构;Figure 22 shows a single tree block partition structure according to an example;

图23示出根据示例的双树块分区结构;Figure 23 shows a dual tree block partition structure according to an example;

图24示出根据示例的用于基于对应区域中的位置来指定对应块的方案;24 illustrates a scheme for specifying a corresponding block based on a location in a corresponding region, according to an example;

图25示出根据示例的用于基于对应区域中的面积来指定对应块的方案;25 illustrates a scheme for specifying corresponding blocks based on areas in corresponding regions, according to an example;

图26示出根据示例的用于基于对应区域中的面积来指定对应块的另一方案;26 illustrates another scheme for specifying corresponding blocks based on areas in corresponding regions, according to an example;

图27示出根据示例的用于基于对应区域中的块的形式来指定对应块的方案;27 illustrates a scheme for specifying a corresponding block based on the form of the block in the corresponding region, according to an example;

图28示出根据示例的用于基于对应区域中的块的形式来指定对应块的另一方案;28 illustrates another scheme for specifying a corresponding block based on the form of the block in the corresponding region, according to an example;

图29示出根据示例的用于基于对应区域中的块的高宽比来指定对应块的方案;29 illustrates a scheme for specifying a corresponding block based on an aspect ratio of a block in a corresponding region, according to an example;

图30示出根据示例的用于基于对应区域中的块的高宽比来指定对应块的另一方案;30 illustrates another scheme for specifying a corresponding block based on an aspect ratio of a block in a corresponding region, according to an example;

图31示出根据示例的用于基于对应区域中的块的编码特征来指定对应块的方案;31 illustrates a scheme for specifying corresponding blocks based on coding characteristics of blocks in corresponding regions, according to an example;

图32是根据实施例的编码方法的流程图;以及Figure 32 is a flowchart of an encoding method according to an embodiment; and

图33是根据实施例的解码方法的流程图。33 is a flowchart of a decoding method according to an embodiment.

具体实施方式Detailed ways

本发明可被各种改变,并且可具有各种实施例,下面将参照附图详细地描述特定实施例。然而,应理解,这些实施例不意图将本发明限制为特定的公开形式,它们包括本发明的精神和范围内所包括的所有变化、等同形式或修改形式。The present invention may be variously modified and may have various embodiments, and specific embodiments will be described in detail below with reference to the accompanying drawings. It should be understood, however, that these examples are not intended to limit the invention to the particular disclosed forms, and that they include all changes, equivalents, or modifications included within the spirit and scope of the invention.

将参照示出特定实施例的附图对以下示例性实施例进行详细描述。描述这些实施例,使得本公开所属技术领域的普通技术人员能够容易地实践这些实施例。应注意,各种实施例彼此不同,但不需要彼此互斥。例如,在此描述的特定形状、结构和特性可在不脱离与一个实施例相关的多个实施例的精神和范围的情况下被实现为其它实施例。此外,应理解,在不脱离实施例的精神和范围的情况下,能够改变在每个公开的实施例中的各个组件的位置或布置。因此,所附的详细描述并非旨在限制本公开的范围,并且示例性实施例的范围仅由所附的权利要求及其等同物(只要它们被适当地描述)限定。The following exemplary embodiments will be described in detail with reference to the accompanying drawings that illustrate specific embodiments. These embodiments are described so that those of ordinary skill in the art to which this disclosure pertains can easily practice the embodiments. It should be noted that the various embodiments differ from each other, but need not be mutually exclusive. For example, the specific shapes, structures, and characteristics described herein can be implemented in other embodiments without departing from the spirit and scope of the various embodiments associated with one embodiment. Furthermore, it is to be understood that the location or arrangement of the various components in each disclosed embodiment can be changed without departing from the spirit and scope of the embodiments. Therefore, the appended detailed description is not intended to limit the scope of the present disclosure, and the scope of the exemplary embodiments is limited only by the appended claims and their equivalents, so long as they are appropriately described.

在附图中,相似的参考标号被用于在各个方面指定相同或相似的功能。附图中的组件的形状、尺寸等可被夸大以使得描述清楚。In the drawings, like reference numerals are used to designate the same or similar functions in various respects. The shapes, sizes, etc. of components in the drawings may be exaggerated for clarity of description.

诸如“第一”和“第二”的术语可用于描述各种组件,但是组件不受所述术语限制。所述术语仅用于将一个组件与另一个组件区分开。例如,在不脱离本说明书的范围的情况下,第一组件可被称为第二组件。类似地,第二组件可被称为第一组件。术语“和/或”可包括多个相关描述项的组合或者多个相关描述项中的任意一个。Terms such as "first" and "second" may be used to describe various components, but the components are not limited by the terms. The terms are only used to distinguish one component from another. For example, a first component could be termed a second component without departing from the scope of this specification. Similarly, the second component may be referred to as the first component. The term "and/or" can include a combination of multiple associated descriptive items or any one of the multiple associated descriptive items.

将理解的是,当组件被称为“连接”或“结合”到另一组件时,所述两个组件可彼此直接连接或结合,或者在所述两个组件之间可存在中间组件。将理解的是,当组件被称为“直接连接或结合”时,在所述两个组件之间不存在中间组件。It will be understood that when a component is referred to as being "connected" or "coupled" to another component, the two components can be directly connected or coupled to each other or intervening components may be present between the two components. It will be understood that when components are referred to as being "directly connected or coupled," there are no intervening components between the two components.

此外,在实施例中描述的组件被独立地示出以表示不同的特征功能,但是这并不意味着每个组件由一个单独的硬件或软件形成。也就是说,为了方便描述,多个组件被单独地布置和包括。例如,多个组件中的至少两个组件可被集成为单个组件。相反,一个组件可被划分为多个组件。只要不脱离本说明书的本质,多个组件被集成的实施例或一些组件被分离的实施例被包括在本说明书的范围中。Furthermore, the components described in the embodiments are shown independently to represent different feature functions, but this does not mean that each component is formed by a separate piece of hardware or software. That is, for convenience of description, a plurality of components are individually arranged and included. For example, at least two of the plurality of components may be integrated into a single component. Conversely, a component can be divided into multiple components. Embodiments in which a plurality of components are integrated or embodiments in which some components are separated are included in the scope of the present specification as long as they do not depart from the essence of the present specification.

此外,应注意,在示例性实施例中,描述组件“包括”特定组件的表述意指另外的组件可被包括在示例性实施例的实践或技术精神的范围内,但是不排除存在除了所述特定组件之外的组件。Furthermore, it should be noted that, in an exemplary embodiment, the expression describing that a component "includes" a particular component means that additional components may be included within the scope of the practical or technical spirit of the exemplary embodiment, but does not preclude the existence of Components other than a specific component.

在本说明书中使用的术语仅用于描述特定实施例,并不意图限制本发明。单数表述包括复数表述,除非在上下文中具体指出相反的描述。在本说明书中,应理解,诸如“包括”或“具有”的术语仅意图指示存在特征、数字、步骤、操作、组件、部件或它们的组合,而不意图排除将存在或添加一个或更多个其它特征、数字、步骤、操作、组件、部件或它们的组合的可能。The terms used in this specification are used to describe specific embodiments only, and are not intended to limit the present invention. Singular expressions include plural expressions unless the context specifically indicates the contrary. In this specification, it should be understood that terms such as "comprising" or "having" are only intended to indicate the presence of features, numbers, steps, operations, components, parts or combinations thereof, and are not intended to preclude the presence or addition of one or more the possibility of other features, numbers, steps, operations, components, parts or combinations thereof.

下面将参照附图来详细描述实施例,使得实施例所属技术领域的普通技术人员能够容易地实践实施例。在实施例的以下描述中,被视为使本说明书的要点模糊的公知功能或配置的详细描述将被省略。此外,相同的参考标号在整个附图中用于指定相同的组件,并且将省略对相同组件的重复描述。The embodiments will be described in detail below with reference to the accompanying drawings so that those skilled in the art to which the embodiments pertain can easily practice the embodiments. In the following description of the embodiments, detailed descriptions of well-known functions or configurations that are considered to obscure the gist of the present specification will be omitted. Also, the same reference numerals are used throughout the drawings to designate the same components, and repeated descriptions of the same components will be omitted.

在下文中,“图像”可表示构成视频的单个画面,或者可表示视频本身。例如,“对图像的编码和/或解码”可表示“对视频的编码和/或解码”,并且也可表示“对构成视频的多个图像中的任意一个图像的编码和/或解码”。Hereinafter, "image" may refer to a single picture that constitutes a video, or may refer to the video itself. For example, "encoding and/or decoding an image" may mean "encoding and/or decoding a video", and may also mean "encoding and/or decoding any one of a plurality of images constituting a video".

在下文中,术语“视频”和“运动画面”可被用于具有相同的含义,并且可彼此互换使用。Hereinafter, the terms "video" and "moving picture" may be used to have the same meaning and may be used interchangeably with each other.

在下文中,目标图像可以是作为将被编码的目标的编码目标图像和/或作为将被解码的目标的解码目标图像。此外,目标图像可以是被输入到编码设备的输入图像或者被输入到解码设备的输入图像。Hereinafter, the target image may be an encoding target image that is a target to be encoded and/or a decoding target image that is a target to be decoded. Also, the target image may be an input image input to an encoding device or an input image input to a decoding device.

在下文中,术语“图像”、“画面”、“帧”和“屏幕”可被用于具有相同的含义,并且可彼此互换使用。Hereinafter, the terms "image", "picture", "frame" and "screen" may be used to have the same meaning and may be used interchangeably with each other.

在下文中,目标块可以是编码目标块(即,将被编码的目标)和/或解码目标块(即,将被解码的目标)。此外,目标块可以是当前块,即,当前将被编码和/或解码的目标。这里,术语“目标块”和“当前块”可被用于具有相同的含义,并且可彼此互换使用。Hereinafter, the target block may be an encoding target block (ie, a target to be encoded) and/or a decoding target block (ie, a target to be decoded). Furthermore, the target block may be the current block, ie the target currently to be encoded and/or decoded. Here, the terms "target block" and "current block" may be used to have the same meaning and may be used interchangeably with each other.

在下文中,术语“块”和“单元”可被用于具有相同的含义,并且可彼此互换使用。可选地,“块”可表示特定单元。Hereinafter, the terms "block" and "unit" may be used to have the same meaning and may be used interchangeably with each other. Alternatively, a "block" may refer to a specific unit.

在下文中,术语“区域”和“片段”可彼此互换使用。In the following, the terms "region" and "fragment" are used interchangeably with each other.

在下文中,特定信号可以是指示特定块的信号。例如,原始信号可以是指示目标块的信号。预测信号可以是指示预测块的信号。残差信号可以是指示残差块的信号。Hereinafter, the specific signal may be a signal indicating a specific block. For example, the original signal may be a signal indicating the target block. The prediction signal may be a signal indicating a prediction block. The residual signal may be a signal indicative of a residual block.

在以下的实施例中,特定的信息、数据、标志、元素和属性可具有它们各自的值。与所述信息、数据、标志、元素和属性中的每一个对应的值“0”可指示逻辑假或第一预定义值。换句话说,值“0”、假、逻辑假和第一预定义值可彼此互换使用。与所述信息、数据、标志、元素和属性中的每一个对应的值“1”可指示逻辑真或第二预定义值。换句话说,值“1”、真、逻辑真和第二预定义值可彼此互换使用。In the following embodiments, certain information, data, flags, elements and attributes may have their respective values. A value of "0" corresponding to each of the information, data, flags, elements and attributes may indicate a logical false or a first predefined value. In other words, the value "0", false, logical false and the first predefined value can be used interchangeably with each other. A value of "1" corresponding to each of the information, data, flags, elements and attributes may indicate logical true or a second predefined value. In other words, the value "1", true, logical true and the second predefined value can be used interchangeably with each other.

当诸如i或j的变量被用于指示行、列或索引时,值i可以是整数0或大于0的整数,或者可以是整数1或大于1的整数。换句话说,在实施例中,行、列和索引中的每一个可从0开始计数,或者可以从1开始计数。When a variable such as i or j is used to denote a row, column, or index, the value i may be an integer of 0 or greater, or an integer of 1 or greater. In other words, in an embodiment, each of the row, column, and index may be counted from 0, or may be counted from 1.

下面,将描述将在实施例中使用的术语。Hereinafter, terms to be used in the embodiments will be described.

编码器:编码器表示用于执行编码的装置。Encoder: An encoder represents a device for performing encoding.

解码器:解码器表示用于执行解码的装置。Decoder: A decoder represents a device for performing decoding.

单元:“单元”可表示图像编码和解码的单元。术语“单元”和“块”可被用于具有相同的含义,并且可彼此互换使用。Unit: A "unit" may represent a unit of image encoding and decoding. The terms "unit" and "block" may be used to have the same meaning and may be used interchangeably with each other.

–“单元”可以是M×N样点阵列。M和N可分别是正整数。术语“单元”通常可表示二维(2D)样点阵列。– A "cell" can be an array of MxN samples. M and N may each be a positive integer. The term "cell" may generally refer to a two-dimensional (2D) array of samples.

–在图像的编码和解码过程中,“单元”可以是通过对一个图像进行分区而生成的区域。换句话说,“单元”可以是在一个图像中指定的区域。单个图像可被分区为多个单元。可选地,一个图像可被分区为子部分,并且单元可表示在对分区出的子部分执行编码或解码时每个分区出的子部分。– During the encoding and decoding of images, a "unit" can be an area generated by partitioning an image. In other words, a "cell" can be a designated area in an image. A single image can be partitioned into multiple cells. Alternatively, a picture may be partitioned into sub-portions, and a unit may represent each partitioned sub-portion when encoding or decoding is performed on the partitioned sub-portions.

–在图像的编码和解码过程中,可根据单元的类型对每个单元执行预定义的处理。– During the encoding and decoding of images, pre-defined processing can be performed on each cell according to the cell type.

–根据功能,单元类型可被分类为宏单元、编码单元(CU)、预测单元(PU)、残差单元、变换单元(TU)等。可选地,根据功能,单元可表示块、宏块、编码树单元、编码树块、编码单元、编码块、预测单元、预测块、残差单元、残差块、变换单元、变换块等。- Unit types can be classified into macrounits, coding units (CUs), prediction units (PUs), residual units, transform units (TUs), etc., according to function. Alternatively, depending on the function, a unit may represent a block, a macroblock, a coding tree unit, a coding tree block, a coding unit, a coding block, a prediction unit, a prediction block, a residual unit, a residual block, a transform unit, a transform block, or the like.

–术语“单元”可表示包括亮度(luma)分量块、与亮度分量块对应的色度(chroma)分量块、以及用于各个块的语法元素的信息使得单元被指定为与块区分开。- The term "unit" may denote information including a luma component block, a chroma component block corresponding to the luma component block, and syntax elements for each block such that the unit is designated as distinct from the block.

–单元的尺寸和形状可被不同地实现。此外,单元可具有各种尺寸和形状中的任意一种。具体地,单元的形状不仅可包括正方形,还可以包括可以以二维(2D)表示的几何形状(诸如,矩形、梯形、三角形和五边形)。- The size and shape of the cells can be implemented differently. Furthermore, the cells may have any of a variety of sizes and shapes. Specifically, the shape of the cells may include not only squares, but also geometrical shapes (such as rectangles, trapezoids, triangles, and pentagons) that can be represented in two dimensions (2D).

此外,单元信息可包括单元的类型、单元的尺寸、单元的深度、单元的编码顺序和单元的解码顺序等中的一个或更多个。例如,单元的类型可指示CU、PU、残差单元和TU中的一个。Also, the unit information may include one or more of the type of the unit, the size of the unit, the depth of the unit, the encoding order of the unit, the decoding order of the unit, and the like. For example, the type of unit may indicate one of CU, PU, residual unit, and TU.

–一个单元可被分区为子单元,每个子单元具有比相关单元的尺寸更小的尺寸。- A unit may be partitioned into subunits, each subunit having a smaller size than the size of the associated unit.

–深度:深度可表示单元被分区的程度。此外,单元深度可指示当以树结构表示单元时对应单元存在的等级。– Depth: Depth may represent the extent to which the cell is partitioned. Furthermore, the cell depth may indicate the level at which the corresponding cell exists when the cell is represented in a tree structure.

–单元分区信息可包括指示单元的深度的深度。深度可指示单元被分区的次数和/或单元被分区的程度。- The cell partition information may include a depth indicating the depth of the cell. The depth may indicate the number of times the cell is partitioned and/or the degree to which the cell is partitioned.

–在树结构中,可认为根节点的深度最小并且叶节点的深度最大。– In a tree structure, it can be considered that the depth of the root node is the smallest and the depth of the leaf nodes is the largest.

–单个单元可被分层分区为多个子单元,同时所述单个单元具有基于树结构的深度信息。换句话说,单元和通过对该单元进行分区而生成的子单元可分别对应于节点和该节点的子节点。每个被分区出的子单元可具有单元深度。由于深度指示单元被分区的次数和/或单元被分区的程度,因此子单元的分区信息可包括关于所述子单元的尺寸的信息。- A single unit can be hierarchically partitioned into multiple sub-units, while the single unit has depth information based on a tree structure. In other words, a unit and subunits generated by partitioning the unit may correspond to a node and a child node of the node, respectively. Each partitioned subunit may have a unit depth. Since the depth indicates the number of times the unit is partitioned and/or the degree to which the unit is partitioned, the partition information for a subunit may include information about the size of the subunit.

–在树结构中,顶部节点可对应于进行分区之前的初始节点。顶部节点可被称为“根节点”。此外,根节点可具有最小深度值。这里,顶部节点的深度可为等级“0”。– In a tree structure, the top node may correspond to the initial node before partitioning. The top node may be referred to as the "root node". Also, the root node may have a minimum depth value. Here, the depth of the top node may be level "0".

–深度为等级“1”的节点可表示在初始单元被分区一次时所生成的单元。深度为等级“2”的节点可表示在初始单元被分区两次时所生成的单元。- Nodes with a depth of level "1" may represent cells that were generated when the initial cell was partitioned once. Nodes with a depth of level "2" may represent cells that were generated when the original cell was partitioned twice.

–深度为等级“n”的叶节点可表示在初始单元被分区n次时所生成的单元。- A leaf node with a depth of level "n" may represent a cell generated when the original cell is partitioned n times.

–叶节点可以是不能被进一步分区的底部节点。叶节点的深度可以是最大等级。例如,针对最大等级的预定义值可以是3。– Leaf nodes can be bottom nodes that cannot be further partitioned. The depth of a leaf node can be a maximum level. For example, the predefined value for the maximum level may be 3.

–QT深度可表示针对四分区的深度。BT深度可表示针对二分区的深度。TT深度可表示针对三分区的深度。-QT depth may represent the depth for the quad partition. BT depth may represent the depth for two partitions. TT depth may represent the depth for three partitions.

–样点:样点可以是构成块的基本单元。可用从根据比特深度(Bd)的0到2Bd-1的值来表示样点。– Sample: A sample can be the basic unit that makes up a block. A sample can be represented by a value from 0 to 2 Bd -1 according to the bit depth (Bd).

–样点可以是像素或像素值。– Samples can be pixels or pixel values.

–在下文中,术语“像素”和“样点”可被用于具有相同含义,并且可彼此互换使用。- In the following, the terms "pixel" and "sample" may be used to have the same meaning and may be used interchangeably with each other.

编码树单元(CTU):CTU可由单个亮度分量(Y)编码树块和与亮度分量编码树块相关的两个色度分量(Cb,Cr)编码树块构成。此外,CTU可表示包括上述块以及用于每个块的语法元素的信息。Coding Tree Unit (CTU): A CTU may consist of a single luma component (Y) coding tree block and two chroma component (Cb, Cr) coding tree blocks associated with the luma component coding tree block. Also, the CTU may represent information including the above-described blocks and syntax elements for each block.

–可使用一个或更多个分区方法(诸如四叉树(QT)、二叉树(BT)和三叉树(TT))对每个编码树单元(CTU)进行分区,以便配置子单元,诸如编码单元、预测单元和变换单元。此外,可使用一个或更多个分区方法,利用多类型树对每个编码树单元进行分区。– Each coding tree unit (CTU) may be partitioned using one or more partitioning methods, such as quad-tree (QT), binary-tree (BT), and ternary-tree (TT), in order to configure subunits, such as coding units , prediction unit and transform unit. Furthermore, each coding tree unit may be partitioned with a multi-type tree using one or more partitioning methods.

–“CTU”可被用作指定在图像解码和编码处理中(如在对输入图像进行分区的情况下)作为处理单元的像素块的术语。- "CTU" may be used as a term to designate a block of pixels that is a processing unit in image decoding and encoding processes, such as in the case of partitioning an input image.

编码树块(CTB):“CTB”可被用作指定Y编码树块、Cb编码树块和Cr编码树块中的任意一个的术语。Coding Tree Block (CTB): "CTB" may be used as a term to designate any one of a Y coding tree block, a Cb coding tree block, and a Cr coding tree block.

邻近块:邻近块(或邻近的块)可表示与目标块相邻的块。邻近块可表示重建邻近块。Neighboring Blocks: Neighboring blocks (or neighboring blocks) may represent blocks that are adjacent to the target block. Neighboring blocks may represent reconstructed neighboring blocks.

在下文中,术语“邻近块”和“相邻块”可被用于具有相同的含义,并且可彼此互换地使用。Hereinafter, the terms "adjacent block" and "adjacent block" may be used to have the same meaning and may be used interchangeably with each other.

空间邻近块:空间邻近块可以是在空间上与目标块相邻的块。邻近块可包括空间邻近块。Spatial Neighboring Block: A spatially neighboring block may be a block that is spatially adjacent to the target block. Neighboring blocks may include spatially neighboring blocks.

–目标块和空间邻近块可被包括在目标画面中。- The target block and spatially adjacent blocks may be included in the target picture.

–空间邻近块可表示边界与目标块接触的块或者位于距目标块预定距离内的块。- A spatially adjacent block may represent a block whose boundary is in contact with the target block or a block located within a predetermined distance from the target block.

–空间邻近块可表示与目标块的顶点相邻的块。这里,与目标块的顶点相邻的块可表示与水平相邻于目标块的邻近块垂直相邻的块或者与垂直相邻于目标块的邻近块水平相邻的块。- A spatially adjacent block may represent a block adjacent to a vertex of the target block. Here, the block adjacent to the vertex of the target block may represent a block that is vertically adjacent to a neighboring block that is horizontally adjacent to the target block or a block that is horizontally adjacent to a neighboring block that is vertically adjacent to the target block.

时间邻近块:时间邻近块可以是在时间上与目标块相邻的块。邻近块可包括时间邻近块。Temporal Neighboring Block: A temporally neighboring block may be a block that is temporally adjacent to the target block. The adjacent blocks may include temporally adjacent blocks.

–时间邻近块可包括同位块(col块)。- Temporally adjacent blocks may include co-located blocks (col blocks).

–col块可以是先前重建的同位画面(col画面)中的块。col块在col画面中的位置可与目标块在目标画面中的位置对应。可选地,col块在col画面中的位置可等于目标块在目标画面中的位置。col画面可以是参考画面列表中包括的画面。The -col block may be a block in a previously reconstructed co-located picture (col picture). The position of the col block in the col picture may correspond to the position of the target block in the target picture. Alternatively, the position of the col block in the col picture may be equal to the position of the target block in the target picture. The col picture may be a picture included in the reference picture list.

–时间邻近块可以是在时间上与目标块的空间邻近块相邻的块。- A temporally adjacent block may be a block that is temporally adjacent to a spatially adjacent block of the target block.

预测单元:预测单元可以是用于预测(诸如帧间预测、帧内预测、帧间补偿、帧内补偿和运动补偿)的基本单元。Prediction unit: A prediction unit may be a basic unit for prediction such as inter prediction, intra prediction, inter compensation, intra compensation and motion compensation.

–单个预测单元可被划分为具有更小尺寸的多个分区或者子预测单元。所述多个分区也可以是在执行预测或补偿时的基本单元。通过对预测单元进行划分所生成的分区也可以是预测单元。- A single prediction unit may be divided into multiple partitions or sub-prediction units of smaller size. The plurality of partitions may also be basic units when performing prediction or compensation. A partition generated by dividing a prediction unit may also be a prediction unit.

预测单元分区:预测单元分区可以是预测单元被划分为的形状。Prediction unit partition: A prediction unit partition may be the shape into which the prediction unit is divided.

重建的邻近单元:重建的邻近单元可以是在目标单元周围已经被解码和重建的单元。Reconstructed Neighbors: Reconstructed neighbors may be units that have been decoded and reconstructed around the target unit.

–重建的邻近单元可以是在空间上与目标单元相邻或者在时间上与目标单元相邻的单元。- The reconstructed neighbor cells may be cells that are spatially adjacent to the target cell or temporally adjacent to the target cell.

–重建的空间邻近单元可以是在目标画面中包括的已经通过编码和/或解码被重建的单元。- A reconstructed spatially adjacent unit may be a unit included in the target picture that has been reconstructed by encoding and/or decoding.

–重建的时间邻近单元可以是在参考图像中包括的已经通过编码和/或解码被重建的单元。重建的时间邻近单元在参考图像中的位置可以与目标单元在目标画面中的位置相同,或者可以与目标单元在目标画面中的位置对应。- A reconstructed temporally adjacent unit may be a unit included in the reference picture that has been reconstructed by encoding and/or decoding. The position of the reconstructed temporally adjacent unit in the reference image may be the same as the position of the target unit in the target picture, or may correspond to the position of the target unit in the target picture.

参数集:参数集可以是比特流的结构中的头信息。例如,参数集可包括视频参数集、序列参数集、画面参数集、自适应参数集等。Parameter set: The parameter set can be header information in the structure of the bitstream. For example, parameter sets may include video parameter sets, sequence parameter sets, picture parameter sets, adaptation parameter sets, and the like.

此外,参数集可包括条带头信息和并行块头信息。Also, the parameter set may include slice header information and parallel block header information.

率失真优化:编码设备可使用率失真优化以便通过利用以下项的组合来提供高编码效率:编码单元(CU)的尺寸、预测模式、预测单元(PU)的尺寸、运动信息和变换单元(TU)的尺寸。Rate-distortion optimization: An encoding device may use rate-distortion optimization in order to provide high encoding efficiency by utilizing a combination of: coding unit (CU) size, prediction mode, prediction unit (PU) size, motion information, and transform unit (TU) )size of.

–率失真优化方案可计算各个组合的率失真代价以从这些组合中选择最优组合。可使用以下方程式1来计算率失真代价。通常,可将使率失真代价最小化的组合选为在率失真优化方案下的最优组合。- The rate-distortion optimization scheme can calculate the rate-distortion cost of each combination to select the optimal combination from these combinations. The rate-distortion penalty can be calculated using Equation 1 below. In general, the combination that minimizes the rate-distortion penalty can be selected as the optimal combination under the rate-distortion optimization scheme.

[方程式1][Equation 1]

D+λ*RD+λ*R

–D可表示失真。D可以是在变换单元中的原始变换系数与重建的变换系数之间的差值的平方的平均值(即,均方误差)。-D for distortion. D may be the average of the squares of the differences between the original transform coefficients and the reconstructed transform coefficients in the transform unit (ie, the mean squared error).

–R可表示所述率,其可使用相关上下文信息来表示比特率。-R may represent the rate, which may use relevant context information to represent the bit rate.

–λ表示拉格朗日乘数。R不仅可包括编码参数信息(诸如预测模式、运动信息和编码块标志),还可包括由于对变换系数进行编码而生成的比特。–λ represents the Lagrange multiplier. R may include not only encoding parameter information such as prediction mode, motion information, and encoding block flags, but also bits generated as a result of encoding transform coefficients.

–编码设备可执行诸如帧间预测和/或帧内预测、变换、量化、熵编码、逆量化(反量化)和逆变换的过程,以便计算精确的D和R。这些过程会大大增加编码设备的复杂度。- The encoding device may perform processes such as inter prediction and/or intra prediction, transform, quantization, entropy coding, inverse quantization (inverse quantization) and inverse transform in order to calculate exact D and R. These processes can greatly increase the complexity of the encoding device.

–比特流:比特流可表示包括编码图像信息的比特的流。- Bitstream: A bitstream may represent a stream of bits comprising encoded image information.

–参数集:参数集可以是比特流的结构中的头信息。- Parameter set: The parameter set may be header information in the structure of the bitstream.

参数集可包括视频参数集、序列参数集、画面参数集和自适应参数集中的至少一个。此外,参数集可包括关于条带头的信息和关于并行块头的信息。The parameter set may include at least one of a video parameter set, a sequence parameter set, a picture parameter set, and an adaptation parameter set. Furthermore, the parameter set may include information on slice headers and information on parallel block headers.

解析:解析可以是通过对比特流执行熵解码而做出的对语法元素的值的判定。可选地,术语“解析”可表示这种熵解码本身。Parsing: Parsing may be a decision on the value of a syntax element made by performing entropy decoding on the bitstream. Alternatively, the term "parse" may refer to this entropy decoding itself.

符号:符号可以是编码目标单元和/或解码目标单元的语法元素、编码参数和变换系数中的至少一个。此外,符号可以是熵编码的目标或熵解码的结果。Symbol: A symbol may be at least one of a syntax element, an encoding parameter, and a transform coefficient of the encoding target unit and/or the decoding target unit. Furthermore, the symbols can be the target of entropy encoding or the result of entropy decoding.

参考画面:参考画面可以是被单元参考以便执行帧间预测或运动补偿的图像。可选地,参考画面可以是包括被目标单元参考以便执行帧间预测或运动补偿的参考单元的图像。Reference picture: A reference picture may be an image that is referenced by a unit in order to perform inter prediction or motion compensation. Alternatively, the reference picture may be a picture including a reference unit that is referenced by the target unit in order to perform inter prediction or motion compensation.

在下文中,术语“参考画面”和“参考图像”可被用于具有相同的含义,并且可彼此互换使用。Hereinafter, the terms "reference picture" and "reference image" may be used to have the same meaning and may be used interchangeably with each other.

参考画面列表:参考画面列表可以是包括被用于帧间预测或运动补偿的一个或更多个参考图像的列表。Reference picture list: The reference picture list may be a list including one or more reference pictures used for inter prediction or motion compensation.

–参考画面列表的类型可包括合并的列表(LC)、列表0(L0)、列表1(L1)、列表2(L3)、列表3(L3)等。- Types of reference picture lists may include merged list (LC), list 0 (L0), list 1 (L1), list 2 (L3), list 3 (L3), etc.

–对于帧间预测,可使用一个或更多个参考画面列表。- For inter prediction, one or more reference picture lists may be used.

帧间预测指示符:帧间预测指示符可指示针对目标单元的帧间预测方向。帧间预测可以是单向预测和双向预测之一。可选地,帧间预测指示符可表示用于生成目标单元的预测单元的参考图像的数量。可选地,帧间预测指示符可表示用于目标单元的帧间预测或运动补偿的预测块的数量。Inter prediction indicator: The inter prediction indicator may indicate the inter prediction direction for the target unit. Inter prediction may be one of unidirectional prediction and bidirectional prediction. Alternatively, the inter prediction indicator may represent the number of reference pictures used to generate the prediction unit of the target unit. Alternatively, the inter prediction indicator may represent the number of prediction blocks used for inter prediction or motion compensation of the target unit.

参考画面索引:参考画面索引可以是指示在参考画面列表中的特定参考图像的索引。Reference picture index: The reference picture index may be an index indicating a specific reference picture in the reference picture list.

运动矢量(MV):运动矢量可以是用于帧间预测或运动补偿的2D矢量。运动矢量可表示目标图像与参考图像之间的偏移。Motion Vector (MV): A motion vector can be a 2D vector used for inter prediction or motion compensation. The motion vector can represent the offset between the target image and the reference image.

–例如,可以以诸如(mvx,mvy)的形式来表示MV。mvx可指示水平分量,mvy可指示垂直分量。- For example, the MV can be represented in a form such as (mv x , mv y ). mv x may indicate the horizontal component and mv y may indicate the vertical component.

–搜索范围:搜索范围可以是在帧间预测期间执行针对MV的搜索的2D区域。例如,搜索范围的尺寸可以是M×N。M和N可分别是正整数。- Search range: The search range may be the 2D area where the search for the MV is performed during inter prediction. For example, the size of the search range may be MxN. M and N may each be a positive integer.

运动矢量候选:运动矢量候选可以是在运动矢量被预测时作为预测候选的块或者作为预测候选的块的运动矢量。Motion vector candidate: A motion vector candidate may be a block that is a prediction candidate when the motion vector is predicted or a motion vector of a block that is a prediction candidate.

–运动矢量候选可被包括在运动矢量候选列表中。- Motion vector candidates may be included in the motion vector candidate list.

运动矢量候选列表:运动矢量候选列表可以是使用一个或更多个运动矢量候选配置的列表。Motion Vector Candidate List: A motion vector candidate list may be a list configured using one or more motion vector candidates.

运动矢量候选索引:运动矢量候选索引可以是用于指示运动矢量候选列表中的运动矢量候选的指示符。可选地,运动矢量候选索引可以是运动矢量预测因子的索引。Motion Vector Candidate Index: The motion vector candidate index may be an indicator for indicating a motion vector candidate in the motion vector candidate list. Alternatively, the motion vector candidate index may be an index of a motion vector predictor.

运动信息:运动信息可以是包括参考画面列表、参考图像、运动矢量候选、运动矢量候选索引、合并候选和合并索引中的至少一个以及运动矢量、参考画面索引和帧间预测指示符的信息。Motion information: The motion information may be information including at least one of a reference picture list, a reference picture, a motion vector candidate, a motion vector candidate index, a merge candidate, and a merge index, and a motion vector, a reference picture index, and an inter prediction indicator.

合并候选列表:合并候选列表可以是使用合并候选配置的列表。Merge candidate list: The merge candidate list may be a list using the merge candidate configuration.

合并候选:合并候选可以是空间合并候选、时间合并候选、组合合并候选、组合双预测合并候选、零合并候选等。合并候选可包括运动信息,诸如预测类型信息、用于每个列表的参考画面索引以及运动矢量。Merging candidate: The merging candidate may be a spatial merging candidate, a temporal merging candidate, a combined merging candidate, a combined bi-predictive merging candidate, a zero merging candidate, and the like. Merge candidates may include motion information, such as prediction type information, reference picture indices for each list, and motion vectors.

合并索引:合并索引可以是用于指示合并候选列表中的合并候选的指示符。Merge Index: The merge index may be an indicator for indicating a merge candidate in the merge candidate list.

–合并索引可指示在空间上与目标单元相邻的重建单元和在时间上与目标单元相邻的重建单元之间的用于推导合并候选的重建单元。- The merge index may indicate a reconstruction unit for deriving a merge candidate between a reconstruction unit that is spatially adjacent to the target unit and a reconstruction unit that is temporally adjacent to the target unit.

–合并索引可指示合并候选的多条运动信息中的至少一条。- The merge index may indicate at least one of multiple pieces of motion information of a merge candidate.

变换单元:变换单元可以是残差信号编码和/或残差信号解码(诸如变换、逆变换、量化、反量化、变换系数编码和变换系数解码)的基本单元。单个变换单元可被分区为具有更小尺寸的多个变换单元。Transform unit: A transform unit may be a basic unit for residual signal encoding and/or residual signal decoding, such as transform, inverse transform, quantization, inverse quantization, transform coefficient encoding, and transform coefficient decoding. A single transform unit may be partitioned into multiple transform units with smaller sizes.

缩放:缩放可表示将因子乘以变换系数等级的过程。Scaling: Scaling can refer to the process of multiplying a factor by a transform coefficient level.

–作为对变换系数等级进行缩放的结果,可生成变换系数。缩放也可被称为“反量化”。- Transform coefficients may be generated as a result of scaling transform coefficient levels. Scaling may also be referred to as "inverse quantization".

量化参数(QP):量化参数可以是用于在量化中生成用于变换系数的变换系数等级的值。可选地,量化参数也可以是用于在反量化中通过对变换系数等级进行缩放来生成变换系数的值。可选地,量化参数可以是被映射到量化步长的值。Quantization parameter (QP): A quantization parameter may be a value used to generate transform coefficient levels for transform coefficients in quantization. Optionally, the quantization parameter may also be a value used to generate transform coefficients by scaling transform coefficient levels in inverse quantization. Alternatively, the quantization parameter may be a value mapped to a quantization step size.

变量增量(Delta)量化参数:变量增量量化参数是目标单元的量化参数与预测出的量化参数之间的差值。Delta quantization parameter: The delta quantization parameter is the difference between the quantization parameter of the target unit and the predicted quantization parameter.

扫描:扫描可表示对单元、块或矩阵中的系数顺序进行排列的方法。例如,用于按照一维(1D)阵列的形式对2D阵列进行排列的方法可被称为“扫描”。可选地,用于按照2D阵列的形式对1D阵列进行排列的方法也可被称为“扫描”或“逆扫描”。Scanning: Scanning can represent a method of arranging the order of coefficients in a cell, block, or matrix. For example, a method for arranging a 2D array in the form of a one-dimensional (1D) array may be referred to as "scanning". Alternatively, the method for arranging a 1D array in the form of a 2D array may also be referred to as "scanning" or "inverse scanning".

变换系数:变换系数可以是在编码设备执行变换时生成的系数值。可选地,变换系数可以是在解码设备执行熵解码和反量化中的至少一个时生成的系数值。Transform coefficients: Transform coefficients may be coefficient values generated when an encoding device performs transform. Alternatively, the transform coefficients may be coefficient values generated when the decoding apparatus performs at least one of entropy decoding and inverse quantization.

–通过将量化应用于变换系数或残差信号而生成的量化的等级或量化的变换系数等级也可被包括在术语“变换系数”的含义中。- Quantized levels or quantized transform coefficient levels generated by applying quantization to transform coefficients or residual signals may also be included in the meaning of the term "transform coefficients".

量化的等级:量化的等级可以是在编码设备对变换系数或残差信号执行量化时生成的值。可选地,量化的等级可以是在解码设备执行反量化时作为反量化的目标的值。Level of quantization: The level of quantization may be a value generated when an encoding apparatus performs quantization on a transform coefficient or residual signal. Alternatively, the level of quantization may be a value that is a target of inverse quantization when the decoding apparatus performs inverse quantization.

–作为变换和量化的结果的量化的变换系数等级也可被包括在量化的等级的含义中。- Quantized transform coefficient levels that are the result of transform and quantization may also be included in the meaning of quantized levels.

非零变换系数:非零变换系数可以是具有除了0之外的值的变换系数,或者可以是具有除了0之外的值的变换系数等级。可选地,非零变换系数可以是值的幅度不为0的变换系数,或者可以是值的幅度不为0的变换系数等级。Non-zero transform coefficients: Non-zero transform coefficients may be transform coefficients having values other than zero, or may be transform coefficient levels having values other than zero. Alternatively, the non-zero transform coefficients may be transform coefficients whose magnitudes are not zero, or may be transform coefficient levels whose magnitudes are not zero.

量化矩阵:量化矩阵可以是在量化过程或反量化过程中使用以便提高图像的主观图像质量或客观图像质量的矩阵。量化矩阵也可被称为“缩放列表”。Quantization matrix: A quantization matrix can be a matrix used in the quantization process or inverse quantization process in order to improve the subjective image quality or objective image quality of an image. The quantization matrix may also be referred to as a "scale list".

量化矩阵系数:量化矩阵系数可以是量化矩阵中的每个元素。量化矩阵系数也可被称为“矩阵系数”。Quantization matrix coefficients: The quantization matrix coefficients can be each element in the quantization matrix. The quantization matrix coefficients may also be referred to as "matrix coefficients".

默认矩阵:默认矩阵可以是被编码设备和解码设备预先定义的量化矩阵。Default matrix: The default matrix can be a quantization matrix pre-defined by the encoding device and the decoding device.

非默认矩阵:非默认矩阵可以是未被编码设备和解码设备预先定义的量化矩阵。非默认矩阵可以由编码设备用信号传送给解码设备。Non-default matrix: The non-default matrix may be a quantization matrix that is not predefined by the encoding device and the decoding device. The non-default matrix may be signaled by the encoding device to the decoding device.

最可能模式(MPM):MPM可表示高概率被用于针对目标块的帧内预测的帧内预测模式。Most Probable Mode (MPM): MPM may represent an intra prediction mode that is used for intra prediction for a target block with a high probability.

编码设备和解码设备可基于与目标块相关的编码参数以及与目标块相关的实体的属性来确定一个或更多个MPM。The encoding apparatus and the decoding apparatus may determine one or more MPMs based on encoding parameters related to the target block and attributes of entities related to the target block.

编码设备和解码设备可基于参考块的帧内预测模式来确定一个或更多个MPM。参考块可包括多个参考块。多个参考块可包括与目标块的左侧相邻的空间邻近块和与目标块的上方相邻的空间邻近块。换句话说,根据哪些帧内预测模式已经被用于参考块,可确定一个或更多个不同的MPM。The encoding apparatus and the decoding apparatus may determine one or more MPMs based on the intra prediction mode of the reference block. A reference block may include multiple reference blocks. The plurality of reference blocks may include a spatially adjacent block adjacent to the left of the target block and a spatially adjacent block adjacent to the upper side of the target block. In other words, one or more different MPMs may be determined depending on which intra prediction modes have been used for the reference block.

可在编码设备和解码设备两者中以相同的方式确定一个或更多个MPM。也就是说,编码设备和解码设备可共享包括一个或更多个MPM的相同的MPM列表。One or more MPMs may be determined in the same manner in both the encoding device and the decoding device. That is, the encoding apparatus and the decoding apparatus may share the same MPM list including one or more MPMs.

MPM列表:MPM列表可以是包括一个或更多个MPM的列表。可预先定义MPM列表中的一个或更多个MPM的数量。MPM list: An MPM list can be a list that includes one or more MPMs. The number of one or more MPMs in the MPM list may be predefined.

MPM指示符:MPM指示符可指示MPM列表中的一个或更多个MPM之中的将被用于针对目标块的帧内预测的MPM。例如,MPM指示符可以是用于MPM列表的索引。MPM indicator: The MPM indicator may indicate an MPM among one or more MPMs in the MPM list to be used for intra prediction for the target block. For example, the MPM indicator may be an index for the MPM list.

由于在编码设备和解码设备两者中以相同的方式确定MPM列表,因此可不需要将MPM列表本身从编码设备发送到解码设备。Since the MPM list is determined in the same way in both the encoding device and the decoding device, it may not be necessary to send the MPM list itself from the encoding device to the decoding device.

MPM指示符可从编码设备被用信号发送到解码设备。由于MPM指示符被用信号发送,解码设备可确定MPM列表中的MPM之中的将被用于针对目标块的帧内预测的MPM。The MPM indicator may be signaled from the encoding device to the decoding device. Since the MPM indicator is signaled, the decoding apparatus may determine the MPM to be used for intra prediction for the target block among the MPMs in the MPM list.

MPM使用指示符:MPM使用指示符可指示MPM使用模式是否将被用于针对目标块的预测。MPM使用模式可以是使用MPM列表来确定将被用于针对目标块的帧内预测的MPM的模式。MPM usage indicator: The MPM usage indicator may indicate whether the MPM usage mode is to be used for prediction for the target block. The MPM usage mode may be a mode in which the MPM to be used for intra prediction for the target block is determined using the MPM list.

MPM使用指示符可从编码设备被用信号发送到解码设备。The MPM usage indicator may be signaled from the encoding device to the decoding device.

信令:“信令”可表示信息从编码设备被发送到解码设备。可选地,“信令”可表示信息被包括在比特流或存储介质中。由编码设备用信号发送的信息可被解码设备使用。Signaling: "Signaling" may mean that information is sent from an encoding device to a decoding device. Alternatively, "signaling" may mean that information is included in a bitstream or storage medium. The information signaled by the encoding device can be used by the decoding device.

图1是示出被应用了本公开的编码设备的实施例的配置的框图。FIG. 1 is a block diagram showing the configuration of an embodiment of an encoding apparatus to which the present disclosure is applied.

编码设备100可以是编码器、视频编码设备或图像编码设备。视频可包括一个或更多个图像(画面)。编码设备100可顺序地对视频的一个或更多个图像进行编码。The encoding device 100 may be an encoder, a video encoding device, or an image encoding device. A video may include one or more images (pictures). The encoding apparatus 100 may sequentially encode one or more pictures of the video.

参照图1,编码设备100包括帧间预测单元110、帧内预测单元120、切换器115、减法器125、变换单元130、量化单元140、熵编码单元150、反量化(逆量化)单元160、逆变换单元170、加法器175、滤波器单元180和参考画面缓冲器190。1, the encoding apparatus 100 includes an inter prediction unit 110, an intra prediction unit 120, a switch 115, a subtractor 125, a transform unit 130, a quantization unit 140, an entropy encoding unit 150, an inverse quantization (inverse quantization) unit 160, Inverse transform unit 170 , adder 175 , filter unit 180 and reference picture buffer 190 .

编码设备100可使用帧内模式和/或帧间模式对目标图像执行编码。The encoding apparatus 100 may perform encoding on the target image using the intra mode and/or the inter mode.

此外,编码设备100可通过对目标图像进行编码来生成包括关于编码的信息的比特流,并且可输出生成的比特流。生成的比特流可被存储在计算机可读存储介质中,并且可通过无线/有线传输介质被流传输。Also, the encoding apparatus 100 may generate a bitstream including information on encoding by encoding the target image, and may output the generated bitstream. The generated bitstream can be stored in a computer-readable storage medium, and can be streamed over a wireless/wired transmission medium.

当帧内模式被用作预测模式时,切换器115可切换到帧内模式。当帧间模式被用作预测模式时,切换器115可切换到帧间模式。When the intra mode is used as the prediction mode, the switch 115 may switch to the intra mode. When the inter mode is used as the prediction mode, the switcher 115 may switch to the inter mode.

编码设备100可生成目标块的预测块。此外,在已生成了预测块之后,编码设备100可对目标块与预测块之间的残差进行编码。The encoding apparatus 100 may generate a prediction block of the target block. Also, after the prediction block has been generated, the encoding apparatus 100 may encode the residual between the target block and the prediction block.

当预测模式是帧内模式时,帧内预测单元120可将在目标块周围的先前被编码/解码的邻近块的像素用作参考样点。帧内预测单元120可使用所述参考样点对目标块执行空间预测,并且可经由空间预测针对目标块生成预测样点。When the prediction mode is the intra mode, the intra prediction unit 120 may use the pixels of the previously encoded/decoded neighboring blocks around the target block as reference samples. Intra-prediction unit 120 may perform spatial prediction on the target block using the reference samples, and may generate prediction samples for the target block via spatial prediction.

帧间预测单元110可包括运动预测单元和运动补偿单元。The inter prediction unit 110 may include a motion prediction unit and a motion compensation unit.

当预测模式是帧间模式时,运动预测单元可在运动预测过程中在参考图像中搜索与目标块最匹配的区域,并且可基于找到的区域针对目标块和找到的区域推导运动矢量。When the prediction mode is the inter mode, the motion prediction unit may search the reference image for a region that best matches the target block in the motion prediction process, and may derive a motion vector for the target block and the found region based on the found region.

参考图像可被存储在参考画面缓冲器190中。更具体地讲,当参考图像的编码和/或解码已被处理时,参考图像可被存储在参考画面缓冲器190中。Reference images may be stored in the reference picture buffer 190 . More specifically, the reference picture may be stored in the reference picture buffer 190 when encoding and/or decoding of the reference picture has been processed.

运动补偿单元可通过使用运动矢量执行运动补偿来生成针对目标块的预测块。这里,运动矢量可以是用于帧间预测的二维(2D)矢量。此外,运动矢量可表示目标图像与参考图像之间的偏移。The motion compensation unit may generate a prediction block for the target block by performing motion compensation using the motion vector. Here, the motion vector may be a two-dimensional (2D) vector used for inter prediction. Furthermore, the motion vector may represent the offset between the target image and the reference image.

当运动矢量具有除了整数之外的值时,运动预测单元和运动补偿单元可通过将插值滤波器应用于参考图像的部分区域来生成预测块。为了执行帧间预测或运动补偿,可确定跳过模式、合并模式、高级运动矢量预测(AMVP)模式和当前画面参考模式中的哪一种模式对应于用于基于CU对CU中包括的PU的运动进行预测并对该运动进行补偿的方法,并且可根据该模式执行帧间预测或运动补偿。When the motion vector has a value other than an integer, the motion prediction unit and the motion compensation unit may generate a prediction block by applying an interpolation filter to a partial region of the reference image. In order to perform inter prediction or motion compensation, it may be determined which of skip mode, merge mode, advanced motion vector prediction (AMVP) mode, and current picture reference mode corresponds to a CU-based PU for PUs included in the CU A method of predicting and compensating for motion, and inter-prediction or motion compensation may be performed according to this mode.

减法器125可生成残差块,其中,残差块是目标块与预测块之间的差。残差块也可被称为“残差信号”。Subtractor 125 may generate a residual block, where the residual block is the difference between the target block and the prediction block. The residual block may also be referred to as a "residual signal".

残差信号可以是原始信号与预测信号之间的差。可选地,残差信号可以是通过对原始信号与预测信号之间的差进行变换或量化而生成的信号或者通过对该差进行变换和量化而生成的信号。残差块可以是针对块单元的残差信号。The residual signal may be the difference between the original signal and the predicted signal. Alternatively, the residual signal may be a signal generated by transforming or quantizing the difference between the original signal and the predicted signal or a signal generated by transforming and quantizing the difference. The residual block may be a residual signal for block units.

变换单元130可通过对残差块进行变换来生成变换系数,并且可输出生成的变换系数。这里,变换系数可以是通过对残差块进行变换而生成的系数值。The transform unit 130 may generate transform coefficients by transforming the residual block, and may output the generated transform coefficients. Here, the transform coefficients may be coefficient values generated by transforming the residual block.

变换单元130可在执行变换时使用多个预定义的变换方法中的一个。Transform unit 130 may use one of a number of predefined transform methods when performing the transform.

多个预定义的变换方法可包括离散余弦变换(DCT)、离散正弦变换(DST)、Karhunen-Loeve变换(KLT)等。A number of predefined transform methods may include discrete cosine transform (DCT), discrete sine transform (DST), Karhunen-Loeve transform (KLT), and the like.

可根据针对目标块和/或邻近块的编码参数中的至少一个来确定用于对残差块进行变换的变换方法。例如,可基于针对PU的帧间预测模式、针对PU的帧内预测模式、TU的尺寸以及TU的形状中的至少一个来确定变换方法。可选地,指示变换方法的变换信息可从编码设备100被用信号发送到解码设备200。A transform method for transforming the residual block may be determined according to at least one of encoding parameters for the target block and/or neighboring blocks. For example, the transform method may be determined based on at least one of the inter prediction mode for the PU, the intra prediction mode for the PU, the size of the TU, and the shape of the TU. Alternatively, transform information indicating the transform method may be signaled from the encoding apparatus 100 to the decoding apparatus 200 .

当使用变换跳过模式时,变换单元130可省略对残差块进行变换的操作。When transform skip mode is used, transform unit 130 may omit transforming the residual block.

通过对变换系数实施量化,可生成量化的变换系数等级或者量化的等级。在下文中,在实施例中,量化的变换系数等级和量化的等级中的每一个也可被称为“变换系数”。By quantizing the transform coefficients, a quantized transform coefficient level or a quantized level can be generated. Hereinafter, in an embodiment, each of the quantized transform coefficient level and the quantized level may also be referred to as a "transform coefficient".

量化单元140可通过根据量化参数对变换系数进行量化来生成量化的变换系数等级或量化的等级。量化单元140可输出生成的量化的变换系数等级或量化的等级。在这种情况下,量化单元140可使用量化矩阵对变换系数进行量化。The quantization unit 140 may generate a quantized transform coefficient level or a quantized level by quantizing the transform coefficient according to the quantization parameter. The quantization unit 140 may output the generated quantized transform coefficient level or quantized level. In this case, the quantization unit 140 may quantize the transform coefficients using a quantization matrix.

熵编码单元150可通过基于由量化单元140计算出的值和/或在编码过程中计算出的编码参数值执行基于概率分布的熵编码来生成比特流。熵编码单元150可输出生成的比特流。The entropy encoding unit 150 may generate a bitstream by performing probability distribution-based entropy encoding based on values calculated by the quantization unit 140 and/or encoding parameter values calculated during encoding. The entropy encoding unit 150 may output the generated bitstream.

熵编码单元150可对关于图像的像素的信息以及对图像进行解码所需的信息执行熵编码。例如,对图像进行解码所需的信息可包括语法元素等。The entropy encoding unit 150 may perform entropy encoding on information about pixels of the image and information required to decode the image. For example, information required to decode an image may include syntax elements and the like.

当应用熵编码时,可将更少的比特分配给更频繁出现的符号,并且可将更多的比特分配给很少出现的符号。由于通过该分配来表示符号,因此可减少用于将被编码的目标符号的比特串的大小。因此,通过熵编码可提高视频编码的压缩性能。When entropy coding is applied, fewer bits can be allocated to more frequently occurring symbols, and more bits can be allocated to infrequently occurring symbols. Since the symbols are represented by this allocation, the size of the bit string for the target symbol to be encoded can be reduced. Therefore, the compression performance of video coding can be improved by entropy coding.

此外,为了进行熵编码,熵编码单元150可使用诸如指数哥伦布、上下文自适应可变长度编码(CAVLC)或上下文自适应二进制算术编码(CABAC)的编码方法。例如,熵编码单元150可使用可变长度编码/码(VLC)表来执行熵编码。例如,熵编码单元150可推导用于目标符号的二值化方法。此外,熵编码单元150可推导用于目标符号/二进制位的概率模型。熵编码单元150可使用推导出的二值化方法、概率模型和上下文模型来执行算术编码。Also, for entropy encoding, the entropy encoding unit 150 may use an encoding method such as Exponential Golomb, Context Adaptive Variable Length Coding (CAVLC), or Context Adaptive Binary Arithmetic Coding (CABAC). For example, the entropy encoding unit 150 may perform entropy encoding using a variable length coding/code (VLC) table. For example, entropy encoding unit 150 may derive a binarization method for the target symbol. Additionally, entropy encoding unit 150 may derive a probability model for the target symbol/bin. The entropy encoding unit 150 may perform arithmetic encoding using the derived binarization method, probability model, and context model.

熵编码单元150可通过变换系数扫描方法将2D块形式的系数变换为1D矢量形式,以便对量化的变换系数等级进行编码。The entropy encoding unit 150 may transform coefficients in a 2D block form into a 1D vector form through a transform coefficient scanning method in order to encode the quantized transform coefficient levels.

编码参数可以是编码和/或解码所需的信息。编码参数可包括由编码设备100编码且从编码设备100发送到解码设备的信息,并且还可包括可在编码或解码过程中推导的信息。例如,发送到解码设备的信息可包括语法元素。Encoding parameters may be information required for encoding and/or decoding. The encoding parameters may include information encoded by the encoding apparatus 100 and transmitted from the encoding apparatus 100 to the decoding apparatus, and may also include information that may be derived during encoding or decoding. For example, the information sent to the decoding device may include syntax elements.

编码参数不仅可包括由编码设备编码并由编码设备用信号传送到解码设备的诸如语法元素的信息(或标志或索引),还可包括在编码或解码处理中推导出的信息。此外,编码参数可包括对图像进行编码或解码所需的信息。例如,编码参数可包括以下项中的至少一个值、以下项的组合或统计:单元/块的尺寸、单元/块的深度、单元/块的分区信息、单元/块的分区结构、指示单元/块是否以四叉树结构被分区的信息、指示单元/块是否以二叉树结构被分区的信息、二叉树结构的分区方向(水平方向或垂直方向)、二叉树结构的分区形式(对称分区或非对称分区)、指示单元/块是否以三叉树结构被分区的信息、三叉树结构的分区方向(水平方向或垂直方向)、三叉树结构的分区形式(对称分区或非对称分区等)、指示单元/块是否以复合树结构被分区的信息、复合树结构的分区的组合和方向(水平方向或垂直方向等)、预测方案(帧内预测或帧间预测)、帧内预测模式/方向、参考样点滤波方法、预测块滤波方法、预测块边界滤波方法、用于滤波的滤波器抽头、用于滤波的滤波器系数、帧间预测模式、运动信息、运动矢量、参考画面索引、帧间预测方向、帧间预测指示符、参考画面列表、参考图像、运动矢量预测因子、运动矢量预测候选、运动矢量候选列表、指示合并模式是否被使用的信息、合并候选、合并候选列表、指示跳过模式是否被使用的信息、插值滤波器的类型、插值滤波器的抽头、插值滤波器的滤波器系数、运动矢量的大小、运动矢量表示的精确度、变换类型、变换大小、指示初次变换是否被使用的信息、指示附加(二次)变换是否被使用的信息、首次变换选择信息(或首次变换索引)、二次变换选择信息(或二次变换索引)、指示残差信号存在或不存在的信息、编码块样式、编码块标志、量化参数、量化矩阵、关于环内滤波器的信息、指示环内滤波器是否被应用的信息、环内滤波器的系数、环内滤波器的抽头、环内滤波器的形状/形式、指示去块滤波器是否被应用的信息、去块滤波器的系数、去块滤波器的抽头、去块滤波器强度、去块滤波器的形状/形式、指示自适应样点偏移是否被应用的信息、自适应样点偏移的值、自适应样点偏移的类别、自适应样点偏移的类型、指示自适应环路滤波器是否被应用的信息、自适应环路滤波器的系数、自适应环路滤波器的抽头、自适应环路滤波器的形状/形式、二值化/反二值化方法、上下文模型、上下文模型决定方法、上下文模型更新方法、指示常规模式是否被执行的信息、指示旁路(bypass)模式是否被执行的信息、上下文二进制位、旁路二进制位、变换系数、变换系数等级、变换系数等级扫描方法、图像显示/输出顺序、条带识别信息、条带类型、条带分区信息、并行块识别信息、并行块类型、并行块分区信息、画面类型、比特深度、关于亮度信号的信息和关于色度信号的信息。预测方案可表示帧内预测模式和帧间预测模式中的一个预测模式。Encoding parameters may include not only information such as syntax elements (or flags or indices) encoded by the encoding device and signaled by the encoding device to the decoding device, but also information derived in the encoding or decoding process. Furthermore, the encoding parameters may include information required to encode or decode the image. For example, the coding parameters may include at least one of the following values, a combination or statistics of: size of unit/block, depth of unit/block, partition information of unit/block, partition structure of unit/block, indicating unit/block Information whether the block is partitioned in a quad-tree structure, information indicating whether the unit/block is partitioned in a binary tree structure, the partition direction (horizontal direction or vertical direction) of the binary tree structure, the partition form of the binary tree structure (symmetric partition or asymmetric partition) ), information indicating whether the unit/block is partitioned in a ternary tree structure, the partition direction of the ternary tree structure (horizontal direction or vertical direction), the partition form of the ternary tree structure (symmetrical or asymmetrical partition, etc.), indicating the unit/block Information on whether to be partitioned in a composite tree structure, combination and direction (horizontal direction or vertical direction, etc.) of the partitions of the composite tree structure, prediction scheme (intra prediction or inter prediction), intra prediction mode/direction, reference samples filtering method, predictive block filtering method, predictive block boundary filtering method, filter taps for filtering, filter coefficients for filtering, inter prediction mode, motion information, motion vector, reference picture index, inter prediction direction, Inter prediction indicator, reference picture list, reference picture, motion vector predictor, motion vector prediction candidate, motion vector candidate list, information indicating whether merge mode is used, merge candidate, merge candidate list, indicating whether skip mode is used Information used, type of interpolation filter, taps of interpolation filter, filter coefficients of interpolation filter, size of motion vector, accuracy of motion vector representation, transform type, transform size, information indicating whether primary transform is used or not , information indicating whether an additional (secondary) transform is used, first transform selection information (or first transform index), second transform selection information (or second transform index), information indicating the presence or absence of a residual signal, coding Block style, coding block flag, quantization parameter, quantization matrix, information about the in-loop filter, information indicating whether the in-loop filter is applied, coefficients of the in-loop filter, taps of the in-loop filter, in-loop filter shape/form of DF, information indicating whether DF is applied, DF coefficients, DF taps, DF strength, DF shape/form, indicating adaptive samples Information on whether the offset is applied, value of adaptive sample offset, type of adaptive sample offset, type of adaptive sample offset, information indicating whether adaptive loop filter is applied, adaptive Coefficients of loop filter, taps of adaptive loop filter, shape/form of adaptive loop filter, binarization/de-binarization method, context model, context model determination method, context model update method, Information indicating whether the normal mode is executed, information indicating whether the bypass mode is executed, context bins, bypass bins, transform coefficients, transform coefficient levels, transform coefficient level scanning method, image display/output order, Stripe identification information, stripe type, stripe partition information, parallel block identification information, Parallel block type, parallel block partition information, picture type, bit depth, information on luma signals and information on chroma signals. The prediction scheme may represent one of an intra prediction mode and an inter prediction mode.

首次变换选择信息可指示应用于目标块的首次变换。The first transform selection information may indicate the first transform applied to the target block.

二次变换选择信息可指示应用于目标块的二次变换。The secondary transform selection information may indicate the secondary transform applied to the target block.

残差信号可表示原始信号与预测信号之间的差。可选地,残差信号可以是对原始信号与预测信号之间的差进行变换而生成的信号。可选地,残差信号可以是对原始信号与预测信号之间的差进行变换和量化而生成的信号。残差块可以是针对块的残差信号。The residual signal may represent the difference between the original signal and the predicted signal. Alternatively, the residual signal may be a signal generated by transforming the difference between the original signal and the predicted signal. Alternatively, the residual signal may be a signal generated by transforming and quantizing the difference between the original signal and the predicted signal. The residual block may be a residual signal for the block.

这里,用信号传送标志或索引可表示编码设备100将通过对标志或索引执行熵编码而生成的熵编码的标志或熵编码的索引包括在比特流中,并且可表示解码设备200通过对从比特流提取的熵编码的标志或熵编码的索引执行熵解码来获取标志或索引。Here, signaling the flag or index may mean that the encoding apparatus 100 includes an entropy-coded flag or an entropy-coded index generated by performing entropy encoding on the flag or index in the bitstream, and may mean that the decoding apparatus 200 performs entropy coding by the decoding apparatus 200 The entropy-coded flag or entropy-coded index extracted from the stream performs entropy decoding to obtain the flag or index.

由于编码设备100经由帧间预测执行编码,因此编码的目标图像可被用作用于将被后续处理的另外的图像的参考图像。因此,编码设备100可对编码的目标图像进行重建或解码,并将重建或解码的图像作为参考图像存储在参考画面缓冲器190中。对于解码,可进行对编码的目标图像的反量化和逆变换。Since the encoding apparatus 100 performs encoding via inter prediction, the encoded target image can be used as a reference image for another image to be subsequently processed. Accordingly, the encoding apparatus 100 may reconstruct or decode the encoded target image, and store the reconstructed or decoded image in the reference picture buffer 190 as a reference image. For decoding, inverse quantization and inverse transformation of the encoded target image may be performed.

量化的等级可由反量化单元160进行反量化,并且可由逆变换单元170进行逆变换。可由加法器175将已被反量化和/或逆变换的系数与预测块相加。将反量化和/或逆变换的系数和预测块相加,然后可生成重建块。这里,反量化和/或逆变换的系数可表示被执行了反量化和逆变换中的一个或更多个的系数,并且也可以表示重建的残差块。The quantized levels may be inverse quantized by the inverse quantization unit 160 and inversely transformed by the inverse transform unit 170 . The coefficients that have been inverse quantized and/or inverse transformed may be added to the prediction block by an adder 175 . The inverse-quantized and/or inverse-transformed coefficients and the prediction block are added, and then the reconstructed block can be generated. Here, the inversely quantized and/or inversely transformed coefficients may represent coefficients on which one or more of inverse quantization and inverse transformations are performed, and may also represent a reconstructed residual block.

重建块可通过滤波器单元180进行滤波。滤波器单元180可将去块滤波器、样点自适应偏移(SAO)滤波器、自适应环路滤波器(ALF)和非局部滤波器(NLF)中的一个或更多个滤波器应用于重建块或重建画面。滤波器单元180也可被称为“环路滤波器”。The reconstructed block may be filtered by filter unit 180 . Filter unit 180 may apply one or more of a deblocking filter, a sample adaptive offset (SAO) filter, an adaptive loop filter (ALF), and a non-local filter (NLF) for reconstructing blocks or reconstructing pictures. Filter unit 180 may also be referred to as a "loop filter".

去块滤波器可消除在块之间的边界处出现的块失真。为了确定是否应用去块滤波器,可决定被包括在块中并且包括确定是否对目标块应用去块滤波器所基于的像素的列或行的数量。A deblocking filter removes block distortions that occur at boundaries between blocks. In order to determine whether to apply the deblocking filter, a decision may be made on the number of columns or rows of pixels that are included in the block and include the determination of whether to apply the deblocking filter to the target block.

当去块滤波器被应用于目标块时,所应用的滤波器可根据所需要的去块滤波的强度而不同。换句话说,在不同的滤波器中,可将考虑去块滤波的强度而决定的滤波器应用于目标块。当去块滤波器被应用于目标块时,可根据所需的去块滤波的强度将与强滤波器和弱滤波器中的任意一个对应的滤波器应用于目标块。When a deblocking filter is applied to a target block, the applied filter may vary according to the required strength of deblocking filtering. In other words, among different filters, a filter decided in consideration of the strength of deblocking filtering may be applied to the target block. When the deblocking filter is applied to the target block, a filter corresponding to any one of the strong filter and the weak filter may be applied to the target block according to the required strength of the deblocking filter.

此外,当对目标块执行垂直滤波和水平滤波时,可并行地进行水平滤波和垂直滤波。Also, when vertical filtering and horizontal filtering are performed on the target block, horizontal filtering and vertical filtering may be performed in parallel.

SAO可将适当的偏移与像素值相加以便对编码误差进行补偿。SAO可基于像素对被应用去块的图像执行校正,其中,该校正使用原始图像与被应用去块的图像之间的差的偏移。为了执行针对图像的偏移校正,可使用用于将图像中包括的像素划分为特定数量的区域、在划分出的区域之中确定将被应用偏移的区域并将偏移应用于所确定的区域的方法,并且也可使用用于考虑每个像素的边缘信息来应用偏移的方法。SAO can add the appropriate offset to the pixel value to compensate for encoding errors. SAO may perform a correction on the image to which deblocking is applied on a pixel-by-pixel basis, where the correction uses the offset of the difference between the original image and the image to which deblocking is applied. In order to perform offset correction for an image, a method for dividing pixels included in the image into a certain number of areas, determining an area to which offset is to be applied among the divided areas, and applying the offset to the determined area may be used. area method, and a method for applying offsets that takes into account edge information for each pixel may also be used.

ALF可基于通过将重建图像与原始图像进行比较而获得的值来执行滤波。在图像中包括的像素已被划分为预定数量的群组之后,可确定将被应用于每个群组的滤波器,并且可针对各个群组不同地执行滤波。对于亮度信号,可针对每个CU用信号发送与是否应用自适应环路滤波器相关的信息。将被应用于各个块的ALF的形状和滤波器系数可针对各个块而不同。可选地,不管块的特征如何,具有固定形式的ALF可被应用于所述块。The ALF may perform filtering based on values obtained by comparing the reconstructed image with the original image. After the pixels included in the image have been divided into a predetermined number of groups, a filter to be applied to each group may be determined, and filtering may be performed differently for each group. For luma signals, information on whether to apply an adaptive loop filter may be signaled for each CU. The shape and filter coefficients of the ALF to be applied to each block may be different for each block. Alternatively, ALF with a fixed form can be applied to the block regardless of the characteristics of the block.

非局部滤波器可基于与目标块相似的重建块执行滤波。可从重建画面选择与目标块相似的区域,并且可使用所选择的相似区域的统计属性来执行目标块的滤波。可针对编码单元(CU)用信号发送关于是否应用非局部滤波器的信息。此外,将应用于块的非局部滤波器的形状和滤波器系数可根据该块而不同。The non-local filter may perform filtering based on a reconstructed block similar to the target block. A region similar to the target block may be selected from the reconstructed picture, and the filtering of the target block may be performed using statistical properties of the selected similar region. Information on whether to apply a non-local filter may be signaled for a coding unit (CU). Furthermore, the shape and filter coefficients of the non-local filters to be applied to a block may differ from block to block.

通过滤波器单元180滤波的重建块或重建图像可被存储在参考画面缓冲器190中。通过滤波器单元180滤波的重建块可以是参考画面的一部分。换句话说,参考画面可以是由通过滤波器单元180滤波的重建块构成的重建画面。存储的参考画面随后可被用于帧间预测。The reconstructed blocks or reconstructed images filtered by the filter unit 180 may be stored in the reference picture buffer 190 . The reconstructed block filtered by the filter unit 180 may be part of the reference picture. In other words, the reference picture may be a reconstructed picture composed of reconstructed blocks filtered by the filter unit 180 . The stored reference pictures can then be used for inter prediction.

图2是示出被应用了本公开的解码设备的实施例的配置的框图。FIG. 2 is a block diagram showing the configuration of an embodiment of a decoding apparatus to which the present disclosure is applied.

解码设备200可以是解码器、视频解码设备或图像解码设备。The decoding device 200 may be a decoder, a video decoding device, or an image decoding device.

参照图2,解码设备200可包括熵解码单元210、反量化(逆量化)单元220、逆变换单元230、帧内预测单元240、帧间预测单元250、切换器245、加法器255、滤波器单元260和参考画面缓冲器270。2, the decoding apparatus 200 may include an entropy decoding unit 210, an inverse quantization (inverse quantization) unit 220, an inverse transform unit 230, an intra prediction unit 240, an inter prediction unit 250, a switch 245, an adder 255, a filter unit 260 and reference picture buffer 270.

解码设备200可接收从编码设备100输出的比特流。解码设备200可接收存储在计算机可读存储介质中的比特流,并且可接收通过有线/无线传输介质流传输的比特流。The decoding apparatus 200 may receive the bit stream output from the encoding apparatus 100 . The decoding apparatus 200 may receive a bitstream stored in a computer-readable storage medium, and may receive a bitstream streamed through a wired/wireless transmission medium.

解码设备200可在帧内模式和/或帧间模式下对比特流执行解码。此外,解码设备200可经由解码来生成重建图像或解码图像,并且可输出重建图像或解码图像。The decoding apparatus 200 may perform decoding on the bitstream in the intra mode and/or the inter mode. Also, the decoding apparatus 200 may generate a reconstructed image or a decoded image through decoding, and may output the reconstructed image or the decoded image.

例如,可通过切换器245来执行基于用于进行解码的预测模式而切换到帧内模式或帧间模式的操作。当用于解码的预测模式是帧内模式时,切换器245可被操作以切换到帧内模式。当用于解码的预测模式是帧间模式时,切换器245可被操作以切换到帧间模式。For example, the operation of switching to the intra mode or the inter mode based on the prediction mode used for decoding may be performed by the switch 245 . When the prediction mode for decoding is the intra mode, the switch 245 may be operated to switch to the intra mode. When the prediction mode for decoding is the inter mode, the switch 245 may be operated to switch to the inter mode.

解码设备200可通过对输入的比特流进行解码来获取重建的残差块,并且可生成预测块。当重建的残差块和预测块被获取时,解码设备200可通过将重建的残差块与预测块相加来生成作为被解码的目标的重建块。The decoding apparatus 200 may obtain a reconstructed residual block by decoding an input bitstream, and may generate a prediction block. When the reconstructed residual block and the prediction block are acquired, the decoding apparatus 200 may generate a reconstructed block that is a target of decoding by adding the reconstructed residual block and the prediction block.

熵解码单元210可通过基于比特流的概率分布对比特流执行熵解码来生成符号。生成的符号可包括量化的变换系数等级格式符号。这里,熵解码方法可与以上描述的熵编码方法相似。也就是说,熵解码方法可以是以上描述的熵编码方法的逆过程。The entropy decoding unit 210 may generate symbols by performing entropy decoding on the bitstream based on a probability distribution of the bitstream. The generated symbols may include quantized transform coefficient level format symbols. Here, the entropy decoding method may be similar to the entropy encoding method described above. That is, the entropy decoding method may be an inverse process of the entropy encoding method described above.

熵解码单元210可通过变换系数扫描方法将具有一维(1D)矢量形式的系数改变为2D块形状,以便对量化的变换系数等级进行解码。The entropy decoding unit 210 may change coefficients in the form of a one-dimensional (1D) vector into a 2D block shape through a transform coefficient scanning method in order to decode the quantized transform coefficient levels.

例如,可通过使用右上对角线扫描对块系数进行扫描来将块的系数改变为2D块形状。可选地,可根据对应块的尺寸和/或帧内预测模式来确定右上对角线扫描、垂直扫描和水平扫描中的哪一个将被使用。For example, the coefficients of the block can be changed to a 2D block shape by scanning the block coefficients using an upper right diagonal scan. Alternatively, which of the upper right diagonal scan, vertical scan and horizontal scan will be used may be determined according to the size of the corresponding block and/or the intra prediction mode.

量化的系数可由反量化单元220进行反量化。反量化单元220可通过对量化的系数执行反量化来生成反量化的系数。此外,反量化的系数可由逆变换单元230进行逆变换。逆变换单元230可通过对反量化的系数执行逆变换来生成重建的残差块。作为对量化的系数执行反量化和逆变换的结果,可生成重建的残差块。这里,当生成重建的残差块时,反量化单元220可将量化矩阵应用于量化的系数。The quantized coefficients may be inverse quantized by the inverse quantization unit 220 . The inverse quantization unit 220 may generate inverse quantized coefficients by performing inverse quantization on the quantized coefficients. In addition, the inversely quantized coefficients may be inversely transformed by the inverse transform unit 230 . The inverse transform unit 230 may generate a reconstructed residual block by performing an inverse transform on the inversely quantized coefficients. As a result of performing inverse quantization and inverse transform on the quantized coefficients, a reconstructed residual block may be generated. Here, when generating the reconstructed residual block, the inverse quantization unit 220 may apply a quantization matrix to the quantized coefficients.

当使用帧内模式时,帧内预测单元240可通过执行空间预测来生成预测块,其中,所述空间预测使用在目标块周围的先前被解码的邻近块的像素值。When using the intra mode, the intra prediction unit 240 may generate a prediction block by performing spatial prediction using pixel values of previously decoded neighboring blocks around the target block.

帧间预测单元250可包括运动补偿单元。可选地,帧间预测单元250可被指定为“运动补偿单元”。Inter prediction unit 250 may include a motion compensation unit. Alternatively, the inter prediction unit 250 may be designated as a "motion compensation unit".

当使用帧间模式时,运动补偿单元250可通过执行运动补偿来生成预测块,其中,所述运动补偿使用运动矢量和被存储在参考画面缓冲器270中的参考图像。When the inter mode is used, the motion compensation unit 250 may generate a prediction block by performing motion compensation using motion vectors and reference images stored in the reference picture buffer 270 .

运动补偿单元可在运动矢量具有除了整数之外的值时将插值滤波器应用于参考图像的部分区域,并且可使用被应用了插值滤波器的参考图像来生成预测块。为了执行运动补偿,运动补偿单元可基于CU确定跳过模式、合并模式、高级运动矢量预测(AMVP)模式和当前画面参考模式中的哪一种模式对应于用于CU中包括的PU的运动补偿方法,并且可根据所确定的模式来执行运动补偿。The motion compensation unit may apply an interpolation filter to a partial region of the reference image when the motion vector has a value other than an integer, and may generate a prediction block using the reference image to which the interpolation filter is applied. To perform motion compensation, the motion compensation unit may determine, on a CU basis, which of skip mode, merge mode, advanced motion vector prediction (AMVP) mode, and current picture reference mode corresponds to motion compensation for PUs included in the CU method, and motion compensation may be performed according to the determined mode.

重建的残差块和预测块可由加法器255彼此相加。加法器255可通过将重建的残差块和预测块相加来生成重建块。The reconstructed residual block and the prediction block may be added to each other by the adder 255 . The adder 255 may generate a reconstructed block by adding the reconstructed residual block and the prediction block.

重建块可通过滤波器单元260进行滤波。滤波器单元260可将去块滤波器、SAO滤波器、ALF和NLF中的至少一个应用于重建块或重建图像。重建图像可以是包括重建块的画面。The reconstructed block may be filtered by filter unit 260 . The filter unit 260 may apply at least one of a deblocking filter, a SAO filter, an ALF, and an NLF to the reconstructed block or the reconstructed image. The reconstructed image may be a picture including reconstructed blocks.

经过滤波的重建图像可被编码设备100输出,并且可被编码设备使用。The filtered reconstructed image may be output by the encoding device 100 and used by the encoding device.

通过滤波器单元260滤波的重建图像可作为参考画面被存储在参考画面缓冲器270中。通过滤波器单元260滤波的重建块可以是参考画面的一部分。换句话说,参考画面可以是由通过滤波器单元260滤波的重建块构成的图像。存储的参考画面随后可被用于帧间预测。The reconstructed image filtered by the filter unit 260 may be stored in the reference picture buffer 270 as a reference picture. The reconstructed block filtered by the filter unit 260 may be part of the reference picture. In other words, the reference picture may be an image composed of reconstructed blocks filtered by the filter unit 260 . The stored reference pictures can then be used for inter prediction.

图3是示意性地示出当图像被编码和解码时图像的分区结构的示图。FIG. 3 is a diagram schematically showing a partition structure of an image when the image is encoded and decoded.

图3可示意性地示出单个单元被分区为多个子单元的示例。FIG. 3 may schematically illustrate an example in which a single unit is partitioned into a plurality of subunits.

为了有效地对图像进行分区,可在编码和解码中使用编码单元(CU)。术语“单元”可被用于共同地指定1)包括图像样点的块和2)语法元素。例如,“单元的分区”可表示“与单元对应的块的分区”。To efficiently partition an image, coding units (CUs) may be used in encoding and decoding. The term "unit" may be used to collectively designate 1) a block comprising image samples and 2) a syntax element. For example, "partition of a unit" may mean "partition of a block corresponding to a unit".

CU可被用作用于图像编码/解码的基本单元。CU可被用作在图像编码/解码中从帧内模式和帧间模式选择的一个模式被应用到的单元。换句话说,在图像编码/解码中,可确定帧内模式和帧间模式中的哪一个模式将被应用于每个CU。A CU may be used as a basic unit for image encoding/decoding. The CU may be used as a unit to which one mode selected from the intra mode and the inter mode is applied in image encoding/decoding. In other words, in image encoding/decoding, it may be determined which of the intra mode and the inter mode is to be applied to each CU.

此外,CU可以是对变换系数进行预测、变换、量化、逆变换、反量化和编码/解码的基本单元。Also, a CU may be a basic unit for predicting, transforming, quantizing, inverse transforming, inverse quantizing, and encoding/decoding transform coefficients.

参照图3,图像300可被顺序地分区为与最大编码单元(LCU)对应的单元,并且分区结构可针对每个LCU被确定。这里,LCU可被用于具有与编码树单元(CTU)相同的含义。3, a picture 300 may be sequentially partitioned into units corresponding to a largest coding unit (LCU), and a partition structure may be determined for each LCU. Here, LCU may be used to have the same meaning as Coding Tree Unit (CTU).

对单元进行分区可表示对与单元对应的块进行分区。块分区信息可包括关于单元的深度的深度信息。深度信息可指示单元被分区的次数和/或单元被分区的程度。单个单元可被分层分区为子单元,同时所述单个单元具有基于树结构的深度信息。每个被分区出的子单元可具有深度信息。深度信息可以是指示CU的尺寸的信息。可针对每个CU存储深度信息。Partitioning a cell may mean partitioning a block corresponding to the cell. The block partition information may include depth information about the depth of the unit. The depth information may indicate the number of times the cell is partitioned and/or the extent to which the cell is partitioned. A single unit can be hierarchically partitioned into subunits, while the single unit has depth information based on a tree structure. Each partitioned subunit may have depth information. The depth information may be information indicating the size of the CU. Depth information may be stored for each CU.

每个CU可具有深度信息。当CU被分区时,从分区生成的CU的深度可从被分区的CU的深度增加1。Each CU may have depth information. When a CU is partitioned, the depth of the CU generated from the partition may be increased by 1 from the depth of the partitioned CU.

分区结构可表示LCU 310中的用于对图像进行有效编码的编码单元(CU)的分布。可根据单个CU是否将被分区为多个CU来确定这种分布。通过进行分区而生成的CU的数量可以是正整数2或更大,包括2、3、4、8、16等。根据通过进行分区而生成的CU的数量,通过进行分区而生成的每个CU的水平尺寸和垂直尺寸可小于被分区之前的CU的水平尺寸和垂直尺寸。The partition structure may represent the distribution of coding units (CUs) in the LCU 310 that are used to efficiently encode pictures. This distribution may be determined depending on whether a single CU is to be partitioned into multiple CUs. The number of CUs generated by partitioning may be a positive integer of 2 or more, including 2, 3, 4, 8, 16, and so on. According to the number of CUs generated by partitioning, the horizontal size and vertical size of each CU generated by partitioning may be smaller than those of the CU before being partitioned.

每个分区出的CU可按照相同的方式被递归地分区为四个CU。与被分区之前的CU的水平尺寸和垂直尺寸中的至少一个相比,经由递归分区,每个被分区出的CU的水平尺寸和垂直尺寸中的至少一个可被减小。Each partitioned CU may be recursively partitioned into four CUs in the same manner. Via recursive partitioning, at least one of the horizontal size and the vertical size of each partitioned CU may be reduced compared to at least one of the horizontal size and the vertical size of the CU before being partitioned.

CU的分区可被递归地执行直到预定义的深度或预定义的尺寸为止。例如,CU的深度可具有范围从0到3的值。CU的尺寸范围依据CU的深度而可以是从64×64的尺寸到8×8的尺寸。Partitioning of a CU may be performed recursively up to a predefined depth or a predefined size. For example, the depth of a CU may have a value ranging from 0 to 3. The size of the CU may range from a size of 64×64 to a size of 8×8 depending on the depth of the CU.

例如,LCU的深度可以是0,最小编码单元(SCU)的深度可以是预定义的最大深度。这里,如上所述,LCU可以是具有最大编码单元尺寸的CU,并且SCU可以是具有最小编码单元尺寸的CU。For example, the depth of the LCU may be 0, and the depth of the smallest coding unit (SCU) may be a predefined maximum depth. Here, as described above, the LCU may be the CU having the largest coding unit size, and the SCU may be the CU having the smallest coding unit size.

可在LCU 310处开始进行分区,并且每当CU的水平尺寸和/或垂直尺寸通过进行分区而减小时,CU的深度可增加1。Partitioning may begin at LCU 310, and the depth of the CU may increase by one whenever the horizontal size and/or vertical size of the CU is reduced by partitioning.

例如,对于各个深度,未被分区的CU可具有2N×2N的尺寸。此外,在CU被分区的情况下,尺寸为2N×2N的CU可被分区为尺寸均为N×N的四个CU。每当深度增加1时,N的值可减半。For example, an unpartitioned CU may have a size of 2Nx2N for each depth. Also, in the case where a CU is partitioned, a CU of size 2N×2N may be partitioned into four CUs each of size N×N. The value of N can be halved each time the depth increases by 1.

参照图3,深度为0的LCU可具有64×64个像素或64×64的块。0可以是最小深度。深度为3的SCU可具有8×8个像素或8×8的块。3可以是最大深度。这里,作为LCU的具有64×64的块的CU可用深度0来表示。具有32×32的块的CU可用深度1来表示。具有16×16的块的CU可用深度2来表示。作为SCU的具有8×8的块的CU可用深度3来表示。Referring to FIG. 3 , an LCU with a depth of 0 may have 64×64 pixels or a block of 64×64. 0 can be the minimum depth. A depth-3 SCU may have 8x8 pixels or 8x8 blocks. 3 can be the maximum depth. Here, a CU having a block of 64×64, which is an LCU, may be represented by depth 0. A CU with a block of 32x32 may be represented with a depth of 1. A CU with 16x16 blocks may be represented with a depth of 2. A CU having a block of 8×8 that is an SCU may be represented by a depth of 3.

关于对应CU是否被分区的信息可以用CU的分区信息来表示。分区信息可以是1比特信息。除了SCU之外的所有CU可包括分区信息。例如,未被分区的CU的分区信息的值可以是0。被分区的CU的分区信息的值可以是1。The information on whether the corresponding CU is partitioned may be represented by the partition information of the CU. The partition information may be 1-bit information. All CUs except SCUs may include partition information. For example, the value of partition information of a CU that is not partitioned may be 0. The value of the partition information of the partitioned CU may be 1.

例如,当单个CU被分区为四个CU时,通过进行分区而生成的四个CU中的每个CU的水平尺寸和垂直尺寸可以是在被分区之前的CU的水平尺寸和垂直尺寸的一半。当尺寸为32×32的CU被分区为四个CU时,被分区出的四个CU中的每个CU的尺寸可以是16×16。当单个CU被分区为四个CU时,可认为CU已经以四叉树结构被分区。For example, when a single CU is partitioned into four CUs, the horizontal and vertical sizes of each of the four CUs generated by partitioning may be half of those of the CU before being partitioned. When a CU having a size of 32×32 is partitioned into four CUs, the size of each of the four partitioned CUs may be 16×16. When a single CU is partitioned into four CUs, the CUs may be considered to have been partitioned in a quad-tree structure.

例如,当单个CU被分区成两个CU时,通过进行分区而生成的两个CU中的每个CU的水平尺寸或垂直尺寸可以是在被分区之前的CU的水平尺寸或垂直尺寸的一半。当尺寸为32×32的CU被垂直分区为两个CU时,被分区出的两个CU中的每个CU的尺寸可以是16×32。当尺寸为32×32的CU被水平分区为两个CU时,被分区出的两个CU中的每个CU的尺寸可以是32×16。当单个CU被分区为两个CU时,可认为CU已经以二叉树结构被分区。For example, when a single CU is partitioned into two CUs, the horizontal or vertical size of each of the two CUs generated by partitioning may be half the horizontal or vertical size of the CU before being partitioned. When a CU having a size of 32×32 is vertically partitioned into two CUs, the size of each of the two partitioned CUs may be 16×32. When a CU having a size of 32×32 is horizontally partitioned into two CUs, the size of each of the partitioned two CUs may be 32×16. When a single CU is partitioned into two CUs, the CUs may be considered to have been partitioned in a binary tree structure.

四叉树分区和二叉树分区两者被应用于图3的LCU310。Both quad-tree partitioning and binary-tree partitioning are applied to the LCU 310 of FIG. 3 .

在编码设备100中,可通过递归四叉树结构将尺寸为64×64的编码树单元(CTU)分区为多个更小的CU。单个CU可被分区为具有相同尺寸的四个CU。每个CU可被递归地分区并且可具有四叉树结构。In the encoding apparatus 100, a coding tree unit (CTU) having a size of 64×64 may be partitioned into a plurality of smaller CUs through a recursive quad-tree structure. A single CU may be partitioned into four CUs with the same size. Each CU may be recursively partitioned and may have a quadtree structure.

通过CU的递归分区,可选择引起最小率失真代价的最优分区方法。Through recursive partitioning of the CU, the optimal partitioning method that incurs the least rate-distortion cost can be selected.

图4是示出编码单元(CU)能够包括的预测单元(PU)的形式的示图。FIG. 4 is a diagram illustrating a form of a prediction unit (PU) that a coding unit (CU) can include.

在从LCU分区出的CU中,可将不再被分区的CU划分为一个或更多个预测单元(PU)。这种划分也被称为“分区”。Among the CUs partitioned from the LCU, the CUs that are no longer partitioned may be divided into one or more prediction units (PUs). This division is also called "partitioning".

PU可以是用于预测的基本单元。PU可在跳过模式、帧间模式和帧内模式中的任意一个模式下被编码和解码。可根据各个模式将PU分区为各种形状。例如,以上参照图1描述的目标块和以上参照图2描述的目标块可以均是PU。A PU may be a basic unit for prediction. The PU may be encoded and decoded in any one of skip mode, inter mode, and intra mode. The PU can be partitioned into various shapes according to each mode. For example, the target block described above with reference to FIG. 1 and the target block described above with reference to FIG. 2 may both be PUs.

CU可不被划分为PU。当CU不被划分为PU时,CU的尺寸和PU的尺寸可彼此相等。A CU may not be divided into PUs. When the CU is not divided into PUs, the size of the CU and the size of the PU may be equal to each other.

在跳过模式下,在CU中可不存在分区。在跳过模式下,可支持2N×2N模式410,而不进行分区,其中,在2N×2N模式410中,PU的尺寸和CU的尺寸彼此相同。In skip mode, there may be no partitions in the CU. In skip mode, the 2Nx2N mode 410, in which the size of the PU and the size of the CU are the same as each other, may be supported without partitioning.

在帧间模式下,在CU中可存在8种类型的分区形状。例如,在帧间模式下,可支持2N×2N模式410、2N×N模式415、N×2N模式420、N×N模式425、2N×nU模式430、2N×nD模式435、nL×2N模式440和nR×2N模式445。In inter mode, there can be 8 types of partition shapes in a CU. For example, in inter mode, 2N×2N mode 410, 2N×N mode 415, N×2N mode 420, N×N mode 425, 2N×nU mode 430, 2N×nD mode 435, nL×2N mode can be supported 440 and nR×2N mode 445.

在帧内模式下,可支持2N×2N模式410和N×N模式425。In intra mode, 2Nx2N mode 410 and NxN mode 425 may be supported.

在2N×2N模式410下,可对尺寸为2N×2N的PU进行编码。尺寸为2N×2N的PU可表示尺寸与CU的尺寸相同的PU。例如,尺寸为2N×2N的PU可具有尺寸64×64、32×32、16×16或8×8。In 2Nx2N mode 410, a PU of size 2Nx2N may be encoded. A PU of size 2N×2N may represent a PU of the same size as that of a CU. For example, a PU of size 2Nx2N may have size 64x64, 32x32, 16x16, or 8x8.

在N×N模式425下,可对尺寸为N×N的PU进行编码。In NxN mode 425, a PU of size NxN may be encoded.

例如,在帧内预测中,当PU的尺寸是8×8时,可对四个分区出的PU进行编码。每个分区出的PU的尺寸可以是4×4。For example, in intra prediction, when the size of the PU is 8×8, four partitioned PUs may be encoded. The size of each partitioned PU may be 4×4.

当在帧内模式下对PU进行编码时,可使用多个帧内预测模式中的任意一个对PU进行编码。例如,HEVC技术可提供35个帧内预测模式,PU可在所述35个帧内预测模式中的任意一个下被编码。When the PU is encoded in intra mode, the PU may be encoded using any one of multiple intra prediction modes. For example, HEVC techniques may provide 35 intra-prediction modes under which a PU may be encoded.

可基于率失真代价来确定2N×2N模式410和N×N模式425中的哪一个模式将被用于对PU进行编码。Which of the 2Nx2N mode 410 and the NxN mode 425 will be used to encode the PU may be determined based on a rate-distortion cost.

编码设备100可对尺寸为2N×2N的PU执行编码操作。这里,编码操作可以是在能够被编码设备100使用的多个帧内预测模式中的每个模式下对PU进行编码的操作。通过编码操作,可推导出用于尺寸为2N×2N的PU的最佳帧内预测模式。该最佳帧内预测模式可以是能够被编码设备100使用的多个帧内预测模式之中的在对尺寸为2N×2N的PU进行编码时出现最小率失真代价的帧内预测模式。The encoding apparatus 100 may perform an encoding operation on a PU having a size of 2N×2N. Here, the encoding operation may be an operation of encoding the PU in each of a plurality of intra prediction modes that can be used by the encoding apparatus 100 . Through encoding operations, the optimal intra prediction mode for a PU of size 2Nx2N can be derived. The optimal intra prediction mode may be an intra prediction mode in which the smallest rate-distortion cost occurs when encoding a PU of size 2N×2N among a plurality of intra prediction modes that can be used by the encoding apparatus 100 .

此外,编码设备100可顺序地对通过进行N×N分区而获得的各个PU执行编码操作。这里,编码操作可以是在能够被编码设备100使用的多个帧内预测模式中的每个模式下对PU进行编码的操作。通过编码操作,可推导出用于尺寸为N×N的PU的最佳帧内预测模式。该最佳帧内预测模式可以是能够被编码设备100使用的多个帧内预测模式之中的在对尺寸为N×N的PU进行编码时出现最小率失真代价的帧内预测模式。Also, the encoding apparatus 100 may sequentially perform encoding operations on respective PUs obtained by performing N×N partitioning. Here, the encoding operation may be an operation of encoding the PU in each of a plurality of intra prediction modes that can be used by the encoding apparatus 100 . Through encoding operations, the optimal intra prediction mode for a PU of size NxN can be derived. The optimal intra prediction mode may be an intra prediction mode in which the smallest rate-distortion cost occurs when encoding a PU of size N×N among a plurality of intra prediction modes that can be used by the encoding apparatus 100 .

编码设备100可基于尺寸为2N×2N的PU的率失真代价与尺寸为N×N的PU的率失真代价之间的比较来确定尺寸为2N×2N的PU和尺寸为N×N的PU中的哪一个将被编码。The encoding apparatus 100 may determine, based on a comparison between a rate-distortion cost of a PU of size 2N×2N and a rate-distortion cost of a PU of size N×N, among a PU of size 2N×2N and a PU of size N×N which will be encoded.

单个CU可被分区为一个或更多个PU,并且PU可被分区多个PU。A single CU may be partitioned into one or more PUs, and a PU may be partitioned into multiple PUs.

例如,当单个PU被分区为四个PU时,通过分区生成的四个PU中的每一个的水平尺寸和垂直尺寸可以是被分区之前的PU的水平尺寸和垂直尺寸的一半。当尺寸为32×32的PU被分区为四个PU时,四个被分区出的PU中的每一个的尺寸可以是16×16。当单个PU被分区为四个PU时,可认为PU已经以四叉树结构被分区。For example, when a single PU is partitioned into four PUs, the horizontal and vertical sizes of each of the four PUs generated by partitioning may be half of those of the PU before being partitioned. When a PU of size 32x32 is partitioned into four PUs, the size of each of the four partitioned PUs may be 16x16. When a single PU is partitioned into four PUs, the PUs can be considered to have been partitioned in a quad-tree structure.

例如,当单个PU被分区为两个PU时,通过分区生成的两个PU中的每一个的水平尺寸或垂直尺寸可以是被分区之前的PU的水平尺寸或垂直尺寸的一半。当尺寸为32×32的PU被垂直分区为两个PU时,两个被分区出的PU中的每一个的尺寸可以是16×32。当尺寸为32×32的PU被水平分区为两个PU时,两个被分区出的PU中的每一个的尺寸可以是32×16。当单个PU被分区为两个PU时,可认为PU已经以二叉树结构被分区。For example, when a single PU is partitioned into two PUs, the horizontal or vertical size of each of the two PUs generated by partitioning may be half the horizontal or vertical size of the PU before being partitioned. When a PU of size 32x32 is vertically partitioned into two PUs, the size of each of the two partitioned PUs may be 16x32. When a PU of size 32x32 is horizontally partitioned into two PUs, the size of each of the two partitioned PUs may be 32x16. When a single PU is partitioned into two PUs, the PUs can be considered to have been partitioned in a binary tree structure.

图5是示出能够被包括在CU中的变换单元(TU)的形式的示图。FIG. 5 is a diagram illustrating a form of a transform unit (TU) that can be included in a CU.

变换单元(TU)可以是CU中被用于诸如变换、量化、逆变换、反量化、熵编码和熵解码的过程的基本单元。A transform unit (TU) may be a basic unit in a CU that is used for processes such as transform, quantization, inverse transform, inverse quantization, entropy encoding, and entropy decoding.

TU可具有正方形形状或矩形形状。可基于CU的尺寸和/或形状来确定TU的形状。A TU may have a square shape or a rectangular shape. The shape of the TU may be determined based on the size and/or shape of the CU.

在从LCU分区出的CU中,可将不再被分区为CU的CU分区为一个或更多个TU。这里,TU的分区结构可以是四叉树结构。例如,如图5中所示,可根据四叉树结构将单个CU 510分区一次或更多次。通过这种分区,单个CU 510可由具有各种尺寸的TU组成。In a CU partitioned from an LCU, a CU that is no longer partitioned as a CU may be partitioned into one or more TUs. Here, the partition structure of the TU may be a quadtree structure. For example, as shown in FIG. 5, a single CU 510 may be partitioned one or more times according to a quadtree structure. Through such partitioning, a single CU 510 may be composed of TUs of various sizes.

可认为当单个CU被划分两次或更多次时CU被递归地划分。通过划分,单个CU可由具有各种尺寸的变换单元(TU)组成。It can be considered that a CU is recursively divided when a single CU is divided two or more times. Through partitioning, a single CU may be composed of transform units (TUs) with various sizes.

可选地,可基于划分CU的垂直线和/或水平线的数量将单个CU划分为一个或更多个TU。Optionally, a single CU may be divided into one or more TUs based on the number of vertical and/or horizontal lines dividing the CU.

CU可被划分为对称TU或非对称TU。为了划分为非对称TU,可将关于每个TU的尺寸和/或形状的信息从编码设备100用信号发送到解码设备200。可选地,可从关于CU的尺寸和/或形状的信息推导每个TU的尺寸和/或形状。CUs can be divided into symmetric TUs or asymmetric TUs. In order to divide into asymmetric TUs, information about the size and/or shape of each TU may be signaled from the encoding apparatus 100 to the decoding apparatus 200 . Optionally, the size and/or shape of each TU may be derived from information about the size and/or shape of the CU.

CU可不被划分为TU。当CU不被划分为TU时,CU的尺寸和TU的尺寸可彼此相等。A CU may not be divided into TUs. When the CU is not divided into TUs, the size of the CU and the size of the TU may be equal to each other.

单个CU可被分区为一个或更多个TU,并且TU可被分区为多个TU。A single CU may be partitioned into one or more TUs, and a TU may be partitioned into multiple TUs.

例如,当单个TU被分区为四个TU时,通过分区生成的四个TU中的每一个的水平尺寸和垂直尺寸可以是被分区之前的TU的水平尺寸和垂直尺寸的一半。当尺寸为32×32的TU被分区为四个TU时,四个被分区出的TU中的每一个的尺寸可以是16×16。当单个TU被分区为四个TU时,可认为TU已经以四叉树结构被分区。For example, when a single TU is partitioned into four TUs, the horizontal and vertical sizes of each of the four TUs generated by partitioning may be half of those of the TU before being partitioned. When a TU of size 32x32 is partitioned into four TUs, the size of each of the four partitioned TUs may be 16x16. When a single TU is partitioned into four TUs, the TUs can be considered to have been partitioned in a quad-tree structure.

例如,当单个TU被分区为两个TU时,通过分区生成的两个TU中的每一个的水平尺寸或垂直尺寸可以是被分区之前的TU的水平尺寸或垂直尺寸的一半。当尺寸为32×32的TU被垂直分区为两个TU时,两个被分区出的TU中的每一个的尺寸可以是16×32。当尺寸为32×32的TU被水平分区为两个TU时,两个被分区出的TU中的每一个的尺寸可以是32×16。当单个TU被分区为两个TU时,可认为TU已经以二叉树结构被分区。For example, when a single TU is partitioned into two TUs, the horizontal or vertical size of each of the two TUs generated by partitioning may be half the horizontal or vertical size of the TU before being partitioned. When a TU of size 32x32 is vertically partitioned into two TUs, the size of each of the two partitioned TUs may be 16x32. When a TU of size 32x32 is horizontally partitioned into two TUs, the size of each of the two partitioned TUs may be 32x16. When a single TU is partitioned into two TUs, the TUs can be considered to have been partitioned in a binary tree structure.

可以以与图5中所示的方式不同的方式对CU进行划分。The CUs may be divided in a different manner than that shown in FIG. 5 .

例如,可将单个CU划分为三个CU。通过划分生成的三个CU的水平尺寸或垂直尺寸可分别是被划分之前的原始CU的水平尺寸或垂直尺寸的1/4、1/2和1/4。For example, a single CU may be divided into three CUs. The horizontal size or vertical size of the three CUs generated by the division may be 1/4, 1/2, and 1/4 of the horizontal size or vertical size of the original CU before being divided, respectively.

例如,当尺寸为32×32的CU被垂直划分为三个CU时,通过划分生成的三个CU的尺寸可分别是8×32、16×32和8×32。以这种方式,当单个CU被划分为三个CU时,可认为该CU以三叉树的形式被划分。For example, when a CU having a size of 32×32 is vertically divided into three CUs, the sizes of the three CUs generated by the division may be 8×32, 16×32, and 8×32, respectively. In this way, when a single CU is divided into three CUs, the CU can be considered to be divided in the form of a ternary tree.

可将示例性划分形式(即,四叉树划分、二叉树划分和三叉树划分)中的一个应用于CU的划分,并且可将多种划分方案组合并一起用于CU的划分。这里,将多种划分方案组合并一起使用的情况可被称为“复合树格式划分”。One of the exemplary partitioning forms (ie, quadtree partitioning, binary tree partitioning, and ternary tree partitioning) may be applied to the partitioning of a CU, and multiple partitioning schemes may be combined and used together for the partitioning of a CU. Here, the case where a plurality of division schemes are combined and used together may be referred to as "compound tree format division".

图6示出根据示例的块的划分。FIG. 6 shows the division of blocks according to an example.

在视频编码和/或解码处理中,如图6中所示,目标块可被划分。In the video encoding and/or decoding process, as shown in FIG. 6, target blocks may be divided.

对于目标块的划分,指示划分信息的指示符可从编码设备100被用信号发送到解码设备200。划分信息可以是指示目标块如何被划分的信息。For the division of the target block, an indicator indicating division information may be signaled from the encoding apparatus 100 to the decoding apparatus 200 . The division information may be information indicating how the target block is divided.

划分信息可以是划分标志(以下称为“split_flag”)、四-二进制标志(以下称为“QB_flag”)、四叉树标志(以下称为“quadtree_flag”)、二叉树标志(以下称为“binarytree_flag”)和二进制类型标志(以下称为“Btype_flag”)中的一个或更多个。The division information may be a division flag (hereinafter referred to as "split_flag"), a quad-binary flag (hereinafter referred to as "QB_flag"), a quadtree flag (hereinafter referred to as "quadtree_flag"), a binary tree flag (hereinafter referred to as "binarytree_flag") ) and one or more of a binary type flag (hereinafter referred to as "Btype_flag").

“split_flag”可以是指示块是否被划分的标志。例如,split_flag值为1可指示对应块被划分。split_flag值为0可指示对应块不被划分。'split_flag' may be a flag indicating whether the block is divided. For example, a split_flag value of 1 may indicate that the corresponding block is split. A split_flag value of 0 may indicate that the corresponding block is not split.

“QB_flag”可以是指示四叉树形式和二叉树形式中的哪一个与块被划分的形状对应的标志。例如,QB_flag值为0可指示块以四叉树形式被划分。QB_flag值为1可指示块以二叉树形式被划分。可选地,QB_flag值为0可指示块以二叉树形式被划分。QB_flag值为1可指示块以四叉树形式被划分。"QB_flag" may be a flag indicating which of the quad-tree form and the binary-tree form corresponds to the shape in which the block is divided. For example, a QB_flag value of 0 may indicate that the block is partitioned in a quadtree. A QB_flag value of 1 may indicate that the block is partitioned in a binary tree. Optionally, a QB_flag value of 0 may indicate that the block is divided in a binary tree. A QB_flag value of 1 may indicate that the block is partitioned in a quadtree.

“quadtree_flag”可以是指示块是否以四叉树形式被划分的标志。例如,quadtree_flag值为1可指示块以四叉树形式被划分。quadtree_flag值为0可指示块不以四叉树形式被划分。'quadtree_flag' may be a flag indicating whether a block is divided in a quadtree form. For example, a quadtree_flag value of 1 may indicate that the block is divided in a quadtree. A quadtree_flag value of 0 may indicate that the block is not partitioned in a quadtree.

“binarytree_flag”可以是指示块是否以二叉树形式被划分的标志。例如,binarytree_flag值为1可指示块以二叉树形式被划分。binarytree_flag值为0可指示块不以二叉树形式被划分。"binarytree_flag" may be a flag indicating whether the block is divided in a binary tree form. For example, a binarytree_flag value of 1 may indicate that the block is divided in a binary tree. A binarytree_flag value of 0 may indicate that the block is not divided in a binary tree.

“Btype_flag”可以是指示当块以二叉树形式被划分时垂直划分和水平划分中的哪一个与划分方向对应的标志。例如,Btype_flag值为0可指示块在水平方向上被划分。Btype_flag值为1可指示块在垂直方向上被划分。可选地,Btype_flag值为0可指示块在垂直方向上被划分。Btype_flag值为1可指示块在水平方向上被划分。'Btype_flag' may be a flag indicating which of vertical division and horizontal division corresponds to the division direction when the block is divided in a binary tree form. For example, a Btype_flag value of 0 may indicate that the block is divided in the horizontal direction. A Btype_flag value of 1 may indicate that the block is divided in the vertical direction. Alternatively, a Btype_flag value of 0 may indicate that the block is divided in the vertical direction. A Btype_flag value of 1 may indicate that the block is divided in the horizontal direction.

例如,可通过用信号发送quadtree_flag、binarytree_flag和Btype_flag中的至少一个来推导图6中的块的划分信息,如下表1中所示。For example, partition information for the block in FIG. 6 may be derived by signaling at least one of quadtree_flag, binarytree_flag, and Btype_flag, as shown in Table 1 below.

表1Table 1

Figure BDA0002623340100000301
Figure BDA0002623340100000301

例如,可通过用信号发送split_flag、QB_flag和Btype_flag中的至少一个来推导图6中的块的划分信息,如下表2中所示。For example, the split information for the block in FIG. 6 may be derived by signaling at least one of split_flag, QB_flag, and Btype_flag, as shown in Table 2 below.

表2Table 2

Figure BDA0002623340100000311
Figure BDA0002623340100000311

划分方法可根据块的尺寸和/或形状而仅限于四叉树或二叉树。当这个限制被应用时,split_flag可以是指示块是否以四叉树形式被划分的标志或者指示块是否以二叉树形式被划分的标志。可根据块的深度信息来推导块的尺寸和形状,并且深度信息可从编码设备100被用信号发送到解码设备200。The partitioning method may be limited to quadtree or binary tree depending on the size and/or shape of the block. When this restriction is applied, split_flag may be a flag indicating whether the block is divided in the form of a quad-tree or a flag indicating whether the block is divided in the form of a binary tree. The size and shape of the block may be derived from the depth information of the block, and the depth information may be signaled from the encoding apparatus 100 to the decoding apparatus 200 .

当块的尺寸落在特定范围内时,仅以四叉树形式进行划分是可能的。例如,可由能够仅以四叉树形式进行划分的最大块尺寸和最小块尺寸中的至少一个来定义所述特定范围。Partitioning in the form of a quadtree is only possible when the size of the block falls within a certain range. For example, the specific range may be defined by at least one of a maximum block size and a minimum block size that can be divided only in the form of a quadtree.

可通过比特流将指示能够仅以四叉树形式进行划分的最大块尺寸和最小块尺寸的信息从编码设备100用信号发送到解码设备200。此外,可针对诸如视频、序列、画面和条带(或片段)的单元中的至少一个用信号发送这个信息。Information indicating the maximum block size and the minimum block size that can be divided only in the form of a quadtree may be signaled from the encoding apparatus 100 to the decoding apparatus 200 through a bitstream. Furthermore, this information may be signaled for at least one of units such as videos, sequences, pictures, and slices (or segments).

可选地,最大块尺寸和/或最小块尺寸可以是由编码设备100和解码设备200预先定义的固定尺寸。例如,当块的尺寸大于64×64且小于256×256时,仅以四叉树形式进行划分是可能的。在这种情况下,split_flag可以是指示是否执行以四叉树形式的划分的标志。Alternatively, the maximum block size and/or the minimum block size may be fixed sizes predefined by the encoding apparatus 100 and the decoding apparatus 200 . For example, when the size of the block is larger than 64x64 and smaller than 256x256, only division in the form of a quadtree is possible. In this case, split_flag may be a flag indicating whether to perform division in a quadtree form.

当块的尺寸落入特定范围内时,仅以二叉树形式进行划分是可能的。例如,可由能够仅以二叉树形式进行划分的最大块尺寸和最小块尺寸中的至少一个来定义所述特定范围。Partitioning in binary tree form is only possible when the size of the block falls within a certain range. For example, the specific range may be defined by at least one of a maximum block size and a minimum block size that can be divided only in the form of a binary tree.

可通过比特流将指示能够仅以二叉树形式进行划分的最大块尺寸和/或最小块尺寸的信息从编码设备100用信号发送到解码设备200。此外,可针对诸如序列、画面和条带(或片段)的单元中的至少一个用信号发送这个信息。Information indicating the maximum block size and/or the minimum block size that can be divided only in a binary tree form may be signaled from the encoding apparatus 100 to the decoding apparatus 200 through a bitstream. Furthermore, this information may be signaled for at least one of units such as sequences, pictures, and slices (or slices).

可选地,最大块尺寸和/或最小块尺寸可以是由编码设备100和解码设备200预先定义的固定尺寸。例如,当块的尺寸大于8×8且小于16×16时,仅以二叉树形式进行划分是可能的。在这种情况下,split_flag可以是指示是否执行以二叉树形式的划分的标志。Alternatively, the maximum block size and/or the minimum block size may be fixed sizes predefined by the encoding apparatus 100 and the decoding apparatus 200 . For example, when the size of the block is larger than 8×8 and smaller than 16×16, only division in the form of a binary tree is possible. In this case, split_flag may be a flag indicating whether to perform division in the form of a binary tree.

块的划分可受到先前划分的限制。例如,当块以二叉树形式被划分并生成了多个分区块时,可仅以二叉树形式另外划分每个分区块。The division of blocks may be restricted by previous divisions. For example, when a block is divided in the form of a binary tree and a plurality of divided blocks are generated, each divided block may be additionally divided only in the form of a binary tree.

当分区块的水平尺寸或垂直尺寸是不能被进一步划分的尺寸时,可不用信号发送上述指示符。When the horizontal size or vertical size of the partition block is a size that cannot be further divided, the above-mentioned indicator may not be signaled.

图7是用于解释帧内预测处理的实施例的示图。FIG. 7 is a diagram for explaining an embodiment of intra prediction processing.

从图7中的图的中心径向延伸的箭头表示帧内预测模式的预测方向。此外,出现在箭头附近的数字指示被分配给帧内预测模式或被分配给帧内预测模式的预测方向的模式值的示例。Arrows extending radially from the center of the graph in FIG. 7 indicate the prediction directions of the intra prediction modes. Also, numbers appearing near the arrows indicate examples of mode values assigned to the intra prediction mode or assigned to the prediction direction of the intra prediction mode.

可使用与目标块邻近的块的参考样点来执行帧内编码和/或解码。邻近块可以是邻近的重建块。例如,可使用在每个邻近的重建块中包括的参考样点的值或者邻近的重建块的编码参数来执行帧内编码和/或解码。Intra-coding and/or decoding may be performed using reference samples of blocks adjacent to the target block. Neighboring blocks may be neighboring reconstructed blocks. For example, intra-frame encoding and/or decoding may be performed using values of reference samples included in each adjacent reconstructed block or encoding parameters of adjacent reconstructed blocks.

编码设备100和/或解码设备200可通过基于关于目标图像中的样点的信息对目标块执行帧内预测来生成预测块。当帧内预测被执行时,编码设备100和/或解码设备200可通过基于关于目标图像中的样点的信息执行帧内预测来生成针对目标块的预测块。当帧内预测被执行时,编码设备100和/或解码设备200可基于至少一个重建的参考样点来执行方向预测和/或非方向预测。The encoding apparatus 100 and/or the decoding apparatus 200 may generate the prediction block by performing intra prediction on the target block based on the information on the samples in the target image. When intra prediction is performed, the encoding apparatus 100 and/or the decoding apparatus 200 may generate a prediction block for the target block by performing intra prediction based on information about samples in the target image. When intra prediction is performed, the encoding apparatus 100 and/or the decoding apparatus 200 may perform directional prediction and/or non-directional prediction based on at least one reconstructed reference sample.

预测块可以是作为执行帧内预测的结果而生成的块。预测块可对应于CU、PU和TU中的至少一个。The prediction block may be a block generated as a result of performing intra prediction. A prediction block may correspond to at least one of a CU, PU, and TU.

预测块的单元可具有与CU、PU和TU中的至少一个对应的尺寸。预测块可具有尺寸为2N×2N或N×N的正方形形状。尺寸N×N可包括尺寸4×4、8×8、16×16、32×32、64×64等。A unit of a prediction block may have a size corresponding to at least one of a CU, PU, and TU. The prediction block may have a square shape of size 2Nx2N or NxN. Dimensions NxN may include dimensions 4x4, 8x8, 16x16, 32x32, 64x64, and the like.

可选地,预测块可以是尺寸为2×2、4×4、8×8、16×16、32×32、64×64等的正方形块或者尺寸为2×8、4×8、2×16、4×16、8×16等的矩形块。Alternatively, the prediction block may be a square block of size 2x2, 4x4, 8x8, 16x16, 32x32, 64x64, etc. or a size of 2x8, 4x8, 2x Rectangular blocks of 16, 4x16, 8x16, etc.

可考虑用于目标块的帧内预测模式执行帧内预测。目标块可具有的帧内预测模式的数量可以是预定义的固定值,并且可以是根据预测块的属性不同地确定的值。例如,预测块的属性可包括预测块的尺寸、预测块的类型等。Intra prediction may be performed considering the intra prediction mode for the target block. The number of intra prediction modes that the target block may have may be a predefined fixed value, and may be a value determined differently according to properties of the prediction block. For example, the properties of the prediction block may include the size of the prediction block, the type of the prediction block, and the like.

例如,不论预测块的尺寸如何,帧内预测模式的数量都可被固定为35。可选地,帧内预测模式的数量可以是例如3、5、9、17、34、35或36。For example, the number of intra prediction modes may be fixed to 35 regardless of the size of the prediction block. Alternatively, the number of intra prediction modes may be, for example, 3, 5, 9, 17, 34, 35 or 36.

帧内预测模式可以是非方向模式或方向模式。例如,如图7中所示,帧内预测模式可包括两种非方向模式和33种方向模式。The intra prediction mode may be a non-directional mode or a directional mode. For example, as shown in FIG. 7, the intra prediction modes may include two non-directional modes and 33 directional modes.

所述两种非方向模式可包括DC模式和平面模式。The two non-directional modes may include DC mode and planar mode.

所述方向模式可以是具有特定方向或特定角度的模式。The directional pattern may be a pattern with a specific direction or a specific angle.

帧内预测模式皆可用模式编号、模式值和模式角度中的至少一个来表示。帧内预测模式的数量可以是M。M的值可以是1或更大。换句话说,帧内预测模式的数量可以是M,其中,M包括非方向模式的数量和方向模式的数量。All intra prediction modes can be represented by at least one of a mode number, a mode value, and a mode angle. The number of intra prediction modes may be M. The value of M can be 1 or greater. In other words, the number of intra prediction modes may be M, where M includes the number of non-directional modes and the number of directional modes.

帧内预测模式的数量可被固定为M,而不管块的尺寸和/或颜色分量如何。例如,帧内预测模式的数量可被固定为35和67中的任意一个,而不管块的尺寸如何。The number of intra prediction modes may be fixed to M regardless of the size and/or color components of the block. For example, the number of intra prediction modes may be fixed to any one of 35 and 67 regardless of the size of the block.

可选地,帧内预测模式的数量可根据块的尺寸和/或颜色分量的类型而不同。Alternatively, the number of intra prediction modes may vary according to the size of the block and/or the type of color component.

例如,块的尺寸越大,帧内预测模式的数量越多。可选地,块的尺寸越大,帧内预测模式的数量越少。当块的尺寸是4×4或8×8时,帧内预测模式的数量可以是67。当块的尺寸是16×16时,帧内预测模式的数量可以是35。当块的尺寸是32×32时,帧内预测模式的数量可以是19。当块的尺寸是64×64时,帧内预测模式的数量可以是7。For example, the larger the size of the block, the larger the number of intra prediction modes. Optionally, the larger the size of the block, the smaller the number of intra prediction modes. When the size of the block is 4×4 or 8×8, the number of intra prediction modes may be 67. When the size of the block is 16×16, the number of intra prediction modes may be 35. When the size of the block is 32×32, the number of intra prediction modes may be 19. When the size of the block is 64×64, the number of intra prediction modes may be seven.

例如,帧内预测模式的数量可根据颜色分量是亮度信号还是色度信号而不同。可选地,与亮度分量块对应的帧内预测模式的数量可大于与色度分量块对应的帧内预测模式的数量。For example, the number of intra prediction modes may differ depending on whether the color component is a luminance signal or a chrominance signal. Alternatively, the number of intra prediction modes corresponding to luma component blocks may be greater than the number of intra prediction modes corresponding to chroma component blocks.

例如,在模式值为26的垂直模式中,可基于参考样点的像素值沿垂直方向执行预测。例如,在模式值为10的水平模式中,可基于参考样点的像素值沿水平方向执行预测。For example, in a vertical mode with a mode value of 26, prediction may be performed in the vertical direction based on pixel values of reference samples. For example, in a horizontal mode with a mode value of 10, prediction may be performed in the horizontal direction based on pixel values of reference samples.

即使在除了上述模式之外的方向模式中,编码设备100和解码设备200仍可使用依据与方向模式对应的角度的参考样点对目标单元执行帧内预测。Even in a direction mode other than the above-described modes, the encoding apparatus 100 and the decoding apparatus 200 may perform intra prediction on the target unit using reference samples according to angles corresponding to the direction mode.

位于相对于垂直模式的右侧的帧内预测模式可被称为“垂直-右侧模式”。位于水平模式下方的帧内预测模式可被称为“水平-下方模式”。例如,在图7中,模式值是27、28、29、30、31、32、33和34之一的帧内预测模式可以是垂直-右侧模式613。模式值是2、3、4、5、6、7、8和9之一的帧内预测模式可以是水平-下方模式616。The intra prediction mode located on the right side with respect to the vertical mode may be referred to as "vertical-right mode". The intra prediction mode located below the horizontal mode may be referred to as a "horizontal-down mode". For example, in FIG. 7 , the intra prediction mode whose mode value is one of 27, 28, 29, 30, 31, 32, 33, and 34 may be the vertical-right mode 613. The intra prediction mode whose mode value is one of 2, 3, 4, 5, 6, 7, 8, and 9 may be the horizontal-down mode 616 .

非方向模式可包括DC模式和平面模式。例如,DC模式的值可以是1。平面模式的值可以是0。Non-directional modes may include DC mode and planar mode. For example, the value of DC mode may be 1. The value of flat mode can be 0.

方向模式可包括角度模式。在多个帧内预测模式中,除了DC模式和平面模式之外的其余模式可以是方向模式。Orientation patterns may include angle patterns. Among the plurality of intra prediction modes, the remaining modes other than the DC mode and the planar mode may be directional modes.

当帧内预测模式是DC模式时,可基于多个参考像素的像素值的平均值生成预测块。例如,可基于多个参考像素的像素值的平均值确定预测块的像素的值。When the intra prediction mode is the DC mode, the prediction block may be generated based on an average value of pixel values of a plurality of reference pixels. For example, the value of the pixel of the prediction block may be determined based on an average value of pixel values of a plurality of reference pixels.

以上描述的帧内预测模式的数量以及各个帧内预测模式的模式值仅是示例性的。可根据实施例、实现方式和/或要求来不同地定义以上描述的帧内预测模式的数量以及各个帧内预测模式的模式值。The number of intra prediction modes and the mode value of each intra prediction mode described above are only exemplary. The number of intra-prediction modes described above and the mode value of each intra-prediction mode may be differently defined according to embodiments, implementations, and/or requirements.

为了对目标块执行帧内预测,可执行检查在重建的邻近块中包括的样点是否可被用作目标块的参考样点的步骤。当在邻近块中的样点之中存在不能被用作目标块的参考样点的样点时,经由使用在重建的邻近块中包括的样点之中的至少一个样点值的插值和/或复制而生成的值可替换不能被用作参考样点的样点的样点值。当经由复制和/或插值而生成的值替换现有样点的样点值时,该样点可被用作目标块的参考样点。In order to perform intra prediction on the target block, a step of checking whether samples included in the reconstructed adjacent blocks can be used as reference samples for the target block may be performed. When a sample that cannot be used as a reference sample of the target block exists among the samples in the neighboring block, via interpolation using at least one sample value among the samples included in the reconstructed neighboring block and/or Or copy the resulting value to replace the sample value of a sample that cannot be used as a reference sample. When a sample value of an existing sample is replaced by a value generated via copying and/or interpolation, that sample may be used as a reference sample for the target block.

在帧内预测中,可基于帧内预测模式和目标块的尺寸中的至少一个将滤波器应用于参考样点和预测样点中的至少一个。In intra prediction, a filter may be applied to at least one of reference samples and prediction samples based on at least one of an intra prediction mode and a size of a target block.

将被应用于参考样点和预测样点中的至少一个的滤波器的类型可根据目标块的帧内预测模式、目标块的尺寸和目标块的形状中的至少一个而不同。滤波器的类型可根据滤波器抽头的数量、滤波器系数的值和滤波器强度中的一个或更多个来分类。The type of filter to be applied to at least one of the reference samples and the prediction samples may be different according to at least one of the intra prediction mode of the target block, the size of the target block, and the shape of the target block. The types of filters can be classified according to one or more of the number of filter taps, the value of the filter coefficients, and the filter strength.

当帧内预测模式是平面模式时,可在生成目标块的预测块时根据预测目标样点在预测块中的位置使用目标块的上参考样点、目标块的左参考样点、目标块的右上参考样点和目标块的左下参考样点的加权和来生成预测目标块的样点值。When the intra prediction mode is the plane mode, the upper reference sample of the target block, the left reference sample of the target block, the A weighted sum of the upper right reference samples and the lower left reference samples of the target block is used to generate the sample value of the prediction target block.

当帧内预测模式是DC模式时,可在生成目标块的预测块时使用目标块上方的参考样点和目标块左侧的参考样点的平均值。此外,可对目标块中的特定行或特定列执行使用参考样点的值的滤波。所述特定行可以是与参考样点相邻的一个或更多个上方行。所述特定列可以是与参考样点相邻的一个或更多个左侧列。When the intra prediction mode is the DC mode, the average value of the reference samples above the target block and the reference samples to the left of the target block may be used when generating the prediction block of the target block. Also, filtering using the values of the reference samples may be performed on a specific row or a specific column in the target block. The particular row may be one or more upper rows adjacent to the reference sample. The particular column may be one or more left columns adjacent to the reference sample.

当帧内预测模式是方向模式时,可使用目标块的上参考样点、左参考样点、右上参考样点和/或左下参考样点来生成预测块。When the intra prediction mode is the directional mode, the prediction block may be generated using upper reference samples, left reference samples, upper right reference samples and/or lower left reference samples of the target block.

为了生成上述预测样点,可执行基于实数的插值。In order to generate the predicted samples described above, real-number based interpolation may be performed.

可从与目标块相邻的邻近块的帧内预测模式预测目标块的帧内预测模式,并且用于预测的信息可被熵编码/熵解码。The intra prediction mode of the target block may be predicted from the intra prediction modes of neighboring blocks adjacent to the target block, and information for prediction may be entropy encoded/entropy decoded.

例如,当目标块和邻近块的帧内预测模式彼此相同时,可使用预定义的标志来用信号传送目标块和邻近块的帧内预测模式是相同的。For example, when the intra prediction modes of the target block and the adjacent blocks are the same as each other, a predefined flag may be used to signal that the intra prediction modes of the target block and the adjacent blocks are the same.

例如,可用信号传送用于指示在多个邻近块的帧内预测模式之中与目标块的帧内预测模式相同的帧内预测模式的指示符。For example, an indicator for indicating the same intra prediction mode as the intra prediction mode of the target block among the intra prediction modes of a plurality of adjacent blocks may be signaled.

当目标块和邻近块的帧内预测模式彼此不同时,可使用熵编码和/或熵解码对关于目标块的帧内预测模式的信息进行编码和/或解码。When the intra prediction modes of the target block and the neighboring blocks are different from each other, the information on the intra prediction mode of the target block may be encoded and/or decoded using entropy encoding and/or entropy decoding.

图8是用于解释在帧内预测过程中使用的参考样点的位置的示图。FIG. 8 is a diagram for explaining the positions of reference samples used in the intra prediction process.

图8示出了用于对目标块进行帧内预测的参考样点的位置。参照图8,用于对目标块进行帧内预测的重建参考样点可包括左下参考样点831、左参考样点833、左上角参考样点835、上参考样点837和右上参考样点839。FIG. 8 shows the positions of reference samples used for intra prediction of the target block. Referring to FIG. 8 , the reconstructed reference samples used for intra prediction of the target block may include a lower left reference sample 831, a left reference sample 833, an upper left reference sample 835, an upper reference sample 837, and an upper right reference sample 839 .

例如,左参考样点833可表示与目标块的左侧相邻的重建参考像素。上参考样点837可表示与目标块的顶部相邻的重建参考像素。左上角参考样点835可表示位于目标块的左上角处的重建参考像素。左下参考样点831可表示在位于与由左参考样点833组成的左侧样点线相同的线上的样点之中的位于所述左侧样点线下方的参考样点。右上参考样点839可表示在位于与由上参考样点837组成的上方样点线相同的线上的样点之中的位于所述上方样点线右侧的参考样点。For example, the left reference sample 833 may represent the reconstructed reference pixel adjacent to the left side of the target block. Upper reference samples 837 may represent reconstructed reference pixels adjacent to the top of the target block. The upper left reference sample 835 may represent the reconstructed reference pixel located at the upper left corner of the target block. The lower left reference sample 831 may represent a reference sample located below the left sample line among samples located on the same line as the left sample line composed of the left reference sample 833 . The upper right reference sample 839 may represent a reference sample located to the right of the upper sample line among samples located on the same line as the upper sample line composed of the upper reference sample 837 .

当目标块的尺寸是N×N时,左下参考样点831、左参考样点833、上参考样点837和右上参考样点839的数量可以均是N。When the size of the target block is N×N, the numbers of the lower left reference samples 831 , the left reference samples 833 , the upper reference samples 837 , and the upper right reference samples 839 may all be N.

通过对目标块执行帧内预测,可生成预测块。生成预测块的过程可包括确定预测块中的像素的值。目标块和预测块的尺寸可以相同。The predicted block may be generated by performing intra prediction on the target block. The process of generating the prediction block may include determining values for pixels in the prediction block. The size of the target block and the prediction block may be the same.

用于对目标块进行帧内预测的参考样点可根据目标块的帧内预测模式而改变。帧内预测模式的方向可表示参考样点与预测块的像素之间的依赖关系。例如,指定参考样点的值可被用作预测块中的一个或更多个指定像素的值。在这种情况下,所述指定参考样点和预测块中的所述一个或更多个指定像素可以是位于沿帧内预测模式的方向的直线上的样点和像素。换句话说,所述指定参考样点的值可被复制作为位于与帧内预测模式的方向相反的方向上的像素的值。可选地,预测块中的像素的值可以是相对于该像素的位置位于帧内预测模式的方向上的参考样点的值。The reference samples used for intra prediction of the target block may vary according to the intra prediction mode of the target block. The direction of the intra prediction mode may represent the dependency between the reference samples and the pixels of the prediction block. For example, a value specifying a reference sample may be used as a value specifying one or more pixels in the prediction block. In this case, the designated reference samples and the one or more designated pixels in the prediction block may be samples and pixels located on a straight line in the direction of the intra prediction mode. In other words, the value of the specified reference sample can be copied as the value of the pixel located in the direction opposite to that of the intra prediction mode. Alternatively, the value of a pixel in the prediction block may be the value of a reference sample located in the direction of the intra prediction mode with respect to the position of the pixel.

在示例中,当目标块的帧内预测模式是模式值为26的垂直模式时,上参考样点837可被用于帧内预测。当帧内预测模式是垂直模式时,预测块中的像素的值可以是垂直地位于该像素的位置上方的参考样点的值。因此,与目标块的顶部相邻的上参考样点837可被用于帧内预测。此外,在预测块的一行中的像素的值可与上参考样点837的像素的值相同。In an example, when the intra prediction mode of the target block is a vertical mode with a mode value of 26, the upper reference sample 837 may be used for intra prediction. When the intra prediction mode is the vertical mode, the value of a pixel in the prediction block may be the value of a reference sample located vertically above the position of the pixel. Therefore, the upper reference sample 837 adjacent to the top of the target block can be used for intra prediction. Furthermore, the values of the pixels in a row of the prediction block may be the same as the values of the pixels of the upper reference sample 837 .

在示例中,当目标块的帧内预测模式是模式值为10的水平模式时,左参考样点833可被用于帧内预测。当帧内预测模式是水平模式时,预测块中的像素的值可以是水平地位于该像素的位置左侧的参考样点的值。因此,与目标块的左侧相邻的左参考样点833可被用于帧内预测。此外,在预测块的一列中的像素的值可与左参考样点833的像素的值相同。In an example, when the intra prediction mode of the target block is a horizontal mode with a mode value of 10, the left reference sample 833 may be used for intra prediction. When the intra prediction mode is the horizontal mode, the value of a pixel in the prediction block may be the value of a reference sample located horizontally to the left of the position of the pixel. Therefore, the left reference sample 833 adjacent to the left side of the target block can be used for intra prediction. Furthermore, the values of the pixels in one column of the prediction block may be the same as the values of the pixels of the left reference sample 833 .

在示例中,当当前块的帧内预测模式的模式值是18时,左参考样点833中的至少一些、左上角参考样点835、以及上参考样点837中的至少一些可被用于帧内预测。当帧内预测模式的模式值是18时,预测块中的像素的值可以是对角地位于该像素的左上角处的参考样点的值。In an example, when the mode value of the intra prediction mode of the current block is 18, at least some of the left reference samples 833, the upper left reference samples 835, and the upper reference samples 837 may be used for Intra prediction. When the mode value of the intra prediction mode is 18, the value of the pixel in the prediction block may be the value of the reference sample diagonally located at the upper left corner of the pixel.

此外,在模式值为27、28、29、30、31、32、33或34的帧内预测模式被使用的情况下,右上参考样点839中的至少一部分可被用于帧内预测。Also, in the case where an intra prediction mode with a mode value of 27, 28, 29, 30, 31, 32, 33 or 34 is used, at least a part of the upper right reference samples 839 may be used for intra prediction.

此外,在模式值为2、3、4、5、6、7、8或9的帧内预测模式被使用的情况下,左下参考样点831中的至少一部分可被用于帧内预测。Also, in the case where an intra prediction mode with a mode value of 2, 3, 4, 5, 6, 7, 8, or 9 is used, at least a part of the lower left reference samples 831 may be used for intra prediction.

此外,在模式值是范围从11至25的值的帧内预测模式的情况下,左上角参考样点835可被用于帧内预测。Also, in the case of an intra prediction mode in which the mode value is a value ranging from 11 to 25, the upper left reference sample 835 may be used for intra prediction.

用于确定预测块中的一个像素的像素值的参考样点的数量可以是1或者2或者更多。The number of reference samples used to determine the pixel value of one pixel in the prediction block may be 1 or 2 or more.

如上所述,可根据像素的位置和由帧内预测模式的方向所指示的参考样点的位置来确定预测块中的像素的像素值。当像素的位置以及由帧内预测模式的方向所指示的参考样点的位置是整数位置时,由整数位置所指示的一个参考样点的值可被用于确定预测块中的像素的像素值。As described above, the pixel value of the pixel in the prediction block may be determined according to the position of the pixel and the position of the reference sample indicated by the direction of the intra prediction mode. When the position of the pixel and the position of the reference sample indicated by the direction of the intra prediction mode are integer positions, the value of one reference sample indicated by the integer position may be used to determine the pixel value of the pixel in the prediction block .

当像素的位置以及由帧内预测模式的方向所指示的参考样点的位置不是整数位置时,可生成基于与该参考样点的位置最接近的两个参考样点的插值参考样点。插值参考样点的值可被用于确定预测块中的像素的像素值。换句话说,当预测块中的像素的位置以及由帧内预测模式的方向所指示的参考样点的位置指示两个参考样点之间的位置时,可生成基于这两个样点的值的插值。When the position of the pixel and the position of the reference sample indicated by the direction of the intra prediction mode are not integer positions, an interpolated reference sample based on the two closest reference samples to the position of the reference sample may be generated. The values of the interpolated reference samples may be used to determine pixel values for pixels in the prediction block. In other words, when the position of the pixel in the prediction block and the position of the reference sample indicated by the direction of the intra prediction mode indicate the position between the two reference samples, a value based on the two samples may be generated interpolation.

经由预测而生成的预测块可以与原始目标块不同。换句话说,可能存在预测误差,该预测误差是目标块与预测块之间的差,并且也可能存在在目标块的像素与预测块的像素之间的预测误差。The prediction block generated via prediction may be different from the original target block. In other words, there may be a prediction error, which is the difference between the target block and the prediction block, and there may also be a prediction error between the pixels of the target block and the pixels of the prediction block.

在下文中,术语“差”、“误差”和“残差”可被用于具有相同的含义,并且可彼此互换使用。In the following, the terms "difference", "error" and "residual" may be used to have the same meaning and may be used interchangeably with each other.

例如,在方向帧内预测的情况下,预测块的像素与参考样点之间的距离越长,则可能发生的预测误差越大。这种预测误差可导致生成的预测块与邻近块之间的不连续性。For example, in the case of directional intra prediction, the longer the distance between the pixels of the prediction block and the reference samples, the larger the prediction error that may occur. Such prediction errors can lead to discontinuities between the generated prediction block and neighboring blocks.

为了减少预测误差,可使用针对预测块的滤波操作。滤波操作可被配置为自适应地将滤波器应用于预测块中的被认为具有较大预测误差的区域。例如,被认为具有较大预测误差的区域可以是预测块的边界。此外,在预测块中被认为具有较大预测误差的区域可根据帧内预测模式而不同,并且滤波器的特性也可根据帧内预测模式而不同。To reduce prediction errors, filtering operations for prediction blocks may be used. The filtering operation may be configured to adaptively apply filters to regions in the prediction block that are considered to have larger prediction errors. For example, a region considered to have a larger prediction error may be the boundary of a prediction block. Also, a region considered to have a large prediction error in a prediction block may differ according to intra prediction modes, and characteristics of filters may also differ according to intra prediction modes.

图9是用于解释帧间预测过程的实施例的示图。FIG. 9 is a diagram for explaining an embodiment of an inter prediction process.

图9中示出的矩形可表示图像(或画面)。此外,在图9中,箭头可表示预测方向。也就是说,可根据预测方向对每个图像进行编码和/或解码。The rectangles shown in FIG. 9 may represent images (or pictures). Furthermore, in FIG. 9, arrows may indicate prediction directions. That is, each picture may be encoded and/or decoded according to the prediction direction.

图像可根据编码类型被分类为帧内画面(I画面)、单预测画面或预测编码画面(P画面)、以及双预测画面或双预测编码画面(B画面)。可根据每个画面的编码类型对每个画面进行编码和/或解码。Pictures may be classified into intra pictures (I pictures), uni-predictive or predictively encoded pictures (P pictures), and bi-predictive or bi-predictive encoded pictures (B pictures) according to encoding types. Each picture may be encoded and/or decoded according to its encoding type.

当作为将被编码的目标的目标图像是I画面时,目标图像可在不进行参照其它图像的帧间预测的情况下使用图像本身包含的数据被编码。例如,I画面可仅经由帧内预测被编码。When a target image to be encoded is an I picture, the target image can be encoded using data contained in the image itself without performing inter prediction with reference to other images. For example, I pictures may be encoded via intra prediction only.

当目标图像是P画面时,可经由使用存在于一个方向上的参考画面的帧间预测对目标图像进行编码。这里,所述一个方向可以是前向方向或后向方向。When the target image is a P picture, the target image may be encoded via inter prediction using reference pictures existing in one direction. Here, the one direction may be a forward direction or a backward direction.

当目标图像是B画面时,可经由使用存在于两个方向上的参考画面的帧间预测对图像进行编码,或者可以经由使用存在于前向方向和后向方向之一上的参考画面的帧间预测对图像进行编码。这里,所述两个方向可以是前向方向和后向方向。When the target image is a B-picture, the image may be encoded via inter-frame prediction using reference pictures existing in both directions, or may be encoded via frames using reference pictures existing in one of the forward and backward directions Inter prediction encodes images. Here, the two directions may be a forward direction and a backward direction.

使用参考画面进行编码和/或解码的P画面和B画面可被视为使用帧间预测的图像。P pictures and B pictures encoded and/or decoded using reference pictures may be regarded as pictures using inter prediction.

下面,将详细地描述根据实施例的在帧间模式下的帧间预测。Hereinafter, the inter prediction in the inter mode according to the embodiment will be described in detail.

可使用运动信息来执行帧间预测。Inter prediction may be performed using motion information.

在帧间模式下,编码设备100可对目标块执行帧间预测和/或运动补偿。解码设备200可对目标块执行与由编码设备100执行的帧间预测和/或运动补偿对应的帧间预测和/或运动补偿。In the inter mode, the encoding apparatus 100 may perform inter prediction and/or motion compensation on the target block. The decoding apparatus 200 may perform inter prediction and/or motion compensation corresponding to the inter prediction and/or motion compensation performed by the encoding apparatus 100 on the target block.

可由编码设备100和解码设备200在帧间预测期间单独地推导目标块的运动信息。可使用重建的邻近块的运动信息、col块的运动信息和/或与col块相邻的块的运动信息来推导运动信息。The motion information of the target block may be derived separately by the encoding apparatus 100 and the decoding apparatus 200 during inter prediction. The motion information may be derived using motion information of reconstructed neighboring blocks, motion information of the col block, and/or motion information of blocks adjacent to the col block.

例如,编码设备100或解码设备200可通过将空间候选和/或时间候选的运动信息用作目标块的运动信息来执行预测和/或运动补偿。目标块可表示PU和/或PU分区。For example, the encoding apparatus 100 or the decoding apparatus 200 may perform prediction and/or motion compensation by using motion information of spatial candidates and/or temporal candidates as motion information of the target block. A target block may represent a PU and/or a PU partition.

空间候选可以是在空间上与目标块相邻的重建块。Spatial candidates may be reconstructed blocks that are spatially adjacent to the target block.

时间候选可以是在先前重建的同位画面(col画面)中的与目标块对应的重建块。A temporal candidate may be a reconstructed block corresponding to the target block in a previously reconstructed co-located picture (col picture).

在帧间预测中,编码设备100和解码设备200可通过利用空间候选和/或时间候选的运动信息来提高编码效率和解码效率。空间候选的运动信息可被称为“空间运动信息”。时间候选的运动信息可被称为“时间运动信息”。In inter prediction, the encoding apparatus 100 and the decoding apparatus 200 may improve encoding efficiency and decoding efficiency by utilizing motion information of spatial candidates and/or temporal candidates. The motion information of the spatial candidates may be referred to as "spatial motion information". The motion information of the temporal candidates may be referred to as "temporal motion information".

下面,空间候选的运动信息可以是包括空间候选的PU的运动信息。时间候选的运动信息可以是包括时间候选的PU的运动信息。候选块的运动信息可以是包括候选块的PU的运动信息。Hereinafter, the motion information of the spatial candidate may be motion information of a PU including the spatial candidate. The motion information of the temporal candidate may be motion information of a PU including the temporal candidate. The motion information of the candidate block may be motion information of a PU including the candidate block.

可使用参考画面执行帧间预测。Inter prediction may be performed using reference pictures.

参考画面可以是在目标画面之前的画面和在目标画面之后的画面中的至少一个。参考画面可以是用于目标块的预测的图像。The reference picture may be at least one of a picture preceding the target picture and a picture following the target picture. The reference picture may be a picture used for prediction of the target block.

在帧间预测中,可利用用于指示参考画面的参考画面索引(或refIdx)、随后将被描述的运动矢量等来指定参考画面中的区域。这里,在参考画面中指定的区域可指示参考块。In inter prediction, a region in a reference picture may be specified using a reference picture index (or refIdx) for indicating a reference picture, a motion vector to be described later, and the like. Here, the area specified in the reference picture may indicate a reference block.

帧间预测可选择参考画面,并且还可以从参考画面选择与目标块对应的参考块。此外,帧间预测可使用所选择的参考块来生成针对目标块的预测块。The inter prediction can select a reference picture, and can also select a reference block corresponding to the target block from the reference picture. Also, inter prediction may use the selected reference block to generate a prediction block for the target block.

可由编码设备100和解码设备200中的每一个在帧间预测期间推导运动信息。Motion information may be derived during inter prediction by each of the encoding apparatus 100 and the decoding apparatus 200 .

空间候选可以是1)存在于目标画面中的2)已经在先前经由编码和/或解码而重建并且3)与目标块相邻或位于目标块的拐角处的块。这里,“位于目标块的拐角处的块”可以是与水平相邻于目标块的邻近块垂直相邻的块,或者是与垂直相邻于目标块的邻近块水平相邻的块。此外,“位于目标块的拐角处的块”可具有与“与目标块的拐角相邻的块”相同的含义。“位于目标块的拐角处的块”的含义可被包括在“与目标块相邻的块”的含义中。Spatial candidates may be 1) blocks that exist in the target picture 2) have been previously reconstructed via encoding and/or decoding and 3) are adjacent to or at the corners of the target block. Here, the "block at the corner of the target block" may be a block that is vertically adjacent to a neighboring block that is horizontally adjacent to the target block, or a block that is horizontally adjacent to a neighboring block that is vertically adjacent to the target block. Also, "a block located at the corner of the target block" may have the same meaning as "a block adjacent to the corner of the target block". The meaning of "a block located at the corner of the target block" may be included in the meaning of "a block adjacent to the target block".

例如,空间候选可以是位于目标块左侧的重建块、位于目标块上方的重建块、位于目标块左下角的重建块、位于目标块右上角的重建块或位于目标块左上角的目标块。For example, a spatial candidate can be a reconstructed block located to the left of the target block, a reconstructed block located above the target block, a reconstructed block located in the lower left corner of the target block, a reconstructed block located in the upper right corner of the target block, or a target block located in the upper left corner of the target block.

编码设备100和解码设备200中的每一个可识别存在于col画面中的在空间上与目标块对应的位置的块。目标块在目标画面中的位置和所识别的块在col画面中的位置可彼此对应。Each of the encoding apparatus 100 and the decoding apparatus 200 may identify a block existing at a position spatially corresponding to the target block in the col picture. The position of the target block in the target picture and the position of the identified block in the col picture may correspond to each other.

编码设备100和解码设备200中的每一个可将存在于针对所识别的块的预定义相关位置处的col块确定为时间候选。所述预定义相关位置可以是存在于所识别的块内部和/或外部的位置。Each of the encoding apparatus 100 and the decoding apparatus 200 may determine a col block that exists at a predefined relevant position for the identified block as a temporal candidate. The predefined relevant locations may be locations that exist inside and/or outside the identified block.

例如,col块可包括第一col块和第二col块。当所识别的块的坐标是(xP,yP)并且所识别的块的尺寸用(nPSW,nPSH)表示时,第一col块可以是位于坐标(xP+nPSW,yP+nPSH)处的块。第二col块可以是位于坐标(xP+(nPSW>>1),yP+(nPSH>>1))处的块。当第一col块不可用时,可选择性地使用第二col块。For example, a col block may include a first col block and a second col block. When the coordinates of the identified block are (xP, yP) and the size of the identified block is denoted by (nPSW, nPSH), the first col block may be the block located at coordinates (xP+nPSW, yP+nPSH). The second col block may be the block located at coordinates (xP+(nPSW>>1), yP+(nPSH>>1)). The second col block is optionally used when the first col block is not available.

可基于col块的运动矢量确定目标块的运动矢量。编码设备100和解码设备200中的每一个可对col块的运动矢量进行缩放。col块的经缩放的运动矢量可被用作目标块的运动矢量。此外,存储在列表中的时间候选的运行信息的运动矢量可以是经缩放的运动矢量。The motion vector of the target block may be determined based on the motion vector of the col block. Each of the encoding apparatus 100 and the decoding apparatus 200 may scale the motion vector of the col block. The scaled motion vector of the col block may be used as the motion vector of the target block. Also, the motion vector of the running information of the temporal candidates stored in the list may be a scaled motion vector.

目标块的运动矢量与col块的运动矢量的比率可与第一距离与第二距离的比率相同。第一距离可以是参考画面与目标块的目标画面之间的距离。第二距离可以是参考画面与col块的col画面之间的距离。The ratio of the motion vector of the target block to the motion vector of the col block may be the same as the ratio of the first distance to the second distance. The first distance may be a distance between the reference picture and the target picture of the target block. The second distance may be a distance between the reference picture and the col picture of the col block.

用于推导运动信息的方案可根据目标块的帧间预测模式而改变。例如,作为被应用于帧间预测的帧间预测模式,可存在高级运动矢量预测因子(AMVP)模式、合并模式、跳过模式、当前画面参考模式等。合并模式也可被称为“运动合并模式”。下面将详细描述各个模式。The scheme for deriving motion information may vary according to the inter prediction mode of the target block. For example, as the inter prediction mode applied to the inter prediction, there may be an Advanced Motion Vector Predictor (AMVP) mode, a merge mode, a skip mode, a current picture reference mode, and the like. Merge mode may also be referred to as "motion merge mode". Each mode will be described in detail below.

1)AMVP模式1) AMVP mode

当使用AMVP模式时,编码设备100可在目标块的邻近区域中搜索相似块。编码设备100可通过使用找到的相似块的运动信息对目标块执行预测来获取预测块。编码设备100可对作为目标块与预测块之间的差的残差块进行编码。When the AMVP mode is used, the encoding apparatus 100 may search for similar blocks in a neighboring area of the target block. The encoding apparatus 100 may acquire a prediction block by performing prediction on a target block using motion information of the found similar blocks. The encoding apparatus 100 may encode the residual block, which is the difference between the target block and the prediction block.

1-1)创建预测运动矢量候选的列表1-1) Create a list of predicted motion vector candidates

当AMVP模式被用作预测模式时,编码设备100和解码设备200中的每一个可使用空间候选的运动矢量、时间候选的运动矢量和零矢量来创建预测运动矢量候选的列表。预测运动矢量候选列表可包括一个或更多个预测运动矢量候选。空间候选的运动矢量、时间候选的运动矢量和零矢量中的至少一个可被确定并被用作预测运动矢量候选。When the AMVP mode is used as the prediction mode, each of the encoding apparatus 100 and the decoding apparatus 200 may create a list of predicted motion vector candidates using motion vectors of spatial candidates, motion vectors of temporal candidates, and zero vectors. The motion vector predictor candidate list may include one or more motion vector predictor candidates. At least one of a motion vector of a spatial candidate, a motion vector of a temporal candidate, and a null vector may be determined and used as a predicted motion vector candidate.

在下文中,术语“预测运动矢量(候选)”和“运动矢量(候选)”可被用于具有相同的含义,并且可彼此互换使用。Hereinafter, the terms "prediction motion vector (candidate)" and "motion vector (candidate)" may be used to have the same meaning, and may be used interchangeably with each other.

在下文中,术语“预测运动矢量候选”和“AMVP候选”可被用于具有相同的含义,并且可彼此互换地使用。Hereinafter, the terms "motion vector predictor candidate" and "AMVP candidate" may be used to have the same meaning, and may be used interchangeably with each other.

在下文中,术语“预测运动矢量候选列表”和“AMVP候选列表”可被用于具有相同的含义,并且可彼此互换使用。Hereinafter, the terms "motion vector predictor candidate list" and "AMVP candidate list" may be used to have the same meaning, and may be used interchangeably with each other.

空间候选可包括重建的空间邻近块。换句话说,重建的邻近块的运动矢量可被称为“空间预测运动矢量候选”。Spatial candidates may include reconstructed spatial neighboring blocks. In other words, the motion vectors of the reconstructed neighboring blocks may be referred to as "spatially predicted motion vector candidates".

时间候选可包括col块和与col块相邻的块。换句话说,col块的运动矢量或与col块相邻的块的运动矢量可被称为“时间预测运动矢量候选”。Temporal candidates may include the col block and blocks adjacent to the col block. In other words, the motion vector of the col block or the motion vector of the block adjacent to the col block may be referred to as a "temporal motion vector predictor candidate".

零矢量可以是(0,0)运动矢量。The zero vector can be the (0,0) motion vector.

预测运动矢量候选可以是用于对运动矢量进行预测的运动矢量预测因子。此外,在编码设备100中,每个预测运动矢量候选可以是用于运动矢量的初始搜索位置。A motion vector predictor candidate may be a motion vector predictor for predicting a motion vector. Also, in the encoding apparatus 100, each predicted motion vector candidate may be an initial search position for a motion vector.

1-2)使用预测运动矢量候选的列表搜索运动矢量1-2) Searching for motion vectors using the list of predicted motion vector candidates

编码设备100可使用预测运动矢量候选的列表在搜索范围内确定将被用于对目标块进行编码的运动矢量。此外,编码设备100可在存在于预测运动矢量候选列表中的预测运动矢量候选之中确定将被用作目标块的预测运动矢量的预测运动矢量候选。The encoding apparatus 100 may determine a motion vector to be used for encoding the target block within the search range using the list of predicted motion vector candidates. Also, the encoding apparatus 100 may determine a motion vector predictor candidate to be used as a motion vector predictor of the target block among the motion vector predictor candidates existing in the motion vector predictor candidate list.

将被用于对目标块进行编码的运动矢量可以是可按最小代价编码的运动矢量。The motion vector to be used for encoding the target block may be a motion vector that can be encoded with minimum cost.

此外,编码设备100可确定是否使用AMVP模式对目标块进行编码。Also, the encoding apparatus 100 may determine whether to encode the target block using the AMVP mode.

1-3)对帧间预测信息的传输1-3) Transmission of inter-frame prediction information

编码设备100可生成包括帧间预测所需的帧间预测信息的比特流。解码设备200可使用比特流的帧间预测信息对目标块执行帧间预测。The encoding apparatus 100 may generate a bitstream including inter prediction information required for inter prediction. The decoding apparatus 200 may perform inter prediction on the target block using the inter prediction information of the bitstream.

帧间预测信息可包含1)指示AMVP模式是否被使用的模式信息、2)预测运动矢量索引、3)运动矢量差(MVD)、4)参考方向和5)参考画面索引。The inter prediction information may include 1) mode information indicating whether the AMVP mode is used, 2) a predicted motion vector index, 3) a motion vector difference (MVD), 4) a reference direction, and 5) a reference picture index.

在下文中,术语“预测运动矢量索引”和“AMVP索引”可被用于具有相同的含义,并且可彼此互换使用。Hereinafter, the terms "predictor motion vector index" and "AMVP index" may be used to have the same meaning, and may be used interchangeably with each other.

此外,帧间预测信息可包含残差信号。Furthermore, the inter prediction information may include residual signals.

当模式信息指示AMVP模式被使用时,解码设备200可通过熵解码从比特流获取预测运动矢量索引、MVD、参考方向和参考画面索引。When the mode information indicates that the AMVP mode is used, the decoding apparatus 200 may acquire the predicted motion vector index, the MVD, the reference direction, and the reference picture index from the bitstream through entropy decoding.

预测运动矢量索引可指示在预测运动矢量候选列表中包括的预测运动矢量候选之中的将被用于对目标块进行预测的预测运动矢量候选。The motion vector predictor index may indicate a motion vector predictor candidate to be used for prediction of the target block among the motion vector predictor candidates included in the motion vector predictor candidate list.

1-4)使用帧间预测信息的AMVP模式下的帧间预测1-4) Inter prediction in AMVP mode using inter prediction information

解码设备200可使用预测运动矢量候选列表来推导预测运动矢量候选,并且可基于推导的预测运动矢量候选来确定目标块的运动信息。The decoding apparatus 200 may derive motion vector predictor candidates using the motion vector predictor candidate list, and may determine motion information of the target block based on the derived motion vector predictor candidates.

解码设备200可使用预测运动矢量索引在预测运动矢量候选列表中包括的预测运动矢量候选之中确定用于目标块的运动矢量候选。解码设备200可从预测运动矢量候选列表中包括的预测运动矢量候选之中选择由预测运动矢量索引所指示的预测运动矢量候选作为目标块的预测运动矢量。The decoding apparatus 200 may determine a motion vector candidate for the target block from among the predicted motion vector candidates included in the predicted motion vector candidate list using the predicted motion vector index. The decoding apparatus 200 may select the motion vector predictor candidate indicated by the motion vector predictor index from among the motion vector predictor candidates included in the motion vector predictor candidate list as the motion vector predictor of the target block.

实际将被用于对目标块进行帧间预测的运动矢量可能不与预测运动矢量匹配。为了指示实际将被用于对目标块进行帧间预测的运动矢量与预测运动矢量之间的差,可使用MVD。编码设备100可推导与实际将被用于对目标块进行帧间预测的运动矢量相似的预测运动矢量,以便使用尽可能小的MVD。The motion vector that will actually be used for inter prediction of the target block may not match the predicted motion vector. In order to indicate the difference between the motion vector that will actually be used for inter-predicting the target block and the predicted motion vector, MVD may be used. The encoding apparatus 100 may derive a predicted motion vector similar to the motion vector actually to be used for inter prediction of the target block, so as to use the MVD as small as possible.

MVD可以是目标块的运动矢量与预测运动矢量之间的差。编码设备100可计算MVD,并可对MVD进行熵编码。The MVD may be the difference between the motion vector of the target block and the predicted motion vector. The encoding apparatus 100 may calculate the MVD, and may perform entropy encoding on the MVD.

可通过比特流将MVD从编码设备100发送到解码设备200。解码设备200可对接收的MVD进行解码。解码设备200可通过对解码的MVD和预测运动矢量进行求和来推导目标块的运动矢量。换句话说,由解码设备200推导出的目标块的运动矢量可以是熵解码的MVD和运动矢量候选之和。The MVD may be transmitted from the encoding apparatus 100 to the decoding apparatus 200 through a bitstream. The decoding apparatus 200 may decode the received MVD. The decoding apparatus 200 may derive the motion vector of the target block by summing the decoded MVD and the predicted motion vector. In other words, the motion vector of the target block derived by the decoding apparatus 200 may be the sum of the entropy decoded MVD and the motion vector candidates.

参考方向可指示将被用于对目标块进行预测的参考画面的列表。例如,参考方向可指示参考画面列表L0和参考画面列表L1中的一个。The reference direction may indicate a list of reference pictures to be used for prediction of the target block. For example, the reference direction may indicate one of the reference picture list L0 and the reference picture list L1.

参考方向仅指示将被用于对目标块进行预测的参考画面列表,并且可不意味着参考画面的方向被限制为前向方向或后向方向。换句话说,参考画面列表L0和参考画面列表L1中的每一个可包括前向方向和/或后向方向上的画面。The reference direction only indicates a list of reference pictures to be used for prediction of the target block, and may not mean that the direction of the reference pictures is limited to the forward direction or the backward direction. In other words, each of the reference picture list L0 and the reference picture list L1 may include pictures in the forward direction and/or the backward direction.

参考方向是单向的可意味着使用单个参考画面列表。参考方向是双向的可意味着使用两个参考画面列表。换句话说,参考方向可指示以下情况之一:仅使用参考画面列表L0的情况、仅使用参考画面列表L1的情况、以及使用两个参考画面列表的情况。The reference direction being unidirectional may mean using a single reference picture list. The reference direction being bidirectional may mean using two reference picture lists. In other words, the reference direction may indicate one of the following cases: a case where only the reference picture list L0 is used, a case where only the reference picture list L1 is used, and a case where two reference picture lists are used.

参考画面索引可指示在参考画面列表中的参考画面之中的将被用于对目标块进行预测的参考画面。可由编码设备100对参考画面索引进行熵编码。经熵编码的参考画面索引可通过比特流由编码设备100用信号传送到解码设备200。The reference picture index may indicate a reference picture to be used for prediction of the target block among the reference pictures in the reference picture list. The reference picture index may be entropy encoded by the encoding apparatus 100 . The entropy-encoded reference picture index may be signaled by the encoding apparatus 100 to the decoding apparatus 200 through a bitstream.

当两个参考画面列表被用于对目标块进行预测时,单个参考画面索引和单个运动矢量可被用于参考画面列表中的每一个。此外,当两个参考画面列表被用于对目标块进行预测时,可为目标块指定两个预测块。例如,可使用针对目标块的两个预测块的平均值或加权和来生成目标块的(最终)预测块。When two reference picture lists are used to predict the target block, a single reference picture index and a single motion vector may be used for each of the reference picture lists. Also, when two reference picture lists are used to predict the target block, two prediction blocks may be designated for the target block. For example, the average or weighted sum of the two predicted blocks for the target block may be used to generate the (final) predicted block of the target block.

可通过预测运动矢量索引、MVD、参考方向和参考画面索引来推导目标块的运动矢量。The motion vector of the target block may be derived through the predicted motion vector index, MVD, reference direction, and reference picture index.

解码设备200可基于推导出的运动矢量和参考画面索引来生成针对目标块的预测块。例如,预测块可以是在由参考画面索引所指示的参考画面中的由推导出的运动矢量所指示的参考块。The decoding apparatus 200 may generate a prediction block for the target block based on the derived motion vector and the reference picture index. For example, the prediction block may be the reference block indicated by the derived motion vector in the reference picture indicated by the reference picture index.

由于预测运动矢量索引和MVD被编码,而目标块的运动矢量自身不被编码,因此从编码设备100发送到解码设备200的比特的数量可减少,并且编码效率可提高。Since the predicted motion vector index and the MVD are encoded while the motion vector of the target block itself is not encoded, the number of bits transmitted from the encoding apparatus 100 to the decoding apparatus 200 can be reduced, and encoding efficiency can be improved.

可将重建的邻近块的运动信息用于目标块。在特定的帧间预测模式中,编码设备100可不单独对目标块的实际运动信息进行编码。目标块的运动信息不被编码,而是可对额外信息进行编码,其中,所述额外信息使得能够使用重建的邻近块的运动信息来推导目标块的运动信息。由于所述额外信息被编码,因此被发送到解码设备200的比特的数量可减少,并且编码效率可提高。The motion information of the reconstructed neighboring blocks may be used for the target block. In a specific inter prediction mode, the encoding apparatus 100 may not separately encode the actual motion information of the target block. The motion information of the target block is not encoded, but additional information may be encoded, wherein the additional information enables the motion information of the target block to be derived using the motion information of the reconstructed neighboring blocks. Since the additional information is encoded, the number of bits transmitted to the decoding apparatus 200 can be reduced, and encoding efficiency can be improved.

例如,作为目标块的运动信息不被直接编码的帧间预测模式,可存在跳过模式和/或合并模式。这里,编码设备100和解码设备200中的每一个可使用指示在重建的邻近单元之中的其运动信息将被用作目标单元的运动信息的单元的指示符和/或索引。For example, as an inter prediction mode in which the motion information of the target block is not directly encoded, there may be a skip mode and/or a merge mode. Here, each of the encoding apparatus 100 and the decoding apparatus 200 may use an indicator and/or an index indicating a unit whose motion information is to be used as the motion information of the target unit among reconstructed neighboring units.

2)合并模式2) Merge mode

作为用于推导目标块的运动信息的方案,存在合并。术语“合并”可意味着对多个块的运动进行合并。“合并”可意味着一个块的运动信息也被应用于其它块。换句话说,合并模式可以是从邻近块的运动信息推导目标块的运动信息的模式。As a scheme for deriving motion information of a target block, there is merging. The term "merging" may mean merging the motions of multiple blocks. "Merge" may mean that the motion information of one block is also applied to other blocks. In other words, the merge mode may be a mode in which the motion information of the target block is derived from the motion information of the neighboring blocks.

当使用合并模式时,编码设备100可使用空间候选的运动信息和/或时间候选的运动信息来预测目标块的运动信息。空间候选可包括在空间上与目标块相邻的重建的空间邻近块。空间邻近块可包括左侧邻近块和上方邻近块。时间候选可包括col块。术语“空间候选”和“空间合并候选”可被用于具有相同的含义,并且可彼此互换使用。术语“时间候选”和“时间合并候选”可被用于具有相同的含义,并且可彼此互换使用。When the merge mode is used, the encoding apparatus 100 may predict the motion information of the target block using the motion information of the spatial candidate and/or the motion information of the temporal candidate. The spatial candidates may include reconstructed spatially neighboring blocks that are spatially adjacent to the target block. The spatial neighboring blocks may include left neighboring blocks and upper neighboring blocks. Temporal candidates may include col blocks. The terms "spatial candidate" and "spatial merging candidate" may be used to have the same meaning and may be used interchangeably with each other. The terms "temporal candidate" and "temporal merge candidate" may be used to have the same meaning and may be used interchangeably with each other.

编码设备100可经由预测来获取预测块。编码设备100可对作为目标块与预测块之间的差的残差块进行编码。The encoding apparatus 100 may obtain a prediction block through prediction. The encoding apparatus 100 may encode the residual block, which is the difference between the target block and the prediction block.

2-1)创建合并候选列表2-1) Create a merge candidate list

当使用合并模式时,编码设备100和解码设备200中的每一个可使用空间候选的运动信息和/或时间候选的运动信息来创建合并候选列表。运动信息可包括1)运动矢量、2)参考画面索引和3)参考方向。参考方向可以是单向或双向。When the merge mode is used, each of the encoding apparatus 100 and the decoding apparatus 200 may create a merge candidate list using motion information of spatial candidates and/or motion information of temporal candidates. The motion information may include 1) a motion vector, 2) a reference picture index, and 3) a reference direction. The reference direction can be unidirectional or bidirectional.

合并候选列表可包括合并候选。合并候选可以是运动信息。换句话说,合并候选列表可以是存储多条运动信息的列表。The merge candidate list may include merge candidates. Merge candidates may be motion information. In other words, the merging candidate list may be a list storing a plurality of pieces of motion information.

合并候选可以是多条时间候选和/或空间候选的运动信息。此外,合并候选列表可包括通过对已存在于合并候选列表中的合并候选进行组合而生成的新的合并候选。换句话说,合并候选列表可包括通过对先前存在于合并候选列表中的多条运动信息进行组合而生成的新的运动信息。The merging candidates may be pieces of motion information of temporal candidates and/or spatial candidates. Also, the merge candidate list may include new merge candidates generated by combining merge candidates already existing in the merge candidate list. In other words, the merge candidate list may include new motion information generated by combining pieces of motion information previously existing in the merge candidate list.

合并候选可以是推导帧间预测信息的特定模式。合并候选可以是指示推导帧间预测信息的特定模式的信息。可根据合并候选所指示的特定模式来推导目标块的帧间预测信息。此外,所述特定模式可包括推导一系列帧间预测信息的处理。这种特定模式可以是帧间预测信息推导模式或运动信息推导模式。A merge candidate may be a specific mode in which inter prediction information is derived. The merge candidate may be information indicating a specific mode of deriving inter prediction information. The inter prediction information of the target block may be derived according to the specific mode indicated by the merge candidate. Furthermore, the specific mode may include a process of deriving a series of inter prediction information. This specific mode may be an inter prediction information derivation mode or a motion information derivation mode.

可根据通过合并索引在合并候选列表中的合并候选中选择的合并候选所指示的模式来推导目标块的帧间预测信息。The inter prediction information of the target block may be derived according to the mode indicated by the merge candidate selected from among the merge candidates in the merge candidate list by the merge index.

例如,合并候选列表中的运动信息推导模式可以是以下模式中的至少一个:1)针对子块单元的运动信息推导模式;2)仿射运动信息推导模式。For example, the motion information derivation mode in the merge candidate list may be at least one of the following modes: 1) a motion information derivation mode for sub-block units; 2) an affine motion information derivation mode.

此外,合并候选列表可包括零矢量的运动信息。零矢量也可被称为“零合并候选”。Also, the merge candidate list may include motion information of the zero vector. A zero vector may also be referred to as a "zero merge candidate".

换句话说,合并候选列表中的多条运动信息可以是以下信息中的至少一个:1)空间候选的运动信息、2)时间候选的运动信息、3)通过对先前存在于合并候选列表中的多条运动信息进行组合而生成的运动信息、以及4)零矢量。In other words, the pieces of motion information in the merge candidate list may be at least one of the following information: 1) motion information of spatial candidates, 2) motion information of temporal candidates, 3) motion information of Motion information generated by combining a plurality of pieces of motion information, and 4) a zero vector.

运动信息可包括1)运动矢量、2)参考画面索引和3)参考方向。参考方向也可被称为“帧间预测指示符”。参考方向可以是单向或双向。单向参考方向可指示L0预测或L1预测。The motion information may include 1) a motion vector, 2) a reference picture index, and 3) a reference direction. The reference direction may also be referred to as an "inter prediction indicator". The reference direction can be unidirectional or bidirectional. A unidirectional reference direction may indicate L0 prediction or L1 prediction.

可在执行合并模式下的预测之前创建合并候选列表。A merge candidate list may be created before prediction in merge mode is performed.

可预先定义合并候选列表中的合并候选的数量。编码设备100和解码设备200中的每一个可根据预定义的方案和预定义的优先级将合并候选添加到合并候选列表,使得合并候选列表具有预定义数量的合并候选。可使用预定义的方案和预定义的优先级将编码设备100的合并候选列表和解码设备200的合并候选列表制作为彼此相同。The number of merge candidates in the merge candidate list may be predefined. Each of the encoding apparatus 100 and the decoding apparatus 200 may add merging candidates to the merging candidate list according to a predefined scheme and a predefined priority so that the merging candidate list has a predefined number of merging candidates. The merging candidate list of the encoding apparatus 100 and the merging candidate list of the decoding apparatus 200 may be made identical to each other using a predefined scheme and a predefined priority.

可基于CU或PU来应用合并。当基于CU或PU执行合并时,编码设备100可将包括预定义的信息的比特流发送到解码设备200。例如,所述预定义的信息可包含1)指示是否针对各个块分区执行合并的信息、以及2)关于在作为针对目标块的空间候选和/或时间候选的块之中的将被执行合并的块的信息。Merging may be applied on a CU or PU basis. When combining is performed on a CU or PU basis, the encoding apparatus 100 may transmit a bitstream including predefined information to the decoding apparatus 200 . For example, the predefined information may include 1) information indicating whether or not to perform merging for each block partition, and 2) information on blocks to be merged among blocks that are spatial candidates and/or temporal candidates for the target block block information.

2-2)使用合并候选列表搜索运动矢量2-2) Search motion vector using merge candidate list

编码设备100可确定将被用于对目标块进行编码的合并候选。例如,编码设备100可使用合并候选列表中的合并候选对目标块执行预测,并且可生成针对合并候选的残差块。编码设备100可使用在预测和残差块的编码中生成最小代价的合并候选来对目标块进行编码。The encoding apparatus 100 may determine merge candidates to be used for encoding the target block. For example, the encoding apparatus 100 may perform prediction on the target block using the merge candidates in the merge candidate list, and may generate a residual block for the merge candidates. The encoding apparatus 100 may encode the target block using a merge candidate that generates the least cost in encoding of prediction and residual blocks.

此外,编码设备100可确定是否使用合并模式对目标块进行编码。Also, the encoding apparatus 100 may determine whether to encode the target block using the merge mode.

2-3)对帧间预测信息的传输2-3) Transmission of inter-frame prediction information

编码设备100可生成包括帧间预测所需的帧间预测信息的比特流。编码设备100可通过对帧间预测信息执行熵编码来生成经熵编码的帧间预测信息,并且可将包括经熵编码的帧间预测信息的比特流发送到解码设备200。经熵编码的帧间预测信息可由编码设备100通过比特流用信号传送到解码设备200。The encoding apparatus 100 may generate a bitstream including inter prediction information required for inter prediction. The encoding apparatus 100 may generate entropy-encoded inter-prediction information by performing entropy encoding on the inter-prediction information, and may transmit a bitstream including the entropy-encoded inter-prediction information to the decoding apparatus 200 . The entropy-encoded inter prediction information may be signaled by the encoding apparatus 100 to the decoding apparatus 200 through a bitstream.

解码设备200可使用比特流的帧间预测信息对目标块执行帧间预测。The decoding apparatus 200 may perform inter prediction on the target block using the inter prediction information of the bitstream.

帧间预测信息可包含1)指示合并模式是否被使用的模式信息和2)合并索引。The inter prediction information may include 1) mode information indicating whether a merge mode is used and 2) a merge index.

此外,帧间预测信息可包含残差信号。Furthermore, the inter prediction information may include residual signals.

解码设备200可仅在模式信息指示合并模式被使用时从比特流获取合并索引。The decoding apparatus 200 may acquire the merge index from the bitstream only when the mode information indicates that the merge mode is used.

模式信息可以是合并标志。模式信息的单位可以是块。关于块的信息可包括模式信息,并且模式信息可指示合并模式是否被应用于块。The mode information may be a merge flag. The unit of mode information may be a block. The information about the block may include mode information, and the mode information may indicate whether the merge mode is applied to the block.

合并索引可指示在合并候选列表中包括的合并候选之中的将被用于对目标块进行预测的合并候选。可选地,合并索引可指示在与目标块在空间上或时间上相邻的邻近块之中的将与目标块合并的块。The merge index may indicate a merge candidate to be used for prediction of the target block among the merge candidates included in the merge candidate list. Alternatively, the merge index may indicate a block to be merged with the target block among neighboring blocks that are spatially or temporally adjacent to the target block.

编码设备100可选择合并候选列表中包括的合并候选中的具有最高编码性能的合并候选,并且将合并索引的值设置为指示所选择的合并候选。The encoding apparatus 100 may select a merge candidate having the highest encoding performance among the merge candidates included in the merge candidate list, and set the value of the merge index to indicate the selected merge candidate.

2-4)使用帧间预测信息的合并模式的帧间预测2-4) Inter prediction of merge mode using inter prediction information

解码设备200可使用在合并候选列表中包括的合并候选之中的由合并索引指示的合并候选对目标块执行预测。The decoding apparatus 200 may perform prediction on the target block using the merging candidate indicated by the merging index among the merging candidates included in the merging candidate list.

可通过由合并索引指示的合并候选的运动矢量、参考画面索引和参考方向来指定目标块的运动矢量。The motion vector of the target block may be specified by the motion vector of the merge candidate indicated by the merge index, the reference picture index, and the reference direction.

3)跳过模式3) Skip mode

跳过模式可以是将空间候选的运动信息或时间候选的运动信息在没有改变的情况下应用于目标块的模式。此外,跳过模式可以是不使用残差信号的模式。换句话说,当使用跳过模式时,重建块可以是预测块。The skip mode may be a mode in which motion information of a spatial candidate or motion information of a temporal candidate is applied to the target block without change. Also, the skip mode may be a mode in which the residual signal is not used. In other words, when the skip mode is used, the reconstructed block may be the predicted block.

合并模式与跳过模式之间的差异在于是否发送或使用残差信号。也就是说,除了不发送或使用残差信号之外,跳过模式可类似于合并模式。The difference between merge mode and skip mode is whether a residual signal is sent or used. That is, skip mode may be similar to merge mode, except that no residual signal is sent or used.

当使用跳过模式时,编码设备100可通过比特流将与在作为空间候选或时间候选的块之中的其运动信息将被用作目标块的运动信息的块有关的信息发送到解码设备200。编码设备100可通过对该信息执行熵编码来生成经熵编码的信息,并且可通过比特流将经熵编码的信息用信号传送到解码设备200。When the skip mode is used, the encoding apparatus 100 may transmit, to the decoding apparatus 200 through a bitstream, information about a block whose motion information is to be used as the motion information of the target block among the blocks that are spatial candidates or temporal candidates . The encoding apparatus 100 may generate entropy-encoded information by performing entropy encoding on the information, and may signal the entropy-encoded information to the decoding apparatus 200 through a bitstream.

此外,当使用跳过模式时,编码设备100可不将其它语法信息(诸如MVD)发送到解码设备200。例如,当使用跳过模式时,编码设备100可不将与MVC、编码块标志和变换系数等级中的至少一个相关的语法元素用信号传送到解码设备200。Also, when the skip mode is used, the encoding apparatus 100 may not transmit other syntax information such as MVD to the decoding apparatus 200 . For example, when the skip mode is used, the encoding apparatus 100 may not signal syntax elements related to at least one of MVC, encoding block flags, and transform coefficient levels to the decoding apparatus 200 .

3-1)创建合并候选列表3-1) Create a merge candidate list

跳过模式也可使用合并候选列表。换句话说,可在合并模式和跳过模式两者中使用合并候选列表。在这方面,合并候选列表也可被称为“跳过候选列表”或“合并/跳过候选列表”。The skip mode can also use a merge candidate list. In other words, the merge candidate list can be used in both merge mode and skip mode. In this regard, the merge candidate list may also be referred to as a "skip candidate list" or a "merge/skip candidate list".

可选地,跳过模式可使用与合并模式的候选列表不同的额外候选列表。在这种情况下,在以下描述中,可分别用跳过候选列表和跳过候选来替换合并候选列表和合并候选。Optionally, skip mode may use an additional candidate list that is different from that of merge mode. In this case, in the following description, the merge candidate list and the merge candidate may be replaced with a skip candidate list and a skip candidate, respectively.

可在执行跳过模式下的预测之前创建合并候选列表。A merge candidate list may be created before prediction in skip mode is performed.

3-2)使用合并候选列表搜索运动矢量3-2) Search motion vector using merge candidate list

编码设备100可确定将被用于对目标块进行编码的合并候选。例如,编码设备100可使用合并候选列表中的合并候选对目标块执行预测。编码设备100可使用在预测中生成最小代价的合并候选对目标块进行编码。The encoding apparatus 100 may determine merge candidates to be used for encoding the target block. For example, the encoding apparatus 100 may perform prediction on the target block using merge candidates in the merge candidate list. The encoding apparatus 100 may encode the target block using a merge candidate that generates the least cost in prediction.

此外,编码设备100可确定是否使用跳过模式对目标块进行编码。Also, the encoding apparatus 100 may determine whether to encode the target block using the skip mode.

3-3)对帧间预测信息的传输3-3) Transmission of inter-frame prediction information

编码设备100可生成包括帧间预测所需的帧间预测信息的比特流。解码设备200可使用比特流的帧间预测信息对目标块执行帧间预测。The encoding apparatus 100 may generate a bitstream including inter prediction information required for inter prediction. The decoding apparatus 200 may perform inter prediction on the target block using the inter prediction information of the bitstream.

帧间预测信息可包括1)指示跳过模式是否被使用的模式信息和2)跳过索引。The inter prediction information may include 1) mode information indicating whether a skip mode is used and 2) a skip index.

跳过索引可与以上描述的合并索引相同。The skip index may be the same as the merge index described above.

当使用跳过模式时,可在不使用残差信号的情况下对目标块进行编码。帧间预测信息可不包含残差信号。可选地,比特流可不包括残差信号。When using skip mode, the target block can be encoded without using a residual signal. The inter prediction information may not include residual signals. Alternatively, the bitstream may not include a residual signal.

解码设备200可仅在模式信息指示跳过模式被使用时从比特流获取跳过索引。如上所述,合并索引和跳过索引可彼此相同。解码设备200可仅在模式信息指示合并模式或跳过模式被使用时从比特流获取跳过索引。The decoding apparatus 200 may acquire the skip index from the bitstream only when the mode information indicates that the skip mode is used. As described above, the merge index and the skip index may be the same as each other. The decoding apparatus 200 may acquire the skip index from the bitstream only when the mode information indicates that the merge mode or the skip mode is used.

跳过索引可指示在合并候选列表中包括的合并候选之中的将被用于对目标块进行预测的合并候选。The skip index may indicate a merging candidate to be used for prediction of the target block among the merging candidates included in the merging candidate list.

3-4)使用帧间预测信息的跳过模式下的帧间预测3-4) Inter prediction in skip mode using inter prediction information

解码设备200可使用在合并候选列表中包括的合并候选之中的由跳过索引指示的合并候选对目标块执行预测。The decoding apparatus 200 may perform prediction on the target block using the merge candidate indicated by the skip index among the merge candidates included in the merge candidate list.

可通过由跳过索引指示的合并候选的运动矢量、参考画面索引和参考方向来指定目标块的运动矢量。The motion vector of the target block may be specified by the motion vector of the merge candidate indicated by the skip index, the reference picture index, and the reference direction.

4)当前画面参考模式4) Current screen reference mode

当前画面参考模式可表示这样的预测模式:该预测模式使用目标块所属的目标画面中的先前重建的区域。The current picture reference mode may represent a prediction mode that uses a previously reconstructed region in the target picture to which the target block belongs.

可使用用于指定先前重建的区域的运动矢量。可使用目标块的参考画面索引来确定目标块是否已在当前画面参考模式下被编码。Motion vectors that specify previously reconstructed regions can be used. The reference picture index of the target block may be used to determine whether the target block has been encoded in the current picture reference mode.

指示目标块是否是以当前画面参考模式被编码的块的标志或索引可由编码设备100用信号传送到解码设备200。可选地,可通过目标块的参考画面索引来推断目标块是否是以当前画面参考模式被编码的块。A flag or index indicating whether the target block is a block encoded in the current picture reference mode may be signaled by the encoding apparatus 100 to the decoding apparatus 200 . Alternatively, whether the target block is a block encoded in the current picture reference mode may be inferred from the reference picture index of the target block.

当目标块以当前画面参考模式被编码时,当前画面可存在于针对目标块的参考画面列表中的固定位置或任意位置。When the target block is encoded in the current picture reference mode, the current picture may exist at a fixed position or an arbitrary position in the reference picture list for the target block.

例如,所述固定位置可以是参考画面索引的值是0的位置或最后位置。For example, the fixed position may be the position where the value of the reference picture index is 0 or the last position.

在目标画面存在于参考画面列表中的任意位置时,指示这样的任意位置的额外参考画面索引可由编码设备100用信号传送到解码设备200。When a target picture exists at an arbitrary position in the reference picture list, an additional reference picture index indicating such an arbitrary position may be signaled by the encoding apparatus 100 to the decoding apparatus 200 .

在以上描述的AMVP模式、合并模式和跳过模式中,可使用列表的索引来指定在列表中的多条运动信息之中的将被用于对目标块进行预测的运动信息。In the AMVP mode, merge mode, and skip mode described above, motion information to be used for prediction of a target block among pieces of motion information in the list may be specified using an index of the list.

为了提高编码效率,编码设备100可仅用信号传送在列表中的元素之中的在对目标块的帧间预测中生成最小代价的元素的索引。编码设备100可对该索引进行编码,并且可用信号传送编码后的索引。In order to improve encoding efficiency, the encoding apparatus 100 may signal only an index of an element that generates a minimum cost in inter prediction on the target block among the elements in the list. The encoding apparatus 100 may encode the index, and may signal the encoded index.

因此,必须能够由编码设备100和解码设备200使用相同的方案基于相同的数据来推导以上描述的列表(即,预测运动矢量候选列表和合并候选列表)。这里,所述相同的数据可包括重建画面和重建块。此外,为了使用索引指定元素,必须固定列表中的元素的顺序。Therefore, it must be possible to derive the above-described lists (ie, the motion vector predictor candidate list and the merge candidate list) based on the same data using the same scheme by the encoding apparatus 100 and the decoding apparatus 200 . Here, the same data may include reconstructed pictures and reconstructed blocks. Also, in order to specify an element using an index, the order of the elements in the list must be fixed.

图10示出根据实施例的空间候选。FIG. 10 shows spatial candidates according to an embodiment.

在图10中,示出了空间候选的位置。In Fig. 10, the positions of the spatial candidates are shown.

在图的中央处的大块可表示目标块。五个小块可表示空间候选。The large block at the center of the graph may represent the target block. Five tiles can represent spatial candidates.

目标块的坐标可以是(xP,yP),目标块的尺寸可用(nPSW,nPSH)来表示。The coordinates of the target block can be (xP, yP), and the size of the target block can be represented by (nPSW, nPSH).

空间候选A0可以是与目标块的左下角相邻的块。A0可以是占有位于坐标(xP-1,yP+nPSH+1)处的像素的块。The spatial candidate A 0 may be a block adjacent to the lower left corner of the target block. A 0 may be a block occupying the pixel located at coordinates (xP-1, yP+nPSH+1).

空间候选A1可以是与目标块的左侧相邻的块。A1可以是在与目标块的左侧相邻的块之中的最下方的块。可选地,A1可以是与A0的顶部相邻的块。A1可以是占有位于坐标(xP-1,yP+nPSH)处的像素的块。The spatial candidate A1 may be a block adjacent to the left of the target block. A 1 may be the lowermost block among the blocks adjacent to the left of the target block. Alternatively, A1 may be the block adjacent to the top of A0 . A1 may be a block occupying the pixel located at coordinates (xP-1, yP+nPSH).

空间候选B0可以是与目标块的右上角相邻的块。B0可以是占有位于坐标(xP+nPSW+1,yP-1)处的像素的块。The spatial candidate B 0 may be a block adjacent to the upper right corner of the target block. B 0 may be a block occupying the pixel located at coordinates (xP+nPSW+1, yP-1).

空间候选B1可以是与目标块的顶部相邻的块。B1可以是在与目标块的顶部相邻的块之中的最右侧的块。可选地,B1可以是与B0的左侧相邻的块。B1可以是占有位于坐标(xP+nPSW,yP-1)处的像素的块。The spatial candidate B1 may be a block adjacent to the top of the target block. B 1 may be the rightmost block among the blocks adjacent to the top of the target block. Alternatively, B1 may be the block adjacent to the left of B0 . B1 may be a block occupying the pixel located at coordinates (xP+nPSW, yP-1).

空间候选B2可以是与目标块的左上角相邻的块。B2可以是占有位于坐标(xP-1,yP-1)处的像素的块。The spatial candidate B2 may be a block adjacent to the upper left corner of the target block. B2 may be a block occupying the pixel located at coordinates (xP-1, yP-1).

对空间候选和时间候选的可用性的确定Determination of availability of spatial and temporal candidates

为了将空间候选的运动信息或时间候选的运动信息包括在列表中,必须确定空间候选的运动信息或时间候选的运动信息是否可用。In order to include the motion information of the spatial candidate or the motion information of the temporal candidate in the list, it must be determined whether the motion information of the spatial candidate or the motion information of the temporal candidate is available.

在下文中,候选块可包括空间候选和时间候选。Hereinafter, candidate blocks may include spatial candidates and temporal candidates.

例如,可通过顺序地应用以下步骤1)至步骤4)来执行所述确定。For example, the determination may be performed by sequentially applying the following steps 1) to 4).

步骤1)当包括候选块的PU位于画面的边界外时,候选块的可用性可被设置为“假”。表述“可用性被设置为假”可具有与“设置为不可用”相同的含义。Step 1) When the PU including the candidate block is located outside the boundary of the picture, the availability of the candidate block may be set to "false". The expression "availability is set to false" may have the same meaning as "set to unavailable".

步骤2)当包括候选块的PU位于条带的边界外时,候选块的可用性可被设置为“假”。当目标块和候选块位于不同条带中时,候选块的可用性可被设置为“假”。Step 2) When the PU including the candidate block is located outside the boundary of the slice, the availability of the candidate block may be set to "false". When the target block and the candidate block are located in different slices, the availability of the candidate block may be set to "false".

步骤3)当包括候选块的PU位于并行块的边界外时,候选块的可用性可被设置为“假”。当目标块和候选块位于不同并行块中时,候选块的可用性可被设置为“假”。Step 3) When the PU including the candidate block is located outside the boundary of the parallel block, the availability of the candidate block may be set to "false". When the target block and the candidate block are located in different parallel blocks, the availability of the candidate block may be set to "false".

步骤4)当包括候选块的PU的预测模式是帧内预测模式时,候选块的可用性可被设置为“假”。当包括候选块的PU不使用帧间预测时,候选块的可用性可被设置为“假”。Step 4) When the prediction mode of the PU including the candidate block is the intra prediction mode, the availability of the candidate block may be set to "false". When the PU including the candidate block does not use inter prediction, the availability of the candidate block may be set to "false".

图11示出根据实施例的将空间候选的运动信息添加到合并列表的顺序。FIG. 11 illustrates an order of adding motion information of spatial candidates to a merge list according to an embodiment.

如图11中所示,当空间候选的多条运动信息被添加到合并列表时,可使用A1、B1、B0、A0和B2的顺序。也就是说,可按照A1、B1、B0、A0和B2的顺序将可用空间候选的多条运动信息添加到合并列表。As shown in FIG. 11 , when pieces of motion information of spatial candidates are added to the merge list, the order of A 1 , B 1 , B 0 , A 0 , and B 2 may be used. That is, pieces of motion information of available space candidates may be added to the merge list in the order of A 1 , B 1 , B 0 , A 0 , and B 2 .

用于在合并模式和跳过模式下推导合并列表的方法Methods for deriving merged lists in merge mode and skip mode

如上所述,可设置合并列表中的合并候选的最大数量。可用“N”来指示设置的最大数量。设置的数量可从编码设备100发送到解码设备200。条带的条带头可包括N。换句话说,可通过条带头来设置用于条带的目标块的合并列表中的合并候选的最大数量。例如,N的值基本上可以是5。As described above, the maximum number of merge candidates in the merge list can be set. "N" can be used to indicate the maximum number of settings. The set number may be transmitted from the encoding apparatus 100 to the decoding apparatus 200 . The strip header of the strip may include N. In other words, the maximum number of merge candidates in the merge list for the target block of the slice can be set by the slice header. For example, the value of N can be substantially 5.

可按照以下步骤1)至4)的顺序将多条运动信息(即,合并候选)添加到合并列表。A plurality of pieces of motion information (ie, merge candidates) may be added to the merge list in the order of steps 1) to 4) below.

步骤1)在空间候选之中,可将可用空间候选添加到合并列表。可按照图10中示出的顺序来将可用空间候选的多条运动信息添加到合并列表。这里,当可用空间候选的运动信息与合并列表中已经存在的其它运动信息重叠时,可不将可用空间候选的运动信息添加到合并列表。检查对应运动信息是否与列表中存在的其它运动信息重叠的操作可被简称为“重叠检查”。 Step 1) Among the space candidates, the available space candidates may be added to the merged list. The pieces of motion information of the available space candidates may be added to the merge list in the order shown in FIG. 10 . Here, when the motion information of the available space candidates overlaps with other motion information already existing in the merge list, the motion information of the available space candidates may not be added to the merge list. The operation of checking whether the corresponding motion information overlaps with other motion information existing in the list may be simply referred to as "overlap checking".

被添加的运动信息的最大条数可以是N。The maximum number of pieces of motion information to be added may be N.

步骤2)当合并列表中的运动信息的条数小于N并且时间候选可用时,可将时间候选的运动信息添加到合并列表。这里,当可用时间候选的运动信息与合并列表中已经存在的其它运动信息重叠时,可不将可用时间候选的运动信息添加到合并列表。 Step 2) When the number of pieces of motion information in the merge list is less than N and the temporal candidates are available, the motion information of the temporal candidates can be added to the merge list. Here, when the motion information of the available temporal candidates overlaps with other motion information already existing in the merge list, the motion information of the available temporal candidates may not be added to the merge list.

步骤3)当合并列表中的运动信息的条数小于N并且目标条带的类型是“B”时,可将通过组合双向预测(双预测)而生成的组合运动信息添加到合并列表。 Step 3) When the number of pieces of motion information in the merge list is less than N and the type of the target slice is "B", the combined motion information generated by combining bidirectional prediction (bi-prediction) may be added to the merge list.

目标条带可以是包括目标块的条带。The target slice may be a slice including target blocks.

组合运动信息可以是L0运动信息和L1运动信息的组合。L0运动信息可以是仅参照参考画面列表L0的运动信息。L1运动信息可以是仅参照参考画面列表L1的运动信息。The combined motion information may be a combination of L0 motion information and L1 motion information. The L0 motion information may be motion information referring only to the reference picture list L0. The L1 motion information may be motion information referring only to the reference picture list L1.

在合并列表中,可存在一条或更多条L0运动信息。此外,在合并列表中,可存在一条或更多条L1运动信息。In the merged list, there may be one or more pieces of L0 motion information. Also, in the merged list, there may be one or more pieces of L1 motion information.

组合运动信息可包括一条或更多条组合运动信息。当生成组合运动信息时,可预先定义所述一条或更多条L0运动信息和所述一条或更多条L1运动信息之中的将被用于生成组合运动信息的步骤的L0运动信息和L1运动信息。可经由使用合并列表中的一对不同运动信息的组合双向预测按照预定义的顺序来生成一条或更多条组合运动信息。所述一对不同运动信息中的一条运动信息可以是L0运动信息,并且所述一对不同运动信息中的另一条运动信息可以是L1运动信息。The combined motion information may include one or more pieces of combined motion information. When generating combined motion information, L0 motion information and L1 to be used in the step of generating combined motion information among the one or more pieces of L0 motion information and the one or more pieces of L1 motion information may be predefined sports information. One or more pieces of combined motion information may be generated in a predefined order via combined bi-directional prediction using a pair of different motion information in the merged list. One piece of motion information in the pair of different motion information may be L0 motion information, and the other piece of motion information in the pair of different motion information may be L1 motion information.

例如,被添加有最高优先级的组合运动信息可以是具有合并索引0的L0运动信息和具有合并索引1的L1运动信息的组合。当具有合并索引0的运动信息不是L0运动信息时或者当具有合并索引1的运动信息不是L1运动信息时,可既不生成也不添加组合运动信息。接下来,被添加有下一优先级的组合运动信息可以是具有合并索引1的L0运动信息和具有合并索引0的L1运动信息的组合。随后的详细组合可符合视频编码/解码领域的其它组合。For example, the combined motion information added with the highest priority may be a combination of L0 motion information with merge index 0 and L1 motion information with merge index 1. When motion information with merge index 0 is not L0 motion information or when motion information with merge index 1 is not L1 motion information, combined motion information may neither be generated nor added. Next, the combined motion information added with the next priority may be a combination of L0 motion information with merge index 1 and L1 motion information with merge index 0. The detailed combinations that follow may conform to other combinations in the field of video encoding/decoding.

这里,当组合运动信息与合并列表中已经存在的其它运动信息重叠时,可不将组合运动信息添加到合并列表。Here, when the combined motion information overlaps with other motion information already existing in the merge list, the combined motion information may not be added to the merge list.

步骤4)当合并列表中的运动信息的条数小于N时,可将零矢量的运动信息添加到合并列表。 Step 4) When the number of pieces of motion information in the merge list is less than N, the motion information of the zero vector can be added to the merge list.

零矢量运动信息可以是运动矢量是零矢量的运动信息。The zero vector motion information may be motion information in which the motion vector is a zero vector.

零矢量运动信息的条数可以是一个或更多个。一条或更多条零矢量运动信息的参考画面索引可彼此不同。例如,第一零矢量运动信息的参考画面索引的值可以是0。第二零矢量运动信息的参考画面索引的值可以是1。The number of pieces of zero vector motion information may be one or more. The reference picture indices of one or more pieces of zero vector motion information may be different from each other. For example, the value of the reference picture index of the first zero vector motion information may be 0. The value of the reference picture index of the second zero vector motion information may be 1.

零矢量运动信息的条数可与参考画面列表中的参考画面的数量相同。The number of pieces of zero vector motion information may be the same as the number of reference pictures in the reference picture list.

零矢量运动信息的参考方向可以是双向的。两个运动矢量可以是零矢量。零矢量运动信息的条数可以是参考画面列表L0中的参考画面的数量和参考画面列表L1中的参考画面的数量中较小的一个。可选地,当参考画面列表L0中的参考画面的数量和参考画面列表L1中的参考画面的数量彼此不同时,作为单向的参考方向可被用于可仅应用于单个参考画面列表的参考画面索引。The reference direction of the zero vector motion information can be bidirectional. The two motion vectors can be zero vectors. The number of pieces of zero vector motion information may be the smaller one of the number of reference pictures in the reference picture list L0 and the number of reference pictures in the reference picture list L1. Alternatively, when the number of reference pictures in the reference picture list L0 and the number of reference pictures in the reference picture list L1 are different from each other, a reference direction that is a unidirectional can be used for a reference that can only be applied to a single reference picture list screen index.

编码设备100和/或解码设备200可随后将零矢量运动信息添加到合并列表,同时改变参考画面索引。The encoding apparatus 100 and/or the decoding apparatus 200 may then add the zero vector motion information to the merge list while changing the reference picture index.

当零矢量运动信息与合并列表中已经存在的其它运动信息重叠时,可不将零矢量运动信息添加到合并列表。When the zero vector motion information overlaps with other motion information already present in the merge list, the zero vector motion information may not be added to the merge list.

上述步骤1)至步骤4)的顺序仅是示例性的,并且可被改变。此外,可根据预定义的条件省略以上步骤中的一些步骤。The order of the above steps 1) to 4) is only exemplary and may be changed. Furthermore, some of the above steps may be omitted according to predefined conditions.

用于在AMVP模式下推导预测运动矢量候选列表的方法Method for deriving motion vector predictor candidate list in AMVP mode

可预先定义在预测运动矢量候选列表中的预测运动矢量候选的最大数量。可用N来指示预定义的最大数量。例如,预定义的最大数量可以是2。The maximum number of motion vector predictor candidates in the motion vector predictor candidate list may be predefined. N can be used to indicate a predefined maximum number. For example, the predefined maximum number can be 2.

可按照以下的步骤1)至步骤3)的顺序将多条运动信息(即,预测运动矢量候选)添加到预测运动矢量候选列表。A plurality of pieces of motion information (ie, motion vector predictor candidates) may be added to the motion vector predictor candidate list in the order of step 1) to step 3) below.

步骤1)可将空间候选之中的可用空间候选添加到预测运动矢量候选列表。空间候选可包括第一空间候选和第二空间候选。 Step 1) The available spatial candidates among the spatial candidates may be added to the predicted motion vector candidate list. The spatial candidates may include a first spatial candidate and a second spatial candidate.

第一空间候选可以是A0、A1、经缩放的A0和经缩放的A1中的一个。第二空间候选可以是B0、B1、B2、经缩放的B0、经缩放的B1和经缩放的B2中的一个。The first spatial candidate may be one of A 0 , A 1 , scaled A 0 , and scaled A 1 . The second spatial candidate may be one of B 0 , B 1 , B 2 , scaled B 0 , scaled B 1 , and scaled B 2 .

可按照第一空间候选和第二空间候选的顺序将可用空间候选的多条运动信息添加到预测运动矢量候选列表。在这种情况下,当可用空间候选的运动信息与预测运动矢量候选列表中已经存在的其它运动信息重叠时,可不将可用空间候选的运动信息添加到预测运动矢量候选列表。换句话说,当N的值是2时,如果第二空间候选的运动信息与第一空间候选的运动信息相同,则可不将第二空间候选的运动信息添加到预测运动矢量候选列表。The pieces of motion information of the available spatial candidates may be added to the motion vector predictor candidate list in the order of the first spatial candidate and the second spatial candidate. In this case, when the motion information of the available space candidates overlaps with other motion information already existing in the motion vector predictor candidate list, the motion information of the available space candidates may not be added to the motion vector predictor candidate list. In other words, when the value of N is 2, if the motion information of the second spatial candidate is the same as that of the first spatial candidate, the motion information of the second spatial candidate may not be added to the motion vector predictor candidate list.

被添加的运动信息的最大条数可以是N。The maximum number of pieces of motion information to be added may be N.

步骤2)当预测运动矢量候选列表中的运动信息的条数小于N并且时间候选可用时,可将时间候选的运动信息添加到预测运动矢量候选列表。在这种情况下,当可用时间候选的运动信息与预测运动矢量候选列表中已经存在的其它运动信息重叠时,可不将可用时间候选的运动信息添加到预测运动矢量候选列表。 Step 2) When the number of pieces of motion information in the motion vector predictor candidate list is less than N and the temporal candidates are available, the motion information of the temporal candidates may be added to the motion vector predictor candidate list. In this case, when the motion information of the available temporal candidates overlaps with other motion information already existing in the motion vector predictor candidate list, the motion information of the available temporal candidates may not be added to the motion vector predictor candidate list.

步骤3)当预测运动矢量候选列表中的运动信息的条数小于N时,可将零矢量运动信息添加到预测运动矢量候选列表。 Step 3) When the number of pieces of motion information in the motion vector predictor candidate list is less than N, zero vector motion information may be added to the motion vector predictor candidate list.

零矢量运动信息可包括一条或更多条零矢量运动信息。所述一条或更多条零矢量运动信息的参考画面索引可彼此不同。The zero vector motion information may include one or more pieces of zero vector motion information. The reference picture indices of the one or more pieces of zero vector motion information may be different from each other.

编码设备100和/或解码设备200可顺序地将多条零矢量运动信息添加到预测运动矢量候选列表,同时改变参考画面索引。The encoding apparatus 100 and/or the decoding apparatus 200 may sequentially add pieces of zero vector motion information to the predicted motion vector candidate list while changing the reference picture index.

当零矢量运动信息与预测运动矢量候选列表中已经存在的其它运动信息重叠时,可不将零矢量运动信息添加到预测运动矢量候选列表。When the zero vector motion information overlaps with other motion information already present in the motion vector predictor candidate list, the zero vector motion information may not be added to the motion vector predictor candidate list.

以上结合合并列表做出的对零矢量运动信息的描述也可应用于零矢量运动信息。将省略其重复的描述。The description of the zero vector motion information made above in connection with the merge list can also be applied to the zero vector motion information. Duplicate descriptions thereof will be omitted.

以上描述的步骤1)至步骤3)的顺序仅是示例性的,并且可被改变。此外,可根据预定义的条件省略所述步骤中的一些步骤。The order of step 1) to step 3) described above is only exemplary and may be changed. Furthermore, some of the steps may be omitted according to predefined conditions.

图12示出根据示例的变换和量化处理。Figure 12 shows transform and quantization processing according to an example.

如图12中所示,可通过对残差信号执行变换和/或量化处理来产生量化的等级。As shown in FIG. 12, the quantized levels may be generated by performing transform and/or quantization processing on the residual signal.

残差信号可以被生成为原始块与预测块之间的差。这里,预测块可以是经由帧内预测或帧间预测生成的块。The residual signal may be generated as the difference between the original block and the predicted block. Here, the prediction block may be a block generated via intra prediction or inter prediction.

残差信号可以通过作为量化过程的一部分的变换过程被变换为频域中的信号。The residual signal can be transformed into a signal in the frequency domain by a transformation process as part of the quantization process.

用于变换的变换核可包含各种DCT核,例如离散余弦变换(DCT)类型2(DCT-II)及离散正弦变换(DST)核。Transform kernels for transforms may include various DCT kernels, such as discrete cosine transform (DCT) type 2 (DCT-II) and discrete sine transform (DST) kernels.

这些变换核可对残差信号执行可分离变换或二维(2D)不可分离变换。可分离变换可为指示在水平方向及垂直方向中的每一者上对残差信号执行一维(1D)变换的变换。These transform kernels can perform a separable transform or a two-dimensional (2D) non-separable transform on the residual signal. A separable transform may be a transform that indicates that a one-dimensional (1D) transform is performed on the residual signal in each of the horizontal and vertical directions.

自适应地用于1D变换的DCT类型和DST类型除了DCT-II之外还可以包括DCT-V、DCT-VIII、DST-I和DST-VII,如下表3所示。The DCT types and DST types adaptively used for the 1D transform may include DCT-V, DCT-VIII, DST-I, and DST-VII in addition to DCT-II, as shown in Table 3 below.

表3table 3

Figure BDA0002623340100000531
Figure BDA0002623340100000531

Figure BDA0002623340100000541
Figure BDA0002623340100000541

如表3所示,当推导出将被用于变换的DCT类型或DST类型时,可以使用变换集。每个变换集可包括多个变换候选。每个变换候选可为DCT类型或DST类型。As shown in Table 3, a transform set can be used when deriving the DCT type or DST type to be used for the transform. Each transform set may include multiple transform candidates. Each transform candidate may be of DCT type or DST type.

下表4示出依据帧内预测模式而被应用于水平方向的变换集的示例。Table 4 below shows an example of a transform set applied to the horizontal direction according to the intra prediction mode.

表4Table 4

Figure BDA0002623340100000542
Figure BDA0002623340100000542

Figure BDA0002623340100000551
Figure BDA0002623340100000551

在表4中,依据目标块的帧内预测模式来指示将被应用于残差信号的水平方向的每个变换集的编号。In Table 4, the number of each transform set to be applied to the horizontal direction of the residual signal is indicated according to the intra prediction mode of the target block.

下表5示出依据帧内预测模式而被应用于残差信号的垂直方向的变换集的示例。Table 5 below shows an example of a transform set applied to the vertical direction of the residual signal according to the intra prediction mode.

表5table 5

Figure BDA0002623340100000552
Figure BDA0002623340100000552

Figure BDA0002623340100000561
Figure BDA0002623340100000561

如表4和表5中所例示,可根据目标块的帧内预测模式来预先定义将被应用于水平方向和垂直方向的变换集。编码设备100可使用包括在与目标块的帧内预测模式对应的变换集中的变换来对残差信号执行变换和逆变换。此外,解码设备200可使用包括在与目标块的帧内预测模式对应的变换集中的变换来对残差信号执行逆变换。As exemplified in Tables 4 and 5, transform sets to be applied to the horizontal and vertical directions may be predefined according to the intra prediction mode of the target block. The encoding apparatus 100 may perform transform and inverse transform on the residual signal using the transform included in the transform set corresponding to the intra prediction mode of the target block. Also, the decoding apparatus 200 may perform inverse transform on the residual signal using the transform included in the transform set corresponding to the intra prediction mode of the target block.

在变换和逆变换中,如表3、表4和表5中所例示,将被应用于残差信号的变换集可被确定,并且可不被用信号发送。可将变换指示信息从编码设备100用信号发送到解码设备200。变换指示信息可以是指示变换集中包括的将被应用于残差信号的多个变换候选中的哪一个被使用的信息。In transform and inverse transform, as exemplified in Table 3, Table 4, and Table 5, the set of transforms to be applied to the residual signal may be determined, and may not be signaled. The transformation indication information may be signaled from the encoding apparatus 100 to the decoding apparatus 200 . The transform indication information may be information indicating which of a plurality of transform candidates included in the transform set to be applied to the residual signal is used.

如上所述,使用各种变换的方法可被应用于经由帧内预测或帧间预测生成的残差信号。As described above, methods using various transforms may be applied to residual signals generated via intra prediction or inter prediction.

变换可包括首次变换和二次变换中的至少一个。可通过对残差信号执行首次变换来生成变换系数,并且可通过对变换系数执行二次变换来生成二次变换系数。The transformation may include at least one of a first transformation and a second transformation. The transform coefficients may be generated by performing a first transform on the residual signal, and the second transform coefficients may be generated by performing a second transform on the transform coefficients.

首次变换可被称为“初次变换”。此外,首次变换也可被称为“自适应多变换(AMT)方案”。如上所述,AMT可表示将不同的变换应用于各个1D方向(即,垂直方向和/或水平方向)或者选择的方向。The first transformation may be referred to as the "first transformation." Furthermore, the first transform may also be referred to as an "adaptive multi-transform (AMT) scheme". As mentioned above, AMT may represent the application of different transforms to each 1D direction (ie, vertical and/or horizontal) or selected directions.

可选地,AMT可被称为多变换选择(MTS)或扩展多变换(EMT)。Alternatively, AMT may be referred to as Multiple Transform Selection (MTS) or Extended Multiple Transform (EMT).

二次变换可以是用于提高由首次变换生成的变换系数的能量集中度的变换。类似于首次变换,二次变换可以是可分离变换或不可分离变换。这样的不可分离变换可以是不可分离二次变换(NSST)。The secondary transform may be a transform for increasing the energy concentration of the transform coefficients generated by the first transform. Similar to the first transform, the second transform can be a separable transform or a non-separable transform. Such a non-separable transform may be a non-separable quadratic transform (NSST).

可使用预定义的多种变换方法中的至少一种来执行首次变换。例如,所述预定义的多种变换方法可包括离散余弦变换(DCT)、离散正弦变换(DST)、Karhunen-Loeve变换(KLT)等。The first transformation may be performed using at least one of a number of predefined transformation methods. For example, the predefined multiple transform methods may include discrete cosine transform (DCT), discrete sine transform (DST), Karhunen-Loeve transform (KLT), and the like.

此外,根据定义DCT或DST的核函数,首次变换可以是具有各种类型的变换。Furthermore, the first transformation can be of various types according to the kernel function that defines the DCT or DST.

例如,根据下表6中呈现的变换核,首次变换可包括诸如DCT-2、DCT-5、DST-7、DCT-7、DST-8、DST-1和DCT-8的变换。在下表6中,例示了用于多变换选择(MTS)的各种变换类型和变换核函数。For example, according to the transform kernel presented in Table 6 below, the first transform may include transforms such as DCT-2, DCT-5, DST-7, DCT-7, DST-8, DST-1, and DCT-8. In Table 6 below, various transform types and transform kernels for multiple transform selection (MTS) are illustrated.

MTS可指对一个或更多个DCT和/或DST核的组合的选择,以便在水平和/或垂直方向上对残差信号进行变换。MTS may refer to the selection of a combination of one or more DCT and/or DST cores to transform the residual signal in horizontal and/or vertical directions.

表6Table 6

Figure BDA0002623340100000571
Figure BDA0002623340100000571

Figure BDA0002623340100000581
Figure BDA0002623340100000581

在表6中,i和j可以是等于或大于0并且小于或等于N-1的整数值。In Table 6, i and j may be integer values equal to or greater than 0 and less than or equal to N-1.

可对通过执行首次变换而生成的变换系数执行二次变换。The second transform may be performed on the transform coefficients generated by performing the first transform.

首次变换和/或二次变换可被应用于与亮度(luma)分量和色度(chroma)分量中的一个或更多个对应的信号分量。可根据针对目标块和/或邻近块的编码参数中的至少一个来确定是否应用首次变换和/或二次变换。例如,可根据目标块的尺寸和/或形状来确定是否应用首次变换和/或二次变换。The first transform and/or the second transform may be applied to signal components corresponding to one or more of a luma (luma) component and a chroma (chroma) component. Whether to apply the primary transform and/or the secondary transform may be determined according to at least one of encoding parameters for the target block and/or neighboring blocks. For example, whether to apply the primary transform and/or the secondary transform may be determined according to the size and/or shape of the target block.

可根据针对目标块和/或邻近块的编码参数中的至少一个来确定将被应用于首次变换和/或二次变换的变换方法。所确定的变换方法还可指示首次变换和/或二次变换不被使用。The transform method to be applied to the first transform and/or the second transform may be determined according to at least one of encoding parameters for the target block and/or neighboring blocks. The determined transform method may also indicate that the first transform and/or the second transform is not used.

可选地,指示变换方法的变换信息可从编码设备100被用信号发送到解码设备200。例如,变换信息可包括将被用于首次变换和/或二次变换的变换的索引。Alternatively, transform information indicating the transform method may be signaled from the encoding apparatus 100 to the decoding apparatus 200 . For example, the transform information may include indices of transforms to be used for the first transform and/or the second transform.

可通过对通过执行初次变换和/或二次变换而生成的结果执行量化或者对残差信号执行量化来生成量化的变换系数(即,量化的等级)。The quantized transform coefficients (ie, the levels of quantization) may be generated by performing quantization on the result generated by performing the primary transform and/or the secondary transform or by performing quantization on the residual signal.

图13示出根据示例的对角线扫描。Figure 13 shows a diagonal scan according to an example.

图14示出根据示例的水平扫描。FIG. 14 shows horizontal scanning according to an example.

图15示出根据示例的垂直扫描。FIG. 15 shows vertical scanning according to an example.

可根据帧内预测模式、块尺寸和块形状中的至少一个,经由(右上)对角线扫描、垂直扫描和水平扫描中的至少一个对量化的变换系数进行扫描。所述块可以是变换单元(TU)。The quantized transform coefficients may be scanned via (top right) at least one of diagonal scanning, vertical scanning, and horizontal scanning according to at least one of intra prediction mode, block size, and block shape. The block may be a transform unit (TU).

每个扫描可在特定的开始点处启动,并且可在特定的结束点处终止。Each scan can start at a specific start point and can end at a specific end point.

例如,可通过使用图13的对角线扫描对块的系数进行扫描来将量化的变换系数改变为1D矢量形式。可选地,可根据块的尺寸和/或帧内预测模式使用图14的水平扫描或图15的垂直扫描,而不使用对角线扫描。For example, the quantized transform coefficients may be changed into 1D vector form by scanning the coefficients of the block using the diagonal scanning of FIG. 13 . Alternatively, the horizontal scan of FIG. 14 or the vertical scan of FIG. 15 may be used instead of the diagonal scan according to the size of the block and/or the intra prediction mode.

垂直扫描可以是在列方向上扫描2D块类型系数的操作。水平扫描可以是在行方向上扫描2D块类型系数的操作。Vertical scanning may be an operation of scanning 2D block type coefficients in the column direction. Horizontal scanning may be an operation of scanning 2D block type coefficients in the row direction.

换句话说,可根据块的尺寸和/或帧间预测模式来确定对角线扫描、垂直扫描和水平扫描中的哪一个将被使用。In other words, which of the diagonal scan, the vertical scan and the horizontal scan will be used may be determined according to the size of the block and/or the inter prediction mode.

如图13、图14和图15中所示,可沿着对角线方向、水平方向或垂直方向扫描量化的变换系数。As shown in FIGS. 13 , 14 and 15 , the quantized transform coefficients may be scanned in a diagonal direction, a horizontal direction, or a vertical direction.

量化的变换系数可由块形状表示。每个块可包括多个子块。可根据最小块尺寸或最小块形状来定义每个子块。The quantized transform coefficients may be represented by a block shape. Each block may include multiple sub-blocks. Each sub-block may be defined according to a minimum block size or minimum block shape.

在扫描中,根据扫描的类型或方向的扫描顺序可被首先应用于子块。此外,根据扫描的方向的扫描顺序可被应用于每个子块中的量化的变换系数。In scanning, a scanning order according to the type or direction of scanning may be applied to the sub-blocks first. Also, a scanning order according to the direction of scanning may be applied to the quantized transform coefficients in each sub-block.

例如,如图13、图14和图15中所示,当目标块的尺寸是8×8时,可通过对目标块的残差信号的初次变换、二次变换和量化来生成量化的变换系数。因此,可将三种类型的扫描顺序中的一种应用于四个4×4的子块,并且还可根据所述扫描顺序针对每个4×4的子块扫描量化的变换系数。For example, as shown in FIGS. 13 , 14 and 15 , when the size of the target block is 8×8, quantized transform coefficients may be generated by primary transform, secondary transform, and quantization of the residual signal of the target block . Thus, one of three types of scan orders can be applied to the four 4x4 sub-blocks, and the quantized transform coefficients can also be scanned for each 4x4 sub-block according to the scan order.

经过扫描的量化的变换系数可被熵编码,并且比特流可包括经过熵编码的量化的变换系数。The scanned quantized transform coefficients may be entropy encoded, and the bitstream may include the entropy encoded quantized transform coefficients.

解码设备200可经由对比特流进行熵解码来生成量化的变换系数。可经由逆扫描按照2D块的形式对量化的变换系数进行排列。这里,作为逆扫描的方法,可执行右上对角线扫描、垂直扫描和水平扫描中的至少一个。The decoding apparatus 200 may generate quantized transform coefficients via entropy decoding of the bitstream. The quantized transform coefficients may be arranged in 2D blocks via inverse scanning. Here, as a method of inverse scanning, at least one of upper-right diagonal scanning, vertical scanning, and horizontal scanning may be performed.

可对量化的变换系数执行反量化。可根据是否执行二次逆变换,对通过执行反量化而生成的结果执行二次逆变换。此外,可根据是否将执行首次逆变换,对通过执行二次逆变换而生成的结果执行首次逆变换。可通过对经由执行二次逆变换而生成的结果执行首次逆变换来生成重建的残差信号。Inverse quantization may be performed on the quantized transform coefficients. The result generated by performing inverse quantization may be subjected to inverse quadratic transformation according to whether or not inverse quadratic transformation is performed. Also, the first inverse transformation may be performed on the result generated by performing the second inverse transformation according to whether the first inverse transformation will be performed. The reconstructed residual signal may be generated by performing a first inverse transform on a result generated by performing a second inverse transform.

图16是根据实施例的编码设备的配置图。FIG. 16 is a configuration diagram of an encoding apparatus according to an embodiment.

编码设备1600可与以上描述的编码设备100对应。The encoding apparatus 1600 may correspond to the encoding apparatus 100 described above.

编码设备1600可包括通过总线1690彼此通信的处理单元1610、内存1630、用户界面(UI)输入装置1650、UI输出装置1660和存储器1640。编码设备1600还可包括连接到网络1699的通信单元1620。The encoding apparatus 1600 may include a processing unit 1610, a memory 1630, a user interface (UI) input device 1650, a UI output device 1660, and a memory 1640 in communication with each other through a bus 1690. The encoding device 1600 may also include a communication unit 1620 connected to the network 1699 .

处理单元1610可以是用于运行存储在内存1630或存储器1640中的处理指令的中央处理器(CPU)或半导体器件。处理单元1610可以是至少一个硬件处理器。The processing unit 1610 may be a central processing unit (CPU) or a semiconductor device for executing processing instructions stored in the memory 1630 or the memory 1640 . The processing unit 1610 may be at least one hardware processor.

处理单元1610可生成并处理被输入到编码设备1600的、从编码设备1600输出的、或者在编码设备1600中使用的信号、数据或信息,并且可执行与信号、数据或信息相关的检查、比较、确定等。换句话说,在实施例中,可由处理单元1610执行数据或信息的生成和处理以及与数据或信息相关的检查、比较和确定。The processing unit 1610 may generate and process signals, data, or information input to the encoding apparatus 1600, output from the encoding apparatus 1600, or used in the encoding apparatus 1600, and may perform checking, comparison on the signals, data, or information , OK, etc. In other words, in an embodiment, the generation and processing of data or information and the checking, comparison and determination of data or information may be performed by the processing unit 1610 .

处理单元1610可包括帧间预测单元110、帧内预测单元120、切换器115、减法器125、变换单元130、量化单元140、熵编码单元150、反量化单元160、逆变换单元170、加法器175、滤波器单元180和参考画面缓冲器190。The processing unit 1610 may include an inter prediction unit 110, an intra prediction unit 120, a switch 115, a subtractor 125, a transform unit 130, a quantization unit 140, an entropy encoding unit 150, an inverse quantization unit 160, an inverse transform unit 170, an adder 175 , filter unit 180 and reference picture buffer 190 .

帧间预测单元110、帧内预测单元120、切换器115、减法器125、变换单元130、量化单元140、熵编码单元150、反量化单元160、逆变换单元170、加法器175、滤波器单元180和参考画面缓冲器190中的至少一些可以是程序模块,并且可与外部装置或系统进行通信。所述程序模块可以以操作系统、应用程序模块或其它程序模块的形式被包括在编码设备1600中。Inter prediction unit 110, intra prediction unit 120, switch 115, subtractor 125, transform unit 130, quantization unit 140, entropy encoding unit 150, inverse quantization unit 160, inverse transform unit 170, adder 175, filter unit At least some of 180 and reference picture buffer 190 may be program modules and may communicate with an external device or system. The program modules may be included in the encoding device 1600 in the form of an operating system, application program modules, or other program modules.

所述程序模块可被物理地存储在各种类型的公知的存储装置中。此外,所述程序模块中的至少一些也可被存储在能够与编码设备1200进行通信的远程存储装置中。The program modules may be physically stored in various types of well-known storage devices. Furthermore, at least some of the program modules may also be stored in a remote storage device capable of communicating with the encoding apparatus 1200 .

程序模块可包括但不限于用于执行根据实施例的功能或操作或者用于实现根据实施例的抽象数据类型的例程、子例程、程序、对象、组件和数据结构。Program modules may include, but are not limited to, routines, subroutines, programs, objects, components, and data structures for performing functions or operations according to embodiments or for implementing abstract data types according to embodiments.

可使用由编码设备1600的至少一个处理器运行的指令或代码来实现所述程序模块。The program modules may be implemented using instructions or code executed by at least one processor of the encoding device 1600 .

处理单元1610可运行帧间预测单元110、帧内预测单元120、切换器115、减法器125、变换单元130、量化单元140、熵编码单元150、反量化单元160、逆变换单元170、加法器175、滤波器单元180和参考画面缓冲器190中的指令或代码。Processing unit 1610 may operate inter prediction unit 110, intra prediction unit 120, switch 115, subtractor 125, transform unit 130, quantization unit 140, entropy encoding unit 150, inverse quantization unit 160, inverse transform unit 170, adders 175 , the filter unit 180 and the instructions or code in the reference picture buffer 190 .

存储单元可表示内存1630和/或存储器1640。内存1630和存储器1640中的每一个可以是各种类型的易失性或非易失性存储介质中的任意一种。例如,内存1630可包括只读存储器(ROM)1631和随机存取存储器(RAM)1632中的至少一个。The storage unit may represent memory 1630 and/or storage 1640 . Each of memory 1630 and storage 1640 may be any of various types of volatile or nonvolatile storage media. For example, the memory 1630 may include at least one of a read only memory (ROM) 1631 and a random access memory (RAM) 1632 .

存储单元可存储用于编码设备1600的操作的数据或信息。在实施例中,编码设备1600的数据或信息可被存储在存储单元中。The storage unit may store data or information used for the operation of the encoding apparatus 1600 . In an embodiment, the data or information of the encoding apparatus 1600 may be stored in a storage unit.

例如,存储单元可存储画面、块、列表、运动信息、帧间预测信息、比特流等。For example, the storage unit may store pictures, blocks, lists, motion information, inter prediction information, bitstreams, and the like.

编码设备1600可被实现在包括计算机可读存储介质的计算机系统中。The encoding apparatus 1600 may be implemented in a computer system including a computer-readable storage medium.

存储介质可存储编码设备1600的操作所需的至少一个模块。内存1630可存储至少一个模块,并且可被配置为使得所述至少一个模块由处理单元1610运行。The storage medium may store at least one module required for the operation of the encoding apparatus 1600 . The memory 1630 may store at least one module and may be configured such that the at least one module is executed by the processing unit 1610 .

可通过通信单元1620来执行与编码设备1600的数据或信息的通信相关的功能。Functions related to communication of data or information of the encoding apparatus 1600 may be performed through the communication unit 1620 .

例如,通信单元1620可将比特流发送到随后将描述的解码设备1600。For example, the communication unit 1620 may transmit the bitstream to the decoding apparatus 1600 which will be described later.

图17是根据实施例的解码设备的配置图。FIG. 17 is a configuration diagram of a decoding apparatus according to an embodiment.

解码设备1700可对应于以上描述的解码设备200。The decoding apparatus 1700 may correspond to the decoding apparatus 200 described above.

解码设备1700可包括通过总线1790彼此通信的处理单元1710、内存1730、用户界面(UI)输入装置1750、UI输出装置1760和存储器1740。解码设备1700还可包括连接到网络1799的通信单元1720。The decoding apparatus 1700 may include a processing unit 1710, a memory 1730, a user interface (UI) input device 1750, a UI output device 1760, and a memory 1740 in communication with each other through a bus 1790. The decoding device 1700 may also include a communication unit 1720 connected to the network 1799 .

处理单元1710可以是用于运行存储在内存1730或存储器1740中的处理指令的中央处理器(CPU)或半导体器件。处理单元1710可以是至少一个硬件处理器。The processing unit 1710 may be a central processing unit (CPU) or a semiconductor device for executing processing instructions stored in the memory 1730 or the memory 1740 . The processing unit 1710 may be at least one hardware processor.

处理单元1710可生成并处理被输入到解码设备1700的、从解码设备1700输出的、或者在解码设备1700中使用的信号、数据或信息,并且可执行与信号、数据或信息相关的检查、比较、确定等。换句话说,在实施例中,可由处理单元1710执行数据或信息的生成和处理以及与数据或信息相关的检查、比较和确定。The processing unit 1710 may generate and process signals, data or information input to the decoding apparatus 1700, output from the decoding apparatus 1700, or used in the decoding apparatus 1700, and may perform checking, comparison on the signals, data or information , OK, etc. In other words, in an embodiment, the generation and processing of data or information and the checking, comparison and determination of data or information may be performed by the processing unit 1710 .

处理单元1710可包括熵解码单元210、反量化单元220、逆变换单元230、帧内预测单元240、帧间预测单元250、切换器245、加法器255、滤波器单元260和参考画面缓冲器270。The processing unit 1710 may include an entropy decoding unit 210 , an inverse quantization unit 220 , an inverse transform unit 230 , an intra prediction unit 240 , an inter prediction unit 250 , a switch 245 , an adder 255 , a filter unit 260 , and a reference picture buffer 270 .

解码设备200的熵解码单元210、反量化单元220、逆变换单元230、帧内预测单元240、帧间预测单元250、加法器255、切换器245、滤波器单元260和参考画面缓冲器270中的至少一些可以是程序模块,并且可与外部装置或系统进行通信。所述程序模块可以以操作系统、应用程序模块或其它程序模块的形式被包括在解码设备1700中。In the entropy decoding unit 210 , the inverse quantization unit 220 , the inverse transform unit 230 , the intra prediction unit 240 , the inter prediction unit 250 , the adder 255 , the switch 245 , the filter unit 260 and the reference picture buffer 270 of the decoding apparatus 200 At least some of the modules may be program modules and may communicate with external devices or systems. The program modules may be included in the decoding device 1700 in the form of an operating system, application program modules, or other program modules.

程序模块可被物理地存储在各种类型的公知的存储装置中。此外,所述程序模块中的至少一些也可被存储在能够与解码设备1700进行通信的远程存储装置中。Program modules may be physically stored in various types of well-known storage devices. Furthermore, at least some of the program modules may also be stored in a remote storage device capable of communicating with the decoding apparatus 1700 .

程序模块可包括但不限于用于执行根据实施例的功能或操作或者用于实现根据实施例的抽象数据类型的例程、子例程、程序、对象、组件和数据结构。Program modules may include, but are not limited to, routines, subroutines, programs, objects, components, and data structures for performing functions or operations according to embodiments or for implementing abstract data types according to embodiments.

可使用由解码设备1700的至少一个处理器运行的指令或代码来实现所述程序模块。The program modules may be implemented using instructions or code executed by at least one processor of the decoding apparatus 1700 .

处理单元1710可运行熵解码单元210、反量化单元220、逆变换单元230、帧内预测单元240、帧间预测单元250、切换器245、加法器255、滤波器单元260和参考画面缓冲器270中的指令或代码。Processing unit 1710 may operate entropy decoding unit 210 , inverse quantization unit 220 , inverse transform unit 230 , intra prediction unit 240 , inter prediction unit 250 , switch 245 , adder 255 , filter unit 260 , and reference picture buffer 270 instruction or code in .

存储单元可表示内存1730和/或存储器1740。内存1730和存储器1740中的每一个可以是各种类型的易失性或非易失性存储介质中的任意一种。例如,内存1730可包括ROM1731和RAM 1732中的至少一个。The storage unit may represent memory 1730 and/or storage 1740 . Each of memory 1730 and storage 1740 may be any of various types of volatile or non-volatile storage media. For example, the memory 1730 may include at least one of the ROM 1731 and the RAM 1732 .

存储单元可存储用于解码设备1700的操作的数据或信息。在实施例中,解码设备1700的数据或信息可被存储在存储单元中。The storage unit may store data or information for the operation of the decoding apparatus 1700 . In an embodiment, the data or information of the decoding apparatus 1700 may be stored in a storage unit.

例如,存储单元可存储画面、块、列表、运动信息、帧间预测信息、比特流等。For example, the storage unit may store pictures, blocks, lists, motion information, inter prediction information, bitstreams, and the like.

解码设备1700可被实现在包括计算机可读存储介质的计算机系统中。The decoding apparatus 1700 may be implemented in a computer system including a computer-readable storage medium.

存储介质可存储解码设备1700的操作所需的至少一个模块。内存1730可存储至少一个模块,并且可被配置为使得所述至少一个模块由处理单元1710运行。The storage medium may store at least one module required for the operation of the decoding apparatus 1700 . The memory 1730 may store at least one module and may be configured such that the at least one module is executed by the processing unit 1710 .

可通过通信单元1720来执行与解码设备1700的数据或信息的通信相关的功能。Functions related to communication of data or information of the decoding apparatus 1700 may be performed through the communication unit 1720 .

例如,通信单元1720可从编码设备1600接收比特流。For example, the communication unit 1720 may receive the bitstream from the encoding apparatus 1600 .

使用通道之间的信息的共享的图像处理方法Image processing method using sharing of information between channels

根据实施例的方法和设备可将使用预测和各种变换的变换编码(转码)技术应用于高分辨率图像(诸如4K或8K分辨率图像),可通过共享通道之间的各种类型的预定义编码决策信息来对图像进行编码和/或解码,并且可通过共享通道之间的发送的编码决策信息来对针对经过编码的图像的压缩比特流或压缩数据进行解码。Methods and apparatuses according to embodiments may apply transform coding (transcoding) techniques using prediction and various transforms to high-resolution images, such as 4K or 8K resolution images, by sharing various types of The encoding decision information is predefined to encode and/or decode the image, and the compressed bitstream or compressed data for the encoded image may be decoded by sharing the transmitted encoding decision information between channels.

多个通道可表示代表块的多个分量。例如,多个通道可包括颜色通道、深度通道、阿尔法通道等。Multiple channels may represent multiple components representing a block. For example, the plurality of channels may include color channels, depth channels, alpha channels, and the like.

在下文中,术语“通道”和“颜色”可具有相同的含义,并且可彼此可互换地使用。此外,术语“颜色”可指示通道中的一个。术语“通道”可与术语“颜色”、“深度”和“阿尔法”中的一个或更多个可互换地使用。Hereinafter, the terms "channel" and "color" may have the same meaning and may be used interchangeably with each other. Furthermore, the term "color" may indicate one of the channels. The term "channel" may be used interchangeably with one or more of the terms "color," "depth," and "alpha."

可使用本实施例中的技术,并且因此可解决当传统技术被应用于图像的编码和解码时发生的压缩率和图像质量劣化的问题。具体地,当传统技术被应用于像素值的变化在空间上集中的图像时,压缩率和图像质量的劣化的问题可能是严重的。The technique in the present embodiment can be used, and thus the problems of degradation in compression ratio and image quality that occur when conventional techniques are applied to encoding and decoding of images can be solved. In particular, when the conventional technique is applied to an image in which changes in pixel values are spatially concentrated, the problems of degradation of compression ratio and image quality may be serious.

在示例中,作为根据实施例的在通道之间共享以执行编码和解码的各种类型的编码决策信息,存在以下多条信息。在以下信息的名称中,“标志”可被省略。In an example, as various types of encoding decision information shared between channels to perform encoding and decoding according to an embodiment, there are the following pieces of information. In the names of the following information, "flag" may be omitted.

1)变换跳过标志(transform_skip_flag)信息可指示是否选择性地跳过变换。可选地,transform_skip_flag信息可指示使用变换和跳过变换中的一种。1) Transform skip flag (transform_skip_flag) information may indicate whether to selectively skip transform. Alternatively, the transform_skip_flag information may indicate one of using transform and skipping transform.

2)帧内平滑滤波信息可指示平滑滤波是否被应用于帧内预测中使用的参考像素。2) Intra smoothing filtering information may indicate whether smoothing filtering is applied to reference pixels used in intra prediction.

3)位置相关帧内预测组合(PDPC)标志(PDPC_flag)信息可指示当特定帧内预测(例如,平面预测)被执行时是否将通过使用应用了平滑(即,滤波)的邻近像素和没有应用平滑(即,滤波)的邻近像素来执行帧内预测。3) Position Dependent Intra Prediction Combination (PDPC) flag (PDPC_flag) information may indicate whether a smoothing (i.e., filtering) is applied by using adjacent pixels and not applied when a specific intra prediction (eg, planar prediction) is performed Smoothing (ie, filtering) neighboring pixels to perform intra prediction.

4)残差差分脉冲编码调制(RDPCM)标志(rdpcm_flag)信息可指示另外对经由一次预测获取的残差信号执行差分脉冲编码调制(DPCM)并再次获取残差信号的RDPCM是否将被执行。4) Residual Differential Pulse Code Modulation (RDPCM) flag (rdpcm_flag) information may indicate whether RDPCM that additionally performs Differential Pulse Code Modulation (DPCM) on the residual signal obtained through one prediction and obtains the residual signal again will be performed.

5)多变换选择(MTS)标志(mts_flag)信息可指示基于扩展多变换(EMT)的编码方法是否将被使用。5) The multiple transform selection (MTS) flag (mts_flag) information may indicate whether an extended multiple transform (EMT) based encoding method is to be used.

EMT可以是选择并使用所提供的多个变换中的用于作为目标块的变换块的指定变换的编码方法。EMT may be an encoding method that selects and uses a specified transform for a transform block serving as a target block from among a plurality of transforms provided.

EMT还可代表“增强的多变换”,并且还可指示“多变换选择(MTS)”。EMT may also stand for "Enhanced Multi-Transform" and may also indicate "Multi-Transform Selection (MTS)".

6)EMT标志信息可指示EMT是否将被使用。6) EMT flag information may indicate whether EMT is to be used.

7)MTS索引(mts_idx)信息可指示当使用MTS时将在水平方向和垂直方向上使用哪些变换。7) The MTS index (mts_idx) information may indicate which transforms will be used in the horizontal and vertical directions when using MTS.

mts_idx信息的一部分(例如,mts_idx中的一个指定比特)可以是指示在残差信号的水平方向上使用的变换的信息。A part of the mts_idx information (eg, a designated bit in mts_idx) may be information indicating the transform used in the horizontal direction of the residual signal.

mts_idx信息的另一部分或mts_idx信息的其余部分的一部分(例如,mts_idx中的一个其他指定比特)可以是指示在残差信号的垂直方向上使用的变换的信息。Another part of the mts_idx information or part of the rest of the mts_idx information (eg, one other specified bit in mts_idx) may be information indicating the transform used in the vertical direction of the residual signal.

例如,根据mts_idx信息对变换的确定可如下表7和表8中所示被配置。For example, the determination of the transformation according to the mts_idx information may be configured as shown in Table 7 and Table 8 below.

[表7][Table 7]

Figure BDA0002623340100000641
Figure BDA0002623340100000641

Figure BDA0002623340100000651
Figure BDA0002623340100000651

[表8][Table 8]

Figure BDA0002623340100000652
Figure BDA0002623340100000652

在表7中,例示了根据帧内预测模式和mts_idx信息的值而使用的水平方向上的变换和垂直方向上的变换。In Table 7, the transform in the horizontal direction and the transform in the vertical direction used according to the intra prediction mode and the value of the mts_idx information are exemplified.

根据表7,当获取mts_idx信息以便执行目标块的帧内预测或帧间预测时,可根据mts_idx信息的值来确定将被用于目标块的变换的水平方向上的变换和垂直方向上的变换。According to Table 7, when the mts_idx information is acquired in order to perform intra prediction or inter prediction of the target block, the transformation in the horizontal direction and the transformation in the vertical direction to be used for the transformation of the target block may be determined according to the value of the mts_idx information .

例如,当目标块的帧内预测模式是6并且mts_idx信息的值是2时,DCT-7可被用作水平方向上的变换,并且DCT-8可被用作垂直方向上的变换。For example, when the intra prediction mode of the target block is 6 and the value of mts_idx information is 2, DCT-7 may be used as transform in the horizontal direction, and DCT-8 may be used as transform in the vertical direction.

表8示出表7的修改的示例。在表8中,MTS_CU_flag可指示基于CU确定并发送标志信息mts_flag,其中,所述标志信息mts_flag指示多变换选择(MTS)方法是否被使用。此外,MTS_Hor_flag和MTS_Ver_flag可分别指示在水平方向上使用的变换和在垂直方向上使用的变换。表8可通过MTS_Hor_flag和MTS_Ver_flag的值来例示在水平方向和垂直方向上使用的变换。Table 8 shows a modified example of Table 7. In Table 8, MTS_CU_flag may indicate that flag information mts_flag is determined and transmitted based on the CU, wherein the flag information mts_flag indicates whether a multiple transform selection (MTS) method is used. Also, MTS_Hor_flag and MTS_Ver_flag may indicate the transform used in the horizontal direction and the transform used in the vertical direction, respectively. Table 8 may illustrate transforms used in the horizontal and vertical directions by the values of MTS_Hor_flag and MTS_Ver_flag.

可选地,根据表7的mts_idx信息对变换的确定可如下表9中所示被配置。Optionally, the determination of the transformation according to the mts_idx information of Table 7 may be configured as shown in Table 9 below.

[表9][Table 9]

Figure BDA0002623340100000653
Figure BDA0002623340100000653

Figure BDA0002623340100000661
Figure BDA0002623340100000661

在表9中,在帧内预测和帧间预测中,例示了根据mts_idx信息的值而使用的水平变换类型和垂直变换类型。In Table 9, in intra prediction and inter prediction, the horizontal transform type and the vertical transform type used according to the value of mts_idx information are exemplified.

水平变换类型的各个值可指示特定变换。例如,水平变换类型的值“1”可表示DST-7。水平变换类型的值“2”可表示DCT-8。Each value of the horizontal transform type can indicate a specific transform. For example, a value of "1" for the horizontal transform type may represent DST-7. A value of "2" for the horizontal transform type may represent DCT-8.

8)不可分离二次变换(NSST)标志(nsst_flag)信息可指示用于对经由初次变换获取的全部或一些变换系数另外执行不可分离二次变换的NSST编码方法是否将被使用。8) Non-separable secondary transform (NSST) flag (nsst_flag) information may indicate whether an NSST encoding method for additionally performing non-separable secondary transform on all or some of the transform coefficients obtained through the primary transform is to be used.

9)NSST索引(nsst_idx)信息可指示当使用NSST编码方法时将被应用于全部或一些变换系数的二次变换的类型。9) NSST index (nsst_idx) information may indicate the type of secondary transform to be applied to all or some transform coefficients when the NSST encoding method is used.

nsst_idx信息可指示将被用于不可分离二次变换的变换。The nsst_idx information may indicate the transform to be used for the non-separable secondary transform.

10)CU跳过标志信息可指示发送关于CU的经过编码的数据的步骤是否将被跳过。10) CU skip flag information may indicate whether the step of sending encoded data for the CU is to be skipped.

11)CU局部照度补偿(LIC)标志(CU_lic_flag)信息可指示是否补偿块的亮度值之间的差。11) CU local illuminance compensation (LIC) flag (CU_lic_flag) information may indicate whether to compensate for differences between luminance values of blocks.

12)重叠块运动补偿(OBMC)标志(obmc_flag)信息可指示是否使用多个重叠运动补偿块来生成最终运动补偿块。12) Overlapping block motion compensation (OBMC) flag (obmc_flag) information may indicate whether multiple overlapping motion compensation blocks are used to generate the final motion compensation block.

13)codeAlfCtuEnable标志(codeAlfCtuEnable_flag)信息可指示自适应环路滤波器(ALF)是否可应用于当前CTU的像素值。13) The codeAlfCtuEnable flag (codeAlfCtuEnable_flag) information may indicate whether an adaptive loop filter (ALF) can be applied to the pixel value of the current CTU.

当在通道之间共享这样的编码决策信息时,可在提高图像压缩率的同时获得具有优异图像质量的图像。When such encoding decision information is shared among channels, an image with excellent image quality can be obtained while improving the image compression rate.

描述根据本实施例的将在通道之间共享的编码决策信息时,为了便于整个描述和理解实施例(诸如操作、附图和方程式的描述),可将transform_skip_flag信息用作将在通道之间共享的编码决策信息的示例。When describing encoding decision information to be shared between channels according to the present embodiment, in order to facilitate the overall description and understanding of the embodiment (such as the description of operations, figures, and equations), transform_skip_flag information may be used as the information to be shared between channels An example of encoding decision information.

然而,transform_skip_flag信息仅是单个示例,并且应用了本实施例的将在通道之间共享的编码决策信息不必仅表示transform_skip_flag信息。However, the transform_skip_flag information is only a single example, and the encoding decision information to be shared between channels to which the present embodiment is applied does not necessarily represent only the transform_skip_flag information.

例如,应理解,解码所需的上述多条编码决策信息中的一条或更多条(诸如1)rdpcm_flag信息、2)诸如mts_flag信息、mts_idx信息、nsst_flag信息和nsst_idx信息的多条变换相关选择信息、3)obmc_flag信息以及4)PDPC_flag信息)被包括在将在通道之间共享的编码决策信息中。For example, it should be understood that one or more pieces of the above-mentioned pieces of encoding decision information required for decoding (such as 1) rdpcm_flag information, 2) pieces of transform-related selection information such as mts_flag information, mts_idx information, nsst_flag information, and nsst_idx information , 3) obmc_flag information and 4) PDPC_flag information) are included in coding decision information to be shared between channels.

此外,当描述根据实施例的将共享解码所需的编码决策信息的通道时,可将YCbCr颜色空间用作示例。然而,YCbCr颜色空间仅是单个详细的示例,并且实施例可被应用于各种颜色空间,诸如YUV颜色空间、XYZ颜色空间和RGB颜色空间。Furthermore, the YCbCr color space may be used as an example when describing channels that will share encoding decision information required for decoding according to an embodiment. However, the YCbCr color space is only a single detailed example, and embodiments can be applied to various color spaces, such as YUV color space, XYZ color space, and RGB color space.

颜色索引cIDX可以是指示颜色空间中的通道中的一个通道的通道索引。The color index cIDX may be a channel index indicating one of the channels in the color space.

对于YCbCr颜色空间和YUV颜色空间,cIDX可针对在对应颜色空间中顺序显示的通道具有诸如“0/1/2”的值。值“a/b/c”可表示指示第一通道的cIDX的值是‘a’,指示第二通道的cIDX的值是‘b’,并且指示第三通道的cIDX的值是‘c’。For the YCbCr color space and the YUV color space, cIDX may have values such as "0/1/2" for the channels displayed sequentially in the corresponding color space. The value "a/b/c" may indicate that the value of cIDX indicating the first channel is 'a', the value of cIDX indicating the second channel is 'b', and the value of cIDX indicating the third channel is 'c'.

可选地,对于YCbCr颜色空间和YUV颜色空间,cIDX可针对在对应颜色空间中顺序显示的通道具有诸如“0/2/1”的值。Alternatively, for the YCbCr color space and the YUV color space, cIDX may have values such as "0/2/1" for the channels displayed sequentially in the corresponding color space.

对于RGB颜色空间和XYZ颜色空间,cIDX可针对在对应颜色空间中顺序显示的通道具有诸如“1/0/2”或“2/0/1”的值。For the RGB color space and the XYZ color space, cIDX may have values such as "1/0/2" or "2/0/1" for the channels displayed sequentially in the corresponding color space.

作为出于实现高效图像编码/解码的目的已经开发或正开发的图像压缩技术,可存在各种技术,诸如1)从目标画面之前或之后的画面对包括在目标画面中的像素的值进行预测的帧间预测技术、2)使用目标画面中的像素的信息对包括在当前目标画面中的像素的值进行预测的帧内预测技术、3)对作为预测误差剩余的残差信号的能量进行压缩的变换和量化技术、4)将短码分配给更频繁出现的值并将长码分配给较不频繁出现的值的熵编码技术以及算术编码技术。通过利用这些图像压缩技术,图像数据可被有效地压缩、被发送和被存储。As image compression techniques that have been developed or are being developed for the purpose of realizing high-efficiency image encoding/decoding, there may be various techniques such as 1) predicting values of pixels included in a target picture from pictures before or after the target picture 2) An intra prediction technique that predicts the value of a pixel included in the current target picture using the information of the pixel in the target picture, 3) Compresses the energy of the residual signal remaining as a prediction error 4) Entropy coding techniques that assign short codes to more frequently occurring values and long codes to less frequently occurring values, and arithmetic coding techniques. By utilizing these image compression techniques, image data can be efficiently compressed, transmitted and stored.

存在可被应用于图像的编码的各种压缩技术。此外,根据将被编码的图像的属性,特定压缩技术可能比其它压缩技术更有利。因此,编码设备1600可通过自适应地确定是否针对目标块使用各种类型的多种压缩技术中的任意一种来对目标块执行最有利的压缩。There are various compression techniques that can be applied to the encoding of images. Furthermore, depending on the properties of the image to be encoded, certain compression techniques may be more advantageous than others. Accordingly, the encoding apparatus 1600 may perform the most favorable compression on the target block by adaptively determining whether to use any one of various types of compression techniques for the target block.

因此,为了从各种可选择的压缩技术中选择对于目标块最有利的压缩技术,编码设备1600通常可执行率失真优化(RDO)。从率失真的角度来看,可能预先不知道可被选择以用于图像的编码的各种图像编码决策中的哪一个是最优的。因此,编码设备1600可通过对所有可用图像编码决策的各个组合执行编码(或简化编码)来计算针对所有可用图像编码决策的组合的率失真值,并且可确定并使用具有计算出的率失真值中的最小率失真值的图像编码决策作为用于目标块的最终图像编码决策。Therefore, in order to select the most advantageous compression technique for the target block from among various alternative compression techniques, the encoding apparatus 1600 typically may perform rate-distortion optimization (RDO). From a rate-distortion perspective, it may not be known in advance which of the various image encoding decisions that may be selected for encoding of an image is optimal. Accordingly, the encoding apparatus 1600 may calculate rate-distortion values for the combinations of all available image encoding decisions by performing encoding (or simplified encoding) for each combination of all available image encoding decisions, and may determine and use the rate-distortion values with the calculated rate-distortion values The image encoding decision with the smallest rate-distortion value in is used as the final image encoding decision for the target block.

此外,编码设备1600可将通过执行这样的RDO而推导出的或使用由编码设备1600选择的附加决策方法而推导出的编码决策记录在比特流中。解码设备1700可读取(即,解析)记录在比特流中的编码决策,并且可通过根据编码决策执行与编码对应的逆处理来准确地对目标块执行解码。In addition, the encoding apparatus 1600 may record in the bitstream encoding decisions derived by performing such RDO or derived using an additional decision method selected by the encoding apparatus 1600 . The decoding apparatus 1700 may read (ie, parse) the encoding decision recorded in the bitstream, and may accurately perform decoding on the target block by performing inverse processing corresponding to encoding according to the encoding decision.

这里,指示编码决策的信息可被称为解码所需的“编码决策信息”或“编码信息”。Here, the information indicating the encoding decision may be referred to as "encoding decision information" or "encoding information" required for decoding.

在下文中,术语“编码决策信息”和“编码信息”可具有相同的含义,并且可彼此可互换地使用。Hereinafter, the terms "coding decision information" and "coding information" may have the same meaning and may be used interchangeably with each other.

通常,用于图像的多个通道(例如,YUV、YCbCr、RGB和XYZ)可能不总是具有相同或相似的属性。因此,从提高压缩率的角度来看,独立于多个通道中的每一个进行编码决策通常可实现更好的性能。Often, the multiple channels used for an image (eg, YUV, YCbCr, RGB, and XYZ) may not always have the same or similar properties. Therefore, making encoding decisions independently of each of the multiple channels generally achieves better performance from a compression ratio perspective.

例如,作为上述编码决策中的一个,存在作为指示是否对目标块执行变换的编码决策的transform_skip_flag。也就是说,可确定是否将针对块中的每一个块跳过变换,并且指示这样的决策的transform_skip_flag信息可作为编码决策信息被记录在针对多个通道中的每一个通道的比特流中。For example, as one of the above-described encoding decisions, there is transform_skip_flag as an encoding decision indicating whether to perform transform on the target block. That is, it may be determined whether transform is to be skipped for each of the blocks, and transform_skip_flag information indicating such a decision may be recorded in the bitstream for each of the plurality of channels as encoding decision information.

通常,在用于图像压缩的编码中,已经考虑总是执行针对目标块的变换。然而,当作为压缩的目标的目标块中的像素的值的空间改变非常大时,或者特别是当像素值的改变非常局部受限时,即使应用了变换,图像能量集中在低频上的程度也可能不是很大,并且相反,可能出现具有相对大的值的针对高频区域的较大数量的变换系数。Generally, in encoding for image compression, it has been considered to always perform a transform for a target block. However, when the spatial change of the value of the pixel in the target block that is the target of compression is very large, or especially when the change of the pixel value is very locally limited, the degree to which the image energy is concentrated on the low frequency even if the transformation is applied It may not be very large, and instead, there may be a larger number of transform coefficients for high frequency regions with relatively large values.

因此,当通过变换和量化处理大量保持低频信号分量并消除高频信号分量时,或者当用于通过应用强量化来减少数据量的变换和量化处理被应用时,可能发生图像质量的严重劣化。特别地,当像素的值的空间改变非常大或者像素值的改变集中在非常局部受限的区域上时,这种图像质量的劣化可能进一步增强。Therefore, when the low-frequency signal components are largely maintained and the high-frequency signal components are eliminated by transform and quantization processing, or when transform and quantization processing for reducing the amount of data by applying strong quantization is applied, severe degradation of image quality may occur. In particular, this degradation of image quality may be further enhanced when the spatial changes in the values of pixels are very large or the changes in pixel values are concentrated on very locally restricted areas.

为了解决上述问题,可使用用于在无需变换的情况下直接对空间域中的像素的值进行编码的方法,而不是一致地将变换应用于目标块。根据这种方法,可确定是否对每个变换块执行变换。通过基于这样的决策执行变换或跳过变换,可执行变换块的编码。在比特流中,可记录作为指示是否将跳过执行变换的编码决策信息的transform_skip_flag信息。To address the above problems, instead of uniformly applying the transform to the target block, a method for directly encoding the value of a pixel in the spatial domain without transforming can be used. According to this method, it can be determined whether to perform transform on each transform block. By performing transforms or skipping transforms based on such decisions, encoding of transform blocks may be performed. In the bitstream, transform_skip_flag information, which is encoding decision information indicating whether or not to perform transform is to be skipped, may be recorded.

例如,当transform_skip_flag信息的值为1时,可跳过变换。当transform_skip_flag信息的值为0时,可执行变换。编码设备1600可通过transform_skip_flag信息将关于是否将针对目标块跳过变换的信息传送到解码设备1700,并且可借助于这种传送来解决上述问题。For example, when the value of the transform_skip_flag information is 1, the transform may be skipped. When the value of the transform_skip_flag information is 0, transformation can be performed. The encoding apparatus 1600 may transmit the information on whether to skip the transform for the target block to the decoding apparatus 1700 through the transform_skip_flag information, and may solve the above-mentioned problem by means of such transmission.

此外,可针对亮度通道(即,Y通道)和色度通道(即,Cb通道和Cr通道)分别设置多条transform_skip_flag信息,并且然后可发送所述多条transform_skip_flag信息。解码设备1700可根据从比特流读取(即,解析)的针对每个通道的transform_skip_flag信息的值,通过跳过或执行对目标块的通道的变换来对目标块执行解码。Also, pieces of transform_skip_flag information may be respectively set for a luma channel (ie, a Y channel) and a chroma channel (ie, a Cb channel and a Cr channel), and then the pieces of transform_skip_flag information may be transmitted. The decoding apparatus 1700 may perform decoding on the target block by skipping or performing transformation on the channels of the target block according to the value of transform_skip_flag information for each channel read (ie, parsed) from the bitstream.

然而,当针对所有变换块发送针对诸如Y、Cb和Cr的通道的多条transform_skip_flag信息时,可能出现开销会由于用信号发送多条transform_skip_flag信息而增加并且图像的压缩率可能劣化的另外问题。However, when multiple pieces of transform_skip_flag information for channels such as Y, Cb, and Cr are transmitted for all transform blocks, there may be additional problems that overhead may increase due to signaling multiple pieces of transform_skip_flag information and the compression rate of the image may be degraded.

为了减轻诸如压缩率的劣化的问题,仅当变换块的尺寸小于或等于特定变换块尺寸时,指示变换是否将被跳过的标志信息可被受限地发送。然而,即使使用这种方案,针对尺寸大于所述特定块尺寸的每个变换块也必须发送针对所有通道的指示是否将跳过变换的多条标志信息,这仍然可能使图像的压缩率劣化。此外,这种劣化的压缩率不可避免地降低了压缩图像的质量。In order to alleviate problems such as degradation of compression ratio, flag information indicating whether transform is to be skipped may be limitedly transmitted only when the size of the transform block is less than or equal to a specific transform block size. However, even with this scheme, pieces of flag information indicating whether the transform will be skipped must be sent for each transform block with a size larger than the specific block size for all channels, which may still degrade the compression rate of the image. Furthermore, this degraded compression ratio inevitably reduces the quality of the compressed image.

为了解决由针对所有通道发送由编码设备1600选择的多条编码决策信息引起的压缩率的劣化,在实施例中公开了使用通道之间的信息的共享的编码和/或解码方法。In order to solve the degradation of compression ratio caused by transmitting multiple pieces of encoding decision information selected by the encoding apparatus 1600 for all channels, encoding and/or decoding methods using sharing of information between channels are disclosed in the embodiments.

首先,可预先定义通道的图像属性被确定为彼此相似的条件。当满足这些条件时,可将由编码设备1600针对多个通道中的代表性通道确定的用于图像或块的编码决策信息发送到解码设备1700。First, the image properties of the predefinable channels are determined as conditions similar to each other. When these conditions are satisfied, the encoding decision information for the image or block determined by the encoding apparatus 1600 for a representative channel among the plurality of channels may be sent to the decoding apparatus 1700 .

对于多个通道中的除了代表性通道之外的所有其余通道或从所述其余通道中选择的通道,可共享和使用针对代表性通道传送的编码决策信息。通过这种共享和使用手段,可提高图像的压缩率。因此,即使不发送针对多个通道的各条编码决策信息,根据本实施例的编码和/或解码方法也可提供优异的编码效率。The encoding decision information communicated for the representative channel may be shared and used for all or selected channels of the plurality of channels except the representative channel. By this means of sharing and use, the compression ratio of the image can be improved. Therefore, the encoding and/or decoding method according to the present embodiment can provide excellent encoding efficiency even without transmitting pieces of encoding decision information for multiple channels.

这里,将被共享的编码决策信息可包括上述transform_skip_flag信息、帧内平滑滤波信息、rdpcm_flag信息、mts_flag信息、mts_idx信息、PDPC_flag信息、MTS_CU_flag信息、MTS_Hor_flag信息、MTS_Ver_flag信息、nsst_flag信息、nsst_idx信息、CU跳过标志信息、CU_lic_flag信息、obmc_flag信息、codeAlfCtuEnable_flag信息和PDPC_flag信息中的一个或更多个。Here, the coding decision information to be shared may include the above-mentioned transform_skip_flag information, intra smoothing filter information, rdpcm_flag information, mts_flag information, mts_idx information, PDPC_flag information, MTS_CU_flag information, MTS_Hor_flag information, MTS_Ver_flag information, nsst_flag information, nsst_idx information, CU skip One or more of flag information, CU_lic_flag information, obmc_flag information, codeAlfCtuEnable_flag information, and PDPC_flag information.

各个通道的图像属性被确定为彼此相似的条件The image properties of the individual channels are determined to be conditions that are similar to each other

为了确定通道的图像属性彼此相似,可检查跨通道预测(通道间预测)是否已经被用于目标块。In order to determine that the image properties of the channels are similar to each other, it may be checked whether cross-channel prediction (inter-channel prediction) has been used for the target block.

也就是说,为了对目标块的解码目标通道进行预测,可检查是否使用用于通过将特定模型应用于另外通道(例如,亮度通道)的重建信息来获取用于对目标通道进行解码的预测值的预测方法。例如,重建信息可以是重建像素的像素值或变换系数的值。所述特定模型可以是线性模型。That is, in order to predict the decoding target channel of the target block, it may be checked whether the prediction value for decoding the target channel is obtained using reconstruction information by applying a specific model to another channel (eg, the luma channel). forecasting method. For example, the reconstruction information may be pixel values of reconstructed pixels or values of transform coefficients. The specific model may be a linear model.

解码目标通道可以是多个通道中的作为当前将被解码的目标的通道。编码目标通道可以是多个通道中的作为当前将被编码的目标的通道。在下文中,也可将编码目标通道和/或解码目标通道简称为“目标通道”。The decoding target channel may be a channel that is currently a target to be decoded among the plurality of channels. The encoding target channel may be a channel that is a target to be currently encoded among the plurality of channels. In the following, the encoding target channel and/or the decoding target channel may also be simply referred to as "target channel".

例如,可检查用于目标块的帧内预测是否使用通过利用另外通道的重建信息来推导针对目标通道的预测值的帧内预测模式。For example, it may be checked whether the intra prediction for the target block uses an intra prediction mode in which a prediction value for the target channel is derived by using reconstruction information of another channel.

为了使用另外通道的重建信息来推导针对目标通道的预测值,使用单个线性模型的跨分量线性模型(CCLM)、使用多个线性模型的多模式线性模型(MMLM)以及使用多个滤波器的多滤波器线性模型可被使用。在CCLM中,可用“通道”来代替术语“分量”。In order to use the reconstruction information of another channel to derive the predicted value for the target channel, a cross-component linear model (CCLM) using a single linear model, a multimodal linear model (MMLM) using multiple linear models, and a multimodal linear model using multiple filters Filter linear models can be used. In CCLM, the term "component" may be replaced by "channel".

INTRA_CCLM模式可以是使用CCLM的帧内预测模式。INTRA_MMLM模式可以是使用MMLM的帧内预测模式。INTRA_MFLM模式可以是使用MFLM的帧内预测模式。The INTRA_CCLM mode may be an intra prediction mode using CCLM. INTRA_MMLM mode may be an intra prediction mode using MMLM. INTRA_MFLM mode may be an intra prediction mode using MFLM.

可选地,对通道的图像属性彼此相似的确定可通过检查目标块的帧内预测模式(例如,指示用于目标块的色度通道的帧内预测模式的intra_chroma_pred_mode信息)是否是INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式中的一个来实现。Alternatively, the determination that the image attributes of the channels are similar to each other can be determined by checking whether the intra prediction mode of the target block (eg, intra_chroma_pred_mode information indicating the intra prediction mode for the chroma channel of the target block) is INTRA_CCLM mode, INTRA_MMLM mode and one of INTRA_MFLM mode.

可选地,对通道的图像属性彼此相似的确定可通过检查目标块的目标通道的编码模式是否在不经改变的情况下使用另外通道(例如,亮度通道)的编码模式来实现。例如,对通道的图像属性彼此相似的确定可通过检查目标块的帧内预测模式(例如,intra_chroma_pred_mode信息)是否是直接模式(DM)来实现。直接模式也可被称为“推导模式”。DM可以是指示亮度通道的帧内预测模式由于亮度通道与色度通道之间的相关性可能较高的特性而在不经改变的情况下被用作色度通道的帧内预测模式的模式。Alternatively, the determination that the image properties of the channels are similar to each other can be achieved by checking whether the encoding mode of the target channel of the target block uses the encoding mode of another channel (eg, the luma channel) unchanged. For example, the determination that the image properties of the channels are similar to each other may be realized by checking whether the intra prediction mode (eg, intra_chroma_pred_mode information) of the target block is direct mode (DM). Direct mode may also be referred to as "derived mode". DM may be a mode indicating that the intra prediction mode of the luma channel is used as the intra prediction mode of the chroma channel without change due to the characteristic that the correlation between the luma channel and the chroma channel may be high.

可参照下表10和表11更详细地定义作为帧内预测模式之一的DM的特征及其详细操作。Features of DM as one of the intra prediction modes and detailed operations thereof may be defined in more detail with reference to Table 10 and Table 11 below.

表10示出(当sps_cclm_enabled_flag信息的值为0时)用于设置用于色度信号的帧内预测的IntraPredModeC值的方法。Table 10 shows (when the value of sps_cclm_enabled_flag information is 0) a method for setting the IntraPredModeC value for intra prediction of chroma signals.

表11示出(当sps_cclm_enabled_flag信息的值为1时)用于设置用于色度信号的帧内预测的IntraPredModeC值的方法。Table 11 shows (when the value of sps_cclm_enabled_flag information is 1) a method for setting the IntraPredModeC value for intra prediction of chroma signals.

[表10][Table 10]

Figure BDA0002623340100000711
Figure BDA0002623340100000711

[表11][Table 11]

Figure BDA0002623340100000721
Figure BDA0002623340100000721

一般地,在帧内预测中,可确定是否使用帧内跨分量线性模型(CCLM)模式、帧内多模型LM(MMLM)模式或帧内多滤波器LM(MFLM)模式,其中,在所述模式中针对单个通道(例如,亮度通道,或更一般地,代表性通道)的重建像素的像素值被用于计算针对另外通道(例如,色度通道,或更一般地,目标通道)的预测值。In general, in intra prediction, it may be determined whether to use intra cross-component linear model (CCLM) mode, intra multi-model LM (MMLM) mode or intra multi-filter LM (MFLM) mode, where in the The pixel values of the reconstructed pixels in the mode for a single channel (eg, the luma channel, or more generally, the representative channel) are used to compute predictions for additional channels (eg, the chroma channel, or more generally, the target channel) value.

使用INTRA_CCLM模式、INTRA_MMLM模式或INTRA_MFLM模式的情况的指示可被分类为两种类型,并且根据sps_cclm_enabled_flag信息的值被详细定义,如表10和表11所示。The indication of the case of using INTRA_CCLM mode, INTRA_MMLM mode, or INTRA_MFLM mode can be classified into two types, and is defined in detail according to the value of sps_cclm_enabled_flag information, as shown in Table 10 and Table 11.

sps_cclm_enabled_flag信息可以是指示INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式是否将被启用的信息。可选地,sps_cclm_enabled_flag信息可以是指示INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式是否已经被启用的信息。sps_cclm_enabled_flag information may be information indicating whether INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode are to be enabled. Alternatively, sps_cclm_enabled_flag information may be information indicating whether INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode have been enabled.

当sps_cclm_enabled_flag的值为0时,可不使用INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式,而当sps_cclm_enabled_flag的值为1时,可使用INTRA_CCLM模式。可选地,当sps_cclm_enabled_flag的值为1时,可使用INTRA_MMLM模式和INTRA_MFLM模式中的至少一个。When the value of sps_cclm_enabled_flag is 0, the INTRA_CCLM mode, INTRA_MMLM mode and INTRA_MFLM mode may not be used, and when the value of sps_cclm_enabled_flag is 1, the INTRA_CCLM mode may be used. Optionally, when the value of sps_cclm_enabled_flag is 1, at least one of INTRA_MMLM mode and INTRA_MFLM mode may be used.

可通过检查色度通道的帧内预测模式(即,intra_chroma_pred_mode的值)是否是特定值(例如,表10中的4和表11中的5)来确定目标通道的帧内预测模式(例如,intra_chroma_pred_mode信息)是否是DM。在这种操作的描述中,当sps_cclm_enabled_flag的值为0时,可参考表10,而当sps_cclm_enabled_flag的值为1时,可参考表11。The intra prediction mode (eg, intra_chroma_pred_mode) of the target channel may be determined by checking whether the intra prediction mode of the chroma channel (ie, the value of intra_chroma_pred_mode) is a specific value (eg, 4 in Table 10 and 5 in Table 11) information) is a DM. In the description of this operation, when the value of sps_cclm_enabled_flag is 0, reference may be made to Table 10, and when the value of sps_cclm_enabled_flag is 1, reference may be made to Table 11.

当sps_cclm_enabled_flag的值为0并且目标通道的帧内预测模式(例如,intra_chroma_pred_mode)的值为4时,可考虑DM被应用。可选地,当sps_cclm_enabled_flag的值为1并且目标通道的帧内预测模式(例如,intra_chroma_pred_mode)的值为5时,可考虑DM被应用。对于由DM指示的目标块的目标通道(例如,色度通道)的帧内预测,指示代表性通道(例如,亮度通道)的帧内预测模式的IntraPredModeY的值可在不经改变的情况下被用作IntraPredModeC的值。When the value of sps_cclm_enabled_flag is 0 and the value of the intra prediction mode (eg, intra_chroma_pred_mode) of the target channel is 4, it may be considered that DM is applied. Optionally, when the value of sps_cclm_enabled_flag is 1 and the value of the intra prediction mode (eg, intra_chroma_pred_mode) of the target channel is 5, DM may be considered to be applied. For intra prediction of the target channel (eg, chroma channel) of the target block indicated by DM, the value of IntraPredModeY indicating the intra prediction mode of the representative channel (eg, luma channel) may be changed without change Used as the value for IntraPredModeC.

这里,色度信号的帧内预测模式intra_chroma_pred_mode可以是指示哪种类型的帧内预测将被用于色度信号的索引信息。Here, the intra prediction mode intra_chroma_pred_mode of the chroma signal may be index information indicating which type of intra prediction is to be used for the chroma signal.

借助于这种索引信息,指示实际被用于色度信号的帧内预测的帧内预测模式的最终值可以是IntraPredModeC的值。换句话说,IntraPredModeC可指示实际被用于色度信号的帧内预测的帧内预测模式。With this index information, the final value indicating the intra prediction mode actually used for the intra prediction of the chrominance signal may be the value of IntraPredModeC. In other words, IntraPredModeC may indicate an intra prediction mode actually used for intra prediction of chroma signals.

当sps_cclm_enabled_flag的值为0并且DM被应用(即,intra_chroma_pred_mode的值为4)时,如果IntraPredModeY的值为0、50、18或1,则IntraPredModeC的值也可以是0、50、18或1。When the value of sps_cclm_enabled_flag is 0 and DM is applied (ie, the value of intra_chroma_pred_mode is 4), if the value of IntraPredModeY is 0, 50, 18 or 1, the value of IntraPredModeC may also be 0, 50, 18 or 1.

这里,值0可表示平面模式(即,平面预测或平面方向),值1可表示DC模式,值18可表示水平模式,值50可表示垂直模式,并且值66可表示对角线模式。Here, a value of 0 may represent planar mode (ie, planar prediction or planar direction), a value of 1 may represent DC mode, a value of 18 may represent horizontal mode, a value of 50 may represent vertical mode, and a value of 66 may represent diagonal mode.

当IntraPredModeY的值是与四个值0、50、18和1中的任意一个不同的另外的值X时,IntraPredModeC的值此时也可以是等于IntraPredModeY的值的X。When the value of IntraPredModeY is another value X different from any one of the four values 0, 50, 18 and 1, the value of IntraPredModeC may also be X equal to the value of IntraPredModeY at this time.

另外,如表10中的前四行中所示,当cclm_enabled_flag的值为0时,如果IntraPredModeY的值为0、50、18或1,则IntraPredModeC的值可根据IntraPredModeY的值来确定。In addition, as shown in the first four rows in Table 10, when the value of cclm_enabled_flag is 0, if the value of IntraPredModeY is 0, 50, 18, or 1, the value of IntraPredModeC may be determined according to the value of IntraPredModeY.

例如,如表10中的第一行中所述,当IntraPredModeY的值为0、50、18或1时,IntraPredModeC的值可以是66、0、0或0。当IntraPredModeY的值是除了0、50、18和1之外的另外的值时,IntraPredModeC的值可以是0。For example, when the value of IntraPredModeY is 0, 50, 18, or 1, as described in the first row of Table 10, the value of IntraPredModeC can be 66, 0, 0, or 0. When the value of IntraPredModeY is another value other than 0, 50, 18, and 1, the value of IntraPredModeC may be 0.

此外,当sps_cclm_enabled_flag的值为1并且DM被应用(即,intra_chroma_pred_mode的值为5)时,如果IntraPredModeY的值为0、50、18或1,则IntraPredModeC的值也可以是0、50、18或1。Furthermore, when the value of sps_cclm_enabled_flag is 1 and DM is applied (ie, the value of intra_chroma_pred_mode is 5), if the value of IntraPredModeY is 0, 50, 18 or 1, the value of IntraPredModeC can also be 0, 50, 18 or 1 .

这里,值0可表示平面模式(即,平面预测或平面方向),值1可表示DC模式,值18可表示水平模式,值50可表示垂直模式,并且值66可表示对角线模式。Here, a value of 0 may represent planar mode (ie, planar prediction or planar direction), a value of 1 may represent DC mode, a value of 18 may represent horizontal mode, a value of 50 may represent vertical mode, and a value of 66 may represent diagonal mode.

当IntraPredModeY的值是与四个值0、50、18和1中的任意一个不同的另外的值X时,IntraPredModeC的值此时也可以是等于IntraPredModeY的值的X。When the value of IntraPredModeY is another value X different from any one of the four values 0, 50, 18 and 1, the value of IntraPredModeC may also be X equal to the value of IntraPredModeY at this time.

另外,如表11中的前五行中所示,当cclm_enabled_flag的值为1时,如果IntraPredModeY的值为0、50、18或1,则IntraPredModeC的值可根据IntraPredModeY的值来确定。In addition, as shown in the first five rows in Table 11, when the value of cclm_enabled_flag is 1, if the value of IntraPredModeY is 0, 50, 18, or 1, the value of IntraPredModeC may be determined according to the value of IntraPredModeY.

例如,如表11中的第一行中所述,当IntraPredModeY的值为0、50、18或1时,IntraPredModeC的值可以是66、0、0或0。当IntraPredModeY的值是除了0、50、18和1之外的另外的值时,IntraPredModeC的值可以是0。For example, when the value of IntraPredModeY is 0, 50, 18, or 1, the value of IntraPredModeC can be 66, 0, 0, or 0, as described in the first row in Table 11. When the value of IntraPredModeY is another value other than 0, 50, 18, and 1, the value of IntraPredModeC may be 0.

在另一实施例中,对通道的图像属性彼此相似的确定可通过检查是否将指示只有被另外通道(例如,亮度通道)的编码模式限制的特定模式将被使用的模式用作用于目标块的目标通道的编码模式来实现。例如,可通过检查目标通道的帧内预测模式是否是直接模式(DM)来进行对通道的图像属性彼此相似的确定。In another embodiment, the determination that the image properties of the channels are similar to each other can be determined by checking whether a mode indicating that only a specific mode limited by the encoding mode of another channel (eg, the luma channel) will be used is used as the mode for the target block The encoding mode of the target channel is implemented. For example, the determination that the image properties of the channels are similar to each other can be made by checking whether the intra prediction mode of the target channel is direct mode (DM).

使用通道之间的相关性的跨通道预测Cross-channel prediction using correlation between channels

跨通道预测可以是当对目标通道中的像素的像素值进预测时使用另外通道中的像素的像素值而非使用帧内预测或帧间预测的技术。Cross-channel prediction may be a technique that uses pixel values of pixels in another channel instead of intra- or inter-prediction when predicting pixel values of pixels in a target channel.

当目标块被编码时,跨通道预测的性能比其他类型的预测的性能好的事实可表示在目标块的通道中的像素的像素值之间存在相当大的相似性。When the target block is encoded, the fact that the performance of cross-channel prediction is better than that of other types of prediction may indicate that there is considerable similarity between the pixel values of the pixels in the channels of the target block.

因此,在这种情况下,当代表性通道的编码决策信息的值被确定时,将代表性通道的编码决策信息的所确定的值同样地用于另外通道的编码决策信息或者将由代表性通道的编码决策信息的值指示的特定值用于另外通道的编码决策信息可能是有利的。Therefore, in this case, when the value of the encoding decision information of the representative channel is determined, the determined value of the encoding decision information of the representative channel is likewise used for the encoding decision information of the other channel or will be determined by the representative channel It may be advantageous for the value of the encoding decision information to indicate a specific value for the encoding decision information of another channel.

例如,当代表性通道的transform_skip_flag信息的值为0(指示不跳过变换)时,即使在其它通道中,transform_skip_flag信息的所确定的值将为‘0’的概率也可能较高。For example, when the value of the transform_skip_flag information of the representative channel is 0 (indicating that no transform is skipped), the probability that the determined value of the transform_skip_flag information will be '0' may be high even in other channels.

因此,对于跨通道预测针对其是有效的图像或块,可能出现不需要针对多个通道单独指定多条transform_skip_flag信息的情况。其原因是通道之间的相似性较高,并且因此各个通道的多条transform_skip_flag信息将彼此相同的概率可能较高。Therefore, for a picture or block for which cross-channel prediction is valid, there may be cases where multiple pieces of transform_skip_flag information need not be individually specified for multiple channels. The reason for this is that the similarity between channels is high, and thus the probability that pieces of transform_skip_flag information of each channel will be the same as each other may be high.

尽管存在这样的图像属性,但是当针对图像的通道分别发送多条transform_skip_flag信息时,压缩率和图像质量可能劣化。Despite such image attributes, when multiple pieces of transform_skip_flag information are respectively transmitted for channels of an image, the compression rate and image quality may be degraded.

这种原理还可被应用于另外的编码决策信息,即,mts_flag信息、mts_idx信息、nsst_flag信息、nsst_idx信息、帧内平滑滤波信息、PDPC_flag信息和rdpcm_flag信息,并且针对代表性通道的编码决策信息的值将与针对另外通道的编码决策信息的值相同的概率可能较高。This principle can also be applied to additional coding decision information, namely mts_flag information, mts_idx information, nsst_flag information, nsst_idx information, intra smoothing information, PDPC_flag information and rdpcm_flag information, and the coding decision information for representative channels The probability that the value will be the same as the value of the encoding decision information for the other channel may be higher.

因此,基于通道的图像属性被确定为彼此相似的条件,可确定使用通道之间的相关性的跨通道预测是否已经被确定为目标块的编码模式。Therefore, based on the condition that the image attributes of the channels are determined to be similar to each other, it may be determined whether or not the cross-channel prediction using the correlation between the channels has been determined as the encoding mode of the target block.

例如,对跨通道预测是否已经被确定为目标块的编码模式的确定可旨在根据指示跨通道预测的颜色分量线性预测模式(CCLM)是否被应用于目标块来确定通道的图像属性是否彼此相似。为了确定CCLM是否被应用于目标块,可检查目标块的预测模式是否是INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式中的一个。For example, the determination of whether cross-channel prediction has been determined as the encoding mode of the target block may be aimed at determining whether the image properties of the channels are similar to each other according to whether a color component linear prediction mode (CCLM) indicating cross-channel prediction is applied to the target block . In order to determine whether CCLM is applied to the target block, it may be checked whether the prediction mode of the target block is one of INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode.

例如,对跨通道预测是否已经被确定为目标块的编码模式的确定可旨在确定代表性通道(例如,亮度通道)的帧内模式是否在不经改变的情况下被用于另外通道(例如,色度通道Cb和Cr)。可选地,对跨通道预测是否已经被确定为目标块的编码模式的确定可旨在确定由代表性通道的帧内模式指示的特定帧内模式是否被用于另外通道。可选地,对跨通道预测是否已经被确定为目标块的编码模式的确定可旨在确定从代表性通道的帧内模式推导出的特定帧内模式是否被用于另外通道。For example, the determination of whether cross-channel prediction has been determined to be the encoding mode of the target block may be aimed at determining whether the intra-mode of a representative channel (eg, the luma channel) is used unchanged for another channel (eg, the luma channel) , chrominance channels Cb and Cr). Alternatively, the determination of whether the cross-channel prediction has been determined to be the coding mode of the target block may be aimed at determining whether a particular intra-mode indicated by the intra-mode of the representative channel is used for another channel. Alternatively, the determination of whether the cross-channel prediction has been determined to be the coding mode of the target block may be aimed at determining whether a particular intra-mode derived from the intra-mode of the representative channel is used for the other channel.

例如,对跨通道预测是否已经被确定为目标块的编码模式的确定可旨在确定编码设备1600和解码设备1700是否彼此一致地使用特定编码模式(例如,通道间共享模式)。For example, the determination of whether cross-channel prediction has been determined to be the encoding mode of the target block may be aimed at determining whether the encoding apparatus 1600 and the decoding apparatus 1700 use a particular encoding mode (eg, an inter-channel sharing mode) consistently with each other.

例如,对跨通道预测是否已经被确定为目标块的编码模式的确定可旨在确定色度通道的帧内预测模式是否是DM。For example, the determination of whether cross-channel prediction has been determined to be the encoding mode of the target block may be aimed at determining whether the intra-prediction mode of the chroma channel is DM.

这样的DM可以是指示由于亮度通道与色度通道之间的相关性可能较高的特性而将色度通道的帧内预测模式在不经改变的情况下用作亮度通道的帧内预测模式的模式。因此,当目标块的色度通道的帧内预测模式是DM时,可确定满足通道的图像属性被确定为彼此相似的条件。Such a DM may be an indication to use the intra-prediction mode of the chroma channel as the intra-prediction mode of the luma channel without change due to the characteristic that the correlation between the luma channel and the chroma channel may be high model. Therefore, when the intra prediction mode of the chroma channel of the target block is DM, it can be determined that the condition that the image attributes of the channels are determined to be similar to each other is satisfied.

除了以上示例中描述的条件之外,还可基于块的尺寸来确定跨通道预测是否已经被确定为目标块的编码模式。In addition to the conditions described in the above example, it may also be determined based on the size of the block whether cross-channel prediction has been determined as the encoding mode of the target block.

例如,块的尺寸越大,具有异质属性的像素将存在于对应块中的概率越高。因此,具有较大尺寸的块的通道之间的相似性可小于具有较小尺寸的块的通道之间的相似性。此外,当块的尺寸过小时,块的通道之间的相似性可能不稳定。For example, the larger the size of a block, the higher the probability that pixels with heterogeneous properties will exist in the corresponding block. Therefore, the similarity between channels of blocks with larger size may be less than the similarity between channels of blocks with smaller size. Furthermore, when the size of the blocks is too small, the similarity between the channels of the blocks may be unstable.

例如,可仅针对尺寸小于或等于特定尺寸的块执行通道之间的信息的共享。该特定尺寸可以是64×64、32×32或16×16。当块尺寸小于或等于该特定尺寸时,执行通道之间的信息的共享,并且因此可更可靠地满足通道的图像属性被确定为彼此相似的条件。For example, sharing of information between channels may be performed only for blocks whose size is less than or equal to a certain size. This particular size can be 64x64, 32x32 or 16x16. When the block size is less than or equal to the specific size, sharing of information between channels is performed, and thus the condition that the image attributes of the channels are determined to be similar to each other can be more reliably satisfied.

可选地,可仅对尺寸大于特定尺寸的块执行通道之间的信息的共享。该特定尺寸可以是4×4。当块的尺寸大于该特定尺寸时,执行通道之间的信息的共享,并且因此可更可靠地满足通道的图像属性被确定为彼此相似的条件。Alternatively, the sharing of information between channels may be performed only for blocks whose size is greater than a certain size. This particular size may be 4x4. When the size of the block is larger than the specific size, sharing of information between channels is performed, and thus the condition that the image attributes of the channels are determined to be similar to each other can be more reliably satisfied.

可选地,仅当块的尺寸大于第一特定尺寸(例如,4×4)并且小于或等于第二特定尺寸(例如,32×32或64×64)时,才可执行通道之间的信息的共享。仅当块的尺寸落入特定范围内时,才可执行通道之间的信息的共享,并且因此可更可靠地满足通道的图像属性被确定为彼此相似的条件。Optionally, inter-channel information can be performed only if the size of the block is larger than a first specific size (eg, 4×4) and less than or equal to a second specific size (eg, 32×32 or 64×64) of sharing. Sharing of information between channels can be performed only when the size of the block falls within a certain range, and thus the condition that the image attributes of the channels are determined to be similar to each other can be more reliably satisfied.

在实施例中,下面将描述用于通过通道之间的信息的共享对目标块进行编码的方法和设备,并且可提供以下功能。In an embodiment, a method and apparatus for encoding a target block through sharing of information between channels will be described below, and the following functions may be provided.

-可从压缩的比特流解析目标块的一个通道的编码决策信息,并且可使用所述一- Coding decision information for one channel of the target block can be parsed from the compressed bitstream, and the one can be used 个通道的编码决策信息来执行针对目标块的所有通道或一些选择的通道的解码。encoding decision information for each channel to perform decoding for all or some selected channels of the target block.

-比特流可被配置为使得仅针对目标块的代表性通道或一些选择的通道发送编码决策信息。- The bitstream may be configured such that encoding decision information is sent only for a representative channel of the target block or for some selected channels.

-可确定是否将针对一个通道跳过变换,并且可将对是否将跳过变换的确定应用于另外通道。- It may be determined whether the transform will be skipped for one channel, and the determination of whether the transform will be skipped may be applied to the other channel.

-对于变换块的一个通道,可从压缩的比特流解析transform_skip_flag信息。可通过利用经过解析的transform_skip_flag信息来针对变换块的一个通道或多个通道确定是否将跳过变换。- For one channel of a transform block, the transform_skip_flag information may be parsed from the compressed bitstream. Whether the transform is to be skipped may be determined for one channel or multiple channels of the transform block by utilizing the parsed transform_skip_flag information.

-对于变换块的一个通道,可用信号发送transform_skip_flag信息。针对一个通道的transform_skip_flag信息甚至可被用于另外通道。- For one channel of a transform block, transform_skip_flag information may be signaled. The transform_skip_flag information for one channel can even be used for another channel.

-可通过通道之间的信息的共享来有效地用信号发送编码决策信息。通过这种有效的信令,可提高编码效率和主观图像质量。- Encoding decision information can be efficiently signaled through the sharing of information between channels. Through this efficient signaling, coding efficiency and subjective image quality can be improved.

-特别地,当块中的像素值的空间改变非常大或非常急剧时,即使变换被应用于目标块,图像能量集中在低频上的程度也可能不大。此外,当通过将变换和量化处理应用于这样的块来大量保持低频信号分量并且消除高频信号分量时,或者当将强量化应用于这样的块时,可能发生图像质量的严重劣化。在实施例中,可基于编码设备1600的确定来节约地指示是否将跳过针对块的变换,而不引起大的开销。通过这种节约的指示,可提高图像的压缩率,并且可使图像质量的劣化最小化。- In particular, when the spatial changes of pixel values in a block are very large or very sharp, the degree of image energy concentration on low frequencies may not be large even if the transform is applied to the target block. Furthermore, when low-frequency signal components are largely maintained and high-frequency signal components are eliminated by applying transform and quantization processing to such blocks, or when strong quantization is applied to such blocks, severe degradation of image quality may occur. In an embodiment, whether the transform for a block is to be skipped may be indicated economically based on the determination of the encoding apparatus 1600 without incurring a large overhead. With this saving indication, the compression ratio of the image can be improved, and the degradation of the image quality can be minimized.

-当使用通道之间的相关性的跨通道预测技术被使用时,可不将多条编码决策信息分别用于多个通道。在实施例中,可针对一个通道发送编码决策信息,并且针对所述一个通道发送的编码决策信息可与所有其余通道或从其余通道中选择的一些通道共享和使用。通过这种共享,可解决压缩率和图像质量的劣化的问题。- When a cross-channel prediction technique using correlation between channels is used, multiple pieces of encoding decision information may not be used separately for multiple channels. In an embodiment, the encoding decision information may be sent for one channel, and the encoding decision information sent for the one channel may be shared and used with all the remaining channels or some channels selected from the remaining channels. By this sharing, the problems of degradation of compression rate and image quality can be solved.

根据颜色空间确定代表性通道Determining representative channels from color space

作为用于图像编码和解码的颜色空间,存在用于一般图像的编码和解码的YCbCr和YUV空间,并且此外,存在RGB、XYZ和YCoCg空间。当各种颜色空间中的一种是用于图像的编码和解码的目标颜色空间时,可将目标颜色空间的通道中的一个确定为目标颜色空间的代表性通道。As color spaces for image encoding and decoding, there are YCbCr and YUV spaces for encoding and decoding of general images, and further, RGB, XYZ, and YCoCg spaces exist. When one of the various color spaces is the target color space for encoding and decoding of the image, one of the channels of the target color space may be determined as a representative channel of the target color space.

在实施例中,可将通道中的与亮度信号具有最高相关性的颜色通道确定为代表性通道。例如,在RGB颜色空间中,G通道可与亮度信号具有最高相关性,并且因此G通道可被选为代表性通道。在XYZ颜色空间中,Y通道可与亮度信号具有最高相关性,并且因此Y通道可被选为代表性通道。在YCoCg颜色空间中,Y通道可与亮度信号具有最高相关性,并且因此Y通道可被选为代表性通道。In an embodiment, the color channel of the channels having the highest correlation with the luminance signal may be determined as the representative channel. For example, in the RGB color space, the G channel may have the highest correlation with the luminance signal, and thus the G channel may be selected as the representative channel. In the XYZ color space, the Y channel may have the highest correlation with the luminance signal, and thus the Y channel may be selected as the representative channel. In the YCoCg color space, the Y channel may have the highest correlation with the luminance signal, and thus the Y channel may be selected as the representative channel.

颜色空间中的通道可由诸如“0/1/2”的索引值来表示。SelectedCIDX可以是选择的颜色的索引。可选地,SelectedCIDX可以是指示所选择的代表性通道的索引值。可通过比特流中的关于目标块的多条信息中的指示所选择的代表性通道的索引SelectedCIDX来确定代表性通道。Channels in the color space may be represented by index values such as "0/1/2". SelectedCIDX can be the index of the selected color. Optionally, SelectedCIDX may be an index value indicating the selected representative channel. The representative channel may be determined by an index SelectedCIDX indicating the selected representative channel among the pieces of information about the target block in the bitstream.

例如,在YCbCr颜色空间中,SelectedCIDX的值可以是指示Y通道的0。For example, in the YCbCr color space, the value of SelectedCIDX may be 0 indicating the Y channel.

例如,在YCbCr颜色空间中,Cb通道可被确定为代表性通道。当Cb通道被确定为代表性通道时,SelectedCIDX的值可以是指示Cb通道的1。For example, in the YCbCr color space, the Cb channel can be determined as the representative channel. When the Cb channel is determined to be the representative channel, the value of SelectedCIDX may be 1 indicating the Cb channel.

例如,在YUV颜色空间中,U通道可被确定为代表性通道。当U通道被确定为代表性通道时,SelectedCIDX的值可以是指示U通道的1。For example, in the YUV color space, the U channel may be determined as the representative channel. When the U channel is determined to be the representative channel, the value of SelectedCIDX may be 1 indicating the U channel.

为了通道之间的编码决策信息的共享,可选择颜色空间中的特定通道作为代表性通道。在图像的编码和解码中,可在一个或更多个其余通道之间共享代表性通道的编码决策信息。For the sharing of coding decision information between channels, a specific channel in the color space can be selected as a representative channel. In encoding and decoding of images, encoding decision information for a representative channel may be shared among one or more of the remaining channels.

例如,编码设备1600可通过比特流仅将代表性通道的编码决策信息用信号发送到解码设备1700。可选地,解码设备1700可使用比特流来推导代表性通道的编码决策信息。可不单独用信号发送其余通道中的至少一些其余通道的编码决策信息。解码设备1700可使用代表性通道的编码决策信息来推导其余通道中的至少一些其余通道的编码决策信息。换句话说,代表性通道的编码决策信息可与其余通道中的至少一些其余通道共享。For example, the encoding apparatus 1600 may signal only the encoding decision information of the representative channel to the decoding apparatus 1700 through the bitstream. Alternatively, the decoding apparatus 1700 may use the bitstream to derive coding decision information for the representative channel. The encoding decision information for at least some of the remaining channels may not be separately signaled. The decoding apparatus 1700 may use the encoding decision information for the representative channel to derive encoding decision information for at least some of the remaining channels. In other words, the encoding decision information for the representative channel may be shared with at least some of the remaining channels.

例如,当与亮度信号具有最高相关性的Y通道被选择为YCbCr颜色空间中的代表性通道时,在亮度通道(即,Y)与色度通道(即,Cb和/或Cr)之间可存在相关性。因此,当预测被执行以用于图像压缩时,针对作为代表性通道的亮度通道的编码决策信息可被隐式地共享作为针对一个或更多个色度块的多条编码决策信息,而不是将独立的预测分别应用于颜色空间中的三个通道。所述一个或更多个色度块可包括Cb块和Cr块中的一个或更多个。For example, when the Y channel with the highest correlation with the luma signal is selected as the representative channel in the YCbCr color space, there may be differences between the luma channel (ie, Y) and the chroma channel (ie, Cb and/or Cr) There is a correlation. Therefore, when prediction is performed for image compression, encoding decision information for the luma channel as a representative channel can be implicitly shared as pieces of encoding decision information for one or more chroma blocks, instead of Apply independent predictions to each of the three channels in the color space. The one or more chroma blocks may include one or more of Cb blocks and Cr blocks.

例如,当Cb通道被选为YCbCr颜色空间中的代表性通道时,在构成色度通道的Cb信号与Cr信号之间可存在相关性。因此,当预测被执行以用于图像压缩时,针对作为代表性通道的Cb通道的编码决策信息可被隐式地共享作为针对Cr块的编码决策信息,而不是将独立的预测分别应用于两个色度通道。For example, when the Cb channel is selected as the representative channel in the YCbCr color space, there may be a correlation between the Cb signal and the Cr signal that make up the chrominance channel. Therefore, when prediction is performed for image compression, the encoding decision information for the Cb channel, which is the representative channel, can be implicitly shared as the encoding decision information for the Cr block, instead of applying independent prediction to the two chrominance channel.

在通道之间共享的编码决策信息Encoding decision information shared between channels

可在通道之间共享的编码决策信息可以是诸如语法元素的信息,其中,所述信息通过编码设备1600被编码并且作为包括在比特流中的信息被用信号发送到解码设备1700。例如,编码决策信息可包括标志、索引等。此外,编码决策信息可包括在编码和/或解码处理期间推导出的信息。此外,编码决策信息可表示对图像进行编码和/或解码所需的信息。The encoding decision information that can be shared between channels may be information such as syntax elements, which are encoded by the encoding device 1600 and signaled to the decoding device 1700 as information included in the bitstream. For example, encoding decision information may include flags, indexes, and the like. Additionally, encoding decision information may include information derived during encoding and/or decoding processes. Furthermore, encoding decision information may represent information required to encode and/or decode an image.

例如,编码决策信息可包括单元/块的尺寸、单元/块的深度、单元/块的分区信息、单元/块的分区结构、指示单元/块是否以四叉树的形式被分区的分区标志信息、指示单元/块是否以二叉树的形式被分区的分区标志信息、二叉树形式的分区方向(水平方向或垂直方向)、二叉树形式的分区形式(对称分区或非对称分区)、指示单元/块是否以三叉树的形式被分区的分区标志信息、三叉树形式的分区方向(水平方向或垂直方向)、三叉树形式的分区形式(对称分区或非对称分区)、指示单元/块是否以复合树的形式被分区的信息、复合树形式的分区的组合和方向(水平方向或垂直方向)、预测方案(帧内预测或帧间预测)、帧内预测模式/方向、参考样点滤波方法、预测块滤波方法、预测块边界滤波方法、滤波的滤波器抽头、滤波的滤波器系数、帧间预测模式、运动信息、运动矢量、参考画面索引、帧间预测方向、帧间预测指示符、参考画面列表、参考图像、运动矢量预测因子、运动矢量预测候选,运动矢量候选列表、指示合并模式是否被使用的信息、合并候选、合并候选列表、指示跳过模式是否被使用的信息、插值滤波器的类型、插值滤波器的滤波器抽头、插值滤波器的滤波器系数、运动矢量的大小、运动矢量的表示的精度、变换类型、变换尺寸、指示初次变换是否被使用的信息、指示附加(二次)变换是否被使用的信息、初次变换选择信息(或初次变换索引)、二次变换选择信息(或二次变换索引)、指示残差信号存在或不存在的信息、编码块样式、编码块标志、量化参数、量化矩阵、关于环内滤波器的信息、关于环内滤波器是否被应用的信息、环内滤波器的系数、环内滤波器的滤波器抽头、环内滤波器的形状/形式、指示去块滤波器是否被应用的信息、去块滤波器的系数、去块滤波器的抽头、去块滤波器的强度、去块滤波器的形状/形式、指示自适应样点偏移是否被应用的信息、指示自适应样点偏移是否被应用的信息、自适应样点偏移的值、自适应样点偏移的类别、自适应样点偏移的类型、指示自适应环路滤波器是否被应用的信息、自适应环路滤波器的系数、自适应环路滤波器的抽头、自适应环路滤波器的形状/形式、二值化/去二值化方法、上下文模型、上下文模型确定方法、上下文模型更新方法、指示常规模式是否被执行的信息、指示旁路模式是否被执行的信息、上下文二进制位、旁路二进制位、变换系数、变换系数等级、变换系数等级扫描方法、图像显示/输出序列、条带标识信息、条带类型、条带分区信息、并行块标识信息、并行块类型信息、并行块分区信息、画面类型、比特深度、关于亮度信号的信息和关于色度信号的信息、transform_skip_flag信息、初次变换选择信息、二次变换选择信息、参考样点滤波信息、PDPC_flag信息、rdpcm_flag信息、EMT标志信息、mts_flag信息、mts_idx信息、nsst_flag信息以及nsst_idx信息中的至少一个或组合。For example, the encoding decision information may include the size of the unit/block, the depth of the unit/block, the partition information of the unit/block, the partition structure of the unit/block, the partition flag information indicating whether the unit/block is partitioned in the form of a quadtree , partition flag information indicating whether the unit/block is partitioned in the form of a binary tree, the partition direction (horizontal or vertical direction) in the form of a binary tree, the partition form in the form of a binary tree (symmetrical or asymmetrical), whether the unit/block is partitioned in the form of a binary tree. Partition flag information to be partitioned in the form of a ternary tree, partition direction (horizontal direction or vertical direction) in the form of a ternary tree, partition form in the form of a ternary tree (symmetrical partition or asymmetrical partition), indicating whether the unit/block is in the form of a compound tree Partitioned information, combination and direction (horizontal direction or vertical direction) of partitions in composite tree, prediction scheme (intra prediction or inter prediction), intra prediction mode/direction, reference sample filtering method, prediction block filtering method, prediction block boundary filtering method, filtered filter taps, filtered filter coefficients, inter prediction mode, motion information, motion vector, reference picture index, inter prediction direction, inter prediction indicator, reference picture list, reference image, motion vector predictor, motion vector prediction candidate, motion vector candidate list, information indicating whether merge mode is used, merge candidate, merge candidate list, information indicating whether skip mode is used, type of interpolation filter, Filter taps of interpolation filter, filter coefficients of interpolation filter, size of motion vector, precision of representation of motion vector, transform type, transform size, information indicating whether primary transform is used, indicating additional (secondary) transform Information whether to be used, primary transform selection information (or primary transform index), secondary transform selection information (or secondary transform index), information indicating the presence or absence of residual signal, coding block style, coding block flag, quantization parameters, quantization matrix, information about the in-loop filter, information about whether the in-loop filter is applied, coefficients of the in-loop filter, filter taps of the in-loop filter, shape/form of the in-loop filter, indication Information whether the DF is applied, the coefficients of the DF, the taps of the DF, the strength of the DF, the shape/form of the DF, indicating whether the adaptive sample offset is applied or not information, information indicating whether adaptive sample offset is applied, value of adaptive sample offset, type of adaptive sample offset, type of adaptive sample offset, indicating adaptive loop filter Information whether it is applied, coefficients of the adaptive loop filter, taps of the adaptive loop filter, shape/form of the adaptive loop filter, binarization/debinarization method, context model, context model Determination method, context model update method, information indicating whether normal mode is executed, information indicating whether bypass mode is executed, context bin, bypass bin, transform coefficient, transform coefficient level, transform coefficient level scan method, image Display/output sequence, stripe identification information, stripe type, stripe partition information, Parallel block identification information, parallel block type information, parallel block partition information, picture type, bit depth, information on luminance signals and information on chroma signals, transform_skip_flag information, primary transform selection information, secondary transform selection information, reference samples At least one or a combination of point filter information, PDPC_flag information, rdpcm_flag information, EMT flag information, mts_flag information, mts_idx information, nsst_flag information, and nsst_idx information.

在可在通道之间共享的多条编码决策信息中,初次变换选择信息可以是为了使用与水平方向和/或垂直方向相关的一个或更多个DCT变换核和/或DST变换核的组合来对残差信号执行变换过程所需的变换信息。例如,初次变换选择信息可以是为了在初次变换中使用MTS所需的信息。初次变换选择信息可包括mts_flag信息和mts_idx信息。Among the pieces of encoding decision information that can be shared between channels, the primary transform selection information may be to use a combination of one or more DCT transform kernels and/or DST transform kernels related to the horizontal direction and/or the vertical direction. Transform information required to perform the transform process on the residual signal. For example, the primary transformation selection information may be information required to use the MTS in the primary transformation. The primary transform selection information may include mts_flag information and mts_idx information.

应用于目标块的初次变换选择信息可被显式地用信号发送,或者可选地,可由编码设备1600和解码设备1700使用目标块的编码决策信息和邻近块的编码决策信息来隐式地推导出。The primary transform selection information applied to the target block may be explicitly signaled, or alternatively, may be implicitly derived by the encoding apparatus 1600 and decoding apparatus 1700 using the encoding decision information for the target block and the encoding decision information for neighboring blocks out.

在编码设备1600中完成初次变换之后,可执行二次变换,以便提高变换系数的能量集中度。After the primary transform is completed in the encoding apparatus 1600, secondary transform may be performed in order to improve the energy concentration of the transform coefficients.

应用于目标块的二次变换选择信息可被显式地用信号发送,或者可选地,可由编码设备1600和解码设备1700使用目标块的编码决策信息和邻近块的编码决策信息来隐式地推导出。解码设备1700可根据二次逆变换是否将被执行来执行二次逆变换,并且可根据初次逆变换是否将被执行来对执行二次逆变换的结果执行初次逆变换。The secondary transform selection information applied to the target block may be explicitly signaled, or alternatively, may be implicitly signaled by the encoding apparatus 1600 and the decoding apparatus 1700 using the encoding decision information of the target block and the encoding decision information of neighboring blocks. Deduced. The decoding apparatus 1700 may perform the second inverse transformation according to whether the second inverse transformation is to be performed, and may perform the primary inverse transformation on the result of performing the second inverse transformation according to whether the first inverse transformation is to be performed.

编码设备1600可生成针对目标块的rdpcm_flag信息,并且可将rdpcm_flag信息记录在比特流中。解码设备1700可通过比特流获取rdpcm_flag信息,并且可根据由rdpcm_flag信息指示的信息来执行RDPCM,或者可不执行RDPCM。The encoding apparatus 1600 may generate rdpcm_flag information for the target block, and may record the rdpcm_flag information in the bitstream. The decoding apparatus 1700 may acquire rdpcm_flag information through a bitstream, and may perform RDPCM according to the information indicated by the rdpcm_flag information, or may not perform RDPCM.

图18是根据实施例的用于对编码决策信息进行解码的方法的流程图。18 is a flowchart of a method for decoding encoding decision information, according to an embodiment.

根据实施例,当颜色空间中的特定通道被选为代表性通道以便在多个通道之间共享信息时,目标块的代表性通道的编码决策信息可由多个通道中的除了代表性通道之外的一个或更多个其余通道共享。According to an embodiment, when a specific channel in the color space is selected as the representative channel in order to share information among the plurality of channels, the encoding decision information of the representative channel of the target block can be obtained from the plurality of channels other than the representative channel One or more of the remaining channels are shared.

例如,当在YCbCr颜色空间中执行针对目标块的帧内预测时,可将Y通道设置为代表性通道,并且此后,代表性通道的帧内编码决策信息可被共享并被用于执行除了代表性通道之外的通道(即,Cb通道和/或Cr通道)的解码,而不是分别独立地发送针对颜色空间中的三个通道的多条帧内编码决策信息。可选地,在Cb通道被假设为YCbCr颜色空间中的代表性通道之后,代表性通道的帧内编码决策信息可被隐式地用于Cr通道的解码,其中,所述Cr通道是除了代表性通道之外的通道。For example, when performing intra prediction for the target block in the YCbCr color space, the Y channel may be set as the representative channel, and thereafter, the intra coding decision information of the representative channel may be shared and used to perform in addition to the representative channel Instead of sending multiple pieces of intra-coding decision information for each of the three channels in the color space independently, the decoding of channels other than the sex channel (ie, the Cb channel and/or the Cr channel). Optionally, after the Cb channel is assumed to be the representative channel in the YCbCr color space, the intra-coding decision information of the representative channel can be implicitly used for decoding the Cr channel, which is in addition to the representative channel. channel outside the sexual channel.

例如,可由其余通道共享的代表性通道的帧内编码决策信息可包括帧内预测模式、帧内预测方向、预测块边界滤波方法、用于预测块边界滤波的滤波器抽头、用于预测块边界滤波的滤波器系数、transform_skip_flag信息、初次变换选择信息、二次变换选择信息、mts_flag信息、mts_idx信息、PDPC_flag信息、rdpcm_flag信息、EMT标志信息、nsst_flag信息、nsst_idx信息、帧内平滑滤波信息、CU跳过标志信息、CU_lic标志信息、obmc_flag信息、codeAlfCtuEnable_flag信息和PDPC_flag信息中的一个或更多个。For example, intra-coding decision information of a representative channel that can be shared by the remaining channels may include intra-prediction mode, intra-prediction direction, prediction block boundary filtering method, filter taps for prediction block boundary filtering, prediction block boundary filtering Filtered filter coefficients, transform_skip_flag information, primary transform selection information, secondary transform selection information, mts_flag information, mts_idx information, PDPC_flag information, rdpcm_flag information, EMT flag information, nsst_flag information, nsst_idx information, intra frame smoothing filter information, CU skip One or more of pass flag information, CU_lic flag information, obmc_flag information, codeAlfCtuEnable_flag information, and PDPC_flag information.

从用于预测色度信号的各种技术(包括角度预测、DC预测、平面预测等)中选择跨通道预测的情况或者跨通道预测更有利的情况可表示亮度信号(即,Y信号)的属性非常类似于色度信号(即,Cb信号和/或Cr信号)的属性。The case where cross-channel prediction is selected from among various techniques for predicting chrominance signals (including angle prediction, DC prediction, planar prediction, etc.), or the case where cross-channel prediction is more favorable, may represent properties of the luma signal (ie, the Y signal) Very similar properties to chrominance signals (ie, Cb and/or Cr signals).

在这种情况下,当Y信号块的通道是代表性通道时,在对代表性通道进行编码和/或解码的处理中确定的编码决策信息可同样地应用于色度块。通过这种应用,可减少为了发送编码决策信息所需的比特数。因此,可执行编码和解码,使得经由跨通道预测将单条编码决策信息用于多个通道。In this case, when the channel of the Y signal block is the representative channel, the coding decision information determined in the process of encoding and/or decoding the representative channel can be applied equally to the chroma block. With this application, the number of bits required to transmit coding decision information can be reduced. Thus, encoding and decoding can be performed such that a single piece of encoding decision information is used for multiple channels via cross-channel prediction.

例如,在针对亮度通道的编码决策信息已经被确定之后,可在其余通道之间共享所确定的编码决策信息,并且可基于所共享的编码决策信息执行编码。可选地,多条编码决策信息可被独立地应用于三个通道而不是在通道之间共享,并且所述三个通道可被独立地编码和/或解码。编码设备1600可在用于在通道之间共享编码决策信息的方法和用于独立地对通道进行编码的方法中,将从率失真的角度来看有利的方法确定为编码方法。根据所述确定,编码设备1600可显式地将关于编码决策信息是否在通道之间共享的信息写入比特流中,并且这个信息可通过比特流被发送到解码设备1700。For example, after encoding decision information for the luma channel has been determined, the determined encoding decision information may be shared among the remaining channels, and encoding may be performed based on the shared encoding decision information. Alternatively, pieces of encoding decision information may be applied independently to the three channels rather than shared between channels, and the three channels may be encoded and/or decoded independently. The encoding apparatus 1600 may determine a method that is advantageous from the viewpoint of rate-distortion as an encoding method in a method for sharing encoding decision information between channels and a method for encoding channels independently. Based on the determination, the encoding apparatus 1600 may explicitly write information on whether encoding decision information is shared between channels into the bitstream, and this information may be sent to the decoding apparatus 1700 through the bitstream.

例如,当在编码和/或解码处理中满足特定编码条件时,可不显式地用信号发送关于编码决策信息是否在通道之间共享的信息,并且代表性通道的编码决策信息可由其余通道共享。For example, when certain encoding conditions are met in the encoding and/or decoding process, information on whether encoding decision information is shared between channels may not be explicitly signaled, and encoding decision information for a representative channel may be shared by the remaining channels.

例如,特定编码条件可以是指示跨分量预测(CCP)、DM或跨分量线性模型(CCLM)是否被使用的条件。For example, the specific coding condition may be a condition indicating whether cross-component prediction (CCP), DM or cross-component linear model (CCLM) is used.

例如,当使用已经完成编码或解码的亮度信号的原始信号、重建信号、残差信号和预测信号中的至少一个来对其余色度信号(即,Cb信号和/或Cr信号)进行预测时,可应用通道之间的编码决策信息的共享。For example, when the remaining chrominance signals (ie, the Cb signal and/or the Cr signal) are predicted using at least one of the original signal, the reconstructed signal, the residual signal, and the predicted signal of the luma signal that has been encoded or decoded, Sharing of coding decision information between channels may be applied.

例如,当使用已经完成编码或解码的色度信号的原始信号、重建信号、残差信号和预测信号中的至少一个来对亮度信号进行预测时,可应用通道之间的编码决策信息的共享。For example, when a luma signal is predicted using at least one of an original signal, a reconstructed signal, a residual signal, and a prediction signal of a chroma signal that has been encoded or decoded, sharing of encoding decision information between channels may be applied.

例如,当使用已经完成编码或解码的Cb信号的原始信号、重建信号、残差信号和预测信号中的至少一个来对Cr信号进行预测时,可应用通道之间的编码决策信息的共享。For example, when a Cr signal is predicted using at least one of an original signal, a reconstructed signal, a residual signal, and a prediction signal of the Cb signal that has completed encoding or decoding, sharing of encoding decision information between channels may be applied.

例如,当使用已经完成编码或解码的Cr信号的原始信号、重建信号、残差信号和预测信号中的至少一个来对Cb信号进行预测时,可应用通道之间的编码决策信息的共享。For example, when the Cb signal is predicted using at least one of the original, reconstructed, residual and predicted signals of the Cr signal that has been encoded or decoded, sharing of coding decision information between channels may be applied.

当亮度通道被设置为YCbCr颜色空间中的代表性通道时,可仅针对亮度信号用信号发送编码决策信息,并且可不针对其余色度通道单独发送编码决策信息,而不是向除了亮度通道之外的其余通道用信号发送编码决策信息。通过这样的选择性的发送,可提高可压缩率。When the luma channel is set as the representative channel in the YCbCr color space, the encoding decision information may be signaled only for the luma signal, and may not be signaled separately for the rest of the chroma channels, rather than to all channels other than the luma channel. The remaining channels signal encoding decision information. By such selective transmission, the compressibility ratio can be improved.

此外,可仅针对Cb信号发送编码决策信息,并且所发送的编码决策信息可被共享以用于Cr信号,因此可提高压缩率。可选地,可仅对Cr信号发送编码决策信息,并且所发送的编码决策信息可被共享以用于Cb信号,因此,可提高压缩率。Also, the encoding decision information can be transmitted only for the Cb signal, and the transmitted encoding decision information can be shared for the Cr signal, so the compression rate can be improved. Alternatively, the encoding decision information may be transmitted only for the Cr signal, and the transmitted encoding decision information may be shared for the Cb signal, and thus, the compression rate may be improved.

在步骤1810,通信单元1720可接收比特流。比特流可包括编码决策信息。At step 1810, the communication unit 1720 may receive the bitstream. The bitstream may include coding decision information.

在步骤1820,处理单元1710可确定编码决策信息的共享是否将被用于目标块的目标通道。At step 1820, the processing unit 1710 may determine whether the sharing of encoding decision information is to be used for the target channel of the target block.

当不共享编码决策信息时,可执行步骤1830。When the encoding decision information is not shared, step 1830 may be performed.

当将共享编码决策信息时,可执行步骤1840。When the encoding decision information is to be shared, step 1840 may be performed.

在步骤1830,处理单元1710可从比特流获取目标通道的编码决策信息。处理单元1710可从比特流解析和读取目标通道的编码决策信息。In step 1830, the processing unit 1710 may obtain coding decision information of the target channel from the bitstream. The processing unit 1710 may parse and read the encoding decision information of the target channel from the bitstream.

在步骤1840,处理单元1710可设置编码决策信息,使得代表性通道的编码决策信息被用作目标通道的编码决策信息。In step 1840, the processing unit 1710 may set the encoding decision information such that the encoding decision information of the representative channel is used as the encoding decision information of the target channel.

步骤1820、步骤1830和步骤1840可由以下代码1表示:Step 1820, Step 1830 and Step 1840 can be represented by the following code 1:

[代码1][Code 1]

Figure BDA0002623340100000831
Figure BDA0002623340100000831

cIdx可指示目标块的目标通道。例如,当目标图像中的通道的数量是3时,cIdx可以是预定义的特定值中的一个,并且可以是{0,1,2}中的一个。cIdx may indicate the target channel of the target block. For example, when the number of channels in the target image is 3, cIdx may be one of predefined specific values, and may be one of {0, 1, 2}.

在实施例中,代表性通道的cIdx可被假设为0。In an embodiment, the cIdx of the representative channel may be assumed to be 0.

“cidx!=0”可指示目标通道不是代表性通道(例如,亮度通道)。换句话说,cIdx值“0”可指示目标通道是代表性通道。"cidx!=0" may indicate that the target channel is not a representative channel (eg, a luma channel). In other words, a cIdx value of "0" may indicate that the target channel is the representative channel.

换句话说,在步骤1820,当目标通道不是代表性通道并且跨通道预测被用于目标块时,处理单元1710可确定针对目标通道使用编码决策信息的共享。当目标通道是代表性通道时或者当跨通道预测不被用于目标块时,处理单元1710可确定针对目标通道不使用编码决策信息的共享。In other words, at step 1820, when the target channel is not a representative channel and cross-channel prediction is used for the target block, the processing unit 1710 may determine to use sharing of coding decision information for the target channel. When the target channel is a representative channel or when cross-channel prediction is not used for the target block, the processing unit 1710 may determine that sharing of coding decision information is not used for the target channel.

可1)基于从比特流获取的信息来推导并且2)根据是否满足特定条件来隐式地推导跨通道预测是否被用于目标块。Whether cross-channel prediction is used for the target block may be 1) derived based on information obtained from the bitstream and 2) implicitly derived according to whether certain conditions are met.

如上所述,可基于目标块的帧内预测模式来确定跨通道预测是否被使用。可基于目标块的帧内预测模式是否是INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式中的一个来确定跨通道预测是否被使用。例如,当目标块的帧内预测模式是INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式中的一个时,处理单元1710可确定跨通道预测被使用。As described above, whether cross-channel prediction is used may be determined based on the intra prediction mode of the target block. Whether cross-channel prediction is used may be determined based on whether the intra prediction mode of the target block is one of INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode. For example, when the intra prediction mode of the target block is one of INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode, the processing unit 1710 may determine that cross-channel prediction is used.

如上所述,可基于目标块的色度通道的帧内预测模式是否具有特定值来确定跨通道预测是否被使用。例如,当目标块的色度通道的帧内预测模式具有特定值时,处理单元1710可确定跨通道预测被使用。As described above, whether or not cross-channel prediction is used may be determined based on whether the intra prediction mode of the chroma channel of the target block has a specific value. For example, when the intra prediction mode of the chroma channel of the target block has a specific value, the processing unit 1710 may determine that cross-channel prediction is used.

目标块的色度通道的帧内预测模式可由intra_chroma_pred_mode信息指示。The intra prediction mode of the chroma channel of the target block may be indicated by intra_chroma_pred_mode information.

如上所述,可基于目标通道的帧内预测模式是否是DM来确定跨通道预测是否被使用。例如,当目标通道的帧内预测模式是DM时,处理单元1710可确定跨通道预测被使用。As described above, whether cross-channel prediction is used may be determined based on whether the intra prediction mode of the target channel is DM. For example, when the intra prediction mode of the target channel is DM, the processing unit 1710 may determine that cross-channel prediction is used.

在步骤1830,处理单元1710可从比特流获取由cIdx指示的通道的块的编码决策信息。At step 1830, the processing unit 1710 may obtain coding decision information for the block of the channel indicated by cIdx from the bitstream.

处理单元1710可从比特流解析和读取由cIdx指示的通道的块的编码决策信息。The processing unit 1710 may parse and read the coding decision information of the block of the channel indicated by cIdx from the bitstream.

在步骤1840,处理单元1710可共享代表性通道的编码决策信息作为由cIdx指示的通道的块的编码决策信息。换句话说,处理单元1710可将代表性通道的编码决策信息设置为由cIdx指示的通道的块的编码决策信息。At step 1840, the processing unit 1710 may share the encoding decision information for the representative channel as the encoding decision information for the block of the channel indicated by cIdx. In other words, the processing unit 1710 may set the encoding decision information of the representative channel as the encoding decision information of the block of the channel indicated by cIdx.

根据实施例,可在步骤1820之前或者在步骤1820与1840之间另外执行与条件或执行对应的操作。Depending on the embodiment, an operation corresponding to the condition or execution may be additionally performed before step 1820 or between steps 1820 and 1840 .

根据实施例,针对Cb信号,可发送编码决策信息,并且针对Cr信号,可共享Cb信号的编码决策信息而无需发送编码决策信息。在这种情况下,可将上述代码1修改为以下代码2:According to an embodiment, for Cb signals, coding decision information may be sent, and for Cr signals, coding decision information for Cb signals may be shared without sending coding decision information. In this case, the above code 1 can be modified to the following code 2:

[代码2][Code 2]

Figure BDA0002623340100000841
Figure BDA0002623340100000841

图19是根据实施例的用于确定是否将跳过变换的解码方法的流程图。19 is a flowchart of a decoding method for determining whether a transform is to be skipped, according to an embodiment.

编码设备1600可根据目标块的尺寸来确定是否将跳过变换(例如,初次变换和/或二次变换)。The encoding apparatus 1600 may determine whether to skip transform (eg, primary transform and/or secondary transform) according to the size of the target block.

目标块可以是变换块。The target block may be a transform block.

log2TrafoSize可表示目标块的尺寸。log2TrafoSize may represent the size of the target block.

例如,当目标块的尺寸小于或等于指示块尺寸的边界值的阈值时,编码设备1600可跳过目标块的变换。For example, when the size of the target block is less than or equal to a threshold value indicating a boundary value of the block size, the encoding apparatus 1600 may skip transformation of the target block.

Log2MaxTransformSkipSize可表示指示块尺寸的边界值的阈值。Log2MaxTransformSkipSize may represent a threshold value indicating the boundary value of the block size.

当跳过目标块的变换时,编码设备1600可将transform_skip_flag信息的值设置为1而不执行变换。transform_skip_flag信息可通过比特流被发送到解码设备1700。When the transformation of the target block is skipped, the encoding apparatus 1600 may set the value of transform_skip_flag information to 1 without performing transformation. transform_skip_flag information may be transmitted to the decoding apparatus 1700 through a bitstream.

此外,当执行目标块的变换时,编码设备1600可执行变换,并且可将transform_skip_flag信息的值设置为0。transform_skip_flag信息可通过比特流被发送到解码设备1700。Also, when performing transformation of the target block, the encoding apparatus 1600 may perform transformation, and may set the value of transform_skip_flag information to 0. transform_skip_flag information may be transmitted to the decoding apparatus 1700 through a bitstream.

这里,可针对构成图像的颜色空间的通道单独发送多条transform_skip_flag信息。Here, a plurality of pieces of transform_skip_flag information may be individually transmitted for channels constituting a color space of an image.

解码设备1700可从比特流获取transform_skip_flag信息的值。换句话说,解码设备1700可从比特流解析和读取transform_skip_flag信息。The decoding apparatus 1700 may acquire the value of transform_skip_flag information from the bitstream. In other words, the decoding apparatus 1700 may parse and read transform_skip_flag information from the bitstream.

这里,仅当块的尺寸小于或等于指示块尺寸的边界值的阈值时,解码设备1700才可从比特流获取transform_skip_flag信息的值。Here, the decoding apparatus 1700 may acquire the value of the transform_skip_flag information from the bitstream only when the size of the block is less than or equal to the threshold value indicating the boundary value of the block size.

此外,解码设备1700可获取针对图像的多个通道的多条transform_skip_flag信息。Also, the decoding apparatus 1700 may acquire pieces of transform_skip_flag information for multiple channels of an image.

transform_skip_flag信息的获取可由以下代码3表示:The acquisition of transform_skip_flag information can be represented by the following code 3:

[代码3][Code 3]

If(log2TrafoSize<=Log2MaxTransformSkipSize)If(log2TrafoSize<=Log2MaxTransformSkipSize)

transform_skip_flag[x0][y0][cIdx]transform_skip_flag[x0][y0][cIdx]

x0和y0可表示指示目标块的位置的空间坐标。x0 and y0 may represent spatial coordinates indicating the location of the target block.

cIdx可指示目标块信息的目标通道。cIdx may indicate the target channel of the target block information.

当存在三个图像通道时,cIdx可具有预定义值{0,1,2}中的一个。代表性通道的值可以是0。When there are three image channels, cIdx may have one of the predefined values {0, 1, 2}. The value of the representative channel can be 0.

代码3可被修改为以下代码4:Code 3 can be modified to code 4 below:

[代码4][Code 4]

If((log2TbWidth<=Log2MaxTransformSkipSize_W)&&(log2TbHeight<=Log2MaxTransformSkipSize_H))If((log2TbWidth<=Log2MaxTransformSkipSize_W)&&(log2TbHeight<=Log2MaxTransformSkipSize_H))

transform_skip_flag[x0][y0][cIdx]transform_skip_flag[x0][y0][cIdx]

log2TbWidth可以是基于以下方程式11的值。“width”可以是目标块的宽度(即,目标块的水平长度)。log2TbWidth may be a value based on Equation 11 below. "width" may be the width of the target block (ie, the horizontal length of the target block).

[方程式11][Equation 11]

log2TbWidth=log2widthlog2TbWidth= log2width

log2TbHeight可具有基于以下方程式12的值。“height”可以是目标块的高度(即,目标块的垂直长度)。log2TbHeight may have a value based on Equation 12 below. "height" may be the height of the target block (ie, the vertical length of the target block).

[方程式12][Equation 12]

log2TbHeight=log2heightlog2TbHeight=log 2 height

预定义阈值Log2MaxTransformSkipSize_W和Log2MaxTransformSkipSize_H可彼此相等或者可彼此不同。例如,Log2MaxTransformSkipSize_W的值可以是2,并且Log2MaxTransformSkipSize_H的值可以是2。The predefined thresholds Log2MaxTransformSkipSize_W and Log2MaxTransformSkipSize_H may be equal to each other or may be different from each other. For example, the value of Log2MaxTransformSkipSize_W may be 2, and the value of Log2MaxTransformSkipSize_H may be 2.

代码3可被修改为以下代码5:Code 3 can be modified to code 5 below:

[代码5][Code 5]

If((log2TbWidth<=2)&&(log2TbHeight<=2))If((log2TbWidth<=2)&&(log2TbHeight<=2))

transform_skip_flag[x0][y0][cIdx]transform_skip_flag[x0][y0][cIdx]

如上所述,代替针对多个通道单独用信号发送多条transform_skip_flag信息,可仅针对亮度(Y)信号用信号发送transform_skip_flag信息,并且可不针对其余色度通道单独用信号发送transform_skip_flag信息。可选地,可仅针对Cb信号用信号发送transform_skip_flag信息,并且可不针对Cr信号单独用信号发送transform_skip_flag信息,并且针对Cb信号发送的transform_skip_flag信息可被共享以用于Cr信号。As described above, instead of individually signaling multiple pieces of transform_skip_flag information for multiple channels, transform_skip_flag information may be signaled only for luma (Y) signals, and transform_skip_flag information may not be individually signaled for the remaining chroma channels. Alternatively, the transform_skip_flag information may be signaled only for the Cb signal, and the transform_skip_flag information may not be separately signaled for the Cr signal, and the transform_skip_flag information signaled for the Cb signal may be shared for the Cr signal.

下面,将描述共享transform_skip_flag信息的实施例。Next, an embodiment of sharing transform_skip_flag information will be described.

在实施例中,解码设备1700可从比特流获取transform_skip_flag信息。这种获取可由以下代码6表示:In an embodiment, the decoding apparatus 1700 may obtain transform_skip_flag information from the bitstream. This acquisition can be represented by the following code 6:

[代码6][Code 6]

Figure BDA0002623340100000861
Figure BDA0002623340100000861

Figure BDA0002623340100000871
Figure BDA0002623340100000871

x0和y0可以是指示目标块的位置的空间坐标。x0 and y0 may be spatial coordinates indicating the location of the target block.

cIdx可指示目标块的目标通道。cIdx may indicate the target channel of the target block.

在代码6以及包括条件“if(log2TrafoSize<=Log2MaxTransformSkipSize)”的其它代码中,可用条件“if((log2TbWidth<=Log2MaxTransformSkipSize_W)&&(log2TbHeight<=Log2MaxTransformSkipSize_H))”或条件“if((log2TbWidth<=2)&&(log2TbHeight<=2))”代替条件“if(log2TrafoSize<=Log2MaxTransformSkipSize)”。In code 6 and other codes including the condition "if(log2TrafoSize<=Log2MaxTransformSkipSize)", the condition "if((log2TbWidth<=Log2MaxTransformSkipSize_W)&&(log2TbHeight<=Log2MaxTransformSkipSize_H))" or the condition "if((log2TbWidth<=2 )&&(log2TbHeight<=2))" instead of the condition "if(log2TrafoSize<=Log2MaxTransformSkipSize)".

当图像的通道的数量是3时,cIdx可具有预定义值{0,1,2}中的一个。代表性通道的值可以是0。可选地,代表性通道的值可以是1或2。When the number of channels of an image is 3, cIdx may have one of the predefined values {0, 1, 2}. The value of the representative channel can be 0. Alternatively, the value of the representative channel may be 1 or 2.

在步骤1910,通信单元1720可接收比特流。At step 1910, the communication unit 1720 may receive the bitstream.

在步骤1920,处理单元1710可确定是否可能跳过针对目标块的变换。At step 1920, the processing unit 1710 may determine whether it is possible to skip the transform for the target block.

如果确定可能跳过变换,则可执行步骤1930。If it is determined that the transformation may be skipped, step 1930 may be performed.

如果确定不可能跳过变换,则可执行步骤1960。If it is determined that it is not possible to skip the transformation, step 1960 may be performed.

例如,当目标块的尺寸小于或等于特定尺寸时,处理单元1710可确定不可能跳过变换。For example, when the size of the target block is less than or equal to a certain size, the processing unit 1710 may determine that it is not possible to skip the transform.

例如,当目标块的尺寸大于特定尺寸时,处理单元1710可确定不可能跳过变换。For example, when the size of the target block is larger than a certain size, the processing unit 1710 may determine that it is not possible to skip the transform.

这里,特定尺寸可以是允许跳过变换的块尺寸的边界值。Here, the specific size may be a boundary value of a block size that allows skipping of transforms.

例如,当满足以下代码7中的条件时(即,当代码7中的条件的结果为真时),处理单元1710可确定不可能跳过变换,而当不满足以下代码7中的条件时(即,当代码7中的条件的结果为假时),处理单元1710可确定可能跳过变换。For example, the processing unit 1710 may determine that it is not possible to skip the transformation when the conditions in Code 7 below are satisfied (ie, when the results of the conditions in Code 7 below are true), and when the conditions in Code 7 below are not satisfied ( That is, when the result of the condition in Code 7 is false), processing unit 1710 may determine that the transform may be skipped.

[代码7][Code 7]

if(log2TrafoSize<=Log2MaxTransformSkipSize)if(log2TrafoSize<=Log2MaxTransformSkipSize)

在步骤1930,处理单元1710可确定与目标块的目标通道的transform_skip_flag信息的共享是否将被使用。At step 1930, the processing unit 1710 may determine whether sharing of transform_skip_flag information with the target channel of the target block is to be used.

如果确定将不共享transform_skip_flag信息,则可执行步骤1940。If it is determined that transform_skip_flag information will not be shared, step 1940 may be performed.

如果确定将共享transform_skip_flag信息,则可执行步骤1950。If it is determined that transform_skip_flag information is to be shared, step 1950 may be performed.

例如,当满足以下代码8中的条件时(即,当代码8中的条件的结果为真时),处理单元1710可确定将共享transform_skip_flag信息,而当不满足以下代码8中的条件时(即,当代码8中的条件的结果为假时),处理单元1710可确定将不共享transform_skip_flag信息。For example, the processing unit 1710 may determine that transform_skip_flag information will be shared when the conditions in Code 8 below are satisfied (ie, when the result of the conditions in Code 8 is true), and when the conditions in Code 8 below are not satisfied (ie, when the results of the conditions in Code 8 below are true) , when the result of the condition in code 8 is false), processing unit 1710 may determine that transform_skip_flag information will not be shared.

[代码8][Code 8]

if((cIdx!=0)&&(跨通道预测被使用))if((cIdx != 0) && (cross-channel prediction is used))

换句话说,在步骤1930,当目标通道不是代表性通道并且跨通道预测被用于目标块时,处理单元1710可确定将与目标通道共享transform_skip_flag信息。相反,当目标通道是代表性通道时或者当跨通道预测不被用于目标块时,处理单元1710可确定将不共享transform_skip_flag信息。In other words, at step 1930, when the target channel is not a representative channel and cross-channel prediction is used for the target block, the processing unit 1710 may determine that transform_skip_flag information is to be shared with the target channel. Conversely, when the target channel is a representative channel or when cross-channel prediction is not used for the target block, the processing unit 1710 may determine that transform_skip_flag information will not be shared.

在步骤1940,处理单元1710可从比特流获取目标通道的transform_skip_flag信息。处理单元1710可从比特流解析和读取目标通道的transform_skip_flag信息。In step 1940, the processing unit 1710 may obtain the transform_skip_flag information of the target channel from the bitstream. The processing unit 1710 may parse and read the transform_skip_flag information of the target channel from the bitstream.

transform_skip_flag信息可被存储在transform_skip_flag[x0][y0][cIdx]中。transform_skip_flag information may be stored in transform_skip_flag[x0][y0][cIdx].

在步骤1950,处理单元1710可设置transform_skip_flag信息,使得代表性通道的transform_skip_flag信息被用作目标通道的transform_skip_flag信息。In step 1950, the processing unit 1710 may set the transform_skip_flag information such that the transform_skip_flag information of the representative channel is used as the transform_skip_flag information of the target channel.

处理单元1710可将代表性通道的transform_skip_flag信息用作目标通道的transform_skip_flag信息,而不从比特流解析和读取目标通道的transform_skip_flag信息。换句话说,处理单元1710可将transform_skip_flag[x0][y0][0]的值存储在transform_skip_flag[x0][y0][cIdx]中。The processing unit 1710 may use the transform_skip_flag information of the representative channel as the transform_skip_flag information of the target channel without parsing and reading the transform_skip_flag information of the target channel from the bitstream. In other words, the processing unit 1710 may store the value of transform_skip_flag[x0][y0][0] in transform_skip_flag[x0][y0][cIdx].

也就是说,在不需要用于从比特流解析和读取目标通道的transform_skip_flag信息的过程的情况下,先前存储在transform_skip_flag[x0][y0][0]中的值可同样地在transform_skip_flag[x0][y0][cIdx]中被使用。That is, in the case where the process for parsing and reading the transform_skip_flag information of the target channel from the bitstream is not required, the value previously stored in transform_skip_flag[x0][y0][0] can likewise be stored in transform_skip_flag[x0 ][y0][cIdx] is used.

例如,针对作为代表性通道的亮度(Y)通道用信号发送的transform_skip_flag信息甚至可被用于色度通道(Cb和/或Cr)。For example, the transform_skip_flag information signaled for the luma (Y) channel as the representative channel may even be used for the chroma channels (Cb and/or Cr).

根据实施例,针对Cb信号,可发送transform_skip_flag信息,并且针对Cr信号,可在不发送transform_skip_flag信息的情况下共享针对Cb信号的transform_skip_flag信息。在这种情况下,上述代码6可被修改为以下代码9:According to an embodiment, for the Cb signal, transform_skip_flag information may be transmitted, and for the Cr signal, the transform_skip_flag information for the Cb signal may be shared without transmitting the transform_skip_flag information. In this case, the above code 6 can be modified to the following code 9:

[代码9][Code 9]

Figure BDA0002623340100000891
Figure BDA0002623340100000891

当不可能跳过针对目标块的变换时,可执行步骤1960。When it is not possible to skip the transform for the target block, step 1960 may be performed.

在步骤1960,可设置指示针对目标块将不跳过变换的信息。因为不允许跳过针对目标块的变换,所以transform_skip_flag[x0][y0][cIdx]的值可被设置为0。At step 1960, information indicating that the transform will not be skipped for the target block may be set. The value of transform_skip_flag[x0][y0][cIdx] may be set to 0 because it is not allowed to skip the transform for the target block.

图20是根据实施例的用于根据帧内模式确定是否将跳过变换的解码方法的流程图。20 is a flowchart of a decoding method for determining whether a transform is to be skipped according to an intra mode, according to an embodiment.

在图像的亮度通道(即,Y)通道与色度通道(即,Cb和/或Cr)之间可能存在相当大的相关性。例如,亮度通道可包括关于图像的纹理的大量信息,并且作为色度通道的Cb通道和Cr通道可另外提供将被添加到纹理的颜色信息。There may be considerable correlation between the luminance channel (ie, Y) and the chrominance channel (ie, Cb and/or Cr) of an image. For example, the luma channel may contain a lot of information about the texture of the image, and the Cb channel and the Cr channel, which are chroma channels, may additionally provide color information to be added to the texture.

因此,当执行图像的压缩和重建所需的预测时,可计算针对Cb块和Cr块的预测值,而不针对颜色空间的三个通道分别执行独立的预测,其中,根据先前通过解码获取的亮度通道的信号针对所述Cb块和Cr块执行预测。Therefore, when performing prediction required for compression and reconstruction of an image, the prediction values for the Cb block and the Cr block can be calculated without performing independent prediction for each of the three channels of the color space, which is based on the previously obtained through decoding The signal of the luma channel performs prediction for the Cb and Cr blocks.

用于计算这些预测值的技术可被称为“跨通道预测(CCP)”或上述“CCLM”。The technique used to calculate these predicted values may be referred to as "Cross-Channel Prediction (CCP)" or "CCLM" above.

解码设备1700可通过检查目标块的帧内预测模式是否是INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式中的一个来确定跨通道预测是否已经被使用。The decoding apparatus 1700 may determine whether cross-channel prediction has been used by checking whether the intra prediction mode of the target block is one of INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode.

由于色度信号的纹理信息中的相当大的一部分也被包括在亮度信号中,因此这样的跨通道预测可能有效。类似地,可使用跨通道预测根据Cb通道的信号来计算针对作为预测的目标的Cr块的预测值。Such cross-channel prediction may be effective since a substantial portion of the texture information of the chroma signal is also included in the luma signal. Similarly, a prediction value for a Cr block targeted for prediction may be calculated from the signal of the Cb channel using cross-channel prediction.

从包括角度预测、DC预测和平面预测的各种技术中选择跨通道预测以用于色度信号的预测的情况或者跨通道预测有利的情况可表示与SelectedCIDX对应的通道的信号特性非常类似于另外通道的信号特性。The case where cross-channel prediction is selected for prediction of chrominance signals, or the case where cross-channel prediction is favorable, from various techniques including angle prediction, DC prediction, and planar prediction may indicate that the signal characteristics of the channel corresponding to SelectedCIDX are very similar to that of another Signal characteristics of the channel.

因此,当跳过(或执行)针对与SelectedCIDX对应的通道的块的变换有利时,也跳过(或执行)针对其余通道的块的变换可同样有利。Therefore, when it is advantageous to skip (or perform) the transform for the block for the channel corresponding to SelectedCIDX, it may be equally advantageous to also skip (or perform) the transform for the blocks of the remaining channels.

因此,当使用跨通道预测时,可不针对三个通道分别执行对比特流的解析以便获取transform_skip_flag信息。当代表性通道的transform_skip_flag信息被解析出时,可不单独解析其余通道的transform_skip_flag信息。代表性通道的transform_skip_flag信息可被共享并被用作其余通道的transform_skip_flag信息,并且指示这样的共享的信息可被记录在比特流中。例如,对于这样的共享,可使用与SelectedCIDX对应的通道来确定是否将跳过变换。Therefore, when cross-channel prediction is used, the parsing of the bitstream may not be performed separately for the three channels in order to obtain transform_skip_flag information. When the transform_skip_flag information of the representative channel is parsed out, the transform_skip_flag information of the remaining channels may not be parsed separately. The transform_skip_flag information of the representative channel may be shared and used as the transform_skip_flag information of the remaining channels, and information indicating such sharing may be recorded in the bitstream. For example, for such sharing, the channel corresponding to SelectedCIDX may be used to determine whether the transform is to be skipped.

可选地,可对针对三个通道同样地跳过变换的情况来计算率失真值,并且可对针对三个通道同样地执行变换的情况来计算率失真值。可将当跳过变换时计算出的率失真值与当执行变换时计算出的率失真值彼此进行比较,并且可基于比较的结果使用用于跳过变换的方案与用于执行变换的方案之间的更有利的方案来执行通道的编码。Alternatively, rate-distortion values may be calculated for cases where transforms are skipped equally for three channels, and rate-distortion values may be calculated for cases where transforms are performed equally for three channels. The rate-distortion value calculated when the transform is skipped and the rate-distortion value calculated when the transform is performed may be compared with each other, and one of the scheme for skipping the transform and the scheme for performing the transform may be used based on the result of the comparison. between the more advantageous schemes to perform the encoding of the channel.

在实施例中,代替针对多个通道用信号发送多条transform_skip_flag信息,可仅针对与SelectedCIDX对应的通道用信号发送transform_skip_flag信息,并且可不针对其余通道单独用信号发送transform_skip_flag信息。In an embodiment, instead of signaling multiple pieces of transform_skip_flag information for multiple lanes, transform_skip_flag information may be signaled only for lanes corresponding to SelectedCIDX, and transform_skip_flag information may not be separately signaled for the remaining lanes.

下面,将描述这样的transform_skip_flag信息被共享的实施例。Next, an embodiment in which such transform_skip_flag information is shared will be described.

在实施例中,解码设备1700可从比特流获取transform_skip_flag信息。这样的获取可由以下代码10表示:In an embodiment, the decoding apparatus 1700 may obtain transform_skip_flag information from the bitstream. Such acquisition can be represented by the following code 10:

[代码10][Code 10]

Figure BDA0002623340100000901
Figure BDA0002623340100000901

x0和y0可以是指示目标块的位置的空间坐标。x0 and y0 may be spatial coordinates indicating the location of the target block.

cIdx可指示目标块的目标通道。cIdx may indicate the target channel of the target block.

当图像中的通道的数量是3时,代码10中的cIdx的值可以是值{0,1,2}中的一个。例如,cIdx的值可以是预定义值{0,1,2}中的一个。When the number of channels in the image is 3, the value of cIdx in code 10 may be one of the values {0, 1, 2}. For example, the value of cIdx may be one of the predefined values {0,1,2}.

在步骤2010,通信单元1720可接收比特流。At step 2010, the communication unit 1720 may receive the bitstream.

在步骤2020,处理单元1710可确定是否可能跳过针对目标块的变换。At step 2020, the processing unit 1710 may determine whether it is possible to skip the transform for the target block.

当可能跳过变换时,可执行步骤2030。When it is possible to skip the transformation, step 2030 may be performed.

当不可能跳过变换时,可执行步骤2060。When it is not possible to skip the transformation, step 2060 may be performed.

例如,当目标块的尺寸小于或等于特定尺寸时,处理单元1710可确定不可能跳过变换。For example, when the size of the target block is less than or equal to a certain size, the processing unit 1710 may determine that it is not possible to skip the transform.

例如,当目标块的尺寸大于特定尺寸时,处理单元1710可确定不可能跳过变换。For example, when the size of the target block is larger than a certain size, the processing unit 1710 may determine that it is not possible to skip the transform.

这里,所述特定尺寸可以是允许跳过变换的块尺寸的边界值。Here, the specific size may be a boundary value of a block size that allows skipping of transforms.

例如,当满足以下代码7中的条件时(即,当代码7中的条件的结果为真时),处理单元1710可确定不可能跳过变换,而当不满足以下代码7中的条件时(即,当代码7中的条件的结果为假时),处理单元1710可确定可能跳过变换。For example, the processing unit 1710 may determine that it is not possible to skip the transformation when the conditions in Code 7 below are satisfied (ie, when the results of the conditions in Code 7 below are true), and when the conditions in Code 7 below are not satisfied ( That is, when the result of the condition in Code 7 is false), processing unit 1710 may determine that the transform may be skipped.

[代码11][Code 11]

if(log2TrafoSize<=Log2MaxTransformSkipSize)if(log2TrafoSize<=Log2MaxTransformSkipSize)

在步骤2030,处理单元1710可基于所选择的代表性通道来确定是否将使用与目标块的目标通道的transform_skip_flag信息的共享。At step 2030, the processing unit 1710 may determine whether to use sharing of transform_skip_flag information with the target channel of the target block based on the selected representative channel.

如果确定将不共享transform_skip_flag信息,则可执行步骤2040。If it is determined that the transform_skip_flag information will not be shared, step 2040 may be performed.

如果确定将共享transform_skip_flag信息,则可执行步骤2050。If it is determined that transform_skip_flag information is to be shared, step 2050 may be performed.

例如,当满足以下代码8中的条件时(即,当代码8中的条件的结果为真时),处理单元1710可确定将共享transform_skip_flag信息,而当不满足以下代码8中的条件时(即,当代码8中的条件的结果为假时),处理单元1710可确定将不共享transform_skip_flag信息。For example, the processing unit 1710 may determine that transform_skip_flag information will be shared when the conditions in Code 8 below are satisfied (ie, when the result of the conditions in Code 8 is true), and when the conditions in Code 8 below are not satisfied (ie, when the results of the conditions in Code 8 below are true) , when the result of the condition in code 8 is false), processing unit 1710 may determine that transform_skip_flag information will not be shared.

[代码12][Code 12]

if((cIdx!=SelectedCIDX)&&"跨通道预测被使用")if((cIdx !=SelectedCIDX) && "Cross-channel prediction is used")

换句话说,在步骤2030,当目标通道不是由SelectedCIDX指示的所选择的代表性通道并且跨通道预测被用于目标块时,处理单元1710可确定与目标通道共享transform_skip_flag信息。此外,当目标通道是由SelectedCIDX指示的所选择的代表性通道时或者当跨通道预测不被用于目标块时,处理单元1710可确定不共享transform_skip_flag信息。In other words, in step 2030, when the target channel is not the selected representative channel indicated by SelectedCIDX and cross-channel prediction is used for the target block, the processing unit 1710 may determine to share transform_skip_flag information with the target channel. Furthermore, when the target channel is the selected representative channel indicated by SelectedCIDX or when cross-channel prediction is not used for the target block, the processing unit 1710 may determine that transform_skip_flag information is not shared.

在步骤2040,处理单元1710可从比特流获取目标通道的transform_skip_flag信息。处理单元1710可从比特流解析和读取目标通道的transform_skip_flag信息。In step 2040, the processing unit 1710 may obtain the transform_skip_flag information of the target channel from the bitstream. The processing unit 1710 may parse and read the transform_skip_flag information of the target channel from the bitstream.

transform_skip_flag信息可被存储在transform_skip_flag[x0][y0][cIdx]中。transform_skip_flag information may be stored in transform_skip_flag[x0][y0][cIdx].

在步骤2050,处理单元1710可设置transform_skip_flag信息,使得由SelectedCIDX指示的所选择的代表性通道的transform_skip_flag信息被用作目标通道的transform_skip_flag信息。In step 2050, the processing unit 1710 may set the transform_skip_flag information such that the transform_skip_flag information of the selected representative channel indicated by SelectedCIDX is used as the transform_skip_flag information of the target channel.

处理单元1710可将由SelectedCIDX指示的所选择的代表性通道的transform_skip_flag信息用作目标通道的transform_skip_flag信息,而不从比特流解析和读取目标通道的transform_skip_flag信息。换句话说,处理单元1710可将transform_skip_flag[x0][y0][SelectedCIDX]的值存储在transform_skip_flag[x0][y0][cIdx]中。The processing unit 1710 may use the transform_skip_flag information of the selected representative channel indicated by SelectedCIDX as the transform_skip_flag information of the target channel without parsing and reading the transform_skip_flag information of the target channel from the bitstream. In other words, the processing unit 1710 may store the value of transform_skip_flag[x0][y0][SelectedCIDX] in transform_skip_flag[x0][y0][cIdx].

也就是说,先前存储在transform_skip_flag[x0][y0][SelectedCIDX]中的值也可同样地在transform_skip_flag[x0][y0][cIdx]中被使用,而不需要用于从比特流解析和读取目标通道的transform_skip_flag信息的过程。That is, the value previously stored in transform_skip_flag[x0][y0][SelectedCIDX] can likewise be used in transform_skip_flag[x0][y0][cIdx] without needing to be used for parsing and reading from the bitstream The process of taking the transform_skip_flag information of the target channel.

在步骤2060,可设置指示针对目标块的目标通道将不跳过变换的信息。因为不允许跳过针对目标块的变换,所以transform_skip_flag[x0][y0][cIdx]的值可被设置为0。In step 2060, information indicating that the target channel for the target block will not skip transforms may be set. The value of transform_skip_flag[x0][y0][cIdx] may be set to 0 because it is not allowed to skip the transform for the target block.

换句话说,对于由cIdx指示的目标块的目标通道,可在transform_skip_flag信息中设置预定义值0以便指示针对目标通道将不跳过变换,而不从比特流解析和读取指示是否将跳过变换的transform_skip_flag信息。In other words, for the target channel of the target block indicated by cIdx, a predefined value of 0 may be set in the transform_skip_flag information to indicate that the transform will not be skipped for the target channel, without parsing and reading from the bitstream indicating whether the skip will be skipped Transform_skip_flag information for transform.

如上所述,在上述实施例中,所选择的代表性通道的编码决策信息可与除了所选择的代表性通道之外的所有通道共享。As described above, in the above-described embodiments, the encoding decision information of the selected representative channel may be shared with all channels except the selected representative channel.

上述实施例可被部分地修改。换句话说,所选择的代表性通道的编码决策信息可被共享作为另一指定通道的编码决策信息。例如,编码决策信息可包括transform_skip_flag信息。The above-described embodiments may be partially modified. In other words, the encoding decision information of the selected representative channel can be shared as the encoding decision information of another designated channel. For example, the encoding decision information may include transform_skip_flag information.

例如,当SelectedCIDX的值为1时,cIDX值1指示Cb信号,并且cIDX值2指示Cr信号,针对Cb信号的编码决策信息可被共享作为Cr信号的编码决策信息。For example, when the value of SelectedCIDX is 1, a cIDX value of 1 indicates a Cb signal, and a cIDX value of 2 indicates a Cr signal, coding decision information for the Cb signal may be shared as coding decision information for the Cr signal.

换句话说,可从编码设备1600向解码设备1700发送针对Cb信号的编码决定信息,并且可使用针对Cb信号的编码决策信息来设置针对Cr信号的编码决策信息,而不单独发送针对Cr信号的编码决策信息。In other words, the encoding decision information for the Cb signal may be transmitted from the encoding apparatus 1600 to the decoding apparatus 1700, and the encoding decision information for the Cr signal may be set using the encoding decision information for the Cb signal without separately transmitting the encoding decision information for the Cr signal Encoding decision information.

当针对Cb信号的编码决策信息被共享作为针对Cr信号的编码决策信息时,上述代码10可被修改为以下代码13:When the coding decision information for the Cb signal is shared as the coding decision information for the Cr signal, the above code 10 can be modified to the following code 13:

[代码13][Code 13]

Figure BDA0002623340100000931
Figure BDA0002623340100000931

在步骤2030,当满足以下代码8中的条件时(即,当代码8中的条件的结果为真时),处理单元1710可确定将共享transform_skip_flag信息,而当不满足以下代码8中的条件时(即,当代码8中的条件的结果为假时),处理单元1710可确定将不共享transform_skip_flag信息。At step 2030, the processing unit 1710 may determine that transform_skip_flag information will be shared when the conditions in Code 8 below are satisfied (ie, when the results of the conditions in Code 8 below are true), and when the conditions in Code 8 below are not satisfied (ie, when the result of the condition in Code 8 is false), processing unit 1710 may determine that transform_skip_flag information will not be shared.

[代码14][Code 14]

if((cIdx!=2)or(!"跨通道预测被使用"))if((cIdx != 2)or(!"Cross-channel prediction is used"))

换句话说,在步骤2030,1)当目标通道不是共享由SelectedCIDX指示的所选择的代表性通道的编码决策信息的通道时,或者2)当跨通道预测不被用于目标块时,处理单元1710可确定不与目标通道共享transform_skip_flag信息。此外,1)当目标通道是共享由SelectedCIDX指示的所选择的代表性通道的编码决策信息的通道时,并且2)当跨通道预测被用于目标块时,处理单元1710可确定共享transform_skip_flag信息。In other words, at step 2030, 1) when the target channel is not a channel that shares the coding decision information of the selected representative channel indicated by SelectedCIDX, or 2) when cross-channel prediction is not used for the target block, the processing unit 1710 may determine not to share transform_skip_flag information with the target channel. Furthermore, 1) when the target channel is a channel that shares encoding decision information for the selected representative channel indicated by SelectedCIDX, and 2) when cross-channel prediction is used for the target block, the processing unit 1710 may determine to share transform_skip_flag information.

代码13中的步骤可被实现为保持相同含义的其他步骤。例如,代码13可被修改为以下代码15:The steps in Code 13 can be implemented as other steps that maintain the same meaning. For example, code 13 can be modified to code 15 as follows:

[代码15][Code 15]

Figure BDA0002623340100000941
Figure BDA0002623340100000941

变换选择信息的共享Sharing of transformation selection information

图21是根据实施例的用于共享变换选择信息的方法的流程图。21 is a flowchart of a method for sharing transform selection information, according to an embodiment.

在上述实施例中,transform_skip_flag信息已经被描述为将被共享的编码决策信息。上述实施例中的transform_skip_flag信息可被替换为另一种类型的编码决策信息。下面,将变换选择信息描述为将被共享的编码决策信息。In the above-described embodiments, the transform_skip_flag information has been described as encoding decision information to be shared. The transform_skip_flag information in the above embodiment may be replaced with another type of encoding decision information. In the following, transform selection information is described as encoding decision information to be shared.

变换选择信息可以是指示哪种变换将被用于目标通道的变换块的信息。变换选择信息可包括上述初次变换选择信息和/或二次变换选择信息。The transform selection information may be information indicating which transform is to be used for the transform block of the target channel. The transformation selection information may include the above-mentioned primary transformation selection information and/or secondary transformation selection information.

在图像的亮度通道(即,Y)与色度通道(即,Cb和/或Cr)之间可能存在相当大的相关性。例如,亮度通道可包括关于图像的纹理的大量信息,并且作为色度通道的Cb通道和Cr通道可另外提供将被添加到纹理的颜色信息。There may be considerable correlation between the luminance channel (ie, Y) and the chrominance channel (ie, Cb and/or Cr) of an image. For example, the luma channel may contain a lot of information about the texture of the image, and the Cb channel and the Cr channel, which are chroma channels, may additionally provide color information to be added to the texture.

因此,当执行图像的压缩和重建所需的预测时,可计算针对Cb块和Cr块的预测值,而不针对颜色空间的三个通道分别执行独立的预测,其中,根据先前通过解码获取的亮度通道的信号针对所述Cb块和Cr块执行预测。由于色度信号的相当大量的纹理信息可被包括在亮度信号中,因此这样的跨通道预测可能有效。Therefore, when performing prediction required for compression and reconstruction of an image, the prediction values for the Cb block and the Cr block can be calculated without performing independent prediction for each of the three channels of the color space, which is based on the previously obtained through decoding The signal of the luma channel performs prediction for the Cb and Cr blocks. Such cross-channel prediction may be effective since a considerable amount of texture information of the chroma signal can be included in the luma signal.

从用于预测色度信号的各种技术(包括角度预测、DC预测、平面预测等)中选择跨通道预测的情况或者跨通道预测更有利的情况可表示亮度通道的信号属性非常类似于色度通道(即,Cb和/或Cr)的信号属性。The case where cross-channel prediction is chosen from among various techniques for predicting chroma signals (including angular prediction, DC prediction, planar prediction, etc.), or the case where cross-channel prediction is more favorable, may indicate that the signal properties of the luma channel are very similar to those of chroma Signal properties of the channel (ie, Cb and/or Cr).

因此,当针对亮度块使用多个变换中的特定变换是有利的时,针对色度块(即,Cb块和/或Cr块)使用相同的变换可能是有利的。Thus, while it may be advantageous to use a particular one of multiple transforms for a luma block, it may be advantageous to use the same transform for a chroma block (ie, a Cb block and/or a Cr block).

换句话说,通常,可将相同的变换用于亮度块和色度块。可选地,当特定变换被用于亮度块时,与用于亮度块的特定变换对应的另外的特定变换可被用于色度块。In other words, in general, the same transform can be used for luma blocks and chroma blocks. Alternatively, when a specific transform is used for luma blocks, another specific transform corresponding to the specific transform used for luma blocks may be used for chroma blocks.

因此,当使用跨通道预测时,如果确定了一个变换,则所确定的变换可同样地被用于亮度通道和色度通道(即,三个通道),而不分别用信号发送用于亮度通道和色度通道的变换。Thus, when using cross-channel prediction, if a transform is determined, the determined transform can be equally used for the luma and chroma channels (ie, the three channels) without being separately signaled for the luma channel and chroma channel transformation.

可选地,如果针对亮度通道确定了一个变换,则与针对亮度通道确定的变换对应的变换可被用于色度通道。Alternatively, if one transform is determined for the luma channel, a transform corresponding to the transform determined for the luma channel may be used for the chroma channel.

当以这种方式确定了将被用于通道的变换时,可基于所述确定来对亮度通道和色度通道进行编码。对于这样的编码,可仅针对亮度通道选择多个可用变换中的一个,并且选择针对亮度通道的所述一个变换,因此可自动确定针对色度通道的变换。When the transform to be used for the channel is determined in this way, the luma channel and the chroma channel may be encoded based on the determination. For such encoding, only one of the available transforms can be selected for the luma channel, and the one transform for the luma channel is selected, so the transform for the chroma channel can be automatically determined.

可选地,可对三个通道执行使用相同变换的编码。借助于这种编码,可针对多个变换计算率失真值。此后,通过可应用的变换的率失真值之间的比较,可选择具有最有利的率失真值的变换,并且可根据所选择的变换来执行编码。Alternatively, encoding using the same transform can be performed on the three channels. With this encoding, rate-distortion values can be calculated for multiple transforms. Thereafter, through a comparison between the rate-distortion values of the applicable transforms, the transform with the most favorable rate-distortion value can be selected, and encoding can be performed according to the selected transform.

可选地,可对三个通道执行使用变换集的编码。变换集可包括用于亮度通道的特定变换以及用于色度通道且与该特定变换对应的变换。Optionally, encoding using transform sets may be performed on three channels. A transform set may include a specific transform for the luma channel and a transform for the chroma channel that corresponds to the specific transform.

借助于这种编码,可针对多个变换集计算率失真值。此后,通过可应用的变换集的率失真值之间的比较,可选择具有最有利的率失真值的变换集,并且可根据所选择的变换集来执行编码。With this encoding, rate-distortion values can be calculated for multiple transform sets. Thereafter, through a comparison between rate-distortion values of applicable transform sets, a transform set with the most favorable rate-distortion value can be selected, and encoding can be performed according to the selected transform set.

在实施例中,代替针对多个通道单独用信号发送多条变换选择信息,可仅针对亮度信号(亮度通道)用信号发送变换选择信息,并且可不针对作为色度通道的其余通道单独用信号发送变换选择信息。In an embodiment, instead of individually signaling multiple pieces of transform selection information for multiple channels, transform selection information may only be signaled for the luma signal (the luma channel), and may not be separately signaled for the remaining channels that are chroma channels Transform selection information.

在实施例中,解码设备1700可从比特流获取变换选择信息。In an embodiment, the decoding apparatus 1700 may obtain transform selection information from the bitstream.

在步骤2110,通信单元1720可接收比特流。At step 2110, the communication unit 1720 may receive the bitstream.

在步骤2120,处理单元1710可确定与目标块的目标通道的变换选择信息的共享是否将被使用。At step 2120, the processing unit 1710 may determine whether sharing of transform selection information with the target channel of the target block is to be used.

当将不共享变换选择信息时,可执行步骤2130。When the transform selection information is not to be shared, step 2130 may be performed.

当将共享变换选择信息时,可执行步骤2140。When the transform selection information is to be shared, step 2140 may be performed.

在步骤2130,处理单元1710可从比特流获取目标通道的变换选择信息。处理单元1710可从比特流解析和读取目标通道的变换选择信息。In step 2130, the processing unit 1710 may obtain transform selection information of the target channel from the bitstream. The processing unit 1710 may parse and read the transform selection information of the target channel from the bitstream.

在步骤2140,处理单元1710可设置变换选择信息,使得代表性通道的变换选择信息被用作目标通道的变换选择信息。In step 2140, the processing unit 1710 may set the transform selection information such that the transform selection information of the representative channel is used as the transform selection information of the target channel.

步骤2120、步骤2130和步骤2140可由以下代码16表示:Step 2120, Step 2130 and Step 2140 can be represented by the following code 16:

[代码16][Code 16]

Figure BDA0002623340100000961
Figure BDA0002623340100000961

x0和y0可以是指示目标块的位置的空间坐标。x0 and y0 may be spatial coordinates indicating the location of the target block.

cIdx可指示目标块的目标通道。cIdx may indicate the target channel of the target block.

当图像中的通道的数量是3时,代码10中的cIdx的值可以是值{0,1,2}中的一个。例如,cIdx的值可以是预定义值{0,1,2}中的一个。When the number of channels in the image is 3, the value of cIdx in code 10 may be one of the values {0, 1, 2}. For example, the value of cIdx may be one of the predefined values {0,1,2}.

代表性通道的cIdx可被假设为0。The cIdx of the representative channel can be assumed to be 0.

“cIdx!=0”可指示目标通道不是代表性通道(例如,亮度通道)。cIdx值0可指示目标通道是代表性通道。"cIdx!=0" may indicate that the target channel is not a representative channel (eg, a luma channel). A cIdx value of 0 may indicate that the target channel is the representative channel.

换句话说,在步骤2120,当目标通道不是代表性通道并且跨通道预测被用于目标块时,处理单元1710可确定与目标通道的变换选择信息的共享将被使用。当目标通道是代表性通道时或者当跨通道预测不被用于目标块时,处理单元1710可确定与目标通道的变换选择信息的共享将不被使用。In other words, at step 2120, when the target channel is not a representative channel and cross-channel prediction is used for the target block, the processing unit 1710 may determine that the sharing of transform selection information with the target channel is to be used. When the target channel is a representative channel or when cross-channel prediction is not used for the target block, the processing unit 1710 may determine that the sharing of transform selection information with the target channel will not be used.

在步骤2130,处理单元1710可从比特流获取目标通道的变换选择信息。处理单元1710可从比特流解析和读取目标通道的变换选择信息。In step 2130, the processing unit 1710 may obtain transform selection information of the target channel from the bitstream. The processing unit 1710 may parse and read the transform selection information of the target channel from the bitstream.

变换选择信息可被存储在transform selection information[x0][y0][cIdx]中。Transform selection information may be stored in transform selection information[x0][y0][cIdx].

在步骤2140,处理单元1710可设置变换选择信息,使得代表性通道的变换选择信息被用作目标通道的变换选择信息。In step 2140, the processing unit 1710 may set the transform selection information such that the transform selection information of the representative channel is used as the transform selection information of the target channel.

处理单元1710可将代表性通道的变换选择信息用作目标通道的变换选择信息,而不从比特流解析和读取目标通道的变换选择信息。也就是说,处理单元1710可将transformselection information[x0][y0][0]的值存储在transform selection information[x0][y0][cIdx]中。The processing unit 1710 may use the transform selection information of the representative channel as the transform selection information of the target channel without parsing and reading the transform selection information of the target channel from the bitstream. That is, the processing unit 1710 may store the value of transform selection information[x0][y0][0] in transform selection information[x0][y0][cIdx].

也就是说,在不需要用于从比特流解析和读取目标通道的变换选择信息的过程的情况下,先前存储在transform selection information[x0][y0][0]中的值可被同样地用于transform selection information[x0][y0][cIdx]。That is, the values previously stored in the transform selection information[x0][y0][0] can be equally For transform selection information[x0][y0][cIdx].

确定是否将使用跨通道预测Determine if cross-channel prediction will be used

在上面参照图18至图21描述的实施例中,已经例示了通过检查目标块的帧内预测模式是否是INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式中的一个来确定是否将使用跨通道预测。换句话说,当目标块的帧内预测模式是INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式中的一个时,可使用跨通道预测。当目标块的帧内预测模式不是INTRA_CCLM模式、INTRA_MMLM模式及INTRA_MFLM模式中的一个时,可不使用跨通道预测。In the embodiments described above with reference to FIGS. 18 to 21 , it has been exemplified whether cross-channel prediction is to be used by checking whether the intra prediction mode of the target block is one of INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode. In other words, when the intra prediction mode of the target block is one of INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode, cross-channel prediction may be used. When the intra prediction mode of the target block is not one of INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode, cross-channel prediction may not be used.

上述确定是否将使用跨通道预测仅是示例,并且可通过以下代码17至代码23中的一个来确定是否将使用跨通道预测。例如,当以下代码中的每一个的条件的值为真时,可使用跨通道预测,而当以下代码中的每一个的条件的值为假时,可不使用跨通道预测。“intra_chroma_pred_mode”可以是色度通道的帧内预测模式。The above-described determination of whether cross-channel prediction will be used is merely an example, and whether cross-channel prediction will be used may be determined by one of the following codes 17 to 23. For example, cross-channel prediction may be used when the value of the condition of each of the following codes is true, and cross-channel prediction may not be used when the value of the condition of each of the following codes is false. "intra_chroma_pred_mode" may be the intra prediction mode of the chroma channel.

[代码17][Code 17]

if(intra_chroma_pred_mode==CCLM mode)if(intra_chroma_pred_mode==CCLM mode)

[代码18][Code 18]

if(intra_chroma_pred_mode==DM mode)if(intra_chroma_pred_mode==DM mode)

[代码19][Code 19]

if(intra_chroma_pred_mode==INTRA_CCLM mode)if(intra_chroma_pred_mode==INTRA_CCLM mode)

[代码20][Code 20]

if(intra_chroma_pred_mode==INTRA_MMLM mode)if(intra_chroma_pred_mode==INTRA_MMLM mode)

[代码21][Code 21]

if(intra_chroma_pred_mode==INTRA_MFLM mode)if(intra_chroma_pred_mode==INTRA_MFLM mode)

[代码22][Code 22]

if((intra_chroma_pred_mode==INTRA_CCLM mode)||if((intra_chroma_pred_mode==INTRA_CCLM mode)||

(intra_chroma_pred_mode==INTRA_MMLM mode)||(intra_chroma_pred_mode==INTRA_MMLM mode)||

(intra_chroma_pred_mode==INTRA_MFLM mode))(intra_chroma_pred_mode==INTRA_MFLM mode))

[代码23][Code 23]

if((intra_chroma_pred_mode==DM mode)||if((intra_chroma_pred_mode==DM mode)||

(intra_chroma_pred_mode==INTRA_CCLM mode)||(intra_chroma_pred_mode==INTRA_CCLM mode)||

(intra_chroma_pred_mode==INTRA_MMLM mode)||(intra_chroma_pred_mode==INTRA_MMLM mode)||

(intra_chroma_pred_mode==INTRA_MFLM mode))(intra_chroma_pred_mode==INTRA_MFLM mode))

代码17至代码23中描述的CCLM模式、DM模式、INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式中的每一个可指示在上述表10和表11中的第一列中呈现的intra_chroma_pred_mode的一个值。关于CCLM模式、DM模式、INTRA_CCLM模式、INTRA_MMLM模式和INTRA_MFLM模式,可参考上面关于表10和表11进行的前述描述。Each of CCLM mode, DM mode, INTRA_CCLM mode, INTRA_MMLM mode, and INTRA_MFLM mode described in Code 17 to Code 23 may indicate one value of intra_chroma_pred_mode presented in the first column in Table 10 and Table 11 above. Regarding the CCLM mode, the DM mode, the INTRA_CCLM mode, the INTRA_MMLM mode, and the INTRA_MFLM mode, reference may be made to the foregoing descriptions made with respect to Table 10 and Table 11 above.

当旨在确定跨通道预测是否将被使用时,在上述代码17至代码23中的方案中可另外考虑块的尺寸。The block size may additionally be considered in the schemes in Code 17 to Code 23 above when aiming to determine whether cross-channel prediction is to be used.

也可通过以下代码24至代码30中的一个来执行确定是否将使用跨通道预测。例如,当以下代码中的每一个中的条件的值为真时,可使用跨通道预测,而当以下代码中的每一个中的条件的值为假时,可不使用跨通道预测。Determining whether cross-channel prediction is to be used may also be performed by one of Code 24 to Code 30 below. For example, cross-channel prediction may be used when the value of the condition in each of the following codes is true, and may not be used when the value of the condition in each of the following code is false.

[代码24][Code 24]

if((intra_chroma_pred_mode==CCLM mode)&&块尺寸条件)if((intra_chroma_pred_mode==CCLM mode) && block size condition)

[代码25][Code 25]

if((intra_chroma_pred_mode==DM mode)&&块尺寸条件)if((intra_chroma_pred_mode==DM mode) && block size condition)

[代码26][Code 26]

if((intra_chroma_pred_mode==INTRA_CCLM mode)&&块尺寸条件)if((intra_chroma_pred_mode==INTRA_CCLM mode) && block size condition)

[代码27][Code 27]

if((intra_chroma_pred_mode==INTRA_MMLM mode)&&块尺寸条件)if((intra_chroma_pred_mode==INTRA_MMLM mode) && block size condition)

[代码28][Code 28]

if((intra_chroma_pred_mode==INTRA_MFLM mode)&&块尺寸条件)if((intra_chroma_pred_mode==INTRA_MFLM mode) && block size condition)

[代码29][Code 29]

if((intra_chroma_pred_mode==INTRA_CCLM mode)||if((intra_chroma_pred_mode==INTRA_CCLM mode)||

(intra_chroma_pred_mode==INTRA_MMLM mode)||(intra_chroma_pred_mode==INTRA_MMLM mode)||

(intra_chroma_pred_mode==INTRA_MFLM mode)&&块尺寸条件)(intra_chroma_pred_mode==INTRA_MFLM mode) && block size condition)

[代码30][Code 30]

if((intra_chroma_pred_mode==DM(direct mode)mode)||if((intra_chroma_pred_mode==DM(direct mode)mode)||

(intra_chroma_pred_mode==INTRA_CCLM mode)||(intra_chroma_pred_mode==INTRA_CCLM mode)||

(intra_chroma_pred_mode==INTRA_MMLM mode)||(intra_chroma_pred_mode==INTRA_MMLM mode)||

(intra_chroma_pred_mode==INTRA_MFLM mode)&&块尺寸条件)(intra_chroma_pred_mode==INTRA_MFLM mode) && block size condition)

代码24至代码30中呈现的块尺寸条件可被以下代码代码31、代码32、代码33和代码34中的一个代替。The block size condition presented in Code 24 to Code 30 may be replaced by one of the following codes Code 31, Code 32, Code 33 and Code 34.

[代码31][Code 31]

((log2TbWidth<=Log2MaxSizeWidth)&&(log2TbHeight<=Log2MaxSizeHeight))((log2TbWidth<=Log2MaxSizeWidth)&&(log2TbHeight<=Log2MaxSizeHeight))

[代码32][Code 32]

((log2TbWidth>=Log2MinSizeWidth)&&(log2TbHeight>=Log2MinSizeHeight))((log2TbWidth>=Log2MinSizeWidth)&&(log2TbHeight>=Log2MinSizeHeight))

[代码33][Code 33]

((log2TbWidth>Log2MinSizeWidth)&&(log2TbHeight<=Log2MaxSizeHeight))((log2TbWidth>Log2MinSizeWidth)&&(log2TbHeight<=Log2MaxSizeHeight))

[代码34][Code 34]

((log2TbWidth>Log2MinSizeWidth)&&(log2TbHeight>Log2MaxSizeHeight))((log2TbWidth>Log2MinSizeWidth)&&(log2TbHeight>Log2MaxSizeHeight))

上面已经参照方程式11和方程式12描述了log2TbWidth和log2TbHeight。log2TbWidth and log2TbHeight have been described above with reference to Equation 11 and Equation 12.

Log2MaxSizeWidth、Log2MaxSizeHeight、Log2MinSizeWidth和Log2MinSizeHeight可以是预定义值。Log2MaxSizeWidth可以是具有最大尺寸的块的宽度。Log2MaxSizeHeight可以是具有最大尺寸的块的高度。Log2MinSizeWidth可以是具有最小尺寸的块的宽度。Log2MinSizeHeight可以是具有最小尺寸的块的高度。Log2MaxSizeWidth, Log2MaxSizeHeight, Log2MinSizeWidth and Log2MinSizeHeight may be predefined values. Log2MaxSizeWidth may be the width of the block with the largest size. Log2MaxSizeHeight can be the height of the block with the largest size. Log2MinSizeWidth can be the width of the block with the smallest size. Log2MinSizeHeight can be the height of the block with the smallest size.

例如,Log2MaxSizeWidth的值可以是16,Log2MaxSizeHeight的值可以是16,Log2MinSizeWidth的值可以是4,并且Log2MinSizeHeight的值可以是4。For example, the value of Log2MaxSizeWidth may be 16, the value of Log2MaxSizeHeight may be 16, the value of Log2MinSizeWidth may be 4, and the value of Log2MinSizeHeight may be 4.

可选地,Log2MaxSizeWidth的值可以是32,并且Log2MaxSizeHeight的值可以是32。Alternatively, the value of Log2MaxSizeWidth may be 32, and the value of Log2MaxSizeHeight may be 32.

当使用DM时,色度信号的帧内预测模式可不被单独用信号发送。当使用DM时,针对亮度信号用信号发送的帧内预测模式也可在不经改变的情况下在色度模式中被使用。When using DM, the intra prediction mode of the chrominance signal may not be signaled separately. When DM is used, the intra-prediction mode signaled for the luma signal can also be used unchanged in the chroma mode.

在块划分结构下使用通道之间的选择性信息的共享的编码和解码Encoding and decoding using sharing of selective information between channels under a block partitioning structure

通常,当图像被编码时,可考虑图像中的空间特性将合适的编码方案单独用于多个空间区域。对于这种编码,可将图像分区为CU,并且可分别对从分区生成的CU进行编码。In general, when an image is encoded, an appropriate encoding scheme may be used individually for multiple spatial regions, taking into account the spatial characteristics in the image. For this encoding, a picture may be partitioned into CUs, and CUs generated from the partitions may be encoded separately.

为了执行这种编码,可将相同的块分区结构用于亮度通道和色度通道。To perform this encoding, the same block partition structure can be used for the luma and chroma channels.

然而,亮度信号的特性和色度信号的特性可彼此不同。考虑特性之间的差异,可分别将不同的块分区结构用于亮度通道和色度通道,以便实现更有效的编码。However, the characteristics of the luminance signal and the characteristics of the chrominance signal may be different from each other. Considering the difference between characteristics, different block partition structures can be used for luma channel and chroma channel respectively, in order to achieve more efficient coding.

在下文中,将图像的块分区结构针对亮度信号和色度信号(或多个通道)相同的情况称为“单树块分区结构”或“单树”。In the following, the case where the block partition structure of an image is the same for the luma signal and the chroma signal (or multiple channels) is referred to as "single tree block partition structure" or "single tree".

在下文中,将图像的块分区结构针对亮度信号和色度信号(或多个通道)不相同的情况称为“双树块分区结构”或“双树”。Hereinafter, the case where the block partition structure of an image is different for the luma signal and the chroma signal (or multiple channels) is referred to as "dual tree block partition structure" or "dual tree".

在实施例中,可在亮度通道和色度通道(或多个通道)之间指定与目标通道的目标块对应的另外通道的块。与目标通道的目标块对应的另外通道的块被称为“对应块(col-block)”。In an embodiment, a block of additional channels corresponding to the target block of the target channel may be specified between the luma channel and the chroma channel (or channels). A block of another channel corresponding to the target block of the target channel is called a "col-block".

在实施例中,当亮度通道和色度通道(或多个通道)具有相同的块分区结构时(即,当单树被应用时),可指定与目标通道的目标块对应的另外通道的块。In an embodiment, when the luma channel and the chroma channel (or channels) have the same block partition structure (ie, when a single tree is applied), the block of the additional channel may be specified corresponding to the target block of the target channel .

在实施例中,当亮度通道和色度通道(或多个通道)具有不同的块分区结构时(即,当双树被应用时),可指定与目标通道的目标块对应的另外通道的块。In an embodiment, when the luma channel and the chroma channel (or channels) have different block partition structures (ie, when a dual tree is applied), the block of the additional channel may be specified corresponding to the target block of the target channel .

当亮度通道和色度通道(或多个通道)具有不同的块分区结构时(即,当双树被应用时),对应块的编码决策信息可与目标通道的目标块共享。通过共享编码决策信息,对目标块进行编码,并且因此可提高编码效率。When the luma channel and the chroma channel (or channels) have different block partition structures (ie, when dual trees are applied), the coding decision information of the corresponding block can be shared with the target block of the target channel. By sharing the encoding decision information, the target block is encoded, and thus encoding efficiency can be improved.

可从压缩的比特流解析与目标通道的目标块对应的对应块的编码决策信息。The encoding decision information for the corresponding block corresponding to the target block of the target channel may be parsed from the compressed bitstream.

可从压缩的比特流解析与目标通道的目标块对应的对应块的编码决策信息,并且可使用对应块的编码决策信息来对目标通道的目标块进行解码。The encoding decision information of the corresponding block corresponding to the target block of the target channel may be parsed from the compressed bitstream, and the target block of the target channel may be decoded using the encoding decision information of the corresponding block.

例如,与目标通道的目标块对应的对应块的transform_skip_flag信息可被解析,并且对应块的transform_skip_flag信息可被用于确定针对目标块是否将跳过变换。For example, transform_skip_flag information of the corresponding block corresponding to the target block of the target channel may be parsed, and the transform_skip_flag information of the corresponding block may be used to determine whether transform is to be skipped for the target block.

共享信息可以是在亮度块与色度块之间共享的编码决策信息。Shared information may be coding decision information shared between luma blocks and chroma blocks.

当第一通道的块分区结构与第二通道的块分区结构彼此相同时,如果第一通道的第一块的空间位置与第二通道的第二块的空间位置对应,则第一块和第二块可彼此对应(即,可以是同位的)。换句话说,不同通道的对应块可以是具有对应(同位的)空间位置的不同通道中的块。与第一块对应的第二块的共享信息可被用于第一块的编码和/或解码。When the block partition structure of the first channel and the block partition structure of the second channel are the same as each other, if the spatial position of the first block of the first channel corresponds to the spatial position of the second block of the second channel, the first block and the second block The two blocks may correspond to each other (ie, may be co-located). In other words, corresponding blocks of different channels may be blocks in different channels with corresponding (co-located) spatial locations. The shared information of the second block corresponding to the first block may be used for encoding and/or decoding of the first block.

当色度通道的块分区结构与亮度通道的块分区结构相同时,与色度通道的特定色度块对应的亮度块可以是在与所述特定色度块的空间位置对应的空间位置处的亮度块。在这种情况下,与色度块对应的亮度块的共享信息可被用于色度块的编码和/或解码。When the block partition structure of the chroma channel is the same as the block partition structure of the luma channel, the luma block corresponding to a specific chroma block of the chroma channel may be at a spatial position corresponding to the spatial position of the specific chroma block Luminance block. In this case, shared information of luma blocks corresponding to chroma blocks may be used for encoding and/or decoding of chroma blocks.

图22示出单树块分区结构。Figure 22 shows a single tree block partition structure.

图23示出双树块分区结构。Figure 23 shows a dual tree block partition structure.

在4:2:0颜色子采样结构下,在空间上与色度块对应的亮度信号区域可占据是色度块的四倍大的面积。换句话说,亮度信号区域的水平长度(宽度)和垂直长度(高度)可以是色度块的水平长度和垂直长度的两倍。Under the 4:2:0 color subsampling structure, the luma signal region spatially corresponding to the chroma block may occupy an area four times larger than that of the chroma block. In other words, the horizontal length (width) and vertical length (height) of the luminance signal region may be twice as large as the horizontal length and vertical length of the chrominance block.

如图22中所示,在色度通道和亮度通道的对应图像区域中,色度通道的块分区结构和亮度通道的块分区结构可彼此相同。换句话说,可将单树用于色度通道和亮度通道。As shown in FIG. 22 , in the corresponding image regions of the chroma channel and the luma channel, the block partition structure of the chroma channel and the block partition structure of the luma channel may be the same as each other. In other words, a single tree can be used for both chroma and luma channels.

在图22和随后的附图中,彼此对应的图像区域由“对应区域”指示。In FIG. 22 and subsequent figures, image areas corresponding to each other are indicated by "corresponding areas".

如图23中所示,在色度通道和亮度通道的对应图像区域中,色度通道的块分区结构和亮度通道的块分区结构可彼此不同。换句话说,可将双树用于色度通道和亮度通道。As shown in FIG. 23 , in the corresponding image regions of the chroma channel and the luma channel, the block partition structure of the chroma channel and the block partition structure of the luma channel may be different from each other. In other words, dual trees can be used for both chroma and luma channels.

例如,在图23中,在空间上与一个色度块对应的亮度通道的区域可被分区为八个块。For example, in FIG. 23, the area of the luma channel spatially corresponding to one chroma block may be partitioned into eight blocks.

当亮度通道和色度通道的对应图像区域中的块分区结构彼此相同时,可明确地指定在空间上与特定色度块对应的亮度通道的块。When the block partition structures in the corresponding image regions of the luma channel and the chroma channel are identical to each other, the block of the luma channel that spatially corresponds to a particular chroma block can be unambiguously specified.

相反,当亮度通道的块分区结构与色度通道的块分区结构不相同时,对于通过根据色度通道的块分区结构进行分区而确定的指定色度块,与指定色度块对应的亮度块在亮度通道中可能未被明确地指定。如图23中所示,这种不明确归因于色度通道的块分区结构和亮度通道的块分区结构彼此不相同的事实。Conversely, when the block partition structure of the luma channel is different from the block partition structure of the chroma channel, for a specified chroma block determined by partitioning according to the block partition structure of the chroma channel, the luma block corresponding to the specified chroma block May not be explicitly specified in the luma channel. As shown in FIG. 23, this ambiguity is due to the fact that the block partition structure of the chroma channel and the block partition structure of the luma channel are different from each other.

在实施例中,对于亮度通道的块分区结构与色度通道的块分区结构不相同的情况,将描述用于指定与色度块对应的亮度块的方法。In an embodiment, a method for specifying a luma block corresponding to a chroma block will be described for a case where the block partition structure of the luma channel is different from that of the chroma channel.

例如,可指定与色度块对应的多个亮度块。借助于这种指定,可从与作为目标块的色度块对应的一个或更多个亮度块获取一条(多条)共享信息,并且可使用所获取的一条(多条)共享信息来执行色度块的编码和/或解码。For example, multiple luma blocks corresponding to chroma blocks may be specified. With this designation, one piece(s) of shared information can be acquired from one or more luma blocks corresponding to the chroma blocks as target blocks, and the chrominance can be performed using the acquired piece(s) of shared information Encoding and/or decoding of degree blocks.

下面,将针对第一通道的块分区结构与第二通道的块分区结构不相同的情况描述用于指定与第一通道的块对应的第二通道的一个或更多个块的方法。在以下描述中,尽管第一通道将被描述为色度通道并且第二通道将被描述为亮度通道,但是色度通道和亮度通道仅是示例性的,并且如上所述,第一通道和第二通道可以是不同类型的通道。Hereinafter, a method for specifying one or more blocks of the second channel corresponding to the blocks of the first channel will be described for a case where the block partition structure of the first channel is different from that of the second channel. In the following description, although the first channel will be described as a chrominance channel and the second channel will be described as a luma channel, the chrominance channel and the luma channel are only exemplary, and as described above, the first channel and the second channel will be described as a luma channel. The two channels can be different types of channels.

图24示出根据示例的用于基于对应区域中的位置来指定对应块的方案。24 illustrates a scheme for specifying corresponding blocks based on positions in corresponding regions, according to an example.

指示与色度块对应的亮度块的对应区域可被指定为矩形区域。该矩形区域中的最上最左像素的位置可以是(xCb,yCb)。该矩形区域中的最下最右像素的位置可以是(xCb+cbWidth-1,yCb+cbHeight-1)。A corresponding area indicating a luma block corresponding to a chroma block may be designated as a rectangular area. The position of the uppermost leftmost pixel in the rectangular area may be (xCb, yCb). The position of the lowermost and rightmost pixel in the rectangular area may be (xCb+cbWidth-1, yCb+cbHeight-1).

位置(xCb,yCb)可指示与色度块(即,色度编码块)中的最上最左像素的位置对应的亮度像素的位置。Position (xCb, yCb) may indicate the position of the luma pixel corresponding to the position of the uppermost leftmost pixel in the chroma block (ie, the chroma coded block).

cbWidth和cbHeight可以是分别指示基于亮度像素的目标块的宽度和高度的值。cbWidth and cbHeight may be values indicating the width and height of the target block based on luma pixels, respectively.

换句话说,指示与色度块对应的亮度块的对应区域可被定义为矩形区域,其中,在所述矩形区域中基于亮度像素的位置的最上最左像素的位置是(xCb,yCb),并且所述矩形区域具有水平宽度cbWidth和垂直高度cbHeight。In other words, a corresponding area indicating a luma block corresponding to a chroma block may be defined as a rectangular area in which the position of the uppermost leftmost pixel based on the position of the luma pixel is (xCb, yCb), And the rectangular area has a horizontal width cbWidth and a vertical height cbHeight.

指示与上述色度块对应的亮度块的对应区域可被应用于以上参照图24至图31描述的实施例。The corresponding regions indicating the luminance blocks corresponding to the above-described chrominance blocks may be applied to the embodiments described above with reference to FIGS. 24 to 31 .

在图24中,与色度块对应的亮度块可以是在空间上与色度块对应的亮度通道的对应区域中的预定义位置处存在的亮度块。In FIG. 24, a luma block corresponding to a chroma block may be a luma block existing at a predefined position in a corresponding region of a luma channel corresponding to a chroma block in space.

换句话说,在空间上与色度块对应的亮度通道的对应区域中的预定义位置处存在的亮度块可被指定为与色度块对应的一个或更多个亮度块。可选地,占据在空间上与色度块对应的亮度通道的对应区域中的预定义位置的亮度块可被指定为与色度块对应的一个或更多个亮度块。In other words, a luma block that exists at a predefined position in a corresponding region of a luma channel that spatially corresponds to a chroma block may be designated as one or more luma blocks corresponding to a chroma block. Alternatively, luma blocks occupying predefined positions in corresponding regions of the luma channel spatially corresponding to the chroma blocks may be designated as one or more luma blocks corresponding to the chroma blocks.

例如,预定义位置可以是在空间上与色度块对应的亮度通道的区域中的中心(CR)位置、左上(TL)位置、右上(TR)位置、左下(BL)位置和右下(BR)位置。For example, the predefined positions may be a center (CR) position, an upper left (TL) position, an upper right (TR) position, a lower left (BL) position, and a lower right (BR) position in the region of the luma channel spatially corresponding to the chroma block. )Location.

CR位置可指示(xCb+cbWidth/2,yCb+cbHeight/2)。TL位置可指示(xCb,yCb)。TR位置可指示(xCb+cbWidth-1,yCb)。BL位置可指示(xCb,yCb+cbHeight-1)。BR位置可指示(xCb+cbWidth-1,yCb+cbHeight-1)。The CR position may indicate (xCb+cbWidth/2, yCb+cbHeight/2). The TL position may indicate (xCb, yCb). The TR position may indicate (xCb+cbWidth-1, yCb). The BL position may indicate (xCb, yCb+cbHeight-1). The BR position may indicate (xCb+cbWidth-1, yCb+cbHeight-1).

在实施例中,可在多个通道(诸如亮度通道和色度通道)中指定与目标通道的目标块对应的另外通道的块。与目标通道的目标块对应的另外通道的块被称为“对应块”。In an embodiment, blocks of additional channels corresponding to the target block of the target channel may be specified in a plurality of channels, such as the luma channel and the chrominance channel. Blocks of another channel corresponding to the target block of the target channel are called "corresponding blocks".

在实施例中,在空间上与色度块对应的亮度通道的区域中,包括指示中心(CR)的位置(xCb+cbWidth/2,yCb+cbHeight/2)处存在的亮度像素的亮度块可以是对应块。因此,当目标块以双树的形式被分区时,可明确地指定多个通道中的与目标通道(例如,色度通道)的目标块对应的另外通道(例如,亮度通道)的特定块。在实施例中,在色度通道的编码和/或解码中,包括位置(xCb+cbWidth/2,yCb+cbHeight/2)处存在的亮度像素的亮度块可被指定为对应块。关于指定的对应块的信息可被用于对目标块进行编码和/或解码。In an embodiment, in the region of the luma channel spatially corresponding to the chroma block, the luma block including the luma pixel present at the position (xCb+cbWidth/2, yCb+cbHeight/2) indicating the center (CR) may be is the corresponding block. Thus, when a target block is partitioned in a dual-tree, a particular block of another channel (eg, luma channel) of the plurality of channels corresponding to the target block of the target channel (eg, chroma channel) can be explicitly specified. In an embodiment, in the encoding and/or decoding of the chroma channels, a luma block including a luma pixel present at position (xCb+cbWidth/2, yCb+cbHeight/2) may be designated as the corresponding block. Information about the specified corresponding block may be used to encode and/or decode the target block.

例如,预定义位置可以是在空间上与色度块对应的亮度通道的区域中的CR位置、TL位置、TR位置、BL位置和BR位置中的一些位置。For example, the predefined positions may be some of the CR positions, the TL positions, the TR positions, the BL positions, and the BR positions in the region of the luma channel that spatially corresponds to the chroma block.

例如,与色度块对应的亮度块可以是包括位于在空间上与色度块对应的亮度通道的区域中的以下位置处的像素中的至少一个像素的块:中心位置、左上位置、右上位置、左下位置和右下位置。For example, a luma block corresponding to a chroma block may be a block including at least one of pixels located at the following positions in the region of the luma channel spatially corresponding to the chroma block: center position, upper left position, upper right position , the lower left position and the lower right position.

例如,与色度块对应的亮度块可以是包括位于在空间上与色度块对应的亮度通道的区域中的以下位置处的像素中的至少一个像素的块中的一些块:中心位置、左上位置、右上位置、左下位置和右下位置。For example, the luma blocks corresponding to the chroma blocks may be some of the blocks including at least one of the pixels located in the area of the luma channel spatially corresponding to the chroma blocks: center position, upper left position, upper right position, lower left position, and lower right position.

例如,与色度块对应的亮度块可包括包含位于在空间上与色度块对应的亮度通道的区域中的以下位置处的像素中的至少一个像素的块:中心位置、左上位置、右上位置、左下位置和右下位置。For example, a luma block corresponding to a chroma block may include a block containing at least one of the pixels located at the following positions in the region of the luma channel spatially corresponding to the chroma block: center position, upper left position, upper right position , the lower left position and the lower right position.

图25示出根据示例的用于基于对应区域中的面积来指定对应块的方案。25 illustrates a scheme for specifying corresponding blocks based on areas in corresponding regions, according to an example.

图26示出根据示例的用于基于对应区域中的面积来指定对应块的另一方案。26 illustrates another scheme for specifying corresponding blocks based on areas in corresponding regions, according to an example.

与色度块对应的亮度块可指示在空间上与色度块对应的亮度通道的区域中的具有最大面积的亮度块。The luma block corresponding to the chroma block may indicate the luma block having the largest area in the region of the luma channel spatially corresponding to the chroma block.

可选地,与色度块对应的亮度块可以是在空间上与色度块对应的亮度通道的区域中的具有最大面积的预定义数量的亮度块。Alternatively, the luma block corresponding to the chroma block may be a predefined number of luma blocks with the largest area in the region of the luma channel spatially corresponding to the chroma block.

这样的指定方案可归因于以下事实:有很大可能性在与色度块对应的亮度通道的区域中的具有最大面积的亮度块或具有最大面积的预定义数量的亮度块的特征将与色度块的特征类似。Such a designation scheme can be attributed to the fact that there is a high probability that the feature of the luma block with the largest area or the predefined number of luma blocks with the largest area in the region of the luma channel corresponding to the chroma block will be the same as Chroma blocks have similar characteristics.

如图25中所示,所述预定数量可以是2。在图25中,可从与色度块对应的亮度通道的区域中的八个亮度块中选择具有最大面积的两个块(即,块1和块2)。As shown in FIG. 25, the predetermined number may be two. In FIG. 25, two blocks having the largest area (ie, block 1 and block 2) may be selected from the eight luma blocks in the region of the luma channel corresponding to the chroma block.

如图26中所示,所述预定数量可以是3。在图25中,可从与色度块对应的亮度通道的区域中的八个亮度块中选择具有最大面积的三个块(即,块1、块2和块3)。As shown in FIG. 26, the predetermined number may be three. In FIG. 25, three blocks having the largest area (ie, block 1, block 2, and block 3) may be selected from the eight luma blocks in the region of the luma channel corresponding to the chroma block.

通过这种指定方案,即使亮度通道和色度通道的块分区结构彼此不相同,也可通过利用共享信息来提高编码效率。With this designation scheme, even if the block partition structures of the luma channel and the chroma channel are different from each other, the coding efficiency can be improved by utilizing the shared information.

图27示出根据示例的用于基于对应区域中的块的形式指定对应块的方案。FIG. 27 illustrates a scheme for specifying a corresponding block based on the form of the block in the corresponding area, according to an example.

图28示出根据示例的用于基于对应区域中的块的形式指定对应块的另一方案。FIG. 28 illustrates another scheme for specifying corresponding blocks based on the form of blocks in corresponding regions, according to an example.

与色度块对应的亮度块可指示在空间上与色度块对应的亮度通道的区域中的具有与色度块相同的形式的亮度块。A luma block corresponding to a chroma block may indicate a luma block having the same form as a chroma block in a region of a luma channel that spatially corresponds to the chroma block.

例如,块的形式可包括块的尺寸。For example, the form of the block may include the size of the block.

根据双树块分区结构,诸如在图27和28中所示的那些,色度通道的CU的块分区结构可不同于与色度通道的CU对应的亮度通道的区域中的CU的块分区结构。即使在这种情况下,与色度通道的CU中的目标块的区域对应的亮度通道的区域可作为单个块存在。According to dual-tree block partition structures, such as those shown in FIGS. 27 and 28 , the block partition structure of CUs for chroma channels may be different from the block partition structure of CUs in regions of luma channels corresponding to CUs for chroma channels . Even in this case, the region of the luma channel corresponding to the region of the target block in the CU of the chroma channel may exist as a single block.

在这种情况下,与色度块对应的亮度通道的区域中的单个亮度块可准确地与色度通道进行匹配。匹配的亮度块的共享信息可被共享作为色度块的编码决策信息。In this case, a single luma block in the region of the luma channel corresponding to the chroma block can be accurately matched to the chroma channel. Shared information for matched luma blocks may be shared as coding decision information for chroma blocks.

图29示出根据示例的用于基于对应区域中的块的高宽比来指定对应块的方案。29 illustrates a scheme for specifying a corresponding block based on the aspect ratio of the block in the corresponding region, according to an example.

图30示出根据示例的用于基于对应区域中的块的高宽比来指定对应块的另一方案。30 illustrates another scheme for specifying a corresponding block based on an aspect ratio of a block in a corresponding region, according to an example.

与色度块对应的亮度块可以是在空间上与色度块对应的亮度通道的区域中的具有与色度块相同的高宽比的亮度块。The luma block corresponding to the chroma block may be a luma block having the same aspect ratio as the chroma block in the region of the luma channel spatially corresponding to the chroma block.

可选地,与色度块对应的亮度块可以是在空间上与色度块对应的亮度通道的区域中的具有与色度块的高宽比相似的高宽比的亮度块。Alternatively, the luma block corresponding to the chroma block may be a luma block having an aspect ratio similar to that of the chroma block in the region of the luma channel spatially corresponding to the chroma block.

例如,在图29的亮度通道的区域中,可选择亮度块2,并且在图30的亮度通道的区域中,可选择亮度块1。For example, in the area of the luma channel of FIG. 29 , luma block 2 may be selected, and in the area of the luma channel of FIG. 30 , luma block 1 may be selected.

这里,块的高宽比可以是对应块的水平长度与垂直长度的比例。换句话说,块的高宽比可以是通过将块的水平长度除以块的垂直长度而获得的值。Here, the aspect ratio of the block may be the ratio of the horizontal length to the vertical length of the corresponding block. In other words, the aspect ratio of the block may be a value obtained by dividing the horizontal length of the block by the vertical length of the block.

例如,可使用以下方程式13来确定块的高宽比是否彼此相等:For example, the following Equation 13 can be used to determine whether the aspect ratios of blocks are equal to each other:

[方程式13][Equation 13]

(log2WidthChroma-log2HeightChroma)==(log2WidthLuma-log2HeightLuma)(log 2 Width Chroma - log 2 Height Chroma )==(log 2 Width Luma - log 2 Height Luma )

WidthChroma可以是色度块的宽度。HeightChroma可以是色度块的高度。Width Chroma can be the width of the chroma block. Height Chroma can be the height of the chroma block.

WidthLuma可以是与色度块对应的亮度块的宽度。HeightLuma可以是与色度块对应的亮度块的高度。Width Luma may be the width of the luma block corresponding to the chroma block. Height Luma may be the height of the luma block corresponding to the chroma block.

可使用以下方程式14来确定高宽比是否彼此相似:The following Equation 14 can be used to determine whether the aspect ratios are similar to each other:

[方程式14][Equation 14]

|(log2WidthChroma-log2HeightChroma)==(log2WidthLuma-log2HeightLuma)|<THD|(log 2 WidthChroma-log 2 HeightChroma)==(log 2 WidthLuma-log 2 HeightLuma)|<THD

“|x|”可指示x的绝对值。"|x|" may indicate the absolute value of x.

THD可以是阈值。例如,THD的值可以是2。THD can be the threshold. For example, the value of THD may be 2.

根据方程式13和方程式14,具有与色度块相同的高宽比的一个或更多个亮度块可被指定为对应块。可选地,具有与色度块的高宽比相似的高宽比的一个或更多个亮度块可被指定为对应块。According to Equation 13 and Equation 14, one or more luma blocks having the same aspect ratio as the chroma blocks may be designated as corresponding blocks. Alternatively, one or more luma blocks having an aspect ratio similar to that of the chroma blocks may be designated as corresponding blocks.

图31示出根据示例的用于基于对应区域中的块的编码属性来指定对应块的方案。31 illustrates a scheme for specifying corresponding blocks based on coding attributes of blocks in corresponding regions, according to an example.

虽然色度通道的块分区结构和亮度通道的块分区结构彼此独立,但是如果在空间上与色度块对应的亮度通道的区域中的亮度块之中存在具有与色度块相同的编码决策信息的亮度块,则色度块的共享信息和亮度块的共享信息可彼此相同。Although the block partition structure of the chroma channel and the block partition structure of the luma channel are independent of each other, if there is a luma block in the region of the luma channel spatially corresponding to the chroma block, there is the same coding decision information as the chroma block. , the shared information of the chrominance block and the shared information of the luma block may be the same as each other.

为了利用这些特性,针对预定义编码决策信息具有与色度块相同的值的亮度块可被指定为与色度块对应的亮度块。可选地,针对预定义编码决策信息具有与色度块的值类似的值的亮度块可被指定为与色度块对应的亮度块。To take advantage of these properties, a luma block that has the same value as a chroma block for predefined coding decision information can be designated as a luma block corresponding to a chroma block. Alternatively, a luma block having a value similar to that of the chroma block for the predefined encoding decision information may be designated as the luma block corresponding to the chroma block.

例如,预定义编码决策信息可以是关于帧内预测是否被使用的信息、帧内预测模式、运动预测信息、运动矢量、关于合并模式是否被使用的信息、推导模式、变换选择信息等。For example, the predefined encoding decision information may be information on whether intra prediction is used, intra prediction mode, motion prediction information, motion vector, information on whether merge mode is used, derivation mode, transform selection information, and the like.

例如,帧内预测模式可被用作预定义编码决策信息。在图31中,亮度块1、亮度块2和亮度块3被示出在亮度通道的区域中。亮度块1、亮度块2和亮度块3可以是具有与色度块的帧内预测模式相同(或相似)的帧内预测模式的亮度块。亮度块1、亮度块2和亮度块3可被指定为与色度块对应的亮度块。For example, intra prediction mode may be used as predefined encoding decision information. In FIG. 31, Luma Block 1, Luma Block 2, and Luma Block 3 are shown in the region of the luma channel. Luma Block 1, Luma Block 2, and Luma Block 3 may be luma blocks having the same (or similar) intra prediction mode as that of the chroma blocks. Luma block 1, luma block 2, and luma block 3 may be designated as luma blocks corresponding to chroma blocks.

根据上述特定方案,与色度块对应的亮度块可包括多个亮度块。当多个亮度块的多条共享信息彼此相同时,在所述多条共享信息被用于色度块时可能不会发生问题。相反,当多个亮度块的多条共享信息彼此不相同时,将被用于色度块的编码和/或解码的共享信息可能不明确。According to the above specific scheme, a luma block corresponding to a chroma block may include a plurality of luma blocks. When pieces of shared information of a plurality of luma blocks are identical to each other, a problem may not occur when the pieces of shared information are used for chroma blocks. Conversely, when pieces of shared information of multiple luma blocks are different from each other, shared information to be used for encoding and/or decoding of chroma blocks may be ambiguous.

与多个对应块的多条共享信息的值中的大部分值对应的值可被用作色度块的共享信息的值。这种决策方法可被称为“基于多数的共享信息决策方法”。通过这种方法,可在没有另外信令的情况下有效地共享将被共享的信息。Values corresponding to most of the values of the pieces of shared information of the plurality of corresponding blocks may be used as the values of the shared information of the chroma blocks. This decision-making method can be called "majority-based shared information decision-making method". In this way, the information to be shared can be efficiently shared without additional signaling.

例如,当共享transform_skip_flag信息时,占据与色度块对应的多个亮度块的多条transform_skip_flag信息的值的大部分的值可被共享作为色度块的transform_skip_flag信息的值。借助于这种共享,可执行色度块的编码和/或解码。For example, when transform_skip_flag information is shared, a value occupying most of the values of a plurality of pieces of transform_skip_flag information of a plurality of luma blocks corresponding to a chroma block may be shared as a value of the transform_skip_flag information of the chroma block. With this sharing, encoding and/or decoding of chroma blocks may be performed.

例如,仅当与色度块对应的亮度块的数量仅为1时,才使用共享信息。仅当在空间上与色度块对应的亮度通道的区域中仅存在一个满足上述特定条件的亮度块时,才可使用共享信息。可选地,当在空间上与目标通道的目标块对应的另外通道的区域仅由单个块指定时,可在通道之间共享编码决策信息。For example, shared information is used only when the number of luma blocks corresponding to chroma blocks is only 1. Shared information can be used only when there is only one luma block satisfying the above-mentioned specific condition in the region of the luma channel spatially corresponding to the chroma block. Alternatively, coding decision information may be shared between the channels when the region of another channel that spatially corresponds to the target block of the target channel is specified by only a single block.

参照图27,当在空间上与色度通道的目标块对应的亮度通道的区域被分区为仅一个块时,所述一个块可以是亮度通道的对应块。该对应块的这种共享信息可被用于对色度块进行编码和/或解码。Referring to FIG. 27 , when a region of a luma channel spatially corresponding to a target block of a chroma channel is partitioned into only one block, the one block may be a corresponding block of the luma channel. This shared information for the corresponding block may be used to encode and/or decode chroma blocks.

当在空间上与色度通道的目标块对应的亮度通道的区域不被分区为一个块时,在没有单独信令的情况下可不共享编码决策信息。When the region of the luma channel spatially corresponding to the target block of the chroma channel is not partitioned into one block, the coding decision information may not be shared without separate signaling.

图32是根据实施例的编码方法的流程图。32 is a flowchart of an encoding method according to an embodiment.

根据本实施例的编码方法和比特流生成方法可由编码设备1600执行。本实施例可以是目标编码方法或视频编码方法的一部分。The encoding method and the bitstream generation method according to the present embodiment may be performed by the encoding apparatus 1600 . This embodiment may be part of a target encoding method or a video encoding method.

在步骤3210,处理单元1610可确定目标块的代表性通道的编码决策信息。At step 3210, the processing unit 1610 may determine encoding decision information for the representative channel of the target block.

在步骤3220,处理单元1610可通过对使用目标块的代表性通道的编码决策信息的目标块执行编码来生成关于目标块的信息。In step 3220, the processing unit 1610 may generate information about the target block by performing encoding on the target block using the encoding decision information of the representative channel of the target block.

在步骤3230,处理单元1610可生成包括关于目标块的信息的比特流。At step 3230, the processing unit 1610 may generate a bitstream including information about the target block.

关于目标块的信息可包括代表性通道的编码决策信息。此外,比特流和关于目标块的信息可不包括目标通道的编码决策信息。The information about the target block may include coding decision information of the representative channel. Also, the bitstream and the information about the target block may not include coding decision information of the target channel.

参照图32描述的实施例可与上述另外的实施例进行组合。这里将省略重复的描述The embodiment described with reference to FIG. 32 may be combined with the further embodiments described above. Duplicate description will be omitted here

图33是根据实施例的解码方法的流程图。33 is a flowchart of a decoding method according to an embodiment.

根据本实施例的解码方法可由解码设备1600执行。The decoding method according to the present embodiment may be performed by the decoding apparatus 1600 .

在步骤3310,通信单元1710可接收包括关于目标块的信息的比特流。At step 3310, the communication unit 1710 may receive a bitstream including information about the target block.

关于目标块的信息可包括代表性通道的编码决策信息。此外,比特流和关于目标块的信息可不包括目标通道的编码决策信息。The information about the target block may include coding decision information of the representative channel. Also, the bitstream and the information about the target block may not include coding decision information of the target channel.

在步骤3320,处理单元1720可共享目标块的代表性通道的编码决策信息作为目标块的目标通道的编码决策信息。In step 3320, the processing unit 1720 may share the encoding decision information of the representative channel of the target block as the encoding decision information of the target channel of the target block.

代表性通道的编码决策信息可被共享作为目标通道的编码决策信息。The encoding decision information of the representative channel may be shared as the encoding decision information of the target channel.

在步骤3320,处理单元1610可对使用目标通道的编码决策信息的目标块执行解码。In step 3320, the processing unit 1610 may perform decoding on the target block using the encoding decision information of the target channel.

参照图33描述的实施例可与上述另外的实施例进行组合。这里将省略重复的描述。The embodiment described with reference to FIG. 33 may be combined with the further embodiments described above. Duplicate descriptions will be omitted here.

可在编码设备1600和解码设备1700中使用相同的方法来执行上述实施例。The above-described embodiments may be performed using the same method in the encoding apparatus 1600 and the decoding apparatus 1700 .

将在实施例中应用的步骤、操作和过程的顺序在编码设备1600和解码设备1700中可彼此不同。可选地,将在实施例中应用的步骤、操作和过程的顺序在编码设备1600和解码设备1700中可彼此相同。The order of steps, operations, and processes to be applied in the embodiments may be different from each other in the encoding apparatus 1600 and the decoding apparatus 1700 . Optionally, the order of steps, operations and processes to be applied in the embodiment may be the same as each other in the encoding apparatus 1600 and the decoding apparatus 1700 .

可分别对亮度信号和色度信号执行实施例。可选地,可等同地对亮度信号和色度信号执行实施例。Embodiments may be performed separately for luminance and chrominance signals. Alternatively, the embodiments may be performed equally on luminance and chrominance signals.

将应用实施例的每个块的形式可以是正方形形式或非正方形形式。The form of each block to which the embodiment will be applied may be a square form or a non-square form.

可基于CU、PU、TU和目标块中的至少一个的尺寸来确定是否应用实施例。这里,尺寸可被定义为使实施例能够被应用于目标的最小尺寸和/或最大尺寸,并且可被定义为使实施例能够被应用于目标的固定尺寸。Whether to apply an embodiment may be determined based on a size of at least one of a CU, a PU, a TU, and a target block. Here, a size may be defined as a minimum size and/or a maximum size that enables an embodiment to be applied to a target, and may be defined as a fixed size that enables an embodiment to be applied to a target.

此外,第一实施例可被应用于第一尺寸,并且第二实施例可被应用于第二尺寸。也就是说,可根据目标的尺寸来复合地应用实施例。此外,实施例可仅被应用于目标的尺寸等于或大于最小尺寸并且小于或等于最大尺寸的情况。也就是说,实施例可仅被应用于目标的尺寸落入特定范围内的情况。Furthermore, the first embodiment can be applied to a first size, and the second embodiment can be applied to a second size. That is, the embodiments can be applied compoundly according to the size of the target. Furthermore, the embodiments may only be applied to the case where the size of the target is equal to or larger than the minimum size and smaller than or equal to the maximum size. That is, the embodiments may only be applied to the case where the size of the target falls within a specific range.

例如,实施例可仅被应用于目标块的尺寸等于或大于8×8的情况。例如,实施例可仅被应用于目标块的尺寸为4×4的情况。例如,实施例可仅被应用于目标块的尺寸小于或等于16×16的情况。例如,实施例可仅被应用于目标块的尺寸等于或大于16×16并且小于或等于64×64的情况。For example, the embodiments may only be applied to the case where the size of the target block is equal to or larger than 8×8. For example, the embodiments may only be applied to the case where the size of the target block is 4x4. For example, the embodiments may only be applied when the size of the target block is less than or equal to 16×16. For example, the embodiments may only be applied to the case where the size of the target block is equal to or greater than 16×16 and less than or equal to 64×64.

可根据时间层来确定是否应用实施例。为了识别将应用实施例的时间层,可用信号发送单独的标识符。实施例可被选择性地应用于由标识符指定的时间层。这里,这样的标识符可指示将应用实施例的最低层和/或最高层,并且也可指示将应用实施例的特定层。此外,可预先定义将应用实施例的时间层。Whether or not to apply an embodiment may be determined according to a time layer. In order to identify the time layer to which an embodiment is to be applied, a separate identifier may be signaled. Embodiments may be selectively applied to the time horizon specified by the identifier. Here, such an identifier may indicate the lowest layer and/or the highest layer to which the embodiment is to be applied, and may also indicate a specific layer to which the embodiment is to be applied. Furthermore, the time horizon to which the embodiments will be applied may be predefined.

例如,实施例可仅被应用于目标图像的时间层是最低层的情况。例如,实施例可仅被应用于目标图像的时间层标识符等于或大于1的情况。例如,实施例可仅被应用于目标图像的时间层是最高层的情况。For example, embodiments may be applied only to the case where the temporal layer of the target image is the lowest layer. For example, embodiments may only be applied when the temporal layer identifier of the target image is equal to or greater than one. For example, the embodiments may be applied only to the case where the temporal layer of the target image is the highest layer.

可定义将应用实施例的条带类型。根据条带类型,可选择性地应用实施例。The stripe type to which the embodiment will be applied can be defined. Embodiments may be selectively applied depending on the stripe type.

在以上描述的实施例中,虽然已基于作为一系列步骤或单元的流程图描述了方法,但是本公开不限于所述步骤的顺序,并且一些步骤可以按照与已描述的步骤的顺序不同的顺序来执行或者与其它步骤同时执行。此外,本领域技术人员将理解:在流程图中示出的步骤不是排他性的,并且还可包括其它步骤,或者,可在不脱离本公开的范围的情况下删除流程图中的一个或更多个步骤。In the above-described embodiments, although the method has been described based on the flowchart as a series of steps or units, the present disclosure is not limited to the order of the steps, and some steps may be in a different order than the described order of steps to be performed or concurrently with other steps. Furthermore, those skilled in the art will understand that the steps shown in the flowcharts are not exclusive and may include other steps, or that one or more of the flowcharts may be deleted without departing from the scope of the present disclosure steps.

以上描述的根据本公开的实施例可被实现为能够由各种计算机装置运行的程序,并且可被记录在计算机可读存储介质上。计算机可读存储介质可单独地或者组合地包括程序指令、数据文件和数据结构。在存储介质上记录的程序指令可被专门设计或配置用于本公开,或者对于计算机软件领域的普通技术人员而言可以是已知的或者可用的。The above-described embodiments according to the present disclosure can be implemented as programs that can be executed by various computer devices, and can be recorded on a computer-readable storage medium. Computer-readable storage media may include program instructions, data files, and data structures, alone or in combination. The program instructions recorded on the storage medium may be specially designed or configured for the present disclosure, or may be known or available to those of ordinary skill in the computer software arts.

计算机可读存储介质可包括本公开的实施例中使用的信息。例如,计算机可读存储介质可包括比特流,并且比特流可包含上面在本发明的实施例中描述的信息。Computer-readable storage media may include information used in embodiments of the present disclosure. For example, a computer-readable storage medium may include a bitstream, and the bitstream may contain the information described above in the embodiments of the present invention.

计算机可读存储介质可包括非暂时性计算机可读介质。Computer-readable storage media may include non-transitory computer-readable media.

计算机可读存储介质的示例可包括被专门配置用于记录和运行程序指令的所有类型的硬件装置,诸如,磁介质(诸如硬盘、软盘和磁带)、光学介质(诸如致密盘(CD)-ROM和数字多功能盘(DVD))、磁光介质(诸如软光盘、ROM、RAM和闪存)。程序指令的示例包括机器代码(诸如由编译器创建的代码)和能够由计算机使用解释器执行的高级语言代码。硬件装置可被配置为作为一个或更多个软件模块进行操作以执行本公开的操作,反之亦可。Examples of computer-readable storage media may include all types of hardware devices specially configured to record and execute program instructions, such as magnetic media (such as hard disks, floppy disks, and magnetic tapes), optical media (such as compact disk (CD)-ROMs) and Digital Versatile Disc (DVD)), magneto-optical media such as floppy disk, ROM, RAM and flash memory. Examples of program instructions include machine code, such as code created by a compiler, and high-level language code that can be executed by a computer using an interpreter. A hardware device may be configured to operate as one or more software modules to perform the operations of the present disclosure, and vice versa.

如上所述,虽然已基于特定细节(诸如详细组件和有限数量的实施例和附图)描述了本公开,但是所述特定细节仅被提供用于容易理解本公开,本公开不限于这些实施例,本领域技术人员将根据以上描述实践各种改变和修改。As described above, although the present disclosure has been described based on specific details, such as detailed components and a limited number of embodiments and drawings, the specific details are provided only to facilitate understanding of the present disclosure and the present disclosure is not limited to these embodiments , those skilled in the art will practice various changes and modifications based on the above description.

因此,应理解的是,本实施例的精神不受限于上述实施例,并且所附权利要求及其等同物和对它们的修改落入本公开的范围内。Therefore, it should be understood that the spirit of the present embodiment is not limited to the above-described embodiment and that the appended claims and their equivalents and modifications thereto fall within the scope of the present disclosure.

Claims (20)

1. A decoding method, comprising:
sharing the coding decision information of the representative channel of the target block as the coding decision information of the target channel of the target block; and is
Decoding a target block using coding decision information of the target channel.
2. The decoding method of claim 1, further comprising: receiving a bitstream including information on a target block,
wherein the information on the target block includes coding decision information of the representative channel, and
wherein the information on the target block does not include coding decision information of the target channel.
3. The decoding method of claim 1, wherein the coding decision information of the representative channel is transform skip information indicating whether a transform is to be skipped.
4. The decoding method of claim 1, wherein the coding decision information of the representative channel is transform selection information indicating which transform is to be used for a transform block of the channel.
5. The decoding method of claim 1, wherein the coding decision information of the representative channel is intra-coding decision information of the representative channel.
6. The decoding method of claim 1, wherein the representative channel and the target channel are channels in a YCbCr color space.
7. The decoding method of claim 1, wherein:
the representative channel is a luminance channel, and
the target channel is a chrominance channel.
8. The decoding method of claim 1, wherein the representative channel is a color channel having a highest correlation with a luminance signal.
9. The decoding method of claim 1, wherein the representative channel is determined by an index in the bitstream indicating the selected representative channel.
10. The decoding method according to claim 1, wherein the sharing operation is performed when image properties of a plurality of channels of the target block are similar to each other.
11. The decoding method of claim 10, wherein when the intra prediction mode of the chroma channel of the target block is a direct mode, the image properties of the plurality of channels are determined to be similar to each other.
12. The decoding method of claim 1, wherein:
when cross-channel prediction is used, the sharing operation is performed, and
whether cross-channel prediction is used is derived based on information obtained from the bitstream.
13. The decoding method of claim 1, wherein:
when cross-channel prediction is used, the sharing operation is performed, and
determining whether to use cross-channel prediction based on an intra prediction mode of a target block.
14. The decoding method of claim 13, wherein when the INTRA prediction mode of the target block is one of an INTRA _ CCLM mode, an INTRA _ MMLM mode, and an INTRA _ MFLM mode, the cross-channel prediction is used.
15. The decoding method of claim 1, wherein whether the sharing operation is to be performed is determined based on a size of a target block.
16. The decoding method of claim 1, wherein the coding decision information of the representative channel of the plurality of channels of the target block is used for all of the plurality of channels.
17. An encoding method, comprising:
determining coding decision information of a representative channel of a target block; and is
Performing encoding on a target block using the encoding decision information of the representative channel,
Wherein the coding decision information of the representative channel is shared with a further channel of the target block.
18. The encoding method of claim 17, further comprising: generating a bitstream including information on the target block,
wherein the information on the target block includes coding decision information of the representative channel, and
wherein the information on the target block does not include coding decision information of the further channel.
19. The encoding method of claim 17, wherein the representative channel and the further channel are channels in a YCbCr color space.
20. A computer-readable storage medium storing a bitstream for image decoding, the bitstream comprising:
information on the target block is transmitted to the mobile station,
wherein the information on the target block includes encoding decision information of a representative channel of the target block,
wherein the coding decision information of the representative channel of the target block is used and shared as the coding decision information of the target channel of the target block, and
wherein decoding of the target block is performed using the coding decision information of the target channel.
CN201880088927.1A 2017-12-07 2018-12-07 Method and apparatus for encoding and decoding using selective information sharing between channels Active CN111699682B (en)

Applications Claiming Priority (9)

Application Number Priority Date Filing Date Title
KR10-2017-0167729 2017-12-07
KR20170167729 2017-12-07
KR20180079012 2018-07-06
KR10-2018-0079012 2018-07-06
KR20180106479 2018-09-06
KR10-2018-0106479 2018-09-06
KR20180114333 2018-09-21
KR10-2018-0114333 2018-09-21
PCT/KR2018/015573 WO2019112394A1 (en) 2017-12-07 2018-12-07 Method and apparatus for encoding and decoding using selective information sharing between channels

Publications (2)

Publication Number Publication Date
CN111699682A true CN111699682A (en) 2020-09-22
CN111699682B CN111699682B (en) 2025-06-06

Family

ID=67064587

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880088927.1A Active CN111699682B (en) 2017-12-07 2018-12-07 Method and apparatus for encoding and decoding using selective information sharing between channels

Country Status (2)

Country Link
KR (2) KR20190067732A (en)
CN (1) CN111699682B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116257494A (en) * 2021-04-21 2023-06-13 华为技术有限公司 Method, system and computer equipment for aggregating communication

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7119236B2 (en) * 2019-01-09 2022-08-16 ベイジン、ターチア、インターネット、インフォメーション、テクノロジー、カンパニー、リミテッド Video Coding Using Cross-Component Linear Models
JP7676314B2 (en) 2019-04-12 2025-05-14 北京字節跳動網絡技術有限公司 Video processing method, device, storage medium and storage method
WO2020244663A1 (en) 2019-06-06 2020-12-10 Beijing Bytedance Network Technology Co., Ltd. Applicability of implicit transform selection
WO2020244662A1 (en) 2019-06-06 2020-12-10 Beijing Bytedance Network Technology Co., Ltd. Simplified transform coding tools
WO2020256524A1 (en) * 2019-06-20 2020-12-24 한국전자통신연구원 Method and apparatus for image encoding and image decoding using prediction based on block type
CN114026865B (en) * 2019-06-21 2025-01-07 北京字节跳动网络技术有限公司 Codec tools for chroma components
CN114258674B (en) * 2019-07-19 2024-12-31 Tcl王牌电器(惠州)有限公司 Image encoding/decoding method and apparatus using IBC
ES3023935T3 (en) * 2019-07-21 2025-06-03 Lg Electronics Inc Image encoding/decoding method and device for signaling chroma component prediction information according to whether palette mode is applicable, and method for transmitting bitstream
CN118741106A (en) * 2019-08-06 2024-10-01 现代自动车株式会社 Method and apparatus for intra-frame prediction encoding of video data
CN118764624A (en) * 2019-08-27 2024-10-11 现代自动车株式会社 Video encoding and decoding using differential coding
EP4018662A4 (en) * 2019-09-21 2023-05-03 Beijing Bytedance Network Technology Co., Ltd. SIZE LIMITATION BASED ON CHROMA INTRA MODE
KR20220050966A (en) * 2019-09-25 2022-04-25 엘지전자 주식회사 Transformation-based video coding method and apparatus
CN114731401A (en) * 2019-09-25 2022-07-08 Lg电子株式会社 Image encoding/decoding method and apparatus for determining segmentation mode based on color format, and method for transmitting bitstream
EP4017009A4 (en) * 2019-10-04 2022-11-23 LG Electronics Inc. TRANSFORMATION-BASED IMAGE CODING METHOD AND APPARATUS THEREOF
JP7404526B2 (en) * 2019-10-28 2023-12-25 北京字節跳動網絡技術有限公司 Syntactic signaling and parsing based on color components
WO2021086050A1 (en) * 2019-10-29 2021-05-06 엘지전자 주식회사 Transform-based image coding method and apparatus therefor
BR112022009084A2 (en) * 2019-11-11 2022-07-26 Lg Electronics Inc IMAGE CODING METHOD BASED ON A TRANSFORM, AND EQUIPMENT OF THE SAME
AU2019275553B2 (en) * 2019-12-03 2022-10-06 Canon Kabushiki Kaisha Method, apparatus and system for encoding and decoding a coding tree unit
AU2019275552B2 (en) * 2019-12-03 2022-10-13 Canon Kabushiki Kaisha Method, apparatus and system for encoding and decoding a coding tree unit
CN120151522A (en) * 2020-01-10 2025-06-13 Lg 电子株式会社 Image decoding method and device using BDPCM
BR112022020449A2 (en) 2020-04-10 2022-11-29 Beijing Bytedance Network Tech Co Ltd METHOD AND APPARATUS FOR PROCESSING VIDEO DATA, AND NON-TRANSITORY COMPUTER READABLE STORAGE AND RECORDING MEDIA
WO2021230618A1 (en) 2020-05-11 2021-11-18 엘지전자 주식회사 Image coding method and device therefor
CN114598871B (en) * 2020-12-03 2024-08-23 杭州海康威视数字技术股份有限公司 Encoding and decoding method, device and equipment
EP4510568A1 (en) * 2022-04-12 2025-02-19 LG Electronics Inc. Image encoding/decoding method, method for transmitting bitstream, and recording medium having bitstream stored therein
WO2024225846A1 (en) * 2023-04-28 2024-10-31 주식회사 윌러스표준기술연구소 Video signal processing method, and device therefor

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012172634A (en) * 2011-02-23 2012-09-10 Toyota Motor Corp Leakage inspection device, and leakage inspection method
KR20120112037A (en) * 2012-03-16 2012-10-11 주식회사 아이벡스피티홀딩스 Method of decoding moving pictures in intra prediction
US20140016693A1 (en) * 2012-07-10 2014-01-16 Broadcom Corporation Real-time video coding system of multiple temporally scaled video and of multiple profile and standards based on shared video coding information
CN105325004A (en) * 2013-04-19 2016-02-10 三星电子株式会社 Video encoding method and apparatus, and video decoding method and apparatus based on signaling of sample adaptive offset parameters
CN105979272A (en) * 2011-02-16 2016-09-28 联发科技股份有限公司 method and device for sharing information in video coding and decoding system
US20170272782A1 (en) * 2016-03-21 2017-09-21 Qualcomm Incorporated Coding video data using a two-level multi-type-tree framework
US20170280159A1 (en) * 2014-09-01 2017-09-28 Hfi Innovation Inc. Method of Intra Picture Block Copy for Screen Content and Video Coding
US20170332106A1 (en) * 2014-12-09 2017-11-16 Gurulogic Microsystems Oy Encoder, decoder and method for images, video and audio

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105979272A (en) * 2011-02-16 2016-09-28 联发科技股份有限公司 method and device for sharing information in video coding and decoding system
JP2012172634A (en) * 2011-02-23 2012-09-10 Toyota Motor Corp Leakage inspection device, and leakage inspection method
KR20120112037A (en) * 2012-03-16 2012-10-11 주식회사 아이벡스피티홀딩스 Method of decoding moving pictures in intra prediction
US20140016693A1 (en) * 2012-07-10 2014-01-16 Broadcom Corporation Real-time video coding system of multiple temporally scaled video and of multiple profile and standards based on shared video coding information
CN105325004A (en) * 2013-04-19 2016-02-10 三星电子株式会社 Video encoding method and apparatus, and video decoding method and apparatus based on signaling of sample adaptive offset parameters
US20170280159A1 (en) * 2014-09-01 2017-09-28 Hfi Innovation Inc. Method of Intra Picture Block Copy for Screen Content and Video Coding
US20170332106A1 (en) * 2014-12-09 2017-11-16 Gurulogic Microsystems Oy Encoder, decoder and method for images, video and audio
US20170272782A1 (en) * 2016-03-21 2017-09-21 Qualcomm Incorporated Coding video data using a two-level multi-type-tree framework

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116257494A (en) * 2021-04-21 2023-06-13 华为技术有限公司 Method, system and computer equipment for aggregating communication
CN116257494B (en) * 2021-04-21 2023-12-08 华为技术有限公司 A method, system and computer equipment for converged communication

Also Published As

Publication number Publication date
KR20190067732A (en) 2019-06-17
CN111699682B (en) 2025-06-06
KR20240066144A (en) 2024-05-14

Similar Documents

Publication Publication Date Title
CN111699682B (en) Method and apparatus for encoding and decoding using selective information sharing between channels
US11902553B2 (en) Method and apparatus for encoding and decoding using selective information sharing between channels
CN110463201B (en) Prediction method and device using reference block
CN110476425B (en) Prediction method and device based on block form
CN109314785B (en) Method and apparatus for deriving motion prediction information
CN111567045A (en) Method and apparatus for using inter prediction information
US20240348788A1 (en) Method, apparatus, and recording medium for encoding/decoding image by using geometric partitioning
CN114450946A (en) Method, apparatus and recording medium for encoding/decoding images by using geometric partitions
CN111684801B (en) Bidirectional intra-frame prediction method and device
CN111919448A (en) Method and apparatus for image encoding and image decoding using temporal motion information
CN116325730A (en) Method, device and recording medium for encoding/decoding images by using geometric partitioning
US20220272321A1 (en) Method, device, and recording medium for encoding/decoding image using reference picture
CN114270865A (en) Method, apparatus and recording medium for encoding/decoding images
US20220201295A1 (en) Method, apparatus and storage medium for image encoding/decoding using prediction
CN114270828A (en) Method and apparatus for image encoding and image decoding using block type-based prediction
CN119013993A (en) Method, apparatus and recording medium for image encoding/decoding
CN119318146A (en) Method, apparatus and recording medium for image encoding/decoding
US11838506B2 (en) Method, apparatus and storage medium for image encoding/decoding
CN115066895A (en) Method and apparatus for encoding/decoding image by using palette mode, and recording medium
CN114556922A (en) Method, apparatus and recording medium for encoding/decoding image by using partition
CN120035989A (en) Image encoding/decoding method and device and recording medium storing bit stream
CN114788288A (en) Transform information encoding/decoding method and apparatus, and bit stream storage medium
CN114342388A (en) Method and apparatus for image encoding and image decoding using region segmentation
CN118355663A (en) Method, device and recording medium for encoding/decoding an image
CN120359755A (en) Method and apparatus for image encoding/decoding and recording medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant