JP7652742B2

JP7652742B2 - VIDEO ENCODING METHOD, VIDEO ENCODING APPARATUS, STORAGE MEDIUM AND COMPUTER PROGRAM - Patent application

Info

Publication number: JP7652742B2
Application number: JP2022109740A
Authority: JP
Inventors: シュウ，シャオユウ; チェン，イーウェン; ワン，シャンリン
Original assignee: Beijing Dajia Internet Information Technology Co Ltd
Current assignee: Beijing Dajia Internet Information Technology Co Ltd
Priority date: 2019-01-09
Filing date: 2022-07-07
Publication date: 2025-03-27
Anticipated expiration: 2040-01-09
Also published as: JP7676485B2; JP7678041B2; JP2023162338A; JP2021192510A; JP2022119936A; JP7303255B2; JP2022172057A; JP2023156465A

Description

本願は、２０１９年１月９日に出願された仮出願第６２/７９０,４２１号に基づき優先
権を主張し、その全部の内容をここに援用する。 This application claims priority to Provisional Application No. 62/790,421, filed January 9, 2019, the entire contents of which are incorporated herein by reference.

本願は、ビデオコーディングと圧縮に関するものである。より具体的には、本願は、ビ
デオコーディングのための複合インターとイントラ予測（ＣＩＩＰ）方法に関する方法お
よび装置に関するものである。 This application relates to video coding and compression. More particularly, this application relates to a method and apparatus relating to a combined inter- and intra-prediction (CIIP) method for video coding.

ビデオデータを圧縮するために、様々なビデオコーディング技術を使用することができ
る。ビデオコーディングは、１つまたは複数のビデオコーディング規格に従って実行され
る。たとえば、ビデオコーディング規格には、多用途ビデオコーディング（ＶＶＣ）、共
同探査テストモデル（ＪＥＭ）、高効率ビデオコーディング（Ｈ.２６５/ＨＥＶＣ）、高
度なビデオコーディング（Ｈ.２６４/ＡＶＣ）、動画エキスパートグループ（ＭＰＥＧ）
コーディングなどが含まれる。ビデオコーディングは、一般に、ビデオ画像またはシーケ
ンスに存在する冗長性を利用する予測方法（例えば、インター予測、イントラ予測など）
を利用する。ビデオコーディング技術の重要な目標は、ビデオ品質の低下を回避または最
小限に抑えながら、ビデオデータを、より低いビットレートを使用する形式に圧縮するこ
とである。 Various video coding techniques may be used to compress the video data. Video coding may be performed according to one or more video coding standards. For example, video coding standards include Versatile Video Coding (VVC), Joint Exploration Test Model (JEM), High Efficiency Video Coding (H.265/HEVC), Advanced Video Coding (H.264/AVC), Moving Picture Experts Group (MPEG)
Video coding generally involves prediction methods (e.g., inter-prediction, intra-prediction, etc.) that exploit redundancy present in a video image or sequence.
An important goal of video coding techniques is to compress video data into a format that uses a lower bit rate while avoiding or minimizing degradation of video quality.

本開示の例は、マージ関連モードの構文シグナリングの効率を改善するための方法を提
供する。 Examples of the present disclosure provide methods for improving the efficiency of syntax signaling for merge-related modes.

本開示の態様によれば、ビデオエンコーディングの方法は、現在の画像の現在のコーディングブロックと関連付けされた、時間順で現在の画像より前にある第１の参照画像と、現在の画像より後にある第２の参照画像を取得すること、前記現在のコーディングブロックから前記第１の参照画像内の参照ブロックまでの第１の動きベクトルに基づく第１の予測を取得すること、前記現在のコーディングブロックから前記第２の参照画像内の参照ブロックまでの第２の動きベクトルに基づく第２の予測を取得すること、および少なくとも前記第１の予測および第２の予測に基づいて、前記現在のコーディングブロックの双方向予測を算出すること、を備え、該算出することは、前記現在のコーディングブロックの双方向予測の算出に複合インター・イントラ予測（ＣＩＩＰ）が適用されない条件下において、前記現在のコーディングブロックの双方向予測の算出時に双方向オプティカルフロー（ＢＤＯＦ）を有効にすることを含む。 According to an aspect of the present disclosure, a method of video encoding includes obtaining a first reference image associated with a current coding block of a current image, the first reference image being temporally before the current image and a second reference image being temporally after the current image; obtaining a first prediction based on a first motion vector from the current coding block to a reference block in the first reference image; obtaining a second prediction based on a second motion vector from the current coding block to a reference block in the second reference image; and calculating a bidirectional prediction of the current coding block based on at least the first prediction and the second prediction, wherein the calculating includes enabling bidirectional optical flow (BDOF) when calculating the bidirectional prediction of the current coding block under a condition that a composite inter-intra prediction (CIIP) is not applied to the calculation of the bidirectional prediction of the current coding block.

前述の一般的な説明および以下の詳細な説明の両方は単なる例であり、本開示を限定す
るものではないことを理解されたい。 It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the present disclosure.

本明細書に組み込まれ、その一部を構成する添付の図面は、本開示と一致する例を示し
、説明とともに、本開示の原理を説明するのに役立つ。
本開示の一例による、エンコーダのブロック図である。本開示の一例による、デコーダのブロック図である。本開示の一例による、複合インターとイントラ予測（ＣＩＩＰ）を生成するための方法を示すフローチャートである。本開示の一例による、ＣＩＩＰを生成するための方法を示すフローチャートである。本開示の一例による、マルチタイプツリー構造におけるブロックパーティションを示す図である。本開示の一例による、マルチタイプツリー構造におけるブロックパーティションを示す図である。本開示の一例による、マルチタイプツリー構造におけるブロックパーティションを示す図である。本開示の一例による、マルチタイプツリー構造におけるブロックパーティションを示す図である。本開示の一例による、マルチタイプツリー構造におけるブロックパーティションを示す図である。本開示の一例による、複合インターとイントラ予測（ＣＩＩＰ）を示す図である。本開示の一例による、複合インターとイントラ予測（ＣＩＩＰ）を示す図である。本開示の一例による、複合インターとイントラ予測（ＣＩＩＰ）を示す図である。本開示の一例による、ＭＰＭ候補リスト生成プロセスのフローチャートである。本開示の一例による、ＭＰＭ候補リスト生成プロセスのフローチャートである。本開示の一例による、ＶＶＣにおける既存のＣＩＩＰデザインのワークフローを示す図である。本開示の一例による、ＢＤＯＦを除去することによる提案されたＣＩＩＰ方法のワークフローを示す図である。本開示の一例による、ＰＯＣ距離に基づいて予測リストを選択する、単一予測ベースのＣＩＩＰのワークフローを示す図である。本開示の一例による、ＭＰＭ候補リスト生成のためにＣＩＩＰブロックを有効にするときの方法のフローチャートである。本開示の一例による、ＭＰＭ候補リスト生成のためにＣＩＩＰブロックを無効にするときの方法のフローチャートである。本開示の一例による、ユーザインターフェースと結合されたコンピューティング環境を示す図である。 The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate examples consistent with the present disclosure and, together with the description, serve to explain the principles of the disclosure.
FIG. 2 is a block diagram of an encoder according to an example of the present disclosure. FIG. 2 is a block diagram of a decoder according to an example of the present disclosure. 1 is a flowchart illustrating a method for generating a combined inter-and-intra prediction (CIIP) according to an example of the present disclosure. 4 is a flowchart illustrating a method for generating a CIIP according to an example of the present disclosure. FIG. 2 illustrates a block partition in a multi-type tree structure according to an example of the present disclosure. FIG. 2 illustrates a block partition in a multi-type tree structure according to an example of the present disclosure. FIG. 2 illustrates a block partition in a multi-type tree structure according to an example of the present disclosure. FIG. 2 illustrates a block partition in a multi-type tree structure according to an example of the present disclosure. FIG. 2 illustrates a block partition in a multi-type tree structure according to an example of the present disclosure. FIG. 2 illustrates a diagram showing combined inter-and-intra prediction (CIIP) according to an example of the present disclosure. FIG. 2 illustrates a diagram showing combined inter-and-intra prediction (CIIP) according to an example of the present disclosure. FIG. 2 illustrates a diagram showing combined inter-and-intra prediction (CIIP) according to an example of the present disclosure. 1 is a flowchart of an MPM candidate list generation process according to an example of the present disclosure. 1 is a flowchart of an MPM candidate list generation process according to an example of the present disclosure. FIG. 2 illustrates a workflow of an existing CIIP design in a VVC according to an example of the present disclosure. FIG. 1 illustrates a workflow of the proposed CIIP method by removing BDOF according to an example of the present disclosure. FIG. 1 illustrates a workflow of a single-prediction-based CIIP that selects a list of predictions based on POC distance, according to an example of the present disclosure. 1 is a flowchart of a method when enabling a CIIP block for MPM candidate list generation according to an example of the present disclosure. 13 is a flowchart of a method for disabling a CIIP block for MPM candidate list generation according to an example of the present disclosure. FIG. 1 illustrates a computing environment coupled with a user interface according to an example of the present disclosure.

ここで、本開示の例を詳細に参照し、その例を添付の図面に示す。以下の説明は、別段
の記載がない限り、異なる図面における同じ番号が同じまたは類似の要素を表す添付の図
面を参照している。本開示の例の以下の説明に記載されている実施の形態は、本開示と一
致するすべての実施の形態を表すわけではない。その代わり、それらは、添付の特許請求
の範囲に記載されている本開示に関連する態様と一致する装置および方法の単なる例であ
る。 Reference will now be made in detail to examples of the present disclosure, examples of which are illustrated in the accompanying drawings. The following description refers to the accompanying drawings, in which the same numbers in different drawings represent the same or similar elements, unless otherwise noted. The embodiments described in the following description of examples of the present disclosure do not represent all embodiments consistent with the present disclosure. Instead, they are merely examples of apparatus and methods consistent with aspects related to the present disclosure as set forth in the appended claims.

本開示で使用される用語は、特定の実施の形態を説明することのみを目的としており、
本開示を限定することを意図するものではない。本開示および添付の特許請求の範囲で使
用されるように、単数形「a」、「an」、および「the」は、文脈で明確に示されていない
限り、複数形も含むことを意図している。ここで使用される「および／または」という用
語は、関連するリストされたアイテムの１つまたは複数の任意またはすべての可能な組み
合わせを意味し、含むことを意図することも理解されたい。 The terminology used in this disclosure is for the purpose of describing particular embodiments only.
It is not intended to limit the disclosure. As used in this disclosure and the appended claims, the singular forms "a,""an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It is also to be understood that the term "and/or" as used herein is intended to mean and include any and all possible combinations of one or more of the associated listed items.

ここで、「第１」、「第２」、「第３」などの用語を使用して様々な情報を説明するこ
とができるが、情報はこれらの用語によって限定されるべきではないことを理解されたい
。これらの用語は、あるカテゴリの情報を別のカテゴリと区別するためにのみ使用される
。例えば、本開示の範囲から逸脱することなく、第１の情報は、第２の情報と呼ばれるこ
とができ、同様に、第２の情報は、第１の情報と呼ばれることもできる。ここで使用され
る場合、「もし」という用語は、文脈に応じて、「ときに」または「に際して」または「
判断に応じて」を意味すると理解され得る。 Herein, terms such as "first,""second," and "third" may be used to describe various pieces of information, but it should be understood that the information should not be limited by these terms. These terms are used only to distinguish one category of information from another. For example, the first information may be referred to as the second information, and similarly, the second information may be referred to as the first information, without departing from the scope of this disclosure. As used herein, the term "if" may also be used as "when" or "in the event of" or "when," depending on the context.
may be understood to mean "at discretion."

ＨＥＶＣ規格の第１のバージョンは、２０１３年１０月に完成し、これは、前世代のビ
デオコーディング規格Ｈ．２６４／ＭＰＥＧＡＶＣと比較して、約５０％のビットレー
ト節約または同等の知覚品質を提供する。ＨＥＶＣ規格は、その前身よりも大幅なコーデ
ィングの改善を提供しているが、ＨＥＶＣにコーディングツールを追加することで、優れ
たコーディング効率を達成できるという証拠がある。これに基づいて、ＶＣＥＧとＭＰＥ
Ｇの両方が、将来のビデオコーディング標準化のための新しいコーディングテクノロジー
の調査作業を開始した。コーディング効率の大幅な向上を可能にする高度なテクノロジー
の重要な研究が開始されるために、２０１５年１０月に、IＴＵ-ＴＶＥＣＧとIＳＯ/IＥ
ＣＭＰＥＧによって１つのJoint Video Exploration Team（ＪＶＥＴ）が結成され
た。共同探査モデル（ＪＥＭ）と呼ばれる１つの参照ソフトウェアは、ＨＥＶＣテストモ
デル（ＨＭ）の上にいくつかの追加のコーディングツールを統合することにより、ＪＶＥ
Ｔによって維持されていた。 The first version of the HEVC standard was finalized in October 2013, and it offers approximately 50% bitrate savings or equivalent perceptual quality compared to the previous generation video coding standard H.264/MPEG AVC. Although the HEVC standard offers significant coding improvements over its predecessor, there is evidence that superior coding efficiency can be achieved by adding additional coding tools to HEVC. Based on this, VCEG and MPEG AVC have agreed to work together to develop a standard that will allow for the creation of a standard that will provide a high level of coding efficiency.
Both the ITU-TVECG and the ISO/IE Group have begun work on investigating new coding technologies for future video coding standardization. In October 2015, the ITU-TVECG and the ISO/IE Group agreed to begin significant research into advanced technologies that could enable significant improvements in coding efficiency.
A Joint Video Exploration Team (JVET) was formed by C-MPEG. A reference software called the Joint Exploration Model (JEM) is a JVE-based test model that integrates some additional coding tools on top of the HEVC Test Model (HM).
It was maintained by T.

２０１７年１０月に、ＨＥＶＣを超える機能を備えたビデオ圧縮に関する共同提案募集
（ＣｆＰ）が、IＴＵ-ＴおよびIＳＯ/IＥＣによって発行された。２０１８年４月に、第
１０回ＪＶＥＴ会議で、２３のＣｆＰ応答が受信され評価され、ＨＥＶＣよりも約４０％
の圧縮効率ゲインが実証された。このような評価結果に基づいて、ＪＶＥＴは、Versatil
e Video Coding（ＶＶＣ）と呼ばれる新世代のビデオコーディング規格を開発するため
の新しいプロジェクトを立ち上げた。同じ月に、ＶＶＣ規格の参照実装を実証するために
、ＶＶＣテストモデル（ＶＴＭ）と呼ばれる１つの参照ソフトウェアコードベースが確立
された。 In October 2017, a Joint Call for Proposals (CfP) for video compression with capabilities beyond HEVC was issued by ITU-T and ISO/IEC. In April 2018, at the 10th JVET meeting, 23 CfP responses were received and evaluated, with approximately 40% more support than HEVC.
Based on these evaluation results, JVET has decided to develop a compression efficiency gain of 100% for Versatile.
In 2013, the National Instruments Board of Education launched a new project to develop a new generation video coding standard called Wide Video Coding (VVC). In the same month, a reference software code base called the VVC Test Model (VTM) was established to demonstrate a reference implementation of the VVC standard.

ＨＥＶＣと同様に、ＶＶＣは、ブロックベースのハイブリッドビデオコーディングフレ
ームワーク上に構成されている。図１（以下に説明）は、一般的なブロックベースのハイ
ブリッドビデオ符号化システムのブロック図を与える。入力ビデオ信号は、ブロック（コ
ーディングユニット（ＣＵ）と呼ばれる。）ごとに処理される。ＶＴＭ-１．０では、Ｃ
Ｕは最大１２８x１２８ピクセルにすることができる。ただし、クアッドツリーのみに基
づいてブロックを区分するＨＥＶＣとは異なり、ＶＶＣでは、クアッド/二元/ターナリー
ツリーに基づくさまざまなローカル特性に適応するために、１つのコーディングツリーユ
ニット（ＣＴＵ）がＣＵに分割される。さらに、ＨＥＶＣにおける複数のパーティション
ユニットタイプの概念が除去され、つまり、ＣＵと予測ユニット（ＰＵ）と変換ユニット
（ＴＵ）の分離がＶＶＣに存在しなくなり、その代わりに、各ＣＵは常に、追加のパーテ
ィションなしで予測と変換の両方の基本単位として使用される。マルチタイプツリー構造
では、１つのＣＴＵが最初にクアッドツリー構造によって区分される。次に、各クアッド
ツリーリーフノードが二元およびターナリツリー構造でさらに区分されることができる。
図図５Ａ、図５Ｂ、図５Ｃ、図５Ｄ、図５Ｄ、図５Ｅ（以下で説明する。）に示すように
、それぞれ、四元パーティショニング、水平二元パーティショニング、垂直二元パーティ
ショニング、水平三元パーティショニング、および垂直三元パーティショニングの５つの
分割タイプがある。 Like HEVC, VVC is built on a block-based hybrid video coding framework. Figure 1 (described below) gives a block diagram of a general block-based hybrid video coding system. The input video signal is processed block by block (called coding unit (CU)). In VTM-1.0,
U can be up to 128x128 pixels. However, unlike HEVC, which partitions blocks based only on quad trees, in VVC, one coding tree unit (CTU) is divided into CUs to accommodate different local characteristics based on quad/binary/ternary trees. In addition, the concept of multiple partition unit types in HEVC is removed, that is, the separation of CU, prediction unit (PU) and transform unit (TU) no longer exists in VVC, and instead, each CU is always used as the basic unit for both prediction and transformation without additional partitions. In the multi-type tree structure, one CTU is first partitioned by a quad tree structure. Then, each quad tree leaf node can be further partitioned by binary and ternary tree structures.
As shown in Figures 5A, 5B, 5C, 5D, 5E (described below), there are five partitioning types: quaternary partitioning, horizontal binary partitioning, vertical binary partitioning, horizontal ternary partitioning, and vertical ternary partitioning, respectively.

図１（以下に説明）では、空間予測および／または時間予測を実行することができる。
空間予測（または「イントラ予測」）は、同一のビデオ画像/スライスにおけるすでにコ
ーディングされた隣接ブロックのサンプル（参照サンプルと呼ばれる。）からのピクセル
を使用して、現在のビデオブロックを予測する。空間予測は、ビデオ信号に固有の空間的
冗長性を低減する。時間予測（「インター予測」または「動き補償予測」とも呼ばれる。
）は、すでにコーディングされたビデオ画像からの再構成されたピクセルを使用して、現
在のビデオブロックを予測する。時間予測は、ビデオ信号に固有の時間的冗長性を低減す
る。特定のＣＵについての時間予測信号は、通常、現在のＣＵとその時間参照との間の動
きの量と方向を示す１つまたは複数の動きベクトル（ＭＶ）によってシグナリングされる
。また、複数の参照画像がサポートされている場合には、１つの参照画像インデックスが
追加で送信される。これは、時間予測信号が参照画像ストアにおけるどの参照画像から来
るかを識別するために使用される。空間予測および／または時間予測の後、エンコーダに
おけるモード決定ブロックは、例えば、レート歪み最適化方法に基づいて、最適な予測モ
ードを選択する。次に、予測ブロックは、現在のビデオブロックから差し引かれ、予測残
差は、変換と量子化を使用して無相関化される。 In FIG. 1 (described below), spatial and/or temporal prediction can be performed.
Spatial prediction (or "intra prediction") predicts a current video block using pixels from samples of already coded neighboring blocks in the same video picture/slice (called reference samples). Spatial prediction reduces spatial redundancy inherent in video signals. Temporal prediction (also called "inter prediction" or "motion compensated prediction") predicts a current video block using pixels from samples of already coded neighboring blocks in the same video picture/slice (called reference samples).
) predicts a current video block using reconstructed pixels from already coded video pictures. Temporal prediction reduces the temporal redundancy inherent in video signals. The temporal prediction signal for a particular CU is typically signaled by one or more motion vectors (MVs), which indicate the amount and direction of motion between the current CU and its temporal references. Also, if multiple reference pictures are supported, one reference picture index is additionally transmitted, which is used to identify which reference picture in the reference picture store the temporal prediction signal comes from. After spatial and/or temporal prediction, a mode decision block in the encoder selects an optimal prediction mode, for example based on a rate-distortion optimization method. The prediction block is then subtracted from the current video block, and the prediction residual is decorrelated using transform and quantization.

量子化された残差係数は、逆量子化と逆変換されて、再構成された残差を形成し、次に
予測ブロックに追加されて、ＣＵの再構成された信号を形成する。デブロッキングフィル
ター、サンプルアダプティブオフセット（ＳＡＯ）、アダプティブインループフィルター
（ＡＬＦ）などのさらなるインループフィルタリングは、参照画像ストアに配置され将来
のビデオブロックのコーディングに使用される前に、再構成されたＣＵに適用できる。出
力ビデオビットストリームを形成するために、コーディングモード（インターまたはイン
トラ）、予測モード情報、動き情報、および量子化された残差係数は、すべてエントロピ
ーコーディングユニットに送信され、さらに圧縮およびパックされてビットストリームを
形成する。 The quantized residual coefficients are dequantized and inverse transformed to form a reconstructed residual, which is then added to the prediction block to form a reconstructed signal for the CU. Further in-loop filtering, such as a deblocking filter, sample adaptive offset (SAO), or adaptive in-loop filter (ALF), can be applied to the reconstructed CU before it is placed in the reference picture store and used for coding future video blocks. To form an output video bitstream, the coding mode (inter or intra), prediction mode information, motion information, and the quantized residual coefficients are all sent to an entropy coding unit for further compression and packing to form a bitstream.

図２（以下に説明）は、ブロックベースのビデオデコーダの一般的なブロック図を示す
。ビデオビットストリームは、最初にエントロピーデコードユニットでエントロピーデコ
ードされる。コーディングモードおよび予測情報は、空間予測ユニット（イントラコーデ
ィングされている場合）または時間予測ユニット（インターコーディングされている場合
）のいずれかに送信されて、予測ブロックを形成する。残差変換係数は、逆量子化ユニッ
トと逆変換ユニットに送信されて、残差ブロックを再構成する。次に、予測ブロックと残
差ブロックは、一緒に加算される。再構成されたブロックは、参照画像ストアに格納され
る前に、インループフィルタリングをさらに通過することができる。次に、参照画像スト
アにおける再構成されたビデオは、ディスプレイデバイスを駆動するために送出され、将
来のビデオブロックを予測するためにも使用される。 FIG. 2 (described below) shows a general block diagram of a block-based video decoder. The video bitstream is first entropy decoded in an entropy decoding unit. The coding mode and prediction information are sent to either a spatial prediction unit (if intra-coded) or a temporal prediction unit (if inter-coded) to form a prediction block. The residual transform coefficients are sent to an inverse quantization unit and an inverse transform unit to reconstruct the residual block. The prediction block and the residual block are then added together. The reconstructed block may further go through in-loop filtering before being stored in a reference image store. The reconstructed video in the reference image store is then sent to drive a display device and is also used to predict future video blocks.

図１は、典型的なエンコーダ１００を示す。エンコーダ１００は、ビデオ入力１１０、
動き補償１１２、動き推定１１４、イントラ／インターモード決定１１６、ブロック予測
器１４０、加算器１２８、変換１３０、量子化１３２、予測関連情報１４２、イントラ予
測１１８、画像バッファ１２０、逆量子化１３４、逆変換１３６、加算器１２６、メモリ
１２４、インループフィルタ１２２、エントロピーコーディング１３８、およびビットス
トリーム１４４を有する。 1 shows a typical encoder 100. The encoder 100 receives a video input 110,
It has motion compensation 112, motion estimation 114, intra/inter mode decision 116, block predictor 140, adder 128, transform 130, quantization 132, prediction related information 142, intra prediction 118, image buffer 120, inverse quantization 134, inverse transform 136, adder 126, memory 124, in-loop filter 122, entropy coding 138, and bitstream 144.

図２は、典型的なデコーダ２００のブロック図を示す。デコーダ２００は、ビットスト
リーム２１０、エントロピーデコード２１２、逆量子化２１４、逆変換２１６、加算器２
１８、イントラ／インターモード選択２２０、イントラ予測２２２、メモリ２３０、イン
ループフィルタ２２８、動き補償２２４、画像バッファ２２６、予測関連情報２３４、お
よびビデオ出力２３２を有する。 2 shows a block diagram of an exemplary decoder 200. The decoder 200 includes a bitstream 210, an entropy decoder 212, an inverse quantizer 214, an inverse transform 216, an adder 218, and an inverse decoder 219.
18 , intra/inter mode selection 220 , intra prediction 222 , memory 230 , in-loop filter 228 , motion compensation 224 , image buffer 226 , prediction related information 234 , and video output 232 .

図３は、本開示による、複合インターとイントラ予測（ＣＩＩＰ）を生成するための例
示的な方法３００を示す。 FIG. 3 illustrates an example method 300 for generating a combined inter-and-intra prediction (CIIP) in accordance with this disclosure.

ステップ３１０において、現在の予測ブロックに関連付けられる第１の参照画像と第２
の参照画像を取得する。ここで、第１の参照画像は表示順で現在の画像の前にあり、第２
の参照画像は表示順で現在の画像の後にある。 In step 310, a first reference image and a second reference image associated with the current prediction block are
2, where the first reference image is before the current image in display order, and the second
The reference image is after the current image in display order.

ステップ３１２において、現在の予測ブロックから第１の参照画像内の参照ブロックへ
の第１の動きベクトルＭＶ０に基づいて、第１の予測Ｌ０を取得する。 In step 312, a first prediction L0 is obtained based on a first motion vector MV0 from the current prediction block to a reference block in a first reference image.

ステップ３１４において、現在の予測ブロックから第２の参照画像内の参照ブロックへ
の第２の動きベクトルＭＶ１に基づいて、第２の予測Ｌ１を取得する。 In step 314, a second prediction L1 is obtained based on a second motion vector MV1 from the current prediction block to a reference block in a second reference image.

図４は、本開示による、ＣＩＩＰを生成するための例示的な方法を示す。たとえば、当
該方法は、ＣＩＩＰを生成するために、単一予測ベースのインター予測とＭＰＭベースの
イントラ予測が含まれる。 4 illustrates an exemplary method for generating a CIIP according to the present disclosure, for example, the method includes uni-prediction based inter prediction and MPM based intra prediction to generate a CIIP.

ステップ４１０において、現在の予測ブロックに関連付けられる参照画像リストにおけ
る参照画像を取得する。 In step 410, a reference picture in the reference picture list associated with the current prediction block is obtained.

ステップ４１２において、現在の画像から第１の参照画像への第１の動きベクトルに基
づいて、インター予測を生成する。 In step 412, an inter prediction is generated based on a first motion vector from the current picture to the first reference picture.

ステップ４１４において、現在の予測ブロックに関連付けられるイントラ予測モードを
取得する。 At step 414, the intra-prediction mode associated with the current prediction block is obtained.

ステップ４１６において、イントラ予測に基づいて、現在の予測ブロックのイントラ予
測を生成する。 At step 416, an intra prediction of the current predicted block is generated based on the intra prediction.

ステップ４１８において、インター予測とイントラ予測を平均することにより、現在の
予測ブロックの最終予測を生成する。 In step 418, the inter prediction and the intra prediction are averaged to generate a final prediction for the current predicted block.

ステップ４２０において、現在の予測ブロックが、最も可能性の高いモード（ＭＰＭ）
ベースのイントラモード予測について、インターモードまたはイントラモードのどちらと
して扱われるかを特定する。 In step 420, the current prediction block is selected based on the most probable mode (MPM).
For the base intra-mode prediction, specify whether it is treated as inter-mode or intra-mode.

図５Ａは、本開示の一例による、マルチタイプツリー構造におけるブロック四元パーテ
ィションを示す図を示す。 FIG. 5A illustrates a diagram illustrating block quad partitions in a multi-type tree structure according to an example of the present disclosure.

図５Ｂは、本開示の一例による、マルチタイプツリー構造におけるブロック垂直二元パ
ーティションを示す図を示す。 FIG. 5B illustrates a diagram illustrating block vertical binary partitioning in a multi-type tree structure according to an example of the present disclosure.

図５Ｃは、本開示の一例による、マルチタイプツリー構造におけるブロック水平二元パ
ーティションを示す図を示す。 FIG. 5C illustrates a diagram illustrating block horizontal binary partitioning in a multi-type tree structure according to an example of the present disclosure.

図５Ｄは、本開示の一例による、マルチタイプツリー構造におけるブロック垂直三元パ
ーティションを示す図を示す。 FIG. 5D illustrates a diagram illustrating block vertical ternary partitioning in a multi-type tree structure according to an example of the present disclosure.

図５Ｅは、本開示の一例による、マルチタイプツリー構造におけるブロック水平三元パ
ーティションを示す図を示す。 FIG. 5E illustrates a diagram illustrating block horizontal ternary partitioning in a multi-type tree structure according to an example of the present disclosure.

複合インターとイントラ予測
図１、図２に示されるように、インターとイントラ予測方法は、ハイブリッドビデオコ
ーディングスキームで使用される。ここで、各ＰＵは、時間域または空間域のいずれかの
みで、相関性を利用するために、インター予測またはイントラ予測を選択することが許可
され、両方ではできない。ただし、従来の文献で指摘されているように、インター予測ブ
ロックとイントラ予測ブロックによって生成された残差信号は、互いに非常に異なる特性
を示す可能性がある。したがって、２種類の予測を効率的に組み合わせることができれば
、予測残差のエネルギーを削減してコーディング効率を向上させるために、もう１つの正
確な予測が期待できる。さらに、自然なビデオコンテンツでは、動くオブジェクトの動き
が複雑になる可能性がある。たとえば、古いコンテンツ（たとえば、以前にコーディング
された画像に含まれるオブジェクト）と新たな新しいコンテンツ（たとえば、以前にコー
ディングされた画像で除外されるオブジェクト）の両方を含む領域が存在する可能性があ
る。このようなシナリオでは、インター予測も、イントラ予測も、現在のブロックの１つ
の正確な予測を提供できない。 Hybrid Inter and Intra Prediction As shown in FIG. 1 and FIG. 2, the inter and intra prediction method is used in the hybrid video coding scheme. Here, each PU is allowed to select inter prediction or intra prediction to exploit correlations in either the time domain or the spatial domain only, but not both. However, as pointed out in the prior art, the residual signals generated by the inter prediction block and the intra prediction block may exhibit very different characteristics from each other. Therefore, if the two kinds of prediction can be efficiently combined, one more accurate prediction can be expected to reduce the energy of the prediction residual and improve the coding efficiency. Furthermore, in natural video content, the motion of moving objects may be complicated. For example, there may be regions that contain both old content (e.g., objects included in the previously coded image) and new new content (e.g., objects excluded in the previously coded image). In such a scenario, neither inter prediction nor intra prediction can provide one accurate prediction of the current block.

予測効率をさらに改善するために、ＶＶＣ規格には、マージモードによってコーディン
グされた１つのＣＵのイントラ予測とインター予測を組み合わせる複合インターとイント
ラ予測（ＣＩＩＰ）が採用されている。具体的には、マージＣＵごとに、１つの追加フラ
グは、ＣＩＩＰが現在のＣＵに対して有効になっているかどうかを示すために、シグナリ
ングされる。輝度コンポーネントに対して、ＣＩＩＰは、平面モード、ＤＣモード、水平
モード、垂直モードを含む頻繁に使用される４つのイントラモードをサポートする。彩度
コンポーネントに対して、ＤＭ（つまり、彩度は、輝度コンポーネントの同じイントラモ
ードを再利用する）は、追加のシグナリングなしで常に適用される。さらに、既存のＣＩ
ＩＰデザインでは、加重平均が適用され、１つのＣＩＩＰＣＵのインター予測サンプル
とイントラ予測サンプルが結合される。具体的には、平面モードまたはＤＣモードが選択
されている場合において、等しい重み（つまり、０.５）が適用される。それ以外の場合
（つまり、水平モードまたは垂直モードのいずれかが適用される。）、現在のＣＵは最初
に水平（水平モードの場合）または垂直（垂直モードの場合）に４つの同じサイズの領域
に分割される。 To further improve prediction efficiency, the VVC standard adopts Combined Inter and Intra Prediction (CIIP), which combines intra prediction and inter prediction for one CU coded by merge mode. Specifically, for each merge CU, one additional flag is signaled to indicate whether CIIP is enabled for the current CU. For the luma component, CIIP supports four frequently used intra modes, including planar mode, DC mode, horizontal mode, and vertical mode. For the chroma component, DM (i.e., chroma reuses the same intra mode of the luma component) is always applied without additional signaling. In addition, the existing CIIP is signaled to indicate whether CIIP is enabled for the current CU.
In the IP design, a weighted average is applied to combine the inter- and intra-predicted samples of one CIIP CU. Specifically, equal weights (i.e., 0.5) are applied when planar or DC mode is selected. Otherwise (i.e., either horizontal or vertical mode is applied), the current CU is first divided horizontally (for horizontal mode) or vertically (for vertical mode) into four equal-sized regions.

さらに、現在のＶＶＣ動作仕様では、１つのＣＩＩＰＣＵのイントラモードが、最も
可能性の高いモード（ＭＰＭ）メカニズムを介して、その隣接するＣＩＩＰＣＵのイン
トラモードを予測するための予測子として使用されることができる。具体的には、各ＣＩ
ＩＰＣＵについて、その隣接するブロックもＣＩＩＰＣＵである場合において、それ
らの隣接ブロックのイントラモードは、最初に、平面モード、ＤＣモード、水平モード、
および垂直モード内の最も近いモードに丸められ、次に、現在のＣＵのＭＰＭ候補リスト
に追加される。ただし、各イントラＣＵのＭＰＭリストを構成するときには、その隣接す
るブロックの１つは、ＣＩＩＰモードでコーディングされていると、使用不可と見なされ
る。つまり、１つのＣＩＩＰＣＵのイントラモードは、その隣接するイントラＣＵのイ
ントラモードを予測することを許可されていない。図７Ａと図７Ｂ（以下で説明する）は
、イントラＣＵとＣＩＩＰＣＵのＭＰＭリスト生成プロセスを比較する。 Furthermore, in the current VVC operation specification, the intra mode of one CIIP CU can be used as a predictor to predict the intra mode of its neighboring CIIP CUs via a Most Probable Mode (MPM) mechanism.
For an IP CU, if its neighboring blocks are also CIIP CUs, the intra modes of those neighboring blocks are first selected from the following: planar mode, DC mode, horizontal mode,
and the nearest mode within the vertical mode, and then added to the MPM candidate list of the current CU. However, when constructing the MPM list of each intra CU, if one of its neighboring blocks is coded in CIIP mode, it is considered as unavailable. That is, the intra mode of one CIIP CU is not allowed to predict the intra mode of its neighboring intra CU. Figures 7A and 7B (described below) compare the MPM list generation process of intra CU and CIIP CU.

ここで、shiftとo_offsetは、それぞれ、１５-ＢＤと１≪（１４-ＢＤ）+２・（１≪１３
）に等しく、二重予測のＬ０とＬ１予測信号を組み合わせるために適用される右シフト値
とオフセット値である。 Here, shift and _offset are 15-BD and 1<<(14-BD)+2*(1<<13
) which are the right shift and offset values applied to combine the L0 and L1 prediction signals of dual prediction.

図６Ａは、本開示の一例による、水平モードの複合インターとイントラ予測を示す図を
示す。 FIG. 6A shows a diagram illustrating hybrid inter and intra prediction in horizontal mode according to an example of this disclosure.

図６Ｂは、本開示の一例による、垂直モードの複合インターとイントラ予測を示す図を
示す。 FIG. 6B shows a diagram illustrating hybrid inter and intra prediction in vertical mode according to an example of this disclosure.

図６Ｃは、本開示の一例による、平面モードとＤＣモードの複合インターとイントラ予
測を示す図を示す。 FIG. 6C shows a diagram illustrating hybrid inter and intra prediction for planar and DC modes according to an example of this disclosure.

図７Ａは、本開示の一例による、イントラＣＵＳのＭＰＭ候補リスト生成プロセスのフ
ローチャートを示す。 FIG. 7A illustrates a flowchart of an MPM candidate list generation process for intra-CUS according to an example of the present disclosure.

図７Ｂは、本開示の一例による、ＣＩＩＰＣＵのＭＰＭ候補リスト生成プロセスのフ
ローチャートを示す。 FIG. 7B illustrates a flowchart of an MPM candidate list generation process for a CIIP CU according to an example of the present disclosure.

ＣＩＩＰに対する改善
ＣＩＩＰは、従来の動き補償予測の効率を高めることができるが、そのデザインをさら
に改善することができる。具体的には、ＶＶＣにおける既存のＣＩＩＰデザインにおける
以下の問題は、本開示で識別されている。 Improvements to CIIP Although CIIP can improve the efficiency of conventional motion compensated prediction, its design can be further improved. Specifically, the following problems in existing CIIP designs in VVC are identified in this disclosure:

まず、「複合インターとイントラ予測」のセクションで説明したように、ＣＩＩＰは、
インターとイントラ予測のサンプルを組み合わせるため、各ＣＩＩＰＣＵは、その再構
成された隣接サンプルを使用して予測信号を生成する必要がある。これは、１つのＣＩＩ
ＰＣＵのデコードが、その隣接ブロックの完全な再構成に依存していることを意味する
。このような相互依存性のため、実際のハードウェア実装では、ＣＩＩＰは、隣接する再
構成されたサンプルがイントラ予測に利用できるようになる再構成段階で実行する必要が
ある。再構成段階でのＣＵのデコードは、順次に（つまり、１つずつ）実行しなければな
らないため、ＣＩＩＰプロセスに含まれる計算演算（例えば、乗算、加算、ビットシフト
）の数は、リアルタイムデコードの十分なスループットを確保するために、高すぎるもの
とすることができない。 First, as explained in the "Combined Inter and Intra Prediction" section, CIIP is
To combine inter and intra prediction samples, each CIIP CU needs to generate a prediction signal using its reconstructed neighboring samples.
This means that the decoding of a P CU depends on the complete reconstruction of its neighboring blocks. Due to such interdependence, in a practical hardware implementation, CIIP needs to be performed in the reconstruction stage, where neighboring reconstructed samples become available for intra prediction. Since the decoding of CUs in the reconstruction stage must be performed sequentially (i.e., one by one), the number of computation operations (e.g., multiplications, additions, bit shifts) involved in the CIIP process cannot be too high to ensure sufficient throughput for real-time decoding.

「双方向オプティカルフロー」のセクションで述べたように、ＢＤＯＦは、前方および
後方の両方の時間方向からの２つの参照ブロックから、１つのインターコーディングされ
たＣＵが予測されるときに、予測品質が向上するように、有効にされる。図８（以下に説
明）に示すように、現在のＶＶＣでは、ＢＤＯＦも、ＣＩＩＰモードのインター予測サン
プルを生成するために関与している。ＢＤＯＦによるさらなる複雑性を考えると、このよ
うなデザインは、ＣＩＩＰが有効にされる場合、ハードウェアコーデックのエンコード/
デコードスループットが大幅に低下する可能性がある。 As mentioned in the "Bidirectional Optical Flow" section, BDOF is enabled to improve prediction quality when one inter-coded CU is predicted from two reference blocks from both forward and backward temporal directions. As shown in Figure 8 (described below), in current VVC, BDOF is also involved to generate inter prediction samples for CIIP mode. Considering the additional complexity due to BDOF, such a design requires a hardware codec encoding/decoding scheme when CIIP is enabled.
Decoding throughput may be significantly reduced.

次に、現在のＣＩＩＰデザインでは、１つのＣＩＩＰＣＵが、二重予測される１つの
マージ候補を参照する場合に、リストＬ０およびＬ１の両方の動き補償予測信号を生成す
る必要がある。１つまたは複数のＭＶが整数精度でない場合においては、部分的なサンプ
ル位置でサンプルを補間するために、追加の補間プロセスを呼び出しなければならない。
このようなプロセスは、計算上の複雑さを増すだけでなく、外部メモリからより多くの参
照サンプルにアクセスする必要がある場合、メモリ帯域幅も増やす。 Secondly, in the current CIIP design, when one CIIP CU references one merge candidate that is bi-predicted, it is necessary to generate motion compensation prediction signals for both lists L0 and L1. In the case where one or more MVs are not integer precision, an additional interpolation process must be invoked to interpolate samples at partial sample positions.
Such a process not only increases the computational complexity but also increases the memory bandwidth if more reference samples need to be accessed from external memory.

それから、「複合インターとイントラ予測」のセクションで論じたように、現在のＣＩ
ＩＰデザインでは、ＣＩＩＰＣＵのイントラモードとイントラＣＵのイントラモードは
、それらの隣接ブロックのＭＰＭリストを構成するときに異なって扱われる。具体的には
、１つの現在のＣＵがＣＩＩＰモードでコーディングされている場合には、その隣接する
ＣＩＩＰＣＵは、イントラと見なされ、つまり、隣接するＣＩＩＰＣＵのイントラモ
ードがＭＰＭ候補リストに追加されることができる。ただし、現在のＣＵがイントラモー
ドでコーディングされている場合には、その隣接するＣＩＩＰＣＵは、インターと見な
され、つまり、隣接するＣＩＩＰＣＵのイントラモードがＭＰＭ候補リストから除外さ
れている。このような統一されていないデザインは、ＶＶＣ規格の最終バージョンに最適
でない可能性がある。 Then, as discussed in the "Combined Inter and Intra Prediction" section, the current CI
In the IP design, the intra modes of CIIP CUs and intra modes of intra CUs are treated differently when constructing the MPM lists of their neighboring blocks. Specifically, if one current CU is coded in CIIP mode, its neighboring CIIP CUs are considered as intra, i.e., the intra modes of the neighboring CIIP CUs can be added to the MPM candidate list. However, if the current CU is coded in intra mode, its neighboring CIIP CUs are considered as inter, i.e., the intra modes of the neighboring CIIP CUs are excluded from the MPM candidate list. Such a non-uniform design may not be optimal for the final version of the VVC standard.

図８は、本開示の一例による、ＶＶＣにおける既存のＣＩＩＰデザインのワークフロー
を示す図を示す。 FIG. 8 illustrates a diagram showing a workflow of an existing CIIP design in a VVC according to an example of the present disclosure.

ＣＩＩＰの単純化
本開示では、ハードウェアコーデック実装を容易にするために既存のＣＩＩＰデザイン
を単純化するための方法が提供される。一般に、本開示で提案される技術の主なアスペク
トは、以下のように要約される。 Simplification of CIIP In this disclosure, a method is provided for simplifying the existing CIIP design to facilitate hardware codec implementation. In general, the main aspects of the technology proposed in this disclosure are summarized as follows:

まず、ＣＩＩＰコーディング／デコードスループットを改善するために、ＣＩＩＰモー
ドでのインター予測サンプルの生成からＢＤＯＦを除外することが提案される。 First, to improve the CIIP coding/decoding throughput, it is proposed to exclude BDOF from the generation of inter-prediction samples in CIIP mode.

次に、計算上の複雑さおよびメモリ帯域幅の消費を低減するためには、１つのＣＩＩＰ
ＣＵが二重予測される（すなわち、Ｌ０およびＬ１ＭＶの両方を有する）場合におい
ては、インター予測サンプルを生成するために、ブロックを二重予測から単一予測に変換
する方法が提案される。 Next, in order to reduce computational complexity and memory bandwidth consumption, one CIIP
In the case where a CU is bi-predicted (ie, has both L0 and L1 MVs), a method is proposed to convert the block from bi-prediction to uni-prediction to generate inter-predicted samples.

それから、２つの方法は、隣接するブロックのＭＰＭ候補を形成するときに、イントラ
ＣＵとＣＩＩＰのイントラモードを調和させるために提案される。 Then, two methods are proposed to harmonize intra-CU and CIIP intra-modes when forming MPM candidates for neighboring blocks.

ＢＤＯＦのないＣＩＩＰ
「問題ステートメント」のセクションで指摘されているように、ＢＤＯＦは、現在のＣ
Ｕが二重予測されるとき、ＣＩＩＰモードについてのインター予測サンプルを生成するよ
うに、常に有効にされている。ＢＤＯＦのさらなる複雑さのため、既存のＣＩＩＰデザイ
ンは、エンコード/デコードスループットが大幅に低下する可能性があり、特に、リアル
タイムデコードがＶＶＣデコーダーに対して困難になる可能性がある。一方、ＣＩＩＰ
ＣＵについては、その最終予測サンプルは、インター予測サンプルとイントラ予測サンプ
ルを平均することによって生成される。言い換えると、ＢＤＯＦによる改良した予測サン
プルは、ＣＩＩＰＣＵの予測信号として直接使用されない。したがって、従来の二重予
測ＣＵ（ここで、ＢＤＯＦは、予測サンプルを生成するために直接に適用される）と比較
すると、ＢＤＯＦから得られる対応する改善はＣＩＩＰＣＵでは効率が低くなる。した
がって、上記の事情に基づいて、ＣＩＩＰモードのインター予測サンプルを生成するとき
にＢＤＯＦを無効にすることが提案される。図９（以下に説明）は、ＢＤＯＦを除去した
後の提案されたＣＩＩＰプロセスの対応するワークフローを示す。 CIIP without BDOF
As pointed out in the "Problem Statement" section, BDOF is a
When U is bi-predicted, it is always enabled to generate inter-predicted samples for CIIP mode. Due to the additional complexity of BDOF, existing CIIP designs may suffer from significantly reduced encoding/decoding throughput, especially making real-time decoding difficult for VVC decoders. On the other hand, CIIP
For a CIIP CU, its final predicted sample is generated by averaging the inter-predicted sample and the intra-predicted sample. In other words, the improved predicted sample by BDOF is not directly used as the predicted signal of the CIIP CU. Therefore, compared with the traditional bi-predicted CU (where BDOF is directly applied to generate the predicted sample), the corresponding improvement obtained from BDOF is less efficient for the CIIP CU. Therefore, based on the above circumstances, it is proposed to disable BDOF when generating the inter-predicted sample for the CIIP mode. Figure 9 (described below) shows the corresponding workflow of the proposed CIIP process after removing BDOF.

図９は、本開示の一例による、ＢＤＯＦを除去することによる提案されたＣＩＩＰ方法
のワークフローを示す図を示す。 FIG. 9 shows a diagram illustrating the workflow of the proposed CIIP method by removing BDOF according to an example of the present disclosure.

単一予測に基づくＣＩＩＰ
上記のように、１つのＣＩＩＰＣＵによって参照されるマージ候補が二重予測される
ときには、Ｌ０およびＬ１予測信号の両方を生成し、ＣＵ内のサンプルを予測する。メモ
リ帯域幅および補間の複雑さを低減するために、本開示の一実施形態では、（現在のＣＵ
が二重予測されている場合でも）単一予測を使用して生成されたインター予測サンプルの
みを使用して、ＣＩＩＰモードにおけるイントラ予測サンプルと結合することになる。具
体的には、現在のＣＩＩＰＣＵが単一予測の場合において、インター予測サンプルは、
イントラ予測サンプルと直接結合される。それ以外の場合（つまり、現在のＣＵが二重予
測される場合）には、ＣＩＩＰによって使用されるインター予測サンプルは、１つの予測
リスト（Ｌ０またはＬ１）からの単一予測に基づいて生成される。予測リストを選択する
には、さまざまな方法が適用できる。第１の方法では、２つの参照画像によって予測され
る任意のＣＩＩＰブロックに対して、第１の予測（つまり、リストＬ０）を常に選択する
ことが提案されている。 CIIP based on single prediction
As mentioned above, when a merge candidate referenced by one CIIP CU is bi-predicted, both L0 and L1 prediction signals are generated to predict samples in the CU. In order to reduce memory bandwidth and interpolation complexity, in one embodiment of the present disclosure,
Only inter-predicted samples generated using uni-prediction (even if the current CIIP CU is bi-predicted) will be used to combine with intra-predicted samples in CIIP mode. Specifically, when the current CIIP CU is uni-predictive, the inter-predicted samples are
The inter-predicted samples used by the CIIP are directly combined with the intra-predicted samples. Otherwise (i.e., when the current CU is bi-predicted), the inter-predicted samples used by the CIIP are generated based on single prediction from one prediction list (L0 or L1). Different methods can be applied to select the prediction list. The first method proposes to always select the first prediction (i.e., list L0) for any CIIP block predicted by two reference pictures.

第２の方法では、２つの参照画像によって予測される任意のＣＩＩＰブロックに対して
、第２の予測（すなわち、リストＬ１）を常に選択することが提案される。第３の方法で
は、１つの適応方法は、現在の画像からの画像順序カウント（ＰＯＣ）距離が小さい１つ
の参照画像に関連付けられた予測リストが選択される場合に、適用される。図１０（以下
で説明）は、ＰＯＣ距離に基づいて予測リストを選択する、単一予測ベースのＣＩＩＰの
ワークフローを示す。 In the second method, it is proposed to always select the second prediction (i.e., list L1) for any CIIP block predicted by two reference pictures. In the third method, an adaptive method is applied in which the prediction list associated with one reference picture with a small picture order count (POC) distance from the current picture is selected. Figure 10 (described below) shows the workflow of a single prediction-based CIIP that selects a prediction list based on the POC distance.

最後に、最後の方法では、現在のＣＵが単一予測されている場合にのみＣＩＩＰモード
を有効にすることが提案されている。さらに、オーバーヘッドを削減するために、ＣＩＩ
Ｐの有効化/無効化フラグのシグナリングは、現在のＣＩＩＰＣＵの予測方向に依存す
る。現在のＣＵが単一予測される場合においては、ＣＩＩＰフラグがビットストリームで
シグナリングされ、ＣＩＩＰが有効か無効かが示される。それ以外の場合（つまり、現在
のＣＵが二重に予測される場合）は、ＣＩＩＰフラグのシグナリングはスキップされ、常
にfalseと推測され、つまり、ＣＩＩＰは常に無効にされる。 Finally, the last method proposes to enable the CIIP mode only if the current CU is uni-predicted.
The signaling of the P enable/disable flag depends on the prediction direction of the current CIIP CU. In the case where the current CU is mono-predicted, the CIIP flag is signaled in the bitstream to indicate whether CIIP is enabled or disabled. Otherwise (i.e., the current CU is bi-predicted), the signaling of the CIIP flag is skipped and always inferred as false, i.e., CIIP is always disabled.

図１０は、本開示の一例による、ＰＯＣ距離に基づいて予測リストを選択する、単一予
測ベースのＣＩＩＰのワークフローを示す図を示す。 FIG. 10 illustrates a diagram showing a workflow of a single-prediction based CIIP that selects a prediction list based on POC distance according to an example of the present disclosure.

ＭＰＭ候補リスト構成のためのイントラＣＵとＣＩＩＰのイントラモードの調和
上記のように、現在のＣＩＩＰデザインは、イントラＣＵとＣＩＩＰＣＵのイントラ
モードを使用してそれらの隣接ブロックのＭＰＭ候補リストを形成する方法に関して、統
一されていない。具体的には、イントラＣＵとＣＩＩＰＣＵのイントラモードの両方で
は、ＣＩＩＰモードでコーディングされた隣接ブロックのイントラモードが予測できる。
ただし、イントラＣＵのイントラモードのみでは、イントラＣＵのイントラモードが予測
できる。もう１つの統一されたデザインを実現するために、２つの方法は、ＭＰＭリスト
構成のためのイントラＣＵとＣＩＩＰのイントラモードの使用法を調和させて、このセク
ションで提案される。 Harmonization of intra-CU and CIIP intra-mode for MPM candidate list construction As mentioned above, current CIIP designs are not unified with respect to how intra-CU and CIIP CU intra-modes are used to form MPM candidate lists for their neighboring blocks. Specifically, both intra-CU and CIIP CU intra-modes can predict the intra-modes of neighboring blocks coded in CIIP mode.
However, only the intra mode of the intra CU can predict the intra mode of the intra CU. To achieve another unified design, two methods are proposed in this section to harmonize the usage of intra CU and intra mode of CIIP for MPM list construction.

第１の方法では、ＣＩＩＰモードをＭＰＭリスト構成のためのインターモードとして扱
うことが提案されている。具体的には、１つのＣＩＩＰＣＵまたは１つのイントラＣＵ
のいずれかのＭＰＭリストを生成するときには、隣接ブロックがＣＩＩＰモードでコーデ
ィングされている場合、隣接ブロックのイントラモードは使用不可としてマークされる。
このような方法では、ＣＩＩＰブロックのイントラモードを使用してＭＰＭリストを構成
することができない。逆に、第２の方法では、ＣＩＩＰモードをＭＰＭリスト構成のため
のイントラモードとして扱うことが提案されている。具体的には、この方法では、ＣＩＩ
ＰＣＵのイントラモードでは、隣接するＣＩＩＰブロックとイントラブロックの両方の
イントラモードが予測できる。図１１Ａと図１１Ｂ（以下に説明）は、上記の２つの方法
が適用される場合のＭＰＭ候補リスト生成プロセスを示す。 The first method proposes treating the CIIP mode as an inter mode for MPM list construction. Specifically, one CIIP CU or one intra CU
When generating any of the MPM lists, if the neighboring block is coded in CIIP mode, the intra mode of the neighboring block is marked as disabled.
In this method, the intra mode of the CIIP block cannot be used to construct the MPM list. Conversely, in the second method, it is proposed to treat the CIIP mode as an intra mode for the purpose of constructing the MPM list.
For the intra mode of a P CU, the intra modes of both the neighboring CIIP blocks and intra blocks can be predicted. Figures 11A and 11B (described below) show the MPM candidate list generation process when the above two methods are applied.

本開示の他の実施形態は、ここで開示される本開示の仕様および実施を考慮することか
ら当業者には明らかである。本願は、その一般原則に従い、当技術分野で知られているま
たは慣習的な慣行の範囲内にある本開示からの逸脱を含む、本開示の任意の変形、使用、
または適合をカバーすることを意図している。本開示の真の範囲および精神は以下の特許
請求の範囲によって示され、明細書および実施例は単なる例として見なされることが意図
されている。 Other embodiments of the present disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the present disclosure disclosed herein. This application is in accordance with its general principles and includes any modifications, uses, and variations of the present disclosure, including departures from the present disclosure that are within known or customary practice in the art.
It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.

本開示は、上記に記載され、添付の図面に示されている具体的な例に限定されず、その
範囲から逸脱することなく、様々な修正および変更を行うことができることを理解された
い。本開示の範囲は、添付の特許請求の範囲によってのみ制限されることが意図されてい
る。 It is to be understood that the present disclosure is not limited to the specific examples described above and illustrated in the accompanying drawings, and various modifications and changes can be made without departing from the scope thereof, which is intended to be limited only by the scope of the appended claims.

図１１Ａは、本開示の一例による、ＭＰＭ候補リスト生成のためにＣＩＩＰブロックを
有効にするときの方法のフローチャートを示す。 FIG. 11A illustrates a flowchart of a method when enabling a CIIP block for MPM candidate list generation according to an example of the present disclosure.

図１１Ｂは、本開示の一例による、ＭＰＭ候補リスト生成のためにＣＩＩＰブロックを
無効にするときの方法のフローチャートを示す。 FIG. 11B illustrates a flowchart of a method for disabling CIIP blocking for MPM candidate list generation according to an example of the present disclosure.

図１２は、ユーザインターフェース１２６０と結合されたコンピューティング環境１２
１０を示す。コンピューティング環境１２１０は、データ処理サーバーの一部であり得る
。コンピューティング環境１２１０は、プロセッサ１２２０と、メモリ１２４０と、Ｉ／
Ｏインターフェース１２５０とを含む。 FIG. 12 illustrates a computing environment 1260 coupled with a user interface 1260.
12. The computing environment 1210 may be part of a data processing server. The computing environment 1210 includes a processor 1220, memory 1240, and I/O.
and an O interface 1250.

プロセッサ１２２０は、通常、表示、データ取得、データ通信、および画像処理に関連
する操作など、コンピューティング環境１２１０の全体的な操作を制御する。プロセッサ
１２２０は、上記の方法のすべてまたはいくつかのステップを行うための命令を実行する
１つまたは複数のプロセッサを含み得る。さらに、プロセッサ１２２０は、プロセッサ１
２２０と他の構成要素との間の相互作用を容易にする１つまたは複数の回路を含み得る。
プロセッサは、中央処理ユニット（ＣＰＵ）、マイクロプロセッサ、シングルチップマシ
ン、ＧＰＵなどであり得る。 The processor 1220 typically controls the overall operation of the computing environment 1210, such as operations related to display, data acquisition, data communication, and image processing. The processor 1220 may include one or more processors that execute instructions for performing all or some of the steps of the methods described above.
220 and other components.
The processor may be a central processing unit (CPU), a microprocessor, a single chip machine, a GPU, or the like.

メモリ１２４０は、コンピューティング環境１２１０の動作をサポートするための様々
なタイプのデータを格納するように構成される。そのようなデータの例は、コンピューテ
ィング環境１２１０で動作する任意のアプリケーションまたは方法に用いる命令、ビデオ
データ、画像データなどを含む。メモリ１２４０は、任意のタイプの揮発性または非揮発
性メモリデバイス、または、それらの組み合わせ、例えば、静的ランダムアクセスメモリ
（ＳＲＡＭ）、電気的に消去可能なプログラマブル読み取り専用メモリ（ＥＥＰＲＯＭ）
、消去可能プログラマブル読み取り専用メモリ（ＥＰＲＯＭ）、プログラム可能な読み取
り専用メモリ（ＰＲＯＭ）、読み取り専用メモリ（ＲＯＭ）、磁気メモリ、フラッシュメ
モリ、磁気ディスクまたは光ディスクを使用して実現できる。 Memory 1240 is configured to store various types of data to support the operation of computing environment 1210. Examples of such data include instructions for any application or method operating in computing environment 1210, video data, image data, etc. Memory 1240 may be any type of volatile or non-volatile memory device, or combination thereof, such as, for example, static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), etc.
, erasable programmable read only memory (EPROM), programmable read only memory (PROM), read only memory (ROM), magnetic memory, flash memory, magnetic disk or optical disk.

Ｉ／Ｏインターフェース１２５０は、プロセッサ１２２０と、キーボード、クリックホ
イール、ボタンなどの周辺インターフェースモジュールとの間のインターフェースを提供
する。ボタンには、ホームボタン、スキャン開始ボタン、およびスキャン停止ボタンが含
まれるが、これらに限定されていない。Ｉ／Ｏインターフェース１２５０は、エンコーダ
およびデコーダと結合することができる。 The I/O interface 1250 provides an interface between the processor 1220 and a peripheral interface module such as a keyboard, a click wheel, buttons, including but not limited to a home button, a start scan button, and a stop scan button. The I/O interface 1250 can be coupled to an encoder and a decoder.

一実施形態では、上記した方法を実行するために、コンピューティング環境１２１０内
のプロセッサ１２２０によって実行可能である、メモリ１２４０に含まれるような複数の
プログラムを含む非一時的なコンピュータ可読記憶媒体も提供される。例えば、非一時的
なコンピュータ可読記憶媒体は、ＲＯＭ、ＲＡＭ、ＣＤ-ＲＯＭ、磁気テープ、フロッピ
ーディスク、光学データ記憶装置などであり得る。 In one embodiment, a non-transitory computer readable storage medium is also provided that includes a number of programs, such as those contained in memory 1240, executable by processor 1220 in computing environment 1210 to perform the methods described above. For example, the non-transitory computer readable storage medium may be a ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage device, etc.

非一時的なコンピュータ可読記憶媒体は、１つまたは複数のプロセッサを有するコンピ
ューティングデバイスによって実行するための複数のプログラムをその中に格納しており
、複数のプログラムは、１つまたは複数のプロセッサによって実行されると、コンピュー
ティングデバイスが上記した動作予測するための方法を実行するものである。 A non-transitory computer-readable storage medium has stored therein a plurality of programs for execution by a computing device having one or more processors, the plurality of programs, when executed by the one or more processors, causing the computing device to perform the method for predicting behavior described above.

一実施形態では、コンピューティング環境１２１０は、上述した方法を実行するために
、１つまたは複数の特定用途向け集積回路（ＡＳＩＣ）、デジタル信号プロセッサ（ＤＳ
Ｐ）、デジタル信号処理デバイス（ＤＳＰＤ）、プログラマブルロジックデバイス（ＰＬ
Ｄ）、フィールドプログラマブルゲートアレイ（ＦＰＧＡ）、グラフィカルプロセッシン
グユニット（ＧＰＵ）、コントローラー、マイクロコントローラー、マイクロプロセッサ
ー、またはその他の電子コンポーネントにより実現できる。 In one embodiment, the computing environment 1210 may include one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), and/or other processors for performing the methods described above.
P), digital signal processing device (DSPD), programmable logic device (PL
D) may be implemented by a field programmable gate array (FPGA), a graphical processing unit (GPU), a controller, a microcontroller, a microprocessor, or other electronic components.

Claims

obtaining a first reference image that is associated with a current coding block of the current image and that is temporally preceding the current image and a second reference image that is temporally following the current image;
obtaining a first prediction based on a first motion vector from the current coding block to a reference block in the first reference image;
obtaining a second prediction based on a second motion vector from the current coding block to a reference block in the second reference image; and calculating a bidirectional prediction of the current coding block based on at least the first prediction and the second prediction, the calculating including: enabling a bidirectional optical flow (BDOF) when calculating the bidirectional prediction of the current coding block under a condition that a composite inter-intra prediction (CIIP) is not applied to the calculation of the bidirectional prediction of the current coding block; and disabling BDOF when calculating the bidirectional prediction of the current coding block in response to a determination that a CIIP is applied to the calculation of the bidirectional prediction of the current coding block, and calculating a bidirectional prediction of the current coding block based on an average of the first prediction and the second prediction .
The intra prediction modes include planar modes,
The BDOF operation is as follows:
calculating a first horizontal gradient value {(∂I ⁽⁰⁾ /∂x)(i,j)} and a first vertical gradient value {(∂I ⁽⁰⁾ /∂y)(i,j)} for a prediction sample associated with the first prediction, and calculating a second horizontal gradient value {(∂I ⁽¹⁾ /∂x)(i,j)} and a second vertical gradient value {(∂I ⁽¹⁾ /∂y)(i,j)} for a prediction sample associated with the second prediction, where I ⁽⁰⁾ (i,j) represents the prediction sample at sample position (i,j) associated with the first prediction and I ⁽¹⁾ (i,j) represents the prediction sample at sample position (i,j) associated with the second prediction;
calculating a bi-directional prediction of the current coding block based on the first prediction, the second prediction, the first horizontal gradient value, the first vertical gradient value, the second horizontal gradient value, and the second vertical gradient value.
Video encoding method.

Calculating a bi-prediction of the current coding block in response to enabling the BDOF includes:
calculating a motion improvement value for each sub-block by minimizing a difference between a prediction sample associated with a first prediction and a prediction sample associated with a second prediction; and calculating a bidirectional prediction of the current coding block based on the motion improvement value, the first prediction, the second prediction, the first horizontal gradient value, the first vertical gradient value, the second horizontal gradient value, and the second vertical gradient value.
The method of claim 1 .

Calculating a bi-prediction of the current coding block in response to enabling the BDOF includes:
calculating a BDOF value based on the motion improvement value, the first horizontal gradient value, the first vertical gradient value, the second horizontal gradient value, and the second vertical gradient value; and calculating a bi-directional prediction of the current coding block based on the first prediction, the second prediction, and the BDOF value.
The method of claim 2 .

1. A video encoding device comprising: one or more processors; and one or more storage devices coupled to the one or more processors,
Configured to carry out the method according to any one of claims 1 to 3 ,
Video encoding equipment.

A non-transitory computer-readable storage medium storing a plurality of programs to be executed by a computing device having one or more processors,
The plurality of programs are executed by the one or more processors to cause the computing device to perform the method of any one of claims 1 to 3 .
Storage medium.

A computer program stored on a computer readable storage medium comprising instructions which, when executed by a processor, perform the method of any one of claims 1 to 3 .