HK1241609B - Image decoding method, video decoder, and non-transitory computer-readable storage medium - Google Patents
Image decoding method, video decoder, and non-transitory computer-readable storage medium Download PDFInfo
- Publication number
- HK1241609B HK1241609B HK18100653.8A HK18100653A HK1241609B HK 1241609 B HK1241609 B HK 1241609B HK 18100653 A HK18100653 A HK 18100653A HK 1241609 B HK1241609 B HK 1241609B
- Authority
- HK
- Hong Kong
- Prior art keywords
- image
- mmr
- prediction
- sub
- model
- Prior art date
Links
Description
本申请为2012年4月13日提交的国际申请号为PCT/US2012/033605、发明名称为“多颜色通道多元回归预测算子”的PCT申请的分案申请,该PCT申请进入中国国家阶段日期为2013年10月11日,国家申请号为201280018070.9。This application is a divisional application of PCT application with international application number PCT/US2012/033605 filed on April 13, 2012, and invention name “Multi-color channel multivariate regression prediction operator”. The date on which the PCT application entered the Chinese national phase is October 11, 2013, and the national application number is 201280018070.9.
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请要求于2011年4月14日提交的美国临时专利申请No.61/475,359的权益,其全部内容通过引用结合于此。This application claims the benefit of U.S. Provisional Patent Application No. 61/475,359, filed April 14, 2011, which is hereby incorporated by reference herein in its entirety.
本申请还涉及于2011年4月14日提交的共同未决的美国临时专利申请No.61/475,372,其全部内容通过引用结合于此。This application is also related to co-pending U.S. Provisional Patent Application No. 61/475,372, filed April 14, 2011, which is incorporated herein by reference in its entirety.
技术领域Technical Field
本发明总体上涉及图像。更具体地,本发明的实施例涉及高动态范围图像与标准动态范围图像之间的多颜色通道、多元回归预测算子。The present invention relates generally to images and, more particularly, to a multi-color channel, multivariate regression prediction operator between high dynamic range images and standard dynamic range images.
背景技术Background Art
如本文中所使用的术语“动态范围”(DR,dynamic range)涉及人类视觉系统(HVS,human psychovisual system)对图像中例如从最暗的暗部到最亮的亮部的强度(例如,亮度)范围进行感知的能力。从这个意义上讲,DR涉及“与场景相关”的强度。DR还可涉及显示设备适当地或近似地呈现特定宽度的强度范围的能力。从这个意义上讲,DR涉及“与显示器相关”的强度。除非在本文的描述中的任何一点处明确地指出特定的意义具有特定含义,否则应推断该术语可以(例如,互换地)用在任意一种意义中。As used herein, the term "dynamic range" (DR) refers to the ability of the human visual system (HVS) to perceive a range of intensities (e.g., brightness) in an image, for example, from the darkest darks to the brightest highlights. In this sense, DR refers to intensities that are "relative to the scene." DR may also refer to the ability of a display device to appropriately or approximately present a range of intensities of a particular width. In this sense, DR refers to intensities that are "relative to the display." Unless a particular meaning is explicitly stated at any point in the description herein as having a specific meaning, it should be inferred that the term can be used (e.g., interchangeably) in either sense.
如本文中所使用的术语高动态范围(HDR,high dynamic range)涉及跨越人类视觉系统(HVS)的某14-15个数量级的DR宽度。例如,基本正常的适应良好的人(例如,在一个或多个统计学意义上、计量生物学意义上或眼科学意义上)具有跨越约15个数量级的强度范围。适应的人可感知少到只有极少数光子的微弱光源。然而,同样的人可感知沙漠、海或雪中正午太阳的近乎刺痛的耀眼强度(或甚至看着太阳,然而只是短暂地看着,以防止伤害)。虽然这个跨度对“适应的”人而言是可达到的,但例如那些人的HVS具有进行重置和调节的时间段。As used herein, the term high dynamic range (HDR) refers to a DR width that spans a certain 14-15 orders of magnitude across the human visual system (HVS). For example, a substantially normal, well-adapted person (e.g., in one or more statistical, biometric, or ophthalmological senses) has an intensity range that spans about 15 orders of magnitude. An adapted person can perceive faint light sources, as few as a handful of photons. However, the same person can perceive the nearly painful glare of the midday sun in the desert, sea, or snow (or even look at the sun, however briefly, to prevent damage). While this span is achievable for "adapted" people, their HVS, for example, has a time period for reset and adjustment.
相比之下,较之HDR,人类可同步感知强度范围中的扩展宽度的DR可在一定程度上缩短。如在本文中所使用的术语“视觉动态范围”或“可变动态范围”(VDR,variabledynamic range)可单独地或互换地涉及可由HVS同时感知的DR。如本文中所使用的VDR可涉及跨越5-6个数量级的DR。因此,尽管与真实场景相关的HDR可能在一定程度上变窄,但VDR仍表示较宽的DR宽度。如本文中所使用的术语“同步动态范围”可涉及VDR。In contrast, the DR of the extended width in the intensity range that humans can perceive simultaneously can be shortened to a certain extent compared to HDR. As used herein, the terms "visual dynamic range" or "variable dynamic range" (VDR) may refer individually or interchangeably to the DR that can be perceived simultaneously by an HVS. As used herein, VDR may refer to a DR that spans 5-6 orders of magnitude. Therefore, although HDR associated with real scenes may be narrowed to a certain extent, VDR still represents a wider DR width. As used herein, the term "simultaneous dynamic range" may refer to VDR.
直至最近,显示器具有了比HDR或VDR明显更窄的DR。使用常规阴极射线管(CRT,cathode ray tube),带有恒定的荧光白背光照明的液晶显示器(LCD,liquid crystaldisplay)或等离子屏幕技术的电视(TV)和计算机监视装置在它们的DR呈现能力上会限制于约三个数量级。因此这种传统的显示器特征是低动态范围(LDR,low dynamic range),对于VDR和HDR,还称为标准动态范围(SDR,standard dynamic range)。Until recently, displays had a significantly narrower DR than either HDR or VDR. Televisions (TVs) and computer monitors using conventional cathode ray tube (CRT), liquid crystal display (LCD) with constant fluorescent white backlighting, or plasma screen technology were limited to about three orders of magnitude in their DR rendering capabilities. These traditional displays were therefore characterized by low dynamic range (LDR), also known as standard dynamic range (SDR) for VDR and HDR.
然而,它们的基础技术中的进步允许更新式的显示器设计,以便相对于呈现在不够新式的显示器上的图像和视频内容,以在多种质量特征上具有显著改进的方式来呈现该内容。例如,更新式的显示设备可能能够呈现高清晰度(HD,high definition)内容和/或可根据多种显示能力(诸如,图像缩放器(image scaler))进行缩放的内容。此外,某些更新式的显示器能够以比传统显示器的SDR更高的DR来呈现内容。However, advances in their underlying technologies have allowed newer display designs to present image and video content with significant improvements in various quality characteristics relative to that content presented on less modern displays. For example, newer display devices may be capable of presenting high-definition (HD) content and/or content that is scalable according to various display capabilities (such as image scalers). Furthermore, some newer displays are capable of presenting content at a higher DR than the SDR of conventional displays.
例如,某些新式LCD显示器具有包括发光二极管(LED,light emitting diode)阵列的背光单元(BLU,backlight unit)。BLU阵列的LED可与有源LCD元件的偏振态的调制分开地进行调制。这种双调制方法是可(诸如)通过BLU阵列与LCD屏幕元件之间的可控中间层来扩展的(例如,扩展成N调制层,其中N包括大于2的整数)。其基于LED阵列的BLU和双(或N)调制有效地增加了具有这种特征的LCD监视器的与显示器相关的DR。For example, some modern LCD displays have a backlight unit (BLU) that includes an array of light-emitting diodes (LEDs). The LEDs of the BLU array can be modulated separately from the modulation of the polarization state of the active LCD elements. This dual modulation approach is extendable (e.g., to an N-modulation layer, where N is an integer greater than 2), such as by using a controllable intermediate layer between the BLU array and the LCD screen elements. The LED-based BLU and dual (or N) modulation effectively increase the display-related DR of LCD monitors having this feature.
关于传统SDR显示器,通常所称的这种“HDR显示器”(尽管实际上,它们的能力可能更近似VDR的范围)和它们可能的DR扩展在显示图像、视频内容和其他视频信息的能力上表现出显著进步。这种HDR显示器可以呈现的色域还可显著地超出多数传统显示器的色域,甚至到能呈现宽色域(WCG,wide color gamut)的程度。与场景相关的HDR或VDR和WCG图像内容,诸如可通过“下一代”电影和TV摄像机产生,现在可通过“HDR”显示器(下文中称为“HDR显示器”)来更真实有效地显示。With respect to traditional SDR displays, these so-called "HDR displays" (although in reality, their capabilities may be closer to the range of VDR) and their possible DR extensions represent a significant advancement in the ability to display images, video content, and other video information. The color gamut that these HDR displays can present can also significantly exceed the color gamut of most traditional displays, even to the point of being able to present wide color gamut (WCG). Scene-related HDR or VDR and WCG image content, such as that produced by "next-generation" film and TV cameras, can now be more realistically and effectively displayed using "HDR" displays (hereinafter referred to as "HDR displays").
就可扩展视频编码和HDTV技术而言,扩展图像DR通常涉及分叉方法。例如,通过新式HDR功能摄像机获取的与场景相关的HDR内容可用于产生内容的SDR版本,该内容的SDR版本可显示在传统SDR显示器上。在一种方法中,根据所获取的VDR版本产生SDR版本,可能涉及将全局色调映射算子(TMO,global tone mapping operator)应用于在HDR内容中的与强度(例如,亮度)相关的像素值。在第二种方法(如为了所有的目的通过引用结合在本文中的、于2011年8月23日提交的国际专利申请NO.PCT/US2011/048861中所描述的)中,产生SDR图像可涉及将可逆算子(或预测算子)应用在VDR数据上。为了保留带宽或出于其他考虑,传输实际获取的VDR内容可能不是最好的方法。In the context of scalable video coding and HDTV technologies, extending image DR typically involves a bifurcated approach. For example, HDR content associated with a scene, captured by a new HDR-capable camera, can be used to generate an SDR version of the content, which can be displayed on a conventional SDR display. In one approach, generating an SDR version from the captured VDR version may involve applying a global tone mapping operator (TMO) to intensity-related (e.g., brightness) pixel values in the HDR content. In a second approach (as described in International Patent Application No. PCT/US2011/048861, filed on August 23, 2011, and incorporated herein by reference for all purposes), generating the SDR image may involve applying a reversible operator (or prediction operator) to the VDR data. To conserve bandwidth or for other considerations, transmitting the actual captured VDR content may not be the best approach.
因此,关于初始TMO的逆的逆色调映射算子(iTMO,inverse tonemappingoperator)或者关于初始预测算子的逆算子可应用于所产生的SDR内容版本,这允许预测VDR内容的版本。所预测的VDR内容版本可与初始获取的HDR内容相比较。例如,从初始VDR版本减去预测的VDR版本可产生残余图像。编码器可将所产生的SDR内容作为基层(BL,baselayer)发送,并且将所产生的SDR内容版本、任意残余图像以及iTMO或其他预测算子打包作为增强层(EL,enhancement layer)或作为元数据。Therefore, an inverse tone mapping operator (iTMO) that is the inverse of the initial TMO or an inverse operator of the initial prediction operator can be applied to the generated SDR content version, which allows a version of the VDR content to be predicted. The predicted VDR content version can be compared with the initially acquired HDR content. For example, subtracting the predicted VDR version from the initial VDR version can produce a residual image. The encoder can send the generated SDR content as a base layer (BL) and package the generated SDR content version, any residual image, and the iTMO or other prediction operator as an enhancement layer (EL) or as metadata.
相比于将HDR内容和SDR内容这两者直接发送进比特流所占用的带宽,将EL和元数据(具有其SDR内容、残余图像和预测算子)发送进比特流中通常占用更少的带宽。接收由编码器发送的比特流的可兼容解码器可对SDR进行解码并且呈现在传统显示器上。然而,可兼容解码器还可使用残余图像,iTMO预测算子或元数据,以根据它们来计算HDR内容的预定版本,以用在更多功能的显示器上。本发明的目的是提供用于产生预测算子的新方法,该预测算子允许利用对应的SDR数据对VDR数据进行有效的编码、传输以及解码。Sending the EL and metadata (with its SDR content, residual image and prediction operator) into the bitstream typically takes up less bandwidth than sending both HDR content and SDR content directly into the bitstream. A compatible decoder receiving the bitstream sent by the encoder can decode the SDR and present it on a conventional display. However, a compatible decoder can also use the residual image, the iTMO prediction operator or the metadata in order to calculate a predetermined version of the HDR content from them for use on a more versatile display. The object of the present invention is to provide a new method for generating a prediction operator that allows efficient encoding, transmission and decoding of VDR data using the corresponding SDR data.
在这一部分中描述的方法是能够执行的方法,但并不一定是以前所设想或执行的方法。因此,除非另有指示,否则不应认为在这一部分所描述的方法中的任何一种因包括在这一部分中而被限定为现有技术。类似地,关于一种或多种方法所确定的问题不应认为已基于这部分在任何现有技术中确定,除非另有指示。The approaches described in this section are approaches that could be pursued, but not necessarily approaches that have been previously conceived or pursued. Therefore, unless otherwise indicated, none of the approaches described in this section should be considered prior art by virtue of their inclusion in this section. Similarly, issues identified with respect to one or more approaches should not be considered established in any prior art based on this section, unless otherwise indicated.
发明内容Summary of the Invention
在下文中给出了关于本发明的简要概述,以便提供关于本发明的某些方面的基本理解。应当理解,该概述并不是关于本发明的穷举性概述,它并非意图确定本发明的关键或重要部分,也不是意图限定本发明的范围。其目的仅仅是以简化的形式给出某些概念,以此作为后文的具体实施方式部分的铺垫。The following is a brief overview of the present invention to provide a basic understanding of certain aspects of the present invention. It should be understood that this overview is not an exhaustive overview of the present invention, is not intended to identify key or important aspects of the present invention, or to limit the scope of the present invention. Its purpose is simply to present certain concepts in a simplified form as a foundation for the detailed description that follows.
根据本发明的一个方面,提供了一种使用处理器的图像解码方法,该方法包括:接收第一图像;接收元数据,该元数据包括用于多通道、多元回归MMR预测模型的预测参数,其中,MMR模型适于根据第一图像来预测第二图像;以及将第一图像和预测参数应用于MMR预测模型以生成对第二图像进行近似的输出图像,其中,基于第一图像中的至少两个颜色成分的像素值来计算输出图像中的至少一个颜色成分的像素值,其中,MMR模型包括根据下述公式的具有交叉相乘的一阶MMR模型:According to one aspect of the present invention, there is provided an image decoding method using a processor, the method comprising: receiving a first image; receiving metadata, the metadata comprising prediction parameters for a multi-channel, multivariate regression (MMR) prediction model, wherein the MMR model is adapted to predict a second image based on the first image; and applying the first image and the prediction parameters to the MMR prediction model to generate an output image that approximates the second image, wherein pixel values of at least one color component in the output image are calculated based on pixel values of at least two color components in the first image, wherein the MMR model comprises a first-order MMR model with cross multiplication according to the following formula:
其中,表示输出图像的第i像素的所预测的三个颜色成分,si=[si1 si2 si3]表示第一图像的第i像素的三个颜色成分,根据下式,是3×3预测参数矩阵并且n是1×3预测参数向量:where represents the predicted three color components of the i-th pixel of the output image, s i = [s i1 s i2 s i3 ] represents the three color components of the i-th pixel of the first image, is a 3×3 prediction parameter matrix and n is a 1×3 prediction parameter vector according to the following formula:
和n=[n11 n12 n13],and n = [n 11 n 12 n 13 ],
根据下式,是4×3预测参数矩阵:According to the following formula, it is a 4×3 prediction parameter matrix:
并且and
sci是根据sci=[si1·si2 si1·si3 si2·si3 si1·si2·si3]的1×4向量。sc i is a 1×4 vector according to sc i = [s i1 ·s i2 s i1 ·s i3 s i2 ·s i3 s i1 ·s i2 ·s i3 ].
根据本发明的一个方面,提供了一种视频解码器,该解码器包括:用于接收第一图像和元数据的输入端,其中,元数据包括用于多通道、多元回归MMR预测模型的预测参数,其中,MMR模型适于根据第一图像来预测第二图像;处理器;以及用于存储输出图像的存储器,其中处理器用于将第一图像和预测参数应用于MMR预测模型以生成对第二图像进行近似的输出图像,其中,基于第一图像中的至少两个颜色成分的像素值来计算输出图像中的至少一个颜色成分的像素值,其中,MMR模型包括根据下述公式的具有交叉相乘的一阶MMR模型:According to one aspect of the present invention, a video decoder is provided, the decoder comprising: an input for receiving a first image and metadata, wherein the metadata comprises prediction parameters for a multi-channel, multivariate regression (MMR) prediction model, wherein the MMR model is adapted to predict a second image based on the first image; a processor; and a memory for storing an output image, wherein the processor is configured to apply the first image and the prediction parameters to the MMR prediction model to generate an output image that approximates the second image, wherein pixel values of at least one color component in the output image are calculated based on pixel values of at least two color components in the first image, wherein the MMR model comprises a first-order MMR model with cross-multiplication according to the following formula:
其中,表示输出图像的第i像素的所预测的三个颜色成分,si=[si1 si2 si3]表示第一图像的第i像素的三个颜色成分,where represents the predicted three color components of the i-th pixel of the output image, s i =[s i1 s i2 s i3 ] represents the three color components of the i-th pixel of the first image,
根据下式,是3×3预测参数矩阵并且n是1×3预测参数向量:According to the following formula, is a 3×3 prediction parameter matrix and n is a 1×3 prediction parameter vector:
和n=[n11 n12 n13],and n = [n 11 n 12 n 13 ],
根据下式,是4×3预测参数矩阵:According to the following formula, it is a 4×3 prediction parameter matrix:
并且and
sci是根据sci=[si1·si2 si1·si3 si2·si3 si1·si2·si3]的1×4向量。sc i is a 1×4 vector according to sc i = [s i1 ·s i2 s i1 ·s i3 s i2 ·s i3 s i1 ·s i2 ·s i3 ].
根据本发明的一个方面,提供了一种非暂态计算机可读存储介质,其上存储有用于使用一个或更多个处理器执行上述使用处理器的图像解码方法的计算机可执行程序。According to one aspect of the present invention, a non-transitory computer-readable storage medium is provided, on which a computer-executable program for executing the above-mentioned image decoding method using one or more processors is stored.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
通过示例(但不是通过限制的方式)在附图部分的图中例示了本发明的实施例,附图中相同的附图标记指示类似的元件,其中:Embodiments of the invention are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings in which like reference numerals indicate similar elements and in which:
图1描绘了根据本发明的实施例的VDR-SDR系统的示例数据流;FIG1 depicts an example data flow of a VDR-SDR system according to an embodiment of the present invention;
图2描绘了根据本发明的实施例的示例VDR编码系统;FIG2 depicts an example VDR encoding system according to an embodiment of the present invention;
图3描绘了根据本发明的实施例的多变量多元回归预测算子的输入和输出接口;FIG3 depicts the input and output interfaces of a multivariate multiple regression prediction operator according to an embodiment of the present invention;
图4描绘了根据本发明的实施例的示例多变量多元回归预测处理;FIG4 depicts an example multivariate multiple regression prediction process according to an embodiment of the present invention;
图5描绘了根据本发明的实施例的关于确定多变量多元回归预测算子的模型的示例处理;FIG5 depicts an example process for determining a model for a multivariate multiple regression prediction operator according to an embodiment of the present invention;
图6描绘了带有根据本发明的实施例进行操作的预测算子的示例图像解码器。FIG6 depicts an example image decoder with a prediction operator operating in accordance with an embodiment of the present invention.
具体实施方式DETAILED DESCRIPTION
在本文中描述基于多变量多元回归建模的颜色间(inter-color)图像预测。给定一对对应的VDR和SDR图像,即,表示相同场景但是在不同的动态范围等级的图像,这部分描述了允许编码器根据多变量多元回归(MMR,multivariate multi-regression)预测算子和SDR图像对VDR图像进行近似的方法。在下面的描述中,为了解释的目的,陈述了许多特定细节以提供对本发明的充分理解。然而,显然的是,本发明无需这些特定细节仍可实现。在其他情形中,为了避免不必要的掩盖、模糊或者混淆了本发明,没有详尽地描述已知的结构和设备。This paper describes inter-color image prediction based on multivariate multivariate regression modeling. Given a pair of corresponding VDR and SDR images, i.e., images representing the same scene but at different dynamic range levels, this section describes a method that allows an encoder to approximate the VDR image based on a multivariate multivariate regression (MMR) prediction operator and the SDR image. In the following description, for purposes of explanation, numerous specific details are set forth to provide a full understanding of the present invention. However, it will be apparent that the present invention can be implemented without these specific details. In other cases, in order to avoid unnecessarily obscuring, obscuring, or confusing the present invention, known structures and devices are not described in detail.
概述Overview
本文中所描述的示范实施例涉及具有高动态范围的编码图像。实施例创建了MMR预测算子,该MMR预测算子允许VDR图像关于其对应的SDR表示来进行表达。The exemplary embodiments described herein relate to encoding images with high dynamic range.The embodiments create an MMR predictor that allows a VDR image to be expressed with respect to its corresponding SDR representation.
示例VDR-SDR系统Example VDR-SDR System
图1描绘了在根据本发明的实施例的VDR-SDR系统100中的示例数据流。利用HDR摄像机110获取HDR图像或视频序列。在获取之后,所获取的图像或视频通过灌制处理(mastering process)来进行处理以创建目标VDR图像125。灌制处理可包含多个处理步骤,诸如:编辑,一次和二次颜色校正、颜色变换以及噪声过滤。此处理的VDR输出125表示关于所获取的图像将如何在目标VDR显示器上进行显示的负责人的意图。FIG1 depicts an example data flow in a VDR-SDR system 100 according to an embodiment of the present invention. An HDR image or video sequence is acquired using an HDR camera 110. After acquisition, the acquired image or video is processed through a mastering process to create a target VDR image 125. The mastering process may include multiple processing steps, such as editing, primary and secondary color correction, color conversion, and noise filtering. The VDR output 125 of this processing represents the intended display of the acquired image on the target VDR display.
灌制处理还可输出对应的SDR图像145,其表示关于所获取的图像将如何在合法的SDR显示器上进行显示的负责人的意图。SDR输出145可直接从灌制电路120提供或者SDR输出145可通过分开的VDR至SDR转换器140来产生。The mastering process may also output a corresponding SDR image 145 that represents the intended purpose of the person responsible for how the acquired image will be displayed on a legal SDR display. The SDR output 145 may be provided directly from the mastering circuitry 120 or may be generated by a separate VDR to SDR converter 140.
在本示例实施例中,VDR 125和SDR 145信号被输入进编码器130。编码器130的目的是创建经编码的比特流,其中该经编码的比特流减少了传输VDR和SDR信号所需的带宽并且还允许对应的解码器150进行解码并且呈现SDR信号或者VDR信号。在示例实现方式中,编码器130可以是分层编码器,诸如通过MPEG-2和H.264编码标准定义的那些编码器中的一个,这将其输出表示为基层、可选增强层以及元数据。如本文中所使用的术语“元数据”涉及作为经编码的比特流的一部分被传输并且帮助解码器呈现经解码的图像的任何辅助信息。这种元数据可包括(但是不限于)如下这些数据:色空间或色域信息、动态范围信息、色调映射信息或者MMR预测算子,诸如本文所描述的那些。In this example embodiment, the VDR 125 and SDR 145 signals are input to an encoder 130. The purpose of the encoder 130 is to create an encoded bitstream that reduces the bandwidth required to transmit the VDR and SDR signals and also allows the corresponding decoder 150 to decode and render the SDR or VDR signals. In an example implementation, the encoder 130 can be a layered encoder, such as one of those defined by the MPEG-2 and H.264 coding standards, which represents its output as a base layer, an optional enhancement layer, and metadata. The term "metadata" as used herein refers to any auxiliary information that is transmitted as part of the encoded bitstream and helps the decoder render the decoded image. Such metadata may include (but is not limited to) such data as color space or color gamut information, dynamic range information, tone mapping information, or MMR prediction operators, such as those described herein.
在接收器上,解码器150使用所接收的经编码的比特流和元数据,以根据目标显示器的能力来呈现SDR图像或者VDR图像。例如,SDR显示器可仅使用基层和元数据来呈现SDR图像。相比之下,VDR显示器可使用来自所有输入层的信息和元数据来呈现VDR信号。At the receiver, the decoder 150 uses the received encoded bitstream and metadata to render either an SDR image or a VDR image, depending on the capabilities of the target display. For example, an SDR display may render an SDR image using only the base layer and metadata. In contrast, a VDR display may render a VDR signal using information and metadata from all input layers.
图2更详细地示出了包括本发明的方法的编码器130的示例实现方式。在图2中,SDR’表示增强的SDR信号。SDR视频现在是8比特、4:2:0、ITU Rec.709数据。SDR’可具有与SDR相同的色空间(原色和白点),但是,对在全空间分辨率下的所有颜色成分(例如,4:4:4RGB)可使用高精度,比方说每个像素12比特。根据图2,能够利用一组正变换从SDR’信号容易地导出SDR,所述一组正变换可包括从比方说每个像素12比特到每个像素8比特的量化,比方说从RGB到YUV的颜色变换以及比方说从4:4:4到4:2:0的颜色子采样。变换器210的SDR输出施加于压缩系统220。根据应用,压缩系统220可能是有损耗的(诸如H.264或MPEG-2)或者无损耗的。压缩系统220的输出可作为基层225传输。为了减小经编码的信号与经解码的信号之间的偏移,编码器130在压缩处理220之后紧接对应的解压缩处理230和对应于正变换210的逆变换240,这并非罕见。因此,预测算子250可具有下列输入:VDR输入205,以及SDR’信号245(当该信号将由对应的解码器接收时其对应于SDR’信号)或输入SDR’207。使用输入的VDR和SDR’数据的预测算子250将创建信号257,信号257表示输入VDR 205的近似或估计。加法器260从初始的VDR 205减去经预测的VDR 257以形成输出残余信号265。随后(未示出),残余265也可由另一有损耗或无损耗的编码器进行编码并且可作为增强层传输至解码器。FIG2 illustrates in more detail an example implementation of an encoder 130 including the method of the present invention. In FIG2 , SDR′ represents an enhanced SDR signal. SDR video is now 8-bit, 4:2:0, ITU Rec.709 data. SDR′ may have the same color space (primaries and white point) as SDR, but may use high precision, say 12 bits per pixel, for all color components at full spatial resolution (e.g., 4:4:4 RGB). Based on FIG2 , SDR can be easily derived from the SDR′ signal using a set of forward transforms, which may include quantization from, say, 12 bits per pixel to 8 bits per pixel, a color conversion from, say, RGB to YUV, and color subsampling from, say, 4:4:4 to 4:2:0. The SDR output of the converter 210 is applied to a compression system 220. Depending on the application, the compression system 220 may be lossy (such as H.264 or MPEG-2) or lossless. The output of the compression system 220 may be transmitted as a base layer 225. To reduce the offset between the encoded and decoded signals, it is not uncommon for encoder 130 to immediately follow compression process 220 with a corresponding decompression process 230 and an inverse transform 240 corresponding to forward transform 210. Thus, predictor 250 may have the following inputs: VDR input 205, and SDR' signal 245 (which corresponds to an SDR' signal when received by a corresponding decoder) or input SDR' 207. Using the input VDR and SDR' data, predictor 250 creates signal 257, which represents an approximation or estimate of input VDR 205. Adder 260 subtracts predicted VDR 257 from original VDR 205 to form output residual signal 265. Residual 265 may then be encoded by another lossy or lossless encoder (not shown) and transmitted to a decoder as an enhancement layer.
预测算子250还可提供在预测处理中使用的预测参数,作为元数据255。由于预测参数可在编码处理期间例如逐帧地或者逐场景地变化,所以这些元数据可作为数据中还包括基层及增强层的一部分被传输至解码器。Predictor 250 may also provide prediction parameters used in the prediction process as metadata 255. Since the prediction parameters may vary during the encoding process, for example, from frame to frame or scene to scene, these metadata may be transmitted to the decoder as part of the data that also includes the base layer and enhancement layer.
由于VDR 205和SDR’207都表示相同的场景,但针对的是具有不同特征(诸如,动态范围和色域)的不同显示器,所以期望这两个信号具有非常紧密的关联性。在本发明的示例实施方式中,开发了新的多变量、多元回归(MMR)预测算子250,其允许使用与VDR信号对应的SDR’信号和多变量MMR算子来预测输入VDR信号。Since both the VDR 205 and the SDR' 207 represent the same scene, but for different displays with different characteristics (such as dynamic range and color gamut), it is expected that the two signals have a very close correlation. In an example embodiment of the present invention, a new multivariate, multiple regression (MMR) prediction operator 250 is developed that allows the SDR' signal corresponding to the VDR signal and a multivariate MMR operator to be used to predict the input VDR signal.
示例预测模型Example Prediction Model
图3示出了根据本发明的示例实现方式的MMR预测算子300的输入和输出接口。根据图3,预测算子330接收输入向量v 310和s 320,它们分别表示VDR图像数据和SDR图像数据,并且预测算子330输出向量340,其表示输入v的预测值。Figure 3 shows the input and output interfaces of an MMR prediction operator 300 according to an example implementation of the present invention. According to Figure 3, the prediction operator 330 receives input vectors v 310 and s 320, which represent VDR image data and SDR image data, respectively, and the prediction operator 330 outputs a vector 340, which represents the predicted value of the input v.
示例标记法和命名法Example notation and nomenclature
SDR图像320中的第i个像素的三个颜色成分标记为:The three color components of the i-th pixel in the SDR image 320 are labeled:
si=[si1 si2 si3]。 (1)s i =[s i1 s i2 s i3 ]. (1)
VDR输入310中的第i个像素的三个颜色成分标记为:The three color components of the i-th pixel in the VDR input 310 are labeled:
Vi=[vi1 vi2 vi3]。 (2)V i =[v i1 v i2 v i3 ]. (2)
经预测的VDR 340中的第i个像素的经预测的三个颜色成分标记为:The predicted three color components of the i-th pixel in the predicted VDR 340 are labeled:
一个颜色成分中的像素总数标记为p。The total number of pixels in one color component is denoted as p.
在等式(1-3)中,颜色像素可以是RGB、YUV、YCbCr、XYZ或者任意其他颜色表示。尽管等式(1-3)针对每个图像或视频帧中的每个像素假定三个颜色表示,但还如后面所示,本文所描述的方法可容易地扩展至每个像素具有多于三个颜色成分的图像和视频表示,或者扩展至这样的图像表示,其中输入中的一个可具有颜色表示数量与其他输入不同的像素。In equations (1-3), color pixels can be RGB, YUV, YCbCr, XYZ, or any other color representation. Although equations (1-3) assume three color representations for each pixel in each image or video frame, as will be shown below, the methods described herein can be easily extended to image and video representations with more than three color components per pixel, or to image representations where one of the inputs may have pixels with a different number of color representations than the other inputs.
一阶模型(MMR-1)First-order model (MMR-1)
利用多变量多元回归(MMR)模型,一阶预测模型能够表示为:Using the multivariate multiple regression (MMR) model, the first-order prediction model can be expressed as:
其中,是3×3矩阵并且n是1×3向量,定义为:where is a 3×3 matrix and n is a 1×3 vector defined as:
和n=[n11 n12 n13]。 (5)and n=[n 11 n 12 n 13 ]. (5)
应当注意,这是多颜色通道预测模型。在等式(4)的中,每个颜色成分表示为输入中的所有颜色成分的线性组合。换言之,与其他的单个通道颜色预测算子(其中,针对每个输出像素,每个颜色通道对其自身进行处理并且彼此独立地进行处理)不同,本模型考虑了像素的所有颜色成分并且因此充分利用颜色间关联性和冗余度。It should be noted that this is a multi-color channel prediction model. In equation (4), each color component is represented as a linear combination of all color components in the input. In other words, unlike other single-channel color prediction operators (where each color channel is processed on its own and independently of each other for each output pixel), this model considers all color components of the pixel and thus fully exploits the correlation and redundancy between colors.
通过使用基于单个矩阵的表示能够将等式(4)简化为:By using a single matrix-based representation, equation (4) can be simplified to:
其中,in,
和S′i=[1 si1 si2 si3] (7)and S′ i = [1 s i1 s i2 s i3 ] (7)
通过将帧(或者输入的其他合适片段或部分)的所有p个像素集合在一起,可以具有下面的矩阵表示:By grouping together all p pixels of a frame (or other suitable fragment or portion of the input), we can have the following matrix representation:
其中,in,
和表示输入和经预测的输出数据,S’是p×4数据矩阵,是p×3矩阵,以及N(1)是4×3矩阵。如本文中所使用的,N(1)可互换地称为多变量算子或预测矩阵。and represent the input and predicted output data, S' is a p x 4 data matrix, is a p x 3 matrix, and N (1) is a 4 x 3 matrix. As used herein, N (1) is interchangeably referred to as a multivariate operator or a prediction matrix.
基于等式(8)的这个线性系统,能够将此MMR系统用公式表示为两个不同的问题:(a)最小二乘问题,或者(b)总体最小二乘问题;两个问题都能使用已知的数值方法求解。例如,使用最小二乘方法,用于求解M的问题可用公式表示为将残余或预测均方误差最小化,或者Based on this linear system of equation (8), the MMR system can be formulated as two different problems: (a) a least squares problem, or (b) a total least squares problem; both problems can be solved using known numerical methods. For example, using the least squares method, the problem for solving M can be formulated as minimizing the residual or prediction mean square error, or
其中V是使用对应的VDR输入数据形成的p×3矩阵。where V is a p-by-3 matrix formed using the corresponding VDR input data.
给定了等式(8)和(10),M(1)的最佳解给出为Given equations (8) and (10), the optimal solution of M (1) is given by
M(1)=(S′TS′)-1S′TV, (11)M (1) = (S′ T S′) -1 S′ T V, (11)
其中,S’T表示S’的转置矩阵,S’TS’是4×4矩阵。Where S' T represents the transposed matrix of S', and S' T S' is a 4×4 matrix.
如果S’是满列秩,例如,If S' is full column rank, for example,
rank(S′)=4≤p,rank(S′)=4≤p,
则,还可利用多种替代的数值技术(包括SVD、QR或者LU分解)来解出M(1)。Then, a variety of alternative numerical techniques (including SVD, QR or LU decomposition) can be used to solve M (1) .
二阶模型(MMR-2)Second-order model (MMR-2)
等式(4)表示一阶MMR预测模型。还可以考虑采用如接下来所描述的更高阶预测。Equation (4) represents a first-order MMR prediction model. Higher-order predictions can also be considered as described below.
二阶预测MMR模型可表示为:The second-order prediction MMR model can be expressed as:
其中是3×3矩阵,where is a 3×3 matrix,
以及as well as
等式(12)能够通过使用单个预测矩阵来简化,Equation (12) can be simplified by using a single prediction matrix,
其中,in,
并且and
通过将所有p个像素集合在一起,可定义下面的矩阵表示:By grouping all p pixels together, the following matrix representation can be defined:
其中,in,
能够利用在前面的部分中描述的相同优化和求解法来求解等式(14)。最小二乘问题的M(2)的最佳解是The same optimization and solution methods described in the previous section can be used to solve equation (14). The optimal solution to the least squares problem M (2) is
M(2)=(S(2)TS(2))-1S(2)TV, (19)M (2) = (S (2)T S (2) ) -1 S(2) T V, (19)
其中,S(2)TS(2)现在是7×7矩阵。where S (2)T S (2) is now a 7×7 matrix.
还能够以类似的方式构建三阶或更高阶的MMR模型。Third-order or higher-order MMR models can also be constructed in a similar manner.
具有交叉相乘的一阶模型(MMR-1C)First-order model with cross-multiplication (MMR-1C)
在替代MMR模型中,能够增强等式(4)的一阶预测模型以包括每个像素的颜色成分之间的交叉相乘(cross-multiplication),如下:In an alternative MMR model, the first-order prediction model of Equation (4) can be enhanced to include cross-multiplication between the color components of each pixel as follows:
其中,是3×3矩阵并且n是1×3向量,两者都如等式(5)中所定义的,并且where is a 3×3 matrix and n is a 1×3 vector, both as defined in equation (5), and
并且sci=[si1·si2 si1·si3 si2·si3 si1·si2·si3]。 (21)And sc i =[s i1 ·s i2 s i1 ·s i3 s i2 ·s i3 s i1 ·s i2 ·s i3 ]. (twenty one)
根据如之前一样的方法,等式(20)的MMR-1C模型能够通过利用单个预测矩阵MC简化,如下:Following the same approach as before, the MMR-IC model of equation (20) can be simplified by utilizing a single prediction matrix MC as follows:
其中,in,
并且and
通过将所有p个像素集合在一起,可以导出简化的矩阵表示,如下:By grouping all p pixels together, a simplified matrix representation can be derived as follows:
其中,in,
和and
SC是p×(1+7)矩阵并且能够利用前面描述的相同最小二乘解来求解。SC is a p x (1 + 7) matrix and can be solved using the same least squares solution described previously.
具有交叉相乘的二阶模型(MMR-2C)Second-order model with cross-multiplication (MMR-2C)
一阶MMR-1C能够扩展至还包括二阶数据。例如,The first-order MMR-1C can be extended to also include second-order data. For example,
其中,in,
并且and
并且等式(27)的其余分量与之前在等式(5-26)中定义的那些相同。And the remaining components of equation (27) are the same as those defined previously in equation (5-26).
与前面一样,等式(27)通过使用简单预测矩阵MC(2)来简化,As before, Equation (27) is simplified by using the simple prediction matrix MC (2) ,
其中,in,
并且and
通过将所有的p个像素集合在一起,可以具有简化的矩阵表示By grouping all p pixels together, we can have a simplified matrix representation
其中,in,
并且SC(2)是px(1+2*7)矩阵并且可以应用如之前所描述的相同最小二乘解。And SC (2) is a px(1+2*7) matrix and the same least squares solution as described before can be applied.
能够以类似的方式构建具有交叉相乘参数的三阶或更高阶模型。替代地,如在“Chaper 5.4.3of“Digital Color Imaging Handbook”,CRC Press,2002,Edited byGaurav Sharma”中所描述的,还能够利用下列的公式描述MMR交叉相乘模型的K阶表示。A third-order or higher-order model with cross-multiplication parameters can be constructed in a similar manner. Alternatively, as described in "Chaper 5.4.3 of "Digital Color Imaging Handbook", CRC Press, 2002, Edited by Gaurav Sharma", the following formula can also be used to describe the K-order representation of the MMR cross-multiplication model.
并且and
其中,K表示MMR预测算子的最高阶。Here, K represents the highest order of the MMR prediction operator.
基于空间扩展的MMR(MMR-C-S)MMR based on spatial expansion (MMR-C-S)
到目前为止所描述的所有MMR模型中,经预测的像素的值仅取决于对应的、通常配置的输入值si。在基于MMR的预测的情况下,还可以通过考虑来自邻近像素的数据而受益。此方法对应到将空间域中的输入的任意线性类型处理(诸如FIR型滤波)集成进MMR模型。In all the MMR models described so far, the value of the predicted pixel depends only on the corresponding, usually configured input value s i . In the case of MMR-based prediction, it is also possible to benefit from taking into account data from neighboring pixels. This approach corresponds to integrating any linear type processing of the input in the spatial domain (such as FIR-type filtering) into the MMR model.
如果在一个图像中考虑所有八个可能的邻近像素,则此方法可将每个颜色成分的多达八个多一阶变量添加进该预测矩阵M中。然而,在实际中,通常仅添加与两个水平邻近像素和两个垂直邻近像素对应的预测变量就足够了,忽略对角邻近像素。这将每个颜色成分的多达四个变量添加进预测矩阵中,即,所述四个变量与上边、左边、下边以及右边像素对应。类似地,还能够添加与邻近的像素值的更高阶数对应的参数。If all eight possible neighboring pixels in an image are considered, this method can add up to eight more first-order variables for each color component into the prediction matrix M. However, in practice, it is usually sufficient to add only the prediction variables corresponding to the two horizontally neighboring pixels and the two vertically neighboring pixels, ignoring the diagonal neighbors. This adds up to four variables for each color component into the prediction matrix, i.e., the four variables corresponding to the top, left, bottom, and right pixels. Similarly, parameters corresponding to higher-order numbers of neighboring pixel values can also be added.
为了简化这种MMR空间模型的复杂性和计算要求,可以考虑仅针对单个颜色成分,诸如亮度成分(如在亮度-色度表示中)或绿色成分(如在RGB表示中),将空间扩展添加至传统模型。例如,假定仅针对绿色的颜色成分添加基于空间的像素预测,则根据等式(34-36),预测绿色输出像素值的一般表示将是To simplify the complexity and computational requirements of such MMR spatial models, one can consider adding spatial extensions to the traditional model for only a single color component, such as the luminance component (as in luma-chrominance representation) or the green component (as in RGB representation). For example, assuming that spatial-based pixel prediction is added only for the green color component, the general representation of the predicted green output pixel value according to equations (34-36) would be
具有空间扩展的一阶模型(MMR-1-S)First-order model with spatial extension (MMR-1-S)
如另一示例实现方式,可以再考虑等式(4)的一阶MMR模型(MMR-1),但是,现在被增强了,以包括在一个或更多个颜色成分中的空间扩展。例如,当应用于第一颜色成分中的每个像素的四个邻近像素时,As another example implementation, the first-order MMR model (MMR-1) of equation (4) may be considered again, but now enhanced to include spatial extension in one or more color components. For example, when applied to the four neighboring pixels of each pixel in the first color component,
其中,是3×3矩阵并且n是1×3向量,两者与等式(5)中定义的一样,where is a 3×3 matrix and n is a 1×3 vector, both as defined in equation (5),
并且and
其中,等式(39)中的m表示具有m列和n行的输入帧中的列数,或者m×n=p总像素。等式(39)能够容易地扩展成将这些方法应用于其他颜色成分和应用于替代的邻近像素构造。Where m in equation (39) represents the number of columns in the input frame with m columns and n rows, or m×n=p total pixels. Equation (39) can be easily extended to apply these methods to other color components and to alternative neighbor pixel constructions.
根据与之前相同的方法,等式(38)能够容易地用公式表示为线性等式的系统Following the same approach as before, equation (38) can be easily formulated as a system of linear equations
其可以如之前描述的一样进行求解。It can be solved as described before.
具有多于三个原色的VDR信号的应用Applications of VDR signals with more than three primary colors
所有所提出的MMR预测模型能够容易地扩展至具有多于三个原色的信号空间。作为示例,可以考虑这样的情况,其中SDR信号具有三个原色,比方说RGB,但是,VDR信号以具有六个原色的P6色空间来定义。在这种情况下,等式(1-3)能够改写为All proposed MMR prediction models can be easily extended to signal spaces with more than three primary colors. As an example, consider the case where the SDR signal has three primary colors, say RGB, but the VDR signal is defined in a P6 color space with six primary colors. In this case, equations (1-3) can be rewritten as
si=[si1 si2 si3], (41)s i =[s i1 s i2 s i3 ], (41)
vi=[vi1 vi2 vi3 vi4 vi5 vi6], (42)v i = [v i1 v i2 v i3 v i4 v i5 v i6 ], (42)
以及as well as
如之前一样,在一个颜色成分中的像素的数量表示为p。现在考虑等式(4)的一阶MMR预测模型(MMR-1),As before, the number of pixels in a color component is denoted as p. Now consider the first-order MMR prediction model (MMR-1) of equation (4),
现在是3×6矩阵并且n是1×6向量,通过下式给出Now it is a 3×6 matrix and n is a 1×6 vector, given by
以及n=[n11 n12 n13 n14 n15 n16]。 (46)And n=[n 11 n 12 n 13 n 14 n 15 n 16 ]. (46)
等式(41)能够使用单个预测矩阵M(1)表示为:Equation (41) can be expressed using a single prediction matrix M (1) as:
其中,in,
以及s′i=[1 si1 si2 si3]。 (48)and s′ i = [1 s i1 s i2 s i3 ]. (48)
通过将所有p个像素集合在一起,这个预测问题能够描述为By grouping all p pixels together, the prediction problem can be described as
其中,in,
是p×6矩阵,是p×4矩阵,并且M(1)是4×6矩阵。is a p×6 matrix, is a p×4 matrix, and M (1) is a 4×6 matrix.
更高阶的MMR预测模型还能够以类似的方式扩展并且经由之前描述的方法能够获得预测矩阵的解。Higher-order MMR prediction models can also be expanded in a similar manner and the solution of the prediction matrix can be obtained via the method described previously.
多通道、多元回归预测的示例处理Example processing of multi-channel and multivariate regression prediction
图4示出了根据本发明的示例实现方式的多通道多元回归预测的示例处理。FIG4 illustrates an example process of multi-channel multiple regression prediction according to an example implementation of the present invention.
该处理开始于步骤410,其中,预测算子(诸如预测算子250)接收输入的VDR和SDR信号。给定两个输入的情形下,在步骤420中预测算子决定选择哪个MMR模型。如前面所描述的,预测算子能够在多种MMR模型当中进行选择,这些模型包括(但不限于):一阶模型(MMR-1)、二阶模型(MMR-2)、三阶或更高阶模型、具有交叉相乘的一阶模型(MMR-1C)、具有交叉相乘的二阶模型(MMR-2C)、具有交叉相乘的三阶模型(MMR-3C)、具有交叉相乘的三阶或更高阶模型、或者添加有空间扩展的上述模型中的任何一种。The process begins at step 410, where a predictor (such as predictor 250) receives input VDR and SDR signals. Given the two inputs, the predictor determines which MMR model to select in step 420. As previously described, the predictor can select from a variety of MMR models, including but not limited to: a first-order model (MMR-1), a second-order model (MMR-2), a third-order or higher-order model, a first-order model with cross-multiplication (MMR-1C), a second-order model with cross-multiplication (MMR-2C), a third-order model with cross-multiplication (MMR-3C), a third-order or higher-order model with cross-multiplication, or any of the above models with added spatial expansion.
能够使用考虑到多个准则的多种方法来进行MMR模型的选择,所述准则包括:关于SDR和VDR输入的现有知识、可获得的计算和存储器资源以及目标编码效率。图5示出了基于残余要比预定阈值低的要求的步骤420的示例实现方式。The selection of the MMR model can be done using a variety of methods that take into account a number of criteria, including prior knowledge about the SDR and VDR inputs, available computational and memory resources, and target coding efficiency. Figure 5 shows an example implementation of step 420 based on the requirement that the residual be lower than a predetermined threshold.
如之前所描述的,如下形式的一组线性等式能够表示任意MMR模型As described previously, a set of linear equations of the form
其中,M是预测矩阵。Where M is the prediction matrix.
在步骤430,能够使用多种数值方法来求解M。例如,在使V与其估计之间的残余的均方值最小的约束下,At step 430, a variety of numerical methods can be used to solve for M. For example, under the constraint of minimizing the mean square of the residual between V and its estimate,
M=(STS)-1STV。 (51)M=(S T S) -1 S T V。 (51)
最后,在步骤440,使用等式(50),预测算子输出和M。Finally, at step 440, the operator outputs and M are predicted using equation (50).
图5示出了在预测期间用于选择MMR模型的示例处理420。在步骤510中预测算子250可以开始于初始MMR模型,诸如,已在之前的帧或场景中使用的MMR模型,例如,二阶模型(MMR-2),或最简单的可能模型,诸如MMR-1。针对M进行求解之后,在步骤520中,预测算子计算输入V与其预测的值之间的预测误差。在步骤530中,如果预测误差低于给定的阈值,则预测算子选择现有的模型并且停止选择处理(540),否则,在步骤550,检查是否使用更复杂的模型。例如,如果当前模型是MMR-2,则预测算子可决定使用MMR-2-C或MMR-2-C-S。如前面所描述的,此决定可取决于多种准则,包括预测误差的值、处理功率要求以及目标编码效率。如果使用更复杂的模型切实可行,则在步骤560中选择新模型并且处理返回步骤520。否则,预测算子将使用现有的模型(540)。FIG5 shows an example process 420 for selecting an MMR model during prediction. In step 510, the predictor 250 may start with an initial MMR model, such as an MMR model that has been used in a previous frame or scene, for example, a second-order model (MMR-2), or the simplest possible model, such as MMR-1. After solving for M, in step 520, the predictor calculates the prediction error between the input V and its predicted value. In step 530, if the prediction error is below a given threshold, the predictor selects the existing model and stops the selection process (540). Otherwise, in step 550, a check is made to see whether a more complex model should be used. For example, if the current model is MMR-2, the predictor may decide to use MMR-2-C or MMR-2-C-S. As described above, this decision may depend on a variety of criteria, including the value of the prediction error, processing power requirements, and target coding efficiency. If using a more complex model is feasible, the new model is selected in step 560 and the process returns to step 520. Otherwise, the predictor uses the existing model (540).
可根据需要以多种间隔来重复预测处理400,以在利用可用的计算资源的同时保持编码效率。例如,当对视频信号进行编码时,针对每一帧、一组帧或者每当预测残余超过特定阈值时,可基于每个预定义的视频片段大小来重复处理400。The prediction process 400 may be repeated at various intervals as needed to maintain coding efficiency while utilizing available computing resources. For example, when encoding a video signal, the process 400 may be repeated for each frame, a group of frames, or each time the prediction residual exceeds a certain threshold, based on each predefined video segment size.
预测处理400还能够使用所有可用的输入像素或者这些像素的子采样。在一个示例实现方式中,可以使用来自输入数据的每第k个像素行和每第k个像素列的像素,其中k是等于或大于2的整数。在另一示例实现方式中,可以决定跳过处于特定裁剪阈值(例如,非常接近于0)以下的输入像素或者处于特定饱和阈值(例如,对于n比特数据,非常接近于2n-1的像素值)以上的像素。在另一实现方式中,可使用这种子采样和阈值化技术的结合,以减小像素采样大小并且适应特定实现方式的计算约束。The prediction process 400 can also use all available input pixels or a subsample of these pixels. In one example implementation, pixels from every k-th pixel row and every k-th pixel column of the input data can be used, where k is an integer equal to or greater than 2. In another example implementation, a decision can be made to skip input pixels that are below a particular clipping threshold (e.g., very close to 0) or pixels that are above a particular saturation threshold (e.g., very close to 2n -1 for n-bit data). In another implementation, a combination of such subsampling and thresholding techniques can be used to reduce the pixel sample size and accommodate the computational constraints of a particular implementation.
图像解码Image decoding
可在图像编码器或在图像解码器上实现本发明的实施例。图6示出了根据本发明的实施例的解码器150的示例实现方式。解码系统600接收经编码的比特流,该经编码的比特流可兼有基层690、可选的增强层(或残余)665以及元数据645,它们在解压缩630和多种逆变换640之后被提取。例如,在VDR-SDR系统中,基层690可表示经编码的信号的SDR表示,并且元数据645可包括与在编码器预测算子250中使用的MMR预测模型和对应的预测参数有关的信息。在一种示例实现方式中,当编码器使用根据本发明的方法的MMR预测算子时,元数据可包括所使用的模型的识别(例如,MMR-1、MMR-2、MMR-2C等)以及与具体模型相关联的所有矩阵系数。给定基层690和从元数据645提取的颜色MMR相关的参数,则预测算子650能够利用本文所描述的对应等式中的任意等式来计算预测的例如,如果所识别的模型是MMR-2C,则能够利用等式(32)计算如果不存在残余,或者残余可忽略,则预测的值680能够作为最后的VDR图像直接输出。否则,在加法器660中,预测算子(680)的输出添加至残余665,以输出VDR信号670。Embodiments of the present invention may be implemented at an image encoder or at an image decoder. FIG6 shows an example implementation of a decoder 150 according to an embodiment of the present invention. The decoding system 600 receives an encoded bitstream that may have a base layer 690, an optional enhancement layer (or residual) 665, and metadata 645, which are extracted after decompression 630 and multiple inverse transforms 640. For example, in a VDR-SDR system, the base layer 690 may represent an SDR representation of the encoded signal, and the metadata 645 may include information about the MMR prediction model and corresponding prediction parameters used in the encoder predictor 250. In an example implementation, when the encoder uses an MMR predictor according to the method of the present invention, the metadata may include an identification of the model used (e.g., MMR-1, MMR-2, MMR-2C, etc.) and all matrix coefficients associated with the specific model. Given the base layer 690 and the color MMR-related parameters extracted from the metadata 645, the prediction operator 650 can calculate the predicted VDR image using any of the corresponding equations described herein. For example, if the identified model is MMR-2C, it can be calculated using equation (32). If there is no residual, or the residual is negligible, the predicted value 680 can be directly output as the final VDR image. Otherwise, in the adder 660, the output of the prediction operator (680) is added to the residual 665 to output the VDR signal 670.
示例计算机系统实现方式Example Computer System Implementation
可通过计算机系统、以电子电路和组件配置的系统、集成电路(IC,integratedcircuit)器件(诸如微控制器、现场可编程门阵列(FPGA,field programmable gatearray)或另一可配置或可编程逻辑器件(PLD,programmable logic device))、离散时间或数字信号处理器(DSP,digital signal processor)、专用IC(ASIC,application specificIC)和/或包括一个或多个这样的系统、器件或组件的装置来实现本发明的实施例。计算机和/或IC可执行、控制或运行与基于MMR的预测(诸如,如本文中所描述的那些)有关的指令。计算机和/或IC可计算与如本文所描述的MMR预测有关的多种参数或值中的任一个。可以以硬件、软件、固件以及它们的多种组合来实现图像和视频动态范围扩展实施例。Embodiments of the present invention may be implemented by a computer system, a system configured with electronic circuits and components, an integrated circuit (IC) device (such as a microcontroller, a field programmable gate array (FPGA) or another configurable or programmable logic device (PLD)), a discrete-time or digital signal processor (DSP), an application-specific IC (ASIC) and/or an apparatus comprising one or more such systems, devices or components. The computer and/or IC may execute, control or run instructions related to MMR-based predictions such as those described herein. The computer and/or IC may calculate any of a variety of parameters or values related to MMR predictions as described herein. The image and video dynamic range extension embodiments may be implemented in hardware, software, firmware and various combinations thereof.
本发明的特定实现方式包括计算机处理器,其运行软件指令,该软件指令使处理器执行本发明的方法。例如,显示器、编码器、机顶盒、代码转换器等中的一个或多个处理器可通过运行处理器可访问的程序存储器中的软件指令来实现如上所述的基于MMR的预测方法。本发明还以程序产品的形式提供。程序产品可包括任意介质,所述任意介质承载包括指令的一套计算机可读信号,所述指令在通过数据处理器执行时使数据处理器执行本发明的方法。根据本发明的程序产品可以是多种多样的形式中的任意一种形式。程序产品可包括例如物理介质,诸如包括软盘、硬盘驱动器的磁性数据存储介质,包括CD ROM、DVD的光数据存储介质,包括ROM、闪存RAM的电子数据存储介质,等。程序产品上的计算机可读信号可选地可以进行压缩或加密。A specific implementation of the present invention includes a computer processor that runs software instructions that cause the processor to perform the method of the present invention. For example, one or more processors in a display, an encoder, a set-top box, a code converter, etc. can implement the above-mentioned MMR-based prediction method by running software instructions in a program memory accessible to the processor. The present invention is also provided in the form of a program product. The program product may include any medium that carries a set of computer-readable signals including instructions that, when executed by a data processor, cause the data processor to perform the method of the present invention. The program product according to the present invention may be in any of a variety of forms. The program product may include, for example, physical media, such as magnetic data storage media including floppy disks and hard drives, optical data storage media including CD ROMs and DVDs, electronic data storage media including ROMs and flash RAMs, etc. The computer-readable signals on the program product may optionally be compressed or encrypted.
上面所指的组件(例如,软件模块、处理器、配件、器件、电路等),除非另有指示,否则对该组件的引用(包括对“装置”的引用)应被解释为包括作为那个组件的等同物的、执行所描述的组件的功能(例如,功能上等同)的任意组件,包括结构上不等同于执行本发明的所例举示例实施例中的功能的被公开结构的组件。Unless otherwise indicated, references to the components referred to above (e.g., software modules, processors, accessories, devices, circuits, etc.) (including references to "means") should be interpreted as including any components that are equivalent to that component and perform the function of the described component (e.g., functionally equivalent), including components that are not structurally equivalent to the disclosed structures that perform the functions in the illustrated example embodiments of the present invention.
等同、扩展、替代以及多样化Equivalence, extension, substitution, and diversification
因此描述了涉及在对VDR和SDR图像进行编码的过程中应用MMR预测的示例实施例。在前述的说明中,已参考会因实现方式不同而变化的很多具体细节来描述了本发明的实施例。因此,本发明的技术方案以及申请人认为发明所涉及的技术方案的唯一指示是一套权利要求,所述一套权利要求源于本申请、按照公布这样的权利要求所遵循的特定形式、包括后续的修正。本文针对在这样的权利要求中所包含的术语所给出的任何明确定义应涵盖如权利要求中所使用的这种术语的含义。因此,权利要求中没有明确陈述的限制、元件、属性、特征、优点或标志不应以任何方式限制这种权利要求的范围。因此说明书和附图被看作是说明性的而没有限制的意思。Thus described are example embodiments relating to the application of MMR prediction in the encoding of VDR and SDR images. In the foregoing description, embodiments of the invention have been described with reference to many specific details that will vary from implementation to implementation. Therefore, the only indication of the technical solution of the invention, and of the technical solution to which the applicant regards the invention relates, is the set of claims arising from this application, in the specific form in which such claims are published, including subsequent amendments. Any express definitions given herein for terms contained in such claims shall encompass the meaning of such terms as used in the claims. Therefore, no limitation, element, attribute, feature, advantage or characterizing that is not expressly recited in a claim should in any way limit the scope of such claim. The specification and drawings are therefore to be regarded in an illustrative and not limiting sense.
第一组附记:The first set of notes:
1.一种方法,包括:1. A method comprising:
接收第一图像和第二图像,其中,所述第二图像具有与所述第一图像不同的动态范围;receiving a first image and a second image, wherein the second image has a different dynamic range than the first image;
从MMR模型的族中选择多通道、多元回归(MMR)预测模型;Selecting a multi-channel, multiple regression (MMR) prediction model from the family of MMR models;
求解所选择的MMR模型的预测参数;Solve for the prediction parameters of the selected MMR model;
利用所述第二图像和所述MMR模型的预测参数来计算表示所述第一图像的预测值的输出图像;calculating an output image representing a predicted value of the first image using the second image and prediction parameters of the MMR model;
输出所述MMR模型的预测参数和所述输出图像。Output the predicted parameters of the MMR model and the output image.
2.根据附记1所述的方法,其中,所述第一图像包括VDR图像并且所述第二图像包括SDR图像。2. The method according to Note 1, wherein the first image comprises a VDR image and the second image comprises an SDR image.
3.根据附记1所述的方法,其中,所述MMR模型是一阶MMR模型、二阶MMR模型、三阶MMR模型、具有交叉相乘的一阶MMR模型、具有交叉相乘的二阶MMR模型或者具有交叉相乘的三阶MMR模型中的至少一个。3. The method according to Note 1, wherein the MMR model is at least one of a first-order MMR model, a second-order MMR model, a third-order MMR model, a first-order MMR model with cross-multiplication, a second-order MMR model with cross-multiplication, or a third-order MMR model with cross-multiplication.
4.根据附记3所述的方法,其中,所述MMR模型中的任意一个进一步包括涉及相邻像素的预测参数。4. The method according to Note 3, wherein any one of the MMR models further includes prediction parameters involving adjacent pixels.
5.根据附记4所述的方法,其中,所考虑的相邻像素包括左边相邻像素、右边相邻像素、上边相邻像素以及下边相邻像素。5. The method according to Note 4, wherein the adjacent pixels considered include left adjacent pixels, right adjacent pixels, top adjacent pixels, and bottom adjacent pixels.
6.根据附记2所述的方法,其中,在所述VDR图像中的像素具有比所述SDR图像中的像素更多的颜色成分。6. The method according to Note 2, wherein pixels in the VDR image have more color components than pixels in the SDR image.
7.根据附记1所述的方法,其中,求解所选择的MMR模型的预测参数进一步包括应用使所述第一图像与所述输出图像之间的均方误差最小化的数值方法。7. The method according to Note 1, wherein solving the prediction parameters of the selected MMR model further includes applying a numerical method that minimizes the mean square error between the first image and the output image.
8.根据附记1所述的方法,其中,从MMR模型的族中选择MMR预测模型进一步包括迭代选择处理,包括:8. The method according to note 1, wherein selecting the MMR prediction model from the family of MMR models further comprises an iterative selection process, comprising:
(a)选择并应用初始MMR模型;(a) Select and apply the initial MMR model;
(b)计算所述第一图像与所述输出图像之间的残余误差;(b) calculating a residual error between the first image and the output image;
(c)如果所述残余误差小于阈值并且无其他MMR模型可用则选择现有MMR模型;否则,选择与先前的模型不同的新MMR模型;并且返回步骤(b)。(c) If the residual error is less than a threshold and no other MMR model is available, select the existing MMR model; otherwise, select a new MMR model that is different from the previous model; and return to step (b).
9.一种图像解码方法,包括:9. An image decoding method, comprising:
接收具有第一动态范围的第一图像;receiving a first image having a first dynamic range;
接收元数据,其中,所述元数据定义MMR预测模型和所述MMR预测模型的对应的预测参数;receiving metadata, wherein the metadata defines an MMR prediction model and corresponding prediction parameters of the MMR prediction model;
将所述第一图像和所述预测参数应用到所述MMR预测模型,以计算表示第二图像的预测值的输出图像,其中,所述第二图像具有与所述第一图像的动态范围不同的动态范围。The first image and the prediction parameters are applied to the MMR prediction model to calculate an output image representing a predicted value for a second image, wherein the second image has a dynamic range different from a dynamic range of the first image.
10.根据附记9所述的方法,其中,所述MMR模型是一阶MMR模型、二阶MMR模型、三阶MMR模型、具有交叉相乘的一阶MMR模型、具有交叉相乘的二阶MMR模型或者具有交叉相乘的三阶MMR模型中的至少一个。10. The method according to Note 9, wherein the MMR model is at least one of a first-order MMR model, a second-order MMR model, a third-order MMR model, a first-order MMR model with cross-multiplication, a second-order MMR model with cross-multiplication, or a third-order MMR model with cross-multiplication.
11.根据附记10所述的方法,其中,所述MMR模型中的任意一个进一步包括涉及相邻像素的预测参数。11. The method according to Note 10, wherein any one of the MMR models further includes prediction parameters involving neighboring pixels.
12.根据附记9所述的方法,其中,所述第一图像包括SDR图像,并且所述第二图像包括VDR图像。12. The method according to Note 9, wherein the first image comprises an SDR image and the second image comprises a VDR image.
13.一种装置,包括处理器并且配置成执行附记1-12中所述的方法中的任意一种方法。13. A device comprising a processor and configured to perform any one of the methods described in Notes 1-12.
14.一种计算机可读存储介质,存储有用于执行根据附记1-12中的任意一项所述的方法的计算机可执行指令。14. A computer-readable storage medium storing computer-executable instructions for executing the method according to any one of Notes 1 to 12.
第二组附记:Second set of notes:
1.一种方法,包括:1. A method comprising:
提供多种多通道、多元回归(MMR)预测模型,每个MMR预测模型适于根据下列项来近似具有第一动态范围的图像,A plurality of multi-channel, multivariate regression (MMR) prediction models are provided, each MMR prediction model being adapted to approximate an image having a first dynamic range according to,
具有第二动态范围的图像,以及an image having a second dynamic range, and
通过应用颜色间图像预测而获得的所述各个MMR预测模型的预测参数;prediction parameters of each of the MMR prediction models obtained by applying inter-color image prediction;
接收第一图像和第二图像,其中,所述第二图像具有与所述第一图像不同的动态范围;receiving a first image and a second image, wherein the second image has a different dynamic range than the first image;
从所述多种MMR模型中选择多通道、多元回归(MMR)预测模型;selecting a multi-channel, multiple regression (MMR) prediction model from the plurality of MMR models;
确定所选择的MMR模型的预测参数的值;determining values of prediction parameters of the selected MMR model;
基于所述第二图像和应用于所选择的MMR预测模型的预测参数的所确定值来计算对所述第一图像进行近似的输出图像;calculating an output image that approximates the first image based on the second image and the determined values of prediction parameters applied to the selected MMR prediction model;
输出所述预测参数的所确定的值和所计算的输出图像,其中,所述多种MMR模型包括根据如下公式的结合每个像素的颜色成分之间的交叉相乘的一阶多通道、多元回归预测模型,Outputting the determined values of the prediction parameters and the calculated output image, wherein the plurality of MMR models includes a first-order multi-channel, multivariate regression prediction model incorporating cross-multiplications between color components of each pixel according to the following formula,
其中,表示所述第一图像的第i像素的所预测的三个颜色成分,wherein represents the three predicted color components of the i-th pixel of the first image,
si=[si1 si2 si3]表示所述第二图像的第i像素的三个颜色成分,s i =[s i1 s i2 s i3 ] represents the three color components of the i-th pixel of the second image,
根据下式,是3×3矩阵并且n是1×3向量According to the following formula, is a 3×3 matrix and n is a 1×3 vector
和n=[n11 n12 n13],and n = [n 11 n 12 n 13 ],
sci=[si1·si2 si1·si3 si2·si3 si1·si2·si3],并且sc i =[s i1 ·s i2 s i1 ·s i3 s i2 ·s i3 s i1 ·s i2 ·s i3 ], and
其中,通过使所述第一图像与所述输出图像之间的均方误差最小化来在数值上获得所述一阶多通道、多元回归预测模型的预测参数。The prediction parameters of the first-order multi-channel, multivariate regression prediction model are numerically obtained by minimizing the mean square error between the first image and the output image.
2.根据附记1所述的方法,其中,所述第一图像包括VDR图像并且所述第二图像包括SDR图像。2. The method according to Note 1, wherein the first image comprises a VDR image and the second image comprises an SDR image.
3.根据附记1所述的方法,其中,所选择的MMR预测模型是一阶MMR模型、二阶MMR模型、三阶MMR模型、具有交叉相乘的一阶MMR模型、具有交叉相乘的二阶MMR模型或者具有交叉相乘的三阶MMR模型中的至少一个。3. The method according to Note 1, wherein the selected MMR prediction model is at least one of a first-order MMR model, a second-order MMR model, a third-order MMR model, a first-order MMR model with cross-multiplication, a second-order MMR model with cross-multiplication, or a third-order MMR model with cross-multiplication.
4.根据附记3所述的方法,其中,所述MMR模型中的任意一个进一步包括涉及相邻像素的预测参数。4. The method according to Note 3, wherein any one of the MMR models further includes prediction parameters involving adjacent pixels.
5.根据附记4所述的方法,其中,所述相邻像素包括左边相邻像素、右边相邻像素、上边相邻像素以及下边相邻像素。5. The method according to Note 4, wherein the adjacent pixels include left adjacent pixels, right adjacent pixels, top adjacent pixels, and bottom adjacent pixels.
6.根据附记2所述的方法,其中,在所述VDR图像中的像素具有比所述SDR图像中的像素更多的颜色成分。6. The method according to Note 2, wherein pixels in the VDR image have more color components than pixels in the SDR image.
7.根据附记1所述的方法,其中,从所述多种MMR预测模型中选择MMR预测模型进一步包括迭代选择处理,包括:7. The method according to Supplementary Note 1, wherein selecting the MMR prediction model from the plurality of MMR prediction models further comprises an iterative selection process, comprising:
(a)选择并应用初始MMR预测模型;(a) Select and apply an initial MMR prediction model;
(b)计算所述第一图像与所述输出图像之间的残余误差;(b) calculating a residual error between the first image and the output image;
(c)如果所述残余误差小于误差阈值并且无其他MMR预测模型能够选择,则选择所述初始MMR模型;否则,从所述多种MMR预测模型中选择新MMR预测模型,所述新MMR预测模型不同于之前选择的MMR预测模型;并且返回步骤(b)。(c) If the residual error is less than the error threshold and no other MMR prediction model can be selected, select the initial MMR model; otherwise, select a new MMR prediction model from the multiple MMR prediction models, and the new MMR prediction model is different from the previously selected MMR prediction model; and return to step (b).
8.一种图像解码方法,包括:8. An image decoding method, comprising:
接收具有第一动态范围的第一图像;receiving a first image having a first dynamic range;
接收元数据,其中,所述元数据包括Receive metadata, wherein the metadata includes
多元回归(MMR)预测模型,所述多元回归预测模型适于根据下列项来近似具有第二动态范围的第二图像,a multivariate regression (MMR) prediction model adapted to approximate a second image having a second dynamic range according to,
所述第一图像,以及the first image, and
通过应用颜色间图像预测而获得的所述MMR预测模型的预测参数,所述元数据进一步包括所述预测参数的先前确定值,以及prediction parameters of the MMR prediction model obtained by applying inter-color image prediction, the metadata further comprising previously determined values of the prediction parameters, and
将所述第一图像和所述预测参数的先前确定值应用于所述MMR预测模型,以计算用于近似所述第二图像的输出图像,其中,所述第二动态范围不同于所述第一动态范围,其中,所述MMR预测模型是根据如下公式的结合每个像素的颜色成分之间的交叉相乘的一阶多通道、多元回归预测模型,applying the first image and previously determined values of the prediction parameters to the MMR prediction model to calculate an output image for approximating the second image, wherein the second dynamic range is different from the first dynamic range, wherein the MMR prediction model is a first-order multi-channel, multivariate regression prediction model incorporating cross-multiplications between color components of each pixel according to the following formula,
其中,表示所述第一图像的第i像素的所预测的三个颜色成分,wherein represents the three predicted color components of the i-th pixel of the first image,
si=[si1 si2 si3]表示所述第二图像的第i像素的三个颜色成分,s i =[s i1 s i2 s i3 ] represents the three color components of the i-th pixel of the second image,
根据下式,是3×3矩阵并且n是1×3向量According to the following formula, is a 3×3 matrix and n is a 1×3 vector
和n=[n11 n12 n13],and n = [n 11 n 12 n 13 ],
sci=[si1·si2 si1·si3 si2·si3 si1·si2·si3],并且sc i =[s i1 ·s i2 s i1 ·s i3 s i2 ·s i3 s i1 ·s i2 ·s i3 ], and
9.根据附记8所述的方法,其中,将所述一阶MMR预测模型扩展成具有像素交叉相乘的二阶MMR预测模型或三阶MMR预测模型。9. The method according to Note 8, wherein the first-order MMR prediction model is expanded into a second-order MMR prediction model or a third-order MMR prediction model with pixel cross multiplication.
10.根据附记8或9所述的方法,其中,所述MMR预测模型进一步包括涉及相邻像素的预测参数。10. The method according to Note 8 or 9, wherein the MMR prediction model further includes prediction parameters involving adjacent pixels.
11.根据附记8所述的方法,其中,所述第一图像包括SDR图像和所述第二图像包括VDR图像。11. The method according to Note 8, wherein the first image comprises an SDR image and the second image comprises a VDR image.
12.一种装置,包括处理器并且配置成执行附记1-11中所述的方法中的任意一种方法。12. A device comprising a processor and configured to perform any one of the methods described in Notes 1-11.
13.一种计算机可读存储介质,存储有用于执行根据附记1-11中的任意一项所述的方法的计算机可执行指令。13. A computer-readable storage medium storing computer-executable instructions for executing the method according to any one of Notes 1-11.
14.一种方法,包括:14. A method comprising:
提供多种多通道、多元回归(MMR)预测模型,每个MMR预测模型适于根据下列项来近似具有第一动态范围的图像,A plurality of multi-channel, multivariate regression (MMR) prediction models are provided, each MMR prediction model being adapted to approximate an image having a first dynamic range according to,
具有第二动态范围的图像,以及an image having a second dynamic range, and
通过应用颜色间图像预测而获得的所述各个MMR预测模型的预测参数;prediction parameters of each of the MMR prediction models obtained by applying inter-color image prediction;
接收第一图像和第二图像,其中,所述第二图像具有与所述第一图像不同的动态范围;receiving a first image and a second image, wherein the second image has a different dynamic range than the first image;
从所述多种MMR模型中选择多通道、多元回归(MMR)预测模型;selecting a multi-channel, multiple regression (MMR) prediction model from the plurality of MMR models;
确定所选择的MMR模型的预测参数的值;determining values of prediction parameters of the selected MMR model;
基于所述第二图像和应用于所选择的MMR预测模型的预测参数的所确定值来计算对所述第一图像进行近似的输出图像;calculating an output image that approximates the first image based on the second image and the determined values of prediction parameters applied to the selected MMR prediction model;
输出所述预测参数的所确定值和所计算的输出图像,其中,所述多种MMR模型包括根据如下公式的二阶多通道、多元回归预测,Outputting the determined values of the prediction parameters and the calculated output image, wherein the multiple MMR models include second-order multi-channel, multivariate regression prediction according to the following formula,
其中,表示所述第一图像的第i像素的所预测的三个颜色成分,wherein represents the three predicted color components of the i-th pixel of the first image,
si=[si1 si2 si3]表示所述第二图像的第i像素的三个颜色成分,s i =[s i1 s i2 s i3 ] represents the three color components of the i-th pixel of the second image,
根据下式,和是3×3矩阵并且n是1×3向量,According to the following formula, and are 3×3 matrices and n is a 1×3 vector,
n=[n11 n12 n13],n=[n 11 n 12 n 13 ],
以及as well as
其中,通过使所述第一图像与所述输出图像之间的均方误差最小化来在数值上获得所述二阶多通道、多元回归预测模型的预测参数。The prediction parameters of the second-order multi-channel, multivariate regression prediction model are numerically obtained by minimizing the mean square error between the first image and the output image.
15.根据附记14所述的方法,其中,所述MMR模型中的任意一个包括涉及相邻像素的预测参数。15. The method according to Note 14, wherein any one of the MMR models includes prediction parameters involving neighboring pixels.
16.根据附记15所述的方法,其中,所述相邻像素包括左边相邻像素、右边相邻像素、上边相邻像素以及下边相邻像素。16. The method according to Note 15, wherein the adjacent pixels include left adjacent pixels, right adjacent pixels, top adjacent pixels, and bottom adjacent pixels.
17.根据附记14所述的方法,其中,从所述多种MMR预测模型中选择MMR预测模型进一步包括迭代选择处理,包括:17. The method according to supplementary note 14, wherein selecting the MMR prediction model from the plurality of MMR prediction models further comprises an iterative selection process, comprising:
(a)选择并应用初始MMR预测模型;(a) Select and apply an initial MMR prediction model;
(b)计算所述第一图像与所述输出图像之间的残余误差;(b) calculating a residual error between the first image and the output image;
(c)如果所述残余误差小于误差阈值并且无其他MMR模型能够选择则选择初始MMR模型;否则,从所述多种MMR预测模型中选择新MMR预测模型,所述新MMR预测模型不同于先前选择的MMR预测模型;并且返回步骤(b)。(c) If the residual error is less than the error threshold and no other MMR model can be selected, select the initial MMR model; otherwise, select a new MMR prediction model from the multiple MMR prediction models, and the new MMR prediction model is different from the previously selected MMR prediction model; and return to step (b).
18.一种图像解码方法,包括:18. An image decoding method, comprising:
接收具有第一动态范围的第一图像;receiving a first image having a first dynamic range;
接收元数据,其中,所述元数据包括Receive metadata, wherein the metadata includes
多元回归(MMR)预测模型,所述多元回归预测模型适于根据下列项来近似具有第二动态范围的第二图像,a multivariate regression (MMR) prediction model adapted to approximate a second image having a second dynamic range according to,
所述第一图像,以及the first image, and
通过应用颜色间图像预测而获得的所述MMR预测模型的预测参数,所述元数据进一步包括所述预测参数的先前确定值;以及prediction parameters of the MMR prediction model obtained by applying inter-color image prediction, the metadata further comprising previously determined values of the prediction parameters; and
将所述第一图像和所述预测参数的先前确定值应用于所述MMR预测模型,以计算用于近似所述第二图像的输出图像,其中,所述第二动态范围不同于所述第一动态范围,其中,所述MMR预测模型是根据如下公式的二阶多通道、多元回归预测,Applying the first image and previously determined values of the prediction parameters to the MMR prediction model to calculate an output image for approximating the second image, wherein the second dynamic range is different from the first dynamic range, wherein the MMR prediction model is a second-order multi-channel, multivariate regression prediction according to the following formula,
其中,表示所述第一图像的第i像素的所预测的三个颜色成分,wherein represents the three predicted color components of the i-th pixel of the first image,
表示所述第二图像的第i像素的三个颜色成分,represents the three color components of the i-th pixel of the second image,
根据下式,和是3×3矩阵并且n是1×3向量,According to the following formula, and are 3×3 matrices and n is a 1×3 vector,
n=[n11 n12 n13],n=[n 11 n 12 n 13 ],
以及as well as
其中,通过使所述第一图像与所述输出图像之间的均方误差最小化来在数值上获得所述二阶多通道、多元回归预测模型的预测参数。The prediction parameters of the second-order multi-channel, multivariate regression prediction model are numerically obtained by minimizing the mean square error between the first image and the output image.
19.根据附记18所述的方法,其中,将所述二阶MMR预测模型扩展成具有像素交叉相乘的二阶MMR预测模型或三阶MMR预测模型。19. The method according to Note 18, wherein the second-order MMR prediction model is expanded into a second-order MMR prediction model or a third-order MMR prediction model with pixel cross multiplication.
20.根据附记18或19所述的方法,其中,所述MMR预测模型进一步包括涉及相邻像素的预测参数。20. The method according to Note 18 or 19, wherein the MMR prediction model further includes prediction parameters involving neighboring pixels.
21.一种装置,包括处理器并且配置成执行附记14-17中所述的方法中的任意一种方法。21. An apparatus comprising a processor and configured to perform any one of the methods described in Notes 14-17.
22.一种计算机可读存储介质,存储有用于执行根据附记14-17中的任意一项所述的方法的计算机可执行指令。22. A computer-readable storage medium storing computer-executable instructions for executing the method according to any one of Notes 14-17.
Claims (10)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US61/475,359 | 2011-04-14 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| HK1241609A1 HK1241609A1 (en) | 2018-06-08 |
| HK1241609B true HK1241609B (en) | 2020-10-30 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6490178B2 (en) | Multi-color channel multiple regression predictor | |
| US20140037205A1 (en) | Image Prediction Based on Primary Color Grading Model | |
| EP2807823B1 (en) | Piecewise cross color channel predictor | |
| US8971408B2 (en) | Piecewise cross color channel predictor | |
| HK1241609B (en) | Image decoding method, video decoder, and non-transitory computer-readable storage medium | |
| HK1241609A1 (en) | Image decoding method, video decoder, and non-transitory computer-readable storage medium | |
| HK1238458B (en) | Multiple color channel multiple regression predictor | |
| HK1238458A1 (en) | Multiple color channel multiple regression predictor | |
| HK1204741B (en) | Piecewise cross color channel predictor | |
| HK1193688B (en) | Multiple color channel multiple regression predictor |