[go: up one dir, main page]

CN109889829B - Fast sample adaptive compensation for 360 degree video - Google Patents

Fast sample adaptive compensation for 360 degree video Download PDF

Info

Publication number
CN109889829B
CN109889829B CN201910094768.2A CN201910094768A CN109889829B CN 109889829 B CN109889829 B CN 109889829B CN 201910094768 A CN201910094768 A CN 201910094768A CN 109889829 B CN109889829 B CN 109889829B
Authority
CN
China
Prior art keywords
video
sao
projection
ctu
threshold
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910094768.2A
Other languages
Chinese (zh)
Other versions
CN109889829A (en
Inventor
张萌萌
刘志
岳�文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Xiaoyu Interactive Co.,Ltd.
Original Assignee
North China University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by North China University of Technology filed Critical North China University of Technology
Priority to CN201910094768.2A priority Critical patent/CN109889829B/en
Publication of CN109889829A publication Critical patent/CN109889829A/en
Application granted granted Critical
Publication of CN109889829B publication Critical patent/CN109889829B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

一种在高效视频编码(HEVC)中针对360度视频的SAO的方法,包括:对360度视频执行ERP投影,以获得ERP投影视频;对所述ERP投影视频中的当前帧的CTU执行帧内预测或帧间预测,以确定最佳RD‑cost;将所述CTU的RD‑cost与阈值进行比较以判断是否执行SAO,其中,所述阈值是至少部分地基于针对所述ERP投影视频的量化参数和ERP投影权重来确定的,并且其中,所述ERP投影权重是至少部分地基于所述ERP投影视频的高度中的CTU数量以及所述CTU在所述ERP投影视频的当前帧中的位置来确定的。

Figure 201910094768

A method for SAO for 360-degree video in High Efficiency Video Coding (HEVC), comprising: performing ERP projection on the 360-degree video to obtain an ERP projection video; performing intra-frame on a CTU of a current frame in the ERP projection video prediction or inter prediction to determine the optimal RD-cost; comparing the RD-cost of the CTU to a threshold to determine whether to perform SAO, wherein the threshold is based at least in part on quantization for the ERP projection video parameters and ERP projection weights, and wherein the ERP projection weights are based at least in part on the number of CTUs in the height of the ERP projection video and the position of the CTUs in the current frame of the ERP projection video definite.

Figure 201910094768

Description

360度视频的快速样点自适应补偿Fast sample adaptive compensation for 360 video

技术领域technical field

本发明涉及图像与视频处理领域,更具体而言,涉及在高效视频编码(HEVC)中针对360度视频的快速样点自适应补偿(SAO)。The present invention relates to the field of image and video processing, and more particularly, to fast sample adaptive compensation (SAO) for 360-degree video in High Efficiency Video Coding (HEVC).

背景技术Background technique

2010年4月,两大国际视频编码标准组织VCEG和MPEG成立视频压缩联合小组JCT-VC(Joint collaborative Team on Video Coding),一同开发高效视频编码HEVC(Highefficiency video coding)标准,其也称为H.265。HEVC标准主要目标是与上一代标准H.264/AVC实现大幅度的编码效率的提高,尤其是针对高分辨率视频序列。其目标是在相同视频质量(PSNR)下码率降为H.264标准的50%。In April 2010, the two major international video coding standards organizations, VCEG and MPEG, established a joint video compression group, JCT-VC (Joint collaborative Team on Video Coding), to jointly develop the HEVC (Highefficiency video coding) standard for high-efficiency video coding, also known as H .265. The main goal of the HEVC standard is to achieve a substantial improvement in coding efficiency with the previous generation standard H.264/AVC, especially for high-resolution video sequences. The goal is to reduce the bit rate to 50% of the H.264 standard at the same video quality (PSNR).

就目前阶段,HEVC依然沿用H.264就开始采用的混合编码框架。帧间和帧内预测编码:消除时间域和空间域的相关性。变换编码:对残差进行变换编码以消除空间相关性。熵编码:消除统计上的冗余度。HEVC将在混合编码框架内,着力研究新的编码工具或技术,提高视频压缩效率。At the current stage, HEVC still uses the hybrid coding framework adopted by H.264. Inter and Intra Predictive Coding: De-correlation between temporal and spatial domains. Transform coding: Transform coding the residuals to remove spatial correlations. Entropy coding: Eliminate statistical redundancy. HEVC will focus on researching new coding tools or technologies within the hybrid coding framework to improve video compression efficiency.

目前,JCT-VC组织的讨论中已经提出的许多编码的新特性,有可能会加入HEVC标准中,各次讨论的具体文献可以从http://wftp3.itu.int获得。At present, many new features of coding that have been proposed in the discussions organized by JCT-VC may be added to the HEVC standard. The specific documents of each discussion can be obtained from http://wftp3.itu.int.

HEVC标准[4]的第一版已经在2013年的一月份完成。并于2013年4月、2014年10月和2015年4月相继发布的3个版本,这些版本能够很容易地从网络上获得,并且本申请将上述HEVC标准的三个版本并入本说明书中作为本发明的背景技术。The first version of the HEVC standard [4] was completed in January 2013. And three versions were successively released in April 2013, October 2014 and April 2015, these versions can be easily obtained from the Internet, and this application incorporates the three versions of the above HEVC standard into this specification as the background art of the present invention.

在HEVC中,由于仍然使用基于块的混合编码框架,因此仍然需要处理方块效应、振铃效应等等。为了降低此类失真对视频质量的影响,HEVC采用了环路滤波技术(In-loopfiltering),其包括去方块滤波(Deblocking filtering)和像素样本自适应补偿(DampleAdaptive Offset,SAO)。SAO是HEVC的许多新技术之一[5]。如图1所示,SAO位于去块滤波器之后。SAO对每个编码树单元(CTU)的每个像素进行分类和统计,计算补偿值,选择最佳SAO类型,并且将SAO类型和补偿值写入码流中。然后,将偏移值添加到重构帧的每个像素,以减少重建帧与原始帧之间的失真。SAO可以显着提高主客观视频质量[5]。SAO主要由三部分组成:统计收集,SAO类型决策和SAO过滤,如图2所示。In HEVC, since the block-based hybrid coding framework is still used, there is still a need to deal with blocking, ringing, etc. In order to reduce the influence of such distortion on video quality, HEVC adopts in-loop filtering technology, which includes deblocking filtering (Deblocking filtering) and pixel sample adaptive compensation (Dample Adaptive Offset, SAO). SAO is one of many new technologies for HEVC [5]. As shown in Figure 1, SAO is located after the deblocking filter. The SAO classifies and counts each pixel of each coding tree unit (CTU), calculates the compensation value, selects the best SAO type, and writes the SAO type and the compensation value into the code stream. Then, an offset value is added to each pixel of the reconstructed frame to reduce the distortion between the reconstructed frame and the original frame. SAO can significantly improve the subjective and objective video quality [5]. SAO mainly consists of three parts: statistics collection, SAO type decision and SAO filtering, as shown in Figure 2.

统计收集:SAO主要有两种需要统计收集过程的偏移类型:边界补偿(EO)和边带补偿(BO)。对于EO类型,有四种EO子类型(EO 0°,EO 90°,EO 135°和EO 45°)。根据分类规则对每个EO子类型进行分类,并计算每个类别中的像素数和失真总和。对于BO类型,像素强度被等分为32个边带,并且根据分类规则对32个边带进行分类,并且计算每个类别中的像素数和失真总和。EO和BO的分类规则如图2所示。Statistics Collection: SAO has two main types of excursions that require a statistics collection process: boundary compensation (EO) and sideband compensation (BO). For the EO type, there are four EO subtypes (EO 0°, EO 90°, EO 135° and EO 45°). Each EO subtype is classified according to the classification rules and the sum of the number of pixels and distortion in each class is calculated. For the BO type, the pixel intensities were equally divided into 32 sidebands, and the 32 sidebands were classified according to the classification rule, and the number of pixels and the sum of distortions in each class were calculated. The classification rules for EO and BO are shown in Figure 2.

SAO类型决策:可以选择四种SAO类型:EO,BO,OFF和MERGE,其中,OFF表示不应用SAO,其在视频码流中通过一个开关参数来实现,MERGE表示对于一个块,其SAO参数直接使用上方或左侧的块的SAO,这时只需要标识采用了哪个相邻块的SAO参数即可。根据统计收集的信息,SAO类型决策通过快速率失真优化(RDO)过程[6]计算每个SAO类型的最优补偿值,并选择最优SAO类型。SAO type decision: Four SAO types can be selected: EO, BO, OFF and MERGE, where OFF means that SAO is not applied, which is implemented by a switch parameter in the video stream, and MERGE means that for a block, its SAO parameter is directly To use the SAO of the block above or to the left, it is only necessary to identify which SAO parameter of the adjacent block is used. Based on the statistically collected information, the SAO type decision calculates the optimal compensation value for each SAO type through a rapid rate-distortion optimization (RDO) process [6], and selects the optimal SAO type.

SAO滤波:根据获得的最佳SAO类型和偏移值对CTU的每个像素进行分类和补偿。SAO filtering: Classifies and compensates each pixel of the CTU according to the obtained best SAO type and offset value.

图2显示SAO过程由三部分组成:统计收集,SAO类型决策和SAO过滤。[16]研究了各部分的计算复杂度。结果表明,统计收集约占SAO总处理时间的82%,SAO类型决策和SAO滤波分别为11%和7%。复杂的统计收集过程是制约SAO处理速度的主要因素。Figure 2 shows that the SAO process consists of three parts: statistics collection, SAO type decision and SAO filtering. [16] studied the computational complexity of each part. The results show that statistics collection accounts for about 82% of the total processing time of SAO, and SAO type decision and SAO filtering account for 11% and 7%, respectively. The complex statistical collection process is the main factor restricting the processing speed of SAO.

在虚拟现实系统中,多个摄像头用于捕捉360度场景,随后拍摄的场景被拼接成球形格式的360度视频。用户可以通过头戴式设备自由观看360度场景中的任何场景显示(HMD)并获得身临其境的体验[1]。360度视频是一种新的视频编码内容。虽然360度视频是在HEVC标准提出之后才流行的并且360度视频是球形视频,但是[2]已经提出了HEVC标准下的360度视频编码框架。在典型的360度视频压缩框架中,球形视频需要在编码前转换为平面视频,平面视频需要在编码后转换为球形视频[3]。转换方式称为投影。已经提出了多种投影格式,例如,等矩形投影(ERP),调整的等面积投影(AEP),立方体投影(CMP),等角立方图投影(EAC),截断正方形金字塔投影(TSP),紧凑的八面体投影(COHP),紧凑二十面体投影(CISP)等。当选择ERP作为投影格式时,360度视频的编码过程包括:将原始视频投影为ERP投影格式,并对ERP投影视频执行编解码,将ERP投影格式的重建视频重新反投影为重建的视频。投影过程对于360度视频编码是必不可少的。作为中间格式的投影格式影响360度视频的编码性能。实际上,目前尚未确定哪种投影格式具有最佳编码性能。但是,ERP被广泛使用,是360度视频的默认格式。因此,本文主要研究ERP投影格式的特点。In a virtual reality system, multiple cameras are used to capture a 360-degree scene, which is then stitched into a spherical format 360-degree video. Users can freely watch any scene display (HMD) in a 360-degree scene through a head-mounted device and get an immersive experience [1]. 360-degree video is a new video encoding content. Although 360-degree video became popular after the HEVC standard was proposed and 360-degree video is spherical video, [2] has proposed a 360-degree video coding framework under the HEVC standard. In a typical 360-degree video compression framework, spherical video needs to be converted to flat video before encoding, and flat video needs to be converted to spherical video after encoding [3]. The way of transformation is called projection. Various projection formats have been proposed, such as equirectangular projection (ERP), adjusted equal area projection (AEP), cube projection (CMP), equirectangular projection (EAC), truncated square pyramid projection (TSP), compact The octahedral projection (COHP), the compact icosahedral projection (CISP), etc. When ERP is selected as the projection format, the encoding process of the 360-degree video includes: projecting the original video into the ERP projection format, performing encoding and decoding on the ERP projection video, and re-projecting the reconstructed video in the ERP projection format into the reconstructed video. The projection process is essential for 360-degree video encoding. The projection format as an intermediate format affects the encoding performance of 360 video. In fact, it has not yet been determined which projection format has the best encoding performance. However, ERP is widely used and is the default format for 360-degree video. Therefore, this paper mainly studies the characteristics of ERP projection format.

与平面视频相比,360度视频具有不同的特征,现有的SAO快速算法的最佳参数和过程不适用于360度视频。在本申请中,基于360度视频的特点,提出了一种针对360度视频的快速SAO算法。Compared with flat videos, 360-degree videos have different characteristics, and the optimal parameters and procedures of existing SAO fast algorithms are not suitable for 360-degree videos. In this application, based on the characteristics of 360-degree video, a fast SAO algorithm for 360-degree video is proposed.

本申请是是对现有HEVC协议的改进,为了使得本领域技术人员能够充分理解本发明,以下附上了本申请中提及的多种概念的引用文献,这些文献被整体上并入本文并作为本申请说明书的一部分。This application is an improvement to the existing HEVC protocol. In order to enable those skilled in the art to fully understand the present invention, citations for various concepts mentioned in this application are attached below, which are incorporated herein in their entirety and incorporated herein by reference. as part of the specification of this application.

1.B.Luo,F.Xu,C.Richardt and J.Yong,″Parallax360:Stereoscopic 360°Scene Representation for Head-Motion Parallax,″in IEEE Transactions onVisualization and Computer Graphics,vol.24,no.4,pp.1545-1553,April 2018.1.B.Luo, F.Xu, C.Richardt and J.Yong, "Parallax360: Stereoscopic 360°Scene Representation for Head-Motion Parallax," in IEEE Transactions onVisualization and Computer Graphics, vol.24, no.4, pp .1545-1553, April 2018.

2.Y.Y,E.Alshina,J.Boyce,“Algorithm descriptions of projection formatconversion and video quality metrics in 360Lib”,Joint Video Exploration Teamof ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11,JVET-H1004,7th Meeting,July2017.2.Y.Y, E.Alshina, J.Boyce, "Algorithm descriptions of projection formatconversion and video quality metrics in 360Lib", Joint Video Exploration Teamof ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11, JVET-H1004, 7th Meeting, July2017.

3.W.Zou,F.Yang and S.Wan,″Perceptual video quality metric forcompression artefacts:from two-dimensional to omnidirectional,″in IET InageProcessing,vol.12,no.3,pp.374-381,3 2018.3. W. Zou, F. Yang and S. Wan, "Perceptual video quality metric for compression artefacts: from two-dimensional to omnidirectional," in IET InageProcessing, vol.12, no.3, pp.374-381, 3 2018 .

4.Sullivan,Gary J.,et al.″Overview of the high efficiency videocoding(HEVC)standard.″Circuits and Systems for Video Technology,IEEETransactions on 22.12(2012):1649-1668.4. Sullivan, Gary J., et al. "Overview of the high efficiency videocoding (HEVC) standard." Circuits and Systems for Video Technology, IEEE Transactions on 22.12 (2012): 1649-1668.

5.C.-M.Fu,E.Alshina,A.Alshin,Y.-W.Huang,C.-Y.Chen,C.-Y.Tsai,C.-W.Hsu,S.-M.Lei,J.-H.Park,and W.-J.Han,”Sample adaptive offset in the HEVCstandard,”Circuits and Systems for Video Technology,IEEE Transactions on,vol.22,no.12,pp.1755-1764,2012.5.C.-M.Fu, E.Alshina, A.Alshin, Y.-W.Huang, C.-Y.Chen, C.-Y.Tsai, C.-W.Hsu, S.-M. Lei, J.-H.Park, and W.-J.Han, "Sample adaptive offset in the HEVCstandard," Circuits and Systems for Video Technology, IEEE Transactions on, vol.22, no.12, pp.1755-1764 , 2012.

6.Zhang M,Bai H,Lin C,et al.Texture Characteristics Based Fast CodingUnit Partition in HEVC Intra Coding,Data Compression Conference.IEEE,2015:477-477.6. Zhang M, Bai H, Lin C, et al. Texture Characteristics Based Fast CodingUnit Partition in HEVC Intra Coding, Data Compression Conference. IEEE, 2015: 477-477.

7.Z.Zhengyong,C.Zhiyun and P.Peng,″A fast SAO algorithm based oncoding unit partition for HEVC,″2015 6th IEEE International Conference onSoftware Engineering and Service Science(ICSESS),Beijing,2015,pp.392-395.7.Z.Zhengyong, C.Zhiyun and P.Peng, "A fast SAO algorithm based on coding unit partition for HEVC," 2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS), Beijing, 2015, pp.392-395 .

8.J.Joo,Y.Choi and K.Lee,″Fast sample adaptive offset encodingalgorithm for HEVC based on intra prediction mode,″2013 IEEE ThirdInternational Conference on Consumer Electronics Berlin(ICCE-Berlin),Berlin,2013,pp.50-53.8. J. Joo, Y. Choi and K. Lee, "Fast sample adaptive offset encoding algorithm for HEVC based on intra prediction mode," 2013 IEEE Third International Conference on Consumer Electronics Berlin (ICCE-Berlin), Berlin, 2013, pp.50 -53.

9.T.Y.Kuo,H.Chiu and F.Amirul,″Fast sample adaptive offset encodingfor HEVC,″2016 IEEE International Conference on Consumer Electronics-Taiwan(ICCE-TW),Nantou,2016,pp.1-2.9. T.Y.Kuo, H.Chiu and F.Amirul, "Fast sample adaptive offset encoding for HEVC," 2016 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), Nantou, 2016, pp.1-2.

10.S.Yin,X.Zhang and Z.Gao,″Efficient SAO coding algorithm for x265encoder,″2015 Visual Communications and Image Processing(VCIP),Singapore,2015,pp.1-4.10. S. Yin, X. Zhang and Z. Gao, "Efficient SAO coding algorithm for x265encoder," 2015 Visual Communications and Image Processing (VCIP), Singapore, 2015, pp.1-4.

11.S.E.Gendy,A.Shalaby and M.S.Sayed,″Fast parameter estimationalgorithm for sample adaptive offset in HEVC encoder,″2015 VisualCommunications and Image Processing(VCIP),Singapore,2015,pp.1-4.11. S.E.Gendy, A.Shalaby and M.S.Sayed, "Fast parameter estimation algorithm for sample adaptive offset in HEVC encoder," 2015 Visual Communications and Image Processing (VCIP), Singapore, 2015, pp.1-4.

12.K.Yang,S.Wan,Y.Gong,Y.Yang and Y.Feng,″Fast sample adaptive offsetfor H.265/HEVC based on temporal dependency,″2016 Asia-Pacific Signal andInformation Processing Association Annual Summit and Conference(APSIPA),Jeju,2016,pp.1-4.12. K. Yang, S. Wan, Y. Gong, Y. Yang and Y. Feng, "Fast sample adaptive offset for H.265/HEVC based on temporal dependency," 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Jeju, 2016, pp.1-4.

13.Sungjei Kim,Jinwoo Jeong,Jeong-Mee Moon and Yong-HWan Kim,″Fastsample adaptive offset parameter estimation algorithm based on earlytermination for HEVC encoder,″2017 IEEE International Conference on ConsumerElectronics(ICCE),Las Vegas,NV,2017,pp.241-242.13. Sungjei Kim, Jinwoo Jeong, Jeong-Mee Moon and Yong-HWan Kim, "Fastsample adaptive offset parameter estimation algorithm based on earlytermination for HEVC encoder," 2017 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, 2017, pp.241-242.

14.W.Zhang and C.Guo,″Design and implementation of parallelalgorithms for sample adaptive offset in HEVC based on GPU,″2016 SixthInternational Conference on Information Science and Technology(ICIST),Dalian,2016,pp.181-187.14. W. Zhang and C. Guo, "Design and implementation of parallelalgorithms for sample adaptive offset in HEVC based on GPU," 2016 Sixth International Conference on Information Science and Technology (ICIST), Dalian, 2016, pp.181-187.

15.Y.Wang,X.Guo,Y.Lu,X.Fan and D.Zhao,″GPU-based optimization forsample adaptive offset in HEVC,″2016 IEEE International Conference on ImageProcessing(ICIP),Phoenix,AZ,2016,pp.829-833.15.Y.Wang, X.Guo, Y.Lu, X.Fan and D.Zhao, "GPU-based optimization forsample adaptive offset in HEVC," 2016 IEEE International Conference on ImageProcessing(ICIP), Phoenix, AZ, 2016, pp.829-833.

16.Y.Choi and J.Joo,″Exploration of Practical HEVC/H.265 SampleAdaptive offset Encoding Policies,″in IEEE Signal Processing Letters,vol.22,no.4,pp.465-468,April 2015.16. Y. Choi and J. Joo, "Exploration of Practical HEVC/H.265 SampleAdaptive offset Encoding Policies," in IEEE Signal Processing Letters, vol. 22, no. 4, pp. 465-468, April 2015.

17.Y.Li,J.Xu and Z.Chen,″Spherical domain rate-distortionoptimization for 360-degree video coding,″2017 IEEE International Conferenceon Multimedia and Expo(ICME),Hong Kong,2017,pp.709-714.17. Y. Li, J. Xu and Z. Chen, "Spherical domain rate-distortion optimization for 360-degree video coding," 2017 IEEE International Conferenceon Multimedia and Expo (ICME), Hong Kong, 2017, pp.709-714.

18.Y.Sun and L.Yu,″Coding optimization based on weighted-to-spherically-uniform quality metric for 360 video,″2017 IEEE VisualCommunications and Image Processing(VCIP),St.Petersburg,FL,2017,pp.1-4.18. Y. Sun and L. Yu, "Coding optimization based on weighted-to-spherically-uniform quality metric for 360 video," 2017 IEEE Visual Communications and Image Processing (VCIP), St. Petersburg, FL, 2017, pp.1 -4.

19.Jill Boyce,Elena Alshina,Adeel Abbas,“JVET common test conditionsand evaluation procedures for 360°video”,Joint Video Exploration Team(JVET)ofITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11,JVET-H1030,8th Meeting,Oct.2017.19. Jill Boyce, Elena Alshina, Adeel Abbas, “JVET common test conditions and evaluation procedures for 360° video”, Joint Video Exploration Team (JVET) of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11 , JVET-H1030, 8th Meeting, Oct.2017.

20.X.Xiu,Y.He,Y.Ye and B.Vishwanath,″An evaluation framework for 360-degtee video compression,″2017 IEEE Visual Communications and ImageProcessing(VCIP),St.Petersburg,FL,2017,pp.1-4.20. X. Xiu, Y. He, Y. Ye and B. Vishwanath, "An evaluation framework for 360-degtee video compression," 2017 IEEE Visual Communications and ImageProcessing (VCIP), St. Petersburg, FL, 2017, pp. 1-4.

21.H.Bai,C.Zhu and Y.Zhao,″Optimized Multiple Description LatticeVector Quantization for Wavelet Image Coding,″in IEEE Transactions onCircuits and Systems for Video Technology,vol.17,no.7,pp.912-917,July 2007.21. H. Bai, C. Zhu and Y. Zhao, "Optimized Multiple Description LatticeVector Quantization for Wavelet Image Coding," in IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.7, pp.912-917, July 2007.

22.C.Yeh,Z.Zhang,M.Chen and C.Lin,″HEVC Intra Frame Coding Based onConvolutional Neural Network,″in IEEE Access,vol.6,pp.50087-50095,2018.22. C. Yeh, Z. Zhang, M. Chen and C. Lin, "HEVC Intra Frame Coding Based on Convolutional Neural Network," in IEEE Access, vol.6, pp.50087-50095, 2018.

23.L.Chang,Z.Liu,L.Wang and X.Li,″Enhance the HEVC Fast Intra CU ModeDecision Based on Convolutional Neural Network by Corner Power Estimation,″2018 Data Compression Conference,Snowbird,UT,2018,pp.400-400。23. L. Chang, Z. Liu, L. Wang and X. Li, "Enhance the HEVC Fast Intra CU ModeDecision Based on Convolutional Neural Network by Corner Power Estimation," 2018 Data Compression Conference, Snowbird, UT, 2018, pp. 400-400.

24.T.Katayama,K.Kuroda,W.Shi,T.Song and T.Shimamoto,″Low-complexityintra coding algorithm based on convolutional neural network for HEVC,″2018International Conference on Information and Computer Technologies (ICICT),DeKalb,IL,2018,pp.115-118.24. T. Katayama, K. Kuroda, W. Shi, T. Song and T. Shimamoto, "Low-complexityintra coding algorithm based on convolutional neural network for HEVC," 2018International Conference on Information and Computer Technologies (ICICT), DeKalb, IL, 2018, pp.115-118.

25.G.Bjontegaard,“Calculation of average PSNR differences between RD-curves,”ITU-T VCEG-M33,2001.25. G. Bjontegaard, "Calculation of average PSNR differences between RD-curves," ITU-T VCEG-M33, 2001.

发明内容SUMMARY OF THE INVENTION

本发明针对360度视频的特性,提出了一种用于360度视频的快速SAO方法。所提出的算法改进了SAO过程,在保留整个SAO过程的基础上,增加了简化的SAO过程。经过基于阈值的SAO执行决策后,可以使用简化的SAO流程,这将大大减少统计数据收集的时间,从而降低SAO的计算复杂度。Aiming at the characteristics of 360-degree video, the present invention proposes a fast SAO method for 360-degree video. The proposed algorithm improves the SAO process by adding a simplified SAO process on the basis of retaining the entire SAO process. After the threshold-based SAO execution decision, a simplified SAO process can be used, which will greatly reduce the time for statistical data collection and thus reduce the computational complexity of SAO.

根据本发明的一个方面,提出了一种在高效视频编码(HEVC)中针对360度视频的样点自适应补偿(SAO)的方法,该方法包括:According to one aspect of the present invention, a method for sample adaptive compensation (SAO) for 360-degree video in High Efficiency Video Coding (HEVC) is proposed, the method comprising:

对所述360度视频执行投影,以获得投影视频;performing projection on the 360-degree video to obtain a projection video;

对所述ERP投影视频中的当前帧的编码树单元(CTU)执行帧内预测或帧间预测,以确定最佳RD-cost;performing intra-frame prediction or inter-frame prediction on the coding tree unit (CTU) of the current frame in the ERP projection video to determine the best RD-cost;

将所述CTU的RD-cost与阈值进行比较以判断是否执行SAO,comparing the RD-cost of the CTU with a threshold to determine whether to perform SAO,

其中,所述阈值是至少部分地基于针对所述投影视频的量化参数和投影权重来确定的,并且其中,所述投影权重是至少部分地基于所述投影视频的高度中的CTU数量以及所述CTU在所述投影视频的当前帧中的位置来确定的。wherein the threshold is determined based at least in part on a quantization parameter and a projection weight for the projected video, and wherein the projection weight is based at least in part on the number of CTUs in the height of the projected video and the The position of the CTU in the current frame of the projected video is determined.

根据本发明的进一步的方面,所述方法进一步包括:如果判定不执行SAO,则至少不对所述CTU执行边界补偿(EO)和边带补偿(BO);如果判定执行SAO,则针对对所述CTU执行OFF或MERGE操作之一。According to a further aspect of the present invention, the method further comprises: if it is determined not to perform SAO, at least not performing boundary compensation (EO) and sideband compensation (BO) on the CTU; if it is determined to perform SAO, The CTU performs one of the OFF or MERGE operations.

根据本发明的进一步的方面,其中,所述阈值至少部分地基于以下至少一项:所述投影权重的以2为底的对数,或所述量化参数的e的幂,或其组合。According to a further aspect of the invention, wherein the threshold is based, at least in part, on at least one of: the base-2 logarithm of the projection weight, or the quantization parameter raised to a power of e, or a combination thereof.

根据本发明的进一步的方面,其中,仅针对所述投影视频中的上方1/4和下方1/4的高度,将所述CTU的RD-cost与阈值进行比较以判断是否执行SAO。According to a further aspect of the present invention, the RD-cost of the CTU is compared with a threshold to determine whether to perform SAO only for the upper 1/4 and lower 1/4 heights in the projected video.

根据另一方面,一种高效视频编码(HEVC)硬件编码器,其适于针对360度视频的样点自适应补偿(SAO),所述编码器被配置为执行上述方法。According to another aspect, a High Efficiency Video Coding (HEVC) hardware encoder adapted for sample adaptive compensation (SAO) for 360 degree video, the encoder configured to perform the above method.

根据另一方面,本发明提出了一种对使用如所述的方法或如所述的编码器进行编码的360视频流进行解码的解码器。According to another aspect, the present invention proposes a decoder for decoding a 360 video stream encoded using the method as described or the encoder as described.

根据另一方面,本发明提出了一种用于执行上述方法的计算机程序产品。According to another aspect, the present invention proposes a computer program product for performing the above method.

根据另一方面,本发明提出了一种可用于视频编解码的设备,该设备包括:一个或多个处理器;存储器,其中存储有计算机代码,所述计算机代码当由所述处理器执行时,实现上述方法。According to another aspect, the present invention proposes a device usable for video encoding and decoding, the device comprising: one or more processors; a memory in which computer code is stored, the computer code when executed by the processor , to implement the above method.

根据另一方面,所述投影为等矩形投影(ERP)。According to another aspect, the projection is an equirectangular projection (ERP).

附图说明Description of drawings

图1示出了HEVC的编码器框图的一个实施例。Figure 1 shows one embodiment of an encoder block diagram for HEVC.

图2示出HEVC中的SAO的简要框图。Figure 2 shows a simplified block diagram of SAO in HEVC.

图3示出了ERP投影的权重分布。Figure 3 shows the weight distribution of ERP projections.

图4示出了根据本公开内容的各个方面的方法流程图。4 illustrates a method flow diagram in accordance with various aspects of the present disclosure.

图5示出了根据本公开内容的各个方面的用于视频编解码的设备的示意图。5 shows a schematic diagram of an apparatus for video encoding and decoding according to various aspects of the present disclosure.

具体实施方式Detailed ways

现在参考附图来描述各种方案。在以下描述中,为了进行解释,阐述了多个具体细节以便提供对一个或多个方案的透彻理解。然而,显然,在没有这些具体细节的情况下也能够实现这些方案。Various schemes are now described with reference to the figures. In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of one or more aspects. Obviously, however, these schemes can be implemented without these specific details.

如在本申请中所使用的,术语“组件”、“模块”、“系统”等等旨在指代与计算机相关的实体,例如但不限于,硬件、固件、硬件和软件的组合、软件,或者是执行中的软件。例如,组件可以是但不限于:在处理器上运行的进程、处理器、对象、可执行体(executable)、执行线程、程序、和/或计算机。举例而言,运行在计算设备上的应用程序和该计算设备都可以是组件。一个或多个组件可以位于执行进程和/或者执行线程内,并且组件可以位于一台计算机上和/或者分布在两台或更多台计算机上。另外,这些组件可以从具有存储在其上的各种数据结构的各种计算机可读介质执行。组件可以借助于本地和/或远程进程进行通信,例如根据具有一个或多个数据分组的信号,例如,来自于借助于信号与本地系统、分布式系统中的另一组件交互和/或者与在诸如因特网之类的网络上借助于信号与其他系统交互的一个组件的数据。As used in this application, the terms "component," "module," "system," and the like are intended to refer to computer-related entities such as, but not limited to, hardware, firmware, a combination of hardware and software, software, or software in execution. For example, a component may be, but is not limited to, a process running on a processor, a processor, an object, an executable, a thread of execution, a program, and/or a computer. For example, both an application running on a computing device and the computing device can be components. One or more components can reside within a process and/or thread of execution, and a component can be localized on one computer and/or distributed across two or more computers. In addition, these components can execute from various computer readable media having various data structures stored thereon. Components may communicate by means of local and/or remote processes, for example, based on signals having one or more data packets, for example, from interacting with another component in a local system, a distributed system by means of signals, and/or with another component in a distributed system. Data for a component on a network such as the Internet that interacts with other systems by means of signals.

本发明针对360度视频的特性,提出了一种用于360度视频的快速SAO方法。所提出的算法改进了SAO过程,在保留整个SAO过程的基础上,增加了简化的SAO过程。经过基于阈值的SAO执行决策后,可以选择执行HEVC规定的常规SAO流程或者选择不执行SAO或仅执行MERGE处理,从而实现了简化的SAO流程,这将大大减少统计数据收集的时间,从而降低SAO的计算复杂度。Aiming at the characteristics of 360-degree video, the present invention proposes a fast SAO method for 360-degree video. The proposed algorithm improves the SAO process by adding a simplified SAO process on the basis of retaining the entire SAO process. After a threshold-based SAO execution decision, one can choose to execute the regular SAO process specified by HEVC or choose not to execute SAO or only execute MERGE processing, thus realizing a simplified SAO process, which will greatly reduce the time for statistical data collection, thereby reducing SAO computational complexity.

I.算法概述I. Algorithm Overview

(1)360度视频中的权重(weight)(1) Weight in 360-degree video

360度视频是一种球形视频,是360度视频与传统视频的最大区别。为了在HEVC标准下编码360度视频,必须将360度视频投影到平面视频中。虽然投影视频和传统视频都是平面视频,但投影视频具有球形视频的失真和拉伸。因此,传统视频的客观质量评价指标PSNR不适合投影视频。WS-PSNR被提出作为投影视频的客观质量评估指标[18]。WS-PSNR为投影视频设计权重,投影视频在失真和拉伸区域的权重较小,反之亦然,然后通过加权平均法计算WS-PSNR。WS-PSNR被联合视频探索团队(JVET)认可为360度视频质量的客观质量评估指标。因此,权重是投影视频和传统视频之间的最大差异。360-degree video is a spherical video, which is the biggest difference between 360-degree video and traditional video. In order to encode a 360-degree video under the HEVC standard, the 360-degree video must be projected into a flat video. While both projected video and traditional video are flat videos, projected video has the distortion and stretching of spherical video. Therefore, the objective quality evaluation index PSNR of traditional video is not suitable for projection video. WS-PSNR was proposed as an objective quality assessment metric for projected videos [18]. WS-PSNR designs weights for the projected video, and the projected video has less weight in the distorted and stretched regions, and vice versa, and then calculates the WS-PSNR by the weighted average method. WS-PSNR is recognized by the Joint Video Exploration Team (JVET) as an objective quality assessment metric for 360-degree video quality. Therefore, weight is the biggest difference between projected video and traditional video.

Figure GSB0000181112430000081
Figure GSB0000181112430000081

其中(i,j)表示像素位置,height表示视频的高度。图3显示了ERP权重分布。颜色越深,越接近0。颜色越浅,越接近1。Region0定义为两极附近区域,权重小;Region1定义为赤道附近的区域,权重大。如图3所示,Region0包括视频的上1/4区域和下1/4区域;Region1代表视频的中间1/2区域。where (i, j) represents the pixel position and height represents the height of the video. Figure 3 shows the ERP weight distribution. The darker the color, the closer it is to 0. The lighter the color, the closer to 1. Region0 is defined as the area near the poles with a small weight; Region1 is defined as the area near the equator with a heavy weight. As shown in Figure 3, Region0 includes the upper 1/4 area and the lower 1/4 area of the video; Region1 represents the middle 1/2 area of the video.

(2)RD-cost(2)RD-cost

在HEVC中,通过速率失真优化(RDO)[17]递归地计算HEVC标准中的帧内预测和帧间预测的最佳预测模式和最佳CU划分。In HEVC, the optimal prediction modes and optimal CU partitions for intra prediction and inter prediction in the HEVC standard are recursively calculated by Rate Distortion Optimization (RDO) [17].

J=D+λ·R (2)J=D+λ·R (2)

其中D表示当前预测模式中的失真,R表示在当前预测模式中对所有信息进行编码所需的比特数,λ是拉格朗日因子,并且J表示拉格朗日代价(RD-cost)。RD-cost越小,预测模式的编码效率越高,并且RD-cost越大,预测模式的编码效率越低。where D represents the distortion in the current prediction mode, R represents the number of bits required to encode all the information in the current prediction mode, λ is the Lagrangian factor, and J represents the Lagrangian cost (RD-cost). The smaller the RD-cost, the higher the coding efficiency of the prediction mode, and the larger the RD-cost, the lower the coding efficiency of the prediction mode.

(3)所提出的算法(3) The proposed algorithm

在本发明的一个实施例中,设置阈值(Threshold)以预先确定是否执行SAO过程。当RD-cost>Threshold时执行SAO过程,否则,不执行SAO过程。通过基于阈值来跳过SAO处理,能够大大减少编码计算量。In one embodiment of the present invention, a threshold (Threshold) is set to predetermine whether to perform the SAO process. When RD-cost>Threshold, the SAO process is performed, otherwise, the SAO process is not performed. By skipping SAO processing based on a threshold, the amount of encoding computation can be greatly reduced.

在本发明的一个实施例中,用于确定是否执行SAO过程的阈值是至少部分地基于用于360度视频的量化参数和ERP投影权重来确定的。In one embodiment of the invention, the threshold for determining whether to perform the SAO process is determined based at least in part on the quantization parameters and ERP projection weights for the 360-degree video.

在本发明的一个实施例中,量化系数对于是否执行SAO的判定具有影响。根据实验统计,随着量化系数的增加,CTU不应用SAO的概率增加;随着量化系数的增加,同一CTU的RD-cost也增加。因此,在确定阈值时考虑量化系数,可以所设定的阈值对是否执行SAO的判定更为准确。In one embodiment of the present invention, the quantization coefficients have an influence on the decision whether to perform SAO. According to experimental statistics, with the increase of quantization coefficient, the probability of CTU not applying SAO increases; with the increase of quantization coefficient, the RD-cost of the same CTU also increases. Therefore, the quantization coefficient is considered when determining the threshold value, and the determination of whether to execute SAO can be more accurate with the set threshold value.

在一个示例而非限制性的实施例中,可以考虑量化参数的e的幂。In one exemplary and non-limiting embodiment, the quantization parameter raised to the power of e may be considered.

在本发明的一个实施例中,投影视频的权重对于是否执行SAO的判定具有影响。由于在本文中使用ERP投影,因此所述权重也被称为ERP投影权重。在研究中,发明人注意到对于ERP投影的权重,Region0中CTU的权重接近0,因此Region0中CTU的失真对最终视频质量影响不大。因此,在对于是否执行SAO的判定中考虑投影视频的权重符合了ERP投影的投影视频的特点,使得该判定不会造成编码视频质量的下降。In one embodiment of the present invention, the weight of the projected video has an impact on the decision whether to perform SAO. Since ERP projections are used in this paper, the weights are also referred to as ERP projection weights. During the research, the inventors noticed that for the weight of the ERP projection, the weight of the CTU in Region0 is close to 0, so the distortion of the CTU in Region0 has little effect on the final video quality. Therefore, the weight of the projected video is considered in the determination of whether to execute the SAO, which conforms to the characteristics of the projected video projected by the ERP, so that the determination will not cause degradation of the quality of the encoded video.

在本发明的一个示例而非限制性的实施例中,发明人提出如下权重设置方式:In an exemplary but non-limiting embodiment of the present invention, the inventor proposes the following weight setting methods:

Figure GSB0000181112430000091
Figure GSB0000181112430000091

其中(x,y)表示CTU的位置,m是视频高度中的CTU数量,weight(x,y)表示CTU的所有像素的平均权重。因此,不同位置的权重是不同的,在Region1中CTU的失真对视频质量的影响要远小于Region0中CTU的失真对视频质量的影响。因此,对Region1中的CTU进行详细决策并对Region0中的CTU进行粗略决策是一种适应投影视频特性的方法。where (x, y) represents the position of the CTU, m is the number of CTUs in the video height, and weight(x, y) represents the average weight of all pixels of the CTU. Therefore, the weights of different positions are different, and the influence of the distortion of the CTU in Region1 on the video quality is much smaller than the influence of the distortion of the CTU in Region0 on the video quality. Therefore, making detailed decisions for CTUs in Region1 and coarse decisions for CTUs in Region0 is a way to adapt to the characteristics of projected video.

在本发明的一个实施例中,发明人根据不同纬度(对应于y值)的权重修改阈值。发明人使用

Figure GSB0000181112430000103
来表示不同纬度的比例因子。纬度越大,因子越大。In one embodiment of the present invention, the inventor modifies the threshold according to the weight of different latitudes (corresponding to the y value). Inventor uses
Figure GSB0000181112430000103
to represent scale factors for different latitudes. The larger the latitude, the larger the factor.

在一个示例而非限制性的实施例中,给出了如以下表1所示的阈值设置。In an exemplary, non-limiting embodiment, threshold settings are given as shown in Table 1 below.

表1.ERP投影视频在不同QP下改进的阈值Table 1. Improved thresholds for ERP projection videos at different QPs

Figure GSB0000181112430000101
Figure GSB0000181112430000101

其中,α和1-α分别表示固定阈值与可变阈值的百分比,QP为量化参数。Among them, α and 1-α represent the percentage of fixed threshold and variable threshold, respectively, and QP is the quantization parameter.

在本发明的一个实施例中,发明人考虑量化参数的e的幂。例如,在另一个示例而非限制性的实施例中,阈值可以如下确定:In one embodiment of the present invention, the inventors consider the quantization parameter raised to the power of e. For example, in another exemplary and non-limiting embodiment, the threshold may be determined as follows:

Figure GSB0000181112430000102
Figure GSB0000181112430000102

在本发明的一个实施例中,如上所述地,发明人注意到在Region0中CTU的失真对视频质量的影响要远小于Region1中CTU的失真对视频质量的影响。因此为了在减小由SAO引起的计算量的同时确保视频质量,可以仅针对ERP投影视频中的Region0(即当前帧上方1/4和下方1/4的高度)来进行上述操作,而对Region1(当前帧中间1/2高度的区域)仍然进行HEVC协议规定的SAO处理。In one embodiment of the present invention, as described above, the inventor noticed that the influence of the distortion of the CTU in Region0 on the video quality is much smaller than the influence of the distortion of the CTU in the Region1 on the video quality. Therefore, in order to ensure the video quality while reducing the amount of computation caused by SAO, the above operations can be performed only for Region0 in the ERP projection video (that is, the height above 1/4 and below the current frame), while for Region1 (The area of 1/2 height in the middle of the current frame) still performs the SAO processing specified by the HEVC protocol.

在本发明的一个实施例中,由于MERGE操作的计算量较小,因此在进行上述操作时,可以考虑在满足阈值条件(例如小于阈值)时,执行MERGE操作或者不执行SAO操作(OFF操作)。In an embodiment of the present invention, since the calculation amount of the MERGE operation is small, when the above operation is performed, it may be considered that when the threshold condition (for example, less than the threshold) is satisfied, the MERGE operation is performed or the SAO operation (OFF operation) is not performed. .

图4示出了根据本公开内容的各个方面的方法流程图。该方法用于在高效视频编码(HEVC)中针对360度视频的样点自适应补偿(SAO)。4 illustrates a method flow diagram in accordance with various aspects of the present disclosure. This method is used for Sample Adaptive Compensation (SAO) for 360-degree video in High Efficiency Video Coding (HEVC).

根据一个实施例,该方法包括:对所述360度视频执行等矩形投影(ERP),以获得ERP投影视频。本领域技术人员容易理解,可以执行除了ERP之外的其他投影方法,具体的投影方法并不是本发明的关注点。According to one embodiment, the method includes performing an equirectangular projection (ERP) on the 360-degree video to obtain an ERP projection video. Those skilled in the art can easily understand that other projection methods other than ERP can be implemented, and the specific projection method is not the focus of the present invention.

根据另一个实施例,该方法还包括:对所述ERP投影视频中的当前帧的编码树单元(CTU)执行帧内预测或帧间预测,以确定最佳RD-cost。According to another embodiment, the method further comprises: performing intra-frame prediction or inter-frame prediction on the coding tree unit (CTU) of the current frame in the ERP projection video to determine the best RD-cost.

根据另一个实施例,该方法还包括:将CTU的RD-cost与阈值进行比较以判断是否执行SAO,其中,所述阈值是至少部分地基于针对ERP投影视频的量化参数和ERP投影权重来确定的,并且其中,所述ERP投影权重是至少部分地基于所述ERP投影视频的高度中的CTU数量以及所述CTU在所述ERP投影视频的当前帧中的位置来确定的。According to another embodiment, the method further includes comparing the RD-cost of the CTU to a threshold value determined based at least in part on quantization parameters for the ERP projection video and ERP projection weights to determine whether to perform SAO , and wherein the ERP projection weight is determined based at least in part on the number of CTUs in the height of the ERP projection video and the position of the CTU in the current frame of the ERP projection video.

根据另一个实施例,如果判定不执行SAO,则至少不对所述CTU执行边界补偿(EO)和边带补偿(BO);而如果判定执行SAO,则针对对所述CTU执行OFF或MERGE操作之一。According to another embodiment, if it is determined not to perform SAO, at least boundary compensation (EO) and sideband compensation (BO) are not performed on the CTU; and if it is determined to perform SAO, then at least the CTU is not performed with an OFF or MERGE operation. one.

根据另一个实施例,所述阈值至少部分地基于以下至少一项:所述ERP投影权重的以2为底的对数,或所述量化参数的e的幂,或其组合。According to another embodiment, the threshold is based, at least in part, on at least one of: a base-2 logarithm of the ERP projection weight, or a power of e of the quantization parameter, or a combination thereof.

根据另一个实施例,仅针对ERP投影视频中的上方1/4和下方1/4的高度,将所述CTU的RD-cost与阈值进行比较以判断是否执行SAO。According to another embodiment, the RD-cost of the CTU is compared with a threshold to determine whether to perform SAO only for the upper 1/4 and lower 1/4 heights in the ERP projection video.

图5示出了根据本公开内容的各个方面的用于视频编解码的设备的示意图。如图5所示,该设备可以包括一个或多个处理器和存储器,所述存储器中存储有计算机代码,所述计算机代码当由所述处理器执行时,实现如本文所述的在高效视频编码(HEVC)中针对360度视频的样点自适应补偿(SAO)的方法。5 shows a schematic diagram of an apparatus for video encoding and decoding according to various aspects of the present disclosure. As shown in FIG. 5, the apparatus may include one or more processors and a memory having computer code stored in the memory that, when executed by the processor, implements high-efficiency video as described herein. A method of sample adaptive compensation (SAO) for 360-degree video in coding (HEVC).

根据另一方面,本公开内容还可以涉及用于实现上述编码方法的编码器。该编码器可以是专用硬件。According to another aspect, the present disclosure may also relate to an encoder for implementing the above encoding method. The encoder can be dedicated hardware.

根据另一方面,本公开内容还可以涉及对应的对编码后的360视频流进行解码的解码器。According to another aspect, the present disclosure may also relate to a corresponding decoder for decoding the encoded 360 video stream.

根据另一方面,本公开内容还可以涉及执行本文所述方法的计算机程序产品。According to another aspect, the present disclosure may also relate to a computer program product for performing the methods described herein.

当用硬件实现时,视频编码器可以用通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现场可编程门阵列(FPGA)或其它可编程逻辑器件、分立门或晶体管逻辑器件、分立硬件组件或者设计为执行本文所述功能的其任意组合,来实现或执行。通用处理器可以是微处理器,但是可替换地,该处理器也可以是任何常规的处理器、控制器、微控制器或者状态机。处理器也可以实现为计算设备的组合,例如,DSP和微处理器的组合、多个微处理器的组合、一个或多个微处理器与DSP内核的组合或者任何其它此种结构。另外,至少一个处理器可以包括可操作以执行上述的一个或多个步骤和/或操作的一个或多个模块。When implemented in hardware, the video encoder may use a general purpose processor, digital signal processor (DSP), application specific integrated circuit (ASIC), field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic A device, discrete hardware component, or any combination thereof designed to perform the functions described herein is implemented or performed. A general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, eg, a combination of a DSP and a microprocessor, a combination of multiple microprocessors, a combination of one or more microprocessors and a DSP core, or any other such configuration. Additionally, at least one processor may include one or more modules operable to perform one or more of the steps and/or operations described above.

当用ASIC、FPGA等硬件电路来实现视频编码器时,其可以包括被配置为执行各种功能的各种电路块。本领域技术人员可以根据施加在整个系统上的各种约束条件来以各种方式设计和实现这些电路,来实现本发明所公开的各种功能。When the video encoder is implemented with hardware circuits such as ASIC, FPGA, etc., it may include various circuit blocks configured to perform various functions. Those skilled in the art can design and implement these circuits in various ways according to various constraints imposed on the entire system to achieve various functions disclosed in the present invention.

尽管前述公开文件论述了示例性方案和/或实施例,但应注意,在不背离由权利要求书定义的描述的方案和/或实施例的范围的情况下,可以在此做出许多变化和修改。而且,尽管以单数形式描述或要求的所述方案和/或实施例的要素,但也可以设想复数的情况,除非明确表示了限于单数。另外,任意方案和/或实施例的全部或部分都可以与任意其它方案和/或实施例的全部或部分结合使用,除非表明了有所不同。While the foregoing disclosure discusses exemplary aspects and/or embodiments, it should be noted that many changes and/or changes may be made herein without departing from the scope of the described aspects and/or embodiments as defined by the claims. Revise. Furthermore, although elements of the described aspects and/or embodiments are described or claimed in the singular, the plural is contemplated unless limitation to the singular is expressly stated. Additionally, all or a portion of any aspect and/or embodiment may be used in conjunction with all or a portion of any other aspect and/or embodiment, unless indicated to the contrary.

Claims (6)

1.一种在高效视频编码(HEVC)中针对360度视频的样点自适应补偿(SAO)的方法,包括:1. A method of sample adaptive compensation (SAO) for 360-degree video in High Efficiency Video Coding (HEVC), comprising: 对所述360度视频执行投影,以获得投影视频;performing projection on the 360-degree video to obtain a projection video; 对所述投影视频中的当前帧的编码树单元(CTU)执行帧内预测或帧间预测,以确定最佳RD-cost;performing intra-frame prediction or inter-frame prediction on the coding tree unit (CTU) of the current frame in the projected video to determine the optimal RD-cost; 将所述CTU的RD-cost与阈值进行比较以判断是否执行SAO,comparing the RD-cost of the CTU with a threshold to determine whether to perform SAO, 其中,所述阈值是至少部分地基于针对所述投影视频的量化参数和投影权重来确定的,并且其中,所述投影权重是至少部分地基于所述投影视频的高度中的CTU数量以及所述CTU在所述投影视频的当前帧中的位置来确定的。wherein the threshold is determined based at least in part on a quantization parameter and a projection weight for the projected video, and wherein the projection weight is based at least in part on the number of CTUs in the height of the projected video and the The position of the CTU in the current frame of the projected video is determined. 2.如权利要求1所述的方法,进一步包括:2. The method of claim 1, further comprising: 如果判定不执行SAO,则至少不对所述CTU执行边界补偿(EO)和边带补偿(BO);If it is determined not to perform SAO, at least not perform boundary compensation (EO) and sideband compensation (BO) for the CTU; 如果判定执行SAO,则针对对所述CTU执行OFF或MERGE操作之一。If it is determined to perform SAO, one of OFF or MERGE operations is performed for the CTU. 3.如权利要求1或2所述的方法,其中,所述阈值至少部分地基于以下至少一项:所述投影权重的以2为底的对数,或所述量化参数的e的幂,或其组合。3. The method of claim 1 or 2, wherein the threshold is based, at least in part, on at least one of the base-2 logarithms of the projection weights, or the quantization parameter raised to a power of e, or a combination thereof. 4.如权利要求1或2所述的方法,其中,仅针对所述投影视频中的上方1/4和下方1/4的高度,将所述CTU的RD-cost与阈值进行比较以判断是否执行SAO。4. The method of claim 1 or 2, wherein the RD-cost of the CTU is compared with a threshold only for the height of the upper 1/4 and the lower 1/4 in the projected video to determine whether Execute SAO. 5.如权利要求1或2所述的方法,其中,所述投影为等矩形投影(ERP)。5. The method of claim 1 or 2, wherein the projection is an equirectangular projection (ERP). 6.一种可用于视频编解码的设备,该设备包括:6. A device that can be used for video encoding and decoding, the device comprising: 一个或多个处理器;one or more processors; 存储器,其中存储有计算机代码,所述计算机代码当由所述处理器执行时,实现在高效视频编码(HEVC)中针对360度视频的样点自适应补偿(SAO)的方法,所述方法包括:a memory having stored therein computer code that, when executed by the processor, implements a method of Sample Adaptive Compensation (SAO) for 360-degree video in High Efficiency Video Coding (HEVC), the method comprising : 对所述360度视频执行投影,以获得投影视频;performing projection on the 360-degree video to obtain a projection video; 对所述投影视频中的当前帧的编码树单元(CTU)执行帧内预测或帧间预测,以确定最佳RD-cost;performing intra-frame prediction or inter-frame prediction on the coding tree unit (CTU) of the current frame in the projected video to determine the optimal RD-cost; 将所述CTU的RD-cost与阈值进行比较以判断是否执行SAO,comparing the RD-cost of the CTU with a threshold to determine whether to perform SAO, 其中,所述阈值是至少部分地基于针对所述投影视频的量化参数和投影权重来确定的,并且其中,所述投影权重是至少部分地基于所述投影视频的高度中的CTU数量以及所述CTU在所述投影视频的当前帧中的位置来确定的。wherein the threshold is determined based at least in part on a quantization parameter and a projection weight for the projected video, and wherein the projection weight is based at least in part on the number of CTUs in the height of the projected video and the The position of the CTU in the current frame of the projected video is determined.
CN201910094768.2A 2019-01-23 2019-01-23 Fast sample adaptive compensation for 360 degree video Active CN109889829B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910094768.2A CN109889829B (en) 2019-01-23 2019-01-23 Fast sample adaptive compensation for 360 degree video

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910094768.2A CN109889829B (en) 2019-01-23 2019-01-23 Fast sample adaptive compensation for 360 degree video

Publications (2)

Publication Number Publication Date
CN109889829A CN109889829A (en) 2019-06-14
CN109889829B true CN109889829B (en) 2022-08-09

Family

ID=66927466

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910094768.2A Active CN109889829B (en) 2019-01-23 2019-01-23 Fast sample adaptive compensation for 360 degree video

Country Status (1)

Country Link
CN (1) CN109889829B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110636293B (en) * 2019-09-27 2024-03-15 腾讯科技(深圳)有限公司 Video encoding and decoding methods and devices, storage medium and electronic device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108293136A (en) * 2015-09-23 2018-07-17 诺基亚技术有限公司 Method, apparatus and computer program product for encoding 360 degree of panoramic videos
CN108924554A (en) * 2018-07-13 2018-11-30 宁波大学 A kind of panorama video code Rate-distortion optimization method of spherical shape weighting structures similarity
CN109089115A (en) * 2018-05-28 2018-12-25 北方工业大学 The adaptive QP compensation and CU high-speed decision of 360 degree of Video codings

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2509707B (en) * 2013-01-04 2016-03-16 Canon Kk A method, device, computer program, and information storage means for encoding or decoding a video sequence
US10904531B2 (en) * 2017-03-23 2021-01-26 Qualcomm Incorporated Adaptive parameters for coding of 360-degree video

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108293136A (en) * 2015-09-23 2018-07-17 诺基亚技术有限公司 Method, apparatus and computer program product for encoding 360 degree of panoramic videos
CN109089115A (en) * 2018-05-28 2018-12-25 北方工业大学 The adaptive QP compensation and CU high-speed decision of 360 degree of Video codings
CN108924554A (en) * 2018-07-13 2018-11-30 宁波大学 A kind of panorama video code Rate-distortion optimization method of spherical shape weighting structures similarity

Also Published As

Publication number Publication date
CN109889829A (en) 2019-06-14

Similar Documents

Publication Publication Date Title
CN101102495B (en) A region-based video image encoding and decoding method and device
CN110199521B (en) Low complexity mixed domain collaborative in-loop filter for lossy video coding
JP5054826B2 (en) Coding mode determination method and apparatus using spatio-temporal complexity
CN104919796B (en) Video encoding method using at least evaluated visual quality and related video encoding device
WO2010078759A1 (en) Method for image temporal and spatial resolution processing based on code rate control
CN111819854A (en) Method and apparatus for coordinating multi-sign bit concealment and residual sign prediction
CN110996102B (en) Video coding method and device for inhibiting intra-frame block respiration effect in P/B frame
WO2007033551A1 (en) A loop filtering method for use in image encoding process
CN1875637A (en) Method and apparatus for minimizing number of reference pictures used for inter-coding
CN109451310B (en) A rate-distortion optimization method and device based on saliency weighting
TW202103495A (en) Method for decoding image data, method for encoding image data and non-transitory computer-readable storage medium storing encoded video information
JP4794147B2 (en) Method for encoding frame sequence, method for decoding frame sequence, apparatus for implementing the method, computer program for executing the method, and storage medium for storing the computer program
CN103442228B (en) Code-transferring method and transcoder thereof in from standard H.264/AVC to the fast frame of HEVC standard
CN113592746B (en) Method for enhancing quality of compressed video by fusing space-time information from coarse to fine
CN103647975B (en) Improved sample adaptive offset filtering method based on histogram analysis
WO2006133613A1 (en) Method for reducing image block effects
CN112106372B (en) Method and apparatus for hybrid intra prediction
CN109889829B (en) Fast sample adaptive compensation for 360 degree video
CN104320671A (en) Online video transcoding method based on H.264 standard
US9736485B2 (en) Encoding apparatus, encoding method, and image capture apparatus
CN117941352A (en) Inter prediction method, encoder, decoder, and storage medium
WO2022077490A1 (en) Intra prediction method, encoder, decoder, and storage medium
CN110446042B (en) Coding method for improving P frame quality in H.264
CN110868593A (en) Fast Division of Video CU Based on Regional Decision Tree
JP2015027097A (en) Moving picture decoding method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
OL01 Intention to license declared
OL01 Intention to license declared
OL02 Intention to license withdrawn
OL02 Intention to license withdrawn
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20250318

Address after: 230000 B-1015, wo Yuan Garden, 81 Ganquan Road, Shushan District, Hefei, Anhui.

Patentee after: HEFEI MINGLONG ELECTRONIC TECHNOLOGY Co.,Ltd.

Country or region after: China

Address before: 100144 Beijing City, Shijingshan District Jin Yuan Zhuang Road No. 5, North China University of Technology

Patentee before: NORTH CHINA University OF TECHNOLOGY

Country or region before: China

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20250610

Address after: 518000 Guangdong Province Shenzhen City Nanshan District Shahhe Street Xiangshan Street Community Qiaocheng East Road 99 Industrial Building 503

Patentee after: Shenzhen Xiaoyu Interactive Co.,Ltd.

Country or region after: China

Address before: Ganquan road Shushan District of Hefei City, Anhui Province, Jiangmen 230000 Business Garden No. 81 floor B-1015

Patentee before: HEFEI MINGLONG ELECTRONIC TECHNOLOGY Co.,Ltd.

Country or region before: China