WO2023015808A1 - Digital retina video processing method and apparatus, electronic device, and storage medium - Google Patents
Digital retina video processing method and apparatus, electronic device, and storage medium Download PDFInfo
- Publication number
- WO2023015808A1 WO2023015808A1 PCT/CN2021/139127 CN2021139127W WO2023015808A1 WO 2023015808 A1 WO2023015808 A1 WO 2023015808A1 CN 2021139127 W CN2021139127 W CN 2021139127W WO 2023015808 A1 WO2023015808 A1 WO 2023015808A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- reference frame
- frame
- feature
- inter
- prediction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/109—Selection of coding mode or of prediction mode among a plurality of temporal predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
Definitions
- the present application relates to the field of computer image processing, in particular to a digital retinal video processing method, device, electronic equipment and storage medium.
- the digital retina technology requires the front-end camera or a video processing device close to the front-end to have both video feature processing and video compression processing capabilities
- the technical complexity and processing power requirements are high
- the inventors found that the selection of the feature reference frame selected in the feature encoding process and the selection of the inter-frame encoding reference frame in the video compression process have at least some results that can be referred to each other Even directly reused. Therefore, if we can design a coordinated mechanism for feature reference frame and inter-frame prediction reference frame selection, it will help reduce the complexity of video processing in digital retinal systems.
- the embodiment of the present application proposes a digital retinal video processing method, device, electronic equipment, and computer-readable storage medium to solve the video processing requirements of the front-end camera or close to the front-end in the digital retinal technology.
- the device has both video feature processing and video compression processing capabilities, which requires high technical complexity and processing capabilities.
- the first aspect of the embodiment of the present application provides a digital retinal video processing method, including:
- the reference frame indication information is the association relationship between the feature reference frame and the inter-frame prediction reference frame, including the reference frame Select instructions.
- the inter-frame prediction reference frame is at least partially replaced by a feature reference frame or formed by combining several feature reference frames.
- the feature reference frame is at least partly replaced by an inter-frame prediction reference frame or formed by a combination of several inter-frame prediction reference frames.
- the reference frame selection indication is an indication that the feature reference frame is determined based on the inter-frame prediction reference frame and an indication that the inter-frame prediction reference frame is determined based on the feature reference frame, including: whether the frame coding mode indicates whether a reference frame is required, At least one of frame encoding mode sequence, forward or backward reference frame and whether to clear reference frame buffer.
- the method also includes:
- the method also includes:
- the second aspect of the embodiment of the present application provides a digital retinal video processing device, including:
- the reference frame determination module determines an inter-frame prediction reference frame based at least in part on the determined feature coding reference frame; and/or determines a feature coding reference frame based at least in part on the determined inter-frame prediction reference frame;
- a reference frame indication information sending module configured to send reference frame indication information in a video compression coded stream or a feature processing coded stream; wherein, the reference frame indication information is the feature reference frame and the inter-frame prediction reference frame The association relationship between, including the reference frame selection indication.
- the inter-frame prediction reference frame is replaced by a feature reference frame or formed by combining several feature reference frames.
- the feature reference frame is formed by replacing or combining several inter-frame prediction reference frames.
- the reference frame selection indication is an indication that the feature reference frame is determined based on the inter-frame prediction reference frame and an indication that the inter-frame prediction reference frame is determined based on the feature reference frame, including: whether the frame coding mode indicates whether a reference frame is required, At least one of frame encoding mode sequence, forward or backward reference frame and whether to clear reference frame buffer.
- the device also includes:
- a reference frame feedback module configured to feed back a reference frame selection result based on at least one reference frame selection indication.
- the reference frame determination module also includes:
- a feature coding reference frame determination module configured to determine a feature coding reference frame based on the reference frame selection result fed back from the determined inter-frame prediction reference frame;
- the inter-frame prediction reference frame determination module is configured to determine the inter-frame prediction reference frame based on the reference frame selection result fed back from the determined feature coding reference frame.
- a third aspect of the embodiments of the present application provides an electronic device, including:
- the memory is connected in communication with the one or more processors, the memory stores instructions executable by the one or more processors, and the instructions are executed by the one or more processors , the electronic device is used to implement the methods described in the foregoing embodiments.
- the fourth aspect of the embodiments of the present application provides a computer-readable storage medium on which computer-executable instructions are stored.
- the computer-executable instructions When executed by a computing device, they can be used to implement the above-mentioned embodiments. Methods.
- a fifth aspect of the embodiments of the present application provides a computer program product, the computer program product includes a computer program stored on a computer-readable storage medium, the computer program includes program instructions, and when the program instructions are executed by the computer , it can be used to implement the methods described in the foregoing embodiments.
- the technical solution of the embodiment of the present application provides a coordination mechanism for selecting a feature reference frame and an inter-frame prediction reference frame in the digital retina technology, so that the feature encoding reference frame selected in the encoding process is the same as that in the video compression process. At least part of the results of the selected inter-frame prediction reference frames can be referred to each other or even directly used, which is beneficial to reduce the complexity of video processing in the digital retina system.
- Fig. 1 is a schematic diagram of a frame interpolation image processing method according to some embodiments of the present application
- FIG. 2 is a schematic diagram of a determination process of an inter-frame prediction reference frame according to some embodiments of the present application
- Fig. 3 is a schematic diagram of a process of sending reference frame indication information in a characteristic coded code stream according to some embodiments of the present application
- Fig. 4 is a schematic diagram of a frame interpolation image processing device according to some embodiments of the present application.
- Fig. 5 is a schematic diagram of a logical structure of an electronic device according to some embodiments of the present application.
- Fig. 6 is a schematic structural diagram of a general-purpose computer node according to some embodiments of the present application.
- An important feature of digital retina technology is the simultaneous dual-stream or multi-stream transmission of video streams and visual feature streams, even abstract video streams, which facilitates video retrieval, video analysis and storage, but the existing digital retina technology It is impossible to require the front-end camera or the video processing device close to the front-end to have video feature processing and video compression processing capabilities at the same time, and the relationship between the feature encoding reference frame and the inter-frame prediction reference frame is not reasonably used, which not only increases the complexity of the technology, but also Higher requirements are placed on the processing power of technology.
- the embodiment of the present application provides a digital retinal video processing method.
- a coordination mechanism is provided for the selection of feature reference frames and inter-frame prediction reference frames, so that the selected At least part of the results of the feature coding reference frame and the selected inter-frame prediction reference frame in the video compression process can be referred to each other or even directly used, which is beneficial to reduce the complexity of video processing in the digital retina system.
- the frame insertion image processing method includes steps:
- video compression is mainly realized by eliminating redundant information.
- Continuous video images have a lot of redundant information in time and space, so by reducing redundant information in video , which can store and transmit video image information more efficiently.
- motion compensation technology is a widely used video information redundancy elimination technology, which is widely used in various video codec standards, including MPEG2, MPEG4, H.264, H.265/HEVC, H.266/ VCC.
- Coded pictures are classified into three types: I-frames, P-frames, and B-frames. The pictures used to predict the coded pictures are called reference frames.
- the I frame is an intra-coded frame, which does not need to refer to other frames, and uses the spatial correlation of the video image to perform independent compression coding.
- the P frame is a forward prediction frame, and the I frame is used as a reference frame to find out the prediction value and motion vector of a "certain point" in the P frame in the I frame, and transmit the prediction difference and the motion vector together.
- the B frame is a bidirectional interpolation frame, with the preceding I or P frame and the following P frame as reference frames.
- Early video coding standards only supported a single reference picture. However, starting from H.263+, multi-reference frame technology is supported, while H.264/AVC and HEVC standards can support up to 15 reference frames.
- the reference frame of the P frame can be an encoded I frame or P frame
- the reference frame of the B frame can be an encoded I frame, P frame or B frame.
- one or two reference images may be used, and these reference images may be forward images or backward images. It can be seen that selecting an appropriate reference frame is beneficial to reducing non-zero residual data and improving compression efficiency.
- I-frames, P-frames and B-frames appear in a predetermined pattern sequence, for example, a frame encoding pattern sequence composed of 15 frames has the following form IBBPBBPBBPBBPBB.
- the frame sequence can also be an adaptive flexible mode, that is, the coding unit will select the most suitable frame type according to the purpose of improving the compression rate.
- Network transmission errors may cause video decoding to fail, and I frames may need to be resent, which will result in readjustment of the frame encoding mode sequence.
- the image frame encoding mode determines whether inter-frame encoding is required and determines an appropriate reference frame.
- the reference frame is determined by matching among other frames within a certain frame number range before or after the current frame at the encoder/decoder.
- the I frame needs to be resent, the buffered video frame to be sent is cleared, and the video frame after the I frame is used as a reference frame for the video frame sent subsequently.
- video feature processing may generally include functions such as video feature extraction, feature location compression, and video feature compression.
- Visual features include information about color, pattern, texture, grayscale, etc. in video frames
- deep learning features can include information such as pedestrian recognition, license plate recognition, traffic accidents, and traffic instruction recognition.
- generation and compression of summary video streams are further included.
- video features include any one or more of visual features and machine learning features. The positions or feature positions of the points of interest determined after video feature extraction are provided to the video compression unit. Similar to inter-frame predictive video coding, some feature frames also use intra-frame coding to compress feature value data. Some other feature frames use the temporal correlation of feature values and adopt the inter-frame differential coding mode.
- the feature frame adopts intra-frame or inter-frame coding, and can also be defined by a predetermined sequence of feature frame coding modes, such as IBBPBBPBBPBBPBB (similar to video coding, define I, P, and B feature frames as intra-frame coding and forward frame respectively) Between differential coding and forward and backward bidirectional differential coding frames).
- IBBPBBPBBPBBPBB similar to video coding, define I, P, and B feature frames as intra-frame coding and forward frame respectively
- differential coding and forward and backward bidirectional differential coding frames there is a correlation between the feature value compressed reference frame and the video compressed frame.
- FIG. 2 shows a process of determining a reference frame for inter-frame prediction.
- a feature processing unit and a video compression unit are included.
- the feature processing unit uses the inter-frame prediction reference frame determined by the video compression unit as the feature coding inter-frame differential coding reference frame.
- the feature processing unit needs to send a reference frame request to the video compression unit, including one or more of these indications (feature frame inter-frame or intra-frame coding, or indication of feature frame types such as I, B, P frames, or providing fixed Coding mode sequences such as IBBPP, etc., whether it is necessary to refresh the cache feature reference frame, etc.).
- the video compression unit determines the encoding mode (I, P, B frame, etc.) of a specific video frame according to the instruction of the feature processing unit, and determines one or more reference frames for the P and B frames and feeds it back to the feature processing unit. If it is indicated that the currently cached reference frame needs to be cleared, the best reference frame is searched after the designated specific frame and the reference frame index is provided to the feature processing unit.
- One way for the feature processing unit to implement inter-frame differential coding of feature frames is to compare feature values of other frames with the closest reference frame to form feature value residuals. Perform transformation coding (such as DCT transformation) on the feature value, then quantize the transformation coefficient, perform run-length coding and entropy coding after quantization, and finally form a feature code stream. Obviously, the feature processing unit does not need to select an independent reference frame, which significantly reduces the amount of calculation.
- the determination of the inter-frame prediction reference frame is at least based on the result of feature reference frame selection by the feature processing unit.
- the feature processing unit follows the instructions of the video compression processing unit (inter-frame or intra-frame coding, or indicates with frame types such as I, B, P frames, or provides a fixed encoding mode sequence such as IBBPP, etc., whether it is necessary to refresh the cache reference frame, etc.), Determines the encoding mode for a specific video frame. If a certain video frame is encoded by inter-frame prediction, then the feature processing module determines one or more feature reference frames for the video frame, which are also used for video compression inter-frame prediction reference frames.
- One embodiment of the present invention is: comparing the current feature frame with several previous feature frames, and selecting one or more frames with the smallest sum of absolute values of differences as reference frames.
- the video compression processing unit instructs to clear the feature frames before a certain frame in the cache, and the reference frame can only be selected from the feature frames after the frame.
- the feature reference frame determined by the feature processing unit is fed back to the video compression unit.
- the video compression unit directly uses it as a video compression reference frame or at least determines the video compression inter-frame prediction reference frame based on the feature reference frame determination result.
- a possible implementation is to combine several feature reference frames to form multiple video compression intra-frame reference frames of the same video frame. It can be seen that in this implementation manner, the selection of the reference frame for video compression inter-frame prediction becomes very simple, which is beneficial to reduce the computing load of the processor. In order to reduce the transmission bandwidth and compression space of eigenvalues, eigenvalues need to be compressed.
- a simple implementation is: compare the eigenvalues of other frames with the closest reference frame in front to form eigenvalue residuals. Perform transformation coding (such as DCT transformation) on the feature value, then quantize the transformation coefficient, perform run-length coding and entropy coding after quantization, and finally form a feature code stream.
- transformation coding such as DCT transformation
- the present invention associates the feature reference frame with the video compression inter-frame prediction reference frame, and can also reduce the overhead of the reference frame indication information.
- the reference frame indication information generally includes one or more reference frame numbers.
- the reference frame indication information in the video coded stream is also applicable to the decoding of feature data, and the feature stream will not repeatedly transmit the reference frame indication information.
- the reference frame indication information in the feature coded code stream is also applicable to the decoding of the video compression code stream, and the video code stream will not repeatedly transmit the reference frame indication information.
- An indication information is required to indicate the above association relationship, including but not limited to: the characteristic reference frame is determined by the video compression reference frame or vice versa, the reference frame indication information is transmitted through the video compression stream or the feature stream.
- the indication information can be transmitted in any mode of the video compression code stream or the feature information code stream, so that the receiving and processing device can determine the correct reference frame.
- Fig. 3 shows the process of sending reference frame indication information in the characteristic encoding code stream according to the embodiment of the present application.
- the indication information indicates the association relationship between the characteristic reference frame and the inter-frame prediction reference frame, including but not limited to:
- the inter-prediction reference frame is based on the indication determined by the feature reference frame.
- the same video processing unit can support multiple video analysis (including video retrieval) applications, and one video analysis application may include multiple video features.
- the compressed video data and video feature data are sent out through the data sending unit.
- Data transmission may be through a priority network, such as an Ethernet interface; it may also be a WIFI network or a cellular communication network such as 4G/5G.
- one, several or all of the compressed video data and feature data are sent to one or more data receiving devices.
- Data reception can be through wired or wireless (WIFI, cellular network, etc.), which have been described above and will not be repeated here. It is easy to know that different video features may have a unified encoding and decoding and/or feature matching manner, or may adopt independent encoding and decoding and/or feature matching manners.
- Fig. 4 is a schematic diagram of a frame interpolation image processing device according to some embodiments of the present application. As shown in Figure 4, the device 400 includes:
- the reference frame determining module 410 determines an inter-frame prediction reference frame based at least in part on the determined feature coding reference frame; and/or determines a feature coding reference frame based at least in part on the determined inter-frame prediction reference frame;
- the reference frame indication information sending module 420 is configured to send the reference frame indication information in the video compression coded stream or the characteristic processing coded stream; wherein, the reference frame indication information is the characteristic reference frame and the inter-frame prediction reference The association relationship between frames, including reference frame selection instructions.
- the inter-frame prediction reference frame is replaced by a feature reference frame or formed by combining several feature reference frames.
- the feature reference frame is formed by replacing or combining several inter-frame prediction reference frames.
- the reference frame selection indication is an indication that the feature reference frame is determined based on the inter-frame prediction reference frame and an indication that the inter-frame prediction reference frame is determined based on the feature reference frame, including: whether the frame coding mode indication needs At least one of the reference frame, frame coding mode sequence, forward or backward reference frame and whether the reference frame buffer area needs to be cleared.
- the device also includes:
- a reference frame feedback module 430 configured to feed back a reference frame selection result based on at least one reference frame selection indication.
- the reference frame determination module further includes:
- a feature coding reference frame determination module 411 configured to determine a feature coding reference frame based on the reference frame selection result fed back from the determined inter-frame prediction reference frame;
- the inter-frame prediction reference frame determination module 412 is configured to determine the inter-frame prediction reference frame based on the reference frame selection result fed back from the determined feature coding reference frame.
- the electronic device 500 includes:
- memory 530 and one or more processors 510;
- the memory 530 is connected in communication with the one or more processors 510, the memory 530 stores a program 532 executable by the one or more processors, and the program 532 is executed by the one or more processors.
- a plurality of processors 510 are executed, so that the one or more processors 510 execute the methods in the foregoing embodiments of the present application.
- the processor 510 and the memory 530 may be connected through a bus or in other ways. In FIG. 5 , connection through a bus 540 is taken as an example.
- the processor 510 may be a central processing unit (Central Processing Unit, CPU).
- the processor 510 can also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application-specific integrated circuits (Application Specific Integrated Circuit, ASIC), field-programmable gate array (Field-Programmable Gate Array, FPGA) or Other chips such as programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or combinations of the above-mentioned types of chips.
- DSP Digital Signal Processor
- ASIC Application Specific Integrated Circuit
- FPGA Field-Programmable Gate Array
- Other chips such as programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or combinations of the above-mentioned types of chips.
- the memory 530 can be used to store non-transitory software programs, non-transitory computer-executable programs and modules, such as the cascaded progressive network in the embodiment of the present application.
- the processor 510 executes various functional applications and data processing of the processor by running non-transitory software programs, programs 532 and functional modules stored in the memory 530 .
- the memory 530 may include a program storage area and a data storage area, wherein the program storage area may store an operating system and an application program required by at least one function; the data storage area may store data created by the processor 510 and the like.
- the memory 530 may include a high-speed random access memory, and may also include a non-transitory memory, such as at least one magnetic disk storage device, a flash memory device, or other non-transitory solid-state storage devices.
- the memory 530 may optionally include memory located remotely from the processor 510, and these remote memories may be connected to the processor 510 through a network (such as through the communication interface 520). Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
- An embodiment of the present application provides a computer-readable storage medium, wherein computer-executable instructions are stored in the computer-readable storage medium, and each step in the foregoing method embodiments is executed after the computer-executable instructions are executed.
- program modules include routines, programs, components, data structures, and other types of structures that perform particular tasks or implement particular abstract data types.
- program modules may be located in both local and remote memory storage devices.
- the functions described above are realized in the form of software function units and sold or used as independent products, they can be stored in a computer-readable storage medium.
- the technical solution of the present application can be embodied in the form of a software product in essence or the part that contributes to the prior art or the technical solution, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application.
- a computer device which may be a personal computer, a server, or a network device, etc.
- the technical solutions of the present application can be implemented and/or propagated through at least one general-purpose computer node 610 as shown in FIG. 6 .
- FIG. 6 In FIG.
- a general-purpose computer node 610 includes: a computer system/server 612, peripherals 614, and a display device 616; wherein, the computer system/server 612 includes a processing unit 620, an input/output interface 622, a network adapter 624, and
- the memory 630 usually implements data transmission through a bus inside; further, the memory 630 is usually composed of various storage devices, such as a RAM (Random Access Memory, random access memory) 632, a cache 634, and a storage system (generally composed of one or more large-capacity non- Volatile storage media) 636, etc.; the program 640 that realizes part or all of the functions of the technical solution of the present application is stored in the memory 630, usually in the form of multiple program modules 642.
- the aforementioned computer-readable storage medium includes physically volatile and non-volatile, removable and non-removable media implemented in any manner or technology for storing information such as computer-readable instructions, data structures, program modules, or other data. Indong medium.
- Computer-readable storage media specifically include, but are not limited to, U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), erasable programmable read-only Memory (EPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Flash or other solid-state memory technology, CD-ROM, Digital Versatile Disk (DVD), HD-DVD, Blue-Ray or other optical storage device, tape, disk storage or other magnetic storage device, or any other medium that can be used to store the desired information and that can be accessed by a computer.
- ROM read-only memory
- RAM random access memory
- EPROM erasable programmable read-only Memory
- EEPROM Electrically Erasable Programmable Read-Only
- the present application proposes a multi-table splicing method, device, electronic equipment and computer-readable storage medium thereof.
- the embodiment of this application controls the generation of multi-table features through the visualization of ontology view, and at the same time uses the meta-path to operate and control the transfer of features, effectively realizing the controllable aggregation of features during multi-table splicing, which can be flexible while reducing complexity. Efficiently control multi-table feature spaces.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
本申请涉及计算机图像处理领域,具体涉及一种数字视网膜视频处理方法、装置、电子设备及存储介质。The present application relates to the field of computer image processing, in particular to a digital retinal video processing method, device, electronic equipment and storage medium.
自从数字视网膜概念提出以来,其在视频编解码、视频监控等领域引起了较大的关注。数字视网膜技术的一个重要特点就是视频流和视觉特征流、甚至包括摘要视频流同时进行的双流或多流传输,这对视频检索、视频分析和存储均提供了便利。Since the concept of digital retina was put forward, it has attracted great attention in the fields of video codec, video surveillance and so on. An important feature of digital retina technology is the simultaneous dual-stream or multi-stream transmission of video streams, visual feature streams, and even summary video streams, which facilitates video retrieval, video analysis, and storage.
然而,发明人在实现本发明实施例相关技术方案的过程中发现,现有技术至少存在以下问题:数字视网膜技术要求前端摄像机或靠近前端的视频处理装置同时具备视频特征处理和视频压缩处理能力,对技术复杂度和处理能力要求都较高,并且发明人发现在特征编码处理中所选择的特征参考帧的选择与视频压缩处理中的帧间编码参考帧的选择存在至少部分结果是可以互相参考甚至直接重用的。因此,如果能设计一种在特征参考帧和帧间预测参考帧选择的协调机制,有利于减少数字视网膜系统中视频处理复杂度。However, the inventor found in the process of implementing the related technical solutions of the embodiments of the present invention that at least the following problems exist in the prior art: the digital retina technology requires the front-end camera or a video processing device close to the front-end to have both video feature processing and video compression processing capabilities, The technical complexity and processing power requirements are high, and the inventors found that the selection of the feature reference frame selected in the feature encoding process and the selection of the inter-frame encoding reference frame in the video compression process have at least some results that can be referred to each other Even directly reused. Therefore, if we can design a coordinated mechanism for feature reference frame and inter-frame prediction reference frame selection, it will help reduce the complexity of video processing in digital retinal systems.
发明内容Contents of the invention
针对现有技术中的上述技术问题,本申请实施例提出了一种数字视网膜视频处理方法、装置、电子设备及计算机可读存储介质,以解决数字视网膜技术中要求前端摄像机或靠近前端的视频处理装置同时具备视频特征处理和视频压缩处理能力,对技术复杂度和处理能力要求都较高的问题。Aiming at the above-mentioned technical problems in the prior art, the embodiment of the present application proposes a digital retinal video processing method, device, electronic equipment, and computer-readable storage medium to solve the video processing requirements of the front-end camera or close to the front-end in the digital retinal technology. The device has both video feature processing and video compression processing capabilities, which requires high technical complexity and processing capabilities.
本申请实施例的第一方面提供了一种数字视网膜视频处理方法,包括:The first aspect of the embodiment of the present application provides a digital retinal video processing method, including:
至少部分基于确定的特征编码参考帧确定帧间预测参考帧;和/或determining an inter prediction reference frame based at least in part on the determined feature coding reference frame; and/or
至少部分基于确定的帧间预测参考帧确定特征编码参考帧;determining a feature encoding reference frame based at least in part on the determined inter prediction reference frame;
在视频压缩编码码流或特征处理编码码流中发送参考帧指示信息;其中,所述参考帧指示信息为所述特征参考帧与所述帧间预测参考帧之间的关联关系,包括参考帧选择指示。Send reference frame indication information in the video compression coded stream or feature processing coded stream; wherein, the reference frame indication information is the association relationship between the feature reference frame and the inter-frame prediction reference frame, including the reference frame Select instructions.
在一些实施例中,所述帧间预测参考帧至少部分由特征参考帧替代形成或由若干特征参考帧组合形成。In some embodiments, the inter-frame prediction reference frame is at least partially replaced by a feature reference frame or formed by combining several feature reference frames.
在一些实施例中,所述特征参考帧至少部分由帧间预测参考帧替代形成或有若干帧间预测参考帧组合形成。In some embodiments, the feature reference frame is at least partly replaced by an inter-frame prediction reference frame or formed by a combination of several inter-frame prediction reference frames.
在一些实施例中,所述参考帧选择指示为特征参考帧基于帧间预测参考帧确定的指 示和帧间预测参考帧基于特征参考帧确定的指示,包括:帧编码模式指示是否需要参考帧、帧编码模式序列、前向或后向参考帧和是否需要清除参考帧缓存区中的至少一种。In some embodiments, the reference frame selection indication is an indication that the feature reference frame is determined based on the inter-frame prediction reference frame and an indication that the inter-frame prediction reference frame is determined based on the feature reference frame, including: whether the frame coding mode indicates whether a reference frame is required, At least one of frame encoding mode sequence, forward or backward reference frame and whether to clear reference frame buffer.
在一些实施例中,所述方法还包括:In some embodiments, the method also includes:
至少基于一个所述参考帧选择指示,反馈参考帧选择结果。Feedback a reference frame selection result based on at least one reference frame selection indication.
在一些实施例中,所述方法还包括:In some embodiments, the method also includes:
基于所述确定的帧间预测参考帧反馈的参考帧选择结果,确定特征编码参考帧;和/或Based on the determined reference frame selection result fed back by the inter-frame prediction reference frame, determine a feature encoding reference frame; and/or
基于所述确定的特征编码参考帧反馈的参考帧选择结果,确定帧间预测参考帧。Determine the reference frame for inter-frame prediction based on the reference frame selection result fed back by the determined feature coding reference frame.
本申请实施例的第二方面提供了一种数字视网膜视频处理装置,包括:The second aspect of the embodiment of the present application provides a digital retinal video processing device, including:
参考帧确定模块,至少部分基于确定的特征编码参考帧确定帧间预测参考帧;和/或至少部分基于确定的帧间预测参考帧确定特征编码参考帧;The reference frame determination module determines an inter-frame prediction reference frame based at least in part on the determined feature coding reference frame; and/or determines a feature coding reference frame based at least in part on the determined inter-frame prediction reference frame;
参考帧指示信息发送模块,用于在视频压缩编码码流或特征处理编码码流中发送参考帧指示信息;其中,所述参考帧指示信息为所述特征参考帧与所述帧间预测参考帧之间的关联关系,包括参考帧选择指示。A reference frame indication information sending module, configured to send reference frame indication information in a video compression coded stream or a feature processing coded stream; wherein, the reference frame indication information is the feature reference frame and the inter-frame prediction reference frame The association relationship between, including the reference frame selection indication.
在一些实施例中,所述帧间预测参考帧由特征参考帧替代形成或由若干特征参考帧组合形成。In some embodiments, the inter-frame prediction reference frame is replaced by a feature reference frame or formed by combining several feature reference frames.
在一些实施例中,所述特征参考帧由帧间预测参考帧替代形成或有若干帧间预测参考帧组合形成。In some embodiments, the feature reference frame is formed by replacing or combining several inter-frame prediction reference frames.
在一些实施例中,所述参考帧选择指示为特征参考帧基于帧间预测参考帧确定的指示和帧间预测参考帧基于特征参考帧确定的指示,包括:帧编码模式指示是否需要参考帧、帧编码模式序列、前向或后向参考帧和是否需要清除参考帧缓存区中的至少一种。In some embodiments, the reference frame selection indication is an indication that the feature reference frame is determined based on the inter-frame prediction reference frame and an indication that the inter-frame prediction reference frame is determined based on the feature reference frame, including: whether the frame coding mode indicates whether a reference frame is required, At least one of frame encoding mode sequence, forward or backward reference frame and whether to clear reference frame buffer.
在一些实施例中,所述装置还包括:In some embodiments, the device also includes:
参考帧反馈模块,用于至少基于一个所述参考帧选择指示,反馈参考帧选择结果。A reference frame feedback module, configured to feed back a reference frame selection result based on at least one reference frame selection indication.
在一些实施例中,所述参考帧确定模块还包括:In some embodiments, the reference frame determination module also includes:
特征编码参考帧确定模块,用于基于所述确定的帧间预测参考帧反馈的参考帧选择结果,确定特征编码参考帧;A feature coding reference frame determination module, configured to determine a feature coding reference frame based on the reference frame selection result fed back from the determined inter-frame prediction reference frame;
帧间预测参考帧确定模块,用于基于所述确定的特征编码参考帧反馈的参考帧选择结果,确定帧间预测参考帧。The inter-frame prediction reference frame determination module is configured to determine the inter-frame prediction reference frame based on the reference frame selection result fed back from the determined feature coding reference frame.
本申请实施例的第三方面提供了一种电子设备,包括:A third aspect of the embodiments of the present application provides an electronic device, including:
存储器以及一个或多个处理器;memory and one or more processors;
其中,所述存储器与所述一个或多个处理器通信连接,所述存储器中存储有可被所述一个或 多个处理器执行的指令,所述指令被所述一个或多个处理器执行时,所述电子设备用于实现如前述各实施例所述的方法。Wherein, the memory is connected in communication with the one or more processors, the memory stores instructions executable by the one or more processors, and the instructions are executed by the one or more processors , the electronic device is used to implement the methods described in the foregoing embodiments.
本申请实施例的第四方面提供了一种计算机可读存储介质,其上存储有计算机可执行指令,当所述计算机可执行指令被计算装置执行时,可用来实现如前述各实施例所述的方法。The fourth aspect of the embodiments of the present application provides a computer-readable storage medium on which computer-executable instructions are stored. When the computer-executable instructions are executed by a computing device, they can be used to implement the above-mentioned embodiments. Methods.
本申请实施例的第五方面提供了一种计算机程序产品,所述计算机程序产品包括存储在计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,可用来实现如前述各实施例所述的方法。A fifth aspect of the embodiments of the present application provides a computer program product, the computer program product includes a computer program stored on a computer-readable storage medium, the computer program includes program instructions, and when the program instructions are executed by the computer , it can be used to implement the methods described in the foregoing embodiments.
本申请实施例的技术方案,在数字视网膜技术中提供了一种在特征参考帧和帧间预测参考帧选择的协调机制,使得在编码处理中所选择的特征编码参考帧与在视频压缩处理中所选择的帧间预测参考帧至少部分结果可以互相参考甚至直接使用,有利于减少数字视网膜系统中视频处理复杂度。The technical solution of the embodiment of the present application provides a coordination mechanism for selecting a feature reference frame and an inter-frame prediction reference frame in the digital retina technology, so that the feature encoding reference frame selected in the encoding process is the same as that in the video compression process. At least part of the results of the selected inter-frame prediction reference frames can be referred to each other or even directly used, which is beneficial to reduce the complexity of video processing in the digital retina system.
通过参考附图会更加清楚的理解本申请的特征和优点,附图是示意性的而不应理解为对本申请进行任何限制,在附图中:The features and advantages of the present application will be more clearly understood by referring to the accompanying drawings, which are schematic and should not be construed as limiting the application in any way. In the accompanying drawings:
图1是根据本申请的一些实施例所示的一种插帧图像处理方法示意图;Fig. 1 is a schematic diagram of a frame interpolation image processing method according to some embodiments of the present application;
图2是根据本申请的一些实施例所示的一种帧间预测参考帧的确定过程示意图;FIG. 2 is a schematic diagram of a determination process of an inter-frame prediction reference frame according to some embodiments of the present application;
图3是根据本申请的一些实施例所示的一种特征编码码流中发送参考帧指示信息的过程示意图;Fig. 3 is a schematic diagram of a process of sending reference frame indication information in a characteristic coded code stream according to some embodiments of the present application;
图4是根据本申请的一些实施例所示的一种插帧图像处理装置示意图。Fig. 4 is a schematic diagram of a frame interpolation image processing device according to some embodiments of the present application.
图5是根据本申请的一些实施例所示的一种电子设备的逻辑结构示意图;Fig. 5 is a schematic diagram of a logical structure of an electronic device according to some embodiments of the present application;
图6是根据本申请的一些实施例所示的一种通用型计算机节点的架构示意图。Fig. 6 is a schematic structural diagram of a general-purpose computer node according to some embodiments of the present application.
在下面的详细描述中,通过示例阐述了本申请的许多具体细节,以便提供对相关披露的透彻理解。然而,对于本领域的普通技术人员来讲,本申请显而易见的可以在没有这些细节的情况下实施。应当理解的是,本申请中使用“系统”、“装置”、“单元”和/或“模块”术语,是用于区分在顺序排列中不同级别的不同部件、元件、部分或组件的一种方法。然而,如果其他表达式可以实现相同的目的,这些术语可以被其他表达式替换。In the following detailed description, numerous specific details of the application are set forth by way of example in order to provide a thorough understanding of the relevant disclosure. It will be apparent, however, to one skilled in the art that the application may be practiced without these details. It should be understood that the terms "system", "device", "unit" and/or "module" used in this application are used as a means to distinguish between different components, elements, parts or assemblies at different levels in a sequential arrangement. method. However, these terms may be replaced by other expressions if the same purpose can be achieved by other expressions.
应当理解的是,当设备、单元或模块被称为“在……上”、“连接到”或“耦合到”另一设备、单元或模块时,其可以直接在另一设备、单元或模块上,连接或耦合到或与 其他设备、单元或模块通信,或者可以存在中间设备、单元或模块,除非上下文明确提示例外情形。例如,本申请所使用的术语“和/或”包括一个或多个相关所列条目的任何一个和所有组合。It will be understood that when a device, unit or module is referred to as being "on," "connected to" or "coupled to" another device, unit or module, it can be directly on the other device, unit or module. connected or coupled to or communicate with other devices, units or modules, or intervening devices, units or modules may be present, unless the context clearly suggests an exception. For example, as used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.
本申请所用术语仅为了描述特定实施例,而非限制本申请范围。如本申请说明书和权利要求书中所示,除非上下文明确提示例外情形,“一”、“一个”、“一种”和/或“该”等词并非特指单数,也可包括复数。一般说来,术语“包括”与“包含”仅提示包括已明确标识的特征、整体、步骤、操作、元素和/或组件,而该类表述并不构成一个排它性的罗列,其他特征、整体、步骤、操作、元素和/或组件也可以包含在内。The terminology used in the present application is for describing specific embodiments only, and does not limit the scope of the present application. As shown in the specification and claims of this application, words such as "a", "an", "an" and/or "the" do not refer to the singular, and may also include the plural, unless the context clearly indicates an exception. Generally speaking, the terms "comprising" and "comprising" only suggest the inclusion of clearly identified features, integers, steps, operations, elements and/or components, and such expressions do not constitute an exclusive list, other features, Integrals, steps, operations, elements and/or assemblies may also be included.
参看下面的说明以及附图,本申请的这些或其他特征和特点、操作方法、结构的相关元素的功能、部分的结合以及制造的经济性可以被更好地理解,其中说明和附图形成了说明书的一部分。然而,可以清楚地理解,附图仅用作说明和描述的目的,并不意在限定本申请的保护范围。可以理解的是,附图并非按比例绘制。These and other features and characteristics, methods of operation, functions of relevant elements of structure, combinations of parts, and economies of manufacture of the present application may be better understood with reference to the following description and drawings, which form a part of the manual. However, it can be clearly understood that the drawings are only for the purpose of illustration and description, and are not intended to limit the protection scope of the present application. It is understood that the drawings are not drawn to scale.
本申请中使用了多种结构图用来说明根据本申请的实施例的各种变形。应当理解的是,前面或下面的结构并不是用来限定本申请。本申请的保护范围以权利要求为准。Various structural diagrams are used in this application to illustrate various modifications of the embodiments according to this application. It should be understood that the preceding or following structures are not intended to limit the present application. The protection scope of the present application shall be determined by the claims.
数字视网膜技术的一个重要特点就是视频流和视觉特征流、甚至包括摘要视频流同时进行的双流或多流传输,这对视频检索、视频分析和存储均提供了便利,但是现有的数字视网膜技术无法要求前端摄像机或靠近前端的视频处理装置同时具备视频特征处理和视频压缩处理能力,没有合理运用特征编码参考帧和帧间预测参考帧之间的联系,这不仅增加了技术的复杂度,同时对技术的处理能力提出了更高的要求。An important feature of digital retina technology is the simultaneous dual-stream or multi-stream transmission of video streams and visual feature streams, even abstract video streams, which facilitates video retrieval, video analysis and storage, but the existing digital retina technology It is impossible to require the front-end camera or the video processing device close to the front-end to have video feature processing and video compression processing capabilities at the same time, and the relationship between the feature encoding reference frame and the inter-frame prediction reference frame is not reasonably used, which not only increases the complexity of the technology, but also Higher requirements are placed on the processing power of technology.
有鉴于此,本申请实施例提供了一种数字视网膜视频处理方法,在数字视网膜技术中提供了一种在特征参考帧和帧间预测参考帧选择的协调机制,使得在编码处理中所选择的特征编码参考帧与在视频压缩处理中所选择的帧间预测参考帧至少部分结果可以互相参考甚至直接使用,有利于减少数字视网膜系统中视频处理复杂度。进一步地,如图1所示,在本申请的一个实施例中,插帧图像处理方法包括步骤:In view of this, the embodiment of the present application provides a digital retinal video processing method. In the digital retinal technology, a coordination mechanism is provided for the selection of feature reference frames and inter-frame prediction reference frames, so that the selected At least part of the results of the feature coding reference frame and the selected inter-frame prediction reference frame in the video compression process can be referred to each other or even directly used, which is beneficial to reduce the complexity of video processing in the digital retina system. Further, as shown in FIG. 1, in one embodiment of the present application, the frame insertion image processing method includes steps:
至少部分基于确定的特征编码参考帧确定帧间预测参考帧;和/或至少部分基于确定的帧间预测参考帧确定特征编码参考帧;在视频压缩编码码流或特征处理编码码流中发送参考帧指示信息。Determining an inter-frame prediction reference frame based at least in part on the determined feature encoding reference frame; and/or determining a feature encoding reference frame based at least in part on the determined inter-frame prediction reference frame; sending the reference in the video compression encoding code stream or feature processing code stream Frame indication information.
其中,在本申请的实施例中,视频压缩主要是通过对冗余信息的消除实现的,连续的视频图像在时间和空间上都存在大量的冗余信息,因此通过减少视频中的冗余信息,可以更高效地进行存储和传输视频图像信息。其中,运动补偿技术是一种应用非常广泛的视频信 息冗余消除技术,被广泛应用在各种视频编解码标准中,包括MPEG2、MPEG4、H.264、H.265/HEVC、H.266/VCC中。编码图像分成三种类型:I帧、P帧和B帧。用来对编码图像进行预测的图像称为参考帧。其中,I帧是内部编码帧,不需要参考其他帧,利用视频图像的空间相关性进行独立压缩编码。P帧是前向预测帧,以I帧为参考帧,在I帧中找出P帧“某点”的预测值和运动矢量,取预测差值和运动矢量一起传送。B帧是双向内插帧,以前面的I或P帧和后面的P帧为参考帧。早期的视频编码标准只支持单个参考图像。但从H.263+开始支持多参考帧技术,而H.264/AVC、HEVC标准最多可支持15个参考帧。P帧的参考帧可以是已编码的I帧或P帧,B帧的参考帧可以是已编码的I帧或P帧或B帧。在B帧的预测过程中,可能使用到一个或两个参考图像,而这些参考图像可能是前向图像,也可能是后向图像。可以看到,选择合适的参考帧,有利于减少非零残差数据,有利于提高压缩效率。在一种现有技术中,I帧、P帧和B帧按预定的模式顺序出现,比如一个15个帧组成帧编码模式序列具有以下形式IBBPBBPBBPBBPBB。帧序列也可以是自适应的灵活模式即在编码单元会根据有利于提高压缩率等目的选择最合适的帧类型。网络传输错误可能会导致视频解码失败,可能需要重新发送I帧,这将导致帧编码模式序列的重新调整。图像帧编码模式决定是否需要需要进行帧间编码以及确定一个合适的参考帧。参考帧在编码器/解码器距当前帧之前或之后一定帧数范围内的其它帧中通过匹配确定。在某些情况包括网络传输发生错误时,需要重新发送I帧,缓存的以发送视频帧被清空,后续发送的视频帧以此I帧之后的视频帧作为参考帧。Among them, in the embodiment of the present application, video compression is mainly realized by eliminating redundant information. Continuous video images have a lot of redundant information in time and space, so by reducing redundant information in video , which can store and transmit video image information more efficiently. Among them, motion compensation technology is a widely used video information redundancy elimination technology, which is widely used in various video codec standards, including MPEG2, MPEG4, H.264, H.265/HEVC, H.266/ VCC. Coded pictures are classified into three types: I-frames, P-frames, and B-frames. The pictures used to predict the coded pictures are called reference frames. Among them, the I frame is an intra-coded frame, which does not need to refer to other frames, and uses the spatial correlation of the video image to perform independent compression coding. The P frame is a forward prediction frame, and the I frame is used as a reference frame to find out the prediction value and motion vector of a "certain point" in the P frame in the I frame, and transmit the prediction difference and the motion vector together. The B frame is a bidirectional interpolation frame, with the preceding I or P frame and the following P frame as reference frames. Early video coding standards only supported a single reference picture. However, starting from H.263+, multi-reference frame technology is supported, while H.264/AVC and HEVC standards can support up to 15 reference frames. The reference frame of the P frame can be an encoded I frame or P frame, and the reference frame of the B frame can be an encoded I frame, P frame or B frame. During the prediction process of the B frame, one or two reference images may be used, and these reference images may be forward images or backward images. It can be seen that selecting an appropriate reference frame is beneficial to reducing non-zero residual data and improving compression efficiency. In one prior art, I-frames, P-frames and B-frames appear in a predetermined pattern sequence, for example, a frame encoding pattern sequence composed of 15 frames has the following form IBBPBBPBBPBBPBB. The frame sequence can also be an adaptive flexible mode, that is, the coding unit will select the most suitable frame type according to the purpose of improving the compression rate. Network transmission errors may cause video decoding to fail, and I frames may need to be resent, which will result in readjustment of the frame encoding mode sequence. The image frame encoding mode determines whether inter-frame encoding is required and determines an appropriate reference frame. The reference frame is determined by matching among other frames within a certain frame number range before or after the current frame at the encoder/decoder. In some cases, when an error occurs in the network transmission, the I frame needs to be resent, the buffered video frame to be sent is cleared, and the video frame after the I frame is used as a reference frame for the video frame sent subsequently.
进一步地,在本申请的一个实施例中,视频特征处理一般可以包括视频特征提取、特征位置压缩、视频特征压缩等功能。视觉特征包括视频帧中关于颜色、图案、纹理、灰度等信息,深度学习特征可以包括行人识别、车牌识别、交通事故、交通指示识别等信息。在某些实施例中,还进一步包括摘要视频流的生成和压缩。为描述方便,在本发明中,视频特征包括视觉特征、机器学习特征的任一种或多种。视频特征提取后确定的兴趣点的位置或特征位置提供给视频压缩单元。与帧间预测视频编码类似,也存在某些特征帧采用帧内编码方式进行特征值数据压缩。其它某些特征帧则利用特征值时间相关性,采取帧间差分编码模式。以某一帧为参考帧,比较当前帧与参考帧之间,对所形成特征值残差进行编码,能够显著提高特征值压缩效率。特征帧采用帧内还是帧间编码,也可以由预先确定的特征帧编码模式序列来定义,比如IBBPBBPBBPBBPBB(与视频编码类似的,定义I、P、B特征帧分别为帧内编码、前向帧间差分编码和前后双向差分编码帧)。在本发明中,特征值压缩参考帧与视频压缩帧间具有相关性。Further, in an embodiment of the present application, video feature processing may generally include functions such as video feature extraction, feature location compression, and video feature compression. Visual features include information about color, pattern, texture, grayscale, etc. in video frames, and deep learning features can include information such as pedestrian recognition, license plate recognition, traffic accidents, and traffic instruction recognition. In some embodiments, generation and compression of summary video streams are further included. For the convenience of description, in the present invention, video features include any one or more of visual features and machine learning features. The positions or feature positions of the points of interest determined after video feature extraction are provided to the video compression unit. Similar to inter-frame predictive video coding, some feature frames also use intra-frame coding to compress feature value data. Some other feature frames use the temporal correlation of feature values and adopt the inter-frame differential coding mode. Taking a certain frame as a reference frame, comparing the current frame with the reference frame, and encoding the resulting eigenvalue residuals can significantly improve the eigenvalue compression efficiency. The feature frame adopts intra-frame or inter-frame coding, and can also be defined by a predetermined sequence of feature frame coding modes, such as IBBPBBPBBPBBPBB (similar to video coding, define I, P, and B feature frames as intra-frame coding and forward frame respectively) Between differential coding and forward and backward bidirectional differential coding frames). In the present invention, there is a correlation between the feature value compressed reference frame and the video compressed frame.
具体地,图2给出了帧间预测参考帧的确定过程。在本发明的一些实施例中,包括特征处理单元和视频压缩单元。特征处理单元根据视频压缩单元确定的帧间预测参考帧作为特征编码帧间差分编码参考帧。特征处理单元需向视频压缩单元发送参考帧请求,包括这些指示中的一种或多种(特征帧帧间或帧内编码,或以I、B、P帧等特征帧类型指示,或提供固定的编码模式序列比如IBBPP等,是否需要刷新缓存特征参考帧等)。视频压缩单元按照特征处理单元的指示确定特定视频帧的编码模式(I、P、B帧等),并对P和B帧确定一个或多个参考帧并反馈给特征处理单元。如果指示需要清空当前缓存的参考帧,则在所指定的特定帧之后搜索最佳参考帧并将参考帧索引提供给特征处理单元。特征处理单元实现特征帧帧间差分编码的一种方式是:其它帧特征值与前面最接近的参考帧进行比较,形成特征值残差。对特征值进行变换编码(比如DCT变换),再对变换系数进行量化,量化后进行行程编码和熵编码,最后形成特征码流。显然,特征处理单元不需进行独立的参考帧选择,显著减少了计算量。Specifically, FIG. 2 shows a process of determining a reference frame for inter-frame prediction. In some embodiments of the present invention, a feature processing unit and a video compression unit are included. The feature processing unit uses the inter-frame prediction reference frame determined by the video compression unit as the feature coding inter-frame differential coding reference frame. The feature processing unit needs to send a reference frame request to the video compression unit, including one or more of these indications (feature frame inter-frame or intra-frame coding, or indication of feature frame types such as I, B, P frames, or providing fixed Coding mode sequences such as IBBPP, etc., whether it is necessary to refresh the cache feature reference frame, etc.). The video compression unit determines the encoding mode (I, P, B frame, etc.) of a specific video frame according to the instruction of the feature processing unit, and determines one or more reference frames for the P and B frames and feeds it back to the feature processing unit. If it is indicated that the currently cached reference frame needs to be cleared, the best reference frame is searched after the designated specific frame and the reference frame index is provided to the feature processing unit. One way for the feature processing unit to implement inter-frame differential coding of feature frames is to compare feature values of other frames with the closest reference frame to form feature value residuals. Perform transformation coding (such as DCT transformation) on the feature value, then quantize the transformation coefficient, perform run-length coding and entropy coding after quantization, and finally form a feature code stream. Obviously, the feature processing unit does not need to select an independent reference frame, which significantly reduces the amount of calculation.
更进一步地,在基于本发明的一个实施例中,帧间预测参考帧的确定至少基于特征处理单元特征参考帧选择的结果。特征处理单元按照视频压缩处理单元的指示(帧间或帧内编码,或以I、B、P帧等帧类型指示,或提供固定的编码模式序列比如IBBPP等,是否需要刷新缓存参考帧等),确定特定视频帧的编码模式。如果某一视频帧是帧间预测编码,那么特征处理模块为该视频帧确定一个或多个特征参考帧,也用于视频压缩帧间预测参考帧。Furthermore, in an embodiment based on the present invention, the determination of the inter-frame prediction reference frame is at least based on the result of feature reference frame selection by the feature processing unit. The feature processing unit follows the instructions of the video compression processing unit (inter-frame or intra-frame coding, or indicates with frame types such as I, B, P frames, or provides a fixed encoding mode sequence such as IBBPP, etc., whether it is necessary to refresh the cache reference frame, etc.), Determines the encoding mode for a specific video frame. If a certain video frame is encoded by inter-frame prediction, then the feature processing module determines one or more feature reference frames for the video frame, which are also used for video compression inter-frame prediction reference frames.
本发明的一个实施方式是:将当前特征帧与之前若干特征帧进行比较,选择差值绝对值之和最小的一帧或更多帧作为参考帧。对于B帧,还需将当前特征帧与其后采样/显示若干特征帧进行比较,选择差值绝对值之和最小的一帧或更多帧作为后向参考帧。在某些情形下,视频压缩处理单元指示清空缓存中某一帧之前的特征帧,参考帧只能从该帧之后的特征帧中选择。特征处理单元所确定的特征参考帧反馈给视频压缩单元。视频压缩单元将其直接作为视频压缩参考帧或至少基于特征参考帧确定结果确定视频压缩帧间预测参考帧。一种可能的实施方式是将组合若干特征参考帧形成同一视频帧的多个视频压缩帧内参考帧。可以看到,在这个实施方式中,视频压缩帧间预测参考帧的选择变得非常简单,有利于减少处理器运算负担。为减少特征值传输带宽和压缩空间,特征值需要进行压缩。简单的一种实现方式是:其它帧特征值与前面最接近的参考帧进行比较,形成特征值残差。对特征值进行变换编码(比如DCT变换),再对变换系数进行量化,量化后进行行程编码和熵编码,最后形成特征码流。One embodiment of the present invention is: comparing the current feature frame with several previous feature frames, and selecting one or more frames with the smallest sum of absolute values of differences as reference frames. For the B frame, it is also necessary to compare the current feature frame with several subsequent sampled/displayed feature frames, and select one or more frames with the smallest sum of absolute values of differences as the backward reference frame. In some cases, the video compression processing unit instructs to clear the feature frames before a certain frame in the cache, and the reference frame can only be selected from the feature frames after the frame. The feature reference frame determined by the feature processing unit is fed back to the video compression unit. The video compression unit directly uses it as a video compression reference frame or at least determines the video compression inter-frame prediction reference frame based on the feature reference frame determination result. A possible implementation is to combine several feature reference frames to form multiple video compression intra-frame reference frames of the same video frame. It can be seen that in this implementation manner, the selection of the reference frame for video compression inter-frame prediction becomes very simple, which is beneficial to reduce the computing load of the processor. In order to reduce the transmission bandwidth and compression space of eigenvalues, eigenvalues need to be compressed. A simple implementation is: compare the eigenvalues of other frames with the closest reference frame in front to form eigenvalue residuals. Perform transformation coding (such as DCT transformation) on the feature value, then quantize the transformation coefficient, perform run-length coding and entropy coding after quantization, and finally form a feature code stream.
本发明将特征参考帧和视频压缩帧间预测参考帧进行关联,还可以减少参考帧指示 信息的开销。参考帧指示信息一般包括一个或多个参考帧帧号。在一种实施方式中,视频编码码流中的参考帧指示信息同样适用于特征数据的解码,而特征流将不再重复地传输参考帧指示信息。The present invention associates the feature reference frame with the video compression inter-frame prediction reference frame, and can also reduce the overhead of the reference frame indication information. The reference frame indication information generally includes one or more reference frame numbers. In one embodiment, the reference frame indication information in the video coded stream is also applicable to the decoding of feature data, and the feature stream will not repeatedly transmit the reference frame indication information.
在一种实施方式中,特征编码码流中的参考帧指示信息同样适用于视频压缩码流的解码,而视频码流将不再重复地传输参考帧指示信息。需要一个指示信息指示上述关联关系,包括但不限于:特征参考帧由视频压缩参考帧确定或者反过来,参考帧指示信息通过视频压缩流或特征流传递。这个指示信息可以通过视频压缩码流或特征信息码流中任一方式进行传输,这样接收处理装置才可以确实正确的参考帧确定方式。In one embodiment, the reference frame indication information in the feature coded code stream is also applicable to the decoding of the video compression code stream, and the video code stream will not repeatedly transmit the reference frame indication information. An indication information is required to indicate the above association relationship, including but not limited to: the characteristic reference frame is determined by the video compression reference frame or vice versa, the reference frame indication information is transmitted through the video compression stream or the feature stream. The indication information can be transmitted in any mode of the video compression code stream or the feature information code stream, so that the receiving and processing device can determine the correct reference frame.
具体地,图3给出了本申请实施例在特征编码码流中发送参考帧指示信息的过程,该指示信息指示特征参考帧与帧间预测参考帧之间的关联关系,包括但不限于:帧间预测参考帧基于特征参考帧确定的指示。Specifically, Fig. 3 shows the process of sending reference frame indication information in the characteristic encoding code stream according to the embodiment of the present application. The indication information indicates the association relationship between the characteristic reference frame and the inter-frame prediction reference frame, including but not limited to: The inter-prediction reference frame is based on the indication determined by the feature reference frame.
在本发明中,同一视频处理单元可以支持多种视频分析(包括视频检索)应用,而一个视频分析应用可能包括多种视频特征。压缩后的视频数据和视频特征数据通过数据发送单元发送出去。数据发送可能是通过优先网络,比如以太网接口;也可以是WIFI网络或4G/5G等蜂窝通信网络。In the present invention, the same video processing unit can support multiple video analysis (including video retrieval) applications, and one video analysis application may include multiple video features. The compressed video data and video feature data are sent out through the data sending unit. Data transmission may be through a priority network, such as an Ethernet interface; it may also be a WIFI network or a cellular communication network such as 4G/5G.
其中,压缩后的视频数据和特征数据中的一种、若干种或全部发个一个或多个数据接收装置。数据接收可以通过有线、无线(WIFI、蜂窝网络等),前面已有描述,不再赘述。容易知道,不同视频特征可以有统一的编码和解码和/或特征匹配方式,也可以采取独立的编码和解码和/或特征匹配方式。Wherein, one, several or all of the compressed video data and feature data are sent to one or more data receiving devices. Data reception can be through wired or wireless (WIFI, cellular network, etc.), which have been described above and will not be repeated here. It is easy to know that different video features may have a unified encoding and decoding and/or feature matching manner, or may adopt independent encoding and decoding and/or feature matching manners.
图4是根据本申请的一些实施例所示的插帧图像处理装置示意图。如图4所示,该装置400包括:Fig. 4 is a schematic diagram of a frame interpolation image processing device according to some embodiments of the present application. As shown in Figure 4, the
参考帧确定模块410,至少部分基于确定的特征编码参考帧确定帧间预测参考帧;和/或至少部分基于确定的帧间预测参考帧确定特征编码参考帧;The reference
参考帧指示信息发送模块420,用于在视频压缩编码码流或特征处理编码码流中发送参考帧指示信息;其中,所述参考帧指示信息为所述特征参考帧与所述帧间预测参考帧之间的关联关系,包括参考帧选择指示。The reference frame indication
在本申请的一些实施例中,所述帧间预测参考帧由特征参考帧替代形成或由若干特征参考帧组合形成。In some embodiments of the present application, the inter-frame prediction reference frame is replaced by a feature reference frame or formed by combining several feature reference frames.
在本申请的一些实施例中,所述特征参考帧由帧间预测参考帧替代形成或有若干帧间预测参考帧组合形成。In some embodiments of the present application, the feature reference frame is formed by replacing or combining several inter-frame prediction reference frames.
在本申请的一些实施例中,所述参考帧选择指示为特征参考帧基于帧间预测参考帧确定的指示和帧间预测参考帧基于特征参考帧确定的指示,包括:帧编码模式指示是否需要参考帧、帧编码模式序列、前向或后向参考帧和是否需要清除参考帧缓存区中的至少一种。In some embodiments of the present application, the reference frame selection indication is an indication that the feature reference frame is determined based on the inter-frame prediction reference frame and an indication that the inter-frame prediction reference frame is determined based on the feature reference frame, including: whether the frame coding mode indication needs At least one of the reference frame, frame coding mode sequence, forward or backward reference frame and whether the reference frame buffer area needs to be cleared.
在本申请的一些实施例中,所述装置还包括:In some embodiments of the present application, the device also includes:
参考帧反馈模块430,用于至少基于一个所述参考帧选择指示,反馈参考帧选择结果。A reference
在本申请的一些实施例中,所述参考帧确定模块还包括:In some embodiments of the present application, the reference frame determination module further includes:
特征编码参考帧确定模块411,用于基于所述确定的帧间预测参考帧反馈的参考帧选择结果,确定特征编码参考帧;A feature coding reference
帧间预测参考帧确定模块412,用于基于所述确定的特征编码参考帧反馈的参考帧选择结果,确定帧间预测参考帧。The inter-frame prediction reference
参考附图5,为本申请一个实施例提供的电子设备示意图。如图5所示,该电子设备500包括:Referring to FIG. 5 , it is a schematic diagram of an electronic device provided by an embodiment of the present application. As shown in Figure 5, the
存储器530以及一个或多个处理器510;memory 530 and one or more processors 510;
其中,所述存储器530与所述一个或多个处理器510通信连接,所述存储器530中存储有可被所述一个或多个处理器执行的程序532,所述程序532被所述一个或多个处理器510执行,以使所述一个或多个处理器510执行本申请前述实施例中的方法。Wherein, the memory 530 is connected in communication with the one or more processors 510, the memory 530 stores a
具体地,处理器510和存储器530可以通过总线或者其他方式连接,图5中以通过总线540连接为例。处理器510可以为中央处理器(Central Processing Unit,CPU)。处理器510还可以为其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等芯片,或者上述各类芯片的组合。Specifically, the processor 510 and the memory 530 may be connected through a bus or in other ways. In FIG. 5 , connection through a bus 540 is taken as an example. The processor 510 may be a central processing unit (Central Processing Unit, CPU). The processor 510 can also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application-specific integrated circuits (Application Specific Integrated Circuit, ASIC), field-programmable gate array (Field-Programmable Gate Array, FPGA) or Other chips such as programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or combinations of the above-mentioned types of chips.
存储器530作为一种非暂态计算机可读存储介质,可用于存储非暂态软件程序、非暂态计算机可执行程序以及模块,如本申请实施例中的级联渐进网络等。处理器510通过运行存储在存储器530中的非暂态软件程序、程序532以及功能模块,从而执行处理器的各种功能应用以及数据处理。As a non-transitory computer-readable storage medium, the memory 530 can be used to store non-transitory software programs, non-transitory computer-executable programs and modules, such as the cascaded progressive network in the embodiment of the present application. The processor 510 executes various functional applications and data processing of the processor by running non-transitory software programs,
存储器530可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储处理器510所创建的数据等。此外,存储器530可以包括高速随机存取存储器,还可以包括非暂态存储器,例如至少一个磁盘存储器件、闪存器件、或其他非暂态固态存储器件。在一些实施例中,存储器530可选包 括相对于处理器510远程设置的存储器,这些远程存储器可以通过网络(比如通过通信接口520)连接至处理器510。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 530 may include a program storage area and a data storage area, wherein the program storage area may store an operating system and an application program required by at least one function; the data storage area may store data created by the processor 510 and the like. In addition, the memory 530 may include a high-speed random access memory, and may also include a non-transitory memory, such as at least one magnetic disk storage device, a flash memory device, or other non-transitory solid-state storage devices. In some embodiments, the memory 530 may optionally include memory located remotely from the processor 510, and these remote memories may be connected to the processor 510 through a network (such as through the communication interface 520). Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
本申请的一个实施例提供一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机可执行指令,所述计算机可执行指令被执行后执行上述方法实施例中的各个步骤。An embodiment of the present application provides a computer-readable storage medium, wherein computer-executable instructions are stored in the computer-readable storage medium, and each step in the foregoing method embodiments is executed after the computer-executable instructions are executed.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的设备和模块的具体工作过程,可以参考前述方法和/或装置实施例中的对应描述,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the above-described devices and modules can refer to the corresponding descriptions in the foregoing method and/or device embodiments, and details are not repeated here.
尽管此处所述的主题是在结合操作系统和应用程序在计算机系统上的执行而执行的一般上下文中提供的,但本领域技术人员可以认识到,还可结合其他类型的程序模块来执行其他实现。一般而言,程序模块包括执行特定任务或实现特定抽象数据类型的例程、程序、组件、数据结构和其他类型的结构。本领域技术人员可以理解,此处所述的本主题可以使用其他计算机系统配置来实践,包括手持式设备、多处理器系统、基于微处理器或可编程消费电子产品、小型计算机、大型计算机等,也可使用在其中任务由通过通信网络连接的远程处理设备执行的分布式计算环境中。在分布式计算环境中,程序模块可位于本地和远程存储器存储设备的两者中。Although the subject matter described herein is presented in the general context of being executed in connection with the execution of operating systems and application programs on a computer system, those skilled in the art will recognize that other types of program modules can also be used to execute other programs. accomplish. Generally, program modules include routines, programs, components, data structures, and other types of structures that perform particular tasks or implement particular abstract data types. Those skilled in the art will appreciate that the subject matter described herein may be practiced using other computer system configurations, including handheld devices, multiprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, etc. , can also be used in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及方法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those skilled in the art can appreciate that the units and method steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the present application.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对原有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。比如,典型地,本申请的技术方案可通过至少一个如图6所示的通用型计算机节点610来实现和/或传播。在图6中,通用型计算机节点610包括:计算机系统/服务器612、外设614和显示设备616;其中,所述计算机系统/服务器612包括处理单元620、输入/输出接口622、网络适配器624和存储器630,内部通常通过总线实现数据传输;进一步地,存储器630通常由 多种存储设备组成,比如,RAM(RandomAccessMemory,随机存储器)632、缓存634和存储系统(一般由一个或多个大容量非易失性存储介质组成)636等;实现本申请技术方案的部分或全部功能的程序640保存在存储器630中,通常以多个程序模块642的形式存在。If the functions described above are realized in the form of software function units and sold or used as independent products, they can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence or the part that contributes to the prior art or the technical solution, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application. For example, typically, the technical solutions of the present application can be implemented and/or propagated through at least one general-
而前述的计算机可读取存储介质包括以存储如计算机可读指令、数据结构、程序模块或其他数据等信息的任何方式或技术来实现的物理易失性和非易失性、可移动和不可因东介质。计算机可读取存储介质具体包括,但不限于,U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、可擦除可编程只读存储器(EPROM)、电可擦可编程只读存储器(EEPROM)、闪存或其他固态存储器技术、CD-ROM、数字多功能盘(DVD)、HD-DVD、蓝光(Blue-Ray)或其他光存储设备、磁带、磁盘存储或其他磁性存储设备、或能用于存储所需信息且可以由计算机访问的任何其他介质。The aforementioned computer-readable storage medium includes physically volatile and non-volatile, removable and non-removable media implemented in any manner or technology for storing information such as computer-readable instructions, data structures, program modules, or other data. Indong medium. Computer-readable storage media specifically include, but are not limited to, U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), erasable programmable read-only Memory (EPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Flash or other solid-state memory technology, CD-ROM, Digital Versatile Disk (DVD), HD-DVD, Blue-Ray or other optical storage device, tape, disk storage or other magnetic storage device, or any other medium that can be used to store the desired information and that can be accessed by a computer.
综上所述,本申请提出了一种多表拼接方法、装置、电子设备及其计算机可读存储介质。本申请实施例通过本体视图的可视化方式来控制多表特征的生成,同时利用元路径来操作控制特征的传递,有效实现了多表拼接时特征的可控聚合,在降低复杂度的同时可以灵活有效地控制多表特征空间。To sum up, the present application proposes a multi-table splicing method, device, electronic equipment and computer-readable storage medium thereof. The embodiment of this application controls the generation of multi-table features through the visualization of ontology view, and at the same time uses the meta-path to operate and control the transfer of features, effectively realizing the controllable aggregation of features during multi-table splicing, which can be flexible while reducing complexity. Efficiently control multi-table feature spaces.
应当理解的是,本申请的上述具体实施方式仅仅用于示例性说明或解释本申请的原理,而不构成对本申请的限制。因此,在不偏离本申请的精神和范围的情况下所做的任何修改、等同替换、改进等,均应包含在本申请的保护范围之内。此外,本申请所附权利要求旨在涵盖落入所附权利要求范围和边界、或者这种范围和边界的等同形式内的全部变化和修改例。It should be understood that the above specific implementation manners of the present application are only used to illustrate or explain the principle of the present application, but not to limit the present application. Therefore, any modification, equivalent replacement, improvement, etc. made without departing from the spirit and scope of the present application shall fall within the protection scope of the present application. Furthermore, the claims appended to this application are intended to embrace all changes and modifications that come within the scope and metes and bounds of the appended claims, or equivalents of such scope and metes and bounds.
Claims (14)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110917723.8A CN113382235B (en) | 2021-08-11 | 2021-08-11 | Digital retina video processing method and device, electronic equipment and storage medium |
| CN202110917723.8 | 2021-08-11 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023015808A1 true WO2023015808A1 (en) | 2023-02-16 |
Family
ID=77576681
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2021/139127 Ceased WO2023015808A1 (en) | 2021-08-11 | 2021-12-17 | Digital retina video processing method and apparatus, electronic device, and storage medium |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN113382235B (en) |
| WO (1) | WO2023015808A1 (en) |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113382235B (en) * | 2021-08-11 | 2021-11-16 | 浙江智慧视频安防创新中心有限公司 | Digital retina video processing method and device, electronic equipment and storage medium |
| CN114157863B (en) * | 2022-02-07 | 2022-07-22 | 浙江智慧视频安防创新中心有限公司 | Video coding method, system and storage medium based on digital retina |
| CN114422805B (en) * | 2022-03-30 | 2022-09-02 | 浙江智慧视频安防创新中心有限公司 | Video coding and decoding method, device and equipment |
| CN114449280B (en) * | 2022-03-30 | 2022-10-04 | 浙江智慧视频安防创新中心有限公司 | Video coding and decoding method, device and equipment |
| CN114422803B (en) * | 2022-03-30 | 2022-08-05 | 浙江智慧视频安防创新中心有限公司 | Video processing method, device and equipment |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7043048B1 (en) * | 2000-06-01 | 2006-05-09 | Digimarc Corporation | Capturing and encoding unique user attributes in media signals |
| CN111092926A (en) * | 2019-08-28 | 2020-05-01 | 北京大学 | Digital retina multivariate data rapid association method |
| CN111787218A (en) * | 2020-06-18 | 2020-10-16 | 安徽超清科技股份有限公司 | Monitoring camera based on digital retina technology |
| CN113382235A (en) * | 2021-08-11 | 2021-09-10 | 浙江智慧视频安防创新中心有限公司 | Digital retina video processing method and device, electronic equipment and storage medium |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10282615B2 (en) * | 2016-12-21 | 2019-05-07 | Volkswagen Ag | System and method for root association in image data |
| CN110719438A (en) * | 2019-08-28 | 2020-01-21 | 北京大学 | Synchronous transmission control method for digital retina video stream and characteristic stream |
| CN111090773B (en) * | 2019-08-28 | 2023-02-07 | 北京大学 | Digital retina system structure and software architecture method and system |
-
2021
- 2021-08-11 CN CN202110917723.8A patent/CN113382235B/en active Active
- 2021-12-17 WO PCT/CN2021/139127 patent/WO2023015808A1/en not_active Ceased
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7043048B1 (en) * | 2000-06-01 | 2006-05-09 | Digimarc Corporation | Capturing and encoding unique user attributes in media signals |
| CN111092926A (en) * | 2019-08-28 | 2020-05-01 | 北京大学 | Digital retina multivariate data rapid association method |
| CN111787218A (en) * | 2020-06-18 | 2020-10-16 | 安徽超清科技股份有限公司 | Monitoring camera based on digital retina technology |
| CN113382235A (en) * | 2021-08-11 | 2021-09-10 | 浙江智慧视频安防创新中心有限公司 | Digital retina video processing method and device, electronic equipment and storage medium |
Non-Patent Citations (2)
| Title |
|---|
| GAO WEN, TIAN YONGHONG, WANG JIAN: "Digital retina: revolutionizing camera systems for the smart city", SCIENTIA SINICA INFORMATIONIS, vol. 48, no. 8, 1 August 2018 (2018-08-01), pages 1076 - 1082, XP093034864, ISSN: 1674-7267, DOI: 10.1360/N112018-00025 * |
| LOU YIHANG; DUAN LING-YU; LUO YONG; CHEN ZIQIAN; LIU TONGLIANG; WANG SHIQI; GAO WEN: "Towards Efficient Front-End Visual Sensing for Digital Retina: A Model-Centric Paradigm", IEEE TRANSACTIONS ON MULTIMEDIA, IEEE, USA, vol. 22, no. 11, 14 January 2020 (2020-01-14), USA, pages 3002 - 3013, XP011816234, ISSN: 1520-9210, DOI: 10.1109/TMM.2020.2966885 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN113382235A (en) | 2021-09-10 |
| CN113382235B (en) | 2021-11-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2023015808A1 (en) | Digital retina video processing method and apparatus, electronic device, and storage medium | |
| US12348769B2 (en) | Representative motion information for temporal motion prediction in video encoding and decoding | |
| US12101487B2 (en) | Block level bi-prediction with weighted averaging | |
| US9271013B2 (en) | Merge mode for motion information prediction | |
| US9635374B2 (en) | Systems and methods for coding video data using switchable encoders and decoders | |
| KR20220024773A (en) | Method and device for storing motion field in video coding | |
| US20120195356A1 (en) | Resource usage control for real time video encoding | |
| US20230017193A1 (en) | Video decoding method, video encoding method, electronic device, and storage medium | |
| WO2022022299A1 (en) | Method, apparatus, and device for constructing motion information list in video coding and decoding | |
| WO2022078150A1 (en) | Candidate motion information list determination method and apparatus, and electronic device and storage medium | |
| US10142633B2 (en) | Flexible coding unit ordering and block sizing | |
| WO2022227622A1 (en) | Weight-configurable inter-frame and intra-frame joint prediction coding and decoding methods and devices | |
| CN111953987B (en) | Video transcoding method, computer device and storage medium | |
| CN112118447A (en) | Construction method and device of fusion candidate motion information list and coder/decoder | |
| US20250071319A1 (en) | Motion Vector Resolution Based Motion Vector Prediction For Video Coding | |
| CN119299674B (en) | Video frame processing method, device, electronic equipment and storage medium | |
| WO2020061784A1 (en) | Encoding method, decoding method, encoding device and decoding device | |
| TWI907873B (en) | On block level bi-prediction with weighted averaging | |
| WO2025190011A1 (en) | Template matching-based prediction method and apparatus, device, and storage medium | |
| KR20250105653A (en) | Multiple lists of block-based weighting factors | |
| CN119767017A (en) | Video encoding method and apparatus, electronic device, storage medium, program product, and method of generating bit stream | |
| WO2025096760A1 (en) | Motion vector coding based on precision adaptive codebooks | |
| WO2024072438A1 (en) | Motion vector candidate signaling | |
| CN121531142A (en) | Video encoding method and apparatus, video decoding method and apparatus | |
| CN118786668A (en) | System and method for improving motion vector prediction sublist |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21953411 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 21953411 Country of ref document: EP Kind code of ref document: A1 |