CN108460751A

CN108460751A - The 4K ultra high-definition image quality evaluating methods of view-based access control model nerve

Info

Publication number: CN108460751A
Application number: CN201710098240.3A
Authority: CN
Inventors: 袁政; 许颖浩; 褚灵伟
Original assignee: WENGUANG INTERDYANMIC TV CO Ltd SHANGHAI
Current assignee: WENGUANG INTERDYANMIC TV CO Ltd SHANGHAI
Priority date: 2017-02-22
Filing date: 2017-02-22
Publication date: 2018-08-28

Abstract

The invention discloses a kind of 4K ultra high-definition image quality evaluating methods of view-based access control model nerve, include the following steps：Step 1, original video and video to be measured are respectively divided into several frame images；Step 2, the notable figure in each frame image of video to be measured is calculated；Step 3, the structure similar diagram of each frame in original video is calculated；Step 4, the notable figure of corresponding frame and structure similar diagram are done into weighted comprehensive, calculates the picture quality of video to be measured.The method of the present invention has the characteristics that speed is fast, expense is low and can be embedded into video system itself, and vision significance algorithm is applied in video evaluation.

Description

The 4K ultra high-definition image quality evaluating methods of view-based access control model nerve

Technical field

The present invention relates to method for evaluating video quality, more specifically to a kind of 4K ultra high-definitions of view-based access control model nerve Image quality evaluating method.

Background technology

Vision attention is a kind of significant psychological regulation mechanism in human visual system's process of information processing, and people is allowed to select The notable feature for obtaining to selecting property institute's observation object, to substantially reduce information processing capacity.Vision attention is in computer vision Play important role.Many tasks of computer vision, such as scene analysis, object detection, video tracking, object are known Not, retrieval, estimation and image recovery etc., all studied and improve performance using vision attention.

What you were seen is that you want to see, this is a main Research foundation of vision significance.At any time, ring The visual information that border is showed can be handled considerably beyond human eye.Vision significantly allow people choose with instantly oneself into The relevant information of capable behavior.In order to adapt to this potential burden, human brain also has a series of corresponding conspicuousness mechanism. Main includes two aspects：It is conspicuousness first to be used for choosing relevant information and ignoring incoherent information and can use To speculate information.In addition conspicuousness can adjust according to the target of behavior and enhance selected information.The research of conspicuousness It can be divided into many types according to different situations.And time and space significance is wherein than major one kind.

When we observe ambient enviroment, often due to attention is had selection by behavior purpose demand or local scene clue Ground concentrates on some or certain scenery, to select the representative of certain point or region as scenery.The sheet of vision attention Matter is a psychological regulation mechanism of human vision, and the mankind choose interested area from the bulk information of external world's input whereby Domain, and the notable feature of target of interest is selectively obtained to a certain extent, to reduce information processing capacity.

Conspicuousness is introduced into image quality evaluation and has been obtained for studying.Due to the introducing of vision significance, obtain more Meet the evaluating objective quality result of human eye.But vision significance algorithm is primarily directed to image at present, and video is special It is that ultra high-definition video wants the more of complexity compared to image, cannot directly applies mechanically the conspicuousness algorithm for image, because the third dimension Temporal signatures and movable information need to be considered.

Invention content

The object of the present invention is to provide a kind of 4K ultra high-definition image quality evaluating methods of view-based access control model nerve, solve existing Vision significance algorithm is used only for picture appraisal in technology, and the problem of cannot be used for video evaluation.

To achieve the above object, the present invention adopts the following technical scheme that：

A kind of 4K ultra high-definition image quality evaluating methods of view-based access control model nerve, include the following steps：Step 1, original to regard Frequency and video to be measured are respectively divided into several frame images；Step 2, the notable figure in each frame image of video to be measured is calculated；Step 3, Calculate the structure similar diagram of each frame in original video；Step 4, that the notable figure of corresponding frame and structure similar diagram are done weighting is comprehensive It closes, calculates the picture quality of video to be measured.

Preferably, step 2 further comprises：Step 2.1, video is formed by continuous frame by one and is decomposed into 3 significantly Property body C_i(i=1,2,3) corresponds to 3 kinds of different features, brightness, color and movement respectively；Step 2.2, each conspicuousness body Different scale j is resolved into progress, establishes a series of Gauss tower C={ C_ij, wherein i=1,2,3, j=1 ..., L；Step 2.3, energy function E is minimized, it includes a data item E_dWith smoothing factor E_s, wherein：E (C)=λ_d·E_d(C)+λ_s·E_s (C), wherein λ_dAnd λ_sFor corrected parameter；Step 2.4, last conspicuousness is that each scale and feature are average：Wherein, j=1 ..., L.

Preferably, step 3 further comprises：Enable V_oAnd V_dOriginal and distortion video is indicated respectively, and dimension is M*N*F；G and f V is indicated respectively_dA frame image.For the structure chart SS of calculated distortion video_f, g and f are divided into many fritter x and y, x and y The calculating of SSIM values is as follows：Wherein μ_x, μ_yAnd σ_x, σ_yIt is the equal of x and y Value and variance, σ_xyIt is the covariance of x and y, C₁、C₂It is constant.The SSIM values of all small image blocks in one frame image constitute Structure chart SS_f(f=1 ..., F).

Preferably, step 4 further comprises：Quality per frame

Preferably, final video quality value VQ is：

In the above-mentioned technical solutions, the 4K ultra high-definition image quality evaluating methods of view-based access control model nerve of the invention have speed The characteristics of degree is fast, expense is low and can be embedded into video system itself, and vision significance algorithm is applied to video evaluation In.

Description of the drawings

Fig. 1 is the flow chart of the method for the present invention.

Specific implementation mode

The technical solution further illustrated the present invention with reference to the accompanying drawings and examples.

The method of the present invention is based on 2 important vision significance principles, i.e. spatial domain conspicuousness and time domain conspicuousness.

Spatial domain conspicuousness：In the task of any visual search, the mode that human eye is watched attentively can be divided into inherence and external two Kind.Inherence pays attention to assuming to think that object controls vision attention.This is also referred to as from top to bottom, the vision attention of target excitation. It is a kind of voluntary, needs the process made great efforts.In addition vision attention can be automatically driven to by external driving by extrinsic motivation Some position.This is referred to as the vision attention for encouraging driving from down to up.Flashing lamp on highway is regarded from external driving Feel attention.External vision attention automatically drives note that response process is rapid.Vision attention external from down to up includes very The driving attribute of multiple types.For example, in visual search, spatial cues and the setting of unexpected vision can cause vision attention. Other visual signatures such as single feature (the vertical target in red objects or horizontal in green color background), can Effectively arouse attention.

Time domain conspicuousness：It is the feature of vision selection in time.The input of visual signal is constantly to change at any time , observer needs extraction and the relevant information of behavior from vision inlet flow.A kind of standard technique of research time domain conspicuousness It is that object sequence is played with the speed of 20 objects per second.This processing method and selection mechanism can allow researcher to find to regard Feel the video playout speed that information can be extracted.

In view of this, the present invention has used for reference the dense time and space significance of Konstantinos Rapantzikos et al. propositions Model (Dense Spatiotemporal Salient Model), abbreviation KR models are the conspicuousness moulds suitable for video One of type.The model characterizes video using multiple dimensioned body, and space-time operation is carried out on three dimensions.The calculating of conspicuousness is logical The process of a global minimization is crossed to realize.The constraints of the minimum is related to a series of visual signature information , including spatial domain proximity, scale and characteristic similarity (brightness, color, movement).The extreme value of above-mentioned conspicuousness response is just chosen For distinguishing feature, it has therefore proved that this method has reached good balance between brightness and information.

Based on above-mentioned principle, commented as shown in Figure 1, the present invention discloses a kind of 4K ultra high-definition picture qualities of view-based access control model nerve Valence method comprising key step below：

S1：Original video and video to be measured are respectively divided into several frame images.

S2：Calculate the notable figure in each frame image of video to be measured.

S3：Calculate the structure similar diagram of each frame in original video.

S4：The notable figure of corresponding frame and structure similar diagram are done into weighted comprehensive, calculate the picture quality of video to be measured.

Assuming that V represents a video being made of continuous frame, q=(x, y, t) represents an individual moment point.So Q means that the element of volume in an individual.V (q) is enabled to indicate pixel values of the V at q points.At this point, the main calculating of the notable figure of S2 Step includes：

S2.1：Video is formed by continuous frame by one and is decomposed into 3 conspicuousness body C_i(i=1,2,3) corresponds to 3 respectively Kind different feature, brightness, color and movement；

S2.2：Different scale j is resolved into each conspicuousness body progress, establishes a series of Gauss tower C={ C_ij, Middle i=1,2,3, j=1 ..., L；

S2.3：Energy function E is minimized, it includes a data item E_dWith smoothing factor E_s, wherein：E (C)=λ_d·E_d (C)+λ_s·E_s(C), wherein λ_dAnd λ_sFor corrected parameter；

S2.4：Last conspicuousness is that each scale and feature are average：Wherein, j=1 ..., L.

In above-mentioned steps, it is assumed that the result of the first scale is chosen as final Saliency maps, i.e., such as enables j=1, then：

Obviously, if certain part of video image is relatively more prominent, human eye is also relatively more to their concern, then managing institute This certain partial content weight shared in the evaluation result of video also should be big.Therefore, salient region is determining entirety Video evaluation value in should occupy an important position.Based on such idea, the present invention is parallel to execute while executing S2 S3。

Enable V_oAnd V_dOriginal and distortion video is indicated respectively, and dimension is M*N*F.G and f indicate V respectively_dA frame image.For The structure chart SS of calculated distortion video_f, the calculating for the SSIM values that g and f are divided into many fritter x and y, x and y is as follows：

Wherein, μ_x, μ_yAnd σ_x, σ_yIt is the mean value and variance of x and y, σ_xyIt is the covariance of x and y, C₁、C₂It is constant.One frame figure The SSIM values of all small image blocks as in constitute structure chart SS_f(f=1 ..., F).

Finally, use notable figure as the weighted factor of structure similar diagram, and the final video mass value of entire video is each Frame weighted graph is averaged, specifically：

Quality per frame

According to above-mentioned it is assumed that assuming that the result of the first scale is chosen as final Saliency maps：

At this timeThen：

Quality per frame

Final video quality value VQ is：

Those of ordinary skill in the art it should be appreciated that more than embodiment be intended merely to illustrate the present invention, And be not used as limitation of the invention, as long as in the spirit of the present invention, the change to embodiment described above Change, modification will all be fallen within the scope of claims of the present invention.

Claims

1. a kind of 4K ultra high-definition image quality evaluating methods of view-based access control model nerve, which is characterized in that include the following steps：

Step 1, original video and video to be measured are respectively divided into several frame images；

Step 2, the notable figure in each frame image of video to be measured is calculated；

Step 3, the structure similar diagram of each frame in original video is calculated；

Step 4, the notable figure of corresponding frame and structure similar diagram are done into weighted comprehensive, calculates the picture quality of video to be measured.

2. the 4K ultra high-definition image quality evaluating methods of view-based access control model nerve as described in claim 1, which is characterized in that step 2 further comprise：

Step 2.1, video is formed by continuous frame by one and is decomposed into 3 conspicuousness body C_i(i=1,2,3) corresponds to 3 kinds respectively Different feature, brightness, color and movements；

Step 2.2, different scale j is resolved into each conspicuousness body progress, establishes a series of Gauss tower C={ C_ij, Middle i=1,2,3, j=1 ..., L；

Step 2.3, energy function E is minimized, it includes a data item E_dWith smoothing factor E_s, wherein：

E (C)=λ_d·E_d(C)+λ_s·E_s(C), wherein λ_dAnd λ_sFor corrected parameter；

Step 2.4, last conspicuousness is that each scale and feature are average：

Wherein, j=1 ..., L.

3. the 4K ultra high-definition image quality evaluating methods of view-based access control model nerve as claimed in claim 2, which is characterized in that step 3 further comprise：

Enable V_oAnd V_dOriginal and distortion video is indicated respectively, and dimension is M*N*F；G and f indicate V respectively_dA frame image.In order to count Calculate the structure chart SS of distortion video_f, the calculating for the SSIM values that g and f are divided into many fritter x and y, x and y is as follows：

Wherein μ_x, μ_yAnd σ_x, σ_yIt is the mean value and variance of x and y, σ_xyIt is the covariance of x and y, C₁、C₂It is constant.In one frame image The SSIM values of all small image blocks constitute structure chart SS_f(f=1 ..., F).

4. the 4K ultra high-definition image quality evaluating methods of view-based access control model nerve as claimed in claim 3, which is characterized in that step 4 further comprise：

Quality per frame

5. the 4K ultra high-definition image quality evaluating methods of view-based access control model nerve as claimed in claim 4, which is characterized in that final Video quality value VQ be：