CN117726963A - Picture data processing method and device, electronic equipment and medium - Google Patents
Picture data processing method and device, electronic equipment and medium Download PDFInfo
- Publication number
- CN117726963A CN117726963A CN202310076423.0A CN202310076423A CN117726963A CN 117726963 A CN117726963 A CN 117726963A CN 202310076423 A CN202310076423 A CN 202310076423A CN 117726963 A CN117726963 A CN 117726963A
- Authority
- CN
- China
- Prior art keywords
- picture
- data
- evaluation parameters
- frame
- features
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 18
- 238000011156 evaluation Methods 0.000 claims abstract description 252
- 238000012545 processing Methods 0.000 claims abstract description 132
- 238000001514 detection method Methods 0.000 claims abstract description 63
- 238000000034 method Methods 0.000 claims abstract description 46
- 230000003068 static effect Effects 0.000 claims description 165
- 230000000875 corresponding effect Effects 0.000 claims description 41
- 238000013441 quality evaluation Methods 0.000 claims description 24
- 230000015654 memory Effects 0.000 claims description 23
- 238000004590 computer program Methods 0.000 claims description 20
- 230000002596 correlated effect Effects 0.000 claims description 3
- 230000000007 visual effect Effects 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 abstract description 5
- 238000010586 diagram Methods 0.000 description 11
- 239000000284 extract Substances 0.000 description 8
- 238000004891 communication Methods 0.000 description 7
- 238000007499 fusion processing Methods 0.000 description 7
- 238000007689 inspection Methods 0.000 description 7
- 238000013507 mapping Methods 0.000 description 6
- 230000008447 perception Effects 0.000 description 5
- 238000012549 training Methods 0.000 description 4
- 230000000306 recurrent effect Effects 0.000 description 3
- 238000013528 artificial neural network Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 230000001537 neural effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 238000011157 data evaluation Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000001303 quality assessment method Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Landscapes
- Image Analysis (AREA)
Abstract
本申请实施例公开了一种画面数据处理方法、装置、电子设备及介质,应用于多媒体技术领域。其中方法包括:获取待检测的画面数据,分别从每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征,根据关联画面特征确定画面数据的画面评估参数,根据画面评估参数生成针对画面数据的质量检测结果。采用本申请实施例,可以提高画面数据的质量检测结果的准确性。
The embodiments of the present application disclose a picture data processing method, device, electronic equipment and medium, which are applied in the field of multimedia technology. The method includes: obtaining the picture data to be detected, extracting the picture features of each picture frame from each picture frame, and obtaining the picture association based on the picture characteristics of each picture frame based on the adjacent picture frames of each picture frame. Processing to obtain associated screen features of the screen data, determining screen evaluation parameters of the screen data based on the associated screen features, and generating quality detection results for the screen data based on the screen evaluation parameters. By adopting the embodiments of the present application, the accuracy of the quality detection results of picture data can be improved.
Description
技术领域Technical field
本申请涉及多媒体技术领域,尤其涉及一种画面数据处理方法、装置、电子设备及介质。The present application relates to the field of multimedia technology, and in particular to a picture data processing method, device, electronic equipment and media.
背景技术Background technique
随着多媒体信息时代的到来,各类多媒体处理技术层出不穷。因而,画面数据的质量检测技术显得日益重要。比如,在视频推荐场景中,可以结合视频数据的质量检测结果要推送给用户终端的视频数据。目前,现有质量检测方法主要是通过人工标注的方式进行,由运营人员观看画面数据中的画面片断,给出评分,从而实现对画面数据的质量检测,该方式下需要消耗大量的时间成本,效率较低,导致画面数据的质量检测结果不够准确性。With the advent of the multimedia information age, various multimedia processing technologies emerge in endlessly. Therefore, the quality detection technology of picture data becomes increasingly important. For example, in the video recommendation scenario, the video data to be pushed to the user terminal can be combined with the quality detection results of the video data. At present, the existing quality detection methods are mainly carried out through manual annotation. The operators watch the picture fragments in the picture data and give scores to achieve the quality detection of the picture data. This method requires a lot of time and cost. The efficiency is low, resulting in inaccurate quality detection results of screen data.
发明内容Contents of the invention
本申请实施例提供了一种画面数据处理方法、装置、电子设备及介质,可以提高画面数据的质量检测结果的准确性。Embodiments of the present application provide a screen data processing method, device, electronic equipment and media, which can improve the accuracy of the quality detection results of screen data.
一方面,本申请实施例提供了一种画面数据处理方法,该方法包括:On the one hand, embodiments of the present application provide a picture data processing method, which method includes:
获取待检测的画面数据;画面数据包括按照画面数据中的时间顺序排列的多个画面帧;Obtain the picture data to be detected; the picture data includes multiple picture frames arranged in time order in the picture data;
分别从每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征;Extract the picture features of each picture frame from each picture frame respectively, and perform picture association processing on the picture features of each picture frame based on the adjacent picture frames of each picture frame to obtain the associated picture features of the picture data;
根据关联画面特征确定画面数据的画面评估参数;画面评估参数包括如下一种或多种:画面静态评估参数、画面动态评估参数;Determine the picture evaluation parameters of the picture data according to the associated picture characteristics; the picture evaluation parameters include one or more of the following: picture static evaluation parameters, picture dynamic evaluation parameters;
根据画面评估参数生成针对画面数据的质量检测结果。Generate quality inspection results for the picture data based on the picture evaluation parameters.
一方面,本申请实施例提供了一种画面数据处理装置,该装置包括:On the one hand, embodiments of the present application provide a screen data processing device, which includes:
获取模块,用于获取待检测的画面数据;画面数据包括按照画面数据中的时间顺序排列的多个画面帧;The acquisition module is used to obtain the picture data to be detected; the picture data includes multiple picture frames arranged in time order in the picture data;
处理模块,用于分别从每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征;The processing module is used to extract the picture features of each picture frame from each picture frame, and perform picture association processing on the picture features of each picture frame based on the adjacent picture frames of each picture frame to obtain the picture data. associated picture features;
处理模块,还用于根据关联画面特征确定画面数据的画面评估参数;画面评估参数包括如下一种或多种:画面静态评估参数、画面动态评估参数。The processing module is also used to determine the picture evaluation parameters of the picture data according to the associated picture characteristics; the picture evaluation parameters include one or more of the following: static picture evaluation parameters and dynamic picture evaluation parameters.
处理模块,还用于根据画面评估参数生成针对画面数据的质量检测结果。The processing module is also used to generate quality detection results for the picture data according to the picture evaluation parameters.
一方面,本申请实施例提供了一种电子设备,该电子设备包括处理器和存储器,其中,存储器用于存储计算机程序,该计算机程序包括程序指令,处理器被配置用于调用该程序指令,执行上述方法中的部分或全部步骤。On the one hand, embodiments of the present application provide an electronic device, which includes a processor and a memory, wherein the memory is used to store a computer program, the computer program includes program instructions, and the processor is configured to call the program instructions, Perform some or all of the steps in the above method.
一方面,本申请实施例提供了一种计算机可读存储介质,该计算机可读存储介质存储有计算机程序,该计算机程序包括程序指令,该程序指令被处理器执行时,用于执行上述方法中的部分或全部步骤。On the one hand, embodiments of the present application provide a computer-readable storage medium. The computer-readable storage medium stores a computer program. The computer program includes program instructions. When the program instructions are executed by a processor, they are used to perform the above method. some or all of the steps.
相应地,根据本申请的一个方面,提供了一种计算机程序产品或者计算机程序,该计算机程序产品或计算机程序包括程序指令,该程序指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该程序指令,处理器执行该程序指令,使得该计算机设备执行上述方法中的部分或全部步骤。Accordingly, according to one aspect of the present application, a computer program product or computer program is provided, which computer program product or computer program includes program instructions stored in a computer-readable storage medium. The processor of the computer device reads the program instructions from the computer-readable storage medium, and the processor executes the program instructions, so that the computer device performs some or all of the steps in the above method.
本申请实施例中,可以获取待检测的画面数据,分别从画面数据包括的每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征,根据关联画面特征确定画面数据的画面评估参数;该可以提高关联画面特征涵盖的特征信息丰富度,从而得到准确的画面评估参数;根据画面评估参数生成针对画面数据的质量检测结果。该画面评估参数可以包括如下一种或多种:画面静态评估参数、画面动态评估参数。该可以通过画面评估参数综合确定出画面数据的质量检测结果,以提高质量检测结果的准确性和可靠性。In the embodiment of the present application, the picture data to be detected can be obtained, the picture characteristics of each picture frame are extracted from each picture frame included in the picture data, and each picture frame is compared based on the adjacent picture frames of each picture frame. Perform picture association processing on the picture features to obtain the associated picture features of the picture data, and determine the picture evaluation parameters of the picture data based on the related picture characteristics; this can improve the richness of feature information covered by the related picture features, thereby obtaining accurate picture evaluation parameters; Generate quality inspection results for the picture data based on the picture evaluation parameters. The picture evaluation parameters may include one or more of the following: static picture evaluation parameters and dynamic picture evaluation parameters. The quality detection results of the screen data can be comprehensively determined through the screen evaluation parameters to improve the accuracy and reliability of the quality detection results.
附图说明Description of the drawings
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings needed to be used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are some embodiments of the present application, which are of great significance to this field. Ordinary technicians can also obtain other drawings based on these drawings without exerting creative work.
图1为本申请实施例提供的一种画面数据处理场景的示意图;Figure 1 is a schematic diagram of a picture data processing scenario provided by an embodiment of the present application;
图2为本申请实施例提供的一种画面数据处理方法的流程示意图;Figure 2 is a schematic flowchart of a screen data processing method provided by an embodiment of the present application;
图3为本申请实施例提供的一种确定质量检测结果的场景示意图;Figure 3 is a schematic diagram of a scenario for determining quality detection results provided by an embodiment of the present application;
图4为本申请实施例提供的一种画面数据处理方法的流程示意图;Figure 4 is a schematic flow chart of a screen data processing method provided by an embodiment of the present application;
图5a为本申请实施例提供的一种确定画面评估参数的场景示意图;Figure 5a is a schematic diagram of a scenario for determining picture evaluation parameters provided by an embodiment of the present application;
图5b为本申请实施例提供的一种确定画面评估参数的场景示意图;Figure 5b is a schematic diagram of a scenario for determining picture evaluation parameters provided by an embodiment of the present application;
图6为本申请实施例提供的一种画面数据处理装置的结构示意图;Figure 6 is a schematic structural diagram of a picture data processing device provided by an embodiment of the present application;
图7为本申请实施例提供的一种电子设备的结构示意图。FIG. 7 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only some of the embodiments of the present application, rather than all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative efforts fall within the scope of protection of this application.
本申请实施例提出的画面数据处理方法实现于电子设备,该电子设备可以是服务器,也可以是终端。其中,服务器可以是独立的物理服务器,也可以是多个物理服务器构成的服务器集群或者分布式系统,还可以是提供云服务、云计算、云函数、云存储、网络服务、云通信、中间件服务、以及大数据和人工智能平台等基础云计算服务的云服务器。终端可以是智能手机、平板电脑、笔记本电脑、台式计算机等,但并不局限于此。The screen data processing method proposed in the embodiment of the present application is implemented in an electronic device, and the electronic device may be a server or a terminal. Among them, the server can be an independent physical server, or a server cluster or distributed system composed of multiple physical servers. It can also provide cloud services, cloud computing, cloud functions, cloud storage, network services, cloud communications, and middleware. services, as well as cloud servers for basic cloud computing services such as big data and artificial intelligence platforms. The terminal can be a smartphone, a tablet, a laptop, a desktop computer, etc., but is not limited thereto.
本申请实施例提出一种画面数据处理方法,该方法可以获取待检测的画面数据,并从画面数据中确定出多个画面帧,该多个画面帧按照画面数据中的时间顺序依次排列;分别从每个画面帧提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行画面关联处理,得到画面数据的关联画面特征,该关联画面特征中涵盖了更丰富的特征信息,从而可以根据关联画面特征确定画面数据更准确的画面评估参数,可以根据该画面评估参数生成针对画面数据的质量检测结果。其中,画面评估参数可以包括一种或多种:画面静态评估参数和画面动态评估参数。可以理解,该质量检测结果可以用于评估画面数据的画面质量,可以用于衡量用户对画面数据的实际观感。此外,画面静态评估参数用于描述针对画面数据的静态观感,比如画面数据的画面质量观感等。画面动态评估参数用于描述针对画面数据的动态观感,比如画面数据的画面切换观感等。也就是说,画面评估参数可以用于描述用户对画面数据的各种类型的观感。因此,通过画面评估参数综合确定出的质量检测结果可以更准确可靠。The embodiment of the present application proposes a picture data processing method, which can obtain the picture data to be detected and determine multiple picture frames from the picture data. The multiple picture frames are arranged in sequence according to the time order in the picture data; respectively Extract the picture features of each picture frame from each picture frame, and perform picture association processing on the picture features of each picture frame based on the adjacent picture frames of each picture frame to obtain the associated picture features of the picture data. The associated picture features It contains richer feature information, so that more accurate picture evaluation parameters for the picture data can be determined based on the associated picture characteristics, and quality detection results for the picture data can be generated based on the picture evaluation parameters. The picture evaluation parameters may include one or more: static picture evaluation parameters and dynamic picture evaluation parameters. It can be understood that the quality detection results can be used to evaluate the picture quality of the picture data and can be used to measure the user's actual perception of the picture data. In addition, the picture static evaluation parameters are used to describe the static look and feel of the picture data, such as the picture quality look and feel of the picture data. Screen dynamic evaluation parameters are used to describe the dynamic look and feel of screen data, such as the look and feel of screen switching of screen data. That is to say, the picture evaluation parameters can be used to describe various types of user perceptions of the picture data. Therefore, the quality detection results determined comprehensively through the picture evaluation parameters can be more accurate and reliable.
由此,基于该画面数据处理方法提出的一种画面数据处理场景的示意图可如图1所示,图1提出一种网络架构,该网络架构可以包括业务服务器以及用户终端集群,其中,用户终端集群可以包括一个或多个用户终端,这里将不对用户终端集群中的用户终端的数量进行限定。用户终端集群中的用户终端之间可以存在通信连接。同时,用户终端集群中的任一用户终端可以与业务服务器存在通信连接,以便于用户终端集群中的每个用户终端均可以通过该通信连接与业务服务器进行数据交互。其中,上述通信连接不限定连接方式,可以通过有线通信方式进行直接或间接地连接,也可以通过无线通信方式进行直接或间接地连接,还可以通过其它方式,本申请在此不做限制。此外,可以理解的是,本申请实施例所涉及的电子设备可以是图1所示的业务服务器,也可以是图1所示的用户终端集群中的任意一个用户终端。Therefore, a schematic diagram of a picture data processing scenario proposed based on the picture data processing method can be shown in Figure 1. Figure 1 proposes a network architecture. The network architecture can include a business server and a user terminal cluster, where the user terminal The cluster may include one or more user terminals, and the number of user terminals in the user terminal cluster will not be limited here. Communication connections may exist between user terminals in a user terminal cluster. At the same time, any user terminal in the user terminal cluster can have a communication connection with the business server, so that each user terminal in the user terminal cluster can interact with the business server through the communication connection. The above-mentioned communication connection is not limited to a connection method. It can be connected directly or indirectly through wired communication, or directly or indirectly through wireless communication. It can also be connected through other methods, which is not limited in this application. In addition, it can be understood that the electronic device involved in the embodiment of the present application may be the service server shown in Figure 1, or any user terminal in the user terminal cluster shown in Figure 1.
可以理解,服务器可以获取任一用户终端上传的画面数据,并通过本申请所提出的画面数据处理方法确定该画面数据的质量检测结果。比如,服务器可以按照画面数据中的时间顺序依次将画面数据划分为多个画面帧,并依次提取每个画面帧的画面特征,基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行画面关联处理,得到画面数据的关联画面特征,根据关联画面特征确定画面数据的画面评估参数,可以根据该画面评估参数生成针对画面数据的质量检测结果,并将该质量检测结果和画面数据关联存储至数据库中。It can be understood that the server can obtain the picture data uploaded by any user terminal, and determine the quality detection result of the picture data through the picture data processing method proposed in this application. For example, the server can divide the picture data into multiple picture frames in sequence according to the time order in the picture data, and extract the picture features of each picture frame in turn, and compare the picture data of each picture frame based on the adjacent picture frames of each picture frame. Features are processed by screen association to obtain the associated screen features of the screen data. The screen evaluation parameters of the screen data are determined based on the associated screen features. The quality detection results for the screen data can be generated based on the screen evaluation parameters, and the quality detection results and the screen data can be combined. The association is stored in the database.
可选的,在一些实施例中,电子设备可根据实际的业务需求,执行该画面数据处理方法以确定出画面数据的质量检测结果。本申请技术方案可以应用于任意画面数据处理场景中。例如可以在画面数据的推荐场景中,电子设备在确定待推荐的画面数据时,可以结合画面数据的质量检测结果以确定出对于用户而言,观感更好的待推荐画面数据。Optionally, in some embodiments, the electronic device can execute the picture data processing method to determine the quality detection result of the picture data according to actual business requirements. The technical solution of this application can be applied to any picture data processing scenario. For example, in the scene of recommending picture data, when the electronic device determines the picture data to be recommended, it can combine the quality detection results of the picture data to determine the picture data to be recommended that has a better look and feel for the user.
可选的,本申请涉及的数据如画面数据、画面数据的质量检测结果等,可以存储于数据库中,或者可以存储于区块链中,如通过区块链分布式系统存储,本申请不做限定。Optionally, the data involved in this application, such as screen data, quality test results of screen data, etc., can be stored in a database, or can be stored in a blockchain, such as through a blockchain distributed system. This application does not limited.
需要说明的是,在本申请的具体实施方式中,涉及到获取用户信息等相关数据的场景时,如获取用户上传的画面数据等,需要获得用户许可或者同意。即在本申请实施例运用到具体产品或技术中时,相关用户数据的收集、使用和处理遵守相关国家和地区的相关法律法规和标准。例如可以通过交互界面的形式发出提示信息以用于提示具体会收集或者获取哪些数据,具体可以通过列表等方式将这些数据的类型、内容等提示给用户,只有在交互界面上接收到允许收集数据的确认操作或者指令之后,才会进一步进行相关数据的收集、处理等。It should be noted that in the specific implementation of the present application, when it comes to scenarios of obtaining user information and other related data, such as obtaining screen data uploaded by users, etc., user permission or consent needs to be obtained. That is, when the embodiments of this application are applied to specific products or technologies, the collection, use and processing of relevant user data comply with the relevant laws, regulations and standards of the relevant countries and regions. For example, prompt information can be sent in the form of an interactive interface to prompt the specific data that will be collected or obtained. Specifically, the type and content of these data can be prompted to the user through a list or other methods. Only users who receive permission to collect data on the interactive interface Only after the confirmation operation or instruction is completed, the relevant data will be further collected and processed.
可以理解,上述场景仅是作为示例,并不构成对于本申请实施例提供的技术方案的应用场景的限定,本申请的技术方案还可应用于其他场景。例如,本领域普通技术人员可知,随着系统架构的演变和新业务场景的出现,本申请实施例提供的技术方案对于类似的技术问题,同样适用。It can be understood that the above scenarios are only examples and do not constitute a limitation on the application scenarios of the technical solutions provided by the embodiments of the present application. The technical solutions of the present application can also be applied to other scenarios. For example, those of ordinary skill in the art know that with the evolution of system architecture and the emergence of new business scenarios, the technical solutions provided in the embodiments of this application are also applicable to similar technical problems.
基于上述的描述,本申请实施例提出了一种画面数据处理方法,该方法可以由上述提及的电子设备来执行。请参见图2,图2为本申请实施例提供的一种画面数据处理方法的流程示意图。如图2所示,本申请实施例的画面数据处理方法的流程可以包括如下:Based on the above description, the embodiment of the present application proposes a picture data processing method, which can be executed by the above-mentioned electronic device. Please refer to Figure 2. Figure 2 is a schematic flow chart of a screen data processing method provided by an embodiment of the present application. As shown in Figure 2, the flow of the screen data processing method in this embodiment of the present application may include the following:
S201、获取待检测的画面数据。S201. Obtain the screen data to be detected.
其中,画面数据可以又称为视频数据、图像数据等,可以是实时流媒体画面数据、对象终端(比如用户终端)拍摄并上传的画面数据,或者是根据指定地址(如用户终端上传的地址)下载的画面数据,等等。在此对画面数据的时长、类型不做限定。该画面数据可以是任意需要进行质量检测的画面数据。画面数据可以包括按照画面数据中的时间顺序排列的多个画面帧(又可称为视频帧、图像帧)。该多个画面帧比如可以依次称为:第1帧、第2帧……第i-1帧、第i帧、第i+1帧……第n-1帧、第n帧等等。Among them, the picture data can also be called video data, image data, etc., and can be real-time streaming picture data, picture data shot and uploaded by the target terminal (such as a user terminal), or based on a specified address (such as an address uploaded by a user terminal). Downloaded screen data, etc. The duration and type of screen data are not limited here. The picture data can be any picture data that requires quality inspection. The picture data may include a plurality of picture frames (also called video frames or image frames) arranged in time order in the picture data. For example, the plurality of picture frames may be called: the 1st frame, the 2nd frame...the i-1th frame, the i-th frame, the i+1th frame...the n-1th frame, the nth frame, and so on.
S202、分别从每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征。S202. Extract the picture features of each picture frame from each picture frame respectively, and perform picture association processing on the picture features of each picture frame based on the adjacent picture frames of each picture frame to obtain the associated picture features of the picture data. .
其中,每个画面帧的画面特征至少包含画面帧中与画面内容相关的特征信息以及与画面质量相关的特征信息。可以理解,画面特征用于表征画面帧中的空间特征。可选地,提取每个画面帧的画面特征可以是,将每个画面帧依次输入预训练的第一特征处理网络,由第一特征处理网络输出每个画面帧的画面特征。其中,该第一特征处理网络可以是基于深度卷积神经网络训练得到,比如,该第一特征处理网络可以采用视觉几何组(VisualGeometry Group,VGG)网络结构等。本申请实施例不对画面特征的具体提取方式进行限定。The picture characteristics of each picture frame include at least the characteristic information related to the picture content and the characteristic information related to the picture quality in the picture frame. It can be understood that picture features are used to characterize spatial features in picture frames. Optionally, extracting the picture features of each picture frame may include inputting each picture frame into a pre-trained first feature processing network in turn, and having the first feature processing network output the picture features of each picture frame. The first feature processing network may be trained based on a deep convolutional neural network. For example, the first feature processing network may adopt a visual geometry group (VisualGeometry Group, VGG) network structure or the like. The embodiments of this application do not limit the specific extraction method of picture features.
可以理解,画面特征仅包含对应画面帧中的特征信息,对于画面数据而言,用户的观感不仅体现在每个画面帧内,还体现在画面数据播放时每个画面帧之间的切换,因此可以对相邻画面帧进行画面关联处理,得到画面数据的关联画面特征。其具体可以是,分别对每个画面帧的画面特征和每个画面帧的相邻画面帧的画面特征进行画面关联处理,得到每个画面帧的关联画面特征,基于每个画面帧的关联画面特征确定画面数据的关联画面特征。比如,可以是将每个画面帧的关联画面特征作为画面数据的关联画面特征。It can be understood that picture features only include feature information in corresponding picture frames. For picture data, the user's perception is not only reflected in each picture frame, but also reflected in the switching between each picture frame when the picture data is played. Therefore Picture association processing can be performed on adjacent picture frames to obtain the associated picture characteristics of the picture data. Specifically, the picture characteristics of each picture frame and the picture characteristics of adjacent picture frames of each picture frame are separately subjected to picture association processing to obtain the associated picture characteristics of each picture frame, based on the associated pictures of each picture frame. The characteristics determine the associated picture characteristics of the picture data. For example, the associated picture features of each picture frame may be used as the related picture features of the picture data.
其中,每个画面帧的相邻画面帧可以是指每个画面帧的上一个画面帧,或者可以是指每个画面帧的上一个画面帧以及下一个画面帧。比如,对于第i个画面帧的相邻画面帧可以是第i-1个画面帧,也可以是第i-1个画面帧和第i+1个画面帧。可以理解,当i=1时,第i-1个画面帧可以为预设的默认画面帧(比如可以是一个初始化的画面帧,也可以是画面数据的封面帧);当i=n时,第i+1个画面帧可以为预设的默认画面帧。可以理解,画面数据的关联画面特征即包含画面帧本身的特征信息,还包含画面帧在切换时涵盖的特征信息。关联画面特征用于表征画面帧中的时空特征。The adjacent picture frames of each picture frame may refer to the previous picture frame of each picture frame, or may refer to the previous picture frame and the next picture frame of each picture frame. For example, the adjacent picture frame to the i-th picture frame can be the i-1th picture frame, or it can be the i-1th picture frame and the i+1th picture frame. It can be understood that when i=1, the i-1th picture frame can be a preset default picture frame (for example, it can be an initialized picture frame or a cover frame of picture data); when i=n, The i+1th picture frame may be a preset default picture frame. It can be understood that the associated picture characteristics of the picture data include the characteristic information of the picture frame itself, and also include the characteristic information covered when the picture frame is switched. Correlated picture features are used to characterize spatiotemporal features in picture frames.
在一些实施例中,以一个画面帧为例,对画面关联处理的过程进行说明。为例便于理解,以第i个画面帧、第i个画面帧的相邻画面帧为第i-1个画面帧为例,确定第i个画面帧的关联画面特征可以是,获取第i-1个画面帧与第i个画面帧之间的帧差画面特征,对帧差画面特征与第i个画面帧的画面特征进行特征融合处理,得到第i个画面帧的关联画面特征。可以理解,关联画面特征包含第i个画面帧本身的特征信息,还包含从第i-1个画面帧切换至第i个画面帧时第i-1个画面帧与第i个画面帧之间的画面变动所带来的特征信息。In some embodiments, a picture frame is taken as an example to illustrate the process of picture association processing. For ease of understanding, take the i-th picture frame and the adjacent picture frame of the i-th picture frame as the i-1th picture frame as an example. Determining the associated picture characteristics of the i-th picture frame can be to obtain the i-th picture frame. The frame difference picture features between one picture frame and the i-th picture frame are subjected to feature fusion processing to obtain the associated picture features of the i-th picture frame. It can be understood that the associated picture characteristics include the characteristic information of the i-th picture frame itself, and also include the information between the i-1th picture frame and the i-th picture frame when switching from the i-1th picture frame to the i-th picture frame. Characteristic information brought about by screen changes.
其中,获取第i-1个画面帧与第i个画面帧之间的帧差画面特征可以是,获取第i-1个画面帧与第i个画面帧之间的帧差信息,并将第i-1个画面帧与第i个画面帧之间的帧差信息输入第一特征处理网络,由第一特征处理网络输出帧差信息对应的帧差画面特征。可以理解,此处可以将第i-1个画面帧的像素信息与第i个画面帧的像素信息之间的像素差作为帧差信息。比如可以是对第i个画面帧的像素信息与第i-1个画面帧的像素信息进行作差处理,以得到帧差信息。Wherein, obtaining the frame difference picture feature between the i-1th picture frame and the i-th picture frame may be: obtaining the frame difference information between the i-1th picture frame and the i-th picture frame, and converting the i-th picture frame to the i-th picture frame. The frame difference information between the i-1 picture frame and the i-th picture frame is input into the first feature processing network, and the first feature processing network outputs frame difference picture features corresponding to the frame difference information. It can be understood that the pixel difference between the pixel information of the i-1th picture frame and the pixel information of the i-th picture frame can be used as the frame difference information. For example, a difference process may be performed on the pixel information of the i-th picture frame and the pixel information of the i-1th picture frame to obtain the frame difference information.
此外,对帧差画面特征与第i个画面帧的画面特征进行特征融合处理的具体方法可以是将帧差画面特征与第i个画面帧的画面特征的特征乘积作为关联画面特征。或者,也可以是将帧差画面特征与第i个画面帧的画面特征输入第二特征处理网络,由第二特征处理网络输出关联画面特征。其中,第二特征处理网络可以是循环神经网络(GRU,GatedRecurrent Unit)。In addition, a specific method for performing feature fusion processing on the frame difference picture feature and the picture feature of the i-th picture frame may be to use the feature product of the frame difference picture feature and the picture feature of the i-th picture frame as the associated picture feature. Alternatively, the frame difference picture feature and the picture feature of the i-th picture frame may be input into the second feature processing network, and the second feature processing network outputs the associated picture feature. The second feature processing network may be a recurrent neural network (GRU, GatedRecurrent Unit).
相应地,第i个画面帧的相邻画面帧为第i-1个画面帧和第i+1个画面帧为例,确定第i个画面帧的关联画面特征可以是,获取第i-1个画面帧与第i个画面帧之间的第一帧差画面特征,获取第i个画面帧与第i+1个画面帧之间的第二帧差画面特征,对第一帧差画面特征、第i个画面帧的画面特征和第二帧差画面特征进行特征融合处理,得到第i个画面帧的关联画面特征。可以理解,关联画面特征包含第i个画面帧本身的特征信息,还包含从第i-1个画面帧切换至第i个画面帧时第i-1个画面帧与第i个画面帧之间的画面变动所带来的特征信息,以及从第i个画面帧切换至第i+1个画面帧时第i个画面帧与第i+1个画面帧之间的画面变动所带来的特征信息。可以理解,可以是对第i个画面帧的像素信息与第i-1个画面帧的像素信息进行作差处理,以得到第一帧差信息;对第i+1个画面帧的像素信息与第i个画面帧的像素信息进行作差处理,以得到第二帧差信息。Correspondingly, the adjacent picture frames of the i-th picture frame are the i-1th picture frame and the i+1th picture frame. For example, determining the associated picture characteristics of the i-th picture frame can be to obtain the i-1th picture frame. The first frame difference picture feature between the i-th picture frame and the i-th picture frame is obtained, and the second frame difference picture feature between the i-th picture frame and the i+1-th picture frame is obtained. For the first frame difference picture feature , perform feature fusion processing on the picture features of the i-th picture frame and the second frame difference picture features, and obtain the associated picture features of the i-th picture frame. It can be understood that the associated picture characteristics include the characteristic information of the i-th picture frame itself, and also include the information between the i-1th picture frame and the i-th picture frame when switching from the i-1th picture frame to the i-th picture frame. The characteristic information brought by the picture change, and the characteristics brought by the picture change between the i-th picture frame and the i+1-th picture frame when switching from the i-th picture frame to the i+1-th picture frame information. It can be understood that the difference processing can be performed on the pixel information of the i-th picture frame and the pixel information of the i-1th picture frame to obtain the first frame difference information; the pixel information of the i+1-th picture frame and The pixel information of the i-th picture frame is subjected to difference processing to obtain the second frame difference information.
此外,对第一帧差画面特征、第i个画面帧的画面特征和第二帧差画面特征进行特征融合处理的具体方法可以是将第一帧差画面特征、第i个画面帧的画面特征与第二帧差画面特征的特征乘积作为关联画面特征。或者,也可以是将第一帧差画面特征、第i个画面帧的画面特征与第二帧差画面特征输入第二特征处理网络,由第二特征处理网络输出关联画面特征。其中,第二特征处理网络可以是循环神经网络(GRU,Gated Recurrent Unit)。In addition, a specific method for performing feature fusion processing on the first frame difference picture feature, the picture feature of the i-th picture frame, and the second frame difference picture feature may be to combine the first frame difference picture feature, the picture feature of the i-th picture frame The feature product with the second frame difference picture feature is used as the associated picture feature. Alternatively, the first frame difference picture feature, the picture feature of the i-th picture frame, and the second frame difference picture feature may be input into the second feature processing network, and the second feature processing network outputs the associated picture feature. The second feature processing network may be a recurrent neural network (GRU, Gated Recurrent Unit).
在一些实施例中,确定第i个画面帧的关联画面特征还可以是,将第i-1个画面帧的画面特征和第i个画面帧的画面特征输入第二特征处理网络,由第二特征处理网络输出关联画面特征。或者,将第i-1个画面帧的画面特征、第i个画面帧的画面特征和第i+1个画面帧的画面特征输入第二特征处理网络,由第二特征处理网络输出关联画面特征。In some embodiments, determining the associated picture characteristics of the i-th picture frame may also include inputting the picture characteristics of the i-1th picture frame and the picture characteristics of the i-th picture frame into the second feature processing network, and the second feature processing network uses The feature processing network outputs associated picture features. Or, input the picture characteristics of the i-1th picture frame, the picture characteristics of the i-th picture frame and the picture characteristics of the i+1th picture frame into the second feature processing network, and the second feature processing network outputs the associated picture features .
可选地,确定第i个画面帧的关联画面特征还可以是,根据第i个画面帧的画面特征以及第i-1个画面帧的关联画面特征确定第i个画面帧的关联画面特征。比如可以是,将第i个画面帧的画面特征以及第i-1个画面帧的关联画面特征输入第二特征处理网络,得到第i个画面帧的关联画面特征。比如,将第1个画面帧的画面特征和默认画面的画面特征输入第二特征处理网络,得到第1个画面帧的关联画面特征;将第2个画面帧的画面特征和第1个画面帧的关联画面特征输入第二特征处理网络,得到第2个画面帧的关联画面特征。Optionally, determining the associated picture characteristics of the i-th picture frame may also include determining the associated picture characteristics of the i-th picture frame based on the picture characteristics of the i-th picture frame and the associated picture characteristics of the i-1 th picture frame. For example, the picture features of the i-th picture frame and the associated picture features of the i-1th picture frame may be input into the second feature processing network to obtain the related picture features of the i-th picture frame. For example, input the picture characteristics of the first picture frame and the picture characteristics of the default picture into the second feature processing network to obtain the associated picture characteristics of the first picture frame; combine the picture characteristics of the second picture frame and the first picture frame The associated picture features are input into the second feature processing network to obtain the associated picture features of the second picture frame.
可选地,在第一特征处理网络和第二特征处理网络之间还可以增加全连接层,在通过第一特征处理网络得到的画面帧的画面特征和/或帧差画面特征可以通过全连接层进行降维处理,并将经过降维处理后的画面特征再输入第二特征处理网络中,由第二特征处理网络输出关联画面特征。Optionally, a fully connected layer can be added between the first feature processing network and the second feature processing network. The picture features and/or frame difference picture features of the picture frame obtained through the first feature processing network can be obtained through the fully connected layer. The first layer performs dimensionality reduction processing, and then inputs the dimensionally reduced picture features into the second feature processing network, and the second feature processing network outputs the associated picture features.
S203、根据关联画面特征确定画面数据的画面评估参数。S203. Determine the picture evaluation parameters of the picture data according to the associated picture characteristics.
其中,可以将每个画面帧的关联画面特征进行特征融合处理,得到画面数据的目标关联画面特征,并将目标关联画面特征输入第三特征处理网络,由第三特征处理网络输出画面数据的画面评估参数。其中,对每个画面帧的关联画面特征进行特征融合处理可以是将每个画面帧的关联画面特征进行特征拼接,并将拼接得到关联画面特征作为目标关联画面特征。或者,也可以是将每个画面特征的关联画面特征的特征乘积作为目标关联画面特征。在此对特征融合的具体方式不做限定。目标关联画面特征即包括的画面帧中的特征信息(即静态特征信息)还包括画面帧在切换时涵盖的特征信息(即动态特征信息)。该第三特征处理网络可以是多层连接的全连接层。通过第三特征处理网络可以对目标关联画面特征进行预测输出一个或多个画面评估参数。Among them, the associated picture features of each picture frame can be subjected to feature fusion processing to obtain the target associated picture features of the picture data, and the target related picture features are input into the third feature processing network, and the third feature processing network outputs the picture of the picture data Evaluation parameters. Wherein, performing feature fusion processing on the associated picture features of each picture frame may be to perform feature splicing on the associated picture features of each picture frame, and use the spliced related picture features as the target related picture features. Alternatively, the feature product of the associated image features of each image feature may be used as the target associated image feature. The specific method of feature fusion is not limited here. The target associated picture features include feature information in the picture frame (i.e., static feature information) and also include feature information covered when the picture frame is switched (ie, dynamic feature information). The third feature processing network may be a fully connected layer of multi-layer connections. Through the third feature processing network, the target-related picture features can be predicted and output one or more picture evaluation parameters.
因此,画面评估参数可以包括如下一种或多种:画面静态评估参数和画面动态评估参数。可选地,画面静态评估参数用于评估画面数据在静态维度下的质量分数,比如可以用于评估画面数据的画质维度、内容维度下的质量分数,画面静态评估参数越高,表明画面数据在画质维度下的画质越清晰,以及在内容维度上的内容越丰富。进一步地,画面静态评估参数还可以细分为画面质量评估参数和画面内容评估参数,画面质量评估参数用于评估画面数据的画质维度下的质量分数,画面内容评估参数用于评估画面数据的内容维度下的质量分数。可选地,画面动态评估参数用于评估画面数据在动态维度下的质量分数,比如可以用于评估画面数据的画面切换维度、画面时序维度下的质量分数,画面动态评估参数越高,表明画面数据在画面切换维度下的切换越协调,以及在画面时序维度上的画面变化越稳定。进一步地,画面动态评估参数还可以细分为画面切换评估参数和画面时序评估参数,画面切换评估参数用于评估画面数据的画面切换维度下的质量分数,画面时序评估参数用于评估画面数据的画面时序维度下的质量分数。因此画面评估参数可以在各种维度下评估用户在画面数据的播放时的静态观感和动态观感,从而可以预估在画质、内容、画面切换过程等维度下用户对画面数据的评分。Therefore, the picture evaluation parameters may include one or more of the following: static picture evaluation parameters and dynamic picture evaluation parameters. Optionally, the picture static evaluation parameter is used to evaluate the quality score of the picture data in the static dimension. For example, it can be used to evaluate the quality score of the picture data in the picture quality dimension and the content dimension. The higher the picture static evaluation parameter, the better the picture data is. The clearer the image quality in the image quality dimension, and the richer the content in the content dimension. Further, the picture static evaluation parameters can also be subdivided into picture quality evaluation parameters and picture content evaluation parameters. The picture quality evaluation parameters are used to evaluate the quality score under the picture quality dimension of the picture data, and the picture content evaluation parameters are used to evaluate the picture data. Quality score under the content dimension. Optionally, the picture dynamic evaluation parameter is used to evaluate the quality score of the picture data in the dynamic dimension. For example, it can be used to evaluate the quality score of the picture data in the picture switching dimension and the picture timing dimension. The higher the picture dynamic evaluation parameter, the better the picture. The more coordinated the data switching is in the screen switching dimension, and the more stable the screen changes are in the screen timing dimension. Further, the screen dynamic evaluation parameters can also be subdivided into screen switching evaluation parameters and screen timing evaluation parameters. The screen switching evaluation parameters are used to evaluate the quality score of the screen data in the screen switching dimension. The screen timing evaluation parameters are used to evaluate the screen data. Quality score in the picture timing dimension. Therefore, the picture evaluation parameters can evaluate the user's static perception and dynamic perception during the playback of the picture data in various dimensions, so that the user's score of the picture data can be estimated in dimensions such as picture quality, content, and picture switching process.
S204、根据画面评估参数生成针对画面数据的质量检测结果。S204. Generate quality detection results for the picture data according to the picture evaluation parameters.
其中,可以将一个或多个画面评估参数的参数值对应的平均值作为针对画面数据的质量检测结果。质量检测结果越高,表示画面数据的质量越好,即针对画面数据的用户评估观感越高。Wherein, the average value corresponding to the parameter values of one or more picture evaluation parameters may be used as the quality detection result for the picture data. The higher the quality test result is, the better the quality of the picture data is, that is, the higher the user evaluation of the picture data is.
例如,如图3所示,图3为本申请实施例提供的一种确定质量检测结果的场景示意图;其中,用于确定质量检测结果的目标检测模型可以包括第一特征处理网络、全连接层、第二特征处理网络和第三特征处理网络;获取画面数据,该画面数据包括多个画面帧(K1、K2、...、Kn),将每个画面帧依次输入第一特征处理网络,得到每个画面帧的画面特征,将每个画面帧的画面特征依次输入全连接层,得到每个画面帧的降维画面特征,将每个画面帧的降维画面特征以及每个画面帧的相邻画面帧的关联画面特征输入第二特征处理网络,得到每个画面帧的关联画面特征,比如可以是将第i个画面帧的降维画面特征和第i-1个画面帧经由第二特征处理网络输出的关联画面特征输入第二特征处理网络,得到第i个画面帧的关联画面特征;对每个画面帧的关联画面特征进行特征融合处理得到目标关联画面特征,将目标关联画面特征输入第三特征处理网络,由第三特征处理网络输出画面评估参数,即画面静态评估参数和画面动态评估参数。可以理解,目标检测模型通过训练样本数据训练得到,该训练样本数据为样本画面数据以及样本画面数据对应的画面评估参数标签。比如,画面评估参数标签可以包括画面静态评估参数标签、画面动态评估参数标签。又如,画面评估参数标签可以包括画面静态评估参数标签、画面动态评估参数标签,且画面静态评估参数标签具体包括画面质量评估参数标签和画面内容评估参数标签、画面动态评估参数标签具体包括画面切换评估参数标签和画面时序评估参数标签。For example, as shown in Figure 3, Figure 3 is a schematic diagram of a scenario for determining quality detection results provided by an embodiment of the present application; wherein, the target detection model used to determine the quality detection results may include a first feature processing network, a fully connected layer , the second feature processing network and the third feature processing network; obtain picture data, which includes multiple picture frames (K1, K2,..., Kn), and input each picture frame into the first feature processing network in turn, Obtain the picture features of each picture frame, input the picture features of each picture frame into the fully connected layer in turn, obtain the reduced-dimensional picture features of each picture frame, and combine the reduced-dimensional picture features of each picture frame and the The associated picture features of adjacent picture frames are input into the second feature processing network to obtain the associated picture features of each picture frame. For example, the dimensionality reduction picture features of the i-th picture frame and the i-1th picture frame can be processed through the second feature processing network. The associated picture features output by the feature processing network are input into the second feature processing network to obtain the associated picture features of the i-th picture frame; feature fusion processing is performed on the associated picture features of each picture frame to obtain the target related picture features, and the target related picture features are obtained The third feature processing network is input, and the third feature processing network outputs picture evaluation parameters, that is, picture static evaluation parameters and picture dynamic evaluation parameters. It can be understood that the target detection model is trained through training sample data, and the training sample data is sample picture data and picture evaluation parameter labels corresponding to the sample picture data. For example, the picture evaluation parameter tags may include picture static evaluation parameter tags and picture dynamic evaluation parameter tags. For another example, the picture evaluation parameter tag may include the picture static evaluation parameter label and the picture dynamic evaluation parameter label, and the picture static evaluation parameter label specifically includes the picture quality evaluation parameter label and the picture content evaluation parameter label, and the picture dynamic evaluation parameter label specifically includes the picture switching. Evaluation parameter tag and picture timing evaluation parameter tag.
本申请实施例中,可以获取待检测的画面数据,分别从画面数据包括的每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征,根据关联画面特征确定画面数据的画面评估参数;该可以提高关联画面特征涵盖的特征信息丰富度,从而得到准确的画面评估参数;根据画面评估参数生成针对画面数据的质量检测结果。该画面评估参数可以包括如下一种或多种:画面静态评估参数、画面动态评估参数。该可以通过画面评估参数综合确定出画面数据的质量检测结果,以提高质量检测结果的准确性和可靠性。In the embodiment of the present application, the picture data to be detected can be obtained, the picture characteristics of each picture frame are extracted from each picture frame included in the picture data, and each picture frame is compared based on the adjacent picture frames of each picture frame. Perform picture association processing on the picture features to obtain the associated picture features of the picture data, and determine the picture evaluation parameters of the picture data based on the related picture characteristics; this can improve the richness of feature information covered by the related picture features, thereby obtaining accurate picture evaluation parameters; Generate quality inspection results for the picture data based on the picture evaluation parameters. The picture evaluation parameters may include one or more of the following: static picture evaluation parameters and dynamic picture evaluation parameters. The quality detection results of the screen data can be comprehensively determined through the screen evaluation parameters to improve the accuracy and reliability of the quality detection results.
请参见图4,图4为本申请实施例提供的一种画面数据处理方法的流程示意图,该方法可以由上述提及的电子设备执行。如图4所示,本申请实施例中画面数据处理方法的流程可以包括如下:Please refer to FIG. 4. FIG. 4 is a schematic flowchart of a picture data processing method provided by an embodiment of the present application. This method can be executed by the above-mentioned electronic device. As shown in Figure 4, the flow of the screen data processing method in the embodiment of the present application may include the following:
S401、获取待检测的画面数据。S401. Obtain the screen data to be detected.
S402、分别从每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征。其中,步骤S401-S402的具体实施方式可以参见上述实施例的相关描述,在此不做赘述。S402. Extract the picture features of each picture frame from each picture frame respectively, and perform picture association processing on the picture features of each picture frame based on the adjacent picture frames of each picture frame to obtain the associated picture features of the picture data. . For the specific implementation of steps S401-S402, please refer to the relevant descriptions of the above embodiments, and will not be described again here.
S403、从关联画面特征中确定出画面数据的画面静态特征和画面动态特征,根据画面静态特征确定画面静态评估参数,并根据画面动态特征确定画面动态评估参数。S403. Determine the static characteristics of the picture and the dynamic characteristics of the picture from the associated picture characteristics, determine the static evaluation parameters of the picture based on the static characteristics of the picture, and determine the dynamic evaluation parameters of the picture based on the dynamic characteristics of the picture.
其中,画面评估参数可以包括画面静态评估参数和画面动态评估参数,可以从关联画面特征中确定出用于确定画面静态评估参数的画面静态特征以及用于确定画面动态评估参数的画面动态特征(又可称为画面运动特征),以实现针对关联画面特征的特征分层。由于将与画面静态信息相关的特征以及与画面动态信息相关的特征分离开来,避免了特征之间相互干扰,可以提高所确定出的画面评估参数的准确性,同时还可以避免使用结构庞大复杂的模型,节省了计算开销,实用性更强。Among them, the picture evaluation parameters may include picture static evaluation parameters and picture dynamic evaluation parameters. The picture static characteristics used to determine the picture static evaluation parameters and the picture dynamic characteristics used to determine the picture dynamic evaluation parameters can be determined from the associated picture characteristics (also known as the picture dynamic characteristics). can be called picture motion features) to achieve feature layering for associated picture features. Since the features related to the static information of the picture and the features related to the dynamic information of the picture are separated, mutual interference between the features can be avoided, the accuracy of the determined picture evaluation parameters can be improved, and the use of large and complex structures can also be avoided. The model saves computational overhead and is more practical.
在一些实施例中,从关联画面特征中确定出画面数据的画面静态特征和画面动态特征可以是,将多个画面帧的关联画面特征依次输入残差网络,通过残差网络从多个画面帧的关联画面特征中提取出多个画面帧的画面残差特征,并将多个画面帧的画面残差特征作为画面静态特征,并将关联画面特征与画面静态特征之间的特征差作为画面动态特征。也就是说,将多个画面帧的关联画面特征输入残差网络,可以由残差网络依次输出每个画面帧中的画面内容信息相关的画面残差特征,并将每个画面帧的画面残差特征作为画面静态特征,此时可以将每个画面帧的关联画面特征与每个画面帧的画面静态特征作差,得到每个画面帧的画面动态特征。其中,残差网络比如可以是Residual Network(ResNet)等。在此对残差网络的网络结构不做限定。In some embodiments, determining the static characteristics of the picture and the dynamic characteristics of the picture from the associated picture features may be to sequentially input the associated picture features of multiple picture frames into the residual network, and use the residual network to extract the picture data from the multiple picture frames. Extract the picture residual features of multiple picture frames from the associated picture features, and use the picture residual features of multiple picture frames as picture static features, and use the feature difference between the related picture features and the picture static features as picture dynamic features feature. That is to say, by inputting the associated picture features of multiple picture frames into the residual network, the residual network can sequentially output the picture residual features related to the picture content information in each picture frame, and combine the picture residual features of each picture frame. The difference feature is used as the static feature of the picture. At this time, the associated picture feature of each picture frame can be differenced with the static picture feature of each picture frame to obtain the dynamic picture characteristics of each picture frame. Among them, the residual network can be, for example, Residual Network (ResNet), etc. The network structure of the residual network is not limited here.
在一些实施例中,从关联画面特征中确定出画面数据的画面静态特征和画面动态特征还可以是,基于每个画面帧的相邻画面帧从每个画面帧的关联画面特征中提取每个画面帧的相关性特征,将每个画面帧的相关性特征作为画面静态特征,并根据画面静态特征从关联画面特征中确定出画面动态特征。In some embodiments, determining the picture static features and picture dynamic features of the picture data from the associated picture features may also include extracting each picture frame from the associated picture features of each picture frame based on the adjacent picture frames of each picture frame. For the correlation features of the picture frame, the correlation characteristics of each picture frame are used as the static characteristics of the picture, and the dynamic characteristics of the picture are determined from the associated picture features according to the static characteristics of the picture.
其中,以第i个画面帧的相邻画面帧为第i-1个画面帧为例,提取第i个画面帧的相关性特征具体可以是,提取第i-1个画面帧和第i个画面帧之间,在第一方向上的相关性特征、在第二方向上的相关性特征和在第三方向上的相关性特征,根据在第一方向上的相关性特征、在第二方向上的相关性特征和在第三方向上的相关性特征确定第i-1个画面帧和第i个画面帧之间的相关性特征。比如可以是将在第一方向上的相关性特征、在第二方向上的相关性特征和在第三方向上的相关性特征确定第i-1个画面帧和第i个画面帧之间的相关性特征进行拼接,并将拼接得到的相关性特征作为第i-1个画面帧和第i个画面帧之间的相关性特征。Among them, taking the adjacent picture frame of the i-th picture frame as the i-1th picture frame as an example, extracting the correlation features of the i-th picture frame may specifically include extracting the i-1th picture frame and the i-th picture frame. Between picture frames, the correlation features in the first direction, the correlation features in the second direction, and the correlation features in the third direction, according to the correlation features in the first direction, the correlation features in the second direction The correlation feature and the correlation feature in the third direction determine the correlation feature between the i-1th picture frame and the i-th picture frame. For example, the correlation feature between the i-1th picture frame and the i-th picture frame can be determined by using the correlation feature in the first direction, the correlation feature in the second direction, and the correlation feature in the third direction. The correlation features are spliced together, and the correlation features obtained by splicing are used as the correlation features between the i-1th picture frame and the i-th picture frame.
因此,可以从多种维度上确定第i-1个画面帧和第i个画面帧之间的相关特征信息,并将该多种维度上的相关特征信息作为第i-1个画面帧和第i个画面帧之间的相关性特征。可以理解,若第i-1个画面帧和第i个画面帧之间的相关性特征表明第i-1个画面帧和第i个画面帧之间越相关,则表示第i-1个画面帧和第i个画面帧之间在切换时所产生的运动信息越少、静态信息越多,因此可以将第i-1个画面帧和第i个画面帧之间的相关性特征作为第i个画面帧的画面静态特征。Therefore, the relevant feature information between the i-1th picture frame and the i-th picture frame can be determined from multiple dimensions, and the relevant feature information in the multiple dimensions can be used as the i-1th picture frame and the i-th picture frame. Correlation features between i picture frames. It can be understood that if the correlation characteristics between the i-1th picture frame and the i-th picture frame indicate that the i-1th picture frame and the i-th picture frame are more correlated, it means that the i-1th picture frame The less motion information and more static information generated when switching between frames and the i-th picture frame, so the correlation features between the i-1th picture frame and the i-th picture frame can be used as the i-th picture frame The static characteristics of each picture frame.
其中,第一方向可以是坐标轴中x轴的方向(也可以称为空间方向上的x方向),第一方向上的相关性特征可以是构建针对第一方向的第一卷积网络,由第一卷积网络对第i-1个画面帧的关联画面特征和第i个画面帧的关联画面特征进行特征卷积,以输出第一方向上的相关性特征。可选地,第一卷积网络可以基于向量维度为3x3的x方向的sobel算子(索贝尔算子)构建。在此对第一卷积网络的网络结构不做限定。比如,可以是由第一卷积网络对第i-1个画面帧的关联画面特征和第i个画面帧的关联画面特征进行特征处理,得到第i-1个画面帧和第i个画面帧所对应的第一联合画面特征,并对第一联合画面特征进行特征处理得到第一方向上的相关性特征。Wherein, the first direction may be the direction of the x-axis in the coordinate axis (which may also be called the x-direction in the spatial direction), and the correlation feature in the first direction may be to construct a first convolutional network for the first direction, as follows The first convolutional network performs feature convolution on the associated picture features of the i-1th picture frame and the associated picture features of the i-th picture frame to output correlation features in the first direction. Optionally, the first convolutional network may be constructed based on the Sobel operator (Sobel operator) in the x direction with a vector dimension of 3x3. The network structure of the first convolutional network is not limited here. For example, the first convolutional network can perform feature processing on the associated picture features of the i-1th picture frame and the associated picture features of the i-th picture frame to obtain the i-1th picture frame and the i-th picture frame. corresponding first joint picture features, and feature processing is performed on the first joint picture features to obtain correlation features in the first direction.
其中,第二方向可以是坐标轴中y轴的方向(也可以称为空间方向上的y方向),第二方向上的相关性特征可以是构建针对第二方向的第二卷积网络,由第二卷积网络对第i-1个画面帧的关联画面特征和第i个画面帧的关联画面特征进行特征卷积,以输出第二方向上的相关性特征。可选地,第二卷积网络可以基于向量维度为3x3的y方向的sobel算子构建。在此对第一卷积网络的网络结构不做限定。比如,可以是由第二卷积网络对第i-1个画面帧的关联画面特征和第i个画面帧的关联画面特征进行特征处理,得到第i-1个画面帧和第i个画面帧所对应的第二联合画面特征,并对第二联合画面特征进行特征处理得到第二方向上的相关性特征。Wherein, the second direction may be the direction of the y-axis in the coordinate axis (which may also be called the y-direction in the spatial direction), and the correlation feature in the second direction may be to construct a second convolutional network for the second direction, as follows The second convolution network performs feature convolution on the associated picture features of the i-1th picture frame and the related picture features of the i-th picture frame to output correlation features in the second direction. Optionally, the second convolutional network can be constructed based on the sobel operator with a vector dimension of 3x3 in the y direction. The network structure of the first convolutional network is not limited here. For example, the second convolutional network can perform feature processing on the associated picture features of the i-1th picture frame and the associated picture features of the i-th picture frame to obtain the i-1th picture frame and the i-th picture frame. corresponding second joint picture features, and feature processing is performed on the second joint picture features to obtain correlation features in the second direction.
其中,第三方向可以是坐标轴中z轴的方向(也可以称为时间方向),第三方向上的相关性特征可以是构建针对第三方向的第三卷积网络,由第三卷积网络对第i-1个画面帧的关联画面特征和第i个画面帧的关联画面特征进行特征卷积,以输出第三方向上的相关性特征。可选地,第三卷积网络可以基于向量维度为2x1x1,值为[-1,1]的向量为卷积核的卷积网络层构建。在此对第一卷积网络的网络结构不做限定。也就是基于第三卷积网络提取两个画面帧之间的时序特征偏差。比如,可以是由第三卷积网络对第i-1个画面帧的关联画面特征和第i个画面帧的关联画面特征进行特征处理,得到第i-1个画面帧和第i个画面帧所对应的第三联合画面特征,并对第三联合画面特征进行特征处理得到第三方向上的相关性特征。Among them, the third direction can be the direction of the z-axis in the coordinate axis (can also be called the time direction), and the correlation feature in the third direction can be to construct a third convolutional network for the third direction. The third convolutional network Feature convolution is performed on the associated picture features of the i-1th picture frame and the related picture features of the i-th picture frame to output correlation features in the third direction. Optionally, the third convolutional network can be constructed based on a convolutional network layer with a vector dimension of 2x1x1 and a vector with a value of [-1,1] as the convolution kernel. The network structure of the first convolutional network is not limited here. That is, the temporal feature deviation between two picture frames is extracted based on the third convolutional network. For example, the third convolutional network can perform feature processing on the associated picture features of the i-1th picture frame and the associated picture features of the i-th picture frame to obtain the i-1th picture frame and the i-th picture frame. corresponding third joint picture features, and feature processing is performed on the third joint picture features to obtain correlation features in the third direction.
由此,可以通过上述过程得到每个画面帧的画面静态特征。相应地,若第i个画面帧的相邻画面帧为第i-1个画面帧和第i+1个画面帧时,可以按照上述过程分别确定第i-1个画面帧和第i个画面帧之间相关性特征、第i个画面帧和第i+1个画面帧之间相关性特征,并将第i-1个画面帧和第i个画面帧之间相关性特征、第i个画面帧和第i+1个画面帧之间相关性特征的特征之和作为第i个画面帧的画面静态特征。Therefore, the static characteristics of each picture frame can be obtained through the above process. Correspondingly, if the adjacent picture frames of the i-th picture frame are the i-1th picture frame and the i+1th picture frame, the i-1th picture frame and the i-th picture frame can be determined respectively according to the above process. The correlation features between frames, the correlation features between the i-th picture frame and the i+1-th picture frame, and the correlation features between the i-1th picture frame and the i-th picture frame, the i-th picture frame The sum of the features of the correlation features between the picture frame and the i+1th picture frame is used as the picture static feature of the i-th picture frame.
在一些实施例中,在基于每个画面帧的关联画面特征确定画面数据的画面静态特征后,可以根据每个画面帧的关联画面特征以及画面静态特征确定画面动态特征。具体可以是,将每个画面帧的关联画面特征通过特征映射网络映射成与画面静态特征具有相同维度的映射画面特征,分别对每个画面帧的映射画面特征与每个画面帧的画面静态特征进行特征作差,得到每个画面帧的画面动态特征,并将每个画面帧的画面动态特征作为画面数据的画面动态特征。或者,对每个画面帧的映射画面特征与每个画面帧的画面静态特征进行特征求和,得到每个画面帧的求和画面特征,并将每个画面帧的求和画面特征输入卷积神经层,由卷积神经层输出每个画面帧的画面动态特征。In some embodiments, after determining the picture static characteristics of the picture data based on the associated picture characteristics of each picture frame, the picture dynamic characteristics may be determined based on the associated picture characteristics and the picture static characteristics of each picture frame. Specifically, the associated picture features of each picture frame are mapped through the feature mapping network into mapped picture features with the same dimensions as the picture static features, and the mapped picture features of each picture frame and the picture static features of each picture frame are separately mapped. Perform feature difference to obtain the picture dynamic characteristics of each picture frame, and use the picture dynamic characteristics of each picture frame as the picture dynamic characteristics of the picture data. Or, perform feature summation on the mapped picture features of each picture frame and the picture static features of each picture frame to obtain the summed picture features of each picture frame, and input the summed picture features of each picture frame into the convolution Neural layer, the convolutional neural layer outputs the dynamic characteristics of each picture frame.
在一些实施例中,第三特征处理网络可以包括静态评估网络和动态评估网络。根据画面静态特征确定画面静态评估参数可以是将画面静态特征输入静态评估网络,由静态评估网络输出画面静态评估参数。根据画面动态特征确定画面动态评估参数可以是将画面动态特征输入动态评估网络,由动态评估网络输出画面动态评估参数。In some embodiments, the third feature processing network may include a static evaluation network and a dynamic evaluation network. Determining the static evaluation parameters of the picture based on the static characteristics of the picture may include inputting the static characteristics of the picture into a static evaluation network, and having the static evaluation network output the static evaluation parameters of the picture. Determining the picture dynamic evaluation parameters based on the picture dynamic characteristics may include inputting the picture dynamic characteristics into a dynamic evaluation network, and having the dynamic evaluation network output the picture dynamic evaluation parameters.
可选地,画面静态评估参数的参数类型还可以包括画面质量评估参数和画面内容评估参数。画面质量评估参数可以表示从画面质量维度对画面数据进行质量评估所得到的结果。画面内容评估参数可以表示从画面内容维度对画面数据进行质量评估所得到的结果。因此可以从画面静态特征分别确定出针对画面质量评估参数的画面静态特征以及针对画面内容评估参数的画面静态特征,通过相应的画面静态特征来分别确定画面质量评估参数和画面内容评估参数。Optionally, the parameter types of the picture static evaluation parameters may also include picture quality evaluation parameters and picture content evaluation parameters. The picture quality evaluation parameters can represent the results of quality evaluation of picture data from the picture quality dimension. The screen content evaluation parameter can represent the result of quality assessment of screen data from the screen content dimension. Therefore, the static characteristics of the picture for the picture quality evaluation parameters and the static characteristics of the picture for the picture content evaluation parameters can be determined respectively from the static characteristics of the picture, and the picture quality evaluation parameters and the picture content evaluation parameters are determined respectively through the corresponding static characteristics of the picture.
可选地,确定画面静态评估参数具体可以是,确定画面数据对应的第一画面区域,基于画面数据对应的第一画面区域从画面静态特征中确定局部的画面静态特征,根据局部的画面静态特征确定画面质量评估参数,对画面静态特征进行特征降维处理,得到全局的画面静态特征,根据全局的画面静态特征确定画面内容评估参数。可以理解,对于画面质量而言,画面数据的局部区域的画面质量与全局区域的画面质量相差不多,因此可以基于画面数据的第一画面区域从画面数据的画面静态特征中截取局部的画面静态特征,并利用局部的画面静态特征确定画面质量评估参数,从而可以降低网络参数的规模。对于画面内容而言,画面数据的不同区域包含的画面内容信息是不同的,因此可以对画面静态特征进行特征降维处理,得到全局的画面静态特征,并利用全局的画面静态特征确定画面内容评估参数。此时全局的画面静态特征包含了画面数据中的全部内容特征信息。Optionally, determining the picture static evaluation parameters may specifically include determining the first picture area corresponding to the picture data, determining the local picture static characteristics from the picture static characteristics based on the first picture area corresponding to the picture data, and determining the local picture static characteristics according to the local picture static characteristics. Determine the picture quality evaluation parameters, perform feature dimensionality reduction processing on the static features of the picture, obtain the global static characteristics of the picture, and determine the picture content evaluation parameters based on the global static characteristics of the picture. It can be understood that in terms of picture quality, the picture quality of the local area of the picture data is similar to the picture quality of the global area. Therefore, the local picture static features can be intercepted from the picture static features of the picture data based on the first picture area of the picture data. , and use local static characteristics of the picture to determine the picture quality evaluation parameters, thereby reducing the scale of network parameters. For screen content, different areas of screen data contain different screen content information. Therefore, feature dimensionality reduction processing can be performed on the static features of the screen to obtain global static features of the screen, and the global static features of the screen can be used to determine the assessment of screen content. parameter. At this time, the global static characteristics of the picture include all content feature information in the picture data.
其中,画面数据对应的第一画面区域可以是指的针对每个画面帧的第一画面区域。针对每个画面帧的第一画面区域相同,且可以为一个或多个。画面帧对应的第一画面区域可以是预设区域。比如画面帧的正中间的指定区域、右上角的指定区域等。或者可以是基于预设数量对画面帧进行划分得到的区域,该划分出的预设数量个画面区域可以拼接得到完整的画面帧。或者可以是对画面帧进行对象识别确定出的区域,比如对画面帧中的人物进行识别而框选出的区域、对画面帧中的文字进行识别而框选出的区域。每个画面区域的大小相同。每个画面区域可以存在重叠区域,也可以不存在重叠区域。其中,每个画面帧对应的第一画面区域相同,因此确定出一个画面帧对应的第一画面区域则表示确定出每个画面帧对应的第一画面区域。该确定一个画面帧对应的第一画面区域时可以是对画面数据包括的多个画面帧中的指定画面帧进行画面区域的确定,比如第一个画面帧、最后一个画面帧、或任意选取的画面帧等。在此不做限定。The first picture area corresponding to the picture data may refer to the first picture area for each picture frame. The first picture area for each picture frame is the same, and may be one or more. The first picture area corresponding to the picture frame may be a preset area. For example, the designated area in the middle of the picture frame, the designated area in the upper right corner, etc. Or it may be an area obtained by dividing the picture frame based on a preset number, and the divided preset number of picture areas can be spliced to obtain a complete picture frame. Or it may be an area determined by object recognition of the picture frame, such as an area selected by identifying a person in the picture frame, or an area selected by identifying a text in the picture frame. Each picture area is the same size. Each screen area may or may not have overlapping areas. The first picture area corresponding to each picture frame is the same, so determining the first picture area corresponding to one picture frame means determining the first picture area corresponding to each picture frame. When determining the first picture area corresponding to a picture frame, the picture area may be determined for a designated picture frame among multiple picture frames included in the picture data, such as the first picture frame, the last picture frame, or an arbitrarily selected picture frame. Picture frames etc. No limitation is made here.
可以理解,可以利用第一画面区域对画面静态特征进行截取以确定局部的画面静态特征。一个第一画面区域可以截取一个局部的画面静态特征。当有多个第一画面区域时,可以将截取的多个局部的画面静态特征进行叠加以作为最终的局部的画面静态特征。此外,由于画面数据的大小与画面静态特征的大小可能不同,因此可以构建画面区域与截取区域的映射关系,可以基于该映射关系确定画面区域对应的截取区域的大小,并按照该截取区域的大小来对画面静态特征进行截取。该映射关系可以基于画面数据的大小与画面静态特征的大小之间的关联关系设置。比如可以是基于画面数据的像素维数与画面静态特征的特征维数来确定映射关系。It can be understood that the first picture area can be used to intercept the static characteristics of the picture to determine the local static characteristics of the picture. A first picture area can capture a local static feature of the picture. When there are multiple first picture areas, multiple intercepted local picture static features can be superimposed to serve as the final local picture static features. In addition, since the size of the picture data may be different from the size of the static features of the picture, a mapping relationship between the picture area and the interception area can be constructed. Based on the mapping relationship, the size of the interception area corresponding to the picture area can be determined, and the size of the interception area can be determined according to the size of the interception area. to intercept the static features of the picture. The mapping relationship may be set based on the correlation between the size of the picture data and the size of the static features of the picture. For example, the mapping relationship can be determined based on the pixel dimension of the picture data and the feature dimension of the static features of the picture.
此外,静态评估网络中可以包括第一评估网络和第二评估网络,可以通过第一评估网络对画面静态特征进行截取处理得到局部的画面静态特征,并对局部的画面静态特征进行特征处理以得到画面质量评估参数。可以通过第二评估网络对画面静态特征进行降维处理得到全局的画面静态特征,并对全局的画面静态特征进行特征处理以得到画面内容评估参数。In addition, the static evaluation network may include a first evaluation network and a second evaluation network. The first evaluation network may intercept the static features of the picture to obtain local static features of the picture, and perform feature processing on the local static features of the picture to obtain Picture quality evaluation parameters. The second evaluation network can perform dimensionality reduction processing on the static characteristics of the picture to obtain the global static characteristics of the picture, and perform feature processing on the global static characteristics of the picture to obtain the picture content evaluation parameters.
相应地,画面动态评估参数的参数类型可以包括画面切换评估参数和画面时序评估参数。画面切换评估参数可以表示从画面切换维度对画面数据进行质量评估所得到的结果。该画面切换维度可以衡量画面数据的画面美学元素与拍摄技巧。画面时序评估参数可以表示从画面时序维度从画面数据进行质量评估所得到的结果。因此可以从画面动态特征分别确定出针对画面切换评估参数的画面静态特征以及针对画面时序评估参数的画面静态特征,通过相应的画面静态特征来分别确定画面切换评估参数和画面时序评估参数。Correspondingly, the parameter types of the picture dynamic evaluation parameters may include picture switching evaluation parameters and picture timing evaluation parameters. The screen switching evaluation parameter can represent the result of quality evaluation of the screen data from the screen switching dimension. This picture switching dimension can measure the picture aesthetic elements and shooting skills of the picture data. The picture timing evaluation parameter can represent the result of quality evaluation from the picture data from the picture timing dimension. Therefore, the static characteristics of the picture for the picture switching evaluation parameters and the static characteristics of the picture for the timing evaluation parameters can be determined from the dynamic characteristics of the picture, and the picture switching evaluation parameters and the picture timing evaluation parameters are determined respectively through the corresponding static characteristics of the picture.
可选地,确定画面动态评估参数具体可以是,确定画面数据对应的第二画面区域,基于画面数据对应的第二画面区域从画面动态特征中确定局部的画面动态特征,根据局部的画面动态特征确定画面时序评估参数,对画面动态特征进行特征降维处理,得到全局的画面动态特征,根据全局的画面动态特征确定画面切换评估参数。可以理解,对于画面时序而言,画面数据的局部区域的画面时序稳定性与全局区域的画面时序稳定性相差不多,因此可以基于画面数据的第二画面区域从画面数据的画面动态特征中截取局部的画面动态特征,并利用局部的画面动态特征确定画面时序评估参数,从而可以降低网络参数的规模。对于画面切换而言,画面数据的不同区域包含的画面内容信息是不同的,因此对于画面切换效果而言,也是不同的,因此可以对画面动态特征进行特征降维处理,得到全局的画面动态特征,并利用全局的画面动态特征确定画面切换评估参数。此时全局的画面动态特征包含了画面数据中的全部内容的切换特征信息。Optionally, determining the picture dynamic evaluation parameters may specifically include determining the second picture area corresponding to the picture data, determining the local picture dynamic characteristics from the picture dynamic characteristics based on the second picture area corresponding to the picture data, and determining the local picture dynamic characteristics according to the local picture dynamic characteristics. Determine the picture timing evaluation parameters, perform feature dimensionality reduction processing on the picture dynamic characteristics, obtain the global picture dynamic characteristics, and determine the picture switching evaluation parameters based on the global picture dynamic characteristics. It can be understood that for the picture timing, the picture timing stability of the local area of the picture data is similar to the picture timing stability of the global area. Therefore, the local area can be intercepted from the picture dynamic characteristics of the picture data based on the second picture area of the picture data. The dynamic characteristics of the picture are determined, and the local dynamic characteristics of the picture are used to determine the picture timing evaluation parameters, thereby reducing the scale of network parameters. For screen switching, different areas of screen data contain different screen content information, so the effect of screen switching is also different. Therefore, feature dimensionality reduction processing can be performed on the dynamic features of the screen to obtain global dynamic features of the screen. , and use the global dynamic characteristics of the picture to determine the picture switching evaluation parameters. At this time, the global dynamic characteristics of the picture include the switching characteristic information of all contents in the picture data.
可以理解,第二画面区域与第一画面区域相同。对画面动态特征进行截取得到局部的画面动态特征的过程和原理可以参照对画面静态特征进行截取得到局部的画面静态特征的过程和原理。此外,动态评估网络中可以包括第三评估网络和第四评估网络,可以通过第三评估网络对画面动态特征进行截取处理得到局部的画面动态特征,并对局部的画面动态特征进行特征处理以得到画面时序评估参数。可以通过第四评估网络对画面动态特征进行降维处理得到全局的画面动态特征,并对全局的画面动态特征进行特征处理以得到画面切换评估参数。It can be understood that the second picture area is the same as the first picture area. The process and principle of intercepting the dynamic characteristics of the picture to obtain local dynamic characteristics of the picture can be referred to the process and principle of intercepting the static characteristics of the picture to obtain local static characteristics of the picture. In addition, the dynamic evaluation network may include a third evaluation network and a fourth evaluation network. The third evaluation network may intercept and process the dynamic characteristics of the picture to obtain local dynamic characteristics of the picture, and perform feature processing on the local dynamic characteristics of the picture to obtain Screen timing evaluation parameters. The fourth evaluation network can perform dimensionality reduction processing on the dynamic characteristics of the picture to obtain the global dynamic characteristics of the picture, and perform feature processing on the dynamic characteristics of the global picture to obtain the picture switching evaluation parameters.
因此,通过上述描述,本申请实施例提供一种目标检测模型,可以通过该目标检测模型确定画面数据的画面评估参数,该目标检测模型可以包括第一特征处理网络、第二特征处理网络、特征分层网络和第三特征处理网络。其中,特征分层网络可以包括残差网络或者多个卷积网络(如上述第一卷积网络、第二卷积网络和第三卷积网络)。第三特征处理网络可以包括动态评估网络和静态评估网络,且静态评估网络可以包括上述第一评估网络和第二评估网络、动态评估网络可以包括上述第三评估网络和第四评估网络。可选地,目标检测模型中的第一特征处理网络和第二特征处理网络之间还可以包括全连接层。上述提及的目标检测模型均已预先通过训练样本数据以及训练样本数据的画面评估参数标签训练好。Therefore, through the above description, embodiments of the present application provide a target detection model through which the picture evaluation parameters of the picture data can be determined. The target detection model may include a first feature processing network, a second feature processing network, a feature processing network, and a feature processing network. Hierarchical network and tertiary feature processing network. The feature layered network may include a residual network or multiple convolutional networks (such as the first convolutional network, the second convolutional network and the third convolutional network mentioned above). The third feature processing network may include a dynamic evaluation network and a static evaluation network, and the static evaluation network may include the above-mentioned first evaluation network and the second evaluation network, and the dynamic evaluation network may include the above-mentioned third evaluation network and the fourth evaluation network. Optionally, a fully connected layer may also be included between the first feature processing network and the second feature processing network in the target detection model. The target detection models mentioned above have been trained in advance through the training sample data and the image evaluation parameter labels of the training sample data.
例如,如图5a-图5b所示,图5a-图5b为本申请实施例提供的一种确定画面评估参数的场景示意图;其中,目标检测模型如图5a所示,确定画面评估参数过程可以是:对画面数据进行分帧处理得到画面数据所包括的多个画面帧,将多个画面帧依次输入第一特征处理网络得到每个画面帧的画面特征;将每个画面帧的画面特征输入第二特征处理网络,由第二特征处理网络对每个画面帧的画面特征以及每个画面帧的相邻画面帧的关联画面特征进行特征关联处理以输出每个画面帧的关联画面特征;将每个画面帧的关联画面特征输入特征分层网络,由特征分词网络中的多个卷积网络对画面数据的关联画面特征进行特征分层处理,以得到画面数据的画面动态特征和画面静态特征;将画面静态特征输出第三特征处理网络中的第一评估网络,得到局部的画面静态特征,并基于局部的画面静态特征得到画面质量评估参数;将画面静态特征输出第三特征处理网络中的第二评估网络,得到全局的画面静态特征,并基于全局的画面静态特征得到画面内容评估参数;将画面动态特征输出第三特征处理网络中的第三评估网络,得到局部的画面动态特征,并基于局部的画面动态特征得到画面时序评估参数;将画面动态特征输出第三特征处理网络中的第四评估网络,得到全局的画面动态特征,并基于全局的画面动态特征得到画面切换评估参数。For example, as shown in Figure 5a-Figure 5b, Figure 5a-Figure 5b is a schematic diagram of a scene for determining picture evaluation parameters provided by an embodiment of the present application; wherein the target detection model is shown in Figure 5a, and the process of determining the picture evaluation parameters can be Yes: perform frame processing on the picture data to obtain multiple picture frames included in the picture data, input the multiple picture frames in sequence to the first feature processing network to obtain the picture characteristics of each picture frame; input the picture characteristics of each picture frame The second feature processing network performs feature correlation processing on the picture features of each picture frame and the associated picture features of adjacent picture frames of each picture frame to output the associated picture features of each picture frame; The associated picture features of each picture frame are input into the feature hierarchical network. Multiple convolutional networks in the feature segmentation network perform feature layering processing on the associated picture features of the picture data to obtain the picture dynamic characteristics and picture static characteristics of the picture data. ; Output the static characteristics of the picture to the first evaluation network in the third feature processing network to obtain local static characteristics of the picture, and obtain the picture quality evaluation parameters based on the local static characteristics of the picture; output the static characteristics of the picture to the third feature processing network The second evaluation network obtains the global static characteristics of the picture, and obtains the picture content evaluation parameters based on the global static characteristics of the picture; outputs the dynamic characteristics of the picture to the third evaluation network in the third feature processing network, obtains the local dynamic characteristics of the picture, and The picture timing evaluation parameters are obtained based on the local picture dynamic characteristics; the picture dynamic characteristics are output to the fourth evaluation network in the third feature processing network to obtain the global picture dynamic characteristics, and the picture switching evaluation parameters are obtained based on the global picture dynamic characteristics.
此外,确定局部的画面静态特征如图5b所示,以画面数据对应的第一画面区域包括区域1和区域2为例,基于画面帧的画面区域的大小和局部静态特征的特征大小可以确定画面区域与截取区域之间的映射关系,可以确定区域1对应的截取区域1以及区域2对应的截取区域2,基于截取区域1从画面静态特征进行截取得到局部的画面静态特征,基于截取区域1从画面静态特征进行截取得到局部的画面静态特征1、以及基于截取区域2从画面静态特征进行截取得到局部的画面静态特征2,将局部的画面静态特征1和局部的画面静态特征2进行叠加得到画面数据最终的局部的画面静态特征。可以理解,每个画面帧均按照前述过程可以得到每个画面帧对应的局部的画面静态特征。In addition, the local static features of the picture are determined as shown in Figure 5b. Taking the first picture area corresponding to the picture data including area 1 and area 2 as an example, the picture can be determined based on the size of the picture area of the picture frame and the characteristic size of the local static features. The mapping relationship between the area and the interception area can determine the interception area 1 corresponding to area 1 and the interception area 2 corresponding to area 2. Based on the interception area 1, we can intercept the static characteristics of the picture to obtain the local static characteristics of the picture. Based on the interception area 1, we can obtain the local static characteristics of the picture. The static features of the screen are intercepted to obtain the local static features of the screen 1, and the static features of the screen are intercepted based on the interception area 2 to obtain the local static features of the screen 2. The local static features of the screen 1 and the local static features of the screen 2 are superimposed to obtain the screen. The final local picture static characteristics of the data. It can be understood that for each picture frame, the local picture static features corresponding to each picture frame can be obtained according to the foregoing process.
在一些实施例中,对于不同画面数据而言,用户在观看时不一定会从所有评估维度来衡量画面数据。比如,对于风景类的画面数据,用户主要在意的可能是画面质量和画面时序。对于日常类的画面数据,用户主要在意的是可能是画面内容和画面切换。因此还可以,对画面数据进行场景检测,得到画面数据的画面场景信息,根据画面场景信息确定待获取的画面评估参数的参数信息,该参数信息指示了画面数据应该获取的画面评估参数的类型,可以根据关联画面特征确定参数信息所指示的至少一种画面评估参数的参数值。比如,参数信息指示画面质量评估参数和画面时序评估参数,则可以是根据关联画面信息确定针对画面质量评估参数的画面静态特征以及针对画面时序评估参数的画面动态特征,并根据针对画面质量评估参数的画面静态特征确定画面质量评估参数的参数值以及根据针对画面时序评估参数的画面动态特征确定画面时序评估参数的参数值。其中,画面场景信息可以表示画面数据的画面类型,比如为风景类、日常类等等。可以是通过场景检测模型对画面数据进行场景检测,以输出画面场景信息。该场景检测模型基于样本画面数据以及画面场景标签训练得到。此外,画面场景信息与参数信息之间的对应关系可以根据经验值设置。In some embodiments, for different picture data, the user may not necessarily measure the picture data from all evaluation dimensions when watching. For example, for landscape picture data, users may mainly care about picture quality and picture timing. For daily screen data, users may be mainly concerned about screen content and screen switching. Therefore, it is also possible to perform scene detection on the picture data to obtain the picture scene information of the picture data, and determine the parameter information of the picture evaluation parameters to be obtained according to the picture scene information. The parameter information indicates the type of picture evaluation parameters that should be obtained by the picture data. The parameter value of at least one picture evaluation parameter indicated by the parameter information may be determined according to the associated picture characteristics. For example, if the parameter information indicates the picture quality evaluation parameters and the picture timing evaluation parameters, the static characteristics of the picture for the picture quality evaluation parameters and the dynamic characteristics of the picture for the picture timing evaluation parameters may be determined based on the associated picture information, and the picture quality evaluation parameters may be determined based on the associated picture information. The parameter value of the picture quality evaluation parameter is determined based on the static characteristics of the picture and the parameter value of the picture timing evaluation parameter is determined based on the dynamic characteristics of the picture for the picture timing evaluation parameter. Among them, the picture scene information can represent the picture type of the picture data, such as landscape type, daily type, etc. Scene detection may be performed on the picture data through a scene detection model to output picture scene information. The scene detection model is trained based on sample picture data and picture scene labels. In addition, the correspondence between picture scene information and parameter information can be set based on experience values.
S404、根据画面静态评估参数和画面动态评估参数生成针对画面数据的质量检测结果。S404. Generate quality detection results for the picture data according to the picture static evaluation parameters and the picture dynamic evaluation parameters.
其中,可以将一种或多种画面静态评估参数的参数值与一种或多种画面动态评估参数的参数值对应的平均值作为质量检测结果。或者,也可以对画面静态评估参数的参数值与画面动态评估参数的参数值进行加权求和,得到质量检测结果。Wherein, the average value corresponding to the parameter value of one or more static picture evaluation parameters and the parameter value of one or more dynamic picture evaluation parameters may be used as the quality detection result. Alternatively, the parameter values of the static picture evaluation parameters and the parameter values of the dynamic picture evaluation parameters can also be weighted and summed to obtain the quality detection result.
可选地,画面数据可以有多个,可以分别将多个画面数据中每个画面数据的画面评估参数的参数值所对应的平均值作为针对每个画面数据的质量检测结果。该质量检测结果可以用于确定待推荐的画面数据。比如可以是,按照多个画面数据的质量检测结果从大到小的顺序,从多个画面数据依次选取出目标数量个画面数据,并将目标数量个画面数据确定为待推荐的画面数据,将待推荐的画面数据推送至目标终端。Optionally, there may be multiple picture data, and the average value corresponding to the parameter value of the picture evaluation parameter of each picture data in the plurality of picture data may be used as the quality detection result for each picture data. The quality detection results can be used to determine the picture data to be recommended. For example, a target number of screen data can be selected from the multiple screen data in descending order according to the quality detection results of the multiple screen data, and the target number of screen data can be determined as the screen data to be recommended, and the target number of screen data can be selected. The screen data to be recommended is pushed to the target terminal.
比如,可以是在画面数据的搜索场景中,基于用户的搜索词条中数据库中获取到匹配的多个画面数据,并可以基于该多个画面数据与搜索词条的匹配以及该多个画面数据的质量检测结果从多个画面数据中综合确定出待推荐给用户的画面数据。For example, in the search scenario of screen data, multiple matching screen data can be obtained from the database based on the user's search terms, and based on the matching of the multiple screen data and the search terms and the multiple screen data The quality inspection results are comprehensively determined from multiple screen data to determine the screen data to be recommended to the user.
本申请实施例中,可以获取待检测的画面数据,分别从画面数据包括的每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征,从关联画面特征中确定出画面数据的画面静态特征和画面动态特征,根据画面静态特征确定画面静态评估参数,并根据画面动态特征确定画面动态评估参数;该可以提高关联画面特征涵盖的特征信息丰富度,从而得到准确的画面评估参数,以及可以根据不同维度的特征确定不同维度的画面评估参数;根据画面静态评估参数和画面动态评估参数生成针对画面数据的质量检测结果。该可以通过画面评估参数综合确定出画面数据的质量检测结果,以提高质量检测结果的准确性和可靠性。In the embodiment of the present application, the picture data to be detected can be obtained, the picture characteristics of each picture frame are extracted from each picture frame included in the picture data, and each picture frame is compared based on the adjacent picture frames of each picture frame. Perform picture association processing on the picture characteristics to obtain the associated picture characteristics of the picture data, determine the picture static characteristics and picture dynamic characteristics of the picture data from the associated picture characteristics, determine the picture static evaluation parameters according to the picture static characteristics, and determine the picture static characteristics according to the picture dynamic characteristics Determine the picture dynamic evaluation parameters; this can improve the feature information richness covered by the associated picture features, thereby obtaining accurate picture evaluation parameters, and determine the picture evaluation parameters of different dimensions according to the characteristics of different dimensions; according to the picture static evaluation parameters and picture dynamics Evaluating parameters generates quality inspection results for screen data. The quality detection results of the screen data can be comprehensively determined through the screen evaluation parameters to improve the accuracy and reliability of the quality detection results.
请参见图6,图6为本申请提供的一种画面数据处理装置的结构示意图。需要说明的是,图6所示的画面数据处理装置,用于执行本申请图2和图4所示实施例的方法,为了便于说明,仅示出了与本申请实施例相关的部分,具体技术细节未揭示,经参照本申请图2和图4所示的实施例。该画面数据处理装置600可包括:获取模块601、处理模块602。其中:Please refer to FIG. 6 , which is a schematic structural diagram of a picture data processing device provided by the present application. It should be noted that the screen data processing device shown in Figure 6 is used to execute the methods of the embodiments shown in Figures 2 and 4 of the present application. For ease of explanation, only the parts related to the embodiments of the present application are shown. Specifically, Technical details are not disclosed, and reference is made to the embodiments shown in Figures 2 and 4 of the present application. The picture data processing device 600 may include: an acquisition module 601 and a processing module 602. in:
获取模块601,用于获取待检测的画面数据;画面数据包括按照画面数据中的时间顺序排列的多个画面帧;The acquisition module 601 is used to obtain the picture data to be detected; the picture data includes multiple picture frames arranged in time order in the picture data;
处理模块602,用于分别从每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征;The processing module 602 is used to extract the picture features of each picture frame from each picture frame, and perform picture association processing on the picture features of each picture frame based on the adjacent picture frames of each picture frame to obtain picture data. associated picture features;
处理模块602,还用于根据关联画面特征确定画面数据的画面评估参数;画面评估参数包括如下一种或多种:画面静态评估参数、画面动态评估参数;The processing module 602 is also used to determine the picture evaluation parameters of the picture data according to the associated picture characteristics; the picture evaluation parameters include one or more of the following: static picture evaluation parameters and dynamic picture evaluation parameters;
处理模块602,还用于根据画面评估参数生成针对画面数据的质量检测结果。The processing module 602 is also used to generate quality detection results for the picture data according to the picture evaluation parameters.
在一些实施例中,处理模块602在用于基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征时,具体用于:In some embodiments, when the processing module 602 is used to perform picture association processing on the picture features of each picture frame based on the adjacent picture frames of each picture frame to obtain the associated picture characteristics of the picture data, it is specifically used to:
分别对每个画面帧的画面特征和每个画面帧的相邻画面帧的画面特征进行画面关联处理,得到每个画面帧的关联画面特征;Perform picture correlation processing on the picture characteristics of each picture frame and the picture characteristics of adjacent picture frames of each picture frame to obtain the associated picture characteristics of each picture frame;
基于每个画面帧的关联画面特征确定画面数据的关联画面特征。The associated picture characteristics of the picture data are determined based on the associated picture characteristics of each picture frame.
在一些实施例中,处理模块602还用于:In some embodiments, the processing module 602 is also used to:
对画面数据进行场景检测,得到画面数据的画面场景信息;Perform scene detection on the picture data to obtain the picture scene information of the picture data;
根据画面场景信息确定待获取的画面评估参数的参数信息;Determine the parameter information of the picture evaluation parameters to be obtained according to the picture scene information;
根据关联画面特征确定画面数据的画面评估参数,包括:Determine the picture evaluation parameters of the picture data based on the associated picture characteristics, including:
根据关联画面特征确定参数信息指示的至少一种画面评估参数的参数值。A parameter value of at least one picture evaluation parameter indicated by the parameter information is determined according to the associated picture characteristics.
在一些实施例中,画面评估参数包括画面静态评估参数和画面动态评估参数;In some embodiments, the picture evaluation parameters include picture static evaluation parameters and picture dynamic evaluation parameters;
处理模块602在用于根据关联画面特征确定画面数据的画面评估参数时,具体用于:When the processing module 602 is used to determine the picture evaluation parameters of the picture data according to the associated picture characteristics, it is specifically used to:
从关联画面特征中确定出画面数据的画面静态特征和画面动态特征;Determine the static characteristics of the picture and the dynamic characteristics of the picture from the associated picture characteristics;
根据画面静态特征确定画面静态评估参数,并根据画面动态特征确定画面动态评估参数。The static evaluation parameters of the picture are determined according to the static characteristics of the picture, and the dynamic evaluation parameters of the picture are determined according to the dynamic characteristics of the picture.
在一些实施例中,画面静态评估参数包括:画面质量评估参数、画面内容评估参数;处理模块602在用于根据画面静态特征确定画面静态评估参数时,具体用于:In some embodiments, the picture static evaluation parameters include: picture quality evaluation parameters and picture content evaluation parameters; when the processing module 602 is used to determine the picture static evaluation parameters according to the picture static characteristics, it is specifically used to:
确定画面数据对应的第一画面区域,基于画面数据对应的第一画面区域从画面静态特征中确定局部的画面静态特征;Determine the first picture area corresponding to the picture data, and determine the local picture static characteristics from the picture static characteristics based on the first picture area corresponding to the picture data;
根据局部的画面静态特征确定画面质量评估参数;Determine picture quality evaluation parameters based on local static characteristics of the picture;
对画面静态特征进行特征降维处理,得到全局的画面静态特征;Perform feature dimensionality reduction on the static features of the picture to obtain global static features of the picture;
根据全局的画面静态特征确定画面内容评估参数。Screen content evaluation parameters are determined based on global static characteristics of the screen.
在一些实施例中,画面动态评估参数包括:画面切换评估参数、画面时序评估参数;处理模块602在用于根据画面动态特征确定画面动态评估参数时,具体用于:In some embodiments, the picture dynamic evaluation parameters include: picture switching evaluation parameters and picture timing evaluation parameters; when the processing module 602 is used to determine the picture dynamic evaluation parameters according to the picture dynamic characteristics, it is specifically used to:
确定画面数据对应的第二画面区域,基于画面数据对应的第二画面区域从画面动态特征中确定局部的画面动态特征;Determine the second picture area corresponding to the picture data, and determine the local picture dynamic characteristics from the picture dynamic characteristics based on the second picture area corresponding to the picture data;
根据局部的画面动态特征确定画面时序评估参数;Determine picture timing evaluation parameters based on local picture dynamic characteristics;
对画面动态特征进行特征降维处理,得到全局的画面动态特征;Perform feature dimensionality reduction on the dynamic features of the picture to obtain global dynamic characteristics of the picture;
根据全局的画面动态特征确定画面切换评估参数。The screen switching evaluation parameters are determined according to the global dynamic characteristics of the screen.
在一些实施例中,画面数据为多个;处理模块602在用于根据画面评估参数生成针对画面数据的质量检测结果时,具体用于:In some embodiments, there is multiple picture data; when the processing module 602 is used to generate quality detection results for the picture data according to the picture evaluation parameters, it is specifically used to:
分别将多个画面数据中每个画面数据的画面评估参数的参数值所对应的平均值作为针对每个画面数据的质量检测结果;The average value corresponding to the parameter value of the picture evaluation parameter of each picture data in the plurality of picture data is used as the quality detection result for each picture data;
处理模块602还用于:The processing module 602 is also used to:
按照多个画面数据的质量检测结果从大到小的顺序,从多个画面数据依次选取出目标数量个画面数据,并将目标数量个画面数据确定为待推荐的画面数据;According to the order of the quality detection results of the multiple screen data from large to small, a target number of screen data are selected from the multiple screen data, and the target number of screen data are determined as the screen data to be recommended;
将待推荐的画面数据推送至目标终端。Push the screen data to be recommended to the target terminal.
本申请实施例中,获取模块获取待检测的画面数据;处理模块分别从每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征;处理模块根据关联画面特征确定画面数据的画面评估参数;处理模块根据画面评估参数生成针对画面数据的质量检测结果。通过上述装置,可以提高关联画面特征涵盖的特征信息丰富度,从而得到准确的画面评估参数,可以通过画面评估参数综合确定出画面数据的质量检测结果,以提高质量检测结果的准确性和可靠性。In the embodiment of the present application, the acquisition module obtains the picture data to be detected; the processing module extracts the picture characteristics of each picture frame from each picture frame, and compares the characteristics of each picture frame based on the adjacent picture frames of each picture frame. The picture characteristics are processed to obtain picture correlation, and the associated picture characteristics of the picture data are obtained; the processing module determines the picture evaluation parameters of the picture data based on the associated picture characteristics; the processing module generates quality detection results for the picture data based on the picture evaluation parameters. Through the above device, the richness of feature information covered by the associated picture features can be improved, thereby obtaining accurate picture evaluation parameters. The quality detection results of the picture data can be comprehensively determined through the picture evaluation parameters to improve the accuracy and reliability of the quality detection results. .
请参见图7,图7为本申请实施例提供的一种电子设备的结构示意图。如图7所示,该电子设备700包括:至少一个处理器701、存储器702。可选的,该电子设备还可包括网络接口。其中,处理器701、存储器702以及网络接口之间可以交互数据,网络接口受处理器701的控制用于收发消息,存储器702用于存储计算机程序,计算机程序包括程序指令,处理器701用于执行存储器702存储的程序指令。其中,处理器701被配置用于调用程序指令执行上述方法。Please refer to FIG. 7 , which is a schematic structural diagram of an electronic device provided by an embodiment of the present application. As shown in FIG. 7 , the electronic device 700 includes: at least one processor 701 and a memory 702 . Optionally, the electronic device may also include a network interface. Among them, the processor 701, the memory 702 and the network interface can exchange data. The network interface is controlled by the processor 701 and is used to send and receive messages. The memory 702 is used to store computer programs. The computer program includes program instructions, and the processor 701 is used to execute Memory 702 stores program instructions. Wherein, the processor 701 is configured to call program instructions to execute the above method.
存储器702可以包括易失性存储器(volatile memory),例如随机存取存储器(random-access memory,RAM);存储器702也可以包括非易失性存储器(non-volatilememory),例如快闪存储器(flash memory),固态硬盘(solid-state drive,SSD)等;存储器702还可以包括上述种类的存储器的组合。The memory 702 may include volatile memory (volatile memory), such as random-access memory (RAM); the memory 702 may also include non-volatile memory (non-volatile memory), such as flash memory (flash memory). ), solid-state drive (SSD), etc.; the memory 702 may also include a combination of the above types of memory.
处理器701可以是中央处理器(central processing unit,CPU)。在一个实施例中,处理器701还可以是图形处理器(Graphics Processing Unit,GPU)。处理器701也可以是由CPU和GPU的组合。The processor 701 may be a central processing unit (CPU). In one embodiment, the processor 701 may also be a graphics processing unit (GPU). The processor 701 may also be a combination of a CPU and a GPU.
在一个可能的实施方式中,存储器702用于存储程序指令,处理器701可以调用程序指令,执行以下步骤:In a possible implementation, the memory 702 is used to store program instructions, and the processor 701 can call the program instructions to perform the following steps:
获取待检测的画面数据;画面数据包括按照画面数据中的时间顺序排列的多个画面帧;Obtain the picture data to be detected; the picture data includes multiple picture frames arranged in time order in the picture data;
分别从每个画面帧中提取每个画面帧的画面特征,并基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征;Extract the picture features of each picture frame from each picture frame respectively, and perform picture association processing on the picture features of each picture frame based on the adjacent picture frames of each picture frame to obtain the associated picture features of the picture data;
根据关联画面特征确定画面数据的画面评估参数;画面评估参数包括如下一种或多种:画面静态评估参数、画面动态评估参数;Determine the picture evaluation parameters of the picture data according to the associated picture characteristics; the picture evaluation parameters include one or more of the following: picture static evaluation parameters, picture dynamic evaluation parameters;
根据画面评估参数生成针对画面数据的质量检测结果。Generate quality inspection results for the picture data based on the picture evaluation parameters.
在一些实施例中,处理器701在用于基于每个画面帧的相邻画面帧对每个画面帧的画面特征进行得到画面关联处理,得到画面数据的关联画面特征时,具体用于:In some embodiments, when the processor 701 is used to perform picture association processing on the picture characteristics of each picture frame based on the adjacent picture frames of each picture frame to obtain the associated picture characteristics of the picture data, it is specifically used to:
分别对每个画面帧的画面特征和每个画面帧的相邻画面帧的画面特征进行画面关联处理,得到每个画面帧的关联画面特征;Perform picture correlation processing on the picture characteristics of each picture frame and the picture characteristics of adjacent picture frames of each picture frame to obtain the associated picture characteristics of each picture frame;
基于每个画面帧的关联画面特征确定画面数据的关联画面特征。The associated picture characteristics of the picture data are determined based on the associated picture characteristics of each picture frame.
在一些实施例中,处理器701还用于:In some embodiments, processor 701 is also used to:
对画面数据进行场景检测,得到画面数据的画面场景信息;Perform scene detection on the picture data to obtain the picture scene information of the picture data;
根据画面场景信息确定待获取的画面评估参数的参数信息;Determine the parameter information of the picture evaluation parameters to be obtained according to the picture scene information;
根据关联画面特征确定画面数据的画面评估参数,包括:Determine the picture evaluation parameters of the picture data based on the associated picture characteristics, including:
根据关联画面特征确定参数信息指示的至少一种画面评估参数的参数值。A parameter value of at least one picture evaluation parameter indicated by the parameter information is determined according to the associated picture characteristics.
在一些实施例中,画面评估参数包括画面静态评估参数和画面动态评估参数;In some embodiments, the picture evaluation parameters include picture static evaluation parameters and picture dynamic evaluation parameters;
处理器701在用于根据关联画面特征确定画面数据的画面评估参数时,具体用于:When the processor 701 is used to determine the picture evaluation parameters of the picture data according to the associated picture characteristics, it is specifically used to:
从关联画面特征中确定出画面数据的画面静态特征和画面动态特征;Determine the static characteristics of the picture and the dynamic characteristics of the picture from the associated picture characteristics;
根据画面静态特征确定画面静态评估参数,并根据画面动态特征确定画面动态评估参数。The static evaluation parameters of the picture are determined according to the static characteristics of the picture, and the dynamic evaluation parameters of the picture are determined according to the dynamic characteristics of the picture.
在一些实施例中,画面静态评估参数包括:画面质量评估参数、画面内容评估参数;处理器701在用于根据画面静态特征确定画面静态评估参数时,具体用于:In some embodiments, the picture static evaluation parameters include: picture quality evaluation parameters and picture content evaluation parameters; when the processor 701 is used to determine the picture static evaluation parameters according to the picture static characteristics, it is specifically used to:
确定画面数据对应的第一画面区域,基于画面数据对应的第一画面区域从画面静态特征中确定局部的画面静态特征;Determine the first picture area corresponding to the picture data, and determine the local picture static characteristics from the picture static characteristics based on the first picture area corresponding to the picture data;
根据局部的画面静态特征确定画面质量评估参数;Determine picture quality evaluation parameters based on local static characteristics of the picture;
对画面静态特征进行特征降维处理,得到全局的画面静态特征;Perform feature dimensionality reduction on the static features of the picture to obtain global static features of the picture;
根据全局的画面静态特征确定画面内容评估参数。Screen content evaluation parameters are determined based on global static characteristics of the screen.
在一些实施例中,画面动态评估参数包括:画面切换评估参数、画面时序评估参数;处理器701在用于根据画面动态特征确定画面动态评估参数时,具体用于:In some embodiments, the picture dynamic evaluation parameters include: picture switching evaluation parameters and picture timing evaluation parameters; when the processor 701 is used to determine the picture dynamic evaluation parameters according to the picture dynamic characteristics, it is specifically used to:
确定画面数据对应的第二画面区域,基于画面数据对应的第二画面区域从画面动态特征中确定局部的画面动态特征;Determine the second picture area corresponding to the picture data, and determine the local picture dynamic characteristics from the picture dynamic characteristics based on the second picture area corresponding to the picture data;
根据局部的画面动态特征确定画面时序评估参数;Determine picture timing evaluation parameters based on local picture dynamic characteristics;
对画面动态特征进行特征降维处理,得到全局的画面动态特征;Perform feature dimensionality reduction on the dynamic features of the picture to obtain global dynamic characteristics of the picture;
根据全局的画面动态特征确定画面切换评估参数。The screen switching evaluation parameters are determined according to the global dynamic characteristics of the screen.
在一些实施例中,画面数据为多个;处理器701在用于根据画面评估参数生成针对画面数据的质量检测结果时,具体用于:In some embodiments, there is multiple picture data; when the processor 701 is used to generate quality detection results for the picture data according to the picture evaluation parameters, it is specifically used to:
分别将多个画面数据中每个画面数据的画面评估参数的参数值所对应的平均值作为针对每个画面数据的质量检测结果;The average value corresponding to the parameter value of the picture evaluation parameter of each picture data in the plurality of picture data is used as the quality detection result for each picture data;
处理器701还用于:Processor 701 is also used for:
按照多个画面数据的质量检测结果从大到小的顺序,从多个画面数据依次选取出目标数量个画面数据,并将目标数量个画面数据确定为待推荐的画面数据;According to the order of the quality detection results of the multiple screen data from large to small, a target number of screen data are selected from the multiple screen data, and the target number of screen data are determined as the screen data to be recommended;
将待推荐的画面数据推送至目标终端。Push the screen data to be recommended to the target terminal.
具体实现中,本申请实施例中所描述的装置、处理器、存储器等可执行上述方法实施例所描述的实现方式,也可执行本申请实施例所描述的实现方式,在此不再赘述。In specific implementation, the devices, processors, memories, etc. described in the embodiments of this application can execute the implementation described in the above method embodiments, or can also execute the implementation described in the embodiments of this application, which will not be described again here.
本申请实施例中还提供一种计算机(可读)存储介质,计算机存储介质存储有计算机程序,计算机程序包括程序指令,程序指令被处理器执行时,使处理器可执行上述方法实施例中所执行的部分或全部步骤。可选的,该计算机存储介质可以是易失性的,也可以是非易失性的。该计算机可读存储介质可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序等;存储数据区可存储根据区块链节点的使用所创建的数据等。The embodiment of the present application also provides a computer (readable) storage medium. The computer storage medium stores a computer program. The computer program includes program instructions. When the program instructions are executed by the processor, the processor can execute the steps in the above method embodiments. Perform some or all of the steps. Optionally, the computer storage medium may be volatile or non-volatile. The computer-readable storage medium may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, at least one application required for a function, etc.; the storage data area may store data generated according to the use of the blockchain node. Created data, etc.
本申请实施例提供了一种计算机程序产品,该计算机程序产品可包括计算机程序,计算机程序被处理器执行时可实现上述方法中的部分或全部步骤,此处不赘述。Embodiments of the present application provide a computer program product. The computer program product may include a computer program. When the computer program is executed by a processor, it may implement some or all of the steps in the above method, which will not be described again here.
在本文中提及的“多个”是指两个或两个以上。“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。The "plurality" mentioned in this article means two or more than two. "And/or" describes the relationship between related objects, indicating that there can be three relationships. For example, A and/or B can mean: A exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the related objects are in an "or" relationship.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,该计算机程序可存储于计算机存储介质中,该计算机存储介质可以为计算机可读存储介质,该程序在执行时,可包括如上述各方法的实施例的流程。其中,该存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)或随机存储记忆体(Random Access Memory,RAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be completed by instructing relevant hardware through a computer program. The computer program can be stored in a computer storage medium, and the computer storage medium can be a computer. In a readable storage medium, when the program is executed, it may include the processes of the above-mentioned method embodiments. The storage medium may be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM) or a random access memory (Random Access Memory, RAM), etc.
以上所揭露的仅为本申请的部分实施例而已,当然不能以此来限定本申请之权利范围,本领域普通技术人员可以理解实现上述实施例的全部或部分流程,并依本申请权利要求所作的等同变化,仍属于本申请所涵盖的范围。What is disclosed above is only some of the embodiments of the present application. Of course, it cannot be used to limit the scope of rights of the present application. Those of ordinary skill in the art can understand all or part of the processes for implementing the above embodiments and make decisions according to the claims of the present application. Equivalent changes are still within the scope of this application.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310076423.0A CN117726963B (en) | 2023-01-17 | 2023-01-17 | Picture data processing method and device, electronic equipment and medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310076423.0A CN117726963B (en) | 2023-01-17 | 2023-01-17 | Picture data processing method and device, electronic equipment and medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117726963A true CN117726963A (en) | 2024-03-19 |
CN117726963B CN117726963B (en) | 2024-12-24 |
Family
ID=90198483
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310076423.0A Active CN117726963B (en) | 2023-01-17 | 2023-01-17 | Picture data processing method and device, electronic equipment and medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117726963B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130166591A1 (en) * | 2011-11-09 | 2013-06-27 | Styloot Inc. | Systems and Methods for Searching for Items of Fashion and other Items of Creation |
CN110111136A (en) * | 2019-04-12 | 2019-08-09 | 平安科技(深圳)有限公司 | Video data handling procedure, device, computer equipment and storage medium |
CN112529048A (en) * | 2020-11-23 | 2021-03-19 | 浙江大学 | Product display video aided design method and device based on perception experience |
US20210142069A1 (en) * | 2018-05-18 | 2021-05-13 | Cambricon Technologies Corporation Limited | Video retrieval method, and method and apparatus for generating video retrieval mapping relationship |
CN114742741A (en) * | 2020-12-23 | 2022-07-12 | 北京鸿享技术服务有限公司 | Video picture quality evaluation method, device, equipment and storage medium |
-
2023
- 2023-01-17 CN CN202310076423.0A patent/CN117726963B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130166591A1 (en) * | 2011-11-09 | 2013-06-27 | Styloot Inc. | Systems and Methods for Searching for Items of Fashion and other Items of Creation |
US20210142069A1 (en) * | 2018-05-18 | 2021-05-13 | Cambricon Technologies Corporation Limited | Video retrieval method, and method and apparatus for generating video retrieval mapping relationship |
CN110111136A (en) * | 2019-04-12 | 2019-08-09 | 平安科技(深圳)有限公司 | Video data handling procedure, device, computer equipment and storage medium |
CN112529048A (en) * | 2020-11-23 | 2021-03-19 | 浙江大学 | Product display video aided design method and device based on perception experience |
CN114742741A (en) * | 2020-12-23 | 2022-07-12 | 北京鸿享技术服务有限公司 | Video picture quality evaluation method, device, equipment and storage medium |
Non-Patent Citations (1)
Title |
---|
杨智渊等: "人工智能赋能的设计评价方法研究与应用", 《包装工程》, vol. 42, no. 18, 31 December 2021 (2021-12-31), pages 24 - 34 * |
Also Published As
Publication number | Publication date |
---|---|
CN117726963B (en) | 2024-12-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111260545B (en) | Method and device for generating image | |
US11055516B2 (en) | Behavior prediction method, behavior prediction system, and non-transitory recording medium | |
JP7537060B2 (en) | Information generation method, device, computer device, storage medium, and computer program | |
US20190362171A1 (en) | Living body detection method, electronic device and computer readable medium | |
CN110751649B (en) | Video quality evaluation method and device, electronic equipment and storage medium | |
CN111275784B (en) | Method and device for generating image | |
WO2021103731A1 (en) | Semantic segmentation method, and model training method and apparatus | |
WO2018133825A1 (en) | Method for processing video images in video call, terminal device, server, and storage medium | |
US11087140B2 (en) | Information generating method and apparatus applied to terminal device | |
CN112215171B (en) | Target detection method, device, equipment and computer readable storage medium | |
CN110287848A (en) | The generation method and device of video | |
CN114092678A (en) | Image processing method, device, electronic device and storage medium | |
US20240320807A1 (en) | Image processing method and apparatus, device, and storage medium | |
CN114003160A (en) | Data visualization display method and device, computer equipment and storage medium | |
US20240037898A1 (en) | Method for predicting reconstructabilit, computer device and storage medium | |
CN113763440A (en) | Image processing method, device, equipment and storage medium | |
CN110414335A (en) | Video frequency identifying method, device and computer readable storage medium | |
CN116721139A (en) | Generating depth images of image data | |
CN114299074B (en) | Video segmentation method, device, equipment and storage medium | |
CN111597361B (en) | Multimedia data processing method, device, storage medium and equipment | |
CN108090468A (en) | For detecting the method and apparatus of face | |
CN117197702B (en) | Training method, device, equipment and storage medium of video recognition model | |
CN112270261A (en) | Segmentation method and device for question stem and answer mixture and storage medium | |
CN117808791A (en) | Image processing method, device, computer equipment and storage medium | |
CN117726963A (en) | Picture data processing method and device, electronic equipment and medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |