CN114500983A - Video picture content integrity evaluation method and electronic equipment - Google Patents
Video picture content integrity evaluation method and electronic equipment Download PDFInfo
- Publication number
- CN114500983A CN114500983A CN202011262153.5A CN202011262153A CN114500983A CN 114500983 A CN114500983 A CN 114500983A CN 202011262153 A CN202011262153 A CN 202011262153A CN 114500983 A CN114500983 A CN 114500983A
- Authority
- CN
- China
- Prior art keywords
- element group
- video
- electronic device
- detected
- incomplete
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N17/00—Diagnosis, testing or measuring for television systems or their details
- H04N17/004—Diagnosis, testing or measuring for television systems or their details for digital television systems
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
Abstract
Description
技术领域technical field
本申请涉及多媒体技术领域,尤其涉及一种视频画面内容完整度评估方法及电子设备。The present application relates to the field of multimedia technologies, and in particular, to a method and an electronic device for evaluating the integrity of video image content.
背景技术Background technique
随着智能手机的全面普及和移动互联网日益成熟,用户视频生产量以及分享量大幅提升。视频平台用户日均视频上传量不断增加,视频数量的大幅提升对平台的视频质量审核能力提出了挑战。视频平台需要对用户上传视频画面内容完整度进行评估,主要目的是审核视频画面中是否在元素残缺(字幕,主体,人物等),导致用户观看体验差或影响用户对视频内容理解。With the full popularity of smartphones and the growing maturity of the mobile Internet, the amount of video production and sharing by users has increased significantly. The daily average video upload volume of video platform users continues to increase, and the substantial increase in the number of videos poses a challenge to the platform's video quality review capabilities. The video platform needs to evaluate the completeness of the content of the video uploaded by the user. The main purpose is to check whether there are incomplete elements (subtitles, main body, characters, etc.) in the video image, which will lead to poor user viewing experience or affect the user's understanding of the video content.
目前业界提供的视频画面内容完整度审核方式主要是人工审核。步骤为:(1)视频审核任务发放给审核人员;(2)审核人员在审核平台上观看该视频,观看过程中可以选择正常速度观看,加速观看,以及跳看;(3)审核人员发现视频画面内容不完整,影响视频浏览者的观看体验,则标记为“视频画面内容不完整”。At present, the video screen content integrity review method provided by the industry is mainly manual review. The steps are: (1) The video review task is issued to the reviewers; (2) The reviewers watch the video on the review platform, and during the viewing process, they can choose to watch at normal speed, accelerated viewing, and skip watching; (3) Reviewers find the video If the screen content is incomplete and affects the viewing experience of the video viewer, it is marked as "the video screen content is incomplete".
然而随着短视频数量的爆炸式增长,面对大批量的视频审核需求,人工费用高,且人工审核的审核效率已不能满足视频平台的审核需求。However, with the explosive growth of the number of short videos, in the face of large-scale video review requirements, labor costs are high, and the review efficiency of manual review can no longer meet the review needs of video platforms.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供了一种视频画面内容完整度评估方法及电子设备,用于自动审核用户上传的视频,提升视频平台的审核效率。Embodiments of the present application provide a method and an electronic device for evaluating the integrity of video image content, which are used to automatically review videos uploaded by users and improve the review efficiency of a video platform.
第一方面,本申请提供了一种视频画面内容评价方法,该方法包括:电子设备识别待检测视频中至少两个图像帧中的元素,该至少两个图像帧包括第一图像帧和第二图像帧,该元素为该图像帧中的基本要素;该电子设备根据识别出的元素,确定该待检测视频中的元素组,其中包括第一元素组,该第一元素组中包含该第一图像帧中的第一元素和第二图像帧中的第一元素;该电子设备确定该第一元素组的元素组属性,该第一元素组的元素组属性包括第一残缺参数值,该第一残缺参数值用于表示该第一元素组的残缺程度;该电子设备至少根据该第一元素组的元素组属性,确定该待检测视频的画面内容评估结果。In a first aspect, the present application provides a video image content evaluation method, the method comprising: an electronic device identifying elements in at least two image frames in a video to be detected, the at least two image frames including a first image frame and a second image frame An image frame, the element is the basic element in the image frame; the electronic device determines an element group in the video to be detected according to the identified element, which includes a first element group, and the first element group includes the first element group the first element in the image frame and the first element in the second image frame; the electronic device determines the element group attribute of the first element group, the element group attribute of the first element group includes the first incomplete parameter value, the first element group attribute A defect parameter value is used to indicate the defect degree of the first element group; the electronic device determines the picture content evaluation result of the video to be detected at least according to the element group attribute of the first element group.
在上述实施例中,通过自动获取待检测视频中的元素组,并确定元素组的元素组属性,可以根据元素组的元素组属性确定待检测视频的画面内容评估结果。不需要人工干预就能直接得到对待检测视频的评估结果,使得视频平台可以自动的审核待检测视频,提升了视频平台的审核效率。In the above embodiment, by automatically acquiring the element group in the video to be detected and determining the element group attribute of the element group, the picture content evaluation result of the video to be detected can be determined according to the element group attribute of the element group. The evaluation result of the video to be detected can be obtained directly without manual intervention, so that the video platform can automatically review the video to be detected, which improves the review efficiency of the video platform.
结合第一方面的一些实施例,在一些实施例中,该电子设备至少根据该第一元素组的元素组属性,确定该待检测视频的画面内容评估结果,具体包括:该电子设备根据该第一元素组的元素组属性,确定该第一元素组对该待检测视频画面内容的影响结果;该电子设备至少根据该第一元素组对该待检测视频画面内容的影响结果,确定该待检测视频的画面内容评估结果。With reference to some embodiments of the first aspect, in some embodiments, the electronic device determines the picture content evaluation result of the video to be detected at least according to the element group attribute of the first element group, which specifically includes: the electronic device according to the first element group. The element group attribute of an element group determines the effect of the first element group on the content of the video image to be detected; the electronic device determines the effect of the first element group on the content of the video image to be detected at least according to the effect of the first element group. The results of the picture content evaluation of the video.
在上述实施例中,电子设备可以先根据第一元素组的元素组属性,确定该第一元素组对该待检测视频画面内容的影响结果,再根据该影响结果确定待检测视频的画面内容评估结果,能更清楚的确定每个元素组对视频画面内容的影响情况,使得对视频画面内容的评估结果更准确。In the above embodiment, the electronic device may first determine the effect of the first element group on the content of the video to be detected according to the element group attribute of the first element group, and then determine the evaluation of the image content of the video to be detected according to the effect. As a result, the influence of each element group on the content of the video picture can be determined more clearly, so that the evaluation result of the content of the video picture is more accurate.
结合第一方面的一些实施例,在一些实施例中,该电子设备根据该第一元素组的元素组属性,确定该第一元素组对该待检测视频画面内容的影响结果,具体包括:该电子设备根据该第一元素组的元素组属性和预设的元素组影响画面完整度规则,确定该第一元素组对该待检测视频画面内容的影响结果。With reference to some embodiments of the first aspect, in some embodiments, the electronic device determines, according to the element group attribute of the first element group, the impact result of the first element group on the video image content to be detected, specifically including: the The electronic device determines the influence result of the first element group on the content of the video image to be detected according to the element group attribute of the first element group and the preset element group influence picture integrity rule.
在上述实施例中,通过预设的元素组影响画面完整度规则可以快速、准确的判断元素组对待检测视频画面内容的影响结果,通过判断元素组是否对待检测视频画面内容有影响,可以筛选出待检测视频画面有影响的元素组参与后续计算,降低了后续计算的计算量。In the above embodiment, the effect of the element group on the content of the video image to be detected can be quickly and accurately determined through the preset rules for affecting the picture integrity of the element group. The element group that has an influence on the video image to be detected participates in the subsequent calculation, which reduces the calculation amount of the subsequent calculation.
结合第一方面的一些实施例,在一些实施例中,该电子设备至少根据该第一元素组对该待检测视频画面内容的影响结果,确定该待检测视频的画面内容评估结果,具体包括:该电子设备至少根据该第一元素组对该待检测视频画面内容的影响结果,确定该待检测视频是否完整;和/或,该电子设备至少根据该第一元素组对该待检测视频画面内容的影响结果和待检测视频的分类信息,确定待检测视频的评分;和/或,该电子设备至少根据该第一元素组对该待检测视频画面内容的影响结果和该第一元素组的元素组属性,确定对待检测视频的修改建议,其中包括对第一元素组的修改建议。With reference to some embodiments of the first aspect, in some embodiments, the electronic device determines the picture content evaluation result of the video to be detected at least according to the effect of the first element group on the picture content of the video to be detected, specifically including: The electronic device determines whether the to-be-detected video is complete at least according to the effect of the first element group on the to-be-detected video picture content; and/or, the electronic device determines the to-be-detected video picture content at least according to the first element group and/or, the electronic device at least according to the first element group's influence result on the to-be-detected video picture content and the elements of the first element group The group attribute determines the modification suggestion for the video to be detected, which includes the modification suggestion for the first element group.
在上述实施例中,视频平台通过确定该检测视频是否完整,可以将是否完整的结果反馈给用户或视频生产者,完成审核;视频平台通过确定该检测视频的评分,可以将评分的结果反馈给用户或视频生产者,有助于帮助用户或视频生产者更客观、可视化的了解该待检测视频的完整度;视频平台通过确定该待检测视频的修改建议,可以将修改建议作为一种具体的评估结果反馈给用户,有助于帮助用户或视频生产者根据修改建议进行针对性的修改,提升用户体验。In the above embodiment, by determining whether the detection video is complete, the video platform can feedback the complete result to the user or the video producer to complete the review; by determining the score of the detection video, the video platform can feed back the score result to Users or video producers can help users or video producers to understand the integrity of the video to be detected more objectively and visually; the video platform can use the modification suggestion as a specific modification suggestion by determining the modification suggestion of the video to be inspected. The evaluation results are fed back to users, which helps users or video producers to make targeted modifications based on modification suggestions to improve user experience.
结合第一方面的一些实施例,在一些实施例中,该电子设备至少根据该第一元素组的元素组属性,确定该待检测视频的画面内容评估结果,具体包括:该电子设备至少根据该第一元素组的元素组属性,确定该待检测视频是否完整;和/或,该电子设备至少根据该第一元素组的元素组属性和待检测视频的分类信息,确定待检测视频的评分;和/或,该电子设备至少根据该第一元素组的元素组属性,确定待检测视频的修改建议,其中包括对第一元素组的修改建议。With reference to some embodiments of the first aspect, in some embodiments, the electronic device determines the picture content evaluation result of the video to be detected at least according to the element group attribute of the first element group, which specifically includes: the electronic device at least according to the The element group attribute of the first element group determines whether the video to be detected is complete; and/or, the electronic device determines the score of the video to be detected at least according to the element group attribute of the first element group and the classification information of the video to be detected; And/or, the electronic device determines, at least according to the element group attribute of the first element group, a modification suggestion for the video to be detected, which includes a modification suggestion for the first element group.
在上述实施例中,视频平台通过确定该检测视频是否完整,可以将是否完整的结果反馈给用户或视频生产者,完成审核;视频平台通过确定该检测视频的评分,可以将评分的结果反馈给用户或视频生产者,有助于帮助用户或视频生产者更客观、可视化的连接该待检测视频的完整度;视频平台通过确定该待检测视频的修改建议,可以将修改建议作为一种具体的评估结果反馈给用户,有助于帮助用户或视频生产者根据修改建议进行针对性的修改,提升用户体验。In the above embodiment, by determining whether the detection video is complete, the video platform can feedback the complete result to the user or the video producer to complete the review; by determining the score of the detection video, the video platform can feed back the score result to Users or video producers can help users or video producers to connect the integrity of the video to be detected more objectively and visually; the video platform can use the modification suggestion as a specific modification suggestion by determining the modification suggestion of the video to be inspected. The evaluation results are fed back to users, which helps users or video producers to make targeted modifications based on modification suggestions to improve user experience.
结合第一方面的一些实施例,在一些实施例中,该第一元素组的修改建议,包括:通过、可不处理、剪裁、涂抹、删帧、模糊中的至少一种。With reference to some embodiments of the first aspect, in some embodiments, the modification suggestion of the first element group includes at least one of: pass, can not be processed, crop, smear, delete frame, and blur.
在上述实施例中,视频平台通过确定待检测视频中的元素组的修改建议,能够将该元素组的修改建议反馈给视频生产者或用户,使得视频生产者和用户得到有针对性、简明的、傻瓜式的修改建议,并可以依据该修改建议对元素组作出对应的修改,提升视频的完整度,提升了用户体验。In the above-mentioned embodiment, by determining the modification suggestion of the element group in the video to be detected, the video platform can feedback the modification suggestion of the element group to the video producer or user, so that the video producer and user can get targeted and concise , fool-like modification suggestions, and can make corresponding modifications to the element group according to the modification suggestions, improve the integrity of the video, and improve the user experience.
结合第一方面的一些实施例,在一些实施例中,该电子设备根据识别出的元素,确定该待检测视频中的元素组,其中包括第一元素组,具体包括:该电子设备将识别出的该至少两个图像帧中表示同一个基本要素的元素,组成元素组。With reference to some embodiments of the first aspect, in some embodiments, the electronic device determines, according to the identified elements, an element group in the video to be detected, which includes the first element group, and specifically includes: the electronic device will identify The elements representing the same basic element in the at least two image frames form an element group.
在上述实施例中,视频平台通过元素确定元素组,保留了元素在待检测视频中得时空完整性,有助于提高对待检测视频画面内容评估结果的准确性。In the above embodiment, the video platform determines the element group through the elements, which preserves the temporal and spatial integrity of the elements in the video to be detected, which helps to improve the accuracy of the content evaluation result of the video to be detected.
结合第一方面的一些实施例,在一些实施例中,该电子设备将识别出的该至少两个图像帧中表示同一个基本要素的元素,组成元素组,其中包括第一元素组,具体包括:该电子设备确定该第一图像帧中的第一元素与该第二图像帧中的第一元素的第一元素相似度,元素相似度用于表示元素图像之间的相似程度;该电子设备确定该第一元素相似度大于预设相似度阈值,将该第一图像帧中的第一元素与该第二图像帧中的第一元素组成第一元素组。With reference to some embodiments of the first aspect, in some embodiments, the electronic device will form elements representing the same basic element in the at least two image frames identified to form an element group, which includes the first element group, which specifically includes : the electronic device determines the first element similarity between the first element in the first image frame and the first element in the second image frame, and the element similarity is used to indicate the similarity between the element images; the electronic device It is determined that the similarity of the first element is greater than a preset similarity threshold, and the first element in the first image frame and the first element in the second image frame form a first element group.
在上述实施例中,视频平台通过元素确定元素组,保留了元素在待检测视频中得时空完整性。在待检测视频可能受到剪辑或者拍摄手法的影响的情况下,通过元素组而非元素作为视频评价方法的基本单元,能够有效反映待检测视频的画面内容完整度。In the above-mentioned embodiment, the video platform determines the element group through the element, which preserves the temporal and spatial integrity of the element in the video to be detected. In the case that the video to be detected may be affected by editing or shooting techniques, using the element group instead of the element as the basic unit of the video evaluation method can effectively reflect the completeness of the screen content of the video to be detected.
结合第一方面的一些实施例,在一些实施例中,该电子设备根据识别出的元素,确定该待检测视频中的元素组,其中包括第一元素组的步骤之后,该方法还包括:该电子设备计算该待检测视频中的元素组的完整度,该完整度用于表示元素组是否完整;该电子设备从该待检测视频中的元素组中筛选出残缺元素组,其中包括该第一元素组,该残缺元素组的完整度为不完整。With reference to some embodiments of the first aspect, in some embodiments, the electronic device determines, according to the identified elements, an element group in the video to be detected, and after the step of including the first element group, the method further includes: the The electronic device calculates the completeness of the element group in the video to be detected, and the completeness is used to indicate whether the element group is complete; the electronic device filters out the incomplete element group from the element group in the to-be-detected video, including the first Element group, the completeness of the incomplete element group is incomplete.
在上述实施例中,通过判断元素组的完整度,筛选完整度为不完整的元素组进行后续处理,降低了后续处理中元素组的数量,能够有效降低计算量。In the above-mentioned embodiment, by judging the completeness of the element group and screening the element group whose completeness is incomplete for subsequent processing, the number of element groups in the subsequent processing can be reduced, and the calculation amount can be effectively reduced.
结合第一方面的一些实施例,在一些实施例中,该电子设备根据识别出的元素,确定该待检测视频中的元素组,其中包括第一元素组的步骤之后,该方法还包括:该电子设备从该待检测视频中的元素组中筛选出疑似残缺元素组,其中包括该第一元素组,该疑似残缺元素组为位置与画面边缘的距离小于预设第二距离阈值的元素组。With reference to some embodiments of the first aspect, in some embodiments, the electronic device determines, according to the identified elements, an element group in the video to be detected, and after the step of including the first element group, the method further includes: the The electronic device filters out a suspected incomplete element group from the element group in the video to be detected, including the first element group, and the suspected incomplete element group is an element group whose position and the distance between the edge of the screen are smaller than a preset second distance threshold.
在上述实施例中,考虑到实际情况中,影响待检测视频画面内容完整度的元素往往出现在画面边缘位置,通过判断元素组的位置筛选元素组,降低了后续处理中元素组的数量,能够有效降低计算量。In the above embodiment, considering the actual situation, the elements that affect the integrity of the video image to be detected often appear at the edge of the image. By judging the position of the element group to filter the element group, the number of element groups in the subsequent processing is reduced, and the number of element groups in the subsequent processing can be reduced. Effectively reduce the amount of calculation.
结合第一方面的一些实施例,在一些实施例中,该电子设备从该待检测视频中的元素组中筛选出疑似残缺元素组步骤之后,该方法还包括:该电子设备计算该待检测视频中的疑似残缺元素组的完整度,该完整度用于表示元素组是否完整;该电子设备从该待检测视频中的疑似残缺元素组中筛选出残缺元素组,其中包括该第一元素组,该残缺元素组的完整度为不完整。With reference to some embodiments of the first aspect, in some embodiments, after the electronic device selects the suspected incomplete element group from the element group in the video to be detected, the method further includes: the electronic device calculates the video to be detected The completeness of the suspected incomplete element group in the video, which is used to indicate whether the element group is complete; the electronic device filters out the incomplete element group from the suspected incomplete element group in the video to be detected, including the first element group, The completeness of the incomplete element group is incomplete.
在上述实施例中,通过判断元素组的完整度,并且考虑到元素组完整度为完整的元素组并不会影响待检测视频的画面内容,筛选出元素组的完整度为不完整的元素组参与后续计算,降低了后续处理中元素组的数量,能够有效降低计算量。In the above embodiment, by judging the completeness of the element group, and considering that the element group whose completeness is complete will not affect the picture content of the video to be detected, the element group whose completeness is incomplete is screened out. Participating in subsequent calculations reduces the number of element groups in subsequent processing, which can effectively reduce the amount of calculation.
结合第一方面的一些实施例,在一些实施例中,该第一元素组的元素组属性还包括第二残缺参数值,该第二残缺参数值用于表示该第一元素组的残缺类型、持续时间、重要程度、周边元素组、时段类型中的至少一个。With reference to some embodiments of the first aspect, in some embodiments, the element group attribute of the first element group further includes a second incomplete parameter value, and the second incomplete parameter value is used to indicate the incomplete type of the first element group, At least one of duration, importance, surrounding element group, and period type.
在上述实施例中,通过确定元素组的元素组属性中的第二残缺参数值,得到元素组的更多信息,进而能够更准确的确定检测视频的画面内容评估结果。In the above embodiment, by determining the value of the second incomplete parameter in the element group attribute of the element group, more information of the element group can be obtained, and then the picture content evaluation result of the detected video can be more accurately determined.
结合第一方面的一些实施例,在一些实施例中,待检测视频的分类信息包括:待检测视频的主题和/或待检测视频的标签。With reference to some embodiments of the first aspect, in some embodiments, the classification information of the video to be detected includes: the subject of the video to be detected and/or the label of the video to be detected.
在上述实施例中,视频平台获取待检测视频的分类信息,并根据分类信息区分不同主题、标签的待检测视频,进而能够更有针对性的对待检测视频进行画面内容的评估,得到更准确的评估结果。In the above embodiment, the video platform obtains the classification information of the video to be detected, and differentiates the videos to be detected with different themes and labels according to the classification information, so that the video to be detected can be more targeted to evaluate the screen content, and a more accurate picture can be obtained. evaluation result.
结合第一方面的一些实施例,在一些实施例中,该电子设备识别待检测视频中至少两个图像帧中的元素的步骤之前,该方法还包括:该电子设备对原始视频抽帧,得到该待检测视频;或,该电子设备将该原始视频切割为N个该待检测视频;或,该电子设备将该原始视频切割为N个中间视频后,对该N个中间视频抽帧,得到N个该待检测视频;该N为正整数。With reference to some embodiments of the first aspect, in some embodiments, before the step of identifying the elements in at least two image frames in the video to be detected by the electronic device, the method further includes: the electronic device extracts frames from the original video to obtain the video to be detected; or, the electronic device cuts the original video into N videos to be detected; or, after the electronic device cuts the original video into N intermediate videos, extracts frames from the N intermediate videos to obtain N the videos to be detected; the N is a positive integer.
在上述实施例中,通过对原始视频进行抽帧操作,降低了待检测视频中图像帧的数量,进而降低了参与计算的元素组数量,故降低了计算量;通过对原始视频进行切割,切割后得到至少一段待检测视频,降低了计算量,同时,通过利用并行计算,降低了计算时间。In the above embodiment, by performing frame extraction operation on the original video, the number of image frames in the video to be detected is reduced, thereby reducing the number of element groups involved in the calculation, so the calculation amount is reduced; by cutting the original video, cutting Then, at least one video to be detected is obtained, which reduces the amount of calculation, and at the same time, reduces the calculation time by using parallel computing.
第二方面,本申请实施例提供了一种电子设备,该电子设备包括:一个或多个处理器和存储器;该存储器与该一个或多个处理器耦合,该存储器用于存储计算机程序代码,该计算机程序代码包括计算机指令,该一个或多个处理器调用该计算机指令以使得该电子设备执行:识别待检测视频中至少两个图像帧中的元素,该至少两个图像帧包括第一图像帧和第二图像帧,该元素为该图像帧中的基本要素;根据识别出的元素,确定该待检测视频中的元素组,其中包括第一元素组,该第一元素组中包含该第一图像帧中的第一元素和第二图像帧中的第一元素;确定该第一元素组的元素组属性,该第一元素组的元素组属性包括第一残缺参数值,该第一残缺参数值用于表示该第一元素组的残缺程度;至少根据该第一元素组的元素组属性,确定该待检测视频的画面内容评估结果。In a second aspect, an embodiment of the present application provides an electronic device, the electronic device includes: one or more processors and a memory; the memory is coupled to the one or more processors, and the memory is used to store computer program codes, The computer program code includes computer instructions invoked by the one or more processors to cause the electronic device to perform: identify elements in at least two image frames in the video to be detected, the at least two image frames including the first image frame and the second image frame, the element is the basic element in the image frame; according to the identified element, determine the element group in the video to be detected, including the first element group, the first element group includes the first element group the first element in an image frame and the first element in the second image frame; determine the element group attribute of the first element group, the element group attribute of the first element group includes the first incomplete parameter value, the first incomplete The parameter value is used to indicate the degree of incompleteness of the first element group; at least according to the element group attribute of the first element group, the picture content evaluation result of the video to be detected is determined.
在上述实施例中,电子设备通过自动获取待检测视频中的元素组,并确定元素组的元素组属性,可以根据元素组的元素组属性确定待检测视频的画面内容评估结果。不需要人工干预就能直接得到对待检测视频的评估结果,使得视频平台可以自动的审核待检测视频,提升了视频平台的审核效率。并且电子设备可以是联网的服务器或者是用户或视频生产者的电脑,便于用户或视频生产者随时随地的评估待检测视频,提升了用户体验。In the above embodiment, the electronic device can automatically obtain the element group in the video to be detected and determine the element group attribute of the element group, and can determine the picture content evaluation result of the video to be detected according to the element group attribute of the element group. The evaluation result of the video to be detected can be obtained directly without manual intervention, so that the video platform can automatically review the video to be detected, which improves the review efficiency of the video platform. In addition, the electronic device can be a networked server or a computer of a user or a video producer, which facilitates the user or video producer to evaluate the video to be inspected anytime and anywhere, and improves the user experience.
结合第二方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:根据该第一元素组的元素组属性,确定该第一元素组对该待检测视频画面内容的影响结果;至少根据该第一元素组对该待检测视频画面内容的影响结果,确定该待检测视频的画面内容评估结果。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instruction to cause the electronic device to execute: determine the element group according to the element group attribute of the first element group The influence result of the first element group on the video picture content to be detected; at least according to the influence result of the first element group on the to-be-detected video picture content, determine the picture content evaluation result of the to-be-detected video.
结合第二方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:根据该第一元素组的元素组属性和预设的元素组影响画面完整度规则,确定该第一元素组对该待检测视频画面内容的影响结果。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: according to the element group attributes and presets of the first element group The first element group affects the picture integrity rule, and the effect of the first element group on the content of the video picture to be detected is determined.
结合第二方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:至少根据该第一元素组对该待检测视频画面内容的影响结果,确定该待检测视频是否完整;和/或,至少根据该第一元素组对该待检测视频画面内容的影响结果和待检测视频的分类信息,确定待检测视频的评分;和/或,至少根据该第一元素组对该待检测视频画面内容的影响结果和该第一元素组的元素组属性,确定对待检测视频的修改建议,其中包括对第一元素组的修改建议。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: at least the video frame to be detected according to the first element group The influence result of the content, to determine whether the video to be detected is complete; and/or, at least according to the influence result of the first element group on the content of the video to be detected and the classification information of the video to be detected, determine the score of the video to be detected; and /or, at least according to the effect of the first element group on the content of the video to be detected and the element group attribute of the first element group, determine the modification suggestion of the to-be-detected video, which includes the modification suggestion for the first element group.
结合第二方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:至少根据该第一元素组的元素组属性,确定该待检测视频是否完整;和/或,至少根据该第一元素组的元素组属性和待检测视频的分类信息,确定待检测视频的评分;和/或,至少根据该第一元素组的元素组属性,确定对待检测视频的修改建议,其中包括对第一元素组的修改建议。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instruction to cause the electronic device to execute: at least according to the element group attribute of the first element group, determine Whether the video to be detected is complete; and/or, at least according to the element group attribute of the first element group and the classification information of the video to be detected, determine the score of the video to be detected; and/or, at least according to the elements of the first element group The group attribute determines the modification suggestion for the video to be detected, which includes the modification suggestion for the first element group.
结合第二方面的一些实施例,在一些实施例中,该第一元素组的修改建议,包括:通过、可不处理、剪裁、涂抹、删帧、模糊中的至少一种。With reference to some embodiments of the second aspect, in some embodiments, the modification suggestion of the first element group includes at least one of: pass, can not be processed, crop, smear, delete frame, and blur.
结合第二方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:将识别出的该至少两个图像帧中表示同一个基本要素的元素,组成元素组。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: The elements of a base element that make up an element group.
结合第二方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:确定该第一图像帧中的第一元素与该第二图像帧中的第一元素的第一元素相似度,元素相似度用于表示元素图像之间的相似程度;确定该第一元素相似度大于预设相似度阈值,将该第一图像帧中的第一元素与该第二图像帧中的第一元素组成第一元素组。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to perform: determine the relationship between the first element in the first image frame and the The first element similarity of the first element in the second image frame, the element similarity is used to indicate the similarity between the element images; it is determined that the first element similarity is greater than the preset similarity threshold, and the first image frame The first element in and the first element in the second image frame form a first element group.
结合第二方面的一些实施例,在一些实施例中,该一个或多个处理器,还用于调用该计算机指令以使得该电子设备执行:确定第一位置与第二位置的距离小于预设第一距离阈值;该第一位置为该第一图像帧中的第一元素在该第一图像帧中的位置,该第二位置为该第二图像帧中的第一元素在该第二图像帧中的位置。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are further configured to invoke the computer instructions to cause the electronic device to execute: determine that the distance between the first position and the second position is less than a preset The first distance threshold; the first position is the position of the first element in the first image frame in the first image frame, and the second position is the first element in the second image frame in the second image position in the frame.
结合第二方面的一些实施例,在一些实施例中,其特征在于,该一个或多个处理器,还用于调用该计算机指令以使得该电子设备执行:计算该待检测视频中的元素组的完整度,该完整度用于表示元素组是否完整;从该待检测视频中的元素组中筛选出残缺元素组,其中包括该第一元素组,该残缺元素组的完整度为不完整。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are further configured to invoke the computer instruction to cause the electronic device to execute: calculate the element group in the video to be detected The completeness is used to indicate whether the element group is complete; the incomplete element group is screened from the element group in the video to be detected, including the first element group, and the completeness of the incomplete element group is incomplete.
结合第二方面的一些实施例,在一些实施例中,所述一个或多个处理器,还用于调用该计算机指令以使得该电子设备执行:从该待检测视频中的元素组中筛选出疑似残缺元素组,其中包括该第一元素组,该疑似残缺元素组为位置与画面边缘的距离小于预设第二距离阈值的元素组。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are further configured to invoke the computer instructions to cause the electronic device to perform: filter out the element group in the video to be detected The suspected incomplete element group includes the first element group, and the suspected incomplete element group is an element group whose distance between the position and the edge of the screen is less than a preset second distance threshold.
结合第二方面的一些实施例,在一些实施例中,所述一个或多个处理器,还用于调用该计算机指令以使得该电子设备执行:计算该待检测视频中的疑似残缺元素组的完整度,该完整度用于表示元素组是否完整;从该待检测视频中的疑似残缺元素组中筛选出残缺元素组,其中包括该第一元素组,该残缺元素组的完整度为不完整。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are further configured to invoke the computer instructions to cause the electronic device to perform: calculating the value of the suspected incomplete element group in the video to be detected Integrity, which is used to indicate whether the element group is complete; screen out the incomplete element group from the suspected incomplete element group in the video to be detected, including the first element group, and the integrity of the incomplete element group is incomplete .
结合第二方面的一些实施例,在一些实施例中,该第一元素组的元素组属性还包括第二残缺参数值,该第二残缺参数值用于表示该第一元素组的残缺类型、持续时间、重要程度、周边元素组、时段类型中的至少一个。With reference to some embodiments of the second aspect, in some embodiments, the element group attribute of the first element group further includes a second incompleteness parameter value, and the second incompleteness parameter value is used to represent the incompleteness type of the first element group, At least one of duration, importance, surrounding element group, and period type.
结合第二方面的一些实施例,在一些实施例中,待检测视频的分类信息包括:待检测视频的主题和/或待检测视频的标签。With reference to some embodiments of the second aspect, in some embodiments, the classification information of the video to be detected includes: the subject of the video to be detected and/or the label of the video to be detected.
结合第二方面的一些实施例,在一些实施例中,该一个或多个处理器,还用于调用该计算机指令以使得该电子设备执行:对原始视频抽帧,得到该待检测视频;或,将该原始视频切割为N个该待检测视频;或,将该原始视频切割为N个中间视频后,对该N个中间视频抽帧,得到N个该待检测视频;该N为正整数。With reference to some embodiments of the second aspect, in some embodiments, the one or more processors are further configured to invoke the computer instruction to cause the electronic device to perform: extract frames from the original video to obtain the video to be detected; or , cutting the original video into N videos to be detected; or, after cutting the original video into N intermediate videos, extracting frames from the N intermediate videos to obtain N videos to be detected; N is a positive integer .
第三方面,本申请实施例提供了一种芯片系统,该芯片系统应用于电子设备,该芯片系统包括一个或多个处理器,该处理器用于调用计算机指令以使得该电子设备执行如第一方面以及第一方面中任一可能的实现方式描述的方法。In a third aspect, an embodiment of the present application provides a chip system, the chip system is applied to an electronic device, the chip system includes one or more processors, and the processors are configured to invoke computer instructions to cause the electronic device to perform the first Aspects and methods described in any possible implementation of the first aspect.
第四方面,本申请实施例提供一种包含指令的计算机程序产品,当上述计算机程序产品在电子设备上运行时,使得上述电子设备执行如第一方面以及第一方面中任一可能的实现方式描述的方法。In a fourth aspect, an embodiment of the present application provides a computer program product containing instructions, when the computer program product is run on an electronic device, the electronic device is made to perform any of the first aspect and any possible implementation manner of the first aspect. method described.
第五方面,本申请实施例提供一种计算机可读存储介质,包括指令,当上述指令在电子设备上运行时,使得上述电子设备执行如第一方面以及第一方面中任一可能的实现方式描述的方法。In a fifth aspect, an embodiment of the present application provides a computer-readable storage medium, including instructions, when the above-mentioned instructions are executed on an electronic device, the above-mentioned electronic device is made to execute the first aspect and any possible implementation manner of the first aspect method described.
可以理解地,上述第二方面提供的电子设备、第三方面提供的芯片系统、第四方面提供的计算机程序产品和第五方面提供的计算机存储介质均用于执行本申请实施例所提供的方法。因此,其所能达到的有益效果可参考对应方法中的有益效果,此处不再赘述。It can be understood that the electronic device provided in the second aspect, the chip system provided in the third aspect, the computer program product provided in the fourth aspect, and the computer storage medium provided in the fifth aspect are all used to execute the methods provided by the embodiments of the present application. . Therefore, for the beneficial effects that can be achieved, reference may be made to the beneficial effects in the corresponding method, which will not be repeated here.
附图说明Description of drawings
图1示出了现有技术中人工审核视频的场景;Fig. 1 shows the scene of manual review video in the prior art;
图2至图6为本申请实施例中一组示例性用户界面示意图;2 to 6 are schematic diagrams of a group of exemplary user interfaces in the embodiments of the present application;
图7为本申请实施例中视频画面内容完整度评估方法的一个流程示意图;FIG. 7 is a schematic flowchart of a method for evaluating video image content integrity in an embodiment of the present application;
图8为本申请实施例中视频切割的一个示例性示意图;FIG. 8 is an exemplary schematic diagram of video cutting in an embodiment of the present application;
图9为本申请实施例中非自适应抽帧方法的一个示例性示意图;FIG. 9 is an exemplary schematic diagram of a non-adaptive frame extraction method in an embodiment of the present application;
图10为本申请实施例中自适应抽帧方法的一个示例性示意图;10 is an exemplary schematic diagram of an adaptive frame extraction method in an embodiment of the present application;
图11为本申请实施例中原始视频被切割并被抽帧的一个示例性示意图;11 is an exemplary schematic diagram of the original video being cut and framed in the embodiment of the application;
图12和图13为本申请实施例中在单个图像帧中进行元素检测的一个示例性示意图;FIG. 12 and FIG. 13 are an exemplary schematic diagram of element detection in a single image frame in an embodiment of the present application;
图14为本申请实施例中获取元素组的一个示例性场景示意图;14 is a schematic diagram of an exemplary scenario for acquiring an element group in an embodiment of the present application;
图15为本申请实施例中获取元素组的另一个示例性场景示意图;15 is a schematic diagram of another exemplary scenario for acquiring an element group in an embodiment of the present application;
图16为本申请实施例中一个元素组完整度判别结果的示例性示意图;FIG. 16 is an exemplary schematic diagram of a completeness judgment result of an element group in an embodiment of the present application;
图17为本申请实施例中元素组影响画面完整度规则的一个示例性结构示意图;17 is an exemplary schematic structural diagram of an element group affecting a picture integrity rule in an embodiment of the present application;
图18为本申请实施例中元素组对画面完整度影响结果的一个示例性示意图;FIG. 18 is an exemplary schematic diagram of the effect of element groups on picture integrity in an embodiment of the present application;
图19为本申请实施例中BP神经网络模型的一个示例性示意图;19 is an exemplary schematic diagram of a BP neural network model in an embodiment of the application;
图20为本申请实施例中修改建议生成模型的一个架构示意图;FIG. 20 is a schematic diagram of the architecture of the modification suggestion generation model in the embodiment of the present application;
图21为本申请实施例中修改建议生成模型输出的原始视频画面修改建议的一个示意图;21 is a schematic diagram of a modification suggestion of an original video picture output by a modification suggestion generation model in an embodiment of the present application;
图22为本申请实施例提供的电子设备100的一个结构示意图;FIG. 22 is a schematic structural diagram of an electronic device 100 provided by an embodiment of the present application;
图23为本发明实施例中电子设备100的一个软件结构示意框图;23 is a schematic block diagram of a software structure of the electronic device 100 in the embodiment of the present invention;
图24为本申请实施例提供的电子设备200的另一个硬件结构示意图。FIG. 24 is another schematic diagram of the hardware structure of the
具体实施方式Detailed ways
本申请以下实施例中所使用的术语只是为了描述特定实施例的目的,而并非旨在作为对本申请的限制。如在本申请的说明书和所附权利要求书中所使用的那样,单数表达形式“一个”、“一种”、“该”、“上述”、“该”和“这一”旨在也包括复数表达形式,除非其上下文中明确地有相反指示。还应当理解,本申请中使用的术语“和/或”是指并包含一个或多个所列出项目的任何或所有可能组合。The terms used in the following embodiments of the present application are only for the purpose of describing specific embodiments, and are not intended to be used as limitations of the present application. As used in the specification of this application and the appended claims, the singular expressions "a," "an," "the," "above," "the," and "the" are intended to also include Plural expressions unless the context clearly dictates otherwise. It will also be understood that, as used in this application, the term "and/or" refers to and includes any and all possible combinations of one or more of the listed items.
以下,术语“第一”、“第二”仅用于描述目的,而不能理解为暗示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征,在本申请实施例的描述中,除非另有说明,“多个”的含义是两个或两个以上。Hereinafter, the terms "first" and "second" are only used for descriptive purposes, and should not be construed as implying or implying relative importance or implying the number of indicated technical features. Therefore, the features defined as "first" and "second" may explicitly or implicitly include one or more of the features. In the description of the embodiments of the present application, unless otherwise specified, the "multiple" The meaning is two or more.
为了便于理解,下面先对本申请实施例涉及的相关术语及相关概念进行介绍:For ease of understanding, related terms and related concepts involved in the embodiments of the present application are first introduced below:
(1)元素和元素类型:(1) Elements and element types:
本申请实施例中,元素为图像帧中可承载或传递视觉信息的基本要素。图像帧为组成视频的最小单位。In this embodiment of the present application, an element is a basic element in an image frame that can carry or transmit visual information. An image frame is the smallest unit that makes up a video.
本申请的一些实施例中,元素可以包括图像帧呈现画面中的主体、陪体、文字等。其中,主体为视频生产者或用户在拍摄过程中关注的主要对象,一般情况下,主体多为单一的一个对象或一组对象,例如主体可以为人、动物、植物、甚至是抽象的对象;陪体为视频生产者或用户在拍摄过程中次要关注的对象,陪体为用于陪衬的物体,一般情况下,陪体多为多个独立的对象,例如陪体可以是生活物品、建筑、艺术品、名胜古迹等。In some embodiments of the present application, the elements may include the main body, accompanying body, text, etc. in the image frame presentation screen. Among them, the subject is the main object that the video producer or the user pays attention to during the shooting process. In general, the subject is mostly a single object or a group of objects. For example, the subject can be a person, an animal, a plant, or even an abstract object; The body is the secondary focus of the video producer or user during the shooting process, and the body is the object used as a foil. In general, the body is mostly multiple independent objects. For example, the body can be living objects, buildings, Art, places of interest, etc.
本申请实施例中,元素类型用于区分传递不同类型视觉信息的元素分类。In this embodiment of the present application, the element type is used to distinguish the classification of elements conveying different types of visual information.
本申请的一些实施例中,元素类型可以包括人(肤色识别如白种人、黄种人、黑种人)、人脸(识别男、女、老、少等)、文字、其他后期文字、字幕、版权标志、台标、水印、植物类主体(识别植物品种如蒲公英、菊花等)、动物类主体(识别具体的动物品种如狗、猫、骆驼等)、风景类主体等。In some embodiments of the present application, the element types may include people (skin color recognition such as Caucasian, yellow, black), human faces (male, female, old, young, etc.), text, other later texts, subtitles , copyright signs, station logos, watermarks, plant subjects (identifying plant species such as dandelions, chrysanthemums, etc.), animal subjects (identifying specific animal species such as dogs, cats, camels, etc.), landscape subjects, etc.
示例性的,若A元素的元素类型为文字,表示A元素为被拍摄的元素;若B元素为字幕,表示B元素为视频拍摄完成后,后期添加的文字字幕,其位置多位于视频的底部且居中;若C元素为其他后期文字,表示C元素为视频拍摄完成后,后期添加的文字,但位置与字幕位置不同。Exemplarily, if the element type of the A element is text, it means that the A element is the element to be photographed; if the B element is a subtitle, it means that the B element is a text subtitle added later after the video is shot, and its position is mostly located at the bottom of the video. And centered; if the C element is other post-processing text, it means that the C element is the text added later after the video is shot, but the position is different from the subtitle position.
具体的,在本申请实施例中,通过对图像帧进行数据处理,或者通过对图像帧呈现的画面进行图像处理,例如元素检测等,可以得到图像帧内所包含的元素。示例性的,如图12至图13所示,对图像帧呈现的画面进行图像处理(元素检测),得到该图像帧呈现的画面中包含的元素,其中检测到的元素包括:1301文字1、1302图标1、1303动物1、1304食物1、1305文字2。同时可以识别出各个检测到元素所属的元素类型,识别结果包括:1301文字1属于其他后期文字,1302图标属于版权标志,1303动物1属于动物(狗),1304食物1属于食物,1305文字2属于字幕。Specifically, in the embodiment of the present application, the elements contained in the image frame can be obtained by performing data processing on the image frame, or performing image processing on the picture presented by the image frame, such as element detection. Exemplarily, as shown in Figures 12 to 13, image processing (element detection) is performed on the picture presented by the image frame to obtain the elements contained in the picture presented by the image frame, wherein the detected elements include: 1301
在得到图像帧内所包含的元素时,可以得到元素的元素属性。元素的元素属性用于量化元素之间的关系、量化元素与视频画面的关系等。元素的元素属性可以包括很多参数,例如元素的位置、元素的残缺程度等。When the elements contained in the image frame are obtained, the element attributes of the elements can be obtained. The element attribute of an element is used to quantify the relationship between the elements, the relationship between the element and the video picture, and the like. The element attribute of an element can include many parameters, such as the position of the element, the degree of incompleteness of the element, and so on.
(2)元素组:(2) Element group:
2.1、元素组的定义:2.1. Definition of element group:
本申请实施例中,视频的多个图像帧中的相同或同一元素可以组成为一个元素组。可以理解的是,元素组的类型与元素组中元素的元素类型相同。In this embodiment of the present application, the same or the same elements in multiple image frames of the video may be formed into an element group. It is understood that the type of the element group is the same as the element type of the elements in the element group.
视频生产者或用户在拍摄视频的情况下,或视频生产者或用户对拍摄的视频进行后期处理情况下,视频受蒙太奇手法、运镜方式、转场等拍摄或剪辑手法的影响,使得仅从元素的完整度这一角度分析并不能有效反映视频的画面内容完整度。考虑到元素在视频中的时空完整度属性,定义元素组。When a video producer or user shoots a video, or when the video producer or user performs post-processing on the captured video, the video is affected by montage, mirror movement, transition and other shooting or editing techniques, so that only the The angle analysis of the completeness of elements cannot effectively reflect the completeness of the video content. Element groups are defined taking into account the spatiotemporal integrity properties of elements in the video.
其中,元素组包括:首先存在时序关系的若干图像帧,其中若干图像帧可以是连续的,也可以是不连续的;其次在上述若干图像帧中的至少两个图像帧中存在同一元素D,其中元素D可以是完整的,也可以是不完整的;最后,D-Group元素组为至少两个图像帧中D元素的集合。Wherein, the element group includes: firstly, there are several image frames in a time sequence relationship, wherein several image frames may be continuous or discontinuous; secondly, the same element D exists in at least two image frames in the above-mentioned several image frames, The element D may be complete or incomplete; finally, the D-Group element group is a set of D elements in at least two image frames.
2.2、元素组属性:2.2, element group attributes:
元素组的元素组属性中包含用于表示元素组特征的参数与相应参数的参数值的对应关系。例如,元素组的元素组属性可以包括很多参数:元素组占据画面的面积、元素组的位置、元素组的残缺比例、元素组的残缺类型、元素组的持续时间、元素组的重要程度、元素组的周边元素组、元素组的时段类型等。The element group attribute of the element group contains the corresponding relationship between the parameters used to represent the characteristics of the element group and the parameter values of the corresponding parameters. For example, the element group attribute of the element group can include many parameters: the area occupied by the element group, the position of the element group, the incomplete ratio of the element group, the incomplete type of the element group, the duration of the element group, the importance of the element group, the element group The surrounding element group of the group, the period type of the element group, etc.
示例性的,如下表1所示,为本申请实施例中元素组属性的一个示意性示例:Exemplarily, as shown in Table 1 below, it is a schematic example of an element group attribute in this embodiment of the present application:
表1Table 1
如表1所示,该第一元素组的元素组属性中包括残缺比例参数值为1/2、持续时间参数值为10秒。As shown in Table 1, the element group attribute of the first element group includes a parameter value of incompleteness ratio of 1/2 and a parameter value of duration of 10 seconds.
可以理解的是,元素组的元素组属性还可以有很多其他的表示方式,例如数组、矩阵等,此处不作限定。It can be understood that the element group attribute of the element group may also have many other representations, such as an array, a matrix, etc., which are not limited here.
元素组的元素组属性与元素组中元素的元素属性有关。元素组的元素组属性用于量化元素组之间的关系、量化元素组与视频画面的关系等。The element group attribute of an element group is related to the element attributes of the elements in the element group. The element group attribute of the element group is used to quantify the relationship between the element groups, the relationship between the element group and the video picture, and the like.
在本申请一些实施例中,元素组属性中包含的参数可以与预设的元素组影响画面完整度规则中参数相同。电子设备可以通过比对一个元素组的元素组属性中参数的参数值与该预设的元素组影响画面完整度规则中参数的阈值或参考值,来确定该元素组是否对视频画面内容完整度有影响。具体可以参考下述(4)预设的元素组影响画面完整度中的描述,此处不作赘述。In some embodiments of the present application, the parameters included in the attribute of the element group may be the same as the parameters in the preset rule for affecting the picture integrity of the element group. The electronic device can determine whether the element group contributes to the content integrity of the video image by comparing the parameter value of the parameter in the element group attribute of an element group with the threshold or reference value of the parameter in the preset element group affecting picture integrity rule. influential. For details, reference may be made to the description in (4) Preset element group affects picture integrity, which will not be repeated here.
2.3、元素组的确定:2.3. Determination of element group:
元素组的元素组类型与元素组中元素的元素类型相同,具有相同的意思表达。The element group type of the element group is the same as the element type of the elements in the element group, and has the same meaning expression.
本申请实施例中,从至少两个图像帧中获取同一元素D,并将不同图像帧中的同一元素D组成D-Group元素组的方式可以有很多种,例如利用元素相似度获取元素组、通过聚类算法获取元素组等方式。下面以使用元素相似度获取元素组为例,介绍获取元素组的方式:In the embodiment of the present application, there may be many ways to obtain the same element D from at least two image frames, and form the same element D in different image frames into a D-Group element group, for example, using element similarity to obtain the element group, Obtain element groups through clustering algorithms, etc. The following uses the element similarity to obtain the element group as an example to introduce the method of obtaining the element group:
使用元素相似度获取元素组:通过图像处理算法或数据处理算法提取所有图像帧中所有元素的图像特征。任意选取一元素作为基准元素,计算该基准元素的图像特征与被比较元素的图像特征之间的相似度。当相似度超过预设阈值时,认为该基准元素与被比较元素为同一元素。其中,元素相似度为:基准元素与被比较元素之间的图像特征之间的距离,其中该距离有多种表现形式,包括余弦距离、欧式距离、曼哈顿距离等。Obtain element groups using element similarity: Extract image features of all elements in all image frames through image processing algorithms or data processing algorithms. An element is arbitrarily selected as the reference element, and the similarity between the image feature of the reference element and the image feature of the compared element is calculated. When the similarity exceeds a preset threshold, the reference element and the compared element are considered to be the same element. Among them, the element similarity is: the distance between the image features between the reference element and the compared element, where the distance has various forms, including cosine distance, Euclidean distance, Manhattan distance, etc.
本申请实施例中,选取被比较元素的方式有很多种。例如,被比较元素的选取方式可以为:遍历选取与基准元素所在图像帧不同的图像帧中的元素作为被比较元素;或者遍历选取与基准元素所在图像帧不同的图像帧中的,并且与基准元素的元素类型相同的元素作为被比较元素;或者遍历选取与基准元素所在图像帧不同的图像帧中的、与基准元素的元素类型相同的元素和与基准元素位置相近的元素作为被比较元素。In the embodiments of the present application, there are many ways to select the elements to be compared. For example, the selection method of the compared element may be: traversing and selecting elements in an image frame different from the image frame where the reference element is located as the compared element; or traversing and selecting elements in an image frame different from the image frame where the reference element is located, The element with the same element type is used as the element to be compared; or the element with the same element type as the reference element and the element with the position close to the reference element in the image frame different from the reference element are selected by traversal as the element to be compared.
判断基准元素与被比较元素位置相近:预先设有第一距离阈值,根据元素检测的结果,可以得知基准元素的位置为第一元素位置,被比较元素的位置为第二元素位置,计算第一元素位置与第二元素的位置距离是否小于预设的第一距离阈值,若该位置距离小于预设的第一距离阈值,则认为基准元素与被比较元素位置相近。第一距离阈值的数值可以与视频的帧率等信息有关。Judging that the position of the reference element is similar to that of the element being compared: a first distance threshold is preset. According to the result of element detection, it can be known that the position of the reference element is the position of the first element, and the position of the element being compared is the position of the second element. Whether the distance between the position of an element and the second element is less than the preset first distance threshold, if the position distance is less than the preset first distance threshold, it is considered that the reference element and the compared element are close in position. The value of the first distance threshold may be related to information such as the frame rate of the video.
示例性的,如图14所示,现有三个图像帧,包括第一图像帧、第二图像帧、第三图像帧。选取第一图像帧中的元素1401A作为基准元素,选取第二图像帧和第三图像帧中的所有元素作为被比较元素,计算基准元素与每一个被比较元素的相似度。在元素相似度大于预设元素相似度阈值0.85时,认为基准元素1401A与被比较元素为同一元素,即基准元素1401A与被比较元素属于同一元素组。其中,第二图像帧的1401B元素的与基准元素1401A的元素相似度为0.95、第三图像帧的1401C元素与基准元素1401A的元素相似度为0.94,由于1401B元素和1401C元素与基准元素1401A的元素相似度大于0.85,故1401A、1401B、1401C组成元素组,该元素组可以记为1401-Group。Exemplarily, as shown in FIG. 14 , there are three image frames, including a first image frame, a second image frame, and a third image frame. The
(3)元素组的完整度:(3) The completeness of the element group:
本申请实施例中,元素组的完整度用于表示元素组是否完整。元素组的完整度的取值可以为二元取值,包括完整、不完整。In this embodiment of the present application, the completeness of the element group is used to indicate whether the element group is complete. The value of the completeness of the element group can be a binary value, including complete and incomplete.
本申请实施中,可以通过多种方式来确定元素组的完整度,例如使用比例关系,使用投影特征等,此处不作限定。In the implementation of this application, the completeness of the element group may be determined in various ways, for example, using a proportional relationship, using a projection feature, etc., which are not limited here.
示例性的,作为本申请中的一个实施例,下面对使用比例关系结合投影特征确定元素组的完整度的一种可选方式进行描述:Exemplarily, as an embodiment of the present application, an optional manner for determining the completeness of an element group using a proportional relationship combined with a projection feature is described below:
不同元素组类型的元素组可以对应不同的预设标准比例区间和预设疑似标准比例区间。其中,预设疑似标准比例区间包含该预设标准比例区间。例如,元素组D-Group的元素组类型为Type-D,其对应的预设标准比例区间为[0.8至1.2],其对应的预设疑似标准比例区间为[0.3至3]。Element groups of different element group types may correspond to different preset standard scale intervals and preset suspected standard scale intervals. Wherein, the preset suspected standard proportion interval includes the preset standard proportion interval. For example, the element group type of the element group D-Group is Type-D, the corresponding preset standard scale interval is [0.8 to 1.2], and the corresponding preset suspected standard scale interval is [0.3 to 3].
当一个元素组的宽高比或高宽比超出预设疑似标准比例区间时,可以确定该元素组的完整度为不完整。当一个元素组的宽高比或高宽比在该预设标准比例区间内时,可以确定该元素组的完整度为完整。When the aspect ratio or the aspect ratio of an element group exceeds the preset suspected standard ratio range, it may be determined that the completeness of the element group is incomplete. When the aspect ratio or the aspect ratio of an element group is within the preset standard ratio range, it can be determined that the completeness of the element group is complete.
当一个元素组的宽高比或高宽比超出预设标准比例区间,但是在预设疑似标准比例区间内时,可以计算该元素组中各元素的投影特征,并对投影特征进行统计。再根据统计结果确定该元素组的完整度。例如,若元素组D-Group的宽高比为1.5,超出了预设标准比例区间[0.8至1.2],但是在预设疑似标准比例区间[0.3至3]内,则可以计算元素组D-Group中每个元素的投影特征,并计算元素组D-Group的投影特征分布曲线。通过计算D-Group的投影特征分布曲线与预设的阈值曲线的距离,当距离大于阈值距离时,认为该元素组D-Group不完整。其中距离有多种表现形式,包括欧式距离、曼哈顿距离等。When the aspect ratio or aspect ratio of an element group exceeds the preset standard scale interval, but is within the preset suspected standard scale interval, the projection characteristics of each element in the element group can be calculated, and the projection characteristics can be counted. Then, the completeness of the element group is determined according to the statistical results. For example, if the aspect ratio of the element group D-Group is 1.5, which is outside the preset standard ratio range [0.8 to 1.2], but within the preset suspected standard ratio range [0.3 to 3], the element group D-Group can be calculated. The projected feature of each element in the Group, and the projected feature distribution curve of the element group D-Group is calculated. By calculating the distance between the projected characteristic distribution curve of the D-Group and the preset threshold curve, when the distance is greater than the threshold distance, the element group D-Group is considered to be incomplete. There are many forms of distance, including Euclidean distance, Manhattan distance, etc.
特征分布曲线可以用直方图统计的形式进行表征。预设的阈值曲线对应于为完整“几”字形,即在元素组D-Group的投影特征的直方图为完整“几”字形的情况下,则可以确定该元素组D-Group的完整度为完整;若在元素组D-Group的投影特征的直方图中“几”字形缺口超出预设统计阈值的情况下,则可以确定该元素组D-Group的完整度为不完整。The characteristic distribution curve can be characterized in the form of histogram statistics. The preset threshold curve corresponds to a complete "ji" glyph, that is, when the histogram of the projection feature of the element group D-Group is a complete "ji" glyph, the integrity of the element group D-Group can be determined as: Complete; if the “several” glyph gap in the histogram of the projected feature of the element group D-Group exceeds the preset statistical threshold, it can be determined that the completeness of the element group D-Group is incomplete.
在本申请的一些实施例中,在确定元素组的完整度的过程中,可以直接得到元素组的残缺程度的相关参数(例如残缺比例、残缺类型)。在本申请的一些实施例中,也可以单独确定元素组的残缺程度,此次不作限定。In some embodiments of the present application, in the process of determining the completeness of the element group, parameters related to the degree of incompleteness of the element group (for example, the proportion of incompleteness, the type of incompleteness) can be directly obtained. In some embodiments of the present application, the degree of incompleteness of the element group may also be determined independently, which is not limited this time.
本申请的一些实施例中,可以根据一个元素组的完整度,确定是否继续对该元素组是否影响画面完整度进行判断。若一个元素组的完整度为完整,则可以直接认为该元素组不影响画面完整度;若一个元素组的完整度为不完整,则可以继续判断该元素组是否影响画面完整度,即确定该元素组对视频画面内容的影响结果。判断该元素组是否影响画面完整度的方式具体可以参考下述(4)预设的元素组影响画面完整度规则中的描述,此处不作赘述。In some embodiments of the present application, it may be determined according to the integrity of an element group whether to continue to judge whether the element group affects the picture integrity. If the integrity of an element group is complete, it can be directly considered that the element group does not affect the integrity of the picture; if the integrity of an element group is incomplete, it can continue to judge whether the element group affects the integrity of the picture. The result of the effect of the element group on the content of the video screen. For a specific way of judging whether the element group affects the picture integrity, reference may be made to the description in the following (4) Preset rules of the element group affecting picture integrity, which will not be repeated here.
本申请的一些实施例中,可以先根据元素组在图像帧呈现的画面中的位置,将位于画面边缘的元素组筛选为疑似不完整元素组,再确定这些疑似不完整元素组的完整度。本申请的一些实施例中,可以直接根据元素组在图像帧呈现的画面中的位置,直接将位于画面边缘的元素组的完整度确定为不完整。本申请的一些实施例中,也可以直接对所有的元素组都直接确定其完整度。此处不作限定。In some embodiments of the present application, element groups located at the edge of the picture may be screened into suspected incomplete element groups according to the positions of the element groups in the picture presented by the image frame, and then the completeness of these suspected incomplete element groups may be determined. In some embodiments of the present application, the completeness of the element group located at the edge of the picture may be directly determined as incomplete according to the position of the element group in the picture presented by the image frame. In some embodiments of the present application, the completeness of all element groups may also be directly determined directly. There is no limitation here.
(4)预设的元素组影响画面完整度规则:(4) The preset element group affects the screen integrity rules:
本申请实施例中,元素组影响画面完整度规则为预先设定的,用于记录各元素组类型的元素组对视频画面内容完整度有影响的参数的阈值和/或参考值。In this embodiment of the present application, the rules for element groups affecting picture integrity are preset, and are used to record thresholds and/or reference values of parameters that affect the integrity of video picture content by element groups of each element group type.
示例性的,如图16所示,为本申请实施例中影响画面完整度规则的一个结构示意图。元素组影响画面完整度规则中可以包含元素组的多种不同元素组类型。元素组类型与元素的元素类型相同。例如元素组类型可以包括:人、人脸、文字、其他后期文字、字幕、版权标志、台标、水印、植物类主体、动物类主体、风景类主体等。各元素组类型的元素组可以分别对应多个参数,例如参数可以包括:残缺比例、残缺类型、持续时间、时段类型、重要程度、周边元素组等。Exemplarily, as shown in FIG. 16 , it is a schematic structural diagram of a rule affecting the screen integrity in this embodiment of the present application. Element Groups Affecting Picture Integrity There are many different types of element groups that can be included in an element group. The element group type is the same as the element type of the element. For example, the element group types may include: people, human faces, texts, other later texts, subtitles, copyright signs, station logos, watermarks, plant subjects, animal subjects, landscape subjects, and the like. The element groups of each element group type may correspond to multiple parameters, for example, the parameters may include: defect ratio, defect type, duration, time period type, importance, surrounding element groups, and the like.
其中,残缺比例表示元素组的残缺程度值,例如取值范围可以为0-1,数值越大表示残缺程度越大;残缺类型表示元素组的残缺方向,取值可以为横向或者纵向等;持续时间表示元素组中元素不完整的有间断的最短时间;时段类型的取值为视频时长内任一段时长,常见的取值为视频开始后若干秒以及视频结尾前若干秒;重要程度用于表示元素组在视频中的重要程度;周边元素组为元素组的周边元素组。Among them, the incompleteness ratio represents the value of the incompleteness degree of the element group, for example, the value range can be 0-1, the larger the value, the greater the degree of incompleteness; the incompleteness type represents the incompleteness direction of the element group, and the value can be horizontal or vertical, etc.; continuous Time indicates the shortest period of time when the elements in the element group are incomplete and discontinuous; the value of the period type is any period of time within the video duration, and the common values are several seconds after the start of the video and several seconds before the end of the video; the degree of importance is used to indicate The importance of the element group in the video; the surrounding element group is the surrounding element group of the element group.
在元素组影响画面完整度规则中,各元素组的元素组类型对应的不同参数下可以包含该元素组类型的元素组的该参数的阈值和/或参考值。示例性的,下表2为本申请实施例中预设的元素组影响画面完整度规则的一个示例:In the rule of element group affecting picture integrity, different parameters corresponding to the element group type of each element group may include the threshold and/or reference value of the parameter of the element group of the element group type. Exemplarily, the following table 2 is an example of the preset element group affecting the picture integrity rule in the embodiment of the application:
表2Table 2
例如,在该表2所示的预设的元素组影响画面完整度规则中,元素组类型为其他后期文字的元素组对应有残缺比例、残缺类型、持续时间、时段类型、重要程度、周边元素组这5个参数。其中,其对应的残缺比例参数的阈值为大于等于1/3,其对应的持续时间参数的阈值为大于等于15秒、其对应的残缺类型参数的参考值为横向,其对应的时段类型参数的参考值为任意,其对应的重要程度参数的参考值为大于3,其对应的周边元素组参数的参考值为无。其中,参数的阈值表示若该元素组类型的元素组的该参数的值超出该阈值,则该元素组可能被判断会影响画面内容完整度,参数的参考值可以用于限定该判断的条件。For example, in the preset rule of element group affecting picture integrity shown in Table 2, the element group whose element group type is other later texts corresponds to the incomplete ratio, incomplete type, duration, period type, importance, surrounding elements Group these 5 parameters. Among them, the threshold value of the corresponding incomplete ratio parameter is greater than or equal to 1/3, the threshold value of the corresponding duration parameter is greater than or equal to 15 seconds, the reference value of the corresponding incomplete type parameter is horizontal, and the corresponding period type parameter is The reference value is arbitrary, the reference value of the corresponding importance parameter is greater than 3, and the reference value of the corresponding surrounding element group parameter is none. The threshold of the parameter indicates that if the value of the parameter of the element group of the element group type exceeds the threshold, the element group may be judged to affect the integrity of the screen content, and the reference value of the parameter can be used to limit the judgment condition.
本申请实施例中,电子设备可以根据获取到的元素组的参数的值和预设的元素组影响画面完整度规则,确定元素组是否对视频画面内容完整度有影响,即确定元素组对视频画面的影响结果。In this embodiment of the present application, the electronic device may determine whether the element group has an impact on the integrity of the video picture content according to the acquired parameter values of the element group and the preset element group influence picture integrity rule, that is, determine whether the element group affects the video picture content integrity. screen effect.
可以理解的是,元素组对视频画面内容完整度的影响结果的表现形式可以有很多种,例如该结果可以为二元取值,该二元取值包括:有影响、无影响;该结果可以为多元取值,该多元取值包括:无影响、有一定影响、有很大影响、影响。为了便于说明,以元素组对视频画面内容完整度的影响结果为二元取值为例进行说明,其中二元取值包括:该元素组影响画面内容完整度、该元素组不影响画面完整度。It can be understood that there can be many expressions of the result of the influence of the element group on the content integrity of the video image. For example, the result can be a binary value, and the binary value includes: influential, no influence; the result can be It is a multivariate value, and the multivariate value includes: no influence, certain influence, great influence, and influence. For the convenience of explanation, the effect of the element group on the content integrity of the video picture is taken as an example of a binary value, wherein the binary value includes: the element group affects the integrity of the picture content, and the element group does not affect the integrity of the picture. .
可以理解的是,根据实际需求的不同,预设的元素组影响画面完整度规则中的参数可以更多或更少,此处不作限定。It can be understood that, according to different actual requirements, the parameters in the preset element group affecting the picture integrity rule may be more or less, which is not limited here.
示例性的,如下表3为本申请实施例中预设的元素组影响画面完整度规则的另一个示例。Exemplarily, the following Table 3 is another example of the rule that the preset element group affects the picture integrity in the embodiment of the present application.
表3table 3
该表3所示的预设的元素组影响画面完整度规则中只包含残缺比例参数的阈值。其中,元素组类型为其他后期文字的元素组对应的残缺比例参数的阈值为大于等于1/3,表示对于一个视频中元素组类型为其他后期文字的元素组,若其残缺比例超出1/3,则可以判定该元素组会影响画面内容完整度。若其残缺比例不超出1/3,则可以判定该元素组不会影响画面内容完整度。The preset element group effects shown in Table 3 only include the threshold of the incompleteness ratio parameter in the picture integrity rule. Among them, the threshold value of the incomplete ratio parameter corresponding to the element group whose element group type is other later text is greater than or equal to 1/3, which means that for an element group whose element group type is other later text in a video, if the incomplete ratio exceeds 1/3 , it can be determined that the element group will affect the integrity of the screen content. If its incomplete ratio does not exceed 1/3, it can be determined that the element group will not affect the integrity of the screen content.
示例性的,如下表4为本申请实施例中预设的元素组影响画面完整度规则的另一个示例。Exemplarily, the following Table 4 is another example of the rule that the preset element group affects the picture integrity in the embodiment of the present application.
表4Table 4
该表4所示的预设的元素组影响画面完整度规则中包含残缺比例参数的阈值和持续时间参数的阈值。其中,元素组类型为其他后期文字的元素组对应的残缺比例参数的阈值为大于等于1/3、持续时间参数的阈值为大于等于15秒。对于一个视频中元素组类型为其他后期文字的元素组,若其残缺比例超出1/3并且持续时间大于等于15秒,则可以判定该元素组会影响画面内容完整度。同理,若其残缺比例不超出1/3,不论其持续时间是多少,均可以判定该元素组不会影响视频画面内容完整度;若其持续时间小于15秒,不论其残缺比例是多少,均可以判定该元素组不会影响视频画面内容完整度。The preset element group influence picture integrity rule shown in Table 4 includes the threshold value of the incomplete ratio parameter and the threshold value of the duration parameter. Among them, the threshold value of the incomplete ratio parameter corresponding to the element group whose element group type is other late text is greater than or equal to 1/3, and the threshold value of the duration parameter is greater than or equal to 15 seconds. For an element group in a video whose element group type is other later text, if its incomplete ratio exceeds 1/3 and the duration is greater than or equal to 15 seconds, it can be determined that the element group will affect the integrity of the picture content. In the same way, if the defect ratio does not exceed 1/3, regardless of its duration, it can be determined that the element group will not affect the integrity of the video content; if its duration is less than 15 seconds, no matter what the defect ratio is, It can be determined that the element group will not affect the integrity of the video image content.
示例性的,如下表5为本申请实施例中预设的元素组影响画面完整度规则的另一个示例。Exemplarily, the following Table 5 is another example of the rule that the preset element group affects the picture integrity in the embodiment of the present application.
表5table 5
该表5所示的预设的元素组影响画面完整度规则中包含两个参数的阈值和一个参数的参考值,其中参数的阈值为残缺比例参数的阈值和持续时间参数的阈值、残缺类型参数的参考值。其中,元素组类型为其他后期文字的元素组对应的残缺比例参数的阈值为大于等于1/3、持续时间参数的阈值为大于等于15秒,残缺类型为横向。对于一个视频中类型为其他后期文字的元素组,若其残缺类型横向,且残残缺比例超出1/3、持续时间大于等于15秒,则可以判定该元素组会影响画面内容完整度;若其残缺类型不为横向,则可以判定该元素组不会影响画面内容完整度。The preset element group influence picture integrity rule shown in Table 5 includes two parameter thresholds and one parameter reference value, wherein the parameter thresholds are the threshold value of the defect ratio parameter, the threshold value of the duration parameter, the defect type parameter reference value. Among them, the threshold value of the incomplete ratio parameter corresponding to the element group type of other later texts is greater than or equal to 1/3, the threshold value of the duration parameter is greater than or equal to 15 seconds, and the incomplete type is horizontal. For an element group whose type is other later text in a video, if its defect type is horizontal, and the defect ratio exceeds 1/3, and the duration is greater than or equal to 15 seconds, it can be determined that the element group will affect the integrity of the picture content; If the defect type is not horizontal, it can be determined that the element group will not affect the integrity of the screen content.
示例性的,如下表6为本申请实施例中预设的元素组影响画面完整度规则的另一个示例。Exemplarily, the following Table 6 is another example of the rule that the preset element group affects the picture integrity in the embodiment of the present application.
该表6所示的预设的元素组影响画面完整度规则中对同一元素类型的元素组有两组规则,其中每一组规则分别包含相同种类的两个参数的阈值和一个参数的参考值,分别记为规则1和规则2。参数的阈值为残缺比例的阈值和持续时间的阈值、参数的参考值为残缺类型的参考值。There are two groups of rules for the element group of the same element type in the preset element group influence picture integrity rules shown in Table 6, wherein each group of rules respectively includes the same type of thresholds of two parameters and a reference value of one parameter , denoted as
当元素组的残缺类型参数为横向时,判定元素是否影响画面完整度需要参照规则1中的参数的阈值:残缺比例参数的阈值为大于等于1/3、持续时间参数的阈值为大于等于15秒;当残缺类型参数为纵向时,判定元素时否影响画面完整度参考规则2中的参数的阈值:残缺比例参数的阈值大于等于1/5、持续时间参数的阈值为大于等于10秒。When the defect type parameter of the element group is horizontal, to determine whether the element affects the picture integrity, it is necessary to refer to the parameter thresholds in rule 1: the threshold value of the defect ratio parameter is greater than or equal to 1/3, and the threshold value of the duration parameter is greater than or equal to 15 seconds ; When the incomplete type parameter is vertical, determine whether the element affects the picture integrity. Refer to the threshold of the parameter in rule 2: the threshold of the incomplete ratio parameter is greater than or equal to 1/5, and the threshold of the duration parameter is greater than or equal to 10 seconds.
表6Table 6
对于元素组类型为其他后期文字的元素组,当其残缺类型为横向时,其残缺比例大于等于1/3,且其持续时间大于等于15秒,则可以判定该元素组会影响画面内容完整度;当其残缺类型为横向时,其残缺比例小于1/3或其持续时间小于15秒,则可以判定该元素组不会影响画面内容完整度。For the element group whose element group type is other late text, when its incomplete type is horizontal, its incomplete ratio is greater than or equal to 1/3, and its duration is greater than or equal to 15 seconds, it can be determined that the element group will affect the integrity of the screen content ; When the defect type is horizontal, the defect ratio is less than 1/3 or the duration is less than 15 seconds, it can be determined that the element group will not affect the integrity of the screen content.
对于元素类似为其他后期文字的元素组,当其残缺类型为纵向时,其残缺比例大于等于1/3,且持续时间大于等于15秒,则可以判定该元素组会影响画面内容完整度;当其残缺类型为纵向时,其残缺比例小于1/3或其持续时间小于15秒,则可以判定该元素组不会影响画面内容完整度。For an element group whose elements are similar to other later texts, when the defect type is vertical, the defect ratio is greater than or equal to 1/3, and the duration is greater than or equal to 15 seconds, it can be determined that the element group will affect the integrity of the screen content; when When the defect type is vertical, the defect ratio is less than 1/3 or the duration is less than 15 seconds, it can be determined that the element group will not affect the integrity of the screen content.
示例性的,上表2所示的预设的元素组影响画面完整度规则中包含两个参数的参考值和多个参数的阈值,参数的阈值为残缺比例的阈值和持续时间的阈值,参考值为残缺类型的参考值、时段类型的参考值、重要程度的参考值、周边元素组的参考值。其中,当元素类型为其他后期文字的元素组的残缺类型为横向,重要程度大于3、残缺比例超出1/3、持续时间大于等于15秒,则可以判定元素组影响画面完整度;若其重要程度小于等于3,则可以判定元素组不影响画面完整度;若其残缺类型为纵向,则可以判定元素组不影响画面完整度。Exemplarily, the preset element group influence picture integrity rule shown in Table 2 above includes reference values of two parameters and thresholds of multiple parameters. The values are the reference value of the incomplete type, the reference value of the period type, the reference value of the importance degree, and the reference value of the surrounding element group. Among them, when the incomplete type of the element group whose element type is other late text is horizontal, the importance degree is greater than 3, the incomplete ratio exceeds 1/3, and the duration is greater than or equal to 15 seconds, it can be determined that the element group affects the picture integrity; if it is important If the degree is less than or equal to 3, it can be determined that the element group does not affect the integrity of the picture; if the defect type is vertical, it can be determined that the element group does not affect the integrity of the picture.
可以理解的是,预设的元素组影响画面完整度规则中还可以包含其他的与元素组相关的参数的阈值或参考值,此处不作限定。It can be understood that, the preset rule for affecting the picture integrity of the element group may further include thresholds or reference values of other parameters related to the element group, which are not limited here.
本申请涉及的元素组影响画面完整度规则在其他实施例中可以表现为函数、映射关系等。In other embodiments, the element group affecting the picture integrity rule involved in the present application may be expressed as a function, a mapping relationship, or the like.
值得说明的是,本申请涉及的元素组影响画面完整度规则中有影响的参数的阈值和/或参考值可以来源于元素组的元素组属性。It is worth noting that the threshold value and/or the reference value of the parameter affected in the element group affecting the picture integrity rule involved in the present application may be derived from the element group attribute of the element group.
(5)完整度判断模型:(5) Integrity judgment model:
本申请实施例中,完整度判断模型用于确定视频的画面内容是否完整。In this embodiment of the present application, the integrity judgment model is used to determine whether the picture content of the video is complete.
完整度判断模型的输入可以包括:元素组对视频画面内容的影响结果。完整度判断模型的输出包括:视频画面内容完整、视频画面内容不完整。The input of the integrity judgment model may include: the influence result of the element group on the content of the video picture. The output of the integrity judgment model includes: complete video image content and incomplete video image content.
完整度判断模型是利用人工智能领域中机器学习方法或深度学习方法构建出的模型。该模型基于训练数据完成的。训练数据的标签为视频画面内容完整或视频画面内容不完整。The integrity judgment model is a model constructed by using machine learning methods or deep learning methods in the field of artificial intelligence. The model is done based on the training data. The label of the training data is that the content of the video image is complete or the content of the video image is incomplete.
可以理解的是,该完整度判断模型的输入还可以包含很多其他的数据,例如在本申请的一些实施例中,该完整度判断模型的输入还可以包括元素组;在本申请的一些实施例中,该完整度判断模型的输入还可以包括视频的分类信息;在本申请的一些实施例中,完整度判断模型的输入还可以包括元素组的元素组属性。此处不作限定。It can be understood that the input of the integrity judgment model may also include many other data. For example, in some embodiments of the present application, the input of the integrity judgment model may also include element groups; in some embodiments of the present application , the input of the completeness judgment model may further include video classification information; in some embodiments of the present application, the input of the completeness judgment model may also include the element group attribute of the element group. There is no limitation here.
视频的分类信息可以有多种获取方式。例如,可以根据视频的主题获取视频的分类信息;可以根据用户或视频生产者在上传视频时选择的视频所属的栏目、分区等信息获取视频的分类信息;可以根据视频的名称获取视频的分类信息;可以根据用户或视频生产者的历史上传记录、用户画像等信息预测视频的分类信息等。在此不做限定。视频的分类信息可以是动物、数码、人物、日常、美食等。There are various ways to obtain video classification information. For example, the classification information of the video can be obtained according to the subject of the video; the classification information of the video can be obtained according to the information such as the column and division of the video selected by the user or the video producer when uploading the video; the classification information of the video can be obtained according to the name of the video ; It can predict the classification information of videos based on the historical upload records of users or video producers, user portraits and other information. This is not limited. The classification information of the video can be animals, numbers, people, daily life, food, etc.
(6)评分模型:(6) Scoring model:
本申请实施例中,评分模型用于确定视频的画面内容完整度的评分。In the embodiment of the present application, the scoring model is used to determine the scoring of the completeness of the picture content of the video.
评分模型的输入包括:元素组对视频画面内容的影响结果。评分模型的输出为视频的画面内容完整度的评分。The input of the scoring model includes: the effect of the element group on the content of the video image. The output of the scoring model is a score for the completeness of the video's picture content.
可以理解的是,该评分模型的输入还可以包含很多其他的数据,例如在本申请的一些实施例中,该完整度判断模型的输入还可以包括元素组;在本申请的一些实施例中,该完整度判断模型的输入还可以包括视频的分类信息;在本申请的一些实施例中,完整度判断模型的输入还可以包括元素组的元素组属性。此处不作限定。It can be understood that the input of the scoring model may also include many other data. For example, in some embodiments of the present application, the input of the integrity judgment model may also include element groups; in some embodiments of the present application, The input of the completeness judgment model may further include video classification information; in some embodiments of the present application, the input of the completeness judgment model may further include the element group attribute of the element group. There is no limitation here.
可以理解的是,在对视频画面内容完整度进行评分时,可以考虑视频的分类信息。例如,针对视频的分类信息为数码的情况下,数码产品、字幕、后期文字等元素组对视频画面内容完整度评分的影响更大,而针对视频的分类信息为人物的情况下,人物等元素组对视频画面内容完整度的评分影响更大。It can be understood that, when the content integrity of the video picture is scored, the classification information of the video may be considered. For example, when the classification information for the video is digital, element groups such as digital products, subtitles, and post-text texts have a greater impact on the content integrity score of the video, while when the classification information for the video is characters, elements such as characters The group has a greater impact on the score of the completeness of the video picture content.
评分模型可以是利用人工智能领域中BP神经网络模型构建出的模型。该模型基于训练数据完成的。其中训练数据为从视频中提取出的元素组的第一残缺参数组、元素组属性等,训练数据的标签为视频画面内容完整度的评分。其中,BP神经网络的模型的示意图如图18所示。The scoring model may be a model constructed by using the BP neural network model in the field of artificial intelligence. The model is done based on the training data. The training data is the first incomplete parameter group, element group attributes, etc. of the element group extracted from the video, and the label of the training data is the score of the content integrity of the video image. The schematic diagram of the model of the BP neural network is shown in Figure 18.
本申请实施例中,评分模型可以是利用人工智能领域中其它机器学习方法或其它深度学习方法构建出的模型。In this embodiment of the present application, the scoring model may be a model constructed by using other machine learning methods or other deep learning methods in the field of artificial intelligence.
(7)修改建议生成模型:(7) Modify the proposed generation model:
本申请实施例中,修改建议生成模型用于确定视频的画面内容修改建议。In the embodiment of the present application, the modification suggestion generation model is used to determine the modification suggestion of the picture content of the video.
修改建议生成模型的输入包括:元素组是否影响画面完整度、元素组的元素组属性。评分模型的输出为视频的画面内容完整度的修改建议。其中,修改建议包括对元素组的修改建议。其中,修改建议可以以审核报告的方式呈现,审核报告包括:元素组的修改建议。审核报告还可以包括:元素组的元素组属性。The input for modifying the proposed generation model includes: whether the element group affects the screen integrity, and the element group attribute of the element group. The output of the scoring model is a suggestion for modifying the completeness of the video's picture content. Among them, the modification proposal includes the modification proposal for the element group. Wherein, the modification suggestion can be presented in the form of an audit report, and the audit report includes: the modification suggestion of the element group. The audit report can also include: Element group attributes for the element group.
可以理解的是,该修改建议生成模型的输入还可以包含很多其他的数据,例如在本申请的一些实施例中,该完整度判断模型的输入还可以包括元素组;在本申请的一些实施例中,该完整度判断模型的输入还可以包括视频的分类信息。此处不作限定。It can be understood that the input of the modification suggestion generation model may also include many other data. For example, in some embodiments of the present application, the input of the integrity judgment model may also include element groups; in some embodiments of the present application , the input of the integrity judgment model may also include video classification information. There is no limitation here.
其中针对元素组的修改建议包括:无建议(Suggest)、可不处理(Pass)、剪裁(Cut)、涂抹(Cover)、删帧(Delete)、模糊(Blur)等。其中,可不处理(Pass)表示该元素组不会影响视频画面内容完整度;剪裁(Cut)表示可以根据元素组边界框高或宽裁剪视频边缘;涂抹(Cover)表示可以通过其他图标以覆盖的方式遮盖该元素组;删帧(Delete)表示可以通过删除该图像帧提高视频画面内容完整度;模糊(Blur)表示可以通过马赛克、高斯模糊方式模糊该元素组以提高视频画面内容完整度。The modification suggestions for the element group include: no suggestion (Suggest), optional processing (Pass), clipping (Cut), smearing (Cover), frame deletion (Delete), blur (Blur) and so on. Among them, can not process (Pass) means that the element group will not affect the integrity of the video picture content; cut (Cut) means that the video edge can be cropped according to the height or width of the element group bounding box; smear (Cover) means that it can be covered by other icons. cover the element group; Delete means that the content integrity of the video picture can be improved by deleting the image frame; Blur means that the element group can be blurred by mosaic and Gaussian blur to improve the integrity of the video picture content.
修改建议生成模型可以是利用人工智能领域中决策树算法构建出的模型。其中,决策树算法的示意图如图18所示。如图18所示,决策树的结点属性可以来源于元素组属性,决策树的结点属性还可以来源于元素组影响画面完整度规则的参考值和/或阈值。决策树的结点属性还可以来源于视频的分类信息。The modification suggestion generation model may be a model constructed by using a decision tree algorithm in the field of artificial intelligence. The schematic diagram of the decision tree algorithm is shown in Figure 18. As shown in FIG. 18 , the node attribute of the decision tree can be derived from the element group attribute, and the node attribute of the decision tree can also be derived from the reference value and/or the threshold value of the element group affecting the picture integrity rule. The node attributes of the decision tree can also be derived from the classification information of the video.
本申请实施例中,修改建议生成模型可以是利用人工智能领域中其它机器学习方法或其它深度学习方法构建出的模型。In this embodiment of the present application, the modification suggestion generation model may be a model constructed by using other machine learning methods or other deep learning methods in the field of artificial intelligence.
在本申请的一些实施例中,上述(5)完整度判断模型、(6)评分模型、(7)修改建议生成模型的输入可以包括元素组的元素组属性,而不包括元素组对视频画面内容的影响结果。电子设备可以直接根据元素组的元素组属性确定对视频的画面内容的评估结果:例如是否完整或评分,而不需要先确定元素组对视频画面内容的影响结果。此处不作限定。In some embodiments of the present application, the input of the above (5) completeness judgment model, (6) scoring model, and (7) modification suggestion generation model may include the element group attribute of the element group, but not the element group to the video picture. Content affects results. The electronic device can directly determine the evaluation result of the picture content of the video according to the element group attribute of the element group: for example, whether it is complete or scored, without first determining the influence result of the element group on the video picture content. There is no limitation here.
下面首先介绍现有技术中人工审核视频的场景。The following first introduces a scenario of manually reviewing a video in the prior art.
图1示出了现有技术中人工审核视频的场景。Fig. 1 shows a scene of manual review of videos in the prior art.
如图1所示,目前业界提供的视频画面内容完整度审核方式主要是人工审核。视频审核人员在审核视频时,多采用倍速浏览视频、跳帧浏览视频等方法,容易遗漏、忽略不符合视频审核规范的视频图像帧。其次,视频审核人员在审核视频时,由于需要人工浏览视频的内容,而受限于工作时长与视频时长,导致人工审核效率远远不能满足视频审核的需求。再次,不同视频审核人员在审核视频时具体的评判标准不同,而且不能给出针对性的修改的指导意见,降低了视频审核的有效性,不利于维护视频原创内容的生态圈维护。As shown in Figure 1, the current industry-provided way of reviewing the content integrity of video images is mainly manual review. When reviewing videos, video reviewers often use methods such as double-speed browsing, skipping frames, etc., which are easy to miss or ignore video image frames that do not meet the video review specifications. Secondly, when video reviewers are reviewing videos, they need to manually browse the content of the video, which is limited by the working time and video duration, resulting in the efficiency of manual review being far from meeting the needs of video review. Third, different video reviewers have different specific evaluation criteria when reviewing videos, and they cannot give guidance for targeted revisions, which reduces the effectiveness of video review and is not conducive to maintaining the ecosystem of original video content.
针对上述目前视频审核领域存在的问题,本申请提供了一种视频画面内容完整度评估方法及电子设备。本申请提供的视频画面内容完整度评估方法及电子设备,能够自动审核视频生产者上传的视频,给出视频画面内容是否完整的评估结果,提升视频审核效率;进一步的,本申请提供的视频画面内容完整度评估方法及电子设备能够给出视频画面完整度的评分;再进一步的,本申请提供的视频画面内容完整度评估方法及电子设备能够给出该视频针对性的修改的指导意见。In view of the above-mentioned problems existing in the field of current video review, the present application provides a method and electronic device for evaluating the content integrity of a video image. The video image content integrity evaluation method and electronic device provided in this application can automatically review videos uploaded by video producers, give an evaluation result of whether the video image content is complete, and improve the efficiency of video review; further, the video images provided in this application The content integrity evaluation method and the electronic device can give a score of the video image integrity; further, the video image content integrity evaluation method and electronic device provided by the present application can provide guidance for targeted modification of the video.
可以理解的是,本申请实施例中的电子设备可以通过运行视频画面内容完整度评估系统来实现本申请实施例中的视频画面内容完整度评估方法。It can be understood that, the electronic device in the embodiment of the present application may implement the method for evaluating the integrity of the video image content in the embodiment of the present application by running the system for evaluating the integrity of the video image content.
图2至图6为本申请实施例中一组示例性用户界面示意图。2 to 6 are schematic diagrams of a group of exemplary user interfaces in the embodiments of the present application.
如图2所示,用户或视频生产者可以将待审核的原始视频上传至视频画面内容完整度评估系统中。可选的,用户或视频生产者在上传原始视频时,可以同时向视频画面内容完整度评估系统上传原始视频的分类信息,分类信息可以包括:原始视频的主题、标签、分类等信息。As shown in Figure 2, the user or video producer can upload the original video to be reviewed to the video image content integrity evaluation system. Optionally, when uploading the original video, the user or video producer can upload the classification information of the original video to the video image content integrity evaluation system at the same time.
如图3和图4所示,用户或视频生产者在将待审核的原始视频上传到电子设备中的视频画面内容完整度评估系统后,电子设备会显示对于该原始视频的审核结果。审核结果包括:视频画面内容完整或视频画面内容不完整。其中,本申请实施例中的电子设备对原始视频的视频画面内容完整度评估方法如图7所示。As shown in FIG. 3 and FIG. 4 , after the user or video producer uploads the original video to be reviewed to the video image content integrity evaluation system in the electronic device, the electronic device will display the review result of the original video. The review results include: the content of the video screen is complete or the content of the video screen is incomplete. The method for evaluating the integrity of the video image content of the original video by the electronic device in the embodiment of the present application is shown in FIG. 7 .
如图5所示,可选的,电子设备在审核原始视频后,会针对原始视频的画面内容完整度给出完整度评分,完整度评分用于评价原始视频画面内容完整度。视频生产者或用户可以通过该完整度评分更具象的了解原始视频的完整度,有助于帮助视频生产者或用户提高创作的原始视频的质量。As shown in FIG. 5 , optionally, after reviewing the original video, the electronic device will give a completeness score for the completeness of the image content of the original video, and the completeness score is used to evaluate the completeness of the original video image content. The video producer or user can understand the integrity of the original video more accurately through the integrity score, which helps the video producer or user to improve the quality of the original video created.
如图6所示,可选的,本系统在审核原始视频后,会针对影响原始视频的画面内容完整度的元素组给出审核报告。审核报告包括:影响原始视频的画面内容的元素组、该元素组在原始视频的位置、该元素组在原始视频中的持续时间和首次出现时间、该元素组在原始视频中的重要程度、该元素组的修改推荐建议等。其中,修改建议包括:通过、剪裁、遮盖、涂抹、模糊、删帧等。本系统通过对原始视频进行审核,并针对原始视频的画面内容完整度,给出对应的修改意见,其中,修改意见以审核报告的形式呈现出来,能够有效的帮助视频生产者或用户修改原始视频,以提高原始视频的画面内容完整度。As shown in FIG. 6 , optionally, after reviewing the original video, the system will issue a review report for the element group that affects the integrity of the picture content of the original video. The audit report includes: the element group that affects the picture content of the original video, the position of the element group in the original video, the duration and first appearance time of the element group in the original video, the importance of the element group in the original video, the Recommendations for modification of element groups, etc. Among them, the modification suggestions include: passing, clipping, masking, smearing, blurring, deleting frames, etc. This system reviews the original video and gives corresponding revision opinions according to the completeness of the original video's screen content. The revision opinions are presented in the form of review reports, which can effectively help video producers or users to modify the original video. , to improve the picture content integrity of the original video.
下面介绍本申请提供的视频画面内容完整度评估方法及电子设备。The following introduces the method and electronic device for evaluating the integrity of video image content provided by the present application.
图7为本申请实施例中视频画面内容完整度评估方法的一个流程示意图。FIG. 7 is a schematic flowchart of a method for evaluating the content integrity of a video image according to an embodiment of the present application.
值得说明的是,待检测视频可以为本申请中的原始视频、第一视频或第二视频。It should be noted that the video to be detected may be the original video, the first video or the second video in this application.
S701、S7021为对视频的处理,可以得到图像帧。S7022、S7023为获取元素组的步骤。S7024、S7025,筛选得到残缺元素组。S7026为计算元素组的元素组属性。S7027为计算元素组是否影响画面完整度。S703为生成视频画面内容完整度评估的结果。S701 and S7021 are for video processing, and image frames can be obtained. S7022 and S7023 are the steps of acquiring the element group. In S7024 and S7025, the incomplete element group is obtained by screening. S7026 is to calculate the element group attribute of the element group. S7027 is to calculate whether the element group affects the picture integrity. S703 is to generate a result of evaluating the integrity of the video image content.
S701:对原始视频进行切割,得到N段第一视频。S701: Cut the original video to obtain N segments of the first video.
电子设备在接收到来自视频生成者或用户上传的等待完整度审核的视频后,电子设备将该视频作为原始视频,并对原始视频进行处理,其中,处理过程包括:切割。切割操作具体包括:依据原始视频的时间长度、视频转场点等信息,将原始视频切分为N段视频,N为大于等于1的正整数。当N等于1时,此时并不对原始视频进行切割,第一视频即是原始视频。当N大于1时,此时对原始视频进行切割,切割后得到N段第一视频。After the electronic device receives the video uploaded from the video generator or the user and is awaiting completeness review, the electronic device regards the video as the original video and processes the original video, wherein the processing process includes: cutting. The cutting operation specifically includes: dividing the original video into N video segments according to the time length of the original video, video transition points and other information, where N is a positive integer greater than or equal to 1. When N is equal to 1, the original video is not cut at this time, and the first video is the original video. When N is greater than 1, the original video is cut at this time, and N segments of the first video are obtained after cutting.
可以理解的是,一方面,将原始视频切割为N段第一视频,并分别对N段第一视频进行后续的完整度评估处理,可以充分利用电子设备的并行计算能力,提高计算效率;另一方面,依据视频的转场点等信息对原始视频进行切割时,充分考虑了蒙太奇手法、运镜方式、转场等拍摄或剪辑手法对元素组(元素)时空完整度的影响,有效提高了电子设备对视频画面内容完整度评估的准确性。It can be understood that, on the one hand, the original video is cut into N segments of the first video, and the subsequent integrity evaluation processing is performed on the N segments of the first video, which can make full use of the parallel computing capability of the electronic device and improve the computing efficiency; On the one hand, when cutting the original video according to the transition points and other information of the video, the influence of montage, mirror movement, transition and other shooting or editing techniques on the temporal and spatial integrity of the element group (element) is fully considered, which effectively improves the performance of the original video. The accuracy of the electronic device's assessment of the integrity of the video image content.
在一些实施例中,电子设备在接收来自视频生成者或用户上传的等待完整度审核的视频时,同时接收视频生产者或者用户上传的视频分类信息,其中分类信息包括该视频的主题名称、该视频的标签等信息。In some embodiments, the electronic device receives video classification information uploaded by the video producer or user when receiving the video uploaded from the video producer or the user awaiting completeness review, where the classification information includes the subject name of the video, the Video tags, etc.
在一些实施例中,电子设备在接收来自视频生成者或用户上传的等待完整度审核的视频时,可以通过图像处理算法获得该视频的分类信息。In some embodiments, when the electronic device receives a video uploaded from a video generator or a user and is awaiting completeness review, the electronic device may obtain classification information of the video through an image processing algorithm.
图8为本申请实施例中视频切割的一个示例性示意图。FIG. 8 is an exemplary schematic diagram of video cutting in an embodiment of the present application.
如图8所示,原始视频被切割为三段视频,分别为第一视频1、第一视频2、第一视频3。As shown in FIG. 8 , the original video is cut into three video segments, which are a
在一些实施例中,当原始视频中没有转场点信息时,可以根据视频长度等时长切割原始视频,或者不切割视频(N等于1的情况)。例如,原始视频时长10分钟30秒,且原始视频中不存在转场点信息,可以将原始视频等时长切割为三段第一视频,其中每段第一视频时长为210秒。In some embodiments, when there is no transition point information in the original video, the original video may be cut according to the length of the video, or the video may not be cut (in the case where N is equal to 1). For example, if the original video is 10 minutes and 30 seconds long, and there is no transition point information in the original video, the original video can be cut into three first videos of equal duration, where each first video is 210 seconds long.
在一些实施例中,可以预先规定对原始视频切割后第一视频的时长或时长范围。例如规定第一视频时长为3分钟,当原始视频时长为10分钟30秒时,则可以将原始视频切分为四段,第一段第一视频为0到3分钟,第二段第一视频为3到6分钟,第三段第一视频为6到9分钟,第四段第一视频为9分钟到10分钟30秒。In some embodiments, the duration or duration range of the first video after cutting the original video may be pre-specified. For example, the length of the first video is set to be 3 minutes. When the length of the original video is 10 minutes and 30 seconds, the original video can be divided into four segments. The first segment of the first video is 0 to 3 minutes, and the second segment of the
在一些实施例中,切割原始视频后的多段第一视频之间可以用重叠的部分。In some embodiments, overlapping parts may be used between the multiple segments of the first video after cutting the original video.
在一些实施例中,当原始视频中存在转场点信息时,可以优先依据转场点信息进行切割视频。进一步的,根据转场点信息初步切割视频后得到至少一段中间视频,可以根据中间视频的视频时长,再次切割中间视频得到第一视频。In some embodiments, when there is transition point information in the original video, the video may be preferentially cut according to the transition point information. Further, at least a piece of intermediate video is obtained after preliminarily cutting the video according to the transition point information, and the first video can be obtained by cutting the intermediate video again according to the video duration of the intermediate video.
电子设备在得到N段第一视频后,可以对所有N段第一视频或者N段第一视频中部分第一视频执行步骤S702。After obtaining the N first videos, the electronic device may perform step S702 on all the N first videos or a part of the N first videos.
S702:包括顺序执行的步骤S7021、S7022、S7023、S7024、S7025、S7026、S7027。其中步骤S7021、S7024为不影响方案完整性的可选步骤。S702: including sequentially executed steps S7021, S7022, S7023, S7024, S7025, S7026, and S7027. Steps S7021 and S7024 are optional steps that do not affect the integrity of the solution.
其中S7021:对第一视频抽帧,得到第二视频。S7021: extracting frames from the first video to obtain the second video.
电子设备在获得第一视频后,对第一视频进行抽帧,抽帧后的视频数据记为第二视频。After obtaining the first video, the electronic device extracts frames from the first video, and the video data after the frame extraction is recorded as the second video.
其中抽帧操作具体包括:非自适应的抽帧和自适应的抽帧,抽帧的对象为第一视频;非自适应的抽帧具体包括:相隔固定的时间间隔或者帧间隔对抽帧的对象抽取一图像帧;自适应抽帧具体包括:依据当前图像帧与下一被抽取图像帧的数据差异(画面差异),动态调整抽取的下一图像帧和当前图像帧之间的时间间隔或帧间隔。The frame extraction operation specifically includes: non-adaptive frame extraction and adaptive frame extraction, and the object of frame extraction is the first video; non-adaptive frame extraction specifically includes: a fixed time interval or frame interval for the extraction of frames. The object extracts an image frame; the adaptive frame extraction specifically includes: dynamically adjusting the time interval or time interval between the extracted next image frame and the current image frame according to the data difference (picture difference) between the current image frame and the next extracted image frame frame interval.
可以理解的是,通过对第一视频抽帧,降低了后续步骤需要处理的第二视频中的图像帧数量,降低了电子设备的计算量。It can be understood that, by extracting frames from the first video, the number of image frames in the second video that needs to be processed in subsequent steps is reduced, and the calculation amount of the electronic device is reduced.
图9为本申请实施例中非自适应抽帧方法的一个示例性示意图。FIG. 9 is an exemplary schematic diagram of a non-adaptive frame extraction method in an embodiment of the present application.
如图9所示,若从第一视频的第L1帧作为被抽帧的第一帧,设置抽帧的帧间隔为L2,则第I个抽帧的帧为第一视频的第L1+(I-1)*(L2+1)帧。例如,在L1=1、L2=2的情况下,被抽帧的帧数为1、4、7…。根据原始视频帧率,通过设置抽帧间隔,可以实现非自适应抽帧。As shown in Figure 9, if the L1 frame of the first video is used as the first frame of the frame to be extracted, and the frame interval of the frame extraction is set to L2, then the frame of the first frame is the L1+(Ith frame of the first video. -1)*(L2+1) frames. For example, in the case of L1=1, L2=2, the number of frames to be extracted is 1, 4, 7, . . . According to the original video frame rate, non-adaptive frame extraction can be achieved by setting the frame extraction interval.
在一些实施例中,抽帧的频率可以为1帧每秒。In some embodiments, the frame rate may be 1 frame per second.
图10为本申请实施例中自适应抽帧方法的一个示例性示意图。FIG. 10 is an exemplary schematic diagram of an adaptive frame extraction method in an embodiment of the present application.
如图10所示,自适应的抽帧包括:根据第一视频的图像帧之间的数据差距(画面差距),自适应的调整抽帧的间隔。例如选取第一视频的第一帧作为被抽帧的第1帧,并以该帧作为基准帧,用于计算后续帧与基准帧的数据差距。当比较第2帧至第4帧时,数据差距没有超过阈值;当比较到第5帧时,第5帧的图像帧与第1帧的图像帧的数据差距大于预设的阈值,选则第5帧的图像帧作为被抽帧的第2帧。此时,选取第5帧作为基准帧,用于计算后续帧与基准帧的数据差距,进而选取第7帧作为被抽帧的第三帧。故,电子设备通过自适应的抽帧,选取了第1帧、第5帧、第7帧图像帧作为第二视频。As shown in FIG. 10 , the adaptive frame extraction includes: adaptively adjusting the frame extraction interval according to the data gap (picture gap) between the image frames of the first video. For example, the first frame of the first video is selected as the first frame of the extracted frame, and the frame is used as the reference frame to calculate the data gap between the subsequent frame and the reference frame. When comparing the second frame to the fourth frame, the data gap does not exceed the threshold; when comparing the fifth frame, the data gap between the image frame of the fifth frame and the image frame of the first frame is greater than the preset threshold, select the first The image frame of 5 frames is used as the second frame of the extracted frame. At this time, the fifth frame is selected as the reference frame, which is used to calculate the data gap between the subsequent frame and the reference frame, and then the seventh frame is selected as the third frame of the extracted frame. Therefore, the electronic device selects the first, fifth, and seventh image frames as the second video through adaptive frame extraction.
自适应抽帧方法中数据差距为两帧的图像帧之间的欧氏距离、绝对值距离、切比雪夫距离等;自适应抽帧方法中的数据差距为两帧的图像帧呈现的画面内容(图像内容)之间的差距,其中,该差距可以通过类图像匹配算法获得。In the adaptive frame sampling method, the data gap is the Euclidean distance, absolute value distance, Chebyshev distance, etc. between the image frames of the two frames; the data gap in the adaptive frame sampling method is the picture content presented by the two image frames. (image content), where this gap can be obtained by an image-like matching algorithm.
图11为本申请实施例中原始视频被切割并被抽帧的一个示例性示意图。FIG. 11 is an exemplary schematic diagram of an original video being cut and framed in an embodiment of the present application.
如图11所示,原始视频经过切割和抽帧后,得到第二视频1、第二视频2、第二视频3,分别用于后续处理。As shown in FIG. 11 , after the original video is cut and framed, a
在一些实施例中,原始视频可以不经过切割和/或抽帧,直接作为第二视频进行后续处理。In some embodiments, the original video may be directly used as the second video for subsequent processing without cutting and/or frame extraction.
在一些实施例中,原始视频可以仅经过切割操作后作为第二视频进行后续处理。In some embodiments, the original video may only undergo a cutting operation for subsequent processing as the second video.
在一些实施例中,原始视频可以仅经过抽帧操作后作为第二视频进行后续处理。In some embodiments, the original video may be processed only as the second video after frame extraction.
值得说明的是,可以不执行步骤S7021,此时第二视频即为第一视频。It should be noted that step S7021 may not be executed, and the second video is the first video at this time.
其中S7022:检测第二视频中每一图像帧中的元素。Wherein S7022: Detect elements in each image frame in the second video.
遍历第二视频数据中每一图像帧,在遍历到任一图像帧时,对该图像帧进行元素检测。对图像帧进行元素检测后,得到该图像帧包含的元素、元素的元素类型以及元素的元素属性等。其中元素的元素属性包括:元素的位置等。Traverse each image frame in the second video data, and when any image frame is traversed, perform element detection on the image frame. After the element detection is performed on the image frame, the elements contained in the image frame, the element type of the element, and the element attribute of the element are obtained. The element attributes of the element include: the position of the element, etc.
其中,元素和元素类型的定义可以参考上述术语解释(1)元素和元素类型的内容,此处不再赘述。For the definition of elements and element types, reference may be made to the above term explanation (1) content of elements and element types, and details are not repeated here.
元素检测包括:文字检测、人脸检测、主体检测、图标检测、水印检测、二维码检测等。主体检测包括:菜品识别、果蔬识别、动物识别、植物识别、车辆识别等。在一些实施例中,元素占据画面的位置可以用一个矩形选框的坐标表征,其中,元素的范围可以以矩形选框框选的范围为准。Element detection includes: text detection, face detection, subject detection, icon detection, watermark detection, QR code detection, etc. Subject detection includes: dish identification, fruit and vegetable identification, animal identification, plant identification, vehicle identification, etc. In some embodiments, the position of the element occupying the screen may be represented by the coordinates of a rectangular marquee, wherein the range of the element may be based on the range selected by the rectangular marquee.
可以理解的是,通过单个图像帧内的元素的完整度去判别原始视频画面内容完整度有一定的局限性。元素在原始视频内具有时空完整度属性,当视频生产者或用户通过蒙太奇、运镜、切换动画等拍摄剪辑手法制作原始视频的情况下,通过单图像帧内元素组完整度判别原始视频画面内容的完整度的效果较差。It can be understood that there are certain limitations in judging the content integrity of the original video picture by the integrity of the elements in a single image frame. Elements have spatiotemporal integrity attributes in the original video. When the video producer or user creates the original video by shooting and editing methods such as montage, mirror movement, switching animation, etc., the content of the original video is judged by the integrity of the element group within a single image frame. The integrity of the effect is poor.
图12和图13为本申请实施例中在单个图像帧中进行元素检测的一个示例性示意图。FIG. 12 and FIG. 13 are an exemplary schematic diagram of element detection in a single image frame in an embodiment of the present application.
例如,第二视频中共有多于1帧图像帧,通过遍历,对所有的图像帧分别进行元素检测,获得图像帧对应的画面内容中出现的元素、元素的属性等。如图12所示,选取一图像帧进行元素检测,包括分别进行文字检测、二维码检测、主体检测等。对图12所示的图像帧进行元素检测,其中,使用文字检测,检测到元素包括:1301文字1-“原创视频1”、1305文字2-“狗是人类的好朋友”;使用二维码检测,没有检测到元素;使用主体检测,检测到元素包括:1303动物1-狗、1304食物1-狗粮;使用图标检测,检测到的元素包括:1302图标-“华为HUAWE”。For example, there are more than one image frame in the second video. Through traversal, element detection is performed on all the image frames to obtain the elements appearing in the picture content corresponding to the image frames, the attributes of the elements, and the like. As shown in FIG. 12 , an image frame is selected for element detection, including character detection, two-dimensional code detection, and subject detection, respectively. Element detection is performed on the image frame shown in Figure 12, wherein, using text detection, the detected elements include: 1301 text 1 - "
进一步得到元素所属的元素类型,1301文字1-“原创视频1”属于其他后期文字,1302图标-“华为HUAWE”属于版权标志,1303动物1属于动物,1304食物1属于食物,1305文字2-“狗是人类的好朋友”属于字幕。Further get the element type to which the element belongs, 1301 text 1-"
对图像帧进行文字检测、二维码检测、主体检测、图标检测后,该图像帧的检测结果如图13所示。在该图像帧中,所有的元素可以用矩形选框框选出来,并得知元素的属性和元素所属的元素类型等。After text detection, two-dimensional code detection, subject detection, and icon detection are performed on the image frame, the detection result of the image frame is shown in FIG. 13 . In this image frame, all elements can be selected with a rectangular marquee box, and the attributes of the elements and the element type to which the element belongs are known.
其中S7023:选取第二视频中的至少两个图像帧,获取元素组。S7023: Select at least two image frames in the second video to obtain an element group.
选取第二视频数据中的至少两图像帧,通过元素相似度、聚类算法等方式获取元素组。在获取元素组时,也可以同时获取元素组的部分元素组属性如元素组位置等。At least two image frames in the second video data are selected, and element groups are obtained by means of element similarity, clustering algorithm, or the like. When acquiring the element group, you can also acquire some element group attributes of the element group, such as the element group position, etc. at the same time.
元素组、选取的方式、元素相似度定义、计算元素相似度的方法可以参考上述术语解释(2)元素组的内容,此处不再赘述。For the element group, the selection method, the definition of element similarity, and the method for calculating element similarity, reference may be made to the content of the above term explanation (2) element group, which will not be repeated here.
其中,通过聚类算法获取元素组包括:将所选取的帧的图像帧叠加后,通过对叠加后的图像帧使用聚类算法,聚类的结果即为元素组的分组结果。Wherein, obtaining the element group by using the clustering algorithm includes: after superimposing the image frames of the selected frame, by using the clustering algorithm on the superimposed image frame, the result of the clustering is the grouping result of the element group.
在一些实施例中,使用聚类算法如K均值算法时,可以根据步骤S7022的元素检测结果,以及选取帧的图像帧,预先决定聚类算法中的簇数,即元素组的个数。In some embodiments, when using a clustering algorithm such as the K-means algorithm, the number of clusters in the clustering algorithm, that is, the number of element groups, may be pre-determined according to the element detection result in step S7022 and the image frame of the selected frame.
可以理解的是,元素组保留了元素在视频中的时空完整性,可以作为判断视频画面内容完整度的基础单元。It can be understood that the element group retains the spatiotemporal integrity of the elements in the video, and can be used as a basic unit for judging the integrity of the video image content.
图14为本申请实施例中获取元素组的一个示例性场景示意图。FIG. 14 is a schematic diagram of an exemplary scenario for acquiring an element group in an embodiment of the present application.
如图14所示,在得到第二视频数据后,其中第二视频数据中包含三个图像帧。其中,每一图像帧中经过S7022的元素检测后,可以用虚线矩形框选出每一个图像帧中检测到的元素。As shown in FIG. 14 , after the second video data is obtained, the second video data includes three image frames. Wherein, after the element detection in S7022 is performed in each image frame, the detected element in each image frame can be selected by a dotted rectangle.
现有三个图像帧,包括第一图像帧、第二图像帧、第三图像帧,选取第一图像帧中的元素1401A作为基准元素,选取第二图像帧和第三图像帧中的所有元素作为被比较元素,计算基准元素与每一个被比较元素的相似度。在元素相似度大于预设元素相似度阈值0.85时,认为基准元素1401A与被比较元素为同一元素,即基准元素1401A与被比较元素属于同一元素组。其中,第二图像帧的1401B元素的与基准元素1401A的元素相似度为0.95、第三图像帧的1401C元素与基准元素1401A的元素相似度为0.94,由于1401B元素和1401C元素与基准元素1401A的元素相似度大于0.85,故1401A、1401B、1401C组成元素组,该元素组可以记为1401-Group-“原创视频1”。同理,可以得到元素组1402-Group-“华为HUAWE”、1403-Group-狗、1404-Group-狗粮、1405-Group-“狗是人类的好朋友”,其获取元素组的过程不再赘述。There are three existing image frames, including the first image frame, the second image frame, and the third image frame, and the
图15为本申请实施例中获取元素组的另一个示例性场景示意图。FIG. 15 is a schematic diagram of another exemplary scenario for acquiring an element group in an embodiment of the present application.
如图15所示,在得到第二视频数据后,其中第二视频数据中包含三个图像帧。其中,每一图像帧中经过步骤S7022的元素检测后,可以用虚线矩形框选出每一个图像帧中检测到的元素。As shown in FIG. 15 , after the second video data is obtained, the second video data includes three image frames. Wherein, after the element detection in step S7022 is performed in each image frame, the detected element in each image frame can be selected by a dotted rectangle.
将三帧图像帧进行叠加,叠加后使用预决定簇数的聚类算法,指定簇数为5,得到元素组。元素组包括:1401-Group-“原创视频1”、1402-Group-“华为HUAWE”、1403-Group-狗、1404-Group-狗粮、1405-Group-“狗是人类的好朋友”。The three image frames are superimposed, and after superposition, the clustering algorithm that determines the number of clusters is used, and the number of clusters is specified as 5, and the element group is obtained. Element groups include: 1401-Group-"
其中S7024:根据元素组位置,获取疑似残缺元素组。Wherein S7024: Obtain a suspected incomplete element group according to the element group position.
当元素组的位置靠近、覆盖视频的画面边缘时,可以认为该元素组为疑似残缺元素组。When the position of the element group is close to and covers the edge of the video, the element group can be considered as a suspected incomplete element group.
元素组的位置与视频的画面的边界之间的距离小于预设的第二距离阈值,认为元素组的位置靠近、覆盖画面边缘。If the distance between the position of the element group and the border of the video frame is smaller than the preset second distance threshold, it is considered that the position of the element group is close to and covers the edge of the frame.
根据步骤S7022中获取到的元素的位置,以及步骤S7023确定的该元素所属的元素组,可以得到元素组的位置。元素组的位置为所有属于该元素组的元素并集所占据的范围。元素组的位置可以有多种表现形式,例如,其中范围可以以矩形选框框选的范围为准,具体表现为一组四维坐标(x,y,h,w),其中(x,y)为矩形选框左上角的坐标,该坐标以画面的左下角作为原点,x轴方向为由左向右、y轴方向为由下向上,h为矩形选框的高,w为矩形选框的宽。根据x和h可以计算在x轴(横向)方向上,元素组是否靠近、覆盖视频的画面边缘;根据y和w可以计算在y轴(纵向)方向上,元素组是否靠近、覆盖视频的画面边缘。According to the position of the element obtained in step S7022 and the element group to which the element belongs determined in step S7023, the position of the element group can be obtained. The position of an element group is the range occupied by the union of all elements belonging to the element group. The position of the element group can be expressed in various forms. For example, the range can be based on the range selected by the rectangular marquee, which is specifically expressed as a set of four-dimensional coordinates (x, y, h, w), where (x, y) is The coordinates of the upper left corner of the rectangular marquee, the coordinates take the lower left corner of the screen as the origin, the x-axis direction is from left to right, the y-axis direction is from bottom to top, h is the height of the rectangular marquee, and w is the width of the rectangular marquee . According to x and h, it can be calculated whether the element group is close to and covers the edge of the video in the x-axis (horizontal) direction; according to y and w, it can be calculated whether the element group is close to and covers the video screen in the y-axis (vertical) direction edge.
示例性的,例如,第二视频的画面为矩形,则其高为H,宽为W,预设的第二距离阈值为threshold1。若存在一元素组,且该元素组的坐标为(x1,y1,h1,w1),则该元素组与视频画面的边界为threshold2=min{min{abs(y1-h1),abs(H-y1)},min{abs(W-x1-w1),x1}},其中min表示取最小值,abs表示取绝对值。当threshold2小于等于threshold1时,可以认为该元素组靠近、覆盖视频的画面边缘,则该元素组可以被认为是疑似残缺元素组;当threshold2大于threshold1时,则该元素组可以不被认为是疑似残缺元素组。Exemplarily, for example, if the picture of the second video is a rectangle, its height is H, its width is W, and the preset second distance threshold is threshold1. If there is an element group, and the coordinates of the element group are (x1, y1, h1, w1), the boundary between the element group and the video picture is threshold2=min{min{abs(y1-h1),abs(H- y1)}, min{abs(W-x1-w1),x1}}, where min represents the minimum value, and abs represents the absolute value. When threshold2 is less than or equal to threshold1, it can be considered that the element group is close to and covers the edge of the video, and the element group can be considered as a suspected incomplete element group; when threshold2 is greater than threshold1, the element group can not be considered as a suspected incomplete element group element group.
如图14以及图15所示,疑似残缺元素组包括:1401-Group-“原创视频1”、1402-Group-“华为HUAWE”、1405-Group-“狗是人类的好朋友”。As shown in Figure 14 and Figure 15, the suspected incomplete element groups include: 1401-Group-"
值得说明的是,可以不执行步骤S7024,即可以不对元素组进行筛选,此时疑似残缺元素组即为所有的元素组。It should be noted that step S7024 may not be performed, that is, the element group may not be filtered, and at this time, the suspected incomplete element group is all the element groups.
可以理解的是,通过筛选疑似残缺元素组,进一步降低了后续步骤中被处理元素组的数量,降低了计算量。It can be understood that, by screening suspected incomplete element groups, the number of processed element groups in subsequent steps is further reduced, and the amount of calculation is reduced.
S7025:判断疑似残缺元素组的完整度,获取残缺元素组S7025: Judge the completeness of the suspected incomplete element group, and obtain the incomplete element group
判断疑似残缺元素组的完整度的过程可以参考上述术语解释(3)元素组的完整度中的内容,此处不再赘述。For the process of judging the completeness of the suspected incomplete element group, reference may be made to the content in the above term explanation (3) Integrity of the element group, which will not be repeated here.
当疑似残缺元素组的完整度为完整的情况下,该疑似残缺元素组为完整元素组;当疑似残缺元素组的完整度为不完整的情况下,该疑似残缺元素组为残缺元素组。When the completeness of the suspected incomplete element group is complete, the suspected incomplete element group is a complete element group; when the completeness of the suspected incomplete element group is incomplete, the suspected incomplete element group is a incomplete element group.
值得说明的是,可以不执行步骤S7025,在该情况下,所有的疑似残缺元素组为残缺元素组。It is worth noting that step S7025 may not be executed, in this case, all suspected defective element groups are defective element groups.
可以理解是,元素组的完整度与元素的完整与否不同,元素组的完整度是在考虑元素组时空完整性的情况下,对元素组是否影响视频画面内容完整度的一种衡量参数。即完整元素组为不影响视频画面内容完整度的元素组,而残缺元素组为有可能影响视频画面内容完整度的元素组。It can be understood that the integrity of the element group is different from the integrity of the element. The integrity of the element group is a measurement parameter for whether the element group affects the integrity of the video image content under the consideration of the temporal and spatial integrity of the element group. That is, the complete element group is an element group that does not affect the integrity of the video picture content, and the incomplete element group is an element group that may affect the integrity of the video picture content.
在一些实施例中,在计算疑似残缺元素组的完整度过程中,可以计算得到元素组的残缺比例参数。例如,根据元素组的比例关系与预设的比例关系阈值的距离可以计算得到元素组的残缺程度;根据元素组的比例关系、投影特征与预设的比例关系、投影特征阈值的距离可以结算得到元素组的残缺程度。其中,残缺程度包括残缺比例等。In some embodiments, in the process of calculating the completeness of the suspected incomplete element group, the incompleteness ratio parameter of the element group may be calculated. For example, the degree of incompleteness of the element group can be calculated according to the distance between the proportional relationship of the element group and the preset proportional relationship threshold; the distance between the proportional relationship of the element group, the projected feature and the preset proportional relationship, and the projected feature threshold can be calculated and obtained. The degree of fragmentation of the element group. Among them, the degree of disability includes the proportion of disability and so on.
图16为本申请实施例中一个元素组完整度判别结果的示例性示意图。FIG. 16 is an exemplary schematic diagram of a completeness determination result of an element group in an embodiment of the present application.
如图16所示,在得到元素组的所属元素组类型后,通过投影特征、比例关系等进一步判断残缺程度得到的结果包括:403-Group-狗元素组完整、1404-Group-狗粮元素组完整、1405-Group-“狗是人类的好朋友”元素组不完整、1401-Group-“原创视频1”元素组不完整、1402-Group-“华为HUAWE”元素组不完整。As shown in Figure 16, after obtaining the element group type to which the element group belongs, the results obtained by further judging the degree of incompleteness through projection features, proportional relationships, etc. include: 403-Group-dog element group complete, 1404-Group-dog food element group Complete, 1405-Group-"Dogs are good friends of human beings" element group is incomplete, 1401-Group-"
其中S7026:计算残缺元素组的元素组属性。Wherein S7026: Calculate the element group attribute of the incomplete element group.
根据残缺元素组所属的元素组类型,分别确定各残缺元素组的元素组属性,其中,残缺元素组的元素组属性中包括至少一个第一残缺参数值。第一残缺参数值用于表示元素组的残缺比例。残缺元素组的元素组属性还可以包括用于表示残缺类型、持续时间、时段类型、重要程度、周边元素组等的第二残缺参数值、第三残残缺参数值等。According to the element group type to which the incomplete element group belongs, the element group attribute of each incomplete element group is determined respectively, wherein the element group attribute of the incomplete element group includes at least one first incomplete parameter value. The first incomplete parameter value is used to represent the incomplete ratio of the element group. The element group attribute of the incomplete element group may further include a second incomplete parameter value, a third incomplete parameter value, etc., which are used to indicate the type of the defect, the duration, the period type, the degree of importance, the surrounding element group, and the like.
第一残缺参数值有多种表现形式,第一残缺参数值可以是一个数值,第一残缺参数值可以是一个向量或矩阵中一行或矩阵中的一列等,此处不作限定。The first incomplete parameter value can be represented in various forms, the first incomplete parameter value may be a numerical value, and the first incomplete parameter value may be a vector or a row in a matrix or a column in a matrix, etc., which are not limited here.
示例性的,当第一残缺参数值时一矩阵中的一列时,第一残缺参数值用于表示该元素组中任一元素在该元素所对应的的图像帧中的残缺程度。残缺程度计算过程包括:使用比例关系、投影特征等计算元素的残缺程度。进一步的,通过该第一残缺参数可以确定元素组的残缺比例。Exemplarily, when the first incompleteness parameter value is a column in a matrix, the first incompleteness parameter value is used to indicate the degree of incompleteness of any element in the element group in the image frame corresponding to the element. The process of calculating the degree of incompleteness includes: calculating the degree of incompleteness of elements using proportional relations, projection features, etc. Further, the defect ratio of the element group can be determined through the first defect parameter.
例如,当第二视频为M帧图像帧的视频,且在获取元素组时选取了全部帧的图像帧,在该情况下,元素组D-Group的属性可以表示为M列的矩阵。其中,该矩阵中至少有一列数据中包含有残缺元素组中元素的残缺程度,即为元素组D-Group的第一残缺参数值。当该矩阵中的第三列数据表示元素组D-Group的第一残缺参数值,此时第三列第I行的数据表示元素组D-Group中第二视频中第I图像帧的元素残缺程度。根据D-Group的第一残缺参数值得到D-Group的残缺比例。For example, when the second video is a video of M image frames, and image frames of all frames are selected when acquiring the element group, in this case, the attribute of the element group D-Group can be represented as a matrix of M columns. Wherein, at least one column of data in the matrix contains the degree of incompleteness of elements in the incomplete element group, which is the value of the first incompleteness parameter of the element group D-Group. When the data in the third column of the matrix represents the value of the first incomplete parameter of the element group D-Group, the data in the third column and the first row at this time represents the incomplete element of the first image frame in the second video in the element group D-Group degree. The incomplete ratio of D-Group is obtained according to the value of the first incomplete parameter of D-Group.
示例性的,当第一残缺参数的表现形式为一个数值时,第一残缺参数值为元素组的残缺比例。Exemplarily, when the expression form of the first incompleteness parameter is a numerical value, the value of the first incompleteness parameter is the proportion of the incompleteness of the element group.
根据残缺元素组中元素在第二视频中各图像帧上的完整度,分别确定各残缺元素组的第二残缺参数组中的第二残缺参数值,其中第二残缺参数值可以包括:持续时间、时段类型、周边元素组、残缺类型、重要程度等。According to the completeness of the elements in the incomplete element group on each image frame in the second video, the second incomplete parameter value in the second incomplete parameter group of each incomplete element group is respectively determined, wherein the second incomplete parameter value may include: duration , period type, surrounding element group, incomplete type, importance, etc.
以持续时间为例,持续时间为残缺元素组在第二视频中元素出现不完整的有间断的最短时间段。对于任一残缺元素组来说,可以使用编码1表示残缺,0表示完整,-1表示未出现,元素出现各帧(一秒一帧)0中完整度可表示为:(共30位)000001100001111100001111110-1-1-1那么元素组出现时间为30秒,有间断最短不完整的时间为21秒,则该残缺元素组的第一残缺参数组中第二残缺参数值中的持续时间的值为21秒。其中,可以有多种方法界定元素组中的元素组在出现帧中是残缺、完整、还是未出现,例如,当元素组的第一残缺参数组中第一残缺参数中的某一帧的残缺程度值大于预设残缺程度阈值时,认为元素在该帧中不完整;若某一帧的残缺程度值为默认值时,认为元素在该帧未出现;若某一帧的残缺程度值小于预设残缺程度阈值时,认为元素在该图像帧完整。或者,在步骤S7022中,电子设备对图像帧进行元素检测时,在得到元素时,同时检测得到元素的是否出现、以及在出现的情况下是否完整。Taking the duration as an example, the duration is the shortest period of time with discontinuities in which the elements of the incomplete element group appear incomplete in the second video. For any incomplete element group, you can use the
重要程度与元素组所属的元素组类型、元素组占据的面积有关;残缺类型与元素组的比例关系有关,残缺类型的取值包括:横向残缺、纵向残缺;周边元素组与所有元素组的位置有关;时段类型为预设的值,时段类型的取值可以为视频开始后(视频结尾前)20秒,转场点后(转场点前)20秒等。The degree of importance is related to the element group type to which the element group belongs and the area occupied by the element group; the defect type is related to the proportional relationship of the element group. The values of the defect type include: horizontal defect and vertical defect; the positions of surrounding element groups and all element groups Relevant; the period type is a preset value, and the value of the period type can be 20 seconds after the video starts (before the end of the video), 20 seconds after the transition point (before the transition point), etc.
其中S7027:计算残缺元素组对视频画面内容的影响结果。S7027: Calculate the result of the influence of the incomplete element group on the content of the video picture.
根据S7026中获得的残缺元素组的元素组属性以及预设的元素组影响画面完整度规则,可以确定该元素组对视频画面内容的影响结果,其中结果包括:残缺元素组影响视频画面内容的完整度、残缺元素组不影响视频画面的完整度。According to the element group attribute of the incomplete element group obtained in S7026 and the preset element group influencing picture integrity rule, the influence result of the element group on the video picture content can be determined, wherein the result includes: the incomplete element group affects the integrity of the video picture content. The degree and incomplete element group do not affect the integrity of the video picture.
元素组影响画面完整度规则、确定残缺元素组对视频画面内容的影响结果可以参考上述术语解释(4)预设的元素组影响画面完整度规则,此处不再赘述。For the rules of element group affecting picture integrity and determining the impact result of incomplete element group on video picture content, please refer to the above term explanation (4) Preset rules of element group affecting picture integrity, which will not be repeated here.
图17为本申请实施例中元素组影响画面完整度规则的一个示例性结构示意图。如图17所示,不同元素组类型的元素组,对应于至少一个元素组影响画面完整度规则。FIG. 17 is a schematic structural diagram of an exemplary structure of an element group affecting a picture integrity rule in an embodiment of the present application. As shown in FIG. 17 , element groups of different element group types, corresponding to at least one element group, affect the picture integrity rule.
元素组影响画面完整度规则还可以有其他的表现形式,例如元素组影响画面完整度规则可以通过评价函数f1呈现。元素组影响画面完整度规则中的参考值和阈值表现为评价函数f1的输入与输出的映射关系。评价函数f1的输入为残缺元素组的元素组属性,输出为该元素组影响视频画面内容完整度或该元素组不影响视频画面内容完整度。The rule that the element group affects the picture integrity may also have other expressions, for example, the rule that the element group affects the picture integrity may be presented through the evaluation function f1. The reference value and the threshold value in the rules of the element group affecting the picture integrity are represented as the mapping relationship between the input and the output of the evaluation function f1. The input of the evaluation function f1 is the element group attribute of the incomplete element group, and the output is that the element group affects the content integrity of the video picture or the element group does not affect the video picture content integrity.
图18为本申请实施例中元素组对画面完整度影响结果的一个示例性示意图。如图18所示,以其他后期文字1401-Group-“原创视频1”元素组为例,其残缺程度为0.15,其持续时间为30s,其重要程度为5。故其他后期文字1401-Group-“原创视频1”元素组的完整度评价函数f1的输出为:该元素组影响原始视频的画面内容完整度,同理,元素组1402-Group-“华为HUAWE”的完整度评价函数f1的输出为:该元素组影响原始视频的画面内容完整度;元素组1403-Group-的完整度评价函数f1的输出为:该元素组不影响原始视频的画面内容完整度;元素组1404-Group-狗粮的完整度评价函数f1的输出为:该元素组不影响原始视频的画面内容完整度;元素组1405-Group-“狗是人类的好朋友”的完整度评价函数f1的输出为:该元素组不影响原始视频的画面内容完整度。FIG. 18 is an exemplary schematic diagram of an effect result of an element group on picture integrity in an embodiment of the present application. As shown in FIG. 18 , taking the element group of other later text 1401-Group-“
S703:得到原始视频画面内容完整度评估结果。S703: Obtain the content integrity evaluation result of the original video image.
可以理解的是,当步骤S701执行后,原始视频被切割为N段第一视频时,此时步骤S702执行后会输出N组数据,其中每一组数据内包括:残缺元素组、残缺元素组对视频画面内容的影响结果、残缺元素组的元素组属性。It can be understood that, when the original video is cut into N segments of the first video after step S701 is executed, N groups of data will be output after step S702 is executed, wherein each group of data includes: incomplete element group, incomplete element group. Influence results on the content of the video screen, element group attributes of the incomplete element group.
本申请实施例中,可以根据N组数据得到原始视频画面内容完整度评估结果,或者根据一组数据得到第二视频的画面内容完整度评估结果,并根据N组画面内容完整度评估结果确定原始视频画面内容完整度评估结果,在此不作限定。In this embodiment of the present application, the content integrity evaluation result of the original video image may be obtained according to N groups of data, or the image content integrity evaluation result of the second video may be obtained according to a group of data, and the original video content integrity evaluation result may be determined according to the N groups of image content integrity evaluation results. The content integrity evaluation result of the video image is not limited here.
本申请实施例中,根据残缺元素组对视频画面内容的影响结果,得到原始视频画面内容完整度评估结果的具体方式可以有很多种:In the embodiment of the present application, according to the result of the influence of the incomplete element group on the content of the video picture, there can be many specific ways to obtain the evaluation result of the integrity of the original video picture content:
(1)根据残缺元素组对视频画面内容的影响结果,得到原始视频画面内容是否完整;(1) According to the effect of the incomplete element group on the content of the video picture, obtain whether the content of the original video picture is complete;
可以通过完整度判断模型判断原始视频画面内容是否完整,并将结果通知给用户或视频生产者。The integrity judgment model can be used to judge whether the content of the original video is complete, and notify the user or video producer of the result.
完整度判断模型的定义、完整度模型的构建和训练可以参考上述术语解释(5)完整度判断模型,此处不再赘述。For the definition of the integrity judgment model, the construction and training of the integrity model, reference may be made to the above term explanation (5) Integrity judgment model, which will not be repeated here.
本申请不对完整度判断模型的输入参数进行限制,在本领域技术人员不付出创造性劳动的情况下,完整度判断模型的输入可以为其他图像特征、数据特征。如对元素组属性中参数的组合、线性叠加、非线性叠加等,或是用于表达元素组残缺程度的其他参数,或是用于表达元素组位置、元素组占据面积等的其他参数。This application does not limit the input parameters of the integrity judgment model, and the input of the integrity judgment model can be other image features and data features without creative efforts by those skilled in the art. Such as the combination, linear superposition, non-linear superposition, etc. of the parameters in the element group attributes, or other parameters used to express the degree of incompleteness of the element group, or other parameters used to express the position of the element group, the area occupied by the element group, etc.
完整度判断模型的表现形式也可以函数或映射关系。以函数f2为例,画面完整度评价函数f2是一种多对二的函数关系,输入为残缺元素组对视频画面内容的影响结果、残缺元素组,输出包括:原始视频画面内容完整、原始视频画面内容不完整。函数f2的输入至输出的映射关系可以是:输入中有至少一个残缺元素组影响视频画面内容的完整度,则输出为原始视频画面内容不完整;输入中有50%及以上比例(影响视频画面内容的完整度的残缺元素组个数/元素组个数)的残缺元素组影响视频画面内容的完整度,则输出为原始视频的画面内容不完整。The representation of the integrity judgment model can also be a function or a mapping relationship. Taking the function f2 as an example, the picture integrity evaluation function f2 is a many-to-two function relationship. The input is the effect of the incomplete element group on the video picture content, and the incomplete element group. The output includes: the original video picture content is complete, the original video Screen content is incomplete. The mapping relationship from the input to the output of the function f2 can be: there is at least one incomplete element group in the input that affects the integrity of the video picture content, then the output is that the original video picture content is incomplete; 50% or more of the input (affecting the video picture content) The incomplete element group of the content integrity (number of incomplete element groups/number of element groups) affects the integrity of the video picture content, and the output is the incomplete picture content of the original video.
可以理解的是,通过完整度判断模型可以快速判断原始视频画面内容是否完整,实现自动的机器审核,提高了审核效率。It is understandable that the integrity judgment model can quickly judge whether the content of the original video is complete, realize automatic machine review, and improve the review efficiency.
(2)根据残缺元素组对视频画面内容的影响结果和原始视频的分类信息,得到原始视频画面内容完整度的评分;(2) According to the influence result of the incomplete element group on the video picture content and the classification information of the original video, obtain the score of the integrity of the original video picture content;
可以通过评分模型确定原始视频画面内容完整度评分,并将结果通知给用户或视频生产者。The content integrity score of the original video picture can be determined through the scoring model, and the result is notified to the user or the video producer.
评分模型的定义、评分模型的构建和训练可以参考上述术语解释(6)评分模型,此处不再赘述。评分模型可以是利用人工智能领域中BP神经网络模型构建出的模型。For the definition of the scoring model, the construction and training of the scoring model, reference may be made to the above term explanation (6) Scoring model, which will not be repeated here. The scoring model may be a model constructed by using the BP neural network model in the field of artificial intelligence.
图19为本申请实施例中BP神经网络模型的一个示例性示意图。FIG. 19 is an exemplary schematic diagram of a BP neural network model in an embodiment of the present application.
如图19所示,BP神经网络模型包括:输入层、隐含层、输出层。As shown in Figure 19, the BP neural network model includes: an input layer, a hidden layer, and an output layer.
本申请不对评分模型的输入参数进行限制,在本领域技术人员不付出创造性劳动的情况下,评分模型的输入可以为其他图像特征、数据特征。如对元素组属性中参数的组合、线性叠加、非线性叠加等,或是用于表达元素组残缺程度的其他参数,或是用于表达元素组位置、元素组位置、元素组占据面积等的其他参数。The present application does not limit the input parameters of the scoring model, and the input of the scoring model can be other image features and data features without creative efforts by those skilled in the art. Such as the combination, linear superposition, non-linear superposition, etc. of the parameters in the attribute of the element group, or other parameters used to express the degree of incompleteness of the element group, or used to express the position of the element group, the position of the element group, the area occupied by the element group, etc. Other parameters.
可以理解的是,通过评分模型可以量化原始视频画面内容完整度,更有效的区分了不同完整度的原始视频,有利于视频生产者或用户更全面的了解原始视频的完整度情况。It is understandable that the scoring model can quantify the content integrity of the original video, which can more effectively distinguish original videos of different integrity, which is helpful for video producers or users to have a more comprehensive understanding of the integrity of the original video.
(3)根据残缺元素组对视频画面内容的影响结果和残缺元素组的元素组属性,得到原始视频画面内容完整度的修改建议。(3) According to the influence result of the incomplete element group on the content of the video picture and the element group attribute of the incomplete element group, the modification suggestion of the content integrity of the original video picture is obtained.
修改建议生成模型的定义、修改建议生成模型的构建和训练可以参考上述术语解释(7)修改建议生成模型,此处不再赘述。修改建议生成模型可以是利用决策树算法构建出的模型。For the definition of the modification suggestion generation model, the construction and training of the modification suggestion generation model, you can refer to the above term explanation (7) Modification suggestion generation model, which will not be repeated here. The modification suggestion generation model may be a model constructed by using a decision tree algorithm.
决策树算法中节点属性的选择与元素组的元素组属性中的参数有关。或者,决策树算法中节点属性的选择与预设的元素组影响画面完整度规则中的阈值和/或参考值有关。The selection of node attributes in the decision tree algorithm is related to the parameters in the element group attribute of the element group. Or, the selection of node attributes in the decision tree algorithm is related to the threshold and/or reference value in the preset element group affecting the picture integrity rule.
元素组的元素组属性中的参数、元素组影响画面完整度规则中的阈值和/或参考值为增益高、纯度高的节点属性,能够有效的提高决策树算法的鲁棒性、准确度、精度等。The parameters in the element group attribute of the element group, the threshold value and/or the reference value in the element group influence picture integrity rule are the node attributes with high gain and high purity, which can effectively improve the robustness, accuracy, and reliability of the decision tree algorithm. accuracy, etc.
可以理解的是,当视频生产者或用户收到电子设备提供的修改建议,可以根据修改建议的内容,有针对性的修改原始视频,提高原始视频的画面内容完整性。其中,修改建议的表现形式可以为审核报告。It is understandable that, when the video producer or user receives the modification suggestion provided by the electronic device, they can modify the original video in a targeted manner according to the content of the modification suggestion, so as to improve the picture content integrity of the original video. Among them, the form of revision suggestion can be an audit report.
本申请不对修改建议生成模型的输入参数或节点参数进行限制,在本领域技术人员不付出创造性劳动的情况下,修改建议生成模型的输入或决策树算法中节点参数的选择可以为其他图像特征、数据特征。如对元素组属性中参数的组合、线性叠加、非线性叠加等,或是用于表达元素组残缺程度的其他参数,或是用于表达元素组位置、元素组位置、元素组占据面积等的其他参数。This application does not limit the input parameters or node parameters of the modification suggestion generation model. Without creative work by those skilled in the art, the input of the modification suggestion generation model or the selection of node parameters in the decision tree algorithm can be other image features, data characteristics. Such as the combination, linear superposition, non-linear superposition, etc. of the parameters in the attribute of the element group, or other parameters used to express the degree of incompleteness of the element group, or used to express the position of the element group, the position of the element group, the area occupied by the element group, etc. Other parameters.
图20为本申请实施例中修改建议生成模型的一个架构示意图。FIG. 20 is a schematic structural diagram of a modification suggestion generation model in an embodiment of the present application.
如图20所示,可以选择残缺元素组占据的面积、持续时间、重要程度、残缺程度等参数作为节点属性,通过决策树算法构建并训练修改建议生成模型。As shown in Figure 20, parameters such as the area occupied by the incomplete element group, duration, importance, degree of incompleteness, etc. can be selected as node attributes, and a decision tree algorithm is used to construct and train a modification suggestion generation model.
示例性的,当任一残缺元素组被输入到利用决策树算法构建并训练的修改建议生成模型后:当该元素组的面积小于视频画面的5%,且该元素组的残缺程度小于1/3,该修改建议生成模型对该残缺元素组的输出为:可不处理(Pass);当该元素组的面积小于视频画面的5%,且该元素组的残缺程度大于1/3,且该元素组的持续时间小于15秒,该修改建议生成模型的对该残缺元素组的输出为:可不处理(Pass);当该元素组的面积小于视频画面的5%,且该元素组的残缺程度大于1/3,且该元素组的持续时间大于15秒且小于30秒,该修改建议生成模型的对该残缺元素组的输出为:裁剪(Cut);当该元素组的面积小于视频画面的5%,且该元素组的残缺程度大于1/3,且该元素组的持续时间大于30秒,该修改建议生成模型的对该残缺元素组的输出为:涂抹(Cover);当该元素组的面积介于视频画面的5%至视频画面的50%,且该持续时间小于1秒,该修改建议生成模型的对该残缺元素组的输出为:可不处理(Pass);当该元素组的面积介于视频画面的5%至视频画面的50%,且该持续时间小于1秒,且重要程度小于等于3,该修改建议生成模型的对该残缺元素组的输出为:涂抹(Cover);当该元素组的面积介于视频画面的5%至视频画面的50%之间,且该持续时间小于1秒,且重要程度大于3,该修改建议生成模型的对该残缺元素组的输出为:剪裁(Cut);当该元素组的面积大于视频画面的50%,该修改建议生成模型的对该残缺元素组的输出为:剪裁(Cut)。Exemplarily, when any incomplete element group is input into the modification suggestion generation model constructed and trained by using the decision tree algorithm: when the area of the element group is less than 5% of the video frame, and the degree of incompleteness of the element group is less than 1/ 3. The output of the modified proposed generation model for the incomplete element group is: can not be processed (Pass); when the area of the element group is less than 5% of the video screen, and the degree of incompleteness of the element group is greater than 1/3, and the element If the duration of the group is less than 15 seconds, this modification suggests that the output of the defective element group of the generated model is: Pass); when the area of the element group is less than 5% of the video screen, and the degree of incompleteness of the element group is greater than 1/3, and the duration of the element group is greater than 15 seconds and less than 30 seconds, this modification suggests that the output of the incomplete element group of the generated model is: Cut; when the area of the element group is less than 5 of the video screen %, and the incompleteness of the element group is greater than 1/3, and the duration of the element group is greater than 30 seconds, this modification suggests that the output of the incomplete element group of the generated model is: smear (Cover); The area is between 5% of the video frame and 50% of the video frame, and the duration is less than 1 second. This modification suggests that the output of the incomplete element group of the generated model is: Pass); when the area of the element group is Between 5% of the video frame and 50% of the video frame, and the duration is less than 1 second, and the degree of importance is less than or equal to 3, this modification suggests that the output of the incomplete element group of the generated model is: smear (Cover); when The area of the element group is between 5% of the video frame and 50% of the video frame, and the duration is less than 1 second, and the degree of importance is greater than 3. This modification suggests that the output of the incomplete element group of the generation model is: Cut (Cut); when the area of the element group is greater than 50% of the video frame, the modification suggests that the output of the incomplete element group of the generation model is: Cut (Cut).
图21为本申请实施例中修改建议生成模型输出的原始视频画面修改建议的一个示意图。FIG. 21 is a schematic diagram of an original video picture modification suggestion output by a modification suggestion generation model in an embodiment of the present application.
如图21所示,原始视频的画面修改建议包括:元素组、元素组的元素组类型、元素组的起始时间、元素组的持续时间、元素组的位置、元素组的修改建议。例如,1401-Group-“原创视频1”元素组的修改建议为剪裁(Cut);1402-Group-“华为HUAWE”元素组的修改建议为涂抹(Cover);1403-Group-狗元素组的修改建议为可不处理(Pass);1404-Group-狗粮元素组的修改建议为可不处理(Pass);1405-Group-“狗是人类的好朋友”元素组的修改建议为剪裁(Cut)。As shown in Figure 21, the picture modification suggestion of the original video includes: element group, element group type of the element group, start time of the element group, duration of the element group, position of the element group, and modification suggestion of the element group. For example, the modification suggestion for 1401-Group-“
值得说明的是,本申请涉及的一种视频画面内容完整度评估方法可以运行在离线的电子设备上,也可以运行在云端的电子设备上;其中,离线的电子设备可以是视频生产者或用户的本地联网或不联网的电子设备上,如电脑、移动终端等;云端的电子设备可以是视频审核系统的云端服务器。It is worth noting that a method for evaluating the integrity of video screen content involved in this application can be run on an offline electronic device or an electronic device in the cloud; wherein, the offline electronic device can be a video producer or user. On the local networked or non-networked electronic devices, such as computers, mobile terminals, etc.; the electronic devices in the cloud can be the cloud server of the video review system.
本申请实施例中电子设备可以为移动电子设备,也可以为PC,此处不作限定。示例性的,图22为本申请实施例提供的电子设备100的一个结构示意图。The electronic device in this embodiment of the present application may be a mobile electronic device or a PC, which is not limited here. Exemplarily, FIG. 22 is a schematic structural diagram of an electronic device 100 provided by an embodiment of the present application.
下面以电子设备100为例对实施例进行具体说明。应该理解的是,电子设备100可以具有比图中所示的更多的或者更少的部件,可以组合两个或多个的部件,或者可以具有不同的部件配置。图中所示出的各种部件可以在包括一个或多个信号处理和/或专用集成电路在内的硬件、软件、或硬件和软件的组合中实现。The embodiment will be described in detail below by taking the electronic device 100 as an example. It should be understood that the electronic device 100 may have more or fewer components than those shown in the figures, may combine two or more components, or may have different component configurations. The various components shown in the figures may be implemented in hardware, software, or a combination of hardware and software, including one or more signal processing and/or application specific integrated circuits.
电子设备100可以包括:处理器110,外部存储器接口120,内部存储器121,通用串行总线(universal serial bus,USB)接口130,充电管理模块140,电源管理模块141,电池142,天线1,天线2,移动通信模块150,无线通信模块160,音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,传感器模块180,按键190,马达191,指示器192,摄像头193,显示屏194以及用户标识模块(subscriber identification module,SIM)卡接口195等。其中传感器模块180可以包括压力传感器180A,陀螺仪传感器180B,气压传感器180C,磁传感器180D,加速度传感器180E,距离传感器180F,接近光传感器180G,指纹传感器180H,温度传感器180J,触摸传感器180K,环境光传感器180L,骨传导传感器180M等。The electronic device 100 may include: a
可以理解的是,本发明实施例示意的结构并不构成对电子设备100的具体限定。在本申请另一些实施例中,电子设备100可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It can be understood that, the structures illustrated in the embodiments of the present invention do not constitute a specific limitation on the electronic device 100 . In other embodiments of the present application, the electronic device 100 may include more or less components than shown, or combine some components, or separate some components, or arrange different components. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.
处理器110可以包括一个或多个处理单元,例如:处理器110可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processingunit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The
其中,控制器可以是电子设备100的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。The controller may be the nerve center and command center of the electronic device 100 . The controller can generate an operation control signal according to the instruction operation code and timing signal, and complete the control of fetching and executing instructions.
处理器110中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使用的指令或数据。如果处理器110需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复存取,减少了处理器110的等待时间,因而提高了系统的效率。A memory may also be provided in the
在一些实施例中,处理器110可以包括一个或多个接口。接口可以包括集成电路(inter-integrated circuit,I2C)接口,集成电路内置音频(inter-integrated circuitsound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purposeinput/output,GPIO)接口,用户标识模块(subscriber identity module,SIM)接口,和/或通用串行总线(universal serial bus,USB)接口等。In some embodiments, the
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDA)和一根串行时钟线(derail clock line,SCL)。在一些实施例中,处理器110可以包含多组I2C总线。处理器110可以通过不同的I2C总线接口分别耦合触摸传感器180K,充电器,闪光灯,摄像头193等。例如:处理器110可以通过I2C接口耦合触摸传感器180K,使处理器110与触摸传感器180K通过I2C总线接口通信,实现电子设备100的触摸功能。The I2C interface is a bidirectional synchronous serial bus that includes a serial data line (SDA) and a serial clock line (SCL). In some embodiments, the
I2S接口可以用于音频通信。在一些实施例中,处理器110可以包含多组I2S总线。处理器110可以通过I2S总线与音频模块170耦合,实现处理器110与音频模块170之间的通信。在一些实施例中,音频模块170可以通过I2S接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。The I2S interface can be used for audio communication. In some embodiments, the
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块170与无线通信模块160可以通过PCM总线接口耦合。在一些实施例中,音频模块170也可以通过PCM接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。所述I2S接口和所述PCM接口都可以用于音频通信。The PCM interface can also be used for audio communications, sampling, quantizing and encoding analog signals. In some embodiments, the
UART接口是一种通用串行数据总线,用于异步通信。该总线可以为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接口通常被用于连接处理器110与无线通信模块160。例如:处理器110通过UART接口与无线通信模块160中的蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块170可以通过UART接口向无线通信模块160传递音频信号,实现通过蓝牙耳机播放音乐的功能。The UART interface is a universal serial data bus used for asynchronous communication. The bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is typically used to connect the
MIPI接口可以被用于连接处理器110与显示屏194,摄像头193等外围器件。MIPI接口包括摄像头串行接口(camera serial interface,CSI),显示屏串行接口(displayserial interface,DSI)等。在一些实施例中,处理器110和摄像头193通过CSI接口通信,实现电子设备100的拍摄功能。处理器110和显示屏194通过DSI接口通信,实现电子设备100的显示功能。The MIPI interface can be used to connect the
GPIO接口可以通过软件配置。GPIO接口可以被配置为控制信号,也可被配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器110与摄像头193,显示屏194,无线通信模块160,音频模块170,传感器模块180等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。The GPIO interface can be configured by software. The GPIO interface can be configured as a control signal or as a data signal. In some embodiments, the GPIO interface may be used to connect the
SIM接口可以被用于与SIM卡接口195通信,实现传送数据到SIM卡或读取SIM卡中数据的功能。The SIM interface can be used to communicate with the
USB接口130是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口130可以用于连接充电器为电子设备100充电,也可以用于电子设备100与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如AR设备等。The USB interface 130 is an interface that conforms to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like. The USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through the headphones. The interface can also be used to connect other electronic devices, such as AR devices.
可以理解的是,本发明实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对电子设备100的结构限定。在本申请另一些实施例中,电子设备100也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。It can be understood that the interface connection relationship between the modules illustrated in the embodiment of the present invention is only a schematic illustration, and does not constitute a structural limitation of the electronic device 100 . In other embodiments of the present application, the electronic device 100 may also adopt different interface connection manners in the foregoing embodiments, or a combination of multiple interface connection manners.
充电管理模块140用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。The
电源管理模块141用于连接电池142,充电管理模块140与处理器110。电源管理模块141接收电池142和/或充电管理模块140的输入,为处理器110,内部存储器121,外部存储器,显示屏194,摄像头193,和无线通信模块160等供电。The power management module 141 is used for connecting the battery 142 , the
电子设备100的无线通信功能可以通过天线1,天线2,移动通信模块150,无线通信模块160,调制解调处理器以及基带处理器等实现。The wireless communication function of the electronic device 100 may be implemented by the
天线1和天线2用于发射和接收电磁波信号。电子设备100中的每个天线可用于覆盖单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将天线1复用为无线局域网的分集天线。在另外一些实施例中,天线可以和调谐开关结合使用。
移动通信模块150可以提供应用在电子设备100上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块150可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块150可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块150还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块150的至少部分功能模块可以被设置于处理器110中。在一些实施例中,移动通信模块150的至少部分功能模块可以与处理器110的至少部分模块被设置在同一个器件中。The
调制解调处理器可以包括调制器和解调器。其中,调制器用于将待发送的低频基带信号调制成中高频信号。解调器用于将接收的电磁波信号解调为低频基带信号。随后解调器将解调得到的低频基带信号传送至基带处理器处理。低频基带信号经基带处理器处理后,被传递给应用处理器。应用处理器通过音频设备(不限于扬声器170A,受话器170B等)输出声音信号,或通过显示屏194显示图像或视频。在一些实施例中,调制解调处理器可以是独立的器件。在另一些实施例中,调制解调处理器可以独立于处理器110,与移动通信模块150或其他功能模块设置在同一个器件中。The modem processor may include a modulator and a demodulator. Wherein, the modulator is used to modulate the low frequency baseband signal to be sent into a medium and high frequency signal. The demodulator is used to demodulate the received electromagnetic wave signal into a low frequency baseband signal. Then the demodulator transmits the demodulated low-frequency baseband signal to the baseband processor for processing. The low frequency baseband signal is processed by the baseband processor and passed to the application processor. The application processor outputs sound signals through audio devices (not limited to the speaker 170A, the
无线通信模块160可以提供应用在电子设备100上的包括无线局域网(wirelesslocal area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块160可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块160经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器110。无线通信模块160还可以从处理器110接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The wireless communication module 160 can provide wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), bluetooth (BT), and global navigation satellite systems applied on the electronic device 100 . (global navigation satellite system, GNSS), frequency modulation (frequency modulation, FM), near field communication technology (near field communication, NFC), infrared technology (infrared, IR) and other wireless communication solutions. The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the
在一些实施例中,电子设备100的天线1和移动通信模块150耦合,天线2和无线通信模块160耦合,使得电子设备100可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications,GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(codedivision multiple access,CDMA),宽带码分多址(wideband code division multipleaccess,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(global positioning system,GPS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(beidounavigation satellite system,BDS),准天顶卫星系统(quasi-zenith satellitesystem,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。In some embodiments, the
电子设备100通过GPU,显示屏194,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏194和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器110可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The electronic device 100 implements a display function through a GPU, a display screen 194, an application processor, and the like. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor. The GPU is used to perform mathematical and geometric calculations for graphics rendering.
显示屏194用于显示图像,视频等。显示屏194包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emittingdiode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrixorganic light emitting diode,AMOLED),柔性发光二极管(flex light-emittingdiode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot lightemitting diodes,QLED)等。在一些实施例中,电子设备100可以包括1个或N个显示屏194,N为大于1的正整数。Display screen 194 is used to display images, videos, and the like. Display screen 194 includes a display panel. The display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode or an active-matrix organic light-emitting diode (active-matrix organic light-emitting diode). , AMOLED), flexible light-emitting diodes (flex light-emitting diodes, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diodes (quantum dot light emitting diodes, QLED) and so on. In some embodiments, the electronic device 100 may include one or N display screens 194 , where N is a positive integer greater than one.
电子设备100可以通过ISP,摄像头193,视频编解码器,GPU,显示屏194以及应用处理器等实现拍摄功能。The electronic device 100 may implement a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
ISP用于处理摄像头193反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头193中。The ISP is used to process the data fed back by the camera 193 . For example, when taking a photo, the shutter is opened, the light is transmitted to the camera photosensitive element through the lens, the light signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin tone. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be provided in the camera 193 .
摄像头193用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,电子设备100可以包括1个或N个摄像头193,N为大于1的正整数。Camera 193 is used to capture still images or video. The object is projected through the lens to generate an optical image onto the photosensitive element. The photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal. The ISP outputs the digital image signal to the DSP for processing. DSP converts digital image signals into standard RGB, YUV and other formats of image signals. In some embodiments, the electronic device 100 may include 1 or N cameras 193 , where N is a positive integer greater than 1.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当电子设备100在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。A digital signal processor is used to process digital signals, in addition to processing digital image signals, it can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy and so on.
视频编解码器用于对数字视频压缩或解压缩。电子设备100可以支持一种或多种视频编解码器。这样,电子设备100可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codecs are used to compress or decompress digital video. The electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos in various encoding formats, for example, moving picture experts group (MPEG) 1, MPEG2, MPEG3, MPEG4, and so on.
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备100的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。The NPU is a neural-network (NN) computing processor. By drawing on the structure of biological neural networks, such as the transfer mode between neurons in the human brain, it can quickly process the input information, and can continuously learn by itself. Applications such as intelligent cognition of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
内部存储器121可以包括一个或多个随机存取存储器(random access memory,RAM)和一个或多个非易失性存储器(non-volatile memory,NVM)。The internal memory 121 may include one or more random access memories (RAM) and one or more non-volatile memories (NVM).
随机存取存储器可以包括静态随机存储器(static random-access memory,SRAM)、动态随机存储器(dynamic random access memory,DRAM)、同步动态随机存储器(synchronous dynamic random access memory,SDRAM)、双倍资料率同步动态随机存取存储器(double data rate synchronous dynamic random access memory,DDR SDRAM,例如第五代DDR SDRAM一般称为DDR5 SDRAM)等;Random access memory may include static random-access memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random access memory (SDRAM), double data rate synchronous Dynamic random access memory (double data rate synchronous dynamic random access memory, DDR SDRAM, such as fifth-generation DDR SDRAM is generally called DDR5 SDRAM), etc.;
非易失性存储器可以包括磁盘存储器件、快闪存储器(flash memory)。Non-volatile memory may include magnetic disk storage devices, flash memory.
快闪存储器按照运作原理划分可以包括NOR FLASH、NAND FLASH、3D NAND FLASH等,按照存储单元电位阶数划分可以包括单阶存储单元(single-level cell,SLC)、多阶存储单元(multi-level cell,MLC)、三阶储存单元(triple-level cell,TLC)、四阶储存单元(quad-level cell,QLC)等,按照存储规范划分可以包括通用闪存存储(英文:universalflash storage,UFS)、嵌入式多媒体存储卡(embedded multi media Card,eMMC)等。Flash memory can be divided into NOR FLASH, NAND FLASH, 3D NAND FLASH, etc. according to the operation principle, and can include single-level memory cell (SLC), multi-level memory cell (multi-level memory cell, SLC) according to the level of storage cell potential. cell, MLC), triple-level cell (TLC), quad-level cell (QLC), etc., according to storage specifications, it can include universal flash storage (English: universal flash storage, UFS), Embedded multimedia memory card (embedded multi media Card, eMMC) and so on.
随机存取存储器可以由处理器110直接进行读写,可以用于存储操作系统或其他正在运行中的程序的可执行程序(例如机器指令),还可以用于存储用户及应用程序的数据等。The random access memory can be directly read and written by the
非易失性存储器也可以存储可执行程序和存储用户及应用程序的数据等,可以提前加载到随机存取存储器中,用于处理器110直接进行读写。The non-volatile memory can also store executable programs and store data of user and application programs, etc., and can be loaded into the random access memory in advance for the
外部存储器接口120可以用于连接外部的非易失性存储器,实现扩展电子设备100的存储能力。外部的非易失性存储器通过外部存储器接口120与处理器110通信,实现数据存储功能。例如将音乐,视频等文件保存在外部的非易失性存储器中。The
电子设备100可以通过音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,以及应用处理器等实现音频功能。例如音乐播放,录音等。The electronic device 100 may implement audio functions through an
音频模块170用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块170还可以用于对音频信号编码和解码。在一些实施例中,音频模块170可以设置于处理器110中,或将音频模块170的部分功能模块设置于处理器110中。The
扬声器170A,也称“喇叭”,用于将音频电信号转换为声音信号。电子设备100可以通过扬声器170A收听音乐,或收听免提通话。Speaker 170A, also referred to as a "speaker", is used to convert audio electrical signals into sound signals. The electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
受话器170B,也称“听筒”,用于将音频电信号转换成声音信号。当电子设备100接听电话或语音信息时,可以通过将受话器170B靠近人耳接听语音。The
麦克风170C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风170C发声,将声音信号输入到麦克风170C。电子设备100可以设置至少一个麦克风170C。在另一些实施例中,电子设备100可以设置两个麦克风170C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,电子设备100还可以设置三个,四个或更多麦克风170C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。The
耳机接口170D用于连接有线耳机。耳机接口170D可以是USB接口130,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The earphone jack 170D is used to connect wired earphones. The earphone port 170D may be the USB port 130 or a 3.5mm open mobile terminal platform (OMTP) standard port, a cellular telecommunications industry association of the USA (CTIA) standard port.
压力传感器180A用于感受压力信号,可以将压力信号转换成电信号。在一些实施例中,压力传感器180A可以设置于显示屏194。压力传感器180A的种类很多,如电阻式压力传感器,电感式压力传感器,电容式压力传感器等。电容式压力传感器可以是包括至少两个具有导电材料的平行板。当有力作用于压力传感器180A,电极之间的电容改变。电子设备100根据电容的变化确定压力的强度。当有触摸操作作用于显示屏194,电子设备100根据压力传感器180A检测所述触摸操作强度。电子设备100也可以根据压力传感器180A的检测信号计算触摸的位置。在一些实施例中,作用于相同触摸位置,但不同触摸操作强度的触摸操作,可以对应不同的操作指令。例如:当有触摸操作强度小于第一压力阈值的触摸操作作用于短消息应用图标时,执行查看短消息的指令。当有触摸操作强度大于或等于第一压力阈值的触摸操作作用于短消息应用图标时,执行新建短消息的指令。The pressure sensor 180A is used to sense pressure signals, and can convert the pressure signals into electrical signals. In some embodiments, the pressure sensor 180A may be provided on the display screen 194 . There are many types of pressure sensors 180A, such as resistive pressure sensors, inductive pressure sensors, capacitive pressure sensors, and the like. The capacitive pressure sensor may be comprised of at least two parallel plates of conductive material. When a force is applied to the pressure sensor 180A, the capacitance between the electrodes changes. The electronic device 100 determines the intensity of the pressure according to the change in capacitance. When a touch operation acts on the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A. The electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A. In some embodiments, touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions. For example, when a touch operation whose intensity is less than the first pressure threshold acts on the short message application icon, the instruction for viewing the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, the instruction to create a new short message is executed.
陀螺仪传感器180B可以用于确定电子设备100的运动姿态。在一些实施例中,可以通过陀螺仪传感器180B确定电子设备100围绕三个轴(即,x,y和z轴)的角速度。陀螺仪传感器180B可以用于拍摄防抖。示例性的,当按下快门,陀螺仪传感器180B检测电子设备100抖动的角度,根据角度计算出镜头模组需要补偿的距离,让镜头通过反向运动抵消电子设备100的抖动,实现防抖。陀螺仪传感器180B还可以用于导航,体感游戏场景。The gyro sensor 180B may be used to determine the motion attitude of the electronic device 100 . In some embodiments, the angular velocity of electronic device 100 about three axes (ie, x, y, and z axes) may be determined by gyro sensor 180B. The gyro sensor 180B can be used for image stabilization. Exemplarily, when the shutter is pressed, the gyro sensor 180B detects the shaking angle of the electronic device 100, calculates the distance that the lens module needs to compensate according to the angle, and allows the lens to offset the shaking of the electronic device 100 through reverse motion to achieve anti-shake. The gyro sensor 180B can also be used for navigation and somatosensory game scenarios.
气压传感器180C用于测量气压。在一些实施例中,电子设备100通过气压传感器180C测得的气压值计算海拔高度,辅助定位和导航。The air pressure sensor 180C is used to measure air pressure. In some embodiments, the electronic device 100 calculates the altitude through the air pressure value measured by the air pressure sensor 180C to assist in positioning and navigation.
磁传感器180D包括霍尔传感器。电子设备100可以利用磁传感器180D检测翻盖皮套的开合。在一些实施例中,当电子设备100是翻盖机时,电子设备100可以根据磁传感器180D检测翻盖的开合。进而根据检测到的皮套的开合状态或翻盖的开合状态,设置翻盖自动解锁等特性。The magnetic sensor 180D includes a Hall sensor. The electronic device 100 can detect the opening and closing of the flip holster using the magnetic sensor 180D. In some embodiments, when the electronic device 100 is a flip machine, the electronic device 100 can detect the opening and closing of the flip according to the magnetic sensor 180D. Further, according to the detected opening and closing state of the leather case or the opening and closing state of the flip cover, characteristics such as automatic unlocking of the flip cover are set.
加速度传感器180E可检测电子设备100在各个方向上(一般为三轴)加速度的大小。当电子设备100静止时可检测出重力的大小及方向。还可以用于识别电子设备姿态,应用于横竖屏切换,计步器等应用。The acceleration sensor 180E can detect the magnitude of the acceleration of the electronic device 100 in various directions (generally three axes). The magnitude and direction of gravity can be detected when the electronic device 100 is stationary. It can also be used to identify the posture of electronic devices, and can be used in applications such as horizontal and vertical screen switching, pedometers, etc.
距离传感器180F,用于测量距离。电子设备100可以通过红外或激光测量距离。在一些实施例中,拍摄场景,电子设备100可以利用距离传感器180F测距以实现快速对焦。Distance sensor 180F for measuring distance. The electronic device 100 can measure the distance through infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 can use the distance sensor 180F to measure the distance to achieve fast focusing.
接近光传感器180G可以包括例如发光二极管(LED)和光检测器,例如光电二极管。发光二极管可以是红外发光二极管。电子设备100通过发光二极管向外发射红外光。电子设备100使用光电二极管检测来自附近物体的红外反射光。当检测到充分的反射光时,可以确定电子设备100附近有物体。当检测到不充分的反射光时,电子设备100可以确定电子设备100附近没有物体。电子设备100可以利用接近光传感器180G检测用户手持电子设备100贴近耳朵通话,以便自动熄灭屏幕达到省电的目的。接近光传感器180G也可用于皮套模式,口袋模式自动解锁与锁屏。Proximity light sensor 180G may include, for example, light emitting diodes (LEDs) and light detectors, such as photodiodes. The light emitting diodes may be infrared light emitting diodes. The electronic device 100 emits infrared light to the outside through the light emitting diode. Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 . The electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power. Proximity light sensor 180G can also be used in holster mode, pocket mode automatically unlocks and locks the screen.
环境光传感器180L用于感知环境光亮度。电子设备100可以根据感知的环境光亮度自适应调节显示屏194亮度。环境光传感器180L也可用于拍照时自动调节白平衡。环境光传感器180L还可以与接近光传感器180G配合,检测电子设备100是否在口袋里,以防误触。The ambient light sensor 180L is used to sense ambient light brightness. The electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness. The ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures. The ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in a pocket, so as to prevent accidental touch.
指纹传感器180H用于采集指纹。电子设备100可以利用采集的指纹特性实现指纹解锁,访问应用锁,指纹拍照,指纹接听来电等。The fingerprint sensor 180H is used to collect fingerprints. The electronic device 100 can use the collected fingerprint characteristics to realize fingerprint unlocking, accessing application locks, taking pictures with fingerprints, answering incoming calls with fingerprints, and the like.
温度传感器180J用于检测温度。在一些实施例中,电子设备100利用温度传感器180J检测的温度,执行温度处理策略。例如,当温度传感器180J上报的温度超过阈值,电子设备100执行降低位于温度传感器180J附近的处理器的性能,以便降低功耗实施热保护。在另一些实施例中,当温度低于另一阈值时,电子设备100对电池142加热,以避免低温导致电子设备100异常关机。在其他一些实施例中,当温度低于又一阈值时,电子设备100对电池142的输出电压执行升压,以避免低温导致的异常关机。The temperature sensor 180J is used to detect the temperature. In some embodiments, the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold value, the electronic device 100 reduces the performance of the processor located near the temperature sensor 180J in order to reduce power consumption and implement thermal protection. In other embodiments, when the temperature is lower than another threshold, the electronic device 100 heats the battery 142 to avoid abnormal shutdown of the electronic device 100 caused by the low temperature. In some other embodiments, when the temperature is lower than another threshold, the electronic device 100 boosts the output voltage of the battery 142 to avoid abnormal shutdown caused by low temperature.
触摸传感器180K,也称“触控面板”。触摸传感器180K可以设置于显示屏194,由触摸传感器180K与显示屏194组成触摸屏,也称“触控屏”。触摸传感器180K用于检测作用于其上或附近的触摸操作。触摸传感器可以将检测到的触摸操作传递给应用处理器,以确定触摸事件类型。可以通过显示屏194提供与触摸操作相关的视觉输出。在另一些实施例中,触摸传感器180K也可以设置于电子设备100的表面,与显示屏194所处的位置不同。Touch sensor 180K, also called "touch panel". The touch sensor 180K may be disposed on the display screen 194 , and the touch sensor 180K and the display screen 194 form a touch screen, also called a “touch screen”. The touch sensor 180K is used to detect a touch operation on or near it. The touch sensor can pass the detected touch operation to the application processor to determine the type of touch event. Visual output related to touch operations may be provided through display screen 194 . In other embodiments, the touch sensor 180K may also be disposed on the surface of the electronic device 100 , which is different from the location where the display screen 194 is located.
按键190包括开机键,音量键等。按键190可以是机械按键。也可以是触摸式按键。电子设备100可以接收按键输入,产生与电子设备100的用户设置以及功能控制有关的键信号输入。The keys 190 include a power-on key, a volume key, and the like. Keys 190 may be mechanical keys. It can also be a touch key. The electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
马达191可以产生振动提示。马达191可以用于来电振动提示,也可以用于触摸振动反馈。例如,作用于不同应用(例如拍照,音频播放等)的触摸操作,可以对应不同的振动反馈效果。作用于显示屏194不同区域的触摸操作,马达191也可对应不同的振动反馈效果。不同的应用场景(例如:时间提醒,接收信息,闹钟,游戏等)也可以对应不同的振动反馈效果。触摸振动反馈效果还可以支持自定义。Motor 191 can generate vibrating cues. The motor 191 can be used for vibrating alerts for incoming calls, and can also be used for touch vibration feedback. For example, touch operations acting on different applications (such as taking pictures, playing audio, etc.) can correspond to different vibration feedback effects. The motor 191 can also correspond to different vibration feedback effects for touch operations on different areas of the display screen 194 . Different application scenarios (for example: time reminder, receiving information, alarm clock, games, etc.) can also correspond to different vibration feedback effects. The touch vibration feedback effect can also support customization.
指示器192可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。The
SIM卡接口195用于连接SIM卡。SIM卡可以通过插入SIM卡接口195,或从SIM卡接口195拔出,实现和电子设备100的接触和分离。电子设备100可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口195可以支持Nano SIM卡,Micro SIM卡,SIM卡等。同一个SIM卡接口195可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口195也可以兼容不同类型的SIM卡。SIM卡接口195也可以兼容外部存储卡。电子设备100通过SIM卡和网络交互,实现通话以及数据通信等功能。The
本申请实施例中,该处理器110可以通过调用该内部存储器121中存储的计算机指令,以使得该电子设备100执行本申请实施例中的视频画面内容完整度评估方法。In the embodiment of the present application, the
图23为本发明实施例中电子设备100的一个软件结构示意框图。FIG. 23 is a schematic block diagram of a software structure of the electronic device 100 in the embodiment of the present invention.
分层架构将软件分成若干个层,每一层都有清晰的角色和分工。层与层之间通过软件接口通信。在一些实施例中,将系统分为四层,从上至下分别为应用程序层,应用程序框架层,运行时和系统库,以及内核层。The layered architecture divides the software into several layers, and each layer has a clear role and division of labor. Layers communicate with each other through software interfaces. In some embodiments, the system is divided into four layers, from top to bottom, the application layer, the application framework layer, the runtime and system libraries, and the kernel layer.
应用程序层可以包括一系列应用程序包。The application layer can include a series of application packages.
如图23所示,应用程序包可以包括相机,图库,日历,通话,地图,导航,WLAN,蓝牙,音乐,视频,短信息等应用程序(也可以称为应用)。As shown in FIG. 23 , the application package may include camera, gallery, calendar, call, map, navigation, WLAN, Bluetooth, music, video, short message and other applications (also referred to as applications).
本申请实施例中,该应用程序层还可以包含有视频评估模块。In this embodiment of the present application, the application layer may further include a video evaluation module.
该视频评估模块,可以用于执行本申请实施例中的视频画面内容完整度评估方法。The video evaluation module may be used to execute the method for evaluating the integrity of video image content in the embodiment of the present application.
应用程序框架层为应用程序层的应用程序提供应用编程接口(applicationprogramming interface,API)和编程框架。应用程序框架层包括一些预先定义的函数。The application framework layer provides an application programming interface (application programming interface, API) and a programming framework for the applications of the application layer. The application framework layer includes some predefined functions.
如图23所示,应用程序框架层可以包括窗口管理器,内容提供器,视图系统,电话管理器,资源管理器,通知管理器,本地Profile管理助手(Local Profile Assistant,LPA)等。As shown in FIG. 23 , the application framework layer may include a window manager, a content provider, a view system, a phone manager, a resource manager, a notification manager, a Local Profile Assistant (LPA), and the like.
窗口管理器用于管理窗口程序。窗口管理器可以获取显示屏大小,判断是否有状态栏,锁定屏幕,截取屏幕等。A window manager is used to manage window programs. The window manager can get the size of the display screen, determine whether there is a status bar, lock the screen, take screenshots, etc.
内容提供器用来存放和获取数据,并使这些数据可以被应用程序访问。所述数据可以包括视频,图像,音频,拨打和接听的电话,浏览历史和书签,电话簿等。Content providers are used to store and retrieve data and make these data accessible to applications. The data may include video, images, audio, calls made and received, browsing history and bookmarks, phone book, etc.
视图系统包括可视控件,例如显示文字的控件,显示图片的控件等。视图系统可用于构建应用程序。显示界面可以由一个或多个视图组成的。例如,包括短信通知图标的显示界面,可以包括显示文字的视图以及显示图片的视图。The view system includes visual controls, such as controls for displaying text, controls for displaying pictures, and so on. View systems can be used to build applications. A display interface can consist of one or more views. For example, the display interface including the short message notification icon may include a view for displaying text and a view for displaying pictures.
电话管理器用于提供电子设备100的通信功能。例如通话状态的管理(包括接通,挂断等)。The phone manager is used to provide the communication function of the electronic device 100 . For example, the management of call status (including connecting, hanging up, etc.).
资源管理器为应用程序提供各种资源,比如本地化字符串,图标,图片,布局文件,视频文件等等。The resource manager provides various resources for the application, such as localization strings, icons, pictures, layout files, video files and so on.
通知管理器使应用程序可以在状态栏中显示通知信息,可以用于传达告知类型的消息,可以短暂停留后自动消失,无需用户交互。比如通知管理器被用于告知下载完成,消息提醒等。通知管理器还可以是以图表或者滚动条文本形式出现在系统顶部状态栏的通知,例如后台运行的应用程序的通知,还可以是以对话界面形式出现在屏幕上的通知。例如在状态栏提示文本信息,发出提示音,电子设备振动,指示灯闪烁等。The notification manager enables applications to display notification information in the status bar, which can be used to convey notification-type messages, and can disappear automatically after a brief pause without user interaction. For example, the notification manager is used to notify download completion, message reminders, etc. The notification manager can also display notifications in the status bar at the top of the system in the form of graphs or scroll bar text, such as notifications from applications running in the background, and can also display notifications on the screen in the form of a dialog interface. For example, text information is prompted in the status bar, a prompt sound is issued, the electronic device vibrates, and the indicator light flashes.
运行时包括核心库和虚拟机。运行时负责安卓系统的调度和管理。The runtime includes core libraries and virtual machines. The runtime is responsible for the scheduling and management of the Android system.
核心库包含两部分:一部分是java语言需要调用的功能函数,另一部分是核心库。The core library consists of two parts: one is the functional functions that the java language needs to call, and the other is the core library.
应用程序层和应用程序框架层运行在虚拟机中。虚拟机将应用程序层和应用程序框架层的java文件执行为二进制文件。虚拟机用于执行对象生命周期的管理,堆栈管理,线程管理,安全和异常的管理,以及垃圾回收等功能。The application layer and the application framework layer run in virtual machines. The virtual machine executes the java files of the application layer and the application framework layer as binary files. The virtual machine is used to perform functions such as object lifecycle management, stack management, thread management, safety and exception management, and garbage collection.
系统库可以包括多个功能模块。例如:表面管理器(surface manager),媒体库(Media Libraries),三维图形处理库(例如:OpenGL ES),二维图形引擎(例如:SGL)等。A system library can include multiple functional modules. For example: surface manager (surface manager), media library (Media Libraries), 3D graphics processing library (eg: OpenGL ES), 2D graphics engine (eg: SGL), etc.
表面管理器用于对显示子系统进行管理,并且为多个应用程序提供了二维(2-Dimensional,2D)和三维(3-Dimensional,3D)图层的融合。The Surface Manager is used to manage the display subsystem and provides a fusion of two-dimensional (2-Dimensional, 2D) and three-dimensional (3-Dimensional, 3D) layers for multiple applications.
媒体库支持多种常用的音频,视频格式回放和录制,以及静态图像文件等。媒体库可以支持多种音视频编码格式,例如:MPEG4,H.264,MP3,AAC,AMR,JPG,PNG等。The media library supports playback and recording of a variety of commonly used audio and video formats, as well as still image files. The media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
三维图形处理库用于实现3D图形绘图,图像渲染,合成,和图层处理等。The 3D graphics processing library is used to implement 3D graphics drawing, image rendering, compositing, and layer processing.
2D图形引擎是2D绘图的绘图引擎。2D graphics engine is a drawing engine for 2D drawing.
内核层是硬件和软件之间的层。内核层至少包含显示驱动,摄像头驱动,音频驱动,传感器驱动,虚拟卡驱动。The kernel layer is the layer between hardware and software. The kernel layer contains at least display drivers, camera drivers, audio drivers, sensor drivers, and virtual card drivers.
示例性的,图24为本申请实施例提供的电子设备200的一个硬件结构示意图。Exemplarily, FIG. 24 is a schematic diagram of a hardware structure of an
该电子设备200包括:The
输入装置201、输出装置202、处理器203和存储器204(其中电子设备200中的处理器203的数量可以一个或多个,图24中以一个处理器203为例)。在本申请的一些实施例中,输入装置201、输出装置202、处理器203和存储器204可通过总线或其它方式连接,其中,图24中以通过总线连接为例。An
其中,处理器203通过调用存储器204存储的操作指令以使得电子设备200执行本申请实施例中的视频画面内容完整度评估方法。Wherein, the
上述实施例中所用,根据上下文,术语“当…时”可以被解释为意思是“如果…”或“在…后”或“响应于确定…”或“响应于检测到…”。类似地,根据上下文,短语“在确定…时”或“如果检测到(所陈述的条件或事件)”可以被解释为意思是“如果确定…”或“响应于确定…”或“在检测到(所陈述的条件或事件)时”或“响应于检测到(所陈述的条件或事件)”。As used in the above embodiments, the term "when" may be interpreted to mean "if" or "after" or "in response to determining..." or "in response to detecting..." depending on the context. Similarly, depending on the context, the phrases "in determining..." or "if detecting (the stated condition or event)" can be interpreted to mean "if determining..." or "in response to determining..." or "on detecting (the stated condition or event)" or "in response to the detection of (the stated condition or event)".
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行该计算机程序指令时,全部或部分地产生按照本申请实施例该的流程或功能。该计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。该计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,该计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线)或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。该计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如DVD)、或者半导体介质(例如固态硬盘)等。In the above-mentioned embodiments, it may be implemented in whole or in part by software, hardware, firmware or any combination thereof. When implemented in software, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, all or part of the processes or functions according to the embodiments of the present application are generated. The computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable device. The computer instructions may be stored on or transmitted from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions may be transmitted over a wire from a website site, computer, server or data center (eg coaxial cable, optical fiber, digital subscriber line) or wireless (eg infrared, wireless, microwave, etc.) to another website site, computer, server or data center. The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that includes one or more available media integrated. The available media may be magnetic media (eg, floppy disks, hard disks, magnetic tapes), optical media (eg, DVDs), or semiconductor media (eg, solid state drives), and the like.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,该流程可以由计算机程序来指令相关的硬件完成,该程序可存储于计算机可读取存储介质中,该程序在执行时,可包括如上述各方法实施例的流程。而前述的存储介质包括:ROM或随机存储记忆体RAM、磁碟或者光盘等各种可存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented. The process can be completed by instructing the relevant hardware by a computer program, and the program can be stored in a computer-readable storage medium. When the program is executed , which may include the processes of the foregoing method embodiments. The aforementioned storage medium includes: ROM or random storage memory RAM, magnetic disk or optical disk and other mediums that can store program codes.
Claims (29)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011262153.5A CN114500983A (en) | 2020-11-12 | 2020-11-12 | Video picture content integrity evaluation method and electronic equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011262153.5A CN114500983A (en) | 2020-11-12 | 2020-11-12 | Video picture content integrity evaluation method and electronic equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114500983A true CN114500983A (en) | 2022-05-13 |
Family
ID=81490051
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011262153.5A Pending CN114500983A (en) | 2020-11-12 | 2020-11-12 | Video picture content integrity evaluation method and electronic equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114500983A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170180722A1 (en) * | 2015-12-22 | 2017-06-22 | Mti Film, Llc | Automatic Defective Digital Motion Picture Pixel Detection |
US20170302901A1 (en) * | 2015-01-06 | 2017-10-19 | Huawei Technologies Co., Ltd. | Image Processing Method and Apparatus |
CN110324665A (en) * | 2019-07-25 | 2019-10-11 | 深圳创维-Rgb电子有限公司 | A kind of method, terminal and the storage medium of the automatic review of a film by the censor |
CN111062926A (en) * | 2019-12-18 | 2020-04-24 | 腾讯科技(深圳)有限公司 | Video data processing method and device and storage medium |
-
2020
- 2020-11-12 CN CN202011262153.5A patent/CN114500983A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170302901A1 (en) * | 2015-01-06 | 2017-10-19 | Huawei Technologies Co., Ltd. | Image Processing Method and Apparatus |
US20170180722A1 (en) * | 2015-12-22 | 2017-06-22 | Mti Film, Llc | Automatic Defective Digital Motion Picture Pixel Detection |
CN110324665A (en) * | 2019-07-25 | 2019-10-11 | 深圳创维-Rgb电子有限公司 | A kind of method, terminal and the storage medium of the automatic review of a film by the censor |
CN111062926A (en) * | 2019-12-18 | 2020-04-24 | 腾讯科技(深圳)有限公司 | Video data processing method and device and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113194242B (en) | A shooting method and mobile terminal in a telephoto scene | |
CN112580400B (en) | Image optimization method and electronic device | |
CN112262563B (en) | Image processing method and electronic device | |
CN113973173B (en) | Image synthesis method and electronic equipment | |
CN110471606B (en) | Input method and electronic equipment | |
CN114140365B (en) | Event frame-based feature point matching method and electronic equipment | |
CN112529645B (en) | Picture layout method and electronic device | |
CN111881315A (en) | Image information input method, electronic device, and computer-readable storage medium | |
CN113810603B (en) | Point light source image detection method and electronic device | |
CN113542580B (en) | Method and device for removing light spots of glasses and electronic equipment | |
WO2021031862A1 (en) | Data processing method and apparatus thereof | |
CN112541861A (en) | Image processing method, device, equipment and computer storage medium | |
CN112150499A (en) | Image processing method and related device | |
CN113489895B (en) | Method and electronic device for determining recommended scene | |
CN114866860A (en) | Video playing method and electronic equipment | |
WO2024082914A1 (en) | Video question answering method and electronic device | |
CN116193275B (en) | Video processing method and related equipment | |
CN113536834B (en) | Eye bag detection method and device | |
WO2023016014A1 (en) | Video editing method and electronic device | |
CN115268742A (en) | A method and electronic device for generating a cover | |
CN114979457A (en) | Image processing method and related device | |
CN117131213B (en) | Image processing method and related equipment | |
CN117499797A (en) | Image processing method and related equipment | |
CN114500983A (en) | Video picture content integrity evaluation method and electronic equipment | |
CN114942741B (en) | Data transmission method and electronic device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |