CN118764686A - A method and device for generating a media file - Google Patents
A method and device for generating a media file Download PDFInfo
- Publication number
- CN118764686A CN118764686A CN202411252157.3A CN202411252157A CN118764686A CN 118764686 A CN118764686 A CN 118764686A CN 202411252157 A CN202411252157 A CN 202411252157A CN 118764686 A CN118764686 A CN 118764686A
- Authority
- CN
- China
- Prior art keywords
- user
- script
- server
- media file
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 96
- 238000013515 script Methods 0.000 claims abstract description 198
- 230000002452 interceptive effect Effects 0.000 claims abstract description 117
- 239000000463 material Substances 0.000 claims abstract description 86
- 230000004044 response Effects 0.000 claims abstract description 67
- 230000003993 interaction Effects 0.000 claims description 42
- 238000004590 computer program Methods 0.000 claims description 12
- 238000003860 storage Methods 0.000 claims description 10
- 230000001960 triggered effect Effects 0.000 claims description 10
- 238000005516 engineering process Methods 0.000 abstract description 15
- 230000000694 effects Effects 0.000 description 20
- 230000008569 process Effects 0.000 description 13
- 238000010586 diagram Methods 0.000 description 8
- 238000004891 communication Methods 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 238000013136 deep learning model Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000007704 transition Effects 0.000 description 3
- 101001121408 Homo sapiens L-amino-acid oxidase Proteins 0.000 description 2
- 102100026388 L-amino-acid oxidase Human genes 0.000 description 2
- 101100012902 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FIG2 gene Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000003190 augmentative effect Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000010923 batch production Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 239000004984 smart glass Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/442—Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
- H04N21/44213—Monitoring of end-user related data
- H04N21/44222—Analytics of user selections, e.g. selection of programs or purchase activity
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/239—Interfacing the upstream path of the transmission network, e.g. prioritizing client content requests
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/25—Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
- H04N21/251—Learning process for intelligent management, e.g. learning user preferences for recommending movies
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Databases & Information Systems (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Social Psychology (AREA)
- Computer Security & Cryptography (AREA)
- Computing Systems (AREA)
- Human Computer Interaction (AREA)
- Computer Networks & Wireless Communication (AREA)
- Information Transfer Between Computers (AREA)
Abstract
本申请实施例公开了一种媒体文件的生成方法及装置,涉及互联网应用技术领域。主要技术方案包括:在交互界面上展示一个以上的推荐内容;响应于用户从所述一个以上的推荐内容中选择目标推荐内容,将所述目标推荐内容发送给服务器端,以便于所述服务器端利用所述目标推荐内容生成脚本,所述脚本包括生成媒体文件的控制变量信息;响应于用户在所述交互界面上输入素材信息,将所述素材信息发送给所述服务器端;响应于接收所述服务器端发送的媒体文件,所述媒体文件是利用所述脚本以及所述素材信息生成的,在所述交互界面上展示所述媒体文件。利用本申请能够简化用户操作、提高用户体验,且帮助用户把握媒体文件的生成方向。
The embodiment of the present application discloses a method and device for generating a media file, and relates to the field of Internet application technology. The main technical scheme includes: displaying more than one recommended content on an interactive interface; in response to a user selecting a target recommended content from the more than one recommended content, sending the target recommended content to a server, so that the server generates a script using the target recommended content, and the script includes control variable information for generating a media file; in response to a user inputting material information on the interactive interface, sending the material information to the server; in response to receiving a media file sent by the server, the media file is generated using the script and the material information, and displaying the media file on the interactive interface. The use of this application can simplify user operations, improve user experience, and help users grasp the generation direction of media files.
Description
技术领域Technical Field
本申请涉及互联网应用技术领域,特别是涉及一种媒体文件的生成方法及装置。The present application relates to the field of Internet application technology, and in particular to a method and device for generating a media file.
背景技术Background Art
随着互联网和计算机技术的普及,人们越来越多的使用各类应用来播放媒体文件。特别是随着自媒体的迅猛发展,大量的媒体文件以网络作为平台向大众进行信息传播。对于媒体文件的生产方而言,如何方便、快捷地生成媒体文件成为各大服务提供商亟待解决的问题。With the popularization of the Internet and computer technology, more and more people use various applications to play media files. Especially with the rapid development of self-media, a large number of media files use the Internet as a platform to disseminate information to the public. For the producers of media files, how to generate media files conveniently and quickly has become an urgent problem for major service providers.
发明内容Summary of the invention
有鉴于此,本申请提供了一种媒体文件的生成方法及装置,用以帮助用户方便、快捷地生成媒体文件。In view of this, the present application provides a method and device for generating a media file, so as to help a user generate a media file conveniently and quickly.
本申请提供了如下方案:This application provides the following solutions:
第一方面,提供了一种媒体文件的生成方法,应用于用户端,所述方法包括:In a first aspect, a method for generating a media file is provided, which is applied to a user terminal, and the method includes:
在交互界面上展示一个以上的推荐内容;Display more than one recommended content on the interactive interface;
响应于用户从所述一个以上的推荐内容中选择目标推荐内容,将所述目标推荐内容发送给服务器端,以便于所述服务器端利用所述目标推荐内容生成脚本,所述脚本包括生成媒体文件的控制变量信息;In response to a user selecting a target recommended content from the one or more recommended contents, sending the target recommended content to a server, so that the server generates a script using the target recommended content, the script including control variable information for generating a media file;
响应于用户利用所述交互界面输入素材信息,将所述素材信息发送给所述服务器端;In response to the user inputting material information using the interactive interface, sending the material information to the server;
响应于接收所述服务器端发送的媒体文件,所述媒体文件是利用所述脚本以及所述素材信息生成的,在所述交互界面上展示所述媒体文件。In response to receiving a media file sent by the server, the media file is generated by using the script and the material information, and the media file is displayed on the interactive interface.
可选地,所述推荐内容包括:推荐的媒体文件、媒体模板、推荐的脚本主题或推荐的脚本模板。Optionally, the recommended content includes: recommended media files, media templates, recommended script themes or recommended script templates.
可选地,所述推荐内容是依据用户账户信息得到的。Optionally, the recommended content is obtained based on user account information.
可选地,所述交互界面包括第一界面元素;Optionally, the interactive interface includes a first interface element;
所述在交互界面上展示一个以上的推荐内容包括:The displaying of more than one recommended content on the interactive interface includes:
响应于用户利用所述第一界面元素输入需求信息,将所述需求信息发送至所述服务器端,以便于所述服务器端利用所述需求信息得到一个以上的推荐内容;In response to the user inputting demand information using the first interface element, sending the demand information to the server, so that the server obtains more than one recommended content using the demand information;
在所述交互界面上展示所述一个以上的推荐内容。The one or more recommended contents are displayed on the interactive interface.
可选地,所述交互界面还包括第二界面元素,所述方法还包括:Optionally, the interactive interface further includes a second interface element, and the method further includes:
响应于用户利用所述第二界面元素输入需求信息,将所述需求信息发送给服务器端,以便于所述服务器端利用所述需求信息生成脚本。In response to the user inputting requirement information using the second interface element, the requirement information is sent to the server, so that the server generates a script using the requirement information.
可选地,所述需求信息采用自然语言形式。Optionally, the demand information is in natural language form.
更进一步地,所述交互界面还包括第三界面元素,所述方法还包括:Furthermore, the interactive interface further includes a third interface element, and the method further includes:
响应于用户触发所述第三界面元素,在所述交互界面上刷新展示的一个以上的推荐内容。In response to the user triggering the third interface element, one or more recommended contents displayed on the interactive interface are refreshed.
更进一步地,所述方法还包括:Furthermore, the method further comprises:
获取所述脚本;Obtaining the script;
利用所述交互界面的第四界面元素展示所述脚本。The script is displayed using a fourth interface element of the interactive interface.
可选地,所述脚本为多个;Optionally, there are multiple scripts;
所述方法还包括:利用所述交互界面的第五界面元素展示所述多个脚本;The method further comprises: displaying the plurality of scripts using a fifth interface element of the interactive interface;
所述利用所述交互界面的第四界面元素展示所述脚本包括:响应于用户从所述多个脚本中选择脚本,利用所述第四界面元素展示选择的脚本;The using the fourth interface element of the interactive interface to display the script comprises: in response to a user selecting a script from the plurality of scripts, using the fourth interface element to display the selected script;
将所述素材信息发送给所述服务器端包括:将所述选择的脚本和所述素材信息发送给所述服务器端。Sending the material information to the server includes: sending the selected script and the material information to the server.
可选地,所述第四界面元素为文本编辑框,所述第五界面元素为卡片组件。Optionally, the fourth interface element is a text editing box, and the fifth interface element is a card component.
更进一步地,响应于用户利用所述第四界面元素对所述脚本进行的编辑;Furthermore, in response to the user editing the script using the fourth interface element;
将编辑后的脚本发送给所述服务器端。The edited script is sent to the server.
可选地,响应于用户利用所述交互界面输入素材信息,将所述素材信息发送给所述服务器端包括:Optionally, in response to the user inputting material information using the interactive interface, sending the material information to the server includes:
响应于所述交互界面上的第六界面元素被触发,获取用户输入的素材信息;In response to a sixth interface element on the interactive interface being triggered, acquiring material information input by a user;
响应于所述交互界面上的第七界面元素被触发,将所述素材信息发送给所述服务器端。In response to the seventh interface element on the interactive interface being triggered, the material information is sent to the server.
可选地,所述媒体文件为多个,在所述交互界面上展示所述媒体文件包括:Optionally, there are multiple media files, and displaying the media files on the interactive interface includes:
在所述交互界面上展示多个所述媒体文件。A plurality of the media files are displayed on the interactive interface.
更进一步地,所述方法还包括:Furthermore, the method further comprises:
响应于用户从多个所述媒体文件中选择目标媒体文件,播放所述选择的目标媒体文件,或者,存储所述选择的目标媒体文件,或者,将所述选择的目标媒体文件的信息发送给所述服务器端。In response to the user selecting a target media file from the plurality of media files, the selected target media file is played, or the selected target media file is stored, or information of the selected target media file is sent to the server.
第二方面,提供了一种媒体文件的生成方法,应用于服务器端,所述方法包括:In a second aspect, a method for generating a media file is provided, which is applied to a server, and the method comprises:
向用户端发送一个以上的推荐内容;Sending one or more recommended contents to the user terminal;
依据用户从所述一个以上的推荐内容中选择的目标推荐内容,生成脚本,所述脚本包括生成媒体文件的控制变量信息;Generate a script according to the target recommended content selected by the user from the one or more recommended contents, wherein the script includes control variable information for generating a media file;
利用所述脚本以及用户输入的素材信息,生成媒体文件;Generate a media file using the script and the material information input by the user;
向所述用户端发送所述媒体文件。The media file is sent to the client.
第三方面,提供了一种媒体文件的生成装置,设置于用户端,所述装置包括:In a third aspect, a device for generating a media file is provided, which is arranged at a user end, and the device includes:
界面展示单元,被配置为在交互界面上展示一个以上的推荐内容;An interface display unit, configured to display one or more recommended contents on the interactive interface;
服务侧交互单元,被配置为响应于用户从所述一个以上的推荐内容中选择目标推荐内容,将所述目标推荐内容发送给服务器端,以便于所述服务器端利用所述目标推荐内容生成脚本,所述脚本包括生成媒体文件的控制变量信息;响应于用户利用所述交互界面输入素材信息,将所述素材信息发送给所述服务器端;接收所述服务器发送的媒体文件,所述媒体文件是利用所述脚本以及所述素材信息生成的;The service-side interaction unit is configured to, in response to a user selecting a target recommended content from the one or more recommended contents, send the target recommended content to a server side, so that the server side generates a script using the target recommended content, the script including control variable information for generating a media file; in response to a user inputting material information using the interaction interface, send the material information to the server side; receive a media file sent by the server, the media file being generated using the script and the material information;
所述界面展示单元,还被配置为在所述交互界面上展示所述媒体文件。The interface display unit is further configured to display the media file on the interactive interface.
第四方面,提供了一种媒体文件的生成装置,设置于服务器端,所述装置包括:In a fourth aspect, a device for generating a media file is provided, which is arranged on a server side, and the device comprises:
用户侧交互单元,被配置为向用户端发送一个以上的推荐内容;A user-side interaction unit configured to send one or more recommended contents to the user terminal;
脚本生成单元,被配置为依据用户从所述一个以上的推荐内容中选择的目标推荐内容,生成脚本,所述脚本包括生成媒体文件的控制变量信息;a script generating unit configured to generate a script according to a target recommended content selected by a user from the one or more recommended contents, wherein the script includes control variable information for generating a media file;
所述用户侧交互单元,还被配置为从用户端获取用户输入的素材信息;The user-side interaction unit is further configured to obtain material information input by the user from the user terminal;
媒体生成单元,被配置为利用所述脚本以及用户输入的素材信息,生成媒体文件;A media generation unit, configured to generate a media file using the script and material information input by a user;
所述用户侧交互单元,还被配置为向所述用户端发送所述媒体文件。The user-side interaction unit is further configured to send the media file to the user terminal.
第五方面,提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现上述第一方面或第二方面所述的方法的步骤。In a fifth aspect, a computer-readable storage medium is provided, on which a computer program is stored, and when the program is executed by a processor, the steps of the method described in the first aspect or the second aspect are implemented.
第六方面,提供了一种电子设备,包括:In a sixth aspect, an electronic device is provided, including:
一个或多个处理器;以及one or more processors; and
与所述一个或多个处理器关联的存储器,所述存储器用于存储程序指令,所述程序指令在被所述一个或多个处理器读取执行时,执行上述第一方面或第二方面所述的方法的步骤。A memory associated with the one or more processors, the memory being used to store program instructions, wherein when the program instructions are read and executed by the one or more processors, the steps of the method described in the first aspect or the second aspect are executed.
第七方面,提供了一种计算机程序产品,包括计算机程序,该计算机程序被处理器执行时实现上述第一方面所述方法的步骤。In a seventh aspect, a computer program product is provided, comprising a computer program, which, when executed by a processor, implements the steps of the method described in the first aspect.
根据本申请提供的具体实施例,本申请公开了以下技术效果:According to the specific embodiments provided in this application, this application discloses the following technical effects:
1)本申请所提供的技术方案,用户仅需要在交互界面上展示的一个以上的推荐内容中选择目标推荐内容并输入素材信息,即可方便、快捷地生成媒体文件,简化了用户操作、提高了用户体验。1) With the technical solution provided by this application, users only need to select target recommended content from one or more recommended content displayed on the interactive interface and input material information to conveniently and quickly generate media files, thereby simplifying user operations and improving user experience.
另外,本申请中首先在交互界面上展示一个以上的推荐内容从而为用户提供参考,以便于用户从一个以上的推荐内容中选择一个作为目标推荐内容指导脚本的生成,从而控制媒体文件的生成。这种方式能够帮助用户更好地把握媒体文件的生成方向,提高媒体文件的生成效果,避免用户不断尝试重新输入需求,因此提高了用户的操作效率,进而提高用户体验。In addition, in the present application, more than one recommended content is first displayed on the interactive interface to provide a reference for the user, so that the user can select one of the more than one recommended content as the target recommended content to guide the generation of the script, thereby controlling the generation of the media file. This method can help the user better grasp the generation direction of the media file, improve the generation effect of the media file, and avoid the user from constantly trying to re-enter the requirements, thereby improving the user's operation efficiency and thus improving the user experience.
2)在本申请中,可以利用在交互界面上展示推荐的媒体文件、媒体模板、推荐的脚本主题或推荐的脚本模板等多种直观的形式为用户提供参考,以方便用户从中选择来指导脚本的生成。2) In the present application, a variety of intuitive forms such as recommended media files, media templates, recommended script themes or recommended script templates can be displayed on an interactive interface to provide users with references, so that users can choose from them to guide the generation of scripts.
3)在本申请中推荐内容可以依据用户账户信息得到,从而无需用户操作就能够向用户展示一个以上的推荐内容供用户选择,用户能够从中选择推荐内容来控制媒体文件的生成方向。3) In the present application, recommended content can be obtained based on user account information, so that more than one recommended content can be displayed to the user for selection without user operation, and the user can select the recommended content to control the generation direction of the media file.
4)在本申请中,也可以在交互界面上提供第一界面元素,用户利用第一界面元素输入需求信息,从而能够依据用户实际输入的需求信息来得到一个以上的推荐内容,从而更精准地为用户提供选择,同时也提供了一种用户精准表达需求的灵活方式。4) In the present application, a first interface element may also be provided on the interactive interface, and the user may input demand information using the first interface element, so that one or more recommended contents may be obtained based on the demand information actually input by the user, thereby providing the user with more accurate choices, and also providing a flexible way for the user to accurately express their needs.
5)在本申请中,也可以在交互界面上提供第二界面元素,用户利用第二界面元素输入需求信息,该需求信息发送给服务器端以使得服务器端能够将用户输入的需求信息作为指导,更加精准地生成脚本信息,进而提高对生成的媒体内容的准确控制。5) In the present application, a second interface element may also be provided on the interactive interface, and the user may use the second interface element to input demand information, which is sent to the server so that the server can use the demand information input by the user as a guide to more accurately generate script information, thereby improving the accurate control of the generated media content.
6)在本申请中,用户在交互界面上能够使用自然语言的形式输入需求信息,相比较传统方式中用户需要按照表单要求逐一输入需求信息的方式,用户能够使用更加自然的方式来表达需求,实现更加灵活,用户体验更佳。6) In this application, users can input demand information in the form of natural language on the interactive interface. Compared with the traditional method in which users need to enter demand information one by one according to the form requirements, users can use a more natural way to express their needs, which is more flexible and provides a better user experience.
7)在本申请中,如果用户对于推荐内容不满意,则可以利用交互界面上的第三界面元素来刷新展示的一个以上的推荐内容,操作简单,用户可以利用简单的操作来找到符合自己需求的推荐内容,进一步提升了用户体验。7) In this application, if the user is not satisfied with the recommended content, the user can use the third interface element on the interactive interface to refresh one or more recommended contents displayed. The operation is simple, and the user can use simple operations to find recommended content that meets his or her needs, further improving the user experience.
8)在本申请中还可以将生成的脚本在交互界面上展示,以便于用户直观地获知生成媒体文件的控制变量是否准确,更进一步提高用户体验。8) In the present application, the generated script can also be displayed on the interactive interface so that the user can intuitively know whether the control variables for generating the media file are accurate, thereby further improving the user experience.
9)在本申请中生成并展示的脚本可以为多个,用户能够从多个脚本中选择最符合自己需求的脚本来生成媒体文件,这种方式能够更加精细地为用户提供脚本选择,从而控制媒体文件的生成方向。9) There may be multiple scripts generated and displayed in this application, and users can select the script that best meets their needs from multiple scripts to generate media files. This method can provide users with more refined script selection, thereby controlling the generation direction of media files.
10)在本申请中可以分别采用卡片组件来展示多个脚本,采用文本框来展示用户选择的脚本,能够对脚本进行直观展示和有效区分,进而方便用户操作,提高用户体验。10) In this application, card components can be used to display multiple scripts, and text boxes can be used to display the script selected by the user, which can intuitively display and effectively distinguish the scripts, thereby facilitating user operations and improving user experience.
11)在本申请中,用户可以利用第四界面元素对脚本进行编辑,使得用户能够更加精细地对生成媒体文件的控制变量进行调整,提高生成媒体文件的可控性,生成的媒体文件更加精准,提高用户体验。11) In this application, the user can use the fourth interface element to edit the script, so that the user can more finely adjust the control variables of the generated media file, improve the controllability of the generated media file, make the generated media file more accurate, and improve the user experience.
12)在本申请中,用户可以利用交互界面上的第六界面元素输入素材信息,并利用触发第七界面元素将素材信息发送给服务器端,这种方式能够方便用户灵活输入素材信息并在确认后再发送给服务器端,一方面保证了素材信息输入的方便性和灵活性,另一方面也保证了素材信息输入的可靠性。12) In the present application, the user can use the sixth interface element on the interactive interface to input material information, and use the triggering seventh interface element to send the material information to the server. This method allows the user to flexibly input material information and send it to the server after confirmation. On the one hand, it ensures the convenience and flexibility of material information input, and on the other hand, it also ensures the reliability of material information input.
13)在本申请中生成的媒体文件可以为多个,即实现媒体文件的批量生产,对于用户而言效率更高、可选择性更强。13) In this application, there may be multiple media files generated, that is, batch production of media files is achieved, which is more efficient and more selective for users.
14)在本申请中,用户可以从多个媒体文件中选择目标媒体文件,进而对目标媒体文件进行播放、存储或将目标媒体文件的信息发送给服务器端,以方便进行后续应用,用户仅需要从中选择即可,操作简单且可视化程度更高。14) In this application, the user can select a target media file from multiple media files, and then play, store or send the information of the target media file to the server to facilitate subsequent applications. The user only needs to select from them, which is simple to operate and more visual.
当然,实施本申请的任一产品并不一定需要同时达到以上所述的所有优点。Of course, any product implementing the present application does not necessarily need to achieve all of the advantages described above at the same time.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings required for use in the embodiments will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present application. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying creative work.
图1为是本申请实施例所适用的系统架构图。FIG1 is a diagram of a system architecture applicable to an embodiment of the present application.
图2为本申请实施例一提供的由用户端执行的信息展示方法的流程图。FIG. 2 is a flowchart of an information display method executed by a user terminal provided in Embodiment 1 of the present application.
图3为本申请实施例二提供的一种生成媒体文件的交互流程图。FIG. 3 is an interactive flow chart of generating a media file provided in Embodiment 2 of the present application.
图4a、图4b、图4c、图4d、图4e和图4f分别为本申请实施例二提供的交互界面在不同阶段的实例图。Figures 4a, 4b, 4c, 4d, 4e and 4f are respectively example diagrams of the interaction interface provided in embodiment 2 of the present application at different stages.
图5为本申请实施例三提供的另一种生成媒体文件的交互流程图。FIG. 5 is another interactive flow chart of generating a media file provided in Embodiment 3 of the present application.
图6a、图6b、图6c、图6d、图6e和图6f分别为本申请实施例三提供的交互界面在不同阶段的实例图。Figures 6a, 6b, 6c, 6d, 6e and 6f are respectively example diagrams of the interactive interface provided in Example 3 of the present application at different stages.
图7为本申请实施例四提供的由服务器端执行的媒体文件的生成方法的流程图。FIG. 7 is a flowchart of a method for generating a media file executed by a server according to a fourth embodiment of the present application.
图8为本申请实施例五提供的设置于用户端的媒体文件的生成装置的示意性框图。FIG8 is a schematic block diagram of a media file generating device provided at a user terminal according to a fifth embodiment of the present application.
图9为本申请实施例六提供的设置于服务器端的媒体文件的生成装置的示意性框图。FIG. 9 is a schematic block diagram of a media file generating device provided on a server side according to a sixth embodiment of the present application.
图10为本申请实施例提供的电子设备的示意性框图。FIG. 10 is a schematic block diagram of an electronic device provided in an embodiment of the present application.
具体实施方式DETAILED DESCRIPTION
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员所获得的所有其他实施例,都属于本申请保护的范围。The following will be combined with the drawings in the embodiments of the present application to clearly and completely describe the technical solutions in the embodiments of the present application. Obviously, the described embodiments are only part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by ordinary technicians in this field belong to the scope of protection of this application.
在本发明实施例中使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本发明。在本发明实施例和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。The terms used in the embodiments of the present invention are only for the purpose of describing specific embodiments, and are not intended to limit the present invention. The singular forms "a", "said" and "the" used in the embodiments of the present invention and the appended claims are also intended to include plural forms, unless the context clearly indicates other meanings.
应当理解,本文中使用的术语“和/或”仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。It should be understood that the term "and/or" used in this article is only a description of the association relationship of associated objects, indicating that there can be three relationships. For example, A and/or B can represent: A exists alone, A and B exist at the same time, and B exists alone. In addition, the character "/" in this article generally indicates that the associated objects before and after are in an "or" relationship.
取决于语境,如在此所使用的词语“如果”可以被解释成为“在……时”或“当……时”或“响应于确定”或“响应于检测”。类似地,取决于语境,短语“如果确定”或“如果检测(陈述的条件或事件)”可以被解释成为“当确定时”或“响应于确定”或“当检测(陈述的条件或事件)时”或“响应于检测(陈述的条件或事件)”。The word "if" as used herein may be interpreted as "at the time of" or "when" or "in response to determining" or "in response to detecting", depending on the context. Similarly, the phrases "if it is determined" or "if (stated condition or event) is detected" may be interpreted as "when it is determined" or "in response to determining" or "when detecting (stated condition or event)" or "in response to detecting (stated condition or event)", depending on the context.
随着AIGC(Artificial Intelligence Generated Content,生成式人工智能)技术的不断发展,生成式模型已经能够基于一定的需求信息为用户自动生成媒体文件。因此,目前已经存在一些传统方法为用户提供界面,用户在界面上以表单的形式输入诸如行业、产品名称、特点等需求信息并上传素材后,由生成式模型利用需求信息生成诸如视频等媒体文件,然后将生成的媒体文件关联用户账号后进行发布。然而,该传统方法存在以下缺陷;With the continuous development of AIGC (Artificial Intelligence Generated Content) technology, generative models can automatically generate media files for users based on certain demand information. Therefore, there are already some traditional methods that provide users with an interface. After users enter demand information such as industry, product name, features, etc. in the form of a form on the interface and upload materials, the generative model uses the demand information to generate media files such as videos, and then associates the generated media files with the user account before publishing. However, this traditional method has the following defects;
1)AIGC随机性较强,用户虽然输入了需求信息,但很难把握媒体文件的生成方向,可控性很差。因此生成的媒体文件往往无法符合用户需求,需要用户不断尝试重新输入需求信息来重新生成媒体文件,效率低下,用户体验差。1) AIGC is highly random. Although users input their demand information, it is difficult to grasp the direction of media file generation, and the controllability is very poor. Therefore, the generated media files often fail to meet user needs, and users need to keep trying to re-enter their demand information to regenerate media files, which is inefficient and has a poor user experience.
2)用户需要按照表单中的要求对应输入需求信息,输入繁琐,效率低下,用户体验差。2) Users need to input required information according to the requirements in the form, which is cumbersome, inefficient and has a poor user experience.
有鉴于此,本申请提供了一种新的思路。为了方便对本申请的理解,首先对本申请所基于的系统架构进行描述。图1示出了可以应用本申请实施例的示例性系统架构,如图1中所示,该系统架构可以包括:内容生产方的终端设备以及运行于该终端设备的用户端、内容消费方的终端设备以及运行于该终端设备的用户端、服务器端、服务器以及媒体库。In view of this, the present application provides a new idea. In order to facilitate the understanding of the present application, the system architecture on which the present application is based is first described. Figure 1 shows an exemplary system architecture to which an embodiment of the present application can be applied. As shown in Figure 1, the system architecture may include: a terminal device of a content producer and a user terminal running on the terminal device, a terminal device of a content consumer and a user terminal running on the terminal device, a server terminal, a server, and a media library.
本申请实施例中涉及的用户端可以是运行于终端设备的客户端、小程序或者利用浏览器运行的Web应用等。The user end involved in the embodiments of the present application may be a client running on a terminal device, a small program, or a Web application running using a browser, etc.
终端设备可以包括但不限于智能移动终端、可穿戴式设备、PC(PersonalComputer,个人计算机)、智能家居设备等。其中智能移动设备可以包括诸如手机、平板电脑、笔记本电脑、PDA(Personal Digital Assistant,个人数字助理)、互联网汽车终端等。可穿戴式设备可以包括诸如智能手表、智能眼镜、智能手环、VR(Virtual Reality,虚拟现实)设备、AR(Augmented Reality,增强现实设备)、混合现实设备(即可以支持虚拟现实和增强现实的设备)等等。智能家居设备可以包括诸如智能电视、带显示屏的智能冰箱等。Terminal devices may include but are not limited to smart mobile terminals, wearable devices, PCs (Personal Computers), smart home devices, etc. Smart mobile devices may include mobile phones, tablet computers, laptops, PDAs (Personal Digital Assistants), Internet car terminals, etc. Wearable devices may include smart watches, smart glasses, smart bracelets, VR (Virtual Reality) devices, AR (Augmented Reality), mixed reality devices (i.e., devices that can support both virtual reality and augmented reality), etc. Smart home devices may include smart TVs, smart refrigerators with display screens, etc.
服务器端设置于服务器,可以是运行于该服务器的应用程序、或者还可以为设置于应用程序中的插件或软件开发工具包(Software Development Kit,SDK)等功能单元。The server side is set on the server, and may be an application program running on the server, or may also be a functional unit such as a plug-in or a software development kit (SDK) set in the application program.
服务器提供媒体服务,可以是单一服务器,也可以是多个服务器构成的服务器群组,还可以是云服务器。云服务器又称为云计算服务器或云主机,是云计算服务体系中的一项主机产品,以解决传统物理主机与虚拟专用服务器(VPS,Virtual Private Server)服务中存在的管理难度大,服务扩展性弱的缺陷。The server provides media services and can be a single server, a server group consisting of multiple servers, or a cloud server. A cloud server, also known as a cloud computing server or cloud host, is a host product in the cloud computing service system to solve the defects of difficult management and weak service scalability in traditional physical hosts and virtual private servers (VPS) services.
在本申请实施例中,作为内容生产方的用户可以使用终端设备上的用户端与服务器端进行交互,实现媒体文件的生成。媒体库可以用以存储和维护生成的媒体文件。In the embodiment of the present application, the user who is the content producer can use the user end on the terminal device to interact with the server end to realize the generation of media files. The media library can be used to store and maintain the generated media files.
作为内容消费方的用户可以使用终端设备上的用户端与服务器端进行交互,请求并获取媒体文件并进行播放。Users who are content consumers can use the user end on the terminal device to interact with the server end, request and obtain media files and play them.
应该理解,图1中的用户端、终端设备、服务器端、服务器以及媒体库的数目仅仅是示意性的。根据实现需要,可以具有任意数目的用户端、终端设备、服务器端、服务器以及媒体库。It should be understood that the number of the client, terminal device, server, server and media library in Fig. 1 is only illustrative. According to the implementation requirements, there can be any number of the client, terminal device, server, server and media library.
实施例一、Embodiment 1
图2为本申请实施例提供的由用户端执行的媒体文件的生成方法的流程图,该方法可以应用于图1所示系统中内容生产方的用户端执行。如图2中所示,该方法可以包括以下步骤:FIG2 is a flow chart of a method for generating a media file executed by a user terminal according to an embodiment of the present application. The method can be applied to a user terminal of a content producer in the system shown in FIG1. As shown in FIG2, the method may include the following steps:
步骤201:在交互界面上展示一个以上的推荐内容。Step 201: Display one or more recommended contents on the interactive interface.
在本申请实施例中,作为内容生产方的用户在登录服务后,用户端可以向用户展示交互界面。或者,用户利用指定链接访问交互界面,用户端向用户展示交互界面。再或者,用户打开用户端后,用户端默认展示交互界面。In the embodiment of the present application, after the user who is the content producer logs in to the service, the user terminal can display the interactive interface to the user. Alternatively, the user uses a specified link to access the interactive interface, and the user terminal displays the interactive interface to the user. Alternatively, after the user opens the user terminal, the user terminal displays the interactive interface by default.
本申请实施例所涉及的界面(也可以是页面)指的是面向用户提供的可视化元素的集合,用户可以利用界面进行信息交互,因此称为交互界面。界面上的元素为界面元素,可以包括字体、图标、组件等等。其中,界面上的组件指的是对数据和方法进行简单封装所得到的对象,也可以被称为控件。可以包括诸如按钮、输入框、下拉菜单、对话框、导航栏、卡片等等。The interface (or page) involved in the embodiments of the present application refers to a collection of visual elements provided to users, and users can use the interface to interact with information, so it is called an interactive interface. The elements on the interface are interface elements, which may include fonts, icons, components, etc. Among them, the components on the interface refer to objects obtained by simply encapsulating data and methods, which may also be called controls. They may include buttons, input boxes, drop-down menus, dialog boxes, navigation bars, cards, etc.
本申请实施例涉及的推荐内容可以表征生成媒体文件的大致“方向”,所谓生成媒体文件的“方向”通常由诸如风格、主题、效果等来体现。用户可以从推荐内容中选择一个作为后续生成脚本进而生成媒体文件的参考。推荐内容可以包括但不限于:推荐的媒体文件、媒体模板、推荐的脚本主题或推荐的脚本模板等等。The recommended content involved in the embodiments of the present application can represent the general "direction" of generating a media file. The so-called "direction" of generating a media file is usually reflected by style, theme, effect, etc. The user can select one from the recommended content as a reference for subsequent script generation and then media file generation. The recommended content can include but is not limited to: recommended media files, media templates, recommended script themes or recommended script templates, etc.
其中,推荐的媒体文件实际上就是将部分媒体文件的实例作为推荐内容向用户推荐,以便用户能够将具体的媒体文件的实例作为参考。媒体模板指的是预先设计好的媒体布局和风格的模板,模板通常为媒体文件的生成提供了一个结构化的框架。本申请实施例中所涉及的脚本是指导媒体文件生成的重要文本,主要包括生成媒体文件的控制变量信息,相应地,所谓脚本主题指的是脚本所表达的主要信息,是对脚本所包含控制变量信息的描述信息。脚本模板是一种结构化的框架,指的是预先设计好的体现媒体布局和风格的控制变量信息模板。Among them, the recommended media files actually recommend instances of some media files to users as recommended content, so that users can use instances of specific media files as references. Media templates refer to pre-designed templates of media layout and style, and templates usually provide a structured framework for the generation of media files. The scripts involved in the embodiments of the present application are important texts for guiding the generation of media files, mainly including control variable information for generating media files. Accordingly, the so-called script theme refers to the main information expressed by the script, which is a description of the control variable information contained in the script. The script template is a structured framework, which refers to a pre-designed control variable information template that embodies the media layout and style.
步骤203:响应于用户从一个以上的推荐内容中选择目标推荐内容,将目标推荐内容发送给服务器端,以便于服务器端利用目标推荐内容生成脚本,脚本包括生成媒体文件的控制变量信息。Step 203: In response to the user selecting a target recommended content from more than one recommended content, the target recommended content is sent to the server, so that the server generates a script using the target recommended content, the script including control variable information for generating a media file.
步骤205:响应于用户利用交互界面输入素材信息,将素材信息发送给服务器端。Step 205: In response to the user inputting material information using the interactive interface, the material information is sent to the server.
所谓素材指的是生成媒体文件所使用的原始材料,通常具有未经整理加工、分散等特点,但材料是生成媒体文件的基础和主要内容。The so-called material refers to the original material used to generate media files, which is usually unorganized and unprocessed and scattered, but the material is the basis and main content of generating media files.
在本申请实施例中涉及的媒体文件可以是视频、动画、图片、音频等等。例如,若媒体文件是视频时,上述的素材信息可以是诸如文本、图片、音频等等。再例如,若媒体文件是音频,上述的素材信息可以是诸如音频片段、文本等。再例如,若媒体文件是图片,上述的素材信息可以是诸如图标、文字、图形元素等等。为了方便描述,后续实施例中均以生成视频为例,其他形式的媒体文件的实现方式类似。The media files involved in the embodiments of the present application may be videos, animations, pictures, audio, etc. For example, if the media file is a video, the above-mentioned material information may be text, pictures, audio, etc. For another example, if the media file is audio, the above-mentioned material information may be audio clips, text, etc. For another example, if the media file is a picture, the above-mentioned material information may be icons, text, graphic elements, etc. For the convenience of description, the subsequent embodiments all take the generation of videos as an example, and the implementation methods of other forms of media files are similar.
步骤207:响应于接收服务器端发送的媒体文件,媒体文件是利用脚本以及上述素材信息生成的,在交互界面上展示媒体文件。Step 207: In response to receiving the media file sent by the server, the media file is generated by using the script and the above-mentioned material information, and the media file is displayed on the interactive interface.
由上述流程中可以看出,本申请所提供的技术方案,用户仅需要在交互界面上展示的一个以上的推荐内容中选择目标推荐内容并输入素材信息,即可方便、快捷地生成媒体文件,简化了用户操作、提高了用户体验。It can be seen from the above process that with the technical solution provided by the present application, users only need to select target recommended content from one or more recommended content displayed on the interactive interface and enter material information to conveniently and quickly generate media files, which simplifies user operations and improves user experience.
另外,本申请中首先在交互界面上展示一个以上的推荐内容从而为用户提供参考,以便于用户从一个以上的推荐内容中选择一个作为目标推荐内容指导脚本的生成,从而控制媒体文件的生成。这种方式能够帮助用户更好地把握媒体文件的生成方向,提高媒体文件的生成效果,避免用户不断尝试重新输入需求,因此提高了用户的操作效率,进而提高用户体验。In addition, in the present application, more than one recommended content is first displayed on the interactive interface to provide a reference for the user, so that the user can select one of the more than one recommended content as the target recommended content to guide the generation of the script, thereby controlling the generation of the media file. This method can help the user better grasp the generation direction of the media file, improve the generation effect of the media file, and avoid the user from constantly trying to re-enter the requirements, thereby improving the user's operation efficiency and thus improving the user experience.
下面结合实施例分别对上述各步骤进行详细描述。需要说明的是,本申请中涉及的“第一”、“第二”等限定并不具备大小、顺序和数量等方面的限制,仅仅用以在名称上加以区分,例如“第一界面元素”、“第二界面元素”等仅仅用以在名称上区分各界面元素。The above steps are described in detail below in conjunction with the embodiments. It should be noted that the limitations such as "first" and "second" involved in this application do not have limitations in terms of size, order, and quantity, and are only used to distinguish them in name. For example, "first interface element", "second interface element", etc. are only used to distinguish each interface element in name.
实施例二、Embodiment 2
为了方便对整个交互过程进行理解,下面以图3所示交互流程为例,对本申请实施例提供的一种较优选的交互过程进行描述。如图3中所示,该方法可以包括以下过程:In order to facilitate understanding of the entire interaction process, the following describes a preferred interaction process provided by the embodiment of the present application, taking the interaction process shown in Figure 3 as an example. As shown in Figure 3, the method may include the following process:
在步骤301中,用户利用用户端登录服务。In step 301, a user logs into a service using a client.
在本申请实施例中,用户可以利用账号登录服务,服务器端获取用户账号信息。In an embodiment of the present application, a user can utilize an account login service, and the server obtains user account information.
在步骤302中,服务器端获取用户账户信息,依据用户账户信息生成一个以上的视频实例,并将该一个以上的视频实例发送给用户端。In step 302, the server obtains user account information, generates one or more video instances according to the user account information, and sends the one or more video instances to the user.
用户登录后,服务器端获取用户账户信息。其中用户账户信息可以是诸如用户账号、用户所属行业、用户的产品信息等属性信息。After the user logs in, the server obtains the user's account information, which may include attribute information such as the user's account number, the industry to which the user belongs, and the user's product information.
需要说明的是,本申请所涉及的用户信息(包括但不限于用户设备信息、用户个人信息等)和数据(包括但不限于用于分析的数据、存储的数据、展示的数据等),均为经用户授权或者经过各方充分授权的信息和数据,并且相关数据的收集、使用和处理需要遵守相关国家和地区的相关法律法规和标准,并提供有相应的操作入口,供用户选择授权或者拒绝。It should be noted that the user information (including but not limited to user device information, user personal information, etc.) and data (including but not limited to data used for analysis, stored data, displayed data, etc.) involved in this application are all information and data authorized by the user or fully authorized by all parties, and the collection, use and processing of relevant data must comply with the relevant laws, regulations and standards of relevant countries and regions, and provide corresponding operation entrances for users to choose to authorize or refuse.
服务器端依据用户账户信息确定推荐内容,即无需用户操作就能够向用户展示一个以上的推荐内容。本实施例中以视频实例为例。所谓视频实例指的视频库中已经存在的视频作为实例供用户参考并选择。服务器端推荐的视频实例可以尽可能体现不同的视频生成“方向”,不同生成“方向”的视频通常具有差异较大的控制变量信息。The server determines the recommended content based on the user account information, that is, it can display more than one recommended content to the user without user operation. In this embodiment, a video instance is taken as an example. The so-called video instance refers to the video that already exists in the video library as an example for the user to refer to and select. The video instance recommended by the server can reflect different video generation "directions" as much as possible. Videos of different generation "directions" usually have control variable information with large differences.
在步骤303中,用户端在交互界面上展示一个以上的视频实例。In step 303, the user terminal displays one or more video instances on the interactive interface.
如图4a中所示的交互界面上,可以展示视频实例1、视频实例2、…、视频实例10一共10个视频实例。图中展示的视频实例的数量是示意性的,本申请对此不加以特别限制。As shown in FIG4a , a total of 10 video instances may be displayed, namely, video instance 1, video instance 2, ..., video instance 10. The number of video instances shown in the figure is illustrative, and the present application does not impose any particular limitation thereto.
另外,除了依据用户账户信息生成推荐内容(本实施例中为视频实例)之外,还可以由用户自主输入需求信息,服务器端利用需求信息生成视频实例发送给用户端。例如图4a中所示,在交互界面上可以包括第一页面元素,图中所示出的第一页面元素为一个文本输入框,用户可以利用该文本输入框输入需求信息,并点击“发送”触发用户端将需求信息发送给服务器端。除了文本输入框之外,还可以采用其他形式的第一页面元素,例如语音输入组件、选项组件等等。In addition, in addition to generating recommended content (video instances in this embodiment) based on user account information, users can also independently input demand information, and the server generates a video instance using the demand information and sends it to the user. For example, as shown in FIG4a, the interactive interface can include a first page element, and the first page element shown in the figure is a text input box. The user can use the text input box to input demand information and click "Send" to trigger the user to send the demand information to the server. In addition to the text input box, other forms of first page elements can also be used, such as a voice input component, an option component, etc.
用户输入的需求信息可以是视频主题、风格、对象、行业等等。在图4a中所示的文本输入框中,用户可以采用自然语言的形式灵活的输入需求信息。服务器端可以使用大语言模型来理解自然语言形式的需求信息,并生成推荐的视频实例。The demand information input by the user can be video theme, style, object, industry, etc. In the text input box shown in FIG4a, the user can flexibly input the demand information in the form of natural language. The server can use a large language model to understand the demand information in the form of natural language and generate recommended video instances.
本实施例中,用户利用第一界面元素输入需求信息,从而能够依据用户实际输入的需求信息来得到一个以上的推荐内容,提供了一种用户精准表达需求的灵活方式。并且,相比较传统方式中用户需要按照表单要求逐一输入需求信息的方式,用户能够使用更加自然的方式来表达需求,实现更加灵活,用户体验更佳。In this embodiment, the user inputs the demand information using the first interface element, so that more than one recommended content can be obtained according to the demand information actually input by the user, which provides a flexible way for the user to accurately express the demand. Moreover, compared with the traditional method in which the user needs to input the demand information one by one according to the form requirements, the user can express the demand in a more natural way, which is more flexible and has a better user experience.
在步骤304中,响应于用户从一个以上的视频实例中选择目标视频实例,将该目标视频实例发送给服务器端。In step 304, in response to the user selecting a target video instance from more than one video instance, the target video instance is sent to the server.
用户可以采用点击播放组件等方式播放视频实例来查看是否满足自己的需求,如果有满足自己需求的视频实例,可以将其选择作为目标视频实例。The user can play the video instance by clicking the play component to check whether it meets his needs. If there is a video instance that meets his needs, he can select it as the target video instance.
更进一步地,如果用户对交互界面上的所有视频实例都不满意,则可以利用交互界面上的第三界面元素来刷新展示的视频实例。用户端响应于用户触发如图4a中所示的,第三界面元素,可以向服务器端发送刷新请求,服务器端重新生成一个以上的视频实例发送给用户端,由用户端刷新展示的视频实例。Furthermore, if the user is not satisfied with all the video instances on the interactive interface, the third interface element on the interactive interface can be used to refresh the displayed video instances. In response to the user triggering the third interface element as shown in FIG4a, the user terminal can send a refresh request to the server terminal, and the server terminal regenerates one or more video instances and sends them to the user terminal, which refreshes the displayed video instances.
在步骤305中,服务器端利用目标视频实例生成脚本并返回给用户端,脚本包括生成媒体文件的控制变量信息。In step 305, the server generates a script using the target video instance and returns it to the user end. The script includes control variable information for generating a media file.
在本申请实施例中,脚本主要包括生成媒体文件的控制变量信息。控制变量信息可以包括但不限于:内容相关变量、技术相关变量、效果相关变量、交互相关变量、音频相关变量等等。其中内容相关变量可以包括诸如场景信息、故事结构、素材位置、编排风格等等。技术相关变量可以包括诸如帧速度、帧尺寸、时长等等。效果相关变量可以包括亮度、对比度、饱和度、转场效果、特效等等。交互相关变量可以包括诸如热区域元素、图标等体现交互效果的变量。音频相关变量可以包括诸如背景音乐、配音等等。In an embodiment of the present application, the script mainly includes control variable information for generating media files. The control variable information may include, but is not limited to, content-related variables, technology-related variables, effect-related variables, interaction-related variables, audio-related variables, and the like. Content-related variables may include, for example, scene information, story structure, material location, arrangement style, and the like. Technology-related variables may include, for example, frame rate, frame size, duration, and the like. Effect-related variables may include brightness, contrast, saturation, transition effects, special effects, and the like. Interaction-related variables may include, for example, variables that embody interaction effects, such as hot zone elements and icons. Audio-related variables may include, for example, background music, dubbing, and the like.
服务器端在利用目标视频实例生成脚本时,可以采用诸如脚本生成模型从目标视频实例中抽取出脚本。其中脚本生成模型可以采用深度学习模型、大模型等多种方式实现,本申请对此不加以限制。When the server generates a script using the target video instance, it can extract the script from the target video instance using a script generation model, for example. The script generation model can be implemented in a variety of ways, such as a deep learning model or a large model, and this application does not limit this.
本实施例中将生成的脚本发送给用户端进行展现,除此之外,也可以不将脚本发送给用户端。In this embodiment, the generated script is sent to the user end for display. In addition, the script may not be sent to the user end.
在步骤306中,用户端在交互界面上展现脚本。In step 306, the user terminal displays the script on the interactive interface.
在本实施例中还可以将生成的脚本在交互界面上展示,以便于用户直观地获知生成媒体文件的控制变量是否准确,更进一步提高用户体验。如图4b中所示,在交互界面上可以包括第四界面元素,利用该第四界面元素展示脚本。图中的第四界面元素是一个文本编辑框,除此之外,也可以采用其他形式的界面元素。第四界面元素中的“XXXXXX……”表示脚本的内容。In this embodiment, the generated script can also be displayed on the interactive interface so that the user can intuitively know whether the control variables for generating the media file are accurate, further improving the user experience. As shown in FIG. 4b, a fourth interface element can be included on the interactive interface, and the script is displayed using the fourth interface element. The fourth interface element in the figure is a text editing box. In addition, other forms of interface elements can also be used. The "XXXXXX..." in the fourth interface element represents the content of the script.
在本实施例中,用户若对脚本的内容不满意,则可以利用第四界面元素对脚本进行编辑。用户端响应于用户利用第四界面元素对脚本进行的编辑,将编辑后的脚本发送给服务器端。例如,用户可以点击图4b中所示的“编辑”组件,在文本编辑框中对脚本的内容进行增加、修改、删除等编辑处理。利用这种方式,用户能够更加精细地对生成媒体文件的控制变量进行调整,提高生成媒体文件的可控性,生成的媒体文件更加精准,提高用户体验。In this embodiment, if the user is not satisfied with the content of the script, the script can be edited using the fourth interface element. In response to the user editing the script using the fourth interface element, the user end sends the edited script to the server end. For example, the user can click the "Edit" component shown in FIG. 4b to add, modify, delete, and other editing processes to the content of the script in the text editing box. In this way, the user can more finely adjust the control variables of the generated media file, improve the controllability of the generated media file, and the generated media file is more accurate, thereby improving the user experience.
在步骤307中,响应于用户在交互界面上输入素材信息,由用户端将素材信息发送给服务器端。In step 307, in response to the user inputting material information on the interactive interface, the user end sends the material information to the server end.
可以在交互界面上向用户提供第六界面元素,如图4b中所示,响应于交互界面上的第六界面元素被触发,获取用户输入的素材信息。如图4c中所示,用户触发第六界面元素后可以在交互界面上弹出一个窗口,用户利用该窗口可以从诸如本地存储空间或服务端素材库等中来选择素材进行输入,点击“确定”组件后完成素材的输入,回到如图4d中所示的交互界面。A sixth interface element may be provided to the user on the interactive interface, as shown in FIG4b , and in response to the sixth interface element on the interactive interface being triggered, the material information input by the user is obtained. As shown in FIG4c , after the user triggers the sixth interface element, a window may pop up on the interactive interface, and the user may use the window to select a material from, for example, a local storage space or a server-side material library for input, and click the “OK” component to complete the input of the material, returning to the interactive interface shown in FIG4d .
交互界面上还可以包括第七界面元素,响应于交互界面上的第七界面元素被触发,将素材信息发送给服务器端。如图4d中所示,用户触发第七界面元素即“生成视频”组件,将输入的素材信息发送给服务器端。The interactive interface may also include a seventh interface element, and in response to the seventh interface element on the interactive interface being triggered, the material information is sent to the server. As shown in FIG4d, the user triggers the seventh interface element, i.e., the "Generate Video" component, and the input material information is sent to the server.
在步骤308中,服务器端利用脚本和素材信息生成一个以上的视频文件,将生成的视频文件发送给用户端。In step 308, the server generates one or more video files using the script and the material information, and sends the generated video files to the user end.
在本步骤中,服务器端可以采用AIGC技术,利用脚本和素材信息生成视频文件。生成的视频文件可以是一个,也可以是多个。另外,由于AIGC技术在生成视频文件时需要一定的时间,因此用户点击图4d中所示的“生成视频”组件后,可以展示图4e中所示的交互界面,显示诸如“视频正在生成中”等提示信息,以提示用户进行等待。In this step, the server side can use AIGC technology to generate a video file using scripts and material information. The generated video file can be one or more. In addition, since AIGC technology takes a certain amount of time to generate a video file, after the user clicks the "Generate Video" component shown in Figure 4d, the interactive interface shown in Figure 4e can be displayed, displaying prompt information such as "Video is being generated" to prompt the user to wait.
在步骤309中,用户端在交互界面上展示一个以上的视频文件。In step 309, the user terminal displays one or more video files on the interactive interface.
如图4f所示,在交互界面上展示视频1、视频2、……、视频5一共5个生成的视频文件。As shown in FIG4f , a total of five generated video files, namely, video 1, video 2, ..., video 5, are displayed on the interactive interface.
在步骤310中,响应于用户从一个以上的视频文件中选择目标视频文件,将目标视频文件的信息发送给服务器端。In step 310, in response to the user selecting a target video file from more than one video file, information of the target video file is sent to a server.
对于交互界面上展示的一个以上的视频文件,用户可以选择目标视频文件以进行不同的后续处理。例如,可以选择目标视频文件对应的播放组件来播放目标视频文件。再例如,可以利用触发存储组件来存储目标视频文件。再例如,可以将所选择的目标媒体文件的信息发送给服务器端。For more than one video file displayed on the interactive interface, the user can select a target video file for different subsequent processing. For example, the player component corresponding to the target video file can be selected to play the target video file. For another example, the target video file can be stored using a trigger storage component. For another example, the information of the selected target media file can be sent to the server.
如图4f中所示,用户可以点选视频4作为目标视频文件,然后选择“发布至主页”,将目标视频文件携带于发布请求中发送给服务器端。As shown in FIG. 4f , the user can click on video 4 as the target video file, and then select “Publish to Home Page” to send the target video file to the server side along with the publishing request.
在步骤311中,将目标视频文件关联用户账号后进行发布。In step 311, the target video file is associated with the user account and then published.
例如,发布至媒体服务的首页、发布至用户主页面、发布至特定主体下的页面,等等。For example, publish to the homepage of the media service, publish to the user's home page, publish to the page under a specific subject, and so on.
图3所示流程实际上为用户提供了一种“生成同款视频”的能力,即将已有的视频实例作为参考,在用户提供的素材信息的基础上生成与用户选择的视频实例同款的视频。The process shown in FIG. 3 actually provides the user with a capability of “generating the same video”, that is, taking the existing video instance as a reference and generating a video of the same type as the video instance selected by the user based on the material information provided by the user.
实施例三、Embodiment 3
下面以图5所示交互流程为例,对本申请实施例提供的另一种较优选的交互过程进行描述。如图5中所示,该方法可以包括以下过程:Taking the interaction process shown in FIG5 as an example, another preferred interaction process provided by the embodiment of the present application is described below. As shown in FIG5, the method may include the following process:
在步骤501中,用户利用用户端登录服务。In step 501, a user logs into a service using a client.
在本申请实施例中,用户可以利用账号登录服务,服务器端获取用户账号信息。In an embodiment of the present application, a user can utilize an account login service, and the server obtains user account information.
在步骤502中,服务器端获取用户账户信息,依据用户账户信息生成一个以上的脚本主题,并将该一个以上的脚本主题发送给用户端。In step 502, the server obtains user account information, generates one or more script themes according to the user account information, and sends the one or more script themes to the user.
用户登录后,服务器端获取用户账户信息。其中用户账户信息可以是诸如用户账号、用户所属行业、用户的产品信息等属性信息。After the user logs in, the server obtains the user's account information, which may include attribute information such as the user's account number, the industry to which the user belongs, and the user's product information.
需要说明的是,本申请所涉及的用户信息(包括但不限于用户设备信息、用户个人信息等)和数据(包括但不限于用于分析的数据、存储的数据、展示的数据等),均为经用户授权或者经过各方充分授权的信息和数据,并且相关数据的收集、使用和处理需要遵守相关国家和地区的相关法律法规和标准,并提供有相应的操作入口,供用户选择授权或者拒绝。It should be noted that the user information (including but not limited to user device information, user personal information, etc.) and data (including but not limited to data used for analysis, stored data, displayed data, etc.) involved in this application are all information and data authorized by the user or fully authorized by all parties, and the collection, use and processing of relevant data must comply with the relevant laws, regulations and standards of relevant countries and regions, and provide corresponding operation entrances for users to choose to authorize or refuse.
服务器端依据用户账户信息确定推荐内容,即无需用户操作就能够向用户展示一个以上的推荐内容。本实施例中以脚本主题为例。所谓脚本主题指的是脚本所表达的主要信息,是对脚本所包含控制变量信息的描述信息。服务器端推荐的脚本主题可以尽可能体现不同的视频生成“方向”,不同生成“方向”的视频通常具有差异较大的控制变量信息。The server determines the recommended content based on the user account information, that is, it can display more than one recommended content to the user without user operation. In this embodiment, the script theme is taken as an example. The so-called script theme refers to the main information expressed by the script, which is a description of the control variable information contained in the script. The script theme recommended by the server can reflect different video generation "directions" as much as possible. Videos of different generation "directions" usually have control variable information with large differences.
在步骤503中,用户端在交互界面上展示一个以上的脚本主题。In step 503, the user terminal displays one or more script themes on the interactive interface.
如图6a中所示的交互界面上,可以展示脚本主题1、脚本主题2、脚本主题3和脚本主题4。其中可以具体展示各脚本主题的领域、关键词等以供用户了解和选择。图中展示的脚本主题的数量是示意性的,本申请对此不加以特别限制。As shown in FIG6a, on the interactive interface, script theme 1, script theme 2, script theme 3 and script theme 4 may be displayed. The fields, keywords, etc. of each script theme may be specifically displayed for the user to understand and select. The number of script themes shown in the figure is schematic, and the present application does not impose any particular limitation on this.
另外,除了依据用户账户信息生成推荐内容(本实施例中为脚本主题)之外,还可以由用户自主输入需求信息,服务器端利用需求信息生成视频实例发送给用户端。该部分内容与实施例二中类似,在此不做赘述。In addition, in addition to generating recommended content (script themes in this embodiment) based on user account information, users can also input demand information on their own, and the server generates a video instance using the demand information and sends it to the user. This part is similar to that in Embodiment 2 and will not be described in detail here.
用户输入的需求信息可以是视频主题、风格、对象、行业等等。服务器端可以使用大语言模型来理解自然语言形式的需求信息,并生成推荐的脚本主题。The demand information input by the user can be video themes, styles, objects, industries, etc. The server can use a large language model to understand the demand information in natural language form and generate recommended script topics.
在步骤504中,响应于用户从一个以上的脚本主题中选择的目标脚本主题,将该目标脚本主题发送给服务器端。In step 504, in response to a target script theme selected by the user from more than one script theme, the target script theme is sent to the server.
在步骤505中,服务器端利用目标视频实例生成多个脚本并返回给用户端,脚本包括生成媒体文件的控制变量信息。In step 505, the server generates multiple scripts using the target video instance and returns them to the user end. The scripts include control variable information for generating media files.
在本申请实施例中,脚本主要包括生成媒体文件的控制变量信息。控制变量信息可以包括但不限于:内容相关变量、技术相关变量、效果相关变量、交互相关变量、音频相关变量等等。其中内容相关变量可以包括诸如场景信息、故事结构、素材位置、编排风格等等。技术相关变量可以包括诸如帧速度、帧尺寸、时长等等。效果相关变量可以包括亮度、对比度、饱和度、转场效果、特效等等。交互相关变量可以包括诸如热区域元素、图标等体现交互效果的变量。音频相关变量可以包括诸如背景音乐、配音等等。In an embodiment of the present application, the script mainly includes control variable information for generating media files. The control variable information may include, but is not limited to, content-related variables, technology-related variables, effect-related variables, interaction-related variables, audio-related variables, and the like. Content-related variables may include, for example, scene information, story structure, material location, arrangement style, and the like. Technology-related variables may include, for example, frame rate, frame size, duration, and the like. Effect-related variables may include brightness, contrast, saturation, transition effects, special effects, and the like. Interaction-related variables may include, for example, variables that embody interaction effects, such as hot zone elements and icons. Audio-related variables may include, for example, background music, dubbing, and the like.
服务器端在利用目标视频实例生成脚本时,可以采用诸如脚本生成模型从目标视频实例中抽取出脚本。其中脚本生成模型可以采用深度学习模型、大模型等多种方式实现,本申请对此不加以限制。When the server generates a script using the target video instance, it can extract the script from the target video instance using a script generation model, for example. The script generation model can be implemented in a variety of ways, such as a deep learning model or a large model, and this application does not limit this.
在本实施例中,以生成多个脚本为例,也可以仅生成一个脚本。另外,本实施例中生成的脚本也可以不发送给用户端进行展现。In this embodiment, taking the generation of multiple scripts as an example, only one script may be generated. In addition, the script generated in this embodiment may not be sent to the user end for display.
在步骤506中,用户端在交互界面上展示多个脚本。In step 506, the user terminal displays multiple scripts on the interactive interface.
在交互界面上可以利用第五界面元素展示多个脚本。如图6b中所示,该第五界面元素可以采用诸如卡片组件的形式,展示脚本1、脚本2和脚本3。除了卡片组件之外,也可以采用其他形式的界面元素,例如文本框等。The fifth interface element can be used to display multiple scripts on the interactive interface. As shown in FIG6b, the fifth interface element can be in the form of a card component, for example, to display script 1, script 2, and script 3. In addition to the card component, other forms of interface elements can also be used, such as a text box.
如果用户对于生成的脚本并不满意,则可以利用交互界面上的第二界面元素自主输入需求信息。响应于用户利用第二界面元素输入需求信息,用户端将需求信息发送给服务器端,以便于服务器端利用需求信息生成脚本,并将生成的脚本重新发送给用户端进行展示。If the user is not satisfied with the generated script, the user can use the second interface element on the interactive interface to input the required information. In response to the user inputting the required information using the second interface element, the user terminal sends the required information to the server terminal, so that the server terminal generates a script using the required information and resends the generated script to the user terminal for display.
用户输入的需求信息可以是视频主题、风格、对象、行业等等,甚至可以是具体的控制变量信息。在图6b中所示的文本输入框中,用户可以采用自然语言的形式灵活的输入需求信息。服务器端可以使用大语言模型来理解自然语言形式的需求信息,并生成脚本。这种方式能够将用户输入的需求信息作为指导,更加精准地生成脚本信息,进而提高生成的媒体内容的效果。The demand information input by the user can be the video theme, style, object, industry, etc., or even specific control variable information. In the text input box shown in FIG6b, the user can flexibly input the demand information in the form of natural language. The server can use a large language model to understand the demand information in the form of natural language and generate a script. This method can use the demand information input by the user as a guide to generate script information more accurately, thereby improving the effect of the generated media content.
在步骤507中,响应于用户端选择其中一个脚本,在交互界面上以可编辑的形式展示该脚本。In step 507, in response to the user terminal selecting one of the scripts, the script is displayed in an editable form on the interactive interface.
在本实施例中,若用户从多个脚本中选择其中一个脚本,则可以利用交互界面上的第四界面元素以可编辑的形式展示该被选择的脚本。例如用户选择了图6b中的脚本2,则交互界面上如图6c中所示的文本编辑框这种展示该脚本的内容。In this embodiment, if the user selects one of the scripts from the multiple scripts, the selected script can be displayed in an editable form using the fourth interface element on the interactive interface. For example, if the user selects script 2 in FIG. 6b, the interactive interface displays the content of the script in a text editing box such as shown in FIG. 6c.
用户若对脚本的内容不满意,则可以利用第四界面元素对脚本进行编辑。用户端响应于用户利用第四界面元素对脚本进行的编辑,将编辑后的脚本发送给服务器端。例如,用户可以点击图6c中所示的“编辑”组件,在文本编辑框中对脚本的内容进行增加、修改、删除等编辑处理。利用这种方式,用户能够更加精细地对生成媒体文件的控制变量进行调整,提高生成媒体文件的可控性,生成的媒体文件更加精准,提高用户体验。If the user is not satisfied with the content of the script, the user can edit the script using the fourth interface element. In response to the user editing the script using the fourth interface element, the user end sends the edited script to the server end. For example, the user can click the "Edit" component shown in Figure 6c to add, modify, delete, and other editing processes to the content of the script in the text editing box. In this way, the user can adjust the control variables of the generated media file more finely, improve the controllability of the generated media file, and the generated media file is more accurate, which improves the user experience.
在步骤508中,响应于用户在交互界面上输入素材信息,由用户端将素材信息和选择的脚本信息发送给服务器端。In step 508, in response to the user inputting material information on the interactive interface, the user end sends the material information and the selected script information to the server end.
可以在交互界面上向用户提供第六界面元素,如图6c中所示,响应于交互界面上的第六界面元素被触发,获取用户输入的素材信息。如图6d中所示,用户触发第六界面元素后可以在交互界面上弹出一个窗口,用户利用该窗口可以从诸如本地存储空间或服务端素材库等中来选择素材进行输入,点击“确定”组件后完成素材的输入,到如图6e中所示的交互界面。A sixth interface element may be provided to the user on the interactive interface, as shown in FIG6c , and in response to the sixth interface element on the interactive interface being triggered, the material information input by the user is obtained. As shown in FIG6d , after the user triggers the sixth interface element, a window may pop up on the interactive interface, and the user may use the window to select a material from, for example, a local storage space or a server-side material library for input, and click the “OK” component to complete the input of the material, and then go to the interactive interface shown in FIG6e .
交互界面上还可以包括第七界面元素,响应于交互界面上的第七界面元素被触发,将素材信息和用户选择的脚本信息发送给服务器端。如图6e中所示,用户触发第七界面元素即“生成视频”组件,将输入的素材信息和用户选择的脚本信息发送给服务器端。The interactive interface may also include a seventh interface element, and in response to the seventh interface element on the interactive interface being triggered, the material information and the script information selected by the user are sent to the server. As shown in FIG6e, the user triggers the seventh interface element, i.e., the "Generate Video" component, and the input material information and the script information selected by the user are sent to the server.
在步骤509中,服务器端利用用户选择的脚本和素材信息生成一个以上的视频文件,将生成的视频文件发送给用户端。In step 509, the server generates one or more video files using the script and material information selected by the user, and sends the generated video files to the user.
在本步骤中,服务器端可以采用AIGC技术,利用脚本和素材信息生成视频文件。生成的视频文件可以是一个,也可以是多个。另外,由于AIGC技术在生成视频文件时需要一定的时间,因此用户点击图6e中所示的“生成视频”组件后,可以在交互界面中显示诸如“视频正在生成中”等提示信息,以提示用户进行等待。In this step, the server side can use AIGC technology to generate a video file using scripts and material information. The generated video file can be one or more. In addition, since AIGC technology takes a certain amount of time to generate a video file, after the user clicks the "Generate Video" component shown in Figure 6e, a prompt message such as "Video is being generated" can be displayed in the interactive interface to prompt the user to wait.
在步骤510中,用户端在交互界面上展示一个以上的视频文件。In step 510, the user terminal displays one or more video files on the interactive interface.
如图6f所示,在交互界面上展示视频1、视频2、……、视频5一共5个生成的视频文件。As shown in FIG6f , a total of five generated video files, namely, video 1, video 2, ..., video 5, are displayed on the interactive interface.
在步骤511中,响应于用户从一个以上的视频文件中选择目标视频文件,将目标视频文件的信息发送给服务器端。In step 511, in response to the user selecting a target video file from more than one video file, information of the target video file is sent to the server.
对于交互界面上展示的一个以上的视频文件,用户可以选择目标视频文件以进行不同的后续处理。例如,可以选择目标视频文件对应的播放组件来播放目标视频文件。再例如,可以利用触发存储组件来存储目标视频文件。再例如,可以将所选择的目标媒体文件的信息发送给服务器端。For more than one video file displayed on the interactive interface, the user can select a target video file for different subsequent processing. For example, the player component corresponding to the target video file can be selected to play the target video file. For another example, the target video file can be stored using a trigger storage component. For another example, the information of the selected target media file can be sent to the server.
如图6f中所示,用户可以点选视频4作为目标视频文件,然后选择“发布至主页”,将目标视频文件携带于发布请求中发送给服务器端。As shown in FIG. 6f , the user can click on video 4 as the target video file, and then select “Publish to Home Page” to send the target video file to the server side along with the publishing request.
在步骤512中,将目标视频文件关联用户账号后进行发布。In step 512, the target video file is associated with the user account and then published.
例如,发布至媒体服务的首页、发布至用户主页面、发布至特定主体下的页面,等等。可以看出,利用目标视频文件与用户账号的关联,用户利用简单的选择操作即可实现目标视频文件的发布,操作简单,提高了用户体验。For example, publishing to the homepage of the media service, publishing to the user's homepage, publishing to a page under a specific subject, etc. It can be seen that by associating the target video file with the user account, the user can publish the target video file with a simple selection operation, which is simple to operate and improves the user experience.
实施例四、Embodiment 4:
图7为本申请实施例提供的由服务器端执行的媒体文件的生成方法的流程图。如图7中所示,该方法可以包括以下步骤:FIG7 is a flow chart of a method for generating a media file executed by a server provided in an embodiment of the present application. As shown in FIG7 , the method may include the following steps:
步骤701:向用户端发送一个以上的推荐内容。Step 701: Send one or more recommended contents to the user terminal.
其中,上述推荐内容可以包括推荐的媒体文件、媒体模板、推荐的脚本主题或推荐的脚本模板。The recommended content may include recommended media files, media templates, recommended script themes or recommended script templates.
用户可以利用账号登录服务,服务器端获取用户账号信息。依据用户账号信息可以获取到用户账户信息。作为其中一种可实现的方式,所述推荐内容是依据用户账户信息得到的。服务器端依据用户账户信息确定推荐内容,即无需用户操作就能够向用户展示一个以上的推荐内容。The user can use the account to log in to the service, and the server obtains the user account information. The user account information can be obtained based on the user account information. As one of the achievable methods, the recommended content is obtained based on the user account information. The server determines the recommended content based on the user account information, that is, it can display more than one recommended content to the user without the user's operation.
作为另一种可实现的方式,上述推荐内容可以是依据用户输入的需求信息得到的。例如用户可以利用交互界面上的第一页面元数输入需求信息,并由用户端发送给服务器端。服务器端依据需求信息得到一个以上的推荐内容。As another achievable method, the above-mentioned recommended content can be obtained based on the demand information input by the user. For example, the user can input the demand information using the first page element on the interactive interface, and the user terminal sends it to the server terminal. The server terminal obtains more than one recommended content based on the demand information.
作为其中一种可实现的方式,服务器端在利用用户账户信息或需求信息得到推荐内容时,可以使用大语言模型实现。基于大语言模型实现的生成式模型具有强大的语言理解能力和生成能力,因此上述需求信息可以采用自然语言形式,使得用户能够使用更加自然的方式来表达需求,实现更加灵活,用户体验更佳。As one of the feasible ways, the server can use a large language model to obtain recommended content using user account information or demand information. The generative model based on the large language model has powerful language understanding and generation capabilities, so the above demand information can be in the form of natural language, allowing users to express their needs in a more natural way, achieving more flexibility and better user experience.
如果用户对于推荐内容均不满意,则可以利用交互界面上的第三界面元素来刷新展示的推荐内容,由用户端向服务器端发送刷新指令。响应于用户端发送的刷新指令,重新生成一个以上的推荐内容并发送给用户端。If the user is not satisfied with the recommended content, the third interface element on the interactive interface can be used to refresh the displayed recommended content, and the user terminal sends a refresh instruction to the server terminal. In response to the refresh instruction sent by the user terminal, more than one recommended content is regenerated and sent to the user terminal.
步骤703:依据用户从一个以上的推荐内容中选择的目标推荐内容,生成脚本,脚本包括生成媒体文件的控制变量信息。Step 703: Generate a script based on the target recommended content selected by the user from one or more recommended contents, where the script includes control variable information for generating a media file.
在本申请实施例中,脚本主要包括生成媒体文件的控制变量信息。控制变量信息可以包括但不限于:内容相关变量、技术相关变量、效果相关变量、交互相关变量、音频相关变量等等。其中内容相关变量可以包括诸如场景信息、故事结构、素材位置、编排风格等等。技术相关变量可以包括诸如帧速度、帧尺寸、时长等等。效果相关变量可以包括亮度、对比度、饱和度、转场效果、特效等等。交互相关变量可以包括诸如热区域元素、图标等体现交互效果的变量。音频相关变量可以包括诸如背景音乐、配音等等。In an embodiment of the present application, the script mainly includes control variable information for generating media files. The control variable information may include, but is not limited to, content-related variables, technology-related variables, effect-related variables, interaction-related variables, audio-related variables, and the like. Content-related variables may include, for example, scene information, story structure, material location, arrangement style, and the like. Technology-related variables may include, for example, frame rate, frame size, duration, and the like. Effect-related variables may include brightness, contrast, saturation, transition effects, special effects, and the like. Interaction-related variables may include, for example, variables that embody interaction effects, such as hot zone elements and icons. Audio-related variables may include, for example, background music, dubbing, and the like.
服务器端在利用目标视频实例生成脚本时,可以采用诸如脚本生成模型从目标视频实例中抽取出脚本。其中脚本生成模型可以采用深度学习模型、大模型等多种方式实现,本申请对此不加以限制。When the server generates a script using the target video instance, it can extract the script from the target video instance using a script generation model, for example. The script generation model can be implemented in a variety of ways, such as a deep learning model or a large model, and this application does not limit this.
本申请实施例中,服务器端可以将生成的脚本发送给用户端进行展现,除此之外,也可以不将脚本发送给用户端。In the embodiment of the present application, the server side may send the generated script to the client side for display, or may not send the script to the client side.
另外,如果用户利用交互界面上的第二界面元素输入需求信息,用户端将需求信息发送给服务器端。服务器端响应于该需求信息,也可以利用所述需求信息生成脚本,并将脚本信息发送给用户端。In addition, if the user inputs the demand information using the second interface element on the interactive interface, the user terminal sends the demand information to the server terminal. In response to the demand information, the server terminal may also generate a script using the demand information and send the script information to the user terminal.
另外,服务器端也可以生成多个脚本发送给用户端,由用户从中选择一个脚本,用户端将选择的脚本发送给服务器端。In addition, the server side can also generate multiple scripts and send them to the client side, and the user selects a script from them, and the client side sends the selected script to the server side.
步骤705:利用脚本以及用户输入的素材信息,生成媒体文件。Step 705: Generate a media file using the script and the material information input by the user.
在本步骤中,服务器端可以采用AIGC技术,利用脚本和素材信息生成视频文件。生成的视频文件可以是一个,也可以是多个。In this step, the server can use AIGC technology to generate a video file using scripts and material information. The generated video file can be one or more.
步骤707:向用户端发送媒体文件。Step 707: Send the media file to the user terminal.
更进一步地,若用户从多个媒体文件中选择一个目标媒体文件,用户端将目标媒体文件的信息发送给服务器端,则服务器端可以进一步将目标视频文件关联用户账号后进行发布。例如,发布至媒体服务的首页、发布至用户主页面、发布至特定主体下的页面,等等。Furthermore, if the user selects a target media file from multiple media files, the user terminal sends the information of the target media file to the server terminal, and the server terminal can further associate the target video file with the user account and publish it, for example, to the homepage of the media service, to the user's homepage, to a page under a specific subject, and so on.
上述对本说明书特定实施例进行了描述。其它实施例在所附权利要求书的范围内。在一些情况下,在权利要求书中记载的动作或步骤可以按照不同于实施例中的顺序来执行并且仍然可以实现期望的结果。另外,在附图中描绘的过程不一定要求示出的特定顺序或者连续顺序才能实现期望的结果。在某些实施方式中,多任务处理和并行处理也是可以的或者可能是有利的。The above is a description of a specific embodiment of the specification. Other embodiments are within the scope of the appended claims. In some cases, the actions or steps recorded in the claims can be performed in an order different from that in the embodiments and still achieve the desired results. In addition, the processes depicted in the drawings do not necessarily require the specific order or continuous order shown to achieve the desired results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.
实施例五、Embodiment 5
图8为本申请实施例提供的设置于用户端的媒体文件的生成装置的示意性框图,该装置可以为设置于用户端的程序、插件或SDK(Software Development Kit,软件开发工具包)等功能单元。如图8所示,该装置800可以包括:界面展示单元801和服务侧交互单元802,其中各组成单元的主要功能如下:FIG8 is a schematic block diagram of a media file generation device provided at a user end according to an embodiment of the present application. The device may be a functional unit such as a program, a plug-in, or an SDK (Software Development Kit) provided at the user end. As shown in FIG8 , the device 800 may include: an interface display unit 801 and a service-side interaction unit 802, wherein the main functions of each component unit are as follows:
界面展示单元801,被配置为在交互界面上展示一个以上的推荐内容。The interface display unit 801 is configured to display one or more recommended contents on the interactive interface.
服务侧交互单元802,被配置为响应于用户从一个以上的推荐内容中选择目标推荐内容,将目标推荐内容发送给服务器端,以便于服务器端利用目标推荐内容生成脚本,脚本包括生成媒体文件的控制变量信息;响应于用户利用交互界面输入素材信息,将素材信息发送给服务器端;接收服务器发送的媒体文件,媒体文件是利用脚本以及素材信息生成的。The service-side interaction unit 802 is configured to, in response to a user selecting a target recommended content from one or more recommended contents, send the target recommended content to the server side, so that the server side generates a script using the target recommended content, the script including control variable information for generating a media file; in response to a user inputting material information using an interactive interface, send the material information to the server side; and receive the media file sent by the server, the media file being generated using the script and the material information.
界面展示单元801,还被配置为在交互界面上展示媒体文件。The interface display unit 801 is also configured to display media files on the interactive interface.
其中,上述推荐内容可以包括:推荐的媒体文件、媒体模板、推荐的脚本主题或推荐的脚本模板。The above-mentioned recommended content may include: recommended media files, media templates, recommended script themes or recommended script templates.
作为其中一种可实现的方式,上述推荐内容是依据用户账户信息得到的。As one possible implementation method, the above-mentioned recommended content is obtained based on user account information.
作为另一种可实现的方式,上述交互界面可以包括第一界面元素;界面展示单元801在交互界面上展示一个以上的推荐内容时,可以被具体配置为:响应于用户利用第一界面元素输入需求信息,将需求信息发送至服务器端,以便于服务器端利用需求信息得到一个以上的推荐内容;在交互界面上展示一个以上的推荐内容。As another achievable method, the above-mentioned interactive interface may include a first interface element; when the interface display unit 801 displays more than one recommended content on the interactive interface, it may be specifically configured as follows: in response to the user inputting demand information using the first interface element, the demand information is sent to the server side, so that the server side obtains more than one recommended content using the demand information; and more than one recommended content is displayed on the interactive interface.
更进一步地,上述交互界面还可以包括第二界面元素,服务侧交互单元802可以进一步被配置为:响应于用户利用第二界面元素输入需求信息,将需求信息发送给服务器端,以便于服务器端利用需求信息生成脚本。Furthermore, the above-mentioned interactive interface may also include a second interface element, and the service-side interactive unit 802 may be further configured to: in response to the user inputting demand information using the second interface element, send the demand information to the server side, so that the server side generates a script using the demand information.
作为其中一种较为优选的实现方式,需求信息可以采用自然语言形式。As one of the more preferred implementations, the demand information may be in the form of natural language.
更进一步地,上述交互界面还可以包括第三界面元素,界面展示单元801可以进一步被配置为:响应于用户触发第三界面元素,在交互界面上刷新展示的一个以上的推荐内容。具体地,界面展示单元801响应于用户触发第三界面元素,可以触发服务侧交互单元802向服务器端发送刷新请求。服务侧交互单元802重新接收到一个以上的推荐内容后,提供给界面展示单元801刷新展示的一个以上的推荐内容。Furthermore, the above-mentioned interactive interface may also include a third interface element, and the interface display unit 801 may be further configured to: in response to the user triggering the third interface element, refresh one or more recommended contents displayed on the interactive interface. Specifically, in response to the user triggering the third interface element, the interface display unit 801 may trigger the service-side interactive unit 802 to send a refresh request to the server. After the service-side interactive unit 802 receives one or more recommended contents again, it provides one or more recommended contents to the interface display unit 801 for refreshing display.
更进一步地,服务侧交互单元802进一步被配置为:获取脚本。相应地,界面展示单元801进一步被配置为:利用交互界面的第四界面元素展示脚本。Furthermore, the service-side interaction unit 802 is further configured to: obtain a script. Correspondingly, the interface display unit 801 is further configured to: display the script using a fourth interface element of the interaction interface.
作为其中一种可实现的方式,上述脚本为多个;界面展示单元801进一步被配置为:利用交互界面的第五界面元素展示多个脚本;响应于用户从多个脚本中选择脚本,利用第四界面元素展示选择的脚本。然后由服务侧交互单元802将选择的脚本和素材信息发送给服务器端。As one of the possible implementations, there are multiple scripts; the interface display unit 801 is further configured to: display multiple scripts using the fifth interface element of the interactive interface; in response to the user selecting a script from the multiple scripts, display the selected script using the fourth interface element. Then, the service-side interaction unit 802 sends the selected script and material information to the server.
作为其中一种较优选的实现方式,上述第四界面元素可以为文本编辑框,上述第五界面元素可以为卡片组件。As a more preferred implementation method, the fourth interface element may be a text editing box, and the fifth interface element may be a card component.
更进一步地,服务侧交互单元802,还可以被配置:响应于用户利用第四界面元素对脚本进行的编辑;将编辑后的脚本发送给服务器端。Furthermore, the service-side interaction unit 802 may also be configured to: in response to the user editing the script using the fourth interface element; send the edited script to the server.
作为其中一种可实现的方式,服务侧交互单元802在响应于用户在交互界面上输入素材信息,将素材信息发送给服务器端时,可以具体被配置为:响应于交互界面上的第六界面元素被触发,获取用户输入的素材信息;响应于交互界面上的第七界面元素被触发,将素材信息发送给服务器端。As one of the achievable methods, when the service-side interaction unit 802 sends material information to the server in response to the user inputting material information on the interaction interface, it can be specifically configured as follows: in response to the sixth interface element on the interaction interface being triggered, the material information input by the user is obtained; in response to the seventh interface element on the interaction interface being triggered, the material information is sent to the server.
作为其中一种较优选的实现方式,上述媒体文件为多个,界面展示单元801在交互界面上展示多个媒体文件。As a preferred implementation, there are multiple media files, and the interface display unit 801 displays the multiple media files on the interactive interface.
实施例六、Embodiment 6
图9为本申请实施例提供的设置于服务器端的媒体文件的生成装置的示意性框图,该装置可以为设置于服务器端的程序、插件或SDK(Software Development Kit,软件开发工具包)等功能单元。如图9所示,该装置900可以包括:用户侧交互单元901、脚本生成单元902和媒体生成单元903,还可以进一步包括推荐内容生成单元904。其中各组成单元的主要功能如下:FIG9 is a schematic block diagram of a media file generation device provided on a server side according to an embodiment of the present application. The device may be a functional unit such as a program, a plug-in, or an SDK (Software Development Kit) provided on the server side. As shown in FIG9 , the device 900 may include: a user-side interaction unit 901, a script generation unit 902, and a media generation unit 903, and may further include a recommended content generation unit 904. The main functions of each component unit are as follows:
用户侧交互单元901,被配置为向用户端发送一个以上的推荐内容。The user-side interaction unit 901 is configured to send one or more recommended contents to the user terminal.
脚本生成单元902,被配置为依据用户从一个以上的推荐内容中选择的目标推荐内容,生成脚本,脚本包括生成媒体文件的控制变量信息。The script generating unit 902 is configured to generate a script according to a target recommended content selected by a user from one or more recommended contents, wherein the script includes control variable information for generating a media file.
用户侧交互单元901,还被配置为从用户端获取用户输入的素材信息。The user-side interaction unit 901 is further configured to obtain material information input by the user from the user terminal.
媒体生成单元903,被配置为利用脚本以及用户输入的素材信息,生成媒体文件。The media generation unit 903 is configured to generate a media file using a script and material information input by a user.
用户侧交互单元901,还被配置为向用户端发送媒体文件。The user-side interaction unit 901 is further configured to send a media file to the user terminal.
作为其中一种可实现的方式,推荐内容生成单元904,可以依据用户账户信息得到的。服务器端依据用户账户信息确定推荐内容,即无需用户操作就能够向用户展示一个以上的推荐内容。As one possible implementation, the recommended content generation unit 904 can be obtained based on the user account information. The server determines the recommended content based on the user account information, that is, more than one recommended content can be displayed to the user without user operation.
作为另一种可实现的方式,推荐内容生成单元904,可以依据户输入的需求信息得到的一个以上的推荐内容。As another achievable manner, the recommended content generating unit 904 may obtain one or more recommended contents according to the demand information input by the user.
作为其中一种可实现的方式,推荐内容生成单元904在利用用户账户信息或需求信息得到推荐内容时,可以使用大语言模型实现。As one of the implementable ways, when the recommended content generating unit 904 obtains the recommended content by using the user account information or the demand information, it can be implemented by using a large language model.
更进一步地,响应于用户侧交互单元901接收到来自用户端的刷新指令,则推荐内容生成单元904重新生成一个以上的推荐内容并提供给用户侧交互单元901,由用户侧交互单元901发送给用户端。Furthermore, in response to the user-side interaction unit 901 receiving a refresh instruction from the user terminal, the recommended content generation unit 904 regenerates one or more recommended contents and provides them to the user-side interaction unit 901 , which then sends them to the user terminal.
本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于装置实施例而言,由于其基本相似于方法实施例,所以描述得比较简单,相关之处参见方法实施例的部分说明即可。以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性劳动的情况下,即可以理解并实施。Each embodiment in this specification is described in a progressive manner, and the same or similar parts between the embodiments can refer to each other, and each embodiment focuses on the differences from other embodiments. In particular, for the device embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and the relevant parts can refer to the partial description of the method embodiment. The device embodiment described above is only schematic, wherein the unit described as a separate component may or may not be physically separated, and the component displayed as a unit may or may not be a physical unit, that is, it may be located in one place, or it may be distributed on multiple network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the scheme of this embodiment. Ordinary technicians in this field can understand and implement it without paying creative work.
另外,本申请实施例还提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现前述方法实施例中任一项所述的方法的步骤。In addition, an embodiment of the present application further provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, the steps of any one of the methods in the aforementioned method embodiments are implemented.
以及一种电子设备,包括:And an electronic device, comprising:
一个或多个处理器;以及one or more processors; and
与所述一个或多个处理器关联的存储器,所述存储器用于存储程序指令,所述程序指令在被所述一个或多个处理器读取执行时,执行前述方法实施例中任一项所述的方法的步骤。A memory associated with the one or more processors, the memory being used to store program instructions, wherein the program instructions, when read and executed by the one or more processors, execute the steps of the method described in any one of the aforementioned method embodiments.
本申请还提供了一种计算机程序产品,包括计算机程序,该计算机程序在被处理器执行时实现前述方法实施例中任一项所述的方法的步骤。The present application also provides a computer program product, including a computer program, which implements the steps of any one of the methods in the aforementioned method embodiments when executed by a processor.
其中,图10示例性的展示出了电子设备的架构,具体可以包括处理器1010,视频显示适配器1011,磁盘驱动器1012,输入/输出接口1013,网络接口1014,以及存储器1020。上述处理器1010、视频显示适配器1011、磁盘驱动器1012、输入/输出接口1013、网络接口1014,与存储器1020之间可以通过通信总线1030进行通信连接。10 exemplarily shows the architecture of the electronic device, which may include a processor 1010, a video display adapter 1011, a disk drive 1012, an input/output interface 1013, a network interface 1014, and a memory 1020. The processor 1010, the video display adapter 1011, the disk drive 1012, the input/output interface 1013, the network interface 1014, and the memory 1020 may be communicatively connected via a communication bus 1030.
其中,处理器1010可以采用通用的CPU、微处理器、应用专用集成电路(Application Specific Integrated Circuit,ASIC)、或者一个或多个集成电路等方式实现,用于执行相关程序,以实现本申请所提供的技术方案。Among them, the processor 1010 can be implemented by a general-purpose CPU, a microprocessor, an application-specific integrated circuit (ASIC), or one or more integrated circuits, etc., to execute relevant programs to implement the technical solutions provided in this application.
存储器1020可以采用ROM(Read Only Memory,只读存储器)、RAM(Random AccessMemory,随机存取存储器)、静态存储设备,动态存储设备等形式实现。存储器1020可以存储用于控制电子设备1000运行的操作系统1021,用于控制电子设备1000的低级别操作的基本输入输出系统(BIOS) 1022。另外,还可以存储网页浏览器1023,数据存储管理系统1024,以及媒体文件的生成装置1025等等。上述媒体文件的生成装置1025就可以是本申请实施例中具体实现前述各步骤操作的应用程序。总之,在通过软件或者固件来实现本申请所提供的技术方案时,相关的程序代码保存在存储器1020中,并由处理器1010来调用执行。The memory 1020 can be implemented in the form of ROM (Read Only Memory), RAM (Random Access Memory), static storage device, dynamic storage device, etc. The memory 1020 can store an operating system 1021 for controlling the operation of the electronic device 1000, and a basic input and output system (BIOS) 1022 for controlling the low-level operation of the electronic device 1000. In addition, a web browser 1023, a data storage management system 1024, and a media file generation device 1025, etc. can also be stored. The above-mentioned media file generation device 1025 can be an application program that specifically implements the aforementioned steps in the embodiment of the present application. In short, when the technical solution provided by the present application is implemented by software or firmware, the relevant program code is stored in the memory 1020 and is called and executed by the processor 1010.
输入/输出接口1013用于连接输入/输出模块,以实现信息输入及输出。输入输出/模块可以作为组件配置在设备中(图中未示出),也可以外接于设备以提供相应功能。其中输入设备可以包括键盘、鼠标、触摸屏、麦克风、各类传感器等,输出设备可以包括显示器、扬声器、振动器、指示灯等。The input/output interface 1013 is used to connect the input/output module to realize information input and output. The input/output module can be configured in the device as a component (not shown in the figure), or it can be externally connected to the device to provide corresponding functions. The input device may include a keyboard, a mouse, a touch screen, a microphone, various sensors, etc., and the output device may include a display, a speaker, a vibrator, an indicator light, etc.
网络接口1014用于连接通信模块(图中未示出),以实现本设备与其他设备的通信交互。其中通信模块可以通过有线方式(例如USB、网线等)实现通信,也可以通过无线方式(例如移动网络、WIFI、蓝牙等)实现通信。The network interface 1014 is used to connect to a communication module (not shown) to achieve communication interaction between the device and other devices. The communication module can achieve communication through a wired method (such as USB, network cable, etc.) or a wireless method (such as mobile network, WIFI, Bluetooth, etc.).
总线1030包括一通路,在设备的各个组件(例如处理器1010、视频显示适配器1011、磁盘驱动器1012、输入/输出接口1013、网络接口1014,与存储器1020)之间传输信息。The bus 1030 comprises a pathway for transmitting information between the various components of the device (eg, the processor 1010 , the video display adapter 1011 , the disk drive 1012 , the input/output interface 1013 , the network interface 1014 , and the memory 1020 ).
需要说明的是,尽管上述设备仅示出了处理器1010、视频显示适配器1011、磁盘驱动器1012、输入/输出接口1013、网络接口1014,存储器1020,总线1030等,但是在具体实施过程中,该设备还可以包括实现正常运行所必需的其他组件。此外,本领域的技术人员可以理解的是,上述设备中也可以仅包含实现本申请方案所必需的组件,而不必包含图中所示的全部组件。It should be noted that, although the above device only shows a processor 1010, a video display adapter 1011, a disk drive 1012, an input/output interface 1013, a network interface 1014, a memory 1020, a bus 1030, etc., in the specific implementation process, the device may also include other components necessary for normal operation. In addition, it can be understood by those skilled in the art that the above device may also only include components necessary for implementing the solution of the present application, and does not necessarily include all the components shown in the figure.
通过以上的实施方式的描述可知,本领域的技术人员可以清楚地了解到本申请可借助软件加必需的通用硬件平台的方式来实现。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以计算机程序产品的形式体现出来,该计算机程序产品可以存储在存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例或者实施例的某些部分所述的方法。It can be known from the description of the above implementation methods that those skilled in the art can clearly understand that the present application can be implemented by means of software plus a necessary general hardware platform. Based on such an understanding, the technical solution of the present application can be essentially or partly contributed to the prior art in the form of a computer program product, which can be stored in a storage medium such as ROM/RAM, a magnetic disk, an optical disk, etc., and includes several instructions for enabling a computer device (which can be a personal computer, a server, or a network device, etc.) to execute the methods described in the various embodiments of the present application or certain parts of the embodiments.
以上对本申请所提供的技术方案进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处。综上所述,本说明书内容不应理解为对本申请的限制。The technical solution provided by the present application is described in detail above. The principle and implementation method of the present application are described in detail using specific examples. The description of the above embodiments is only used to help understand the method and core idea of the present application. At the same time, for those skilled in the art, according to the idea of the present application, there will be changes in the specific implementation method and application scope. In summary, the content of this specification should not be understood as limiting the present application.
Claims (20)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202411252157.3A CN118764686B (en) | 2024-09-09 | 2024-09-09 | Method and device for generating media file |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202411252157.3A CN118764686B (en) | 2024-09-09 | 2024-09-09 | Method and device for generating media file |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN118764686A true CN118764686A (en) | 2024-10-11 |
| CN118764686B CN118764686B (en) | 2024-12-10 |
Family
ID=92944098
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202411252157.3A Active CN118764686B (en) | 2024-09-09 | 2024-09-09 | Method and device for generating media file |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN118764686B (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119597943A (en) * | 2024-11-11 | 2025-03-11 | 百度在线网络技术(北京)有限公司 | Multimodal content generation method and related device |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107770626A (en) * | 2017-11-06 | 2018-03-06 | 腾讯科技(深圳)有限公司 | Processing method, image synthesizing method, device and the storage medium of video material |
| CN110825912A (en) * | 2019-10-30 | 2020-02-21 | 北京达佳互联信息技术有限公司 | Video generation method and device, electronic equipment and storage medium |
| CN116668786A (en) * | 2023-05-31 | 2023-08-29 | 北京字跳网络技术有限公司 | Video generation method, device, computer equipment and storage medium |
| CN116744071A (en) * | 2023-06-13 | 2023-09-12 | 北京字跳网络技术有限公司 | Media content processing methods, devices, equipment, readable storage media and products |
| CN118466811A (en) * | 2023-02-09 | 2024-08-09 | 广州开得联软件技术有限公司 | Media file editing method and device, electronic equipment and storage medium |
| US20240290358A1 (en) * | 2023-02-28 | 2024-08-29 | Lemon Inc. | Video processing method and apparatus, and non-transitory computer-readable storage medium |
-
2024
- 2024-09-09 CN CN202411252157.3A patent/CN118764686B/en active Active
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107770626A (en) * | 2017-11-06 | 2018-03-06 | 腾讯科技(深圳)有限公司 | Processing method, image synthesizing method, device and the storage medium of video material |
| CN110825912A (en) * | 2019-10-30 | 2020-02-21 | 北京达佳互联信息技术有限公司 | Video generation method and device, electronic equipment and storage medium |
| CN118466811A (en) * | 2023-02-09 | 2024-08-09 | 广州开得联软件技术有限公司 | Media file editing method and device, electronic equipment and storage medium |
| US20240290358A1 (en) * | 2023-02-28 | 2024-08-29 | Lemon Inc. | Video processing method and apparatus, and non-transitory computer-readable storage medium |
| CN116668786A (en) * | 2023-05-31 | 2023-08-29 | 北京字跳网络技术有限公司 | Video generation method, device, computer equipment and storage medium |
| CN116744071A (en) * | 2023-06-13 | 2023-09-12 | 北京字跳网络技术有限公司 | Media content processing methods, devices, equipment, readable storage media and products |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119597943A (en) * | 2024-11-11 | 2025-03-11 | 百度在线网络技术(北京)有限公司 | Multimodal content generation method and related device |
Also Published As
| Publication number | Publication date |
|---|---|
| CN118764686B (en) | 2024-12-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111294663B (en) | Bullet screen processing method and device, electronic equipment and computer readable storage medium | |
| TWI776066B (en) | Picture generating method, device, terminal, server and storage medium | |
| CN112558824B (en) | Page display method and device and computer storage medium | |
| CN111343074B (en) | Video processing method, device and equipment and storage medium | |
| CN112230909B (en) | Method, device, equipment and storage medium for binding data of applet | |
| US8117281B2 (en) | Using internet content as a means to establish live social networks by linking internet users to each other who are simultaneously engaged in the same and/or similar content | |
| CN113518160B (en) | Video generation method, device, equipment and storage medium | |
| KR101772361B1 (en) | Method, system and recording medium for providing content in messenger | |
| CN111949908A (en) | Media information processing method, device, electronic device and storage medium | |
| CN113778419A (en) | Multimedia data generation method and device, readable medium and electronic equipment | |
| CN113779469A (en) | Website page modification method and device, electronic device and storage medium | |
| JP6986590B2 (en) | Voice skill creation method, voice skill creation device, electronic device and storage medium | |
| WO2020216310A1 (en) | Method used for generating application, terminal device, and computer readable medium | |
| CN118764686A (en) | A method and device for generating a media file | |
| CN117041674A (en) | Method and device for generating special effects, electronic equipment and storage medium | |
| CN111625740B (en) | Image display method, image display device and electronic device | |
| CN114968025B (en) | Method, device, storage medium and electronic device for publishing comment information | |
| CN118175400A (en) | Video editing method and device, and electronic device | |
| CN115665109A (en) | Multimedia data processing method, device, equipment and medium | |
| CN115934974A (en) | A processing method, device, equipment and medium for multimedia data | |
| CN116017015A (en) | Release method, device, equipment, storage medium and product of barrage | |
| CN119396504A (en) | Information interaction method, device, storage medium and electronic device | |
| CN118625972A (en) | Page display method and device, electronic device, storage medium, computer program | |
| CN120547365A (en) | Live broadcast processing method and device for resource service | |
| CN121387137A (en) | Interaction method and device for virtual pets |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |