CN114708374A

CN114708374A - Virtual image generation method, device, electronic device and storage medium

Info

Publication number: CN114708374A
Application number: CN202210315812.XA
Authority: CN
Inventors: 李�杰; 赵晨
Original assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Current assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Priority date: 2022-03-28
Filing date: 2022-03-28
Publication date: 2022-07-05

Abstract

The present disclosure provides a virtual image generation method, which relates to the technical field of artificial intelligence, and in particular, to the technical field of computer vision and virtual/augmented reality technology. The specific implementation scheme is: determining the style feature map of the target object in the target image; determining a plurality of neural radiation field vectors according to the style feature map; and generating a virtual image corresponding to the target object according to the plurality of neural radiation field vectors. The present disclosure also provides an apparatus for generating a virtual image, an electronic device and a storage medium.

Description

Virtual image generation method, device, electronic device and storage medium

技术领域technical field

本公开涉及人工智能技术领域，尤其涉及计算机视觉技术领域和虚拟 /增强现实技术领域，可应用于图像处理场景下。更具体地，本公开提供了一种虚拟形象生成方法、装置、电子设备和存储介质。The present disclosure relates to the field of artificial intelligence technology, in particular to the field of computer vision technology and virtual/augmented reality technology, and can be applied to image processing scenarios. More specifically, the present disclosure provides an avatar generation method, apparatus, electronic device and storage medium.

背景技术Background technique

随着人工智能技术的发展，在例如虚拟/增强现实等领域中，深度学习模型广泛地应用于图像处理或图像生成。此外，虚拟形象在社交、直播或游戏等场景中具有广泛的应用。可以基于人工的方式生成虚拟形象。With the development of artificial intelligence technology, in fields such as virtual/augmented reality, deep learning models are widely used in image processing or image generation. In addition, avatars have a wide range of applications in social, live or gaming scenarios. The avatar can be generated in a human-based manner.

发明内容SUMMARY OF THE INVENTION

本公开提供了一种虚拟形象生成方法、装置、设备以及存储介质。The present disclosure provides a virtual image generation method, apparatus, device and storage medium.

根据本公开的一方面，提供了一种虚拟形象生成方法，该方法包括：确定目标图像中目标对象的风格特征图；根据所述风格特征图，确定多个神经辐射场矢量；以及根据所述多个神经辐射场矢量，生成与所述目标对象相对应的虚拟形象。According to an aspect of the present disclosure, there is provided a method for generating an avatar, the method comprising: determining a style feature map of a target object in a target image; determining a plurality of neural radiation field vectors according to the style feature map; and according to the A plurality of neural radiation field vectors are used to generate a virtual image corresponding to the target object.

根据本公开的另一方面，提供了一种虚拟形象生成装置，该装置包括：第一确定模块，用于确定目标图像中目标对象的风格特征图；第二确定模块，用于根据所述风格特征图，确定多个神经辐射场矢量；以及生成模块，用于根据所述多个神经辐射场矢量，生成与所述目标对象相对应的虚拟形象。According to another aspect of the present disclosure, there is provided an apparatus for generating an avatar, the apparatus comprising: a first determining module for determining a style feature map of a target object in a target image; a second determining module for determining according to the style a feature map, for determining a plurality of neural radiation field vectors; and a generating module for generating a virtual image corresponding to the target object according to the plurality of neural radiation field vectors.

根据本公开的另一方面，提供了一种电子设备，包括：至少一个处理器；以及与至少一个处理器通信连接的存储器；其中，存储器存储有可被至少一个处理器执行的指令，指令被至少一个处理器执行，以使至少一个处理器能够执行根据本公开提供的方法。According to another aspect of the present disclosure, there is provided an electronic device comprising: at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor. The at least one processor executes to enable the at least one processor to perform the methods provided in accordance with the present disclosure.

根据本公开的另一方面，提供了一种存储有计算机指令的非瞬时计算机可读存储介质，该计算机指令用于使计算机执行根据本公开提供的方法。According to another aspect of the present disclosure, there is provided a non-transitory computer-readable storage medium storing computer instructions for causing a computer to perform a method provided in accordance with the present disclosure.

根据本公开的另一方面，提供了一种计算机程序产品，包括计算机程序，所述计算机程序在被处理器执行时实现根据本公开提供的方法。According to another aspect of the present disclosure, there is provided a computer program product comprising a computer program which, when executed by a processor, implements the method provided according to the present disclosure.

应当理解，本部分所描述的内容并非旨在标识本公开的实施例的关键或重要特征，也不用于限制本公开的范围。本公开的其它特征将通过以下的说明书而变得容易理解。It should be understood that what is described in this section is not intended to identify key or critical features of embodiments of the disclosure, nor is it intended to limit the scope of the disclosure. Other features of the present disclosure will become readily understood from the following description.

附图说明Description of drawings

附图用于更好地理解本方案，不构成对本公开的限定。其中：The accompanying drawings are used for better understanding of the present solution, and do not constitute a limitation to the present disclosure. in:

图1是根据本公开的一个实施例的可以应用虚拟形象生成方法和装置的示例性系统架构示意图；FIG. 1 is a schematic diagram of an exemplary system architecture to which the avatar generation method and apparatus can be applied according to an embodiment of the present disclosure;

图2是根据本公开的一个实施例的虚拟形象生成方法的流程图；2 is a flowchart of a method for generating an avatar according to an embodiment of the present disclosure;

图3是根据本公开的另一个实施例的虚拟形象生成方法的流程图；3 is a flowchart of a method for generating an avatar according to another embodiment of the present disclosure;

图4是根据本公开的一个实施例的虚拟形象生成方法的原理图；4 is a schematic diagram of a method for generating an avatar according to an embodiment of the present disclosure;

图5是根据本公开的一个实施例的虚拟形象生成方法的原理图；5 is a schematic diagram of a method for generating an avatar according to an embodiment of the present disclosure;

图6是根据本公开的一个实施例的虚拟形象生成装置的框图；以及FIG. 6 is a block diagram of an avatar generating apparatus according to an embodiment of the present disclosure; and

图7是根据本公开的一个实施例的可以应用虚拟形象生成方法的电子设备的框图。FIG. 7 is a block diagram of an electronic device to which the avatar generation method can be applied, according to one embodiment of the present disclosure.

具体实施方式Detailed ways

以下结合附图对本公开的示范性实施例做出说明，其中包括本公开实施例的各种细节以助于理解，应当将它们认为仅仅是示范性的。因此，本领域普通技术人员应当认识到，可以对这里描述的实施例做出各种改变和修改，而不会背离本公开的范围和精神。同样，为了清楚和简明，以下的描述中省略了对公知功能和结构的描述。Exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding and should be considered as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.

虚拟形象可以包括虚拟的身体。可以基于人工的方式对一个虚拟形象进行设计、生成和优化，但需要较高的时间成本和人力成本。不同设计人员对风格的理解具有主观性，导致基于同一风格可能完全不同的多个虚拟形象。此外，基于人工的方式生成的虚拟形象的面部和头发之间可能会不匹配。The avatar may include a virtual body. A virtual image can be designed, generated and optimized in a manual way, but it requires high time and labor costs. The interpretation of styles by different designers is subjective, resulting in multiple avatars that may be completely different based on the same style. Also, there may be a mismatch between the face and hair of the artificially generated avatar.

图1是根据本公开一个实施例的可以应用虚拟形象生成方法和装置的示例性系统架构示意图。需要注意的是，图1所示仅为可以应用本公开实施例的系统架构的示例，以帮助本领域技术人员理解本公开的技术内容，但并不意味着本公开实施例不可以用于其他设备、系统、环境或场景。FIG. 1 is a schematic diagram of an exemplary system architecture to which a method and apparatus for generating an avatar can be applied according to an embodiment of the present disclosure. It should be noted that FIG. 1 is only an example of a system architecture to which the embodiments of the present disclosure can be applied, so as to help those skilled in the art to understand the technical content of the present disclosure, but it does not mean that the embodiments of the present disclosure cannot be used for other A device, system, environment or scene.

如图1所示，根据该实施例的系统架构100可以包括终端设备101、 102、103，网络104和服务器105。网络104用以在终端设备101、102、 103和服务器105之间提供通信链路的介质。网络104可以包括各种连接类型，例如有线和/或无线通信链路等等。As shown in FIG. 1 , the system architecture 100 according to this embodiment may include terminal devices 101 , 102 , and 103 , a network 104 and a server 105 . The network 104 is a medium used to provide a communication link between the terminal devices 101 , 102 , 103 and the server 105 . The network 104 may include various connection types, such as wired and/or wireless communication links, and the like.

用户可以使用终端设备101、102、103通过网络104与服务器105交互，以接收或发送消息等。终端设备101、102、103可以是具有显示屏并且支持网页浏览的各种电子设备，包括但不限于智能手机、平板电脑、膝上型便携计算机和台式计算机等等。The user can use the terminal devices 101, 102, 103 to interact with the server 105 through the network 104 to receive or send messages and the like. The terminal devices 101, 102, 103 may be various electronic devices having a display screen and supporting web browsing, including but not limited to smart phones, tablet computers, laptop computers, desktop computers, and the like.

服务器105可以是提供各种服务的服务器，例如对用户利用终端设备 101、102、103所浏览的网站提供支持的后台管理服务器(仅为示例)。后台管理服务器可以对接收到的用户请求等数据进行分析等处理，并将处理结果(例如根据用户请求获取或生成的网页、信息、或数据等)反馈给终端设备。The server 105 may be a server that provides various services, such as a background management server (just an example) that provides support for websites browsed by users using the terminal devices 101, 102, 103. The background management server can analyze and process the received user requests and other data, and feed back the processing results (such as web pages, information, or data obtained or generated according to user requests) to the terminal device.

需要说明的是，本公开实施例所提供的虚拟形象生成方法一般可以由服务器105执行。相应地，本公开实施例所提供的虚拟形象生成装置一般可以设置于服务器105中。本公开实施例所提供的虚拟形象生成方法也可以由不同于服务器105且能够与终端设备101、102、103和/或服务器105 通信的服务器或服务器集群执行。相应地，本公开实施例所提供的虚拟形象生成装置也可以设置于不同于服务器105且能够与终端设备101、102、 103和/或服务器105通信的服务器或服务器集群中。It should be noted that, the virtual image generation method provided by the embodiment of the present disclosure may generally be executed by the server 105 . Correspondingly, the virtual image generating apparatus provided by the embodiment of the present disclosure may generally be provided in the server 105 . The avatar generation method provided by the embodiment of the present disclosure may also be executed by a server or server cluster different from the server 105 and capable of communicating with the terminal devices 101 , 102 , 103 and/or the server 105 . Correspondingly, the avatar generating apparatus provided by the embodiment of the present disclosure may also be set in a server or server cluster that is different from the server 105 and can communicate with the terminal devices 101 , 102 , 103 and/or the server 105 .

图2是根据本公开的一个实施例的虚拟形象生成方法的流程图。FIG. 2 is a flowchart of a method for generating an avatar according to an embodiment of the present disclosure.

如图2所示，该方法200可以包括操作S210至操作S230。As shown in FIG. 2, the method 200 may include operations S210 to S230.

在操作S210，确定目标图像中目标对象的风格特征图。In operation S210, a style feature map of the target object in the target image is determined.

例如，目标图像中可以包含目标对象和图像背景。可以对目标图像进行处理，去除图像背景，得到处理后的目标图像。在一个示例中，可以利用语义分割模型确定图像背景所处的区域，进而可以去除图像背景。For example, the target image can contain the target object and the image background. The target image can be processed to remove the background of the image to obtain the processed target image. In one example, a semantic segmentation model can be used to determine the region where the image background is located, and then the image background can be removed.

又例如，可以利用多个深度学习模型对处理后的目标图像进行处理，得到风格特征图。在一个示例中，多个深度学习模型例如可以包括U-Net (U型网络)模型和StyleGAN2(风格化对抗生成网络)模型。For another example, multiple deep learning models may be used to process the processed target image to obtain a style feature map. In one example, the plurality of deep learning models may include, for example, a U-Net (U-shaped network) model and a StyleGAN2 (Styled Adversarial Generative Network) model.

在操作S220，根据风格特征图，确定多个神经辐射场矢量。In operation S220, a plurality of neural radiation field vectors are determined according to the style feature map.

例如，可以利用NERF(NEural Radiance Field，神经辐射场)模型处理风格特征图，得到多个矢量，作为多个神经辐射场矢量。For example, a NERF (NEural Radiance Field, neural radiation field) model can be used to process the style feature map to obtain multiple vectors as multiple neural radiation field vectors.

在操作S230，根据多个神经辐射场矢量，生成与目标对象相对应的虚拟形象。In operation S230, an avatar corresponding to the target object is generated according to the plurality of neural radiation field vectors.

例如，可以根据多个神经辐射场矢量，进行图像渲染，以得到与目标对象对应的虚拟形象。For example, image rendering can be performed according to a plurality of neural radiation field vectors to obtain an avatar corresponding to the target object.

通过本公开实施例，可以根据用户提供的图像生成具有相应风格的虚拟形象。With the embodiments of the present disclosure, an avatar with a corresponding style can be generated according to an image provided by a user.

图3是根据本公开的另一个实施例的虚拟形象生成方法的流程图。FIG. 3 is a flowchart of a method for generating an avatar according to another embodiment of the present disclosure.

如图3所示，该方法300可以包括操作S311至操作S312、操作S321 至操作S324，以及操作S331至操作S332。As shown in FIG. 3 , the method 300 may include operations S311 to S312 , operations S321 to S324 , and operations S331 to S332 .

在本公开实施例中，方法300可以确定目标图像中目标对象的风格特征图，下面将结合操作S311至操作S312进行详细说明。In this embodiment of the present disclosure, the method 300 may determine the style feature map of the target object in the target image, which will be described in detail below with reference to operations S311 to S312.

在操作S311，确定目标对象的风格编码数据。In operation S311, style encoding data of the target object is determined.

例如，目标对象包括头部。For example, the target object includes a head.

例如，如上文所述，可以去除目标图像中的图像背景，得到处理后的目标图像。接下来，可以利用U-Net模型对处理后的目标图像进行再次处理，得到目标对象的风格编码数据。For example, as described above, the image background in the target image can be removed to obtain the processed target image. Next, the processed target image can be processed again by using the U-Net model to obtain the style encoding data of the target object.

在操作S312，根据风格编码数据，确定风格特征图。In operation S312, a style feature map is determined according to the style encoding data.

例如，可以利用StyleGAN2模型对风格编码数据进行处理，确定风格特征图。For example, the StyleGAN2 model can be used to process style-encoded data to determine style feature maps.

在本公开实施例中，方法300也可以根据风格特征图，确定多个神经辐射场矢量。下面将结合操作S321至操作S324。In the embodiment of the present disclosure, the method 300 may also determine a plurality of neural radiation field vectors according to the style feature map. Operations S321 to S324 will be combined below.

在操作S321，根据风格特征图，生成风格图像。In operation S321, a style image is generated according to the style feature map.

例如，可以利用CNN(Convolutional Neural Network，卷积神经网络) 处理上文所述的风格特征图，以生成风格图像。在一个示例中，可以选用相应的卷积核，以根据风格特征图生成风格图像。For example, a CNN (Convolutional Neural Network, Convolutional Neural Network) can be used to process the style feature map described above to generate a style image. In one example, corresponding convolution kernels can be selected to generate style images from style feature maps.

在操作S322，根据风格图像，确定重建参数。In operation S322, reconstruction parameters are determined according to the style image.

例如，可以利用3DMM(3D Morphable Models，3D可变模型)根据上文所述的风格图像，确定重建参数。在一个示例中，重建参数例如可以包括shape(形状)参数、expression(表情)参数、texture(纹理)参数、相机内参和相机外参等等。For example, 3DMM (3D Morphable Models, 3D Morphable Models) can be used to determine the reconstruction parameters according to the style image described above. In one example, the reconstruction parameters may include, for example, shape parameters, expression parameters, texture parameters, camera intrinsic parameters, camera extrinsic parameters, and the like.

在操作S323，将风格特征图与重建参数相融合，得到融合特征图。In operation S323, the style feature map is fused with the reconstruction parameters to obtain a fused feature map.

例如，可以利用第1个FC(Fully Connected Neural Network，全连接神经网络)模型处理风格特征图，得到一个风格特征信息。For example, the first FC (Fully Connected Neural Network) model can be used to process the style feature map to obtain a style feature information.

又例如，可以利用第2个FC模型处理重建参数，得到一个重建特征信息。For another example, the reconstruction parameters may be processed by using the second FC model to obtain reconstruction feature information.

接下来，可以将风格特征信息和重建特征信息融合，得到融合特征图。Next, the style feature information and the reconstructed feature information can be fused to obtain a fused feature map.

在操作S324，根据融合特征图，确定多个神经辐射场矢量。In operation S324, a plurality of neural radiation field vectors are determined according to the fusion feature map.

例如，可以利用NERF模型处理融合特征图，以确定多个神经辐射场矢量。在一个示例中，NERF模型包括多个全连接层。For example, the fused feature maps can be processed using the NERF model to determine multiple neural radiation field vectors. In one example, the NERF model includes multiple fully connected layers.

在本公开实施例中，方法300还可以根据多个神经辐射场矢量，进行图像渲染，生成与目标图像对应的虚拟形象。下面将结合操作S331至操作S332进行详细说明。In the embodiment of the present disclosure, the method 300 may further perform image rendering according to a plurality of neural radiation field vectors to generate a virtual image corresponding to the target image. A detailed description will be made below in conjunction with operations S331 to S332.

在操作S331，根据多个神经辐射场矢量，得到目标对象的显式三维表达。In operation S331, an explicit three-dimensional representation of the target object is obtained according to the plurality of neural radiation field vectors.

例如，可以基于MC(Marching Cub)算法，根据多个神经辐射场矢量，得到目标对象的显式三维表达。在一个示例中，利用本公开实施例的 NERF模型确定的多个神经辐射场矢量可以视为一种隐式三维表达。基于 MC算法，可以得到一种显式三维表达。For example, based on the MC (Marching Cub) algorithm, an explicit three-dimensional representation of the target object can be obtained according to multiple neural radiation field vectors. In one example, the plurality of neural radiation field vectors determined using the NERF model of an embodiment of the present disclosure can be regarded as an implicit three-dimensional representation. Based on the MC algorithm, an explicit three-dimensional representation can be obtained.

在操作S332，将显式三维表达进行渲染，生成与目标对象相对应的虚拟形象。In operation S332, the explicit three-dimensional representation is rendered to generate an avatar corresponding to the target object.

例如，可以利用上文所述的Pytorch3D渲染器将显式三维表达进行渲染，得到虚拟形象。For example, an explicit three-dimensional representation can be rendered using the Pytorch3D renderer described above to obtain an avatar.

通过本公开实施例，利用3DMM可以准确地确定符合目标对象风格的重建参数。此外，将重建参数与风格特征图融合，可以充分利用U-Net 模型输出的信息和3DMM输出的信息。Through the embodiments of the present disclosure, the reconstruction parameters conforming to the style of the target object can be accurately determined by using the 3DMM. In addition, by fusing the reconstruction parameters with the style feature map, the information output by the U-Net model and the information output by the 3DMM can be fully utilized.

通过本公开实施例，可以在风格化、个性化、需求差异化等方面为用户提供很大的自由度，提高了用户体验。本公开实施例可以生成面部皮肤和皮肤区域以外的三维毛发。该三维头发可以与面部风格匹配。With the embodiments of the present disclosure, a great degree of freedom can be provided for users in terms of stylization, personalization, and demand differentiation, and the user experience is improved. Embodiments of the present disclosure can generate three-dimensional hair outside of facial skin and skin areas. The three-dimensional hair can be matched to the facial style.

需要说明的是，本公开实施例提供的虚拟形象生成方法可以应用于元宇宙场景，也可以应用于其他场景。通过本公开实施例，可以为用户提供端到端完整的虚拟形象生成方法。It should be noted that the virtual image generation method provided by the embodiment of the present disclosure can be applied to a metaverse scene, and can also be applied to other scenes. Through the embodiments of the present disclosure, a complete end-to-end virtual image generation method can be provided for users.

在一些实施例中，NERF模型可以包括一个FC模型。根据风格特征图或融合特征图，可以确定体素的位置信息和视角方向信息，作为NERF 模型的输入矢量。NERF模型输出的神经辐射场矢量例如可以包括颜色信息和体素密度信息。体素密度信息与体素所处的位置相关。颜色信息与体素所处的位置和视角方向有关。In some embodiments, the NERF model may include an FC model. According to the style feature map or the fusion feature map, the position information and view direction information of the voxel can be determined as the input vector of the NERF model. The neural radiation field vector output by the NERF model may include, for example, color information and voxel density information. The voxel density information is related to where the voxels are located. The color information is related to the position of the voxel and the viewing direction.

在一些实施例中，MC算法可以逐个遍历输入的数据场中各个体素，确定该体素是否与等值面相交。在体素与等值面相交的情况下，计算等值面与一个立方体的棱边的交点，用这些交点连接成三角面片以逼近表示该体素内的等值面。在所有的体素被遍历完之后，可以得到整个数据场的等值面。在一个示例中，输入的数据场可以根据上文所述的多个神经辐射场矢量得到。In some embodiments, the MC algorithm may traverse each voxel in the input data field one by one to determine whether the voxel intersects the isosurface. In the case where a voxel intersects an isosurface, calculate the intersections of the isosurface with the edges of a cube, and use these intersections to form triangular patches to approximate the isosurface within the voxel. After all voxels have been traversed, the isosurface of the entire data field can be obtained. In one example, the input data field may be derived from the plurality of neural radiation field vectors described above.

在一些实施例中，3DMM可以将面部形状和面部纹理作为约束，同时考虑中面部姿态和外部光照的影响，可以确定高精度的重建参数。3DMM 可以基于风格图，确定初始化参数，以便确定一个3D的初始化虚拟形象。根据该初始化虚拟形象，可以确定一个2D的投影图像。计算投影图像和风格图之间的差异值，根据该差异值调整上文所述的初始化参数。在差异值小于预设差异阈值之后，可以确定重建参数。In some embodiments, the 3DMM can use facial shape and facial texture as constraints, while taking into account the influence of mid-face pose and external lighting, can determine high-precision reconstruction parameters. The 3DMM can determine initialization parameters based on the style map in order to determine a 3D initialization avatar. From the initialized avatar, a 2D projection image can be determined. Calculate the difference between the projected image and the style map, and adjust the initialization parameters described above according to the difference. After the difference value is smaller than the preset difference threshold, the reconstruction parameters may be determined.

在一些实施例中，Pytorch3D渲染器是一种可以为3D计算机视觉研究提供高效、可复用功能的组件。Pytorch3D渲染器可以存储和操作三角形网格的数据结构，也可以基于三角形网格的数据结构进行高效运算(例如投影变换、图卷积等等)。此外，Pytorch3D渲染器是一种可微分的网格渲染器。In some embodiments, the Pytorch3D renderer is a component that provides efficient, reusable functionality for 3D computer vision research. The Pytorch3D renderer can store and manipulate the data structure of the triangle mesh, and can also perform efficient operations (such as projection transformation, graph convolution, etc.) based on the data structure of the triangle mesh. Also, the Pytorch3D renderer is a differentiable mesh renderer.

图4是根据本公开的另一个实施例的虚拟形象生成方法的原理图。FIG. 4 is a schematic diagram of a method for generating an avatar according to another embodiment of the present disclosure.

如图4所示，可以利用虚拟形象生成模型400根据目标图像401生成虚拟形象402。As shown in FIG. 4 , an avatar 402 can be generated according to the target image 401 by using the avatar generation model 400 .

例如，虚拟形象生成模型400包括U-Net模型410、StyleGAN2模型 420、CNN模型430、3DMM 440、FC模型450、FC模型460、NERF模型470和Pytorch3D渲染器480。For example, the avatar generation model 400 includes a U-Net model 410 , a StyleGAN2 model 420 , a CNN model 430 , a 3DMM 440 , an FC model 450 , an FC model 460 , a NERF model 470 , and a Pytorch3D renderer 480 .

例如，目标图像401中的目标对象包括头部。在一个示例中，可以去除目标图像401中的图像背景，得到处理后的目标图像。接下来，U-Net 模型410可以对处理后的目标图像进行再次处理，得到目标对象的风格编码数据D_s。For example, the target object in the target image 401 includes a head. In one example, the image background in the target image 401 may be removed to obtain the processed target image. Next, the U-Net model 410 can process the processed target image again to obtain the style encoding data D_s of the target object.

StyleGAN2模型420可以对风格编码数据D_s进行处理，确定风格特征图FM_s。The StyleGAN2 model 420 may process the style encoding data D_s to determine the style feature map FM_s.

CNN模型430可以处理风格特征图FM_s，生成风格图像Img_s。The CNN model 430 can process the style feature map FM_s to generate the style image Img_s.

3DMM 440可以根据风格图像Img_s，确定重建参数Para。在一个示例中，重建参数Para例如可以包括shape参数、expression参数、texture 参数、相机内参和相机外参等等。The 3DMM 440 may determine the reconstruction parameter Para according to the style image Img_s. In an example, the reconstruction parameter Para may include, for example, a shape parameter, an expression parameter, a texture parameter, an internal camera parameter, an external camera parameter, and the like.

FC模型450可以处理风格特征图FM_s，而FC模型460可以处理重建参数Para。The FC model 450 can process the style feature map FM_s, and the FC model 460 can process the reconstruction parameter Para.

可以对FC模型450的输出和FC模型460的输出进行融合，得到融合特征图FM_m。The output of the FC model 450 and the output of the FC model 460 may be fused to obtain a fused feature map FM_m.

NERF模型470可以处理融合特征图，确定多个神经辐射场矢量。在一个示例中，NERF模型包括多个全连接层。The NERF model 470 can process the fused feature maps to determine multiple neural radiation field vectors. In one example, the NERF model includes multiple fully connected layers.

接下来，可以基于MC(Marching Cub)算法，根据多个神经辐射场矢量，得到目标对象的显式三维表达。Next, based on the MC (Marching Cub) algorithm, an explicit three-dimensional representation of the target object can be obtained according to multiple neural radiation field vectors.

Pytorch3D渲染器480可以将显式三维表达进行渲染，得到虚拟形象 402。The Pytorch3D renderer 480 can render the explicit three-dimensional representation, resulting in the avatar 402.

在一些实施例中，上文所述的方法还包括：将样本图像输入虚拟形象生成模型，得到与样本图像中样本对象对应的虚拟形象；根据样本图像的标签和虚拟形象，训练虚拟形象生成模型。下面将结合图5进行详细说明。In some embodiments, the method described above further includes: inputting the sample image into the avatar generation model to obtain an avatar corresponding to the sample object in the sample image; training the avatar generation model according to the label of the sample image and the avatar . A detailed description will be given below with reference to FIG. 5 .

图5是根据本公开的另一个实施例的虚拟形象生成方法的原理图。FIG. 5 is a schematic diagram of a method for generating an avatar according to another embodiment of the present disclosure.

如图5所示，可以利用虚拟形象生成模型500根据样本图像501生成虚拟形象502。As shown in FIG. 5 , an avatar 502 can be generated from a sample image 501 using an avatar generation model 500 .

例如，虚拟形象生成模型500包括U-Net模型510、StyleGAN2模型 520、CNN模型530、3DMM 540、FC模型550、FC模型560、NERF模型570和Pytorch3D渲染器580。可以理解，上文对U-Net模型410、 StyleGAN2模型420、CNN模型430、3DMM 440、FC模型450、FC模型 460、NERF模型470和Pytorch3D渲染器480的详细描述同样适用于本实施例，本公开再次不在赘述。For example, avatar generation model 500 includes U-Net model 510, StyleGAN2 model 520, CNN model 530, 3DMM 540, FC model 550, FC model 560, NERF model 570, and Pytorch3D renderer 580. It can be understood that the above detailed descriptions of the U-Net model 410, the StyleGAN2 model 420, the CNN model 430, the 3DMM 440, the FC model 450, the FC model 460, the NERF model 470 and the Pytorch3D renderer 480 are also applicable to this embodiment. The disclosure will not be repeated again.

在Pytorch3D渲染器580生成与样本图像501中样本对象对应的虚拟形象502之后，可以根据虚拟形象502和标签503，确定损失值504。根据损失值504可以调整虚拟形象生成模型500的参数。例如，可以根据损失值504调整U-Net模型510、StyleGAN2模型520、CNN模型530、3DMM 540、FC模型550、FC模型560、NERF模型570和Pytorch3D渲染器580 中一个或多个模型的参数，使得损失值504小于或等于预设损失阈值。在一个示例中，可以在损失值小于或等于预设损失阈值之后，可以得到经训练的虚拟形象生成模型，作为上文所述的虚拟形象生成模型400。After the Pytorch3D renderer 580 generates the avatar 502 corresponding to the sample object in the sample image 501 , the loss value 504 can be determined according to the avatar 502 and the label 503 . The parameters of the avatar generation model 500 may be adjusted according to the loss value 504 . For example, the parameters of one or more of the U-Net model 510, StyleGAN2 model 520, CNN model 530, 3DMM 540, FC model 550, FC model 560, NERF model 570 and Pytorch3D renderer 580 may be adjusted according to the loss value 504, Make the loss value 504 less than or equal to the preset loss threshold. In one example, after the loss value is less than or equal to a preset loss threshold, a trained avatar generation model can be obtained as the avatar generation model 400 described above.

例如，样本图像501的标签可以是人工确定的。可以由多个不同的虚拟形象设计人员根据样本图像中的样本对象设计多个备选的虚拟形象。再由多个设计人员分别对多个备选的虚拟形象进行评价，将评价最高的虚拟形象作为样本图像501的标签。For example, the label of the sample image 501 may be manually determined. Multiple alternative avatars may be designed by multiple different avatar designers according to the sample objects in the sample image. Then, multiple designers evaluate multiple candidate avatars respectively, and use the avatar with the highest evaluation as the label of the sample image 501 .

例如，可以利用L1损失函数或L2损失函数确定损失值504。For example, the loss value 504 may be determined using an L1 loss function or an L2 loss function.

图6是根据本公开的一个实施例的虚拟形象生成装置的框图。FIG. 6 is a block diagram of an avatar generating apparatus according to an embodiment of the present disclosure.

如图6所示，该装置600可以包括第一确定模块610、第二确定模块620和生成模块630。As shown in FIG. 6 , the apparatus 600 may include a first determining module 610 , a second determining module 620 and a generating module 630 .

第一确定模块610，用于确定目标图像中目标对象的风格特征图。在一个示例中，第一确定模块610可以用于执行操作S210，本公开在此不在赘述。The first determining module 610 is configured to determine the style feature map of the target object in the target image. In an example, the first determining module 610 may be configured to perform operation S210, which will not be repeated in the present disclosure.

第二确定模块620，用于根据所述风格特征图，确定多个神经辐射场矢量。在一个示例中，第二确定模块620可以用于执行操作S220，本公开在此不在赘述。The second determining module 620 is configured to determine a plurality of neural radiation field vectors according to the style feature map. In one example, the second determining module 620 may be configured to perform operation S220, which will not be repeated in the present disclosure.

生成模块630，用于根据所述多个神经辐射场矢量，生成与所述目标对象相对应的虚拟形象。在一个示例中，生成模块630可以用于执行操作 S230，本公开在此不在赘述。The generating module 630 is configured to generate a virtual image corresponding to the target object according to the plurality of neural radiation field vectors. In one example, the generating module 630 may be configured to perform operation S230, which will not be repeated in the present disclosure.

在一些实施例中，所述第一确定模块包括：第一确定单元，用于确定所述目标对象的风格编码数据；以及第二确定单元，用于根据所述风格编码数据，确定所述风格特征图。在一个示例中，第一确定单元可以用于执行操作S311。在一个示例中，第二确定单元可以用于执行操作S312，本公开在此不在赘述。In some embodiments, the first determining module includes: a first determining unit for determining style encoding data of the target object; and a second determining unit for determining the style according to the style encoding data feature map. In one example, the first determination unit may be configured to perform operation S311. In one example, the second determining unit may be configured to perform operation S312, which will not be repeated in the present disclosure.

在一些实施例中，所述第二确定模块包括：第一生成单元，用于根据所述风格特征图，生成风格图像；第三确定单元，用于根据所述风格图像，确定重建参数；融合单元，用于将所述风格特征图与所述重建参数相融合，得到融合特征图；以及第四确定单元，用于根据所述融合特征图，确定多个神经辐射场矢量。在一个示例中，第一生成单元可以用于执行操作S321。在一个示例中，第三确定单元可以用于执行操作S322。在一个示例中，融合单元可以用于执行操作S323。在一个示例中，第四确定单元可以用于执行操作S324，本公开在此不在赘述。In some embodiments, the second determining module includes: a first generating unit for generating a style image according to the style feature map; a third determining unit for determining reconstruction parameters according to the style image; fusion a unit for merging the style feature map with the reconstruction parameters to obtain a fused feature map; and a fourth determining unit for determining a plurality of neural radiation field vectors according to the fused feature map. In one example, the first generating unit may be configured to perform operation S321. In one example, the third determination unit may be configured to perform operation S322. In one example, the fusion unit may be used to perform operation S323. In one example, the fourth determining unit may be configured to perform operation S324, which will not be repeated in the present disclosure.

在一些实施例中，所述生成模块包括：获得单元，用于根据所述多个神经辐射场矢量，得到所述目标对象的显式三维表达；以及第二生成单元，用于将所述显式三维表达进行渲染，生成与所述目标对象相对应的虚拟形象。在一个示例中，获得单元可以用于执行操作S331。在一个示例中，第二生成单元可以用于执行操作S332，本公开在此不在赘述。In some embodiments, the generating module includes: an obtaining unit for obtaining an explicit three-dimensional representation of the target object according to the plurality of neural radiation field vectors; and a second generating unit for converting the explicit three-dimensional representation of the target object. The three-dimensional expression is rendered in the form of a three-dimensional representation, and an avatar corresponding to the target object is generated. In one example, the obtaining unit may be used to perform operation S331. In one example, the second generating unit may be configured to perform operation S332, which will not be repeated in this disclosure.

在一些实施例中，所述目标对象包括头部。In some embodiments, the target object includes a head.

本公开的技术方案中，所涉及的用户个人信息的收集、存储、使用、加工、传输、提供和公开等处理，均符合相关法律法规的规定，且不违背公序良俗。In the technical solutions of the present disclosure, the collection, storage, use, processing, transmission, provision, and disclosure of the user's personal information involved are all in compliance with relevant laws and regulations, and do not violate public order and good customs.

根据本公开的实施例，本公开还提供了一种电子设备、一种可读存储介质和一种计算机程序产品。According to embodiments of the present disclosure, the present disclosure also provides an electronic device, a readable storage medium, and a computer program product.

在本公开实施例中，提供了一种电子设备，包括：至少一个处理器；以及与所述至少一个处理器通信连接的存储器；其中，所述存储器存储有可被所述至少一个处理器执行的指令，所述指令被所述至少一个处理器执行，以使所述至少一个处理器能够执行本公开提供的方法。In an embodiment of the present disclosure, an electronic device is provided, comprising: at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores data executable by the at least one processor The instructions are executed by the at least one processor to enable the at least one processor to perform the methods provided by the present disclosure.

在本公开实施例中，提供了一种存储有计算机指令的非瞬时计算机可读存储介质，其中，所述计算机指令用于使所述计算机执行本公开提供的方法。In an embodiment of the present disclosure, a non-transitory computer-readable storage medium storing computer instructions is provided, wherein the computer instructions are used to cause the computer to perform the method provided by the present disclosure.

本公开实施例中，提供了一种计算机程序产品，包括计算机程序，所述计算机程序在被处理器执行时实现本公开提供的方法。In an embodiment of the present disclosure, a computer program product is provided, including a computer program, and the computer program implements the method provided by the present disclosure when executed by a processor.

图7示出了可以用来实施本公开的实施例的示例电子设备700的示意性框图。电子设备旨在表示各种形式的数字计算机，诸如，膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。电子设备还可以表示各种形式的移动装置，诸如，个人数字处理、蜂窝电话、智能电话、可穿戴设备和其它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例，并且不意在限制本文中描述的和/或者要求的本公开的实现。FIG. 7 shows a schematic block diagram of an example electronic device 700 that may be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers. Electronic devices may also represent various forms of mobile devices, such as personal digital processors, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions are by way of example only, and are not intended to limit implementations of the disclosure described and/or claimed herein.

如图7所示，设备700包括计算单元701，其可以根据存储在只读存储器(ROM)702中的计算机程序或者从存储单元708加载到随机访问存储器(RAM)703中的计算机程序，来执行各种适当的动作和处理。在 RAM 703中，还可存储设备700操作所需的各种程序和数据。计算单元 701、ROM 702以及RAM 703通过总线704彼此相连。输入/输出(I/O) 接口705也连接至总线704。As shown in FIG. 7 , the device 700 includes a computing unit 701 that can be executed according to a computer program stored in a read only memory (ROM) 702 or loaded into a random access memory (RAM) 703 from a storage unit 708 Various appropriate actions and handling. In the RAM 703, various programs and data necessary for the operation of the device 700 can also be stored. The computing unit 701, the ROM 702, and the RAM 703 are connected to each other through a bus 704. An input/output (I/O) interface 705 is also connected to bus 704 .

设备700中的多个部件连接至I/O接口705，包括：输入单元706，例如键盘、鼠标等；输出单元707，例如各种类型的显示器、扬声器等；存储单元708，例如磁盘、光盘等；以及通信单元709，例如网卡、调制解调器、无线通信收发机等。通信单元709允许设备700通过诸如因特网的计算机网络和/或各种电信网络与其他设备交换信息/数据。Various components in the device 700 are connected to the I/O interface 705, including: an input unit 706, such as a keyboard, mouse, etc.; an output unit 707, such as various types of displays, speakers, etc.; a storage unit 708, such as a magnetic disk, an optical disk, etc. ; and a communication unit 709, such as a network card, a modem, a wireless communication transceiver, and the like. The communication unit 709 allows the device 700 to exchange information/data with other devices through a computer network such as the Internet and/or various telecommunication networks.

计算单元701可以是各种具有处理和计算能力的通用和/或专用处理组件。计算单元701的一些示例包括但不限于中央处理单元(CPU)、图形处理单元(GPU)、各种专用的人工智能(AI)计算芯片、各种运行机器学习模型算法的计算单元、数字信号处理器(DSP)、以及任何适当的处理器、控制器、微控制器等。计算单元701执行上文所描述的各个方法和处理，例如虚拟形象生成方法。例如，在一些实施例中，虚拟形象生成方法可被实现为计算机软件程序，其被有形地包含于机器可读介质，例如存储单元708。在一些实施例中，计算机程序的部分或者全部可以经由 ROM 702和/或通信单元709而被载入和/或安装到设备700上。当计算机程序加载到RAM 703并由计算单元701执行时，可以执行上文描述的虚拟形象生成方法的一个或多个步骤。备选地，在其他实施例中，计算单元 701可以通过其他任何适当的方式(例如，借助于固件)而被配置为执行虚拟形象生成方法。Computing unit 701 may be various general-purpose and/or special-purpose processing components with processing and computing capabilities. Some examples of computing units 701 include, but are not limited to, central processing units (CPUs), graphics processing units (GPUs), various specialized artificial intelligence (AI) computing chips, various computing units that run machine learning model algorithms, digital signal processing processor (DSP), and any suitable processor, controller, microcontroller, etc. The computing unit 701 executes the various methods and processes described above, such as the avatar generation method. For example, in some embodiments, the avatar generation method may be implemented as a computer software program tangibly embodied on a machine-readable medium, such as storage unit 708 . In some embodiments, part or all of the computer program may be loaded and/or installed on device 700 via ROM 702 and/or communication unit 709 . When the computer program is loaded into RAM 703 and executed by computing unit 701, one or more steps of the avatar generation method described above may be performed. Alternatively, in other embodiments, the computing unit 701 may be configured to perform the avatar generation method by any other suitable means (eg, by means of firmware).

本文中以上描述的系统和技术的各种实施方式可以在数字电子电路系统、集成电路系统、场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、芯片上系统的系统(SOC)、复杂可编程逻辑设备(CPLD)、计算机硬件、固件、软件、和/或它们的组合中实现。这些各种实施方式可以包括：实施在一个或者多个计算机程序中，该一个或者多个计算机程序可在包括至少一个可编程处理器的可编程系统上执行和/ 或解释，该可编程处理器可以是专用或者通用可编程处理器，可以从存储系统、至少一个输入装置、和至少一个输出装置接收数据和指令，并且将数据和指令传输至该存储系统、该至少一个输入装置、和该至少一个输出装置。Various implementations of the systems and techniques described herein above may be implemented in digital electronic circuitry, integrated circuit systems, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on chips system (SOC), complex programmable logic device (CPLD), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include being implemented in one or more computer programs executable and/or interpretable on a programmable system including at least one programmable processor The processor, which may be a special purpose or general-purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device an output device.

用于实施本公开的方法的程序代码可以采用一个或多个编程语言的任何组合来编写。这些程序代码可以提供给通用计算机、专用计算机或其他可编程数据处理装置的处理器或控制器，使得程序代码当由处理器或控制器执行时使流程图和/或框图中所规定的功能/操作被实施。程序代码可以完全在机器上执行、部分地在机器上执行，作为独立软件包部分地在机器上执行且部分地在远程机器上执行或完全在远程机器或服务器上执行。Program code for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general purpose computer, special purpose computer or other programmable data processing apparatus, such that the program code, when executed by the processor or controller, performs the functions/functions specified in the flowcharts and/or block diagrams. Action is implemented. The program code may execute entirely on the machine, partly on the machine, partly on the machine and partly on a remote machine as a stand-alone software package or entirely on the remote machine or server.

在本公开的上下文中，机器可读介质可以是有形的介质，其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备，或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of the present disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), fiber optics, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.

为了提供与用户的交互，可以在计算机上实施此处描述的系统和技术，该计算机具有：用于向用户显示信息的显示装置(例如，CRT(阴极射线管)或者LCD(液晶显示器)监视器)；以及键盘和指向装置(例如，鼠标或者轨迹球)，用户可以通过该键盘和该指向装置来将输入提供给计算机。其它种类的装置还可以用于提供与用户的交互；例如，提供给用户的反馈可以是任何形式的传感反馈(例如，视觉反馈、听觉反馈、或者触觉反馈)；并且可以用任何形式(包括声输入、语音输入或者、触觉输入) 来接收来自用户的输入。To provide interaction with a user, the systems and techniques described herein may be implemented on a computer having a display device (eg, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user ); and a keyboard and pointing device (eg, a mouse or trackball) through which a user can provide input to the computer. Other kinds of devices can also be used to provide interaction with the user; for example, the feedback provided to the user can be any form of sensory feedback (eg, visual feedback, auditory feedback, or tactile feedback); and can be in any form (including audio input, voice input, or tactile input) to receive input from the user.

可以将此处描述的系统和技术实施在包括后台部件的计算系统(例如，作为数据服务器)、或者包括中间件部件的计算系统(例如，应用服务器)、或者包括前端部件的计算系统(例如，具有图形用户界面或者网络浏览器的用户计算机，用户可以通过该图形用户界面或者该网络浏览器来与此处描述的系统和技术的实施方式交互)、或者包括这种后台部件、中间件部件、或者前端部件的任何组合的计算系统中。可以通过任何形式或者介质的数字数据通信(例如，通信网络)来将系统的部件相互连接。通信网络的示例包括：局域网(LAN)、广域网(WAN)和互联网。The systems and techniques described herein may be implemented on a computing system that includes back-end components (eg, as a data server), or a computing system that includes middleware components (eg, an application server), or a computing system that includes front-end components (eg, a user's computer having a graphical user interface or web browser through which a user may interact with implementations of the systems and techniques described herein), or including such backend components, middleware components, Or any combination of front-end components in a computing system. The components of the system may be interconnected by any form or medium of digital data communication (eg, a communication network). Examples of communication networks include: Local Area Networks (LANs), Wide Area Networks (WANs), and the Internet.

计算机系统可以包括客户端和服务器。客户端和服务器一般远离彼此并且通常通过通信网络进行交互。通过在相应的计算机上运行并且彼此具有客户端-服务器关系的计算机程序来产生客户端和服务器的关系。A computer system can include clients and servers. Clients and servers are generally remote from each other and usually interact through a communication network. The relationship of client and server arises by computer programs running on the respective computers and having a client-server relationship to each other.

应该理解，可以使用上面所示的各种形式的流程，重新排序、增加或删除步骤。例如，本发公开中记载的各步骤可以并行地执行也可以顺序地执行也可以不同的次序执行，只要能够实现本公开公开的技术方案所期望的结果，本文在此不进行限制。It should be understood that steps may be reordered, added or deleted using the various forms of flow shown above. For example, the steps described in the present disclosure can be executed in parallel, sequentially, or in different orders. As long as the desired results of the technical solutions disclosed in the present disclosure can be achieved, no limitation is imposed herein.

上述具体实施方式，并不构成对本公开保护范围的限制。本领域技术人员应该明白的是，根据设计要求和其他因素，可以进行各种修改、组合、子组合和替代。任何在本公开的精神和原则之内所作的修改、等同替换和改进等，均应包含在本公开保护范围之内。The above-mentioned specific embodiments do not constitute a limitation on the protection scope of the present disclosure. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may occur depending on design requirements and other factors. Any modifications, equivalent replacements, and improvements made within the spirit and principles of the present disclosure should be included within the protection scope of the present disclosure.

Claims

1. An avatar generation method, comprising:

determining a style characteristic diagram of a target object in a target image;

determining a plurality of nerve radiation field vectors according to the style characteristic diagram; and

and generating an avatar corresponding to the target object according to the plurality of nerve radiation field vectors.

2. The method of claim 1, wherein the determining a style profile of a target object in a target image comprises:

determining style encoding data of the target object; and

and determining the style characteristic graph according to the style encoding data.

3. The method of claim 1, wherein the determining a plurality of nerve radiation field vectors from the style feature map comprises:

generating a style image according to the style characteristic diagram;

determining reconstruction parameters according to the style image;

fusing the style characteristic diagram with the reconstruction parameters to obtain a fused characteristic diagram; and

and determining a plurality of nerve radiation field vectors according to the fused feature map.

4. The method of claim 3, wherein the generating an avatar corresponding to the target object from the plurality of nerve radiation field vectors comprises:

obtaining an explicit three-dimensional expression of the target object according to the plurality of nerve radiation field vectors; and

rendering the explicit three-dimensional representation to generate an avatar corresponding to the target object.

5. The method of any of claims 1 to 4, wherein the target object comprises a head.

6. An avatar generation apparatus comprising:

the first determining module is used for determining a style characteristic diagram of a target object in a target image;

the second determining module is used for determining a plurality of nerve radiation field vectors according to the style characteristic diagram; and

and the generating module is used for generating an avatar corresponding to the target object according to the plurality of nerve radiation field vectors.

7. The apparatus of claim 6, wherein the first determining means comprises:

a first determining unit for determining style encoding data of the target object; and

and the second determining unit is used for determining the style characteristic diagram according to the style encoding data.

8. The apparatus of claim 6, wherein the second determining means comprises:

the first generating unit is used for generating a style image according to the style characteristic diagram;

a third determining unit, configured to determine a reconstruction parameter according to the style image;

the fusion unit is used for fusing the style characteristic diagram with the reconstruction parameters to obtain a fusion characteristic diagram; and

and the fourth determining unit is used for determining a plurality of nerve radiation field vectors according to the fusion characteristic map.

9. The apparatus of claim 8, wherein the generating means comprises:

the obtaining unit is used for obtaining the explicit three-dimensional expression of the target object according to the plurality of nerve radiation field vectors; and

and the second generation unit is used for rendering the explicit three-dimensional expression and generating an avatar corresponding to the target object.

10. The apparatus of any of claims 6 to 9, wherein the target object comprises a head.

11. An electronic device, comprising:

at least one processor; and

a memory communicatively coupled to the at least one processor; wherein,

the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1 to 5.

12. A non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of claims 1 to 5.

13. A computer program product comprising a computer program which, when executed by a processor, implements the method of any one of claims 1 to 5.