CN110309554B

CN110309554B - Video human body three-dimensional reconstruction method and device based on garment modeling and simulation

Info

Publication number: CN110309554B
Application number: CN201910507845.2A
Authority: CN
Inventors: 刘烨斌; 苏肇祺; 戴琼海
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2019-06-12
Filing date: 2019-06-12
Publication date: 2021-01-15
Anticipated expiration: 2039-06-12
Also published as: CN110309554A

Abstract

The invention discloses a three-dimensional reconstruction method and device of video human body based on clothing modeling and simulation, wherein the method includes: using a camera to collect human body motion, extracting human body posture and body posture information through images, and extracting the segmentation of different clothing of the person information, so as to model the human body and surface clothing, and perform physical simulation and texture simulation for the surface clothing according to the motion posture of the human body. The human body model reconstruction method is based on the human body template matching method, and the human body pose and body shape are solved according to the existing method of estimating the human body model based on a single RGB image; the clothing simulation mainly uses the particle simulation method, and the physical simulation is carried out by adding external and internal force constraints. modeling. The method can reconstruct the joint motion of the human body and the clothing better through the physical simulation method, and is suitable for the reconstruction of the human body and the clothing with a single RGB collection of human motion.

Description

Video human body three-dimensional reconstruction method and device based on garment modeling and simulation

Technical Field

The invention relates to the technical field of computer vision, in particular to a video human body three-dimensional reconstruction method and device based on garment modeling and simulation.

Background

Three-dimensional reconstruction is a technology which is focused in scientific research and industry at present in the field of computer vision. The model obtained through three-dimensional reconstruction has higher research and practical values in the fields of video games, architecture, basic industry and the like.

However, the human body three-dimensional and clothing reconstruction method is a great problem in the field of three-dimensional reconstruction because human body actions and clothing materials are abundant. Most of the existing human body reconstruction technologies are to reconstruct the human body and clothes as a complete model, so that the clothes cannot be subjected to real physical simulation and modeling.

Disclosure of Invention

The present invention is directed to solving, at least to some extent, one of the technical problems in the related art.

Therefore, one objective of the present invention is to provide a video human body three-dimensional reconstruction method based on garment modeling and simulation, which can enable the joint motion of the human body and clothes to be well reconstructed through a physical simulation method, and can be applied to the reconstruction of the human body and clothes by acquiring the human body motion through a single RGB.

The invention also aims to provide a video human body three-dimensional reconstruction device based on garment modeling and simulation.

In order to achieve the above object, an embodiment of the present invention provides a video human body three-dimensional reconstruction method based on garment modeling and simulation, including: collecting human body motion data through single RGB, carrying out foreground and background segmentation on the human body motion data, and estimating human body posture and human body shape through multi-frame joint by a single RGB human body template estimation method; modeling the initial posture of the human body by using a human body template according to the body type of the human body, and performing analog simulation on the initial two-dimensional cloth of the clothes to be sewn and put on the human body in the initial posture; adjusting clothes parameters according to collision information between clothes and a human body under an external force, and enabling the clothes under three-dimensional simulation to meet fit conditions; transiting the posture of the person to the posture of the 1 st frame in the video, and simultaneously carrying out combined physical simulation on three-dimensional clothes; the method comprises the steps of segmenting different clothes of a current frame of a video, fitting a segmented graph of the clothes based on the clothes modeled by parameters to enable the clothes under three-dimensional simulation to be matched with the edge part of the clothes of the current frame in the video, and optimizing and solving clothes parameters through multi-frame information for k frame keys in the video; performing human body posture and clothes combined simulation modeling on each frame in the video according to the clothes parameters and the human body posture; and performing texture calculation and mapping on the modeled clothes through a camera projection relation and RGB information in the original video, and re-rendering the whole motion sequence through relighting to obtain a three-dimensional reconstruction result.

According to the video human body three-dimensional reconstruction method based on the garment modeling and simulation, the person wearing and motion simulation can be performed on the clothes by utilizing the two-dimensional cloth parameter information of the garment, and the real images collected in the video are fitted, so that the person and the clothes in the single RGB video can be simultaneously modeled and simulated, further, the combined motion of the human body and the clothes can be well reconstructed by a physical simulation method, and the video human body three-dimensional reconstruction method can be suitable for reconstructing the human body and the clothes of the single RGB collected human body motion.

In addition, the video human body three-dimensional reconstruction method based on garment modeling and simulation according to the above embodiment of the present invention may further have the following additional technical features:

further, in an embodiment of the present invention, wherein the segmentation map of each piece of clothing is a binary image with the same size as the resolution of the original video.

Further, in one embodiment of the present invention, during the parameter adjustment and optimization process, the clothes and the human body are simulated in a combined manner all the time.

Further, in an embodiment of the present invention, the optimally solving the clothes parameters through multi-frame information includes: performing distance transformation operation on the binary image, solving the two-norm distance from each pixel point to the boundary of the binary image, and forming an image; thresholding said constructed image.

Further, in one embodiment of the invention, the optimization term is the deviation of the rendered image and the segmented image:

wherein,

is at a parameter xⁱThe rendered images of the lower and ith clothes,

is a segmentation image;

the calculation formula of the optimization parameters is as follows:

wherein, Δ xⁱFor each parameter by an individual variation.

In order to achieve the above object, an embodiment of the present invention provides a video human body three-dimensional reconstruction apparatus based on garment modeling and simulation, including: the acquisition module is used for acquiring human body motion data through single RGB, carrying out foreground and background segmentation on the human body motion data, and estimating human body posture and human body shape through multi-frame joint by a single RGB human body template estimation method; the first modeling module is used for modeling the initial posture of the human body by using a human body template according to the body type of the human body, and simulating clothes initial two-dimensional cloth to be sewn and put on the human body in the initial posture; the adjusting module is used for adjusting clothes parameters according to collision information between clothes and a human body under external force and enabling the clothes under three-dimensional simulation to meet fit conditions; the transformation module is used for transiting the human posture to the posture of the 1 st frame in the video and simultaneously carrying out combined physical simulation on the three-dimensional clothes; the solving module is used for segmenting different clothes of the current frame of the video, fitting a clothes segmentation graph based on the clothes modeled by parameters to enable the clothes under three-dimensional simulation to be matched with the edge part of the clothes of the current frame in the video, and optimizing and solving clothes parameters through multi-frame information for k frame keys in the video; the second modeling module is used for carrying out human body posture and clothes combined simulation modeling on each frame in the video according to the clothes parameters and the human body posture; and the reconstruction module is used for performing texture calculation and mapping on the modeled clothes through the camera projection relation and RGB information in the original video, and re-rendering the whole motion sequence through relighting to obtain a three-dimensional reconstruction result.

The video human body three-dimensional reconstruction device based on the garment modeling and simulation can simulate the wearing and movement of the clothes by using the two-dimensional cloth parameter information of the clothes and fit the real images acquired in the video, so that the modeling and simulation can be simultaneously performed on the characters and the clothes in the single RGB video, and further the combined movement of the human body and the clothes can be well reconstructed by a physical simulation method, and the video human body three-dimensional reconstruction device can be suitable for reconstructing the human body and the clothes of the single RGB acquired human body movement.

In addition, the video human body three-dimensional reconstruction device based on garment modeling and simulation according to the above embodiment of the invention may further have the following additional technical features:

Further, in an embodiment of the present invention, the solving module is further configured to perform distance transformation on a binary image, determine a two-norm distance between each pixel point and a boundary of the binary image, form an image, and perform threshold processing on the formed image.

wherein,

is at a parameter xⁱThe rendered images of the lower and ith clothes,

is a segmentation image;

the calculation formula of the optimization parameters is as follows:

wherein, Δ xⁱFor each parameter by an individual variation.

Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.

Drawings

The foregoing and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:

FIG. 1 is a flow chart of a video human body three-dimensional reconstruction method based on garment modeling and simulation according to an embodiment of the invention;

fig. 2 is a schematic structural diagram of a video human body three-dimensional reconstruction device based on garment modeling and simulation according to an embodiment of the invention.

Detailed Description

Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.

The following describes a video human body three-dimensional reconstruction method and device based on garment modeling and simulation according to an embodiment of the present invention with reference to the accompanying drawings, and first, a video human body three-dimensional reconstruction method based on garment modeling and simulation according to an embodiment of the present invention will be described with reference to the accompanying drawings.

Fig. 1 is a flowchart of a video human body three-dimensional reconstruction method based on garment modeling and simulation according to an embodiment of the present invention.

As shown in fig. 1, the video human body three-dimensional reconstruction method based on garment modeling and simulation comprises the following steps:

in step S101, human motion data is collected through single RGB, foreground and background segmentation is performed on the human motion data, and the human posture and the human body shape are jointly estimated through multiple frames by a method of estimating a human template through single RGB.

It can be understood that the embodiment of the invention can acquire human motion data through single RGB, perform foreground and background segmentation on the acquired data, and estimate the posture and the body type of the human body through multi-frame joint by using the existing method for estimating the human body template through single RGB.

In step S102, an initial posture of the human body is modeled using a human body template according to the body type of the human body, and the initial two-dimensional cloth of the garment is simulated to be sewn and threaded on the human body in the initial posture.

It can be understood that the embodiment of the invention can model the human body initial posture by using the human body template according to the estimated human body shape, and simulate the clothes initial two-dimensional cloth to be sewn and threaded on the human body in the initial posture.

In step S103, clothes parameters are adjusted according to the collision information between the clothes and the human body under the external force, and the clothes under the three-dimensional simulation satisfy the fitting condition.

It can be understood that the embodiment of the invention can reasonably adjust the clothes parameters according to the collision information between the clothes and the human body under the external force, and make the clothes under the three-dimensional simulation fit better.

In step S104, the human pose transition is transformed to the pose of frame 1 in the video, and simultaneously, the three-dimensional clothing is subjected to the joint physical simulation.

It can be understood that the embodiment of the invention can transform the human posture transition into the posture of the 1 st frame in the video, so that the joint physical simulation can be carried out on three-dimensional clothes in the process.

In step S105, the current frame of the video is segmented into different clothes, a clothes segmented graph based on parameter modeling is fitted to the clothes such that the clothes under three-dimensional simulation and the edge of the clothes of the current frame in the video meet the same condition, and for k frames of key in the video, the clothes parameters are optimized and solved through multi-frame information.

The matching condition may be understood as that the clothes under the three-dimensional simulation matches the edge portion of the clothes of the current frame in the video as much as possible, and of course, a person skilled in the art may set a specific matching condition according to an actual situation, which is not specifically limited herein.

Specifically, (1) for the frame of the video, segmentation of different clothes is performed by an existing method (such as Look intoperson, etc.), and a segmentation map of each piece of clothes is a binary image with the same size as the resolution of the original video.

(2) And fitting the clothing segmentation graph based on the parameter modeling so that the clothing under the three-dimensional simulation is matched with the edge part of the frame of clothing in the video as much as possible. In the parameter adjustment and optimization process, the clothes and the human body are subjected to combined simulation all the time.

(3) And (3) repeating the steps (1) and (2) for k frames of key in the video, and optimally solving the clothes parameters through multi-frame information.

In step S106, human body posture and clothes combined simulation modeling is carried out on each frame in the video according to the clothes parameters and the human body posture.

It can be understood that, according to the embodiment of the present invention, the human body posture and the clothes combined simulation modeling is performed on each frame in the video through the solved clothes parameters and the human body posture extracted from each video frame in step S101.

In step S107, texture calculation and mapping are performed on the modeled clothes according to the camera projection relationship and RGB information in the original video, and the entire motion sequence is re-rendered by relighting to obtain a volumetric three-dimensional reconstruction result.

In summary, the embodiment of the invention is based on video human body three-dimensional reconstruction of garment modeling and simulation, matches people and clothes by using RGB and image segmentation information, and then carries out three-dimensional reconstruction on human bodies and clothes by using calculated model parameters.

The video human body three-dimensional reconstruction method based on garment modeling and simulation is further explained by the specific embodiment, and the specific steps are as follows:

in step S1, a RGB camera is used to capture the motion sequence of a single human body, ensuring that each part (e.g. back) of the person is captured in the video.

Step S2, every frame of picture in the RGB sequence

Performing foreground and background segmentation by using a semantic segmentation method based on deep learning, thereby obtaining a segmentation map M of a character_i。

Step S3, dividing the figure into figure division figures M_iAnd original picture

Using existing deep learning-based inputMethod for estimating human body posture and body type through single RGB image to obtain posture P of human body template based on skeleton covering_iAnd information of body type S.

Step S4, roughly classifies the clothing (such as trousers/shorts/skirt) worn by the person, and models the clothing using the corresponding parametric cloth information.

Step S5, modeling and rendering the human object with the human object type S and the model initial pose. The human body posture is fixed, the modeled clothes are divided into a front part and a rear part, and the clothes can be completely put on the human body by applying attraction to the sewing positions of the front part and the rear part under a physical simulation system.

Step S6, applying gravity, collision with human body, and internal constraint force to the clothes, so that the simulated clothes can be closer to reality.

And step S7, detecting the collision information between the simulated clothes and the human body, and if the clothes are detected to be too tightly collected and the simulation of collision with the human body is too strong for different parts of the human body, relaxing the corresponding cloth parameters.

Step S8, transition the character pose to the estimated character pose P of the first frame in the video₁In this and the following processes, the simulation of the clothes is continuously performed.

Step S9, for the frame F in the video₁And segmenting different clothes by a character part segmentation method based on deep learning, wherein the segmentation image of each piece of clothes is a binary image with the same size as the resolution of the original video. Assume that the segmented image is

And N is the number of clothes worn by the person.

Step S10, solving the cloth parameters: suppose that at parameter xⁱNext, the rendering image (binary as the front background) of the i-th clothing is

And segment the image into

The goal is to fit the image boundaries of both. The specific form is as follows: the specific form is as follows: first, the binary image C is defined as

Firstly, performing distance transformation operation on a binary image C, solving a two-norm distance from each pixel point to a binary image boundary, forming an image D, and then performing threshold processing on the image D:

therefore, the required optimization items are the rendering image and the segmentation image

Deviation under operation:

due to the fact that

There is no explicit expression, so when making gradient estimation, let xⁱ←xⁱ+ΔxⁱObtaining the following through a simulation method:

by setting Δ xⁱFor the independent variable quantity of each parameter, the optimization parameter x can be solved by iteration by using a Gaussian-Newton iteration methodⁱ。

Step S11, for each key frame F in the video_jAnd repeating the operations of step S9 and step S10 to optimize the clothes parameters step by step according to the clothes segmentation information of each frame in the video.

In step S12, each frame of people and clothes is simulated and reconstructed by solving the optimized parameters and the pose of the people estimated in step S1 in each frame.

And step S13, calculating the projection position corresponding to the vertex on each piece of clothes according to the camera projection relation and the RGB information in the original video, obtaining the texture information of the vertex and the patch according to the RGB information, and gradually updating and optimizing the texture according to each frame of information.

And step S14, performing texture mapping on the clothes, performing relighting in a rendering mode, re-rendering the whole sequence, and completing the reconstruction of the characters and the clothes.

In summary, the method of the embodiment of the invention can utilize the camera to collect the human body movement, extract the posture and the posture information of the human body through the image, and extract the segmentation information of different clothes of the person, thereby modeling the human body and the surface clothes, and carrying out physical simulation and texture simulation on the surface clothes according to the movement posture of the human body. The human body model reconstruction method is based on a human body template matching method, and human body posture and body type solving is carried out according to the existing method for estimating the human body model based on a single RGB image; the clothes simulation mainly utilizes a particle simulation method to perform physical simulation and modeling by adding external force and internal force constraints.

According to the video human body three-dimensional reconstruction method based on the garment modeling and simulation, disclosed by the embodiment of the invention, character wearing and motion simulation can be carried out on clothes by utilizing two-dimensional cloth parameter information of the garment, and real images collected in the video are fitted, so that the character and clothes in a single RGB video can be simultaneously modeled and simulated, and further, the combined motion of the human body and the clothes can be well reconstructed by a physical simulation method, and the video human body three-dimensional reconstruction method is suitable for reconstructing the human body and clothes which are collected by single RGB and move of the human body.

Next, a video human body three-dimensional reconstruction device based on garment modeling and simulation according to an embodiment of the present invention will be described with reference to the drawings.

As shown in fig. 2, the video human body three-dimensional reconstruction apparatus 10 based on garment modeling and simulation includes: an acquisition module 100, a first modeling module 200, an adjustment module 300, a transformation module 400, a solving module 500, a second modeling module 600, and a reconstruction module 700.

The acquisition module 100 is configured to acquire human motion data through single RGB, perform foreground and background segmentation on the human motion data, and estimate the human posture and the human body shape through multi-frame joint by using a method of estimating a human body template through single RGB. The first modeling module 200 is used for modeling the initial posture of the human body according to the body type of the human body by using the human body template, and simulating the clothes initial two-dimensional cloth to be sewed and threaded on the human body in the initial posture. The adjusting module 300 is configured to adjust parameters of the clothes according to collision information between the clothes and the human body under the external force, and enable the clothes under the three-dimensional simulation to meet the fit condition. The transformation module 400 is used for transforming the human pose transition into the pose of the 1 st frame in the video and simultaneously performing the joint physical simulation on the three-dimensional clothes. The solving module 500 is used for segmenting different clothes of the current frame of the video, fitting a segmented graph of the clothes based on the clothes modeled by parameters to enable the clothes under three-dimensional simulation to be consistent with the edge part of the clothes of the current frame in the video, and optimizing and solving the clothes parameters through multi-frame information for k frames of keys in the video. The second modeling module 600 is used for performing human body posture and clothes combined simulation modeling on each frame in the video according to the clothes parameters and the human body posture. The reconstruction module 700 is configured to perform texture calculation and mapping on the modeled clothing through the camera projection relationship and RGB information in the original video, and re-render the entire motion sequence through relighting to obtain a three-dimensional reconstruction result. The device 10 of the embodiment of the invention can enable the combined motion of the human body and the clothes to be well reconstructed by a physical simulation method, and can be suitable for reconstructing the human body and the clothes of the single RGB collected human body motion.

Further, in an embodiment of the present invention, the solving module is further configured to perform distance transformation on the binary image, determine a two-norm distance between each pixel point and a boundary of the binary image, construct an image, and perform threshold processing on the constructed image.

wherein,

is at a parameter xⁱThe rendered images of the lower and ith clothes,

is a segmentation image;

the calculation formula of the optimization parameters is as follows:

wherein, Δ xⁱFor each parameter by an individual variation.

It should be noted that the foregoing explanation of the embodiment of the video three-dimensional human body reconstruction method based on garment modeling and simulation is also applicable to the video three-dimensional human body reconstruction apparatus based on garment modeling and simulation of this embodiment, and details are not repeated here.

According to the video human body three-dimensional reconstruction device based on the garment modeling and simulation, disclosed by the embodiment of the invention, character wearing and motion simulation can be carried out on clothes by utilizing two-dimensional cloth parameter information of the garment, and real images collected in the video are fitted, so that the character and clothes in a single RGB video can be simultaneously modeled and simulated, and further, the combined motion of the human body and the clothes can be well reconstructed by a physical simulation method, and the video human body three-dimensional reconstruction device can be suitable for reconstructing the human body and clothes of the single RGB collected human body motion.

Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present invention, "a plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.

Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims

1. A video human body three-dimensional reconstruction method based on garment modeling and simulation is characterized by comprising the following steps:

collecting human body motion data through single RGB, carrying out foreground and background segmentation on the human body motion data, and estimating human body posture and human body shape through multi-frame joint by a single RGB human body template estimation method;

modeling the initial posture of the human body by using a human body template according to the body type of the human body, and performing analog simulation on the initial two-dimensional cloth of the clothes to be sewn and put on the human body in the initial posture;

adjusting clothes parameters according to collision information between clothes and a human body under an external force, and enabling the clothes under three-dimensional simulation to meet fit conditions;

transiting the posture of the person to the posture of the 1 st frame in the video, and simultaneously carrying out combined physical simulation on three-dimensional clothes;

the method comprises the steps of segmenting different clothes of a current frame of a video, fitting a segmented graph of the clothes based on the clothes modeled by parameters to enable the clothes under three-dimensional simulation to be matched with the edge part of the clothes of the current frame in the video, and optimizing and solving clothes parameters through multi-frame information for k frame keys in the video; the segmentation graph of each piece of clothes is a binary image C, and the optimization solution of clothes parameters through multi-frame information comprises the following steps: performing distance transformation operation on the binary image C, solving two-norm distance from each pixel point to the boundary of the binary image C, forming an image D, performing threshold processing on the formed image D, wherein,

the optimization items are as follows:

wherein,

is at a parameter xⁱThe rendered images of the lower and ith clothes,

is a segmentation image; the calculation formula of the optimization parameters is as follows:

wherein, Δ xⁱA separate variation for each parameter;

performing human body posture and clothes combined simulation modeling on each frame in the video according to the clothes parameters and the human body posture; and

and performing texture calculation and mapping on the modeled clothes through the camera projection relation and RGB information in the original video, and re-rendering the whole motion sequence through relighting to obtain a human body three-dimensional reconstruction result.

2. The method of claim 1, wherein the segmentation map for each garment is a binary image of the same size as the original video resolution.

3. The method of claim 1, wherein the clothes are simulated with the human body all the time during the parameter adjustment and optimization process.

4. A video human body three-dimensional reconstruction device based on garment modeling and simulation is characterized by comprising:

the acquisition module is used for acquiring human body motion data through single RGB, carrying out foreground and background segmentation on the human body motion data, and estimating human body posture and human body shape through multi-frame joint by a single RGB human body template estimation method;

the first modeling module is used for modeling the initial posture of the human body by using a human body template according to the body type of the human body, and simulating clothes initial two-dimensional cloth to be sewn and put on the human body in the initial posture;

the adjusting module is used for adjusting clothes parameters according to collision information between clothes and a human body under external force and enabling the clothes under three-dimensional simulation to meet fit conditions;

the transformation module is used for transiting the human posture to the posture of the 1 st frame in the video and simultaneously carrying out combined physical simulation on the three-dimensional clothes;

a solving module for segmenting different clothes of the current frame of the video and fitting the clothes based on the clothes of the parameter modelingThe edge parts of clothes under three-dimensional simulation and the clothes of the current frame in the video reach the matching condition, and for k frames of key in the video, the clothes parameters are optimized and solved through multi-frame information; the segmentation image of each piece of clothes is a binary image C, the solving module is further used for carrying out distance transformation operation on the binary image C, solving the two-norm distance from each pixel point to the boundary of the binary image C, forming an image D, and carrying out threshold processing on the formed image D, wherein,

the optimization items are as follows:

wherein,

is at a parameter xⁱThe rendered images of the lower and ith clothes,

wherein, Δ xⁱA separate variation for each parameter;

the second modeling module is used for carrying out human body posture and clothes combined simulation modeling on each frame in the video according to the clothes parameters and the human body posture; and

and the reconstruction module is used for performing texture calculation and mapping on the modeled clothes through the camera projection relation and RGB information in the original video, and re-rendering the whole motion sequence through relighting to obtain a human body three-dimensional reconstruction result.

5. The apparatus of claim 4, wherein the segmentation map for each garment is a binary image of the same size as the original video resolution.

6. The device according to claim 4, wherein the clothes are simulated with the human body all the time during the parameter adjustment and optimization process.