CN113052825A

CN113052825A - Real-time three-dimensional deformation measurement method based on GPU parallel acceleration

Info

Publication number: CN113052825A
Application number: CN202110333748.3A
Authority: CN
Inventors: 董守斌; 林傲宇; 蒋震宇
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2021-03-29
Filing date: 2021-03-29
Publication date: 2021-06-29
Anticipated expiration: 2041-03-29
Also published as: CN113052825B

Abstract

本发明公开了一种基于GPU并行加速的实时三维变形测量方法，包括以下步骤：1)通过立体标定获得投影矩阵；2)拍摄物体变形前后的图像；3)选择兴趣区域和兴趣点；4)传输投影矩阵、图像和兴趣点到GPU；5)计算兴趣点变形前的三维坐标；6)计算兴趣点在变形中各个时刻的三维变形；7)将三维变形数据传输回CPU。该方法通过将变形前左相机图像作为所有匹配中的参考图像，从而兴趣点对应的Hessian矩阵等IC‑GN预计算的数据得以复用；基于CUDA异构计算平台开发的GPU加速变形测量程序可以发挥GPU硬件设备的计算性能，针对GPU程序的访存等优化技术使得三维变形测量的计算速度大大提高，满足了实时三维变形测量的需求。The invention discloses a real-time three-dimensional deformation measurement method based on GPU parallel acceleration. Transfer the projection matrix, the image and the interest point to the GPU; 5) Calculate the three-dimensional coordinates of the interest point before deformation; 6) Calculate the three-dimensional deformation of the interest point at each moment in the deformation; 7) Transfer the three-dimensional deformation data back to the CPU. This method uses the left camera image before deformation as the reference image in all matching, so that the IC-GN pre-computed data such as the Hessian matrix corresponding to the interest point can be reused; the GPU-accelerated deformation measurement program developed based on the CUDA heterogeneous computing platform can Taking advantage of the computing performance of GPU hardware devices and optimizing the memory access of GPU programs, the calculation speed of 3D deformation measurement is greatly improved, which meets the needs of real-time 3D deformation measurement.

Description

Real-time three-dimensional deformation measurement method based on GPU parallel acceleration

Technical Field

The invention relates to the technical field of optical measurement, in particular to a real-time three-dimensional deformation measurement method based on GPU parallel acceleration.

Background

In the fields of science and engineering, the three-dimensional digital image correlation method is widely applied to three-dimensional deformation measurement due to the advantages of simple device, non-contact type and the like. The three-dimensional digital image correlation method can measure the three-dimensional appearance and the full-field three-dimensional deformation of a curved surface object, and has very rich application scenes. However, since it directly processes a high-resolution digital image, the amount of calculation thereof is also considerable, with a problem that the calculation takes a long time. With the development of digital image acquisition technology, the image resolution and sampling rate are improved, and the problem is more prominent, so that the application of a three-dimensional digital image correlation method in some real-time monitoring scenes and the like is limited. Secondly, as a measuring method, it is also important to maintain its high accuracy.

In recent years, researchers have made great efforts to improve the computational efficiency of three-dimensional digital image correlation methods. However, the results of these studies are not satisfactory and always give a trade-off between accuracy and efficiency. There are a number of existing schemes that improve computational efficiency by optimizing correlation algorithms, reducing redundant computations, and using multi-threading techniques to speed up programs. However, because the computing capability of the multi-core processor is limited and the adopted computing strategy is simple, the highest computing speed reported at present is only 50000POI/s, and the requirement of real-time processing is far from being met; the speed is achieved on the premise of only using a simple first-order shape function, and the precision is not as good as that of a scheme using a second-order shape function.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a real-time three-dimensional deformation measuring method based on GPU parallel acceleration. In all matching, the left camera image L before deformation is used₀As a reference image, the pre-calculation data of the IC-GN algorithm can be repeatedly used, so that a large amount of calculation is saved. The method has the advantages that the algorithm is accelerated by developing a GPU program based on CUDA, the computing capability of hardware is fully exerted, real-time three-dimensional deformation measurement is achieved, the computing speed can exceed 40 frames per second when the number of interest points is about 10000, and the problems that the computing speed of an existing three-dimensional deformation measurement method is low and real-time measurement requirements cannot be met are solved.

In order to achieve the purpose, the technical scheme provided by the invention is as follows: a real-time three-dimensional deformation measurement method based on GPU parallel acceleration comprises the following steps:

1) using Zhangyingyou scaling method to make stereo scaling on left and right fixed cameras to obtain projection matrix M of left and right cameras_L、M_R；

2) Synchronously shooting left and right images L of the surface of the target object before deformation by using the two cameras₀、 R₀And the left and right images L at the ith moment in the deformation process_i、R_iWherein i ═ 1,2,3, …, n;

3) in the image L₀Selecting an interest area, and taking a batch of interest points P at equal intervals in the interest area_L0；

4) Copying a projection matrix of the camera, all shot images and interest points to a GPU;

5) on the GPU, for each interest point P_L0Firstly, an IC-GN algorithm is used for searching the image R₀Corresponding point P on_R0Then using P_L0、P_R0The coordinates of the two points are calculated by a triangulation method to obtain the three-dimensional coordinates P of the interest point before deformation_W0；

6) On the GPU, the following operations are carried out on each time i in the deformation process: first, for each point of interest P_L0Use the IC-GN algorithm to find it in the image L_i、R_iCorresponding point P on_LiAnd P_RiThen using P_Li、P_RiThe coordinates of the two points are calculated by triangulation to obtain the three-dimensional coordinates P of the interest point at the moment i_wiFinally with P_WiMinus P_W0Obtaining three-dimensional deformation data D of the interest point at the moment i_Wi；

7) And copying the three-dimensional deformation data of all the interest points at each moment back to the CPU, so as to obtain the three-dimensional deformation of the surface of the object at each moment.

In step 1), the projection matrix has a size of 3 × 4, which represents the relationship between the three-dimensional space coordinates and the two-dimensional coordinates on the camera image.

In step 3), the interest area refers to an area which needs to be measured and is designated by a user, and the interest point interval in the area is also selected according to the measurement requirement.

In step 4), the shot image is firstly stored in a page-locked memory allocated by using a runtime function cudamallocost in the CUDA, and then the data is copied to the GPU by using a runtime function cudaMemcpy in the CUDA; the camera parameters are copied to the constant memory of the GPU, enabling it to be accessed at high speed.

In step 5) and step 6), the IC-GN algorithm is used, wherein the input is a batch of interest points, reference images and target images, and a CUDA thread block is used to process a computation task corresponding to an interest point, including the following:

a. carrying out precalculation: for each interest point, calculating a corresponding Hessian matrix and storing data; using a CUDA thread block to complete the calculation task of each interest point, wherein the Hessian matrix is

Representing the accumulation of all pixel positions within a reference sub-area, which is a 33 x 33 sub-image centered at the point of interest; ψ is the coordinate of the point of interest, # is the local coordinate in the reference sub-area, # R (ψ + ζ) is the gradient of the reference image,

is a Jacobian matrix, and T represents the transposition of the matrix;

b. and (c) for each interest point, setting the coordinate of the interest point as (x, y), and estimating the initial value p of the deformation vector of the interest point by using an image feature assisted method (u, u)_x,u_y,u_xx,u_xy,u_yy,v,v_x,v_y,v_xx,v_xy,v_yy) (ii) a Wherein u, v are translation amounts; u. of_x,u_y,v_x,v_yIs the first order gradient component; u. of_xx,u_xy,u_yy,v_xx,v_xy,v_yyIs the second order gradient component; here, a CUDA thread is used to accomplish a programCalculating the interest points;

c. for each interest point, iteratively updating the corresponding deformation vector p according to the following steps:

c1, calculating the deformation vector increment delta p:

wherein H-¹The inverse of the Hessian matrix is represented,

and

normalized coefficients for the reference and target sub-regions respectively,

representing the target sub-area after subtraction of the mean value of the gray levels,

representing the reference subarea after subtracting the gray mean value, and W (zeta; p) represents a transformation function from the local coordinate of the reference subarea to the local coordinate of the target subarea;

c2, calculating new transformation function W (ζ; p') as W (W) by using Δ p^-1(ζ; Δ p); p) wherein W^-1(ζ; Δ p) is the inverse of W (ζ; Δ p); then extracting a new deformation vector p 'from the new transformation function W (zeta; p');

c3, updating p, namely making p equal to p';

c4, repeating c1 to c3 until | | | Δ p | | < 0.001, and | | · | | | represents the modular length of the vector;

d. and (3) calculating the coordinates of the interest point at the corresponding point of the target map as (x ', y'), wherein x 'is x + u, and y' is y + v.

In step 6), the IC-GN algorithm used does not need to be pre-calculated, and the data stored after the IC-GN algorithm is pre-calculated in step 5) is directly used.

In step 5) and step 6), triangulation methods are used, including the following:

a. let the coordinate of the interest point on the left view be (x)_L,y_L) The point coordinate on the right view is (x)_R,y_R) The corresponding three-dimensional coordinate is (X)_W,Y_W,Z_W)；

b. (X) is calculated in the following manner_W,Y_W,Z_W) Each CUDA thread calculates a three-dimensional coordinate;

wherein A is a coefficient matrix, and

b is a matrix of constant terms, an

m^L _rcRepresenting a projection matrix M_LM in the r-th row and c-th column^R _rcRepresenting a projection matrix M_RWhere r is 1,2,3, and c is 1,2,3, 4.

Compared with the prior art, the invention has the following advantages and beneficial effects:

1. in all matching of three-dimensional deformation measurement, the invention leads the left camera image L before deformation to be₀As a reference image, data obtained by IC-GN pre-calculation such as Hessian matrix and the like corresponding to the interest points can be repeatedly utilized in the processing of images at all times in the three-dimensional deformation process, so that a large amount of calculation is saved, the processing speed is accelerated, and the calculation time is shortened.

2. The GPU program is developed based on the CUDA platform, so that development cost can be saved, development difficulty is reduced, meanwhile, the special CUDA program and corresponding program optimization can furthest exert the calculation performance of GPU hardware, and the requirement of measuring three-dimensional deformation in real time is met.

Detailed Description

The present invention will be described in further detail with reference to examples, but the embodiments of the present invention are not limited thereto.

The real-time three-dimensional deformation measurement method based on GPU parallel acceleration provided by the embodiment comprises the following steps:

1) using a Zhangyingyou calibration method to carry out three-dimensional calibration on the left and right fixed cameras to obtain the projection matrixes M of the left and right cameras_L、M_R。

2) Synchronously shooting left and right images L of the surface of the target object before deformation by using the two cameras₀、 R₀And the left and right images L at the ith moment in the deformation process_i、R_iWhere i is 1,2,3, …, n.

3) In the image L₀Selecting an interest area, and taking a batch of interest points P at equal intervals in the interest area_L0(ii) a The interest area refers to an area which is specified by a user and needs to be measured, and the interest point interval in the area is also selected according to the measurement requirement.

4) Copying a projection matrix of the camera, all shot images and interest points to a GPU; the method comprises the steps that a shot image is stored in a page locking memory distributed by using a runtime function cudaMallocost in a CUDA (compute unified device architecture), and then data are copied to a GPU (graphics processing unit) by using a runtime function cudaMemcpy function in the CUDA; the projection matrix of the camera is copied to the constant memory of the GPU, allowing it to be accessed at high speed.

5) On the GPU, for each interest point P_L0Firstly, an IC-GN algorithm is used for searching the image R₀Corresponding point P on_R0Then using P_L0、P_R0The coordinates of the two points are calculated by a triangulation method to obtain the three-dimensional coordinates P of the interest point before deformation_W0。

6) On the GPU, the following operations are carried out on each time i in the deformation process: first, for each point of interest P_L0Use the IC-GN algorithm to find it in the image L_i、R_iCorresponding point P on_LiAnd P_RiThen using P_Li、P_RiThe coordinates of the two points are calculated by triangulation to obtain the three-dimensional coordinate of the interest point at the time iMark P_wiFinally with P_WiMinus P_W0Obtaining three-dimensional deformation data D of the interest point at the moment i_Wi(ii) a The IC-GN algorithm used does not need to be pre-calculated, and the data stored after the IC-GN algorithm is pre-calculated in the step 5) is directly utilized.

is a Jacobian matrix, and T represents the transposition of the matrix;

b. and (c) for each interest point, setting the coordinate of the interest point as (x, y), and estimating the initial value p of the deformation vector of the interest point by using an image feature assisted method (u, u)_x,u_y,u_xx,u_xy,u_yy,v,v_x,v_y,v_xx,v_xy,v_yy) (ii) a Wherein u, v are translation amounts; u. of_x,u_y,v_x,v_yIs the first order gradient component; u. of_xx,u_xy,u_yy,v_xx,v_xy,v_yyIs twoAn order gradient component; here, a CUDA thread is used to perform a point of interest computing task;

c1, calculating the deformation vector increment delta p:

wherein H-¹The inverse of the Hessian matrix is represented,

and

normalized coefficients for the reference and target sub-regions respectively,

c3, updating p, namely making p equal to p';

a. setting the point of interest atThe coordinates on the left view are (x)_L,y_L) The point coordinate on the right view is (x)_R,y_R) The corresponding three-dimensional coordinate is (X)_W,Y_W,Z_W)；

wherein A is a coefficient matrix, and

b is a matrix of constant terms, an

The above embodiments are preferred embodiments of the present invention, but the present invention is not limited to the above embodiments, and any other changes, modifications, substitutions, combinations, and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and all such changes, modifications, substitutions, combinations, and simplifications are intended to be included in the scope of the present invention.

Claims

1. a real-time three-dimensional deformation measurement method based on GPU parallel acceleration, is characterized in that, comprises the following steps:

1) Use Zhang Zhengyou's calibration method to perform stereo calibration on the left and right fixed cameras to obtain the projection matrices _ML and _MR of the left and right cameras;

2) Use the above two cameras to simultaneously capture the left and right images L ₀ , R ₀ of the surface of the target object before deformation and the left and right images Li and R _i at the _ith moment during the deformation process, where i=1,2 ,3,…,n;

3) Select a region of interest on the image L ₀ , and take a batch of interest points P _L0 at medium intervals in the region;

4) Copy the camera's projection matrix, all captured images and points of interest to the GPU;

5) On the GPU, for each point of interest P _L0 , first use the IC-GN algorithm to find its corresponding point P _R0 on the image R ₀ , and then use the coordinates of the two points P _L0 and P _R0 to measure by triangulation The three-dimensional coordinate P _W0 of the interest point before the deformation is obtained by calculating the method;

6) On the GPU, perform the following operations on each moment _i in the deformation process: First, use the IC-GN algorithm for each interest point P _L0 to find its corresponding points P _Li and P on the images Li and Ri _. _Ri , then use the coordinates of the two points P _Li and P _Ri to calculate the three-dimensional coordinate P _wi of the point of interest at time i by triangulation, and finally subtract P _W0 from P _Wi to obtain the three-dimensional deformation data of the point of interest at time i D _Wi ;

7) Copy the three-dimensional deformation data of all interest points at each moment back to the CPU, and then the three-dimensional deformation of the object surface at each moment can be obtained.

2. a kind of real-time three-dimensional deformation measurement method based on GPU parallel acceleration according to claim 1, is characterized in that: in step 1), the projection matrix size is 3 × 4, represents two-dimensional coordinates on three-dimensional space coordinates and camera image relationship between coordinates.

3. a kind of real-time three-dimensional deformation measurement method based on GPU parallel acceleration according to claim 1, is characterized in that: in step 3) in, the area of interest refers to the area that needs to be measured that the user specifies, the interest point interval in the area It is also selected according to the measurement needs.

4. a kind of real-time three-dimensional deformation measurement method based on GPU parallel acceleration according to claim 1, is characterized in that: in step 4), the image of taking is first stored in the lock that the runtime function cudaMallocHost in use CUDA distributes In the page memory, and then use the runtime function cudaMemcpy function in CUDA to copy the data to the GPU; the camera's projection matrix is copied to the constant memory of the GPU, so that it can be accessed at high speed.

5. a kind of real-time three-dimensional deformation measurement method based on GPU parallel acceleration according to claim 1, is characterized in that: in step 5) and step 6), in the IC-GN algorithm of use, input is a batch of interest points , the reference image and the target image, using a CUDA thread block to process the computation tasks corresponding to a point of interest, including the following:

a. Perform pre-calculation: For each interest point, calculate the corresponding Hessian matrix and store the data; use a CUDA thread block to complete the calculation task of each interest point, where the Hessian matrix is

Represents the accumulation of all pixel positions in the reference sub-region. The reference sub-region is a sub-image with a size of 33 × 33 centered on the point of interest; ψ is the coordinate of the point of interest, ζ is the local coordinate in the reference sub-region,

is the gradient of the reference image,

is the Jacobian matrix, and T represents the transpose of the matrix;

b. For each interest point, set its coordinates as (x, y), and use the image feature-assisted method to estimate the initial value of its deformation vector p=(u, u _x , u _y , u _xx , u _xy , u _yy ,v,v _x ,v _y ,v _xx ,v _xy ,v _yy ); where u,v are translations; u _x ,u _y ,v _x ,v _y are first-order gradient components; u _xx ,u _xy , u _yy , v _xx , v _xy , v _yy are the second-order gradient components; here a CUDA thread is used to complete the calculation task of a point of interest;

c. For each interest point, iteratively update its corresponding deformation vector p according to the following steps:

c1. Calculate the deformation vector increment Δp:

where H ^-1 represents the inverse of the Hessian matrix,

and

are the normalization coefficients of the reference sub-region and the target sub-region, respectively,

represents the target sub-area after subtracting the gray mean value,

represents the reference sub-region after subtracting the gray mean value, W(ζ; p) represents the transformation function from the local coordinates of the reference sub-region to the local coordinates of the target sub-region;

c2. Use Δp to calculate a new transformation function W(ζ; p')=W(W ^-1 (ζ; Δp); p), where W ^-1 (ζ; Δp) is the inverse transformation of W(ζ; Δp) ; then extract a new deformation vector p' from the new transformation function W(ζ; p');

c3, update p, that is, let p=p';

c4. Repeat c1 to c3 until ||Δp||<0.001, and ||·|| represents the modulo length of the vector;

d. Calculate the coordinates of the corresponding point of the interest point in the target image as (x', y'), where x'=x+u, y'=y+v.

6. a kind of real-time three-dimensional deformation measurement method based on GPU parallel acceleration according to claim 1, is characterized in that: in step 6), the IC-GN algorithm of use need not carry out pre-calculation, directly utilizes step 5) IC-GN The data stored after precomputing the GN algorithm.

7. a kind of real-time three-dimensional deformation measurement method based on GPU parallel acceleration according to claim 1, is characterized in that: in step 5) and step 6), the triangulation method of using, comprises the following content:

a. Let the coordinates of the point of interest on the left view be (x _L , y _L ), the coordinates of the point on the right view be (x _R , y _R ), and the corresponding three-dimensional coordinates are (X _W , Y _W , Z _W );

b. Calculate (X _W , Y _W , Z _W ) as follows, and each CUDA thread calculates a three-dimensional coordinate;

where A is the coefficient matrix, and

b is a constant term matrix, and

m ^L _rc represents the element in the rth row and cth column of the projection matrix _ML , m ^R _rc represents the element in the rth row and the cth column of the projection matrix _MR , where r=1, 2, 3, c=1 ,2,3,4.