CN113095229A

CN113095229A - Unsupervised domain self-adaptive pedestrian re-identification system and method

Info

Publication number: CN113095229A
Application number: CN202110399589.7A
Authority: CN
Inventors: 李佳函; 李云龙; 程德强; 寇旗旗; 张皓翔; 韩成功; 徐进洋; 江曼; 刘瑞航
Original assignee: Jiangsu Huatu Mining Technology Co ltd; China University of Mining and Technology CUMT
Current assignee: China University of Mining and Technology CUMT
Priority date: 2021-04-14
Filing date: 2021-04-14
Publication date: 2021-07-09
Anticipated expiration: 2041-04-14
Also published as: CN113095229B

Abstract

The invention relates to an unsupervised domain self-adaptive pedestrian re-identification system and method, belongs to the technical field of pedestrian re-identification, and solves the problems of high difficulty and low identification accuracy rate of the existing unsupervised domain self-adaptive pedestrian re-identification. The system comprises a data acquisition module, a data acquisition module and a data processing module, wherein the data acquisition module is used for acquiring a plurality of source domain sample subsets and a plurality of target domain sample subsets; the network model training module is used for obtaining a classification loss function and a sample invariance loss function of the pedestrian re-recognition network model, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity between each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset to obtain a layering loss function, and further performing iterative optimization on the pedestrian re-recognition network model; and the re-identification module is used for re-identifying the pedestrians by utilizing the optimized pedestrian re-identification network model to obtain the images which are the same as or similar to the images of the pedestrians to be identified. The system can reduce the migration loss of the network and improve the accuracy of pedestrian re-identification.

Description

Unsupervised domain self-adaptive pedestrian re-identification system and method

Technical Field

The invention relates to the technical field of pedestrian re-identification, in particular to an unsupervised domain self-adaptive pedestrian re-identification system and method.

Background

In conventional unsupervised domain adaptive learning, most methods are used in a closed set scenario where sample classes are consistent in the source domain as well as in the target domain. However, the conventional unsupervised domain adaptive algorithm cannot be used for unsupervised domain adaptive pedestrian re-recognition, because due to image acquisition, the pedestrian categories of the source domain and the target domain are almost different, and a large migration loss is generated when a network trained by using source domain images migrates to the target domain for recognition. The large migration loss caused by network migration makes unsupervised domain adaptive pedestrian re-identification more challenging than most unsupervised learning.

In the prior art, unsupervised domain self-adaptive pedestrian re-identification mainly achieves the purpose of distinguishing label-free data sets through self-clustering of target domains. In the prior art, three invariance losses based on a target domain are proposed to reduce the migration loss of the network, but marked data are not fully utilized, and deep semantic information of a non-tag picture is not fully mined, so that the migration loss of the network cannot be efficiently reduced, and the accuracy of pedestrian re-identification needs to be improved.

In the prior art, at least the following defects exist, the source domain picture information and the target domain picture information cannot be fully utilized, and the network distinguishing learning cannot be carried out on negative samples with different similarities according to the similarities between the source domain picture and the target domain picture, so that the migration loss of a pedestrian re-identification network is large, and the accuracy of the result of the pedestrian re-identification is low.

Disclosure of Invention

In view of the foregoing analysis, embodiments of the present invention provide an unsupervised domain adaptive pedestrian re-identification system and method, so as to solve the problems of large migration loss of a pedestrian re-identification network and low accuracy of a result of pedestrian re-identification in the prior art.

In one aspect, the present invention provides an unsupervised domain adaptive pedestrian re-identification system, including:

a data acquisition module for acquiring a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets;

the network model training module is used for obtaining a classification loss function and a sample invariance loss function of the pedestrian re-recognition network model, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity between each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model;

and the re-identification module is used for identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the pedestrian image to be identified.

Further, the pedestrian re-identification network model comprises a residual error network structure, a full connection layer and a Softmax normalization layer which are connected in sequence and correspond to the classification loss function, a target domain memory and an L, wherein the target domain memory and the L are connected in sequence and correspond to the sample invariance loss function₂The system comprises a normalization layer, a source domain memory and a similarity measurement axis network structure which are sequentially connected and correspond to a hierarchical loss function, wherein the residual error network structure is respectively connected with the full connection layer, the target domain memory, the source domain memory and the similarity measurement axis network structure.

Further, the network model training module obtains the hierarchical loss function specifically by:

respectively inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain and respectively store the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset, and multiplying the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset to obtain corresponding similarity;

based on the similarity, sequencing the pedestrian pictures in the target domain sample subset in a descending order, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures;

obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:

wherein L is_SLRepresenting the hierarchical loss function, n_tRepresenting the number of pedestrian pictures, w, in the target field sample subset_t,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, x_t,iRepresenting target domain sample subset input pedestrian re-recognitionInputting a pedestrian picture with the sequence of i, f (x) in the network model_t,i) Picture x representing pedestrian_t,iP (m | x)_t,i) Picture x representing pedestrian_t,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R₁[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples₁[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, N_sAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.

Further, the layer weights of the three layers of pictures are set as follows:

where b denotes the number of first layer pictures, k₁Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture x_t,iProbability of class i pedestrian pictures in all target domain sample subsets, R₁[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, N_tRepresenting the number of categories of pedestrian pictures in all the stored subsets of the target domain samples.

Further, the network model training module obtains a classification loss function specifically by the following means:

inputting the source domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain the features of each pedestrian picture in the source domain sample subset;

sequentially inputting the characteristics of each pedestrian picture into a full connection layer and a softmax regression layer of a pedestrian re-identification network model, and performing characteristic dimension conversion and characteristic normalization;

classifying a loss function by adopting the following formula based on the characteristics of each pedestrian picture after dimension conversion and normalization;

wherein L is_srcRepresenting the classification loss function, n_sRepresenting the number of pedestrian pictures, x, in a subset of source domain samples_s,mRepresents the m-th pedestrian picture in the source domain sample subset, f (x)_s,m) Picture x representing pedestrian_s,mCharacteristic of (a), y_s,mRepresenting a pedestrian picture x in a subset of source domain samples_s,mClass label of p (y)_s,m|x_s,m) Picture x representing pedestrian_s,mBelong to the category y_s,mProbability of (a), f (x)_s,j) Representing features of class j pedestrian pictures in the source domain sample subset, N_sRepresenting the number of categories of pedestrian pictures in all the stored subsets of the active domain samples.

Further, the network model training module obtains a sample invariance loss function specifically by the following means:

inputting the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain the features of each pedestrian picture in the target domain sample subset, and performing L on the features of each pedestrian picture₂Carrying out standardization treatment;

obtaining a sample invariance loss function according to the characteristics of each pedestrian picture after the normalization processing through the following formula:

wherein L is_TRepresenting the sample invariance loss function, n_tRepresenting the number, x, of pictures of a person in a sample subset of the target field_t,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)_t,i) Picture x representing pedestrian_t,iIs characterized by w_i,lPicture x representing pedestrian_t,iWeight of pedestrian pictures belonging to class I, p (l | x)_t,i) Picture x representing pedestrian_t,iProbability of a pedestrian belonging to class I picture, N_tAnd the stored category number of the pedestrian pictures in all the target domain sample subsets is represented, and beta is a temperature coefficient.

Further, the network model training module performs iterative optimization on the pedestrian re-recognition network model specifically by the following means:

obtaining a total loss function according to the classification loss function, the sample invariance loss function and the layering loss function:

L＝λ₁L_src+λ₂L_T+λ₃L_SL，

wherein L represents the total loss function, L_srcRepresenting the classification loss function, λ₁Weight representing the classification loss function, L_TRepresenting the sample invariance loss function, λ₂Weight, L, representing the sample invariance loss function_SLRepresenting the hierarchical loss function, λ₃A weight representing a hierarchical loss function;

and traversing each source domain sample subset and each target domain sample subset, and iteratively updating the total loss function until the variable quantity of the total loss function value is smaller than a preset value so as to complete the optimization of the pedestrian re-identification network model.

On the other hand, the invention provides an unsupervised domain self-adaptive pedestrian re-identification method, which comprises the following steps:

obtaining a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets;

respectively obtaining a classification loss function and a sample invariance loss function of a pedestrian re-identification network model based on the source domain sample subset and the target domain sample subset, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity of each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model;

and identifying the image of the pedestrian to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the image of the pedestrian to be identified.

Further, the hierarchical loss function is obtained by specifically:

wherein L is_SLRepresenting the hierarchical loss function, n_tRepresenting the number of pedestrian pictures, w, in the target field sample subset_t,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, x_t,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)_t,i) Picture x representing pedestrian_t,iP (m | x)_t,i) Picture x representing pedestrian_t,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R₁[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples₁[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, N_sAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.

Further, the layer weights of the three layers of pictures are set as follows:

Compared with the prior art, the invention can realize at least one of the following beneficial effects:

1. the unsupervised domain self-adaptive pedestrian re-identification system and the unsupervised domain self-adaptive pedestrian re-identification method effectively combine the supervised pedestrian re-identification with the unsupervised pedestrian re-identification through the combination of the source domain sample with the label and the target domain sample without the label, and firstly propose to layer the target domain sample through the similarity between the source domain sample and the target domain sample, so that the discrimination of the pedestrian re-identification network on the target domain sample is improved, the migration loss of the network is reduced, and the accuracy of the pedestrian re-identification result of the pedestrian re-identification network is further improved.

2. The unsupervised domain self-adaptive pedestrian re-recognition system and the unsupervised domain self-adaptive pedestrian re-recognition method, provided by the invention, utilize the similarity of the characteristics of the source domain sample and the target domain sample to carry out layering and weighting on the negative sample to obtain a layered loss function, the loss function enables the network to be self-adaptive in the training of a pedestrian re-recognition network model so as to reduce the learning of the characteristics of the negative sample, thereby reducing the migration loss of the network, and on the basis, the classification loss function and the sample invariance loss function are combined to carry out iterative optimization on the pedestrian re-recognition network, so that the accuracy of the pedestrian re-recognition result of the pedestrian re-recognition network.

In the invention, the technical schemes can be combined with each other to realize more preferable combination schemes. Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and drawings.

Drawings

The drawings are only for purposes of illustrating particular embodiments and are not to be construed as limiting the invention, wherein like reference numerals are used to designate like parts throughout.

FIG. 1 is a schematic diagram of an unsupervised domain adaptive pedestrian re-identification system according to an embodiment of the present invention;

FIG. 2 is a diagram illustrating a pedestrian re-identification network model according to an embodiment of the present invention;

fig. 3 is a flowchart of an unsupervised domain adaptive pedestrian re-identification method according to an embodiment of the present invention.

Reference numerals:

110-a data acquisition module; 120-a network model training module; 130-re-identification module.

Detailed Description

The accompanying drawings, which are incorporated in and constitute a part of this application, illustrate preferred embodiments of the invention and together with the description, serve to explain the principles of the invention and not to limit the scope of the invention.

System embodiment

The invention discloses an unsupervised domain self-adaptive pedestrian re-identification system. As shown in fig. 1, the system includes:

a data acquisition module 110 for acquiring a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets. Specifically, a large number of pictures of pedestrians are randomly acquired from different angles, each picture contains a pedestrian, the acquired pictures are divided into a training set and a testing set, the training set is used for training the pedestrian re-recognition network model, and the testing set is used for testing the trained pedestrian re-recognition network model so as to ensure the recognition accuracy of the pedestrian re-recognition network model. The method comprises the steps of randomly selecting partial pictures in a training set, adding a label to each picture, specifically, adding the same label to a plurality of pictures of the same pedestrian, wherein different labels represent different pedestrians, dividing the pictures with the labels added into a plurality of source domain sample subsets, taking other pictures without the labels in the training set as a target domain sample set, adding a number to each picture, and randomly dividing the pictures with the numbers added into a plurality of target domain sample subsets.

The network model training module 120 is configured to obtain a classification loss function and a sample invariance loss function of the pedestrian re-recognition network model, rank and layer pedestrian pictures in the target domain sample subset according to a similarity between each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtain a layer loss function based on layered pedestrian picture features and corresponding layer weights. And traversing each source domain sample subset and each target domain sample subset based on a classification loss function, a sample invariance loss function and a hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model.

And the re-identification module is used for identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the pedestrian image to be identified. Specifically, a to-be-identified pedestrian picture is input into a trained pedestrian re-identification network model, the model can output a picture most similar to the to-be-identified pedestrian picture, exemplarily, the model is set to output the number of the first three pictures with the highest similarity to the to-be-identified pedestrian picture, the first three pictures are pictures in a target domain sample set, and then the to-be-identified pedestrian picture and the first three pictures are manually compared to determine the category of the to-be-identified pedestrian picture.

Preferably, as shown in fig. 2, the pedestrian re-identification network model includes a residual network structure, a fully-connected layer and a Softmax normalization layer connected in sequence corresponding to the classification loss function, a target domain memory connected in sequence corresponding to the sample invariance loss function, and an L₂The system comprises a normalization layer, a source domain memory and a similarity measurement axis network structure which are connected in sequence and correspond to a hierarchical loss function, wherein the residual error network structure is connected with a full connection layer, a target domain memory, the source domain memory and the similarity measurement axis network structure respectively. Specifically, the target domain memory and the source domain memory are both in a key value storage structure, the key is used for storing the characteristics of the pedestrian picture, the value is used for storing a number or a label corresponding to the pedestrian picture, and the residual network structure is ResNet 50.

Preferably, the network model training module obtains the classification loss function, the sample invariance loss function and the hierarchical loss function in the following way:

step 1, inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model respectively to extract image features so as to obtain the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset.

And storing the characteristics of each pedestrian picture in the target domain sample subset in a target domain memory, and storing the characteristics of each pedestrian picture in the source domain sample subset in a source domain memory.

Step 2, specifically, obtaining a classification loss function in the following way:

and sequentially inputting the characteristics of each pedestrian picture in the source domain sample subset into a full connection layer and a softmax regression layer of the pedestrian re-identification network model, and performing characteristic dimension conversion and characteristic normalization to strengthen the nonlinearity of the pedestrian re-identification network model in learning.

Step 3, specifically, obtaining a sample invariance loss function through the following method:

l is carried out on the characteristics of each pedestrian picture in the current target domain sample subset stored in the target domain memory₂And (6) carrying out standardization processing.

wherein L is_TRepresenting the sample invariance loss function, n_tRepresenting the number, x, of pictures of a person in the current target field sample subset_t,iRepresenting the pedestrian picture with the input sequence i when the current target domain sample subset is input into the pedestrian re-identification network model, f (x)_t,i) Picture x representing pedestrian_t,iIs characterized by w_i,lPicture x representing pedestrian_t,iWeight of pedestrian pictures belonging to class I, p (l | x)_t,i) Picture x representing pedestrian_t,iProbability of a pedestrian belonging to class I picture, N_tAnd the stored category number of the pedestrian pictures in all the target domain sample subsets is represented, and beta is a temperature coefficient.

Step 4, obtaining a layering loss function by the following method:

multiplying the characteristics of each pedestrian picture in the current source domain sample subset and the characteristics of each pedestrian picture in the current target domain sample subset based on the similarity measurement axis network structure to obtain corresponding similarity;

and sequencing the pedestrian pictures in the target domain sample subset in a descending order based on the similarity, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures. Illustratively, the number of first layer pictures is 3, and the number of second layer pictures is 147.

Preferably, the layer weights of the three layers of pictures are set as follows:

where b denotes the number of first layer pictures, k₁Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture x_t,iProbability of class i pedestrian pictures in all target domain sample subsets, R₁[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, N_tRepresenting pedestrians in all stored subsets of target domain samplesNumber of categories of pictures.

Specifically, the steps 2 to 4 are not sequentially performed, and can be performed simultaneously.

Step 5, specifically, the pedestrian re-identification network model is subjected to iterative optimization in the following mode:

L＝λ₁L_src+λ₂L_T+λ₃L_SL，

wherein L represents the total loss function, L_srcRepresenting the classification loss function, λ₁Weight representing the classification loss function, L_TRepresenting the sample invariance loss function, λ₂Weight, L, representing the sample invariance loss function_SLRepresenting the hierarchical loss function, λ₃Weights representing a hierarchical loss function, exemplary, λ₁A value of 0.7, λ₂A value of 0.3, λ₃Is 0.2.

And traversing each source domain sample subset and each target domain sample subset, and repeating the steps 1 to 5 to update the total loss function in an iterative manner until the variable quantity of the total loss function value is smaller than a preset value so as to complete the optimization of the pedestrian re-identification network model.

Specifically, in the iterative update process, the pedestrian image features of the target domain sample subset stored in the target domain memory are updated in real time in the following manner:

x_t,irepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)_t,i) Picture x representing pedestrian_t,iWarp L₂Normalized characteristic, R_iImage x representing pedestrian in target domain stored in target domain memory_t,iIs characterized in that it is a mixture of two or more of the above-mentioned components,

is a hyper-parameter that controls the speed of feature updates.

Method embodiment

The embodiment of the method is based on the same principle as the system embodiment, so that the method is not repeated herein, and repeated parts can refer to the system embodiment on the market.

Specifically, as shown in fig. 3, the method includes:

s110, obtaining a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets.

S120, respectively obtaining a classification loss function and a sample invariance loss function of the pedestrian re-identification network model based on the source domain sample subset and the target domain sample subset, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity of each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; and traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model.

S130, identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model, and obtaining the image which is the same as or similar to the pedestrian image to be identified.

Preferably, the stratification loss function is obtained in particular by:

Compared with the prior art, the unsupervised domain self-adaptive pedestrian re-identification system and the unsupervised domain self-adaptive pedestrian re-identification method provided by the embodiment have the advantages that on one hand, the combination of the source domain sample with the label and the target domain sample without the label effectively combines the supervised pedestrian re-identification with the unsupervised pedestrian re-identification, namely, the target domain sample is layered through the similarity between the source domain sample and the target domain sample for the first time, so that the discrimination of the pedestrian re-identification network on the target domain sample is improved, the migration loss of the network is reduced, and the accuracy of the pedestrian re-identification result by the pedestrian re-identification network is improved; on the other hand, the negative samples are layered and weighted by utilizing the similarity of the characteristics of the source domain samples and the target domain samples to obtain a layered loss function, the loss function enables the network to adaptively reduce the learning of the characteristics of the negative samples in the training of the pedestrian re-recognition network model, so that the migration loss of the network is reduced, on the basis, the classification loss function and the sample invariance loss function are combined to perform iterative optimization on the pedestrian re-recognition network, and the accuracy of the pedestrian re-recognition network on the pedestrian re-recognition result can be greatly improved.

Those skilled in the art will appreciate that all or part of the flow of the method implementing the above embodiments may be implemented by a computer program, which is stored in a computer readable storage medium, to instruct related hardware. The computer readable storage medium is a magnetic disk, an optical disk, a read-only memory or a random access memory.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention.

Claims

1. an unsupervised domain adaptive pedestrian re-identification system, is characterized in that, comprises:

a data acquisition module for acquiring a source domain sample set including multiple source domain sample subsets and a target domain sample set including multiple target domain sample subsets;

The network model training module is used to obtain the classification loss function and sample invariance loss function of the pedestrian re-identification network model, and according to the similarity between each pedestrian image in the source domain sample subset and each pedestrian image in the target domain sample subset Sort and stratify the pedestrian pictures in the target domain sample subset, and obtain a hierarchical loss function based on the stratified pedestrian picture features and corresponding layer weights; based on the classification loss function, the sample invariance loss function and the hierarchical loss function The loss function traverses each source domain sample subset and target domain sample subset to iteratively optimize the pedestrian re-identification network model;

The re-identification module is used for recognizing the image of the pedestrian to be recognized by using the optimized pedestrian re-recognition network model to obtain an image identical or similar to the image of the pedestrian to be recognized.

2. The unsupervised domain adaptive pedestrian re-identification method according to claim 1, wherein the pedestrian re-identification network model comprises a residual network structure, a sequentially connected fully connected layer corresponding to a classification loss function and a Softmax normalization layer, sequentially connected target domain memory corresponding to sample invariance loss function, _L2 normalization layer, and sequentially connected source domain memory and similarity measure axis network structure corresponding to hierarchical loss function, said The residual network structure is respectively connected with the fully connected layer, the target domain memory, the source domain memory and the similarity measure axis network structure.

3. The unsupervised domain adaptive pedestrian re-identification method according to claim 1 or 2, wherein the network model training module specifically obtains a layered loss function in the following manner:

The source domain sample subset and the target domain sample subset are respectively input into the residual network structure of the person re-identification network model to extract image features to obtain the features of each pedestrian image in the source domain sample subset and each row in the target domain sample subset. The features of the person pictures are stored separately, and the features of each pedestrian picture in the source domain sample subset are multiplied by the characteristics of each pedestrian picture in the target domain sample subset to obtain the corresponding similarity;

Based on the similarity, the pedestrian pictures in the target domain sample subset are sorted in descending order, and the first preset number of pictures are selected as the first layer pictures in sequence, and the second preset number of pictures are selected as the second layer pictures, The rest of the pictures are used as the third-layer pictures, and the layer weights of the three-layer pictures are set respectively;

The hierarchical loss function is obtained based on the characteristics of each pedestrian image in the target domain sample subset and the layer weight corresponding to the layer to which it belongs:

Among them, L _SL represents the layered loss function, n _t represents the number of pedestrian images in the target domain sample subset, w _{t, r} represents the layer weight, r represents the sequence number of the pedestrian images sorted by similarity in the target domain sample subset, x _t,i represents the target domain sample subset inputting the pedestrian image in the order of i when inputting the pedestrian re-identification network model, f(x _{t, i} ) represents the feature of the pedestrian image x _{t, i} , p(m|x _{t, i} ) represents the similarity probability between the pedestrian image x _t,i and the stored pedestrian images of all categories in all source domain sample subsets, R ₁ [m] represents the stored features of all categories of pedestrian images in all source domain sample subsets, R ₁ [j] represents the characteristics of the j-th pedestrian image in all the source domain sample subsets stored, N _s represents the number of categories of pedestrian images in all the source domain sample subsets stored, and β is the temperature coefficient.

4. The unsupervised domain adaptive pedestrian re-identification method according to claim 3, wherein the layer weights of the three-layer pictures set are:

Among them, b represents the number of pictures in the first layer, k ₁ represents the sum of the number of pictures in the first layer and the pictures in the second layer, p(l|i) represents the pedestrian picture x _{t, i} belongs to all the target domain sample subsets stored The probability of the l-th type of pedestrian image, R ₁ [l] represents the characteristics of the l-th type of pedestrian image in all the target domain sample subsets stored, and N _t represents the number of categories of pedestrian images in all the stored target domain sample subsets.

5. The unsupervised domain adaptive pedestrian re-identification method according to claim 3, wherein the network model training module obtains a classification loss function in the following manner:

Inputting the source domain sample subset into the residual network structure of the pedestrian re-identification network model to extract image features to obtain the features of each pedestrian picture in the source domain sample subset;

The features of each pedestrian image are input into the fully connected layer and softmax regression layer of the pedestrian re-identification network model in turn to perform feature dimension transformation and feature normalization;

The following formula is used to classify the loss function based on the features of each pedestrian image after dimension transformation and normalization;

Among them, L _src represents the classification loss function, _ns represents the number of pedestrian images in the sample subset of the source domain, x _s,m represents the mth pedestrian image in the sample subset of the source domain, f(x _s,m ) represents the pedestrian image x The features of s, _m , y _s,m represents the category label of the pedestrian image x _s,m in the sample subset of the source domain, p(y _s,m | x _s,m ) represents the pedestrian image x _s,m belongs to the category y _s, The probability of _m , f(x _s,j ) represents the feature of the j-th pedestrian image in the sample subset of the source domain, and N _s represents the number of categories of pedestrian images in all the stored sample subsets of the source domain.

6. The unsupervised domain adaptive pedestrian re-identification method according to claim 3, wherein the network model training module obtains a sample invariance loss function in the following manner:

Input the target domain sample subset into the residual network structure of the pedestrian re-identification network model to extract image features to obtain the characteristics of each pedestrian picture in the target domain sample subset, and perform L ₂ standardized processing;

According to the characteristics of each pedestrian image after normalization, the sample invariance loss function is obtained by the following formula:

Among them, L _T represents the sample invariance loss function, n _t represents the number of pedestrian images in the target domain sample subset, x _{t, i} represents the pedestrian image in the input order of i when the target domain sample subset is input to the pedestrian re-identification network model, f (x _t,i ) represents the feature of the pedestrian image x _t,i , w _i,l represents the weight of the pedestrian image x _t,i belonging to the l-th category of pedestrian images, p(l|x _t,i ) represents the pedestrian image x _{t , i} belongs to the probability of the l-th pedestrian image, N _t represents the number of categories of pedestrian images in all target domain sample subsets stored, and β is the temperature coefficient.

7. The unsupervised domain self-adaptive pedestrian re-identification method according to claim 2, wherein the network model training module specifically performs iterative optimization on the pedestrian re-identification network model in the following manner:

The total loss function is obtained from the classification loss function, sample invariance loss function and hierarchical loss function:

L=λ ₁ L _src +λ ₂ L _T +λ ₃ L _SL ,

where L represents the total loss function, L _src represents the classification loss function, λ ₁ represents the weight of the classification loss function, L _T represents the sample invariance loss function, λ ₂ represents the weight of the sample invariant loss function, and L _SL represents the hierarchical loss function, λ ₃ represents the weight of the hierarchical loss function;

Traverse each source domain sample subset and each target domain sample subset, and iteratively update the total loss function until the change in the total loss function value is less than a preset value, so as to complete the optimization of the pedestrian re-identification network model.

8. An unsupervised domain adaptive pedestrian re-identification method, characterized in that, comprising:

obtaining a source domain sample set including multiple source domain sample subsets and a target domain sample set including multiple target domain sample subsets;

The classification loss function and the sample invariance loss function of the person re-identification network model are obtained based on the source domain sample subset and the target domain sample subset, respectively, and each pedestrian image in the source domain sample subset and the target domain sample subset are obtained according to Set the similarity of each pedestrian image to sort and layer the pedestrian images in the target domain sample subset, and obtain a layered loss function based on the layered pedestrian image features and the corresponding layer weights; based on the classification loss function, The sample invariance loss function and the hierarchical loss function traverse each source domain sample subset and target domain sample subset to iteratively optimize the pedestrian re-identification network model;

The optimized pedestrian re-identification network model is used to recognize the image of the pedestrian to be recognized, and an image identical or similar to the image of the pedestrian to be recognized is obtained.

9. The unsupervised domain adaptive pedestrian re-identification method according to claim 8, wherein the layered loss function is obtained in the following manner:

10. The unsupervised domain adaptive pedestrian re-identification method according to claim 9, wherein the set layer weights of the three-layer pictures are: