Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a power field operation safety monitoring method based on video image analysis, which comprises the following steps:
(1) absolute violation state analysis based on static image
Acquiring a static image of a power field operation monitoring video and preprocessing the static image;
acquiring an operator target area in the image through a clustering algorithm;
acquiring the positions of key points of a human body skeleton in a target region through a key point detection model, and acquiring a regional image of a human body;
judging the behavior type of the operator by combining the regional image, the target regional image of the operator and the whole static image through a convolutional neural network model to obtain a violation behavior analysis result;
(2) track violation analysis based on dynamic video
Fitting the relation among the environmental parameters of the construction field area, the work types of workers, the qualification grade of the workers and the predicted value of the danger grade of the construction area based on the training data;
acquiring a monitoring video image of a construction area to be analyzed, performing face recognition on the appearing operators to acquire identity information, and acquiring the work types and qualification grades of the operators based on the face recognition result;
predicting the danger level of the construction area relative to the workers according to the fitting relation based on the environmental parameters of the construction field area, the labor and the types of the workers and the qualification of the workers;
and sending out alarm information when the predicted danger level is greater than a second preset value.
As a further optimization of the scheme, the absolute violation phenomenon comprises live working without wearing insulating gloves, insecure ground wire hanging, on-site standard dressing and typical violation behaviors, and the track violation comprises intrusion into a warning area.
As a further optimization of the above scheme, the obtaining of the target region in the image through the clustering algorithm includes:
(31) acquiring the density of points in the static image according to a preset first calculation method;
(32) taking the point with the maximum density as a first clustering center, and performing density reduction on the point with the first clustering center as the origin and within a preset radius range according to a preset second calculation method;
(33) obtaining a point with the maximum density from all points of the non-clustering centers, judging whether the density value of the point with the maximum density is larger than a preset first threshold value, if so, taking the point with the maximum density from all points of the non-clustering centers as a next clustering center, performing density reduction on the point with the next clustering center as an original point and within a preset radius range according to a preset second calculation method, and repeating the step (33); otherwise, entering a step (34);
(34) and finishing the acquisition of all target areas, and acquiring a clustering area formed by a plurality of clustering centers as a target area.
As a further optimization of the above scheme, the determining the behavior type of the operator through the convolutional neural network model to obtain the analysis result of the violation behavior includes:
inputting the human body regional image and the target region of the operator into a first convolution neural network, extracting features through the first convolution neural network, inputting the extracted features into a feature fusion layer network for feature fusion, and inputting the extracted features into a first classification layer network based on fusion features to obtain an image classification result;
inputting the whole static image into a second convolutional neural network, extracting features through a second convolutional layer network, and inputting the extracted features into a second classification layer network to obtain an image classification result;
and the output results of the first classification layer network and the second classification layer network are input into the classification fusion layer network to obtain the probability of the behavior types of the operators.
As a further optimization of the above scheme, the first convolutional neural network and the second convolutional neural network are trained by using different model parameters as initialization model parameters during training, and the model parameters are corrected by back propagation through calculating a loss function value of an output result after the first classification layer and the second classification layer, respectively.
As a further optimization of the above scheme, the method for acquiring the human body subregion image includes:
training a key point detection model based on the human skeleton key point image data set;
inputting the static image into a key point detection model to obtain key point position detection and classification;
the method comprises the steps of obtaining a region image which contains all key points of a single person and is the smallest in area, and dividing the region image into at least one sub region according to the type of the key points of the human body.
As a further optimization of the above scheme, the method for detecting the key point by the key point detection model comprises:
carrying out feature extraction on an input image through a high-resolution network to obtain a plurality of feature maps with different resolutions;
selecting one input at least two cavity convolution layers with different expansion rates from the feature map output by the high-resolution network to obtain feature maps with different scales, wherein output channels of the cavity convolution layers are 256;
fusing the feature maps of multiple scales to obtain a fused feature map;
and calculating the probability of each point on the image as a key point based on the fusion feature map, and acquiring the point with the maximum probability as the key point.
As further optimization of the scheme, the relation between the environmental parameters of the construction site area, the labor types of workers, the qualification grade of the workers and the risk grade predicted value of the construction area is fitted based on the training data, and a neural network is adopted for fitting.
The invention also provides a power field operation safety monitoring device based on video image analysis, which comprises:
the absolute violation state analysis module is used for carrying out absolute violation state analysis based on the static image and comprises:
the static image preprocessing unit is used for acquiring and preprocessing a static image of the power field operation monitoring video;
the target area extraction unit is used for acquiring the target area of the operator in the image through a clustering algorithm;
the regional image acquisition unit is used for acquiring the positions of key points of the human skeleton in the target region through the key point detection model and acquiring a regional image of the human body;
the behavior type analysis unit is used for judging the behavior type of the operating personnel through the convolutional neural network model by combining the subarea image, the operating personnel target area image and the static image overall image to obtain a violation behavior analysis result;
the track violation analysis module is used for carrying out track violation analysis based on the dynamic video, and comprises the following steps:
the relevant parameter fitting unit is used for fitting the relation among the environmental parameters of the construction site area, the work types of workers, the qualification grade of the workers and the predicted value of the danger grade of the construction area based on the training data;
the system comprises an operator information acquisition unit, a construction area monitoring unit and a monitoring unit, wherein the operator information acquisition unit is used for acquiring a monitoring video image of a construction area to be analyzed, performing face recognition on the appearing operators to acquire identity information, and acquiring the work types and the qualification grades of the operators based on the face recognition result;
and the operator track violation analysis unit is used for predicting the danger level of the construction area relative to the workers according to the fitting relation based on the environmental parameters, the worker types and the worker qualifications of the construction field area, and sending alarm information when the predicted danger level is greater than a second preset value.
As a further optimization of the scheme, the absolute violation phenomenon comprises live working without wearing insulating gloves, insecure ground wire hanging, on-site standard dressing and typical violation behaviors, and the track violation comprises intrusion into a warning area.
The invention discloses a method and a device for monitoring the safety of electric power field operation based on video image analysis, which have the following beneficial effects:
by combining the regional images, the target regional images of the operating personnel and the whole static images, the method realizes the fusion of the interactive characteristics of the human body and the environment and improves the accuracy of the classification judgment of the behavior types of the operating personnel based on the characteristic fusion of various image regions, particularly the combination of the key region of the human body and the whole images containing the environment images and the human body images, and the regional images formed by the key point position information of the human body skeleton are fused, so that the behavior type analysis is realized through the efficient characteristic representation of the key region, and the calculated amount brought by redundant image information is reduced.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
The method for monitoring the safety of the electric power field operation based on the video image analysis comprises the steps of monitoring an absolute violation state and a track violation behavior, wherein the specific absolute violation state comprises the conditions that live-wire operation does not wear insulating gloves, grounding wires are not firmly hung, the field is normally worn and the track violation behavior comprises the conditions that a warning area breaks into the live-wire operation, and the track violation behavior comprises the conditions that the live-wire operation does not wear the insulating gloves, the grounding wires are not firmly hung and the track.
For monitoring and analyzing the five types of violation behaviors, the method provided by the embodiment of the application comprises the following steps:
(1) absolute violation state analysis based on static image
The method comprises the steps of obtaining static images of power field operation monitoring videos and preprocessing the static images, wherein the optimized power field monitoring video images comprise three-dimensional image data so as to provide more characteristics for subsequent image analysis;
acquiring an operator target area in the image through a clustering algorithm;
acquiring the positions of key points of a human body skeleton in a target region through a key point detection model, and acquiring a regional image of a human body;
judging the behavior type of the operator by combining the regional image, the target regional image of the operator and the whole static image through a convolutional neural network model to obtain a violation behavior analysis result;
in the embodiment, by combining the subarea image, the target area image of the operator and the whole static image, based on the feature fusion of various image areas, especially the combination of the human body key area and the whole image containing the environment image and the human body image, the fusion of the human body and the environment interaction feature is realized, the accuracy of the behavior type classification judgment of the operator is improved, and the subarea image formed by fusing the position information of the human body skeleton key point is realized, so that the behavior type analysis is realized through the efficient feature representation of the key area, and the calculated amount brought by redundant image information is reduced.
(2) Track violation analysis based on dynamic video
Fitting the relation among the environmental parameters of the construction field area, the work types of workers, the qualification grade of the workers and the predicted value of the danger grade of the construction area based on the training data;
acquiring a monitoring video image of a construction area to be analyzed, performing face recognition on the appearing operators to acquire identity information, and acquiring the work types and qualification grades of the operators based on the face recognition result;
predicting the danger level of the construction area relative to the workers according to the fitting relation based on the environmental parameters of the construction field area, the labor and the types of the workers and the qualification of the workers;
and sending out alarm information when the predicted danger level is greater than a second preset value.
In the embodiment, considering that the factors affecting the safety of the operators on the electric power site include subjective factors such as the qualification of the operators, namely, the safety capability condition of the operators, objective factors such as the environmental parameters of the construction site area, and the risk levels of the operators with different safety capabilities in the same working area are different, a method for determining different dangerous areas aiming at different operators is adopted, wherein the qualification of the operators is used for carrying out safety comprehensive capability analysis by combining the personnel information with information fusion such as training records, safety examination information, participation engineering project information, violation records and the like. Wherein the environmental parameters include a safety state of the power device and a safety state of the work tool.
In the embodiment, a convolutional neural network is adopted to fit the relationship among the environmental parameters, the labor types and the qualification grade information of the construction site area and the predicted value of the danger grade of the construction area, the depth characteristic extraction is carried out through a plurality of layers of convolutional layers based on the input characteristic data of the environmental parameters, the labor types and the qualification grade information of the workers serving as the network of the construction site area, the probabilities of different danger grades of the workers entering the construction site are output through softmax on a classification layer, the danger grade with the maximum probability serves as a prediction result, and the workers are considered to enter the warning area when the danger grade exceeds a preset range value and belong to track violation.
In this embodiment, the obtaining of the target region in the image through the clustering algorithm includes the following steps:
(31) acquiring the density of points in a static image according to a preset first calculation method, taking the static image as an example of adopting three-dimensional image data, wherein the first calculation method comprises the following steps:
wherein r is
ax,r
ay,r
azRespectively, as a point (x) in the image
i,y
i,z
i) For the length range value of the central point in the three-dimensional direction, the length range in the three-dimensional direction forms more than one point (x)
i,y
i,z
i) A cuboid spatial range which is a central point;
(32) taking the point with the maximum density as a first clustering center, and performing density reduction on the points with the first clustering center as an origin and within a preset radius range according to a preset second calculation method, wherein the second calculation method comprises the following steps:
wherein (x)
c,y
c,z
c) As cluster center coordinates, D
cIs a point (x)
c,y
c,z
c) Density value of r
bx,r
by,r
bzThe preferred r is a length range value in the three-dimensional direction with the cluster center as the center, i.e., a range of points to be density-reduced
bx,r
by,r
bzIs respectively r
ax,r
ay,r
az1.5 times of;
(33) obtaining a point with the maximum density from all points of the non-clustering centers, judging whether the density value of the point with the maximum density is larger than a preset first threshold value, if so, taking the point with the maximum density from all points of the non-clustering centers as a next clustering center, performing density reduction on the point with the next clustering center as an original point and within a preset radius range according to a preset second calculation method, and repeating the step (33); otherwise, entering a step (34);
(34) and finishing the acquisition of all target areas, and acquiring a clustering area formed by a plurality of clustering centers as a target area.
The method comprises the steps of collecting three-dimensional image data and three-dimensional point cloud density subtraction clustering based on depth camera equipment, accurately extracting a clustering region of a single operator in an image, namely a target region, and effectively reducing redundant image data of the target region.
In the embodiment, two convolutional neural network models are adopted, and image feature extraction and behavior type judgment classification are respectively performed on the human body regional image, the operator target region and the static image overall image, and the specific method comprises the following steps:
inputting the human body regional image and the target region of the operator into a first convolution neural network, extracting features through the first convolution neural network, inputting the extracted features into a feature fusion layer network for feature fusion, and inputting the extracted features into a first classification layer network based on fusion features to obtain an image classification result;
inputting the whole static image into a second convolutional neural network, extracting features through a second convolutional layer network, and inputting the extracted features into a second classification layer network to obtain an image classification result;
and inputting the output results of the first classification layer network and the second classification layer network into a classification fusion layer network to obtain the probability of the behavior types of the operators.
In this embodiment, the first convolutional neural network and the second convolutional neural network are trained by using different model parameters as initialization model parameters, and the model parameters are corrected by back propagation through calculating a loss function value of an output result after the first classification layer and the second classification layer, preferably, the initialization parameters of the second convolutional neural network model are the model parameters of the key point detection model in this embodiment, that is, the second convolutional network model is trained by fusing key point feature information helpful for behavior classification determination, so as to improve the behavior classification accuracy of the second convolutional network model, and simultaneously, in the first convolutional network, a human body region image and an operator target region are obtained based on the positions of key points of a human body skeleton, and the operator target region image provides the contour features of the operator, the image analysis range is reduced, and the model training speed and the identification accuracy of the behavior types of the operators are improved.
The method for acquiring the human body subarea image comprises the following steps:
training a key point detection model based on the human skeleton key point image data set;
the method for inputting the static image into the key point detection model to obtain the key point position detection and classification and detecting the key points on the model based on the input image comprises the following steps:
the method comprises the steps of extracting features of an input image through a high-resolution network to obtain a plurality of feature maps with different resolutions, obtaining a plurality of feature maps with different resolutions in parallel from an original input image in the high-resolution network, fusing a plurality of resolution features, fusing different resolution feature information into each output feature image with different resolutions, and fusing the plurality of different resolution feature maps to obtain a final fused feature map through the following steps;
selecting one input at least two cavity convolution layers with different expansion rates from the feature maps output by the high-resolution network to obtain feature maps with different scales, wherein output channels of the cavity convolution layers are 256, and preferably, selecting one input at least two cavity convolution layers with different expansion rates from the feature maps output by the high-resolution network;
fusing the feature maps of multiple scales to obtain a fused feature map;
calculating the probability of each point on the image as a key point based on the fusion feature map, and acquiring the point with the maximum probability as the position of the key point;
acquiring a region image which contains all the single key points and has the minimum area, wherein all the single key points comprise: the method includes the steps that a left eye, a right eye, a left ear, a right ear, a left hand, a right hand and the like are arranged in a region image, the region image is divided into at least one subarea according to the key point category of a human body, for example, all key points belonging to the head are divided into one category as one subarea, and when a plurality of operators are arranged in one image, all key points of a single person are contained, the key points possibly contained in the region image with the minimum area are not the key points of the same person, so that the area size of the subarea of the classified key points is limited, and when the area is smaller than a preset value, the key points contained in the region are judged to be the key points of the non-same person, and it can be understood that the region image containing all key points of the single person and having the.
The embodiment also provides a power field operation safety monitoring device based on video image analysis, which is used for monitoring an absolute violation state and a track violation behavior, wherein the absolute violation state comprises that the live-wire operation does not wear insulating gloves, the grounding wire is not firmly hooked, the field is normally worn and the track violation behavior comprises that an alert area intrudes, and the power field operation safety monitoring device of the embodiment comprises:
the absolute violation state analysis module is used for carrying out absolute violation state analysis based on the static image and comprises:
the static image preprocessing unit is used for acquiring and preprocessing a static image of the power field operation monitoring video;
the target area extraction unit is used for acquiring the target area of the operator in the image through a clustering algorithm;
the regional image acquisition unit is used for acquiring the positions of key points of the human skeleton in the target region through the key point detection model and acquiring a regional image of the human body;
the behavior type analysis unit is used for judging the behavior type of the operating personnel through the convolutional neural network model by combining the subarea image, the operating personnel target area image and the static image overall image to obtain a violation behavior analysis result;
the track violation analysis module is used for carrying out track violation analysis based on the dynamic video, and comprises the following steps:
the relevant parameter fitting unit is used for fitting the relation among the environmental parameters of the construction site area, the work types of workers, the qualification grade of the workers and the predicted value of the danger grade of the construction area based on the training data;
the system comprises an operator information acquisition unit, a construction area monitoring unit and a monitoring unit, wherein the operator information acquisition unit is used for acquiring a monitoring video image of a construction area to be analyzed, performing face recognition on the appearing operators to acquire identity information, and acquiring the work types and the qualification grades of the operators based on the face recognition result;
the operator track violation analysis unit is used for predicting the danger level of the construction area relative to the workers according to the fitting relation based on the environmental parameters of the construction field area, the work types of the workers and the qualification of the workers;
and sending out alarm information when the predicted danger level is greater than a second preset value.
For specific limitations of the electric field operation safety monitoring device, reference may be made to the above limitations of the electric field operation safety monitoring method, which will not be described herein again. All or part of each unit in the electric field operation safety monitoring device can be realized by software, hardware and a combination thereof. The units can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the units.
The present invention is not limited to the above-described embodiments, and those skilled in the art will be able to make various modifications without creative efforts from the above-described conception, and fall within the scope of the present invention.