CN112347916A

CN112347916A - Power field operation safety monitoring method and device based on video image analysis

Info

Publication number: CN112347916A
Application number: CN202011224459.1A
Authority: CN
Inventors: 徐海青; 陈是同; 陶俊; 赵云龙; 吴小华; 毛舒乐; 林胜; 张天奇; 浦正国; 杨彬彬; 李小威; 宋杰; 石锋
Original assignee: State Grid Information and Telecommunication Co Ltd; Anhui Jiyuan Software Co Ltd
Current assignee: State Grid Information and Telecommunication Co Ltd; Anhui Jiyuan Software Co Ltd
Priority date: 2020-11-05
Filing date: 2020-11-05
Publication date: 2021-02-09
Anticipated expiration: 2040-11-05
Also published as: CN112347916B

Abstract

The invention discloses a method and device for safety monitoring of electric field operations based on video image analysis. the target area of the operator; obtain the position of the key points of the human skeleton in the target area, and obtain the sub-regional image of the human body; combine the sub-regional image, the image of the operator's target area and the overall image of the static image to determine the type of operator behavior through the convolutional neural network model Obtain the analysis result of illegal behavior; the present invention realizes the fusion of human body and environment interaction features through feature fusion based on various image regions, and simultaneously integrates the position features of key points of human skeleton, so as to improve the accuracy of behavior type identification and judgment. Efficient feature representation of key regions performs behavioral category analysis and reduces the amount of computation caused by redundant image information.

Description

Power field operation safety monitoring method and device based on video image analysis

Technical Field

The invention relates to the technical field of electric power safety, in particular to a method and a device for monitoring electric power field operation safety based on video image analysis.

Background

In recent years, in order to effectively guarantee the safety of workers on the electric power operation site and the continuity and stability of power supply to users, the national power grid puts forward higher requirements on the safe production and management of electric power. However, due to improper supervision of related supervision departments, management of managers is lost and power operation constructors do not comply or understand the regulations in place, which easily causes large safety accidents.

The behavior problem of finding whether the site operator has potential risks in the operation site in time is mainly solved by adopting manual design and extracting features and then performing behavior recognition and classification, so that the complexity problem of the manually designed features exists, the robustness and the popularization are poor, for behavior recognition by adopting a deep learning method, compared with the traditional behavior analysis based on the manual feature method, a model adopting the deep learning method can automatically obtain meaningful hierarchical feature representation, however, a video segment obtained from the power site is more complex, and how to extract effective features from a video image is still the core work of numerous researchers.

Disclosure of Invention

Aiming at the problems in the prior art, the invention provides a power field operation safety monitoring method based on video image analysis, which comprises the following steps:

(1) absolute violation state analysis based on static image

Acquiring a static image of a power field operation monitoring video and preprocessing the static image;

acquiring an operator target area in the image through a clustering algorithm;

acquiring the positions of key points of a human body skeleton in a target region through a key point detection model, and acquiring a regional image of a human body;

judging the behavior type of the operator by combining the regional image, the target regional image of the operator and the whole static image through a convolutional neural network model to obtain a violation behavior analysis result;

(2) track violation analysis based on dynamic video

Fitting the relation among the environmental parameters of the construction field area, the work types of workers, the qualification grade of the workers and the predicted value of the danger grade of the construction area based on the training data;

acquiring a monitoring video image of a construction area to be analyzed, performing face recognition on the appearing operators to acquire identity information, and acquiring the work types and qualification grades of the operators based on the face recognition result;

predicting the danger level of the construction area relative to the workers according to the fitting relation based on the environmental parameters of the construction field area, the labor and the types of the workers and the qualification of the workers;

and sending out alarm information when the predicted danger level is greater than a second preset value.

As a further optimization of the scheme, the absolute violation phenomenon comprises live working without wearing insulating gloves, insecure ground wire hanging, on-site standard dressing and typical violation behaviors, and the track violation comprises intrusion into a warning area.

As a further optimization of the above scheme, the obtaining of the target region in the image through the clustering algorithm includes:

(31) acquiring the density of points in the static image according to a preset first calculation method;

(32) taking the point with the maximum density as a first clustering center, and performing density reduction on the point with the first clustering center as the origin and within a preset radius range according to a preset second calculation method;

(33) obtaining a point with the maximum density from all points of the non-clustering centers, judging whether the density value of the point with the maximum density is larger than a preset first threshold value, if so, taking the point with the maximum density from all points of the non-clustering centers as a next clustering center, performing density reduction on the point with the next clustering center as an original point and within a preset radius range according to a preset second calculation method, and repeating the step (33); otherwise, entering a step (34);

(34) and finishing the acquisition of all target areas, and acquiring a clustering area formed by a plurality of clustering centers as a target area.

As a further optimization of the above scheme, the determining the behavior type of the operator through the convolutional neural network model to obtain the analysis result of the violation behavior includes:

inputting the human body regional image and the target region of the operator into a first convolution neural network, extracting features through the first convolution neural network, inputting the extracted features into a feature fusion layer network for feature fusion, and inputting the extracted features into a first classification layer network based on fusion features to obtain an image classification result;

inputting the whole static image into a second convolutional neural network, extracting features through a second convolutional layer network, and inputting the extracted features into a second classification layer network to obtain an image classification result;

and the output results of the first classification layer network and the second classification layer network are input into the classification fusion layer network to obtain the probability of the behavior types of the operators.

As a further optimization of the above scheme, the first convolutional neural network and the second convolutional neural network are trained by using different model parameters as initialization model parameters during training, and the model parameters are corrected by back propagation through calculating a loss function value of an output result after the first classification layer and the second classification layer, respectively.

As a further optimization of the above scheme, the method for acquiring the human body subregion image includes:

training a key point detection model based on the human skeleton key point image data set;

inputting the static image into a key point detection model to obtain key point position detection and classification;

the method comprises the steps of obtaining a region image which contains all key points of a single person and is the smallest in area, and dividing the region image into at least one sub region according to the type of the key points of the human body.

As a further optimization of the above scheme, the method for detecting the key point by the key point detection model comprises:

carrying out feature extraction on an input image through a high-resolution network to obtain a plurality of feature maps with different resolutions;

selecting one input at least two cavity convolution layers with different expansion rates from the feature map output by the high-resolution network to obtain feature maps with different scales, wherein output channels of the cavity convolution layers are 256;

fusing the feature maps of multiple scales to obtain a fused feature map;

and calculating the probability of each point on the image as a key point based on the fusion feature map, and acquiring the point with the maximum probability as the key point.

As further optimization of the scheme, the relation between the environmental parameters of the construction site area, the labor types of workers, the qualification grade of the workers and the risk grade predicted value of the construction area is fitted based on the training data, and a neural network is adopted for fitting.

The invention also provides a power field operation safety monitoring device based on video image analysis, which comprises:

the absolute violation state analysis module is used for carrying out absolute violation state analysis based on the static image and comprises:

the static image preprocessing unit is used for acquiring and preprocessing a static image of the power field operation monitoring video;

the target area extraction unit is used for acquiring the target area of the operator in the image through a clustering algorithm;

the regional image acquisition unit is used for acquiring the positions of key points of the human skeleton in the target region through the key point detection model and acquiring a regional image of the human body;

the behavior type analysis unit is used for judging the behavior type of the operating personnel through the convolutional neural network model by combining the subarea image, the operating personnel target area image and the static image overall image to obtain a violation behavior analysis result;

the track violation analysis module is used for carrying out track violation analysis based on the dynamic video, and comprises the following steps:

the relevant parameter fitting unit is used for fitting the relation among the environmental parameters of the construction site area, the work types of workers, the qualification grade of the workers and the predicted value of the danger grade of the construction area based on the training data;

the system comprises an operator information acquisition unit, a construction area monitoring unit and a monitoring unit, wherein the operator information acquisition unit is used for acquiring a monitoring video image of a construction area to be analyzed, performing face recognition on the appearing operators to acquire identity information, and acquiring the work types and the qualification grades of the operators based on the face recognition result;

and the operator track violation analysis unit is used for predicting the danger level of the construction area relative to the workers according to the fitting relation based on the environmental parameters, the worker types and the worker qualifications of the construction field area, and sending alarm information when the predicted danger level is greater than a second preset value.

The invention discloses a method and a device for monitoring the safety of electric power field operation based on video image analysis, which have the following beneficial effects:

by combining the regional images, the target regional images of the operating personnel and the whole static images, the method realizes the fusion of the interactive characteristics of the human body and the environment and improves the accuracy of the classification judgment of the behavior types of the operating personnel based on the characteristic fusion of various image regions, particularly the combination of the key region of the human body and the whole images containing the environment images and the human body images, and the regional images formed by the key point position information of the human body skeleton are fused, so that the behavior type analysis is realized through the efficient characteristic representation of the key region, and the calculated amount brought by redundant image information is reduced.

Drawings

Fig. 1 is an overall flow chart of a power field operation safety monitoring method based on video image analysis according to an embodiment of the present application;

FIG. 2 is a block diagram of the convolutional neural network model of FIG. 1;

FIG. 3 is a block diagram of a flow chart of a method for judging the behavior type of an operator to obtain an analysis result of violation behaviors by the convolutional neural network model in FIG. 1;

FIG. 4 is a block flow diagram of a method of obtaining the operator target area and the zone images of FIG. 1;

fig. 5 is a block diagram of a power field operation safety monitoring device based on video image analysis according to an embodiment of the present application.

Detailed Description

In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

The method for monitoring the safety of the electric power field operation based on the video image analysis comprises the steps of monitoring an absolute violation state and a track violation behavior, wherein the specific absolute violation state comprises the conditions that live-wire operation does not wear insulating gloves, grounding wires are not firmly hung, the field is normally worn and the track violation behavior comprises the conditions that a warning area breaks into the live-wire operation, and the track violation behavior comprises the conditions that the live-wire operation does not wear the insulating gloves, the grounding wires are not firmly hung and the track.

For monitoring and analyzing the five types of violation behaviors, the method provided by the embodiment of the application comprises the following steps:

(1) absolute violation state analysis based on static image

The method comprises the steps of obtaining static images of power field operation monitoring videos and preprocessing the static images, wherein the optimized power field monitoring video images comprise three-dimensional image data so as to provide more characteristics for subsequent image analysis;

acquiring an operator target area in the image through a clustering algorithm;

in the embodiment, by combining the subarea image, the target area image of the operator and the whole static image, based on the feature fusion of various image areas, especially the combination of the human body key area and the whole image containing the environment image and the human body image, the fusion of the human body and the environment interaction feature is realized, the accuracy of the behavior type classification judgment of the operator is improved, and the subarea image formed by fusing the position information of the human body skeleton key point is realized, so that the behavior type analysis is realized through the efficient feature representation of the key area, and the calculated amount brought by redundant image information is reduced.

(2) Track violation analysis based on dynamic video

In the embodiment, considering that the factors affecting the safety of the operators on the electric power site include subjective factors such as the qualification of the operators, namely, the safety capability condition of the operators, objective factors such as the environmental parameters of the construction site area, and the risk levels of the operators with different safety capabilities in the same working area are different, a method for determining different dangerous areas aiming at different operators is adopted, wherein the qualification of the operators is used for carrying out safety comprehensive capability analysis by combining the personnel information with information fusion such as training records, safety examination information, participation engineering project information, violation records and the like. Wherein the environmental parameters include a safety state of the power device and a safety state of the work tool.

In the embodiment, a convolutional neural network is adopted to fit the relationship among the environmental parameters, the labor types and the qualification grade information of the construction site area and the predicted value of the danger grade of the construction area, the depth characteristic extraction is carried out through a plurality of layers of convolutional layers based on the input characteristic data of the environmental parameters, the labor types and the qualification grade information of the workers serving as the network of the construction site area, the probabilities of different danger grades of the workers entering the construction site are output through softmax on a classification layer, the danger grade with the maximum probability serves as a prediction result, and the workers are considered to enter the warning area when the danger grade exceeds a preset range value and belong to track violation.

In this embodiment, the obtaining of the target region in the image through the clustering algorithm includes the following steps:

(31) acquiring the density of points in a static image according to a preset first calculation method, taking the static image as an example of adopting three-dimensional image data, wherein the first calculation method comprises the following steps:

wherein r is_ax,r_ay,r_azRespectively, as a point (x) in the image_i,y_i,z_i) For the length range value of the central point in the three-dimensional direction, the length range in the three-dimensional direction forms more than one point (x)_i,y_i,z_i) A cuboid spatial range which is a central point;

(32) taking the point with the maximum density as a first clustering center, and performing density reduction on the points with the first clustering center as an origin and within a preset radius range according to a preset second calculation method, wherein the second calculation method comprises the following steps:

wherein (x)_c,y_c,z_c) As cluster center coordinates, D_cIs a point (x)_c,y_c,z_c) Density value of r_bx,r_by,r_bzThe preferred r is a length range value in the three-dimensional direction with the cluster center as the center, i.e., a range of points to be density-reduced_bx,r_by,r_bzIs respectively r_ax,r_ay,r_az1.5 times of;

The method comprises the steps of collecting three-dimensional image data and three-dimensional point cloud density subtraction clustering based on depth camera equipment, accurately extracting a clustering region of a single operator in an image, namely a target region, and effectively reducing redundant image data of the target region.

In the embodiment, two convolutional neural network models are adopted, and image feature extraction and behavior type judgment classification are respectively performed on the human body regional image, the operator target region and the static image overall image, and the specific method comprises the following steps:

and inputting the output results of the first classification layer network and the second classification layer network into a classification fusion layer network to obtain the probability of the behavior types of the operators.

In this embodiment, the first convolutional neural network and the second convolutional neural network are trained by using different model parameters as initialization model parameters, and the model parameters are corrected by back propagation through calculating a loss function value of an output result after the first classification layer and the second classification layer, preferably, the initialization parameters of the second convolutional neural network model are the model parameters of the key point detection model in this embodiment, that is, the second convolutional network model is trained by fusing key point feature information helpful for behavior classification determination, so as to improve the behavior classification accuracy of the second convolutional network model, and simultaneously, in the first convolutional network, a human body region image and an operator target region are obtained based on the positions of key points of a human body skeleton, and the operator target region image provides the contour features of the operator, the image analysis range is reduced, and the model training speed and the identification accuracy of the behavior types of the operators are improved.

The method for acquiring the human body subarea image comprises the following steps:

the method for inputting the static image into the key point detection model to obtain the key point position detection and classification and detecting the key points on the model based on the input image comprises the following steps:

the method comprises the steps of extracting features of an input image through a high-resolution network to obtain a plurality of feature maps with different resolutions, obtaining a plurality of feature maps with different resolutions in parallel from an original input image in the high-resolution network, fusing a plurality of resolution features, fusing different resolution feature information into each output feature image with different resolutions, and fusing the plurality of different resolution feature maps to obtain a final fused feature map through the following steps;

selecting one input at least two cavity convolution layers with different expansion rates from the feature maps output by the high-resolution network to obtain feature maps with different scales, wherein output channels of the cavity convolution layers are 256, and preferably, selecting one input at least two cavity convolution layers with different expansion rates from the feature maps output by the high-resolution network;

fusing the feature maps of multiple scales to obtain a fused feature map;

calculating the probability of each point on the image as a key point based on the fusion feature map, and acquiring the point with the maximum probability as the position of the key point;

acquiring a region image which contains all the single key points and has the minimum area, wherein all the single key points comprise: the method includes the steps that a left eye, a right eye, a left ear, a right ear, a left hand, a right hand and the like are arranged in a region image, the region image is divided into at least one subarea according to the key point category of a human body, for example, all key points belonging to the head are divided into one category as one subarea, and when a plurality of operators are arranged in one image, all key points of a single person are contained, the key points possibly contained in the region image with the minimum area are not the key points of the same person, so that the area size of the subarea of the classified key points is limited, and when the area is smaller than a preset value, the key points contained in the region are judged to be the key points of the non-same person, and it can be understood that the region image containing all key points of the single person and having the.

The embodiment also provides a power field operation safety monitoring device based on video image analysis, which is used for monitoring an absolute violation state and a track violation behavior, wherein the absolute violation state comprises that the live-wire operation does not wear insulating gloves, the grounding wire is not firmly hooked, the field is normally worn and the track violation behavior comprises that an alert area intrudes, and the power field operation safety monitoring device of the embodiment comprises:

the operator track violation analysis unit is used for predicting the danger level of the construction area relative to the workers according to the fitting relation based on the environmental parameters of the construction field area, the work types of the workers and the qualification of the workers;

For specific limitations of the electric field operation safety monitoring device, reference may be made to the above limitations of the electric field operation safety monitoring method, which will not be described herein again. All or part of each unit in the electric field operation safety monitoring device can be realized by software, hardware and a combination thereof. The units can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the units.

The present invention is not limited to the above-described embodiments, and those skilled in the art will be able to make various modifications without creative efforts from the above-described conception, and fall within the scope of the present invention.

Claims

1. the power field operation safety monitoring method based on video image analysis, is characterized in that, comprises the steps:

(1) Absolute violation state analysis based on static images

Obtaining static images of monitoring video of power field operations and preprocessing;

Obtain the target area of the operator in the image through the clustering algorithm;

Obtain the position of the key points of the human skeleton in the target area through the key point detection model, and obtain the sub-regional images of the human body;

Combining the sub-area image, the operator's target area image and the overall image of the static image, the convolutional neural network model is used to determine the type of operator's behavior to obtain the analysis results of illegal behavior;

(2) Trajectory violation analysis based on dynamic video

Fitting the relationship between the regional environmental parameters of the construction site, the type of workers, the qualification level of the workers and the predicted value of the risk level of the construction area based on the training data;

Obtain the surveillance video images of the construction area to be analyzed, perform face recognition on the workers that appear to obtain identity information, and obtain the type of workers and the qualification level of the workers based on the results of face recognition;

Based on the regional environmental parameters of the construction site, the types of workers, and the qualifications of the workers, according to the fitting relationship, predict the danger level of the construction area relative to the workers;

When the predicted danger level is greater than the second preset value, an alarm message is issued.

2. The method for monitoring power field work safety based on video image analysis according to claim 1, wherein the absolute violations include not wearing insulating gloves during live work, unreliable grounding connection, on-site standard dress, typical Violations, the track violations include intrusion into warning areas.

3. The power field operation safety monitoring method based on video image analysis according to claim 1, wherein the acquisition of the target area in the image by a clustering algorithm comprises:

(31) obtaining the density of the point in the static image according to the preset first calculation method;

(32) taking the point with the largest density as the first cluster center, and performing density reduction on the point with the first cluster center as the origin preset radius range according to the preset second calculation method;

(33) Obtain the point with the highest density among all the points of the non-cluster center, determine whether the density value of the point with the highest density is greater than the preset first threshold, and if so, select the point with the highest density among all the points of the non-cluster center The point is used as the next cluster center, and the density reduction is performed on the point with the next cluster center as the origin preset radius range according to the preset second calculation method, and step (33) is repeated; otherwise, step (34) is entered;

(34) Acquiring all target regions is completed, and a cluster region formed by a plurality of cluster centers is obtained as a target region.

4. the power field operation safety monitoring method based on video image analysis according to claim 3, is characterized in that, described by the convolutional neural network model to judge the behavior type of the operator to obtain the analysis result of illegal behavior, comprising:

Input the human body sub-region image and the target area of the operator into the first convolutional neural network, perform feature extraction through the first convolutional layer network, and input the extracted features into the feature fusion layer network for feature fusion. A classification layer network obtains image classification results;

Inputting the static image as a whole into the second convolutional neural network, performing feature extraction through the second convolutional layer network, and inputting the extracted features into the second classification layer network to obtain image classification results;

The output results of the first classification layer network and the second classification layer network are input into the classification and fusion layer network to obtain the probability of the operator's behavior type.

5. The power field operation safety monitoring method based on video image analysis according to claim 4, wherein the first convolutional neural network and the second convolutional neural network use different model parameters as initialization during training The model parameters are trained, and after the first classification layer and the second classification layer, respectively, the model parameters are corrected by backpropagation by calculating the loss function value of the output result.

6. The power field operation safety monitoring method based on video image analysis according to claim 4, wherein the acquisition method of the sub-regional images of the human body comprises:

The keypoint detection model is trained based on the human skeleton keypoint image dataset;

Inputting the static image into a keypoint detection model to obtain keypoint position detection and classification;

Obtain an area image containing all key points of a single person with the smallest area, and divide the area image into at least one sub-area according to the human key point category.

7. The power field operation safety monitoring method based on video image analysis according to claim 6, is characterized in that, the method that described key point detection model carries out key point detection is:

Perform feature extraction on the input image through a high-resolution network to obtain feature maps with different resolutions;

From the feature map output by the high-resolution network, select an input at least two dilated convolutional layers with different expansion rates to obtain feature maps of different scales, and the output channels of the dilated convolutional layer are all 256;

The feature maps of multiple scales are fused to obtain a fusion feature map;

Based on the fusion feature map, the probability that each point on the image is a key point is calculated, and the point with the highest probability is obtained as a key point.

8 . The method for monitoring power field operation safety based on video image analysis according to claim 1 , wherein the fitting of construction site regional environmental parameters, worker types, worker qualification levels and construction area hazards based on training data. 9 . The relationship between the predicted values of the grades is fitted by a neural network.

9. The electric field operation safety monitoring device based on video image analysis is characterized in that, comprising:

The absolute violation status analysis module is used for absolute violation status analysis based on static images, including:

The static image preprocessing unit is used to obtain and preprocess the static image of the monitoring video of the power field operation;

The target area extraction unit is used to obtain the target area of the operator in the image through the clustering algorithm;

The sub-region image acquisition unit is used to obtain the position of the key points of the human skeleton in the target area through the key point detection model, and obtain the sub-region images of the human body;

The behavior type analysis unit is used to combine the sub-regional image, the image of the target area of the operator and the overall image of the static image to determine the type of behavior of the operator through the convolutional neural network model to obtain the analysis result of illegal behavior;

Track violation analysis module, used for tracking violation analysis based on dynamic video, including:

The relevant parameter fitting unit is used to fit the relationship between the regional environmental parameters of the construction site, the type of staff, the qualification level of the staff and the predicted value of the risk level of the construction area based on the training data;

The operator information acquisition unit is used to acquire the surveillance video image of the construction area to be analyzed, perform face recognition on the operator to obtain identity information, and obtain the type of worker and the qualification level of the worker based on the face recognition result;

The operator trajectory violation analysis unit is used to predict the danger level of the construction area relative to the staff based on the fitting relationship based on the environmental parameters of the construction site area, the type of staff, and the qualifications of the staff. When the predicted danger level is greater than the second preset value A warning message is issued.

10. The power field operation safety monitoring device based on video image analysis according to claim 9, wherein the absolute violations include not wearing insulating gloves during live work, unreliable ground wire hooking, on-site standard dress, typical Violations, the track violations include intrusion into warning areas.