[go: up one dir, main page]

CN115293481B - A method and system for checking the relationship between household transformers in substations based on ensemble learning LSCP algorithm - Google Patents

A method and system for checking the relationship between household transformers in substations based on ensemble learning LSCP algorithm Download PDF

Info

Publication number
CN115293481B
CN115293481B CN202210600394.9A CN202210600394A CN115293481B CN 115293481 B CN115293481 B CN 115293481B CN 202210600394 A CN202210600394 A CN 202210600394A CN 115293481 B CN115293481 B CN 115293481B
Authority
CN
China
Prior art keywords
household
users
area
relationship
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210600394.9A
Other languages
Chinese (zh)
Other versions
CN115293481A (en
Inventor
龚逊东
凌佳凯
胡金峰
秦军
沈海平
许广琦
朱振
戴健
刘伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuxi Power Supply Co of State Grid Jiangsu Electric Power Co Ltd
Original Assignee
Wuxi Power Supply Co of State Grid Jiangsu Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuxi Power Supply Co of State Grid Jiangsu Electric Power Co Ltd filed Critical Wuxi Power Supply Co of State Grid Jiangsu Electric Power Co Ltd
Priority to CN202210600394.9A priority Critical patent/CN115293481B/en
Publication of CN115293481A publication Critical patent/CN115293481A/en
Application granted granted Critical
Publication of CN115293481B publication Critical patent/CN115293481B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Operations Research (AREA)
  • Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Primary Health Care (AREA)
  • General Health & Medical Sciences (AREA)
  • Water Supply & Treatment (AREA)
  • Data Mining & Analysis (AREA)
  • Public Health (AREA)
  • Game Theory and Decision Science (AREA)
  • Alarm Systems (AREA)
  • Remote Monitoring And Control Of Power-Distribution Networks (AREA)

Abstract

一种基于集成学习LSCP算法的台区户变关系纠查方法及系统,包括:步骤1,采集多个台区N天内的台区运行数据;步骤2,基于多个基异常检测器和台区运行数据计算台区内用户的异常分数以构建户变关系纠查模型;步骤3,户变关系纠查模型筛选台区内户变关系正常用户并根据其运行数据计算台区线损率以确定该模型的最佳输入参数;步骤4,在所述户变关系纠查模型中输入最佳输入参数,得到台区内户变关系错误用户。本发明能够有效识别出户变关系档案错误用户,查全率和查准率都有较大提升,可以节约人工排查的成本,实现了基于集成学习的综合判断方法,而且与与单个异常检测模型相比,集成方法结合了多个异常检测模型,识别结果也更稳定和可靠。

A method and system for checking the relationship between household transformers in a substation area based on an integrated learning LSCP algorithm, comprising: step 1, collecting substation operation data within N days of multiple substations; step 2, calculating the abnormal scores of users in the substation area based on multiple base anomaly detectors and substation operation data to build a household transformer relationship checking model; step 3, the household transformer relationship checking model screens users with normal household transformer relationships in the substation area and calculates the substation line loss rate according to their operation data to determine the optimal input parameters of the model; step 4, inputting the optimal input parameters into the household transformer relationship checking model to obtain users with incorrect household transformer relationships in the substation area. The present invention can effectively identify incorrect users in household transformer relationship files, greatly improves both the recall rate and the precision rate, saves the cost of manual investigation, realizes a comprehensive judgment method based on integrated learning, and compared with a single anomaly detection model, the integrated method combines multiple anomaly detection models, and the recognition result is more stable and reliable.

Description

Method and system for checking district user change relation based on integrated learning LSCP algorithm
Technical Field
The invention belongs to the field of power distribution, relates to user-to-user relationship identification, and particularly relates to a method for checking a transformer substation relationship of a platform area based on an integrated learning LSCP algorithm.
Background
The improvement and upgrading of the distribution line enable the household change relation of the distribution area to change frequently, if the household change information of the distribution area of the distribution system is updated not timely or is input in error during manual update, the household change relation of the distribution area in the system is wrong, the operation, maintenance and overhaul of a power grid are affected, the total table of the distribution area in the marketing system is inconsistent with metering data of a user meter, the line loss of the distribution area is abnormal, and the lean management of the distribution area is affected.
In order to solve the problems of abnormal line loss of the area caused by the user change relation error, the area needs to be subjected to user change relation correction, and users with wrong user change relation are identified and corrected. The traditional engineering methods comprise an instantaneous power failure method, a pulse current method and the like, manual field checking is needed, the accuracy and the efficiency are low, and the intelligent development goal of the distribution network is not met. Therefore, a new family transformation relation checking method is needed to be provided, the requirements of accuracy and intelligence are met, and the digital transformation of the power distribution network is improved.
The prior art document 1 (CN 114021430A) proposes a household change relation identification method based on electric quantity and voltage information fusion, a platform electricity consumption parameterization model is built according to the total electric quantity of a platform area and the conservation of electric quantity of a user, a solving process of a global optimal solution of the platform electricity consumption parameterization model is built by utilizing a random gradient iteration and learning rate self-adjustment method, membership relations between the user and the platform area are primarily identified, a plurality of judgment methods based on sliding time windows are adopted, platform historical data are divided into a plurality of time windows with different initial dates, a plurality of solutions are carried out, a union set is obtained from the plurality of solving results, a household change membership relation is obtained, namely a primary household change identification result, a training sample is formed by voltage data of users with normal household change relation, a method of a support vector machine is used, the voltage characteristics of the users with normal household change relation are learned, a household change relation identification model is built, all users in a current file of the platform area are subjected to household change identification, and users with wrong household change relation files are obtained, and correct identification of the household change relation is realized. The prior art document 1 has the defect that when voltage data are processed, only one identification method based on a support vector machine is used, so that the user-variable relation identification accuracy is low.
Disclosure of Invention
In order to solve the defects in the prior art, the invention aims to provide a method and a system for checking the change relation of a platform region based on an integrated learning LSCP (Local Selective Combination IN PARALLEL Outlier Ensembles) algorithm, which use a parallel integrated anomaly detection algorithm LSCP of local selection combination, and use 4 classical anomaly detection algorithms as basic anomaly detectors to accurately detect the difference of user voltage curves in the platform region so as to check the change relation of the platform region. Meanwhile, the key input parameters in the LSCP algorithm are determined by calculating the line loss rate, so that the practicability of the algorithm is improved.
The invention adopts the following technical scheme.
A method for checking the change relation of a station area user based on an integrated learning LSCP algorithm comprises the following steps:
step 1, collecting platform region operation data in a plurality of platform regions for N days;
step 2, calculating the anomaly scores of users in the platform area based on a plurality of base anomaly detectors and platform area operation data to construct a user transformation relation correction model;
Step 3, screening normal user of user change relation in the station area by the user change relation correction model, and calculating the line loss rate of the station area according to the operation data of the normal user of the user change relation to determine the optimal input parameters of the model;
and step 4, inputting the optimal input parameters into the user-change relation correction model to obtain the user with wrong user-change relation in the platform area.
The operation data of the transformer area comprises daily electricity quantity and voltage readings of all users in the transformer area and daily electricity quantity and voltage readings of a power supply transformer in the transformer area.
The user change relation correction model comprises a plurality of base anomaly detectors, wherein the base anomaly detectors are used for judging the user change relation of users in the platform region.
The construction process of the user-variable relation correction model comprises the following steps:
S1, inputting voltage readings of all users in a platform area, and calculating abnormal scores of the voltage readings of all the users by using a plurality of base anomaly detectors;
s2, the user with the highest abnormal score is judged to be the user with the wrong user-to-user relationship, and the other users are judged to be the users with normal user-to-user relationship.
S1, training the base anomaly detectors, namely respectively inputting the voltage readings of all users in the platform area into R base anomaly detectors to calculate to obtain the local anomaly scores of the R user voltage readings, establishing a local anomaly score matrix, respectively calculating the correlation between the data in the matrix and the maximum value of the local anomaly score, and screening out x base anomaly detectors with the maximum correlation.
The construction process further comprises the steps of calculating x anomaly scores of the users x i in the ith platform area by using the screened x base anomaly detectors, and taking the average value of the x anomaly scores as the anomaly score of the users.
The abnormal scores of all users in the platform region form an abnormal score set S, the first Y data with high scores in the abnormal score set S are regarded as the number of users with wrong user-changing relations in the platform region, wherein Y=n×c, n is the total number of users in the platform region, c is the abnormal data proportion, and the number Y of users with wrong user-changing relations in the platform region is controlled through the abnormal data proportion c.
The screened basic anomaly detectors comprise isolated forests, OC-SVMs, COPODs and outlier factor detection algorithms LOF;
and calculating the correlation between the data in the matrix and the maximum value of the local anomaly score by using the pearson correlation coefficient.
Step 3 further comprises:
Step 301, traversing the abnormal data proportion in intervals (0,0.1) every 1/n to obtain a plurality of abnormal data proportion values, wherein n is the total number of users in the station area;
Step 302, inputting the abnormal data proportion value into a user change relation correction model to obtain a user with normal user change relation in the platform area and a user with wrong user change relation in the platform area;
Step 303, selecting the sum of the daily electricity consumption of the normal users of the user change relation in the transformer area, calculating the line loss rate of the transformer area, and setting the threshold value of the lowest line loss rate;
Step 304, judging whether the line loss rate of the station area is lower than the lowest line loss threshold, if not, considering that the user with the user-to-user relationship error is missed and executing step 305, if so, taking the next abnormal data proportion value and circularly executing step 302;
Step 305, determining whether the line loss rate of the station area is the minimum line loss rate of the station area, if not, taking the next abnormal data proportion value and executing step 302 in a circulating manner, and if so, taking the abnormal data proportion value corresponding to the line loss rate of the station area as the optimal input parameter.
The calculation mode of the line loss rate of the station area is as follows:
The lowest line loss threshold value is within the range of [ -2%,5% ].
After obtaining the user-change relation error user in the platform area, the user-change relation error user is manually verified, and the system user-change relation file is updated.
A district user change relation correction method based on an integrated learning LSCP algorithm comprises the following steps:
The data acquisition module is used for acquiring the running data of the transformer area, namely, the daily electricity quantity and voltage readings of all users in the transformer area and the daily electricity quantity and voltage readings of a power supply transformer in the transformer area;
The model building module is used for building a user-variable relation correction model according to various anomaly detection algorithms and platform region operation data;
the data calculation module is used for calculating the local anomaly score, the line loss rate of the transformer area and the correlation between the data in the matrix and the maximum value of the local anomaly score of the user voltage reading in the transformer area;
The algorithm screening module is used for screening the detector for the basis abnormality according to the calculation result of the data calculation module;
And the output module is used for obtaining the user change relation error user in the platform area according to the output result of the model.
Compared with the prior art, the invention combines a plurality of abnormal detection models, realizes detection of different angles of data, strengthens learning of data characteristics, obtains better performance and reliability, strengthens stability and accuracy of an error correction result, and applies an abnormal detection method with an integrated framework to identification and error correction of user-variable relations. The invention can utilize a parallel integrated anomaly detection algorithm based on local selection combination to carry out user change relation check based on the actual platform region operation data of the existing electricity consumption system, marketing system and the like, calculate users with wrong user change relation in a certain platform region and clearly check the direction on site at the system level.
The beneficial effects of the invention also include:
(1) According to the invention, key input parameters of the algorithm are optimized and determined from the line loss rate, so that the influence of manually setting parameters on the user-variable relationship correction result is avoided, meanwhile, the reliability and accuracy of the result are increased, and the practicability of the user-variable relationship correction algorithm provided by the invention is further improved.
(2) Based on the method in the invention, the number of abnormal data, namely the number of users with wrong user-to-user relationship can be accurately judged by controlling the abnormal data proportion for a large number of areas, and then specific wrong users can be checked based on the user-by-user check of actual engineering maintenance personnel, thereby effectively reducing the work load of field check, improving the efficiency of field check and saving the check cost.
Drawings
Fig. 1 is a schematic flow chart of a method for checking a change relation of a station area user based on an ensemble learning LSCP algorithm in the present invention;
FIG. 2 is a general framework of the integrated anomaly detection algorithm LSCP of the present invention;
FIG. 3 is a flow chart of the determination of the optimal input parameters of the user-variable relationship correction model of the present invention.
Detailed Description
The application is further described below with reference to the accompanying drawings. The following examples are only for more clearly illustrating the technical aspects of the present application, and are not intended to limit the scope of the present application.
Fig. 1 is a schematic flow chart of a method for checking a change relation of a station area user based on an ensemble learning LSCP algorithm in the present invention. The method comprises the following steps as shown in fig. 1:
And step 1, collecting platform region operation data in N days of a plurality of platform regions, wherein the platform region operation data comprise daily electricity quantity and voltage readings of all users in the platform region and daily electricity quantity and voltage readings of a power supply transformer in the platform region.
Step 2, calculating the anomaly scores of users in the platform area based on a plurality of base anomaly detectors and platform area operation data to construct a user transformation relation correction model;
The user change relation correction model comprises a plurality of base anomaly detectors, wherein the base anomaly detectors are used for judging the user change relation of users in a platform region.
The construction process comprises the following steps:
S1, inputting voltage readings of all users in a platform area, and calculating abnormal scores of the voltage readings of all the users by using a plurality of base anomaly detectors;
s2, the user with the highest abnormal score is judged to be the user with the wrong user-to-user relationship, and the other users are judged to be the users with normal user-to-user relationship.
The method comprises the steps of training base anomaly detectors, respectively inputting voltage readings of all users in a platform area into R base anomaly detectors to calculate local anomaly scores of the R user voltage readings, establishing a local anomaly score matrix, respectively calculating correlation between data in the matrix and the maximum value of the local anomaly scores, and screening out x base anomaly detectors with maximum correlation.
And calculating x anomaly scores of the user x i in the ith platform area by using the screened x base anomaly detectors, and taking the average value of the x anomaly scores as the anomaly score of the user. The screened base anomaly detector comprises an isolated forest, an OC-SVM, a COPOD and an outlier factor detection algorithm LOF, and the pearson correlation coefficient is adopted to calculate the correlation between the data in the matrix and the maximum value of the local anomaly score.
The abnormal scores of all users in the platform region form an abnormal score set S, the first Y data with high scores in the abnormal score set S are regarded as the number of users with wrong user-changing relations in the platform region, wherein Y=n×c, n is the total number of users in the platform region, c is the abnormal data proportion, and the number Y of users with wrong user-changing relations in the platform region is controlled through the abnormal data proportion c.
Step 3, screening normal user of user-to-user relation in the transformer area by using the user-to-user relation correction model, and calculating the line loss rate of the transformer area according to the operation data of the normal user of the user-to-user relation to determine the optimal input parameters of the model, wherein the method specifically comprises the following steps:
Step 301, traversing the abnormal data proportion in intervals (0,0.1) every 1/n to obtain a plurality of abnormal data proportion values, wherein n is the total number of users in the station area;
Step 302, inputting the abnormal data proportion value into a user change relation correction model to obtain a user with normal user change relation in the platform area and a user with wrong user change relation in the platform area;
Step 303, selecting the sum of the daily electricity consumption of the normal users of the user change relation in the transformer area, calculating the line loss rate of the transformer area, and setting the threshold value of the lowest line loss rate;
Step 304, judging whether the line loss rate of the station area is lower than the lowest line loss threshold, if not, considering that the user with the user-to-user relationship error is missed and executing step 305, if so, taking the next abnormal data proportion value and circularly executing step 202;
Step 305, determining whether the line loss rate of the station area is the minimum line loss rate of the station area, if not, taking the next abnormal data proportion value and executing step 202 in a circulating manner, and if so, taking the abnormal data proportion value corresponding to the line loss rate of the station area as the optimal input parameter. The calculation mode of the line loss rate of the station area is as follows:
The lowest line loss threshold value is within the range of [ -2%,5% ].
And step 4, inputting the optimal input parameters into the user-change relation correction model to obtain the user with wrong user-change relation in the platform area.
After obtaining the user-change relation error user in the platform area, the user-change relation error user is manually verified, and the system user-change relation file is updated.
Fig. 2 is an overall framework of the integrated anomaly detection algorithm LSCP. LSCP is an anomaly detection algorithm that is integrated in parallel from multiple anomaly detection algorithms, completely unsupervised. The LSCP algorithm is characterized in that ① multiple base anomaly detection models are independently calculated and judged in parallel, ② dynamically selects a combined base model according to the performance of the base anomaly detection models, ③ randomly samples subsequences of different dimensions of data to construct a characteristic sample subspace to emphasize local anomalies of the data, ④ supports the combined heterogeneous base anomaly detection models and also supports the combined isomorphic base anomaly detection models. The heterogeneous models can be combined to enable the models to have diversity, and the isomorphic models can be combined to realize diversity by setting different super-parameters. The specific flow of the LSCP algorithm for detecting abnormal data comprises the following steps.
Inputting a data set X epsilon R n×d to be detected, wherein the data set to be detected is the voltage readings of all users in a platform area, a base anomaly detector is C= { C 1,C2,…CR }, and training R base anomaly detectors to obtain an anomaly score matrix O (X) epsilon R n×R.
O(X)=[C1(X),C2(X),…,CR(X)] (1)
For each data x i∈Xxi∈R1×d, random sampling is performed to constructThe feature sub-sequence X i' of the dimension results in a set of feature subspaces of the dataset X.
In the feature subspace, the Euclidean distance between x i and x i 'is calculated, and K nearest neighbor samples with the distance of x i' are obtained by using a K nearest neighbor method (KNN). And (3) carrying out t groups of random sampling to obtain t groups of characteristic subspaces, and taking samples with the occurrence times exceeding t/2 as local space psi i of x i in neighbor samples of the characteristic subspaces of the data x i.
The radix b of the local space of each data x i is different, i.e. the number of samples contained therein is different.
After obtaining the local space of the data x i, using a base anomaly detector to obtain the local anomaly score O (ψ i)∈Rb ×R) of x i, assuming that the local space of x i contains 10 data samples, calculating the anomaly score of the local space by using R base anomaly detectors to obtain a local anomaly score matrix O (ψ i) of 10×R dimensions.
Taking the maximum value of the R base anomaly detector scores as the pseudo target of the data sample, the pseudo target of data x i is the target (x i), which is calculated as follows:
Target(xi)=f(O(ψi))max (3)
The pearson correlation coefficient (Pearson Correlation Coefficient) between the local anomaly score matrix O (psi i) and the pseudo-target (psi i) of the data x i is calculated, x local anomaly scores most similar to the pseudo-target in the local anomaly score matrix O (psi i) are selected, and a base anomaly detector for which the x local anomaly scores are calculated is used as a base anomaly detector for a user change relation correction model.
Calculating anomaly scores of data x i using the selected x base anomaly detectors to obtain an anomaly score matrix O (x i)∈Rx×1 and taking the average of the x anomaly scores as anomaly score s of x i, wherein
s=favg(O(xi)) (4)
After the anomaly scores of all the data in the data set X to be detected are calculated by the method, and an anomaly score set S (S epsilon R n×1) is obtained, the number Y of the anomaly data in the data set is controlled by inputting the anomaly data proportion c, wherein Y is the number of users with wrong user-to-user variation relations in the platform region, and the first Y data with high scores in the anomaly score S are regarded as the anomaly data.
Y=n×c (5)
Wherein n is the number of samples in the data set, i.e. the total number of users in the area, and c is the abnormal data proportion.
In order to improve the reliability and stability of the integrated model, the LSCP algorithm is combined with the heterogeneous base anomaly detection model, four classical anomaly detection algorithms are used as base anomaly detectors, detection of different angles of data anomalies is realized for the isolated forest (Isolation Forest)、OC-SVM(One-Class Support Vector Machine)、COPOD(Copula-Based Outlier Detection)、LOF(Local Outlier Factor). respectively, and the study of the data difference characteristics is enhanced.
The isolated forest is based on the idea of division to realize the abnormality detection of data, and the data set is divided, wherein the smaller the number of times of sample division is, the more easily it is isolated, and the higher the abnormality degree of the sample is. The OC-SVM is an abnormal data detection algorithm provided for unbalanced samples, normal data and abnormal data are obviously different by mapping original data to a high-dimensional space through a kernel function, and a hyperplane is constructed to separate the normal data and the abnormal data. COPOD is to implement anomaly detection based on a statistical method, calculate tail end probability of each point of data for multidimensional data and diversity of data distribution, calculate bias of distribution to correct tail end probability of data, and implement estimation of anomaly degree of data. The LOF is based on the density idea, the anomaly detection is carried out by comparing the density of each point with the density of the neighborhood point, and the LOF algorithm calculates the density through the K neighborhood of the point, so that the anomaly erroneous judgment with different scattering conditions of the data density can not be caused.
In order to avoid the influence of random electricity utilization behavior of users on user change relation correction and the difference amplification between normal user and wrong user voltage curves, the application selects voltage data of users in a transformer area for two days as input.
The voltage data among the users in different areas have different trends and characteristics, the user ratio of the user change relation errors in the areas is small, and the area user change relation check problem can be defined as abnormal data detection of unbalanced samples. For the accuracy and reliability of the results, the user-variable relationship is checked by adopting an LSCP anomaly detection algorithm with an integrated framework.
The principle of the method for correcting the district user change relation based on the LSCP algorithm is that voltage data of district users are input, abnormal trends of the voltage data are accurately detected by the LSCP algorithm, the abnormal degree of the voltage data is judged, and abnormal scores of the voltage data of each user are obtained. And judging the individual user with the highest data anomaly score as the user with the user change relation error, and outputting the user with the highest data anomaly score as 1. The remaining users are judged as normal users with user change relation, and the output is 0. When the LSCP algorithm is used for user change relation correction, the adjustable parameters are shown in the following table 1:
Table 1LSCP algorithm parameter specification
The number of the KNN neighbor samples, the random sampling times, the random sampling minimum dimension, the random sampling maximum dimension and the characteristic space sample times threshold are adjusted, and the local space of the data can be changed.
The number of abnormal data output by the abnormal data proportion control, namely the number of users with wrong user-to-user relationship, is a key input parameter of the user-to-user relationship correction algorithm. When the change relation of the station area user is checked, the proportion of users with wrong change relation of the station area user is unknown. For a mass area, if the abnormal data proportion is set to be a fixed value, the accuracy and efficiency of a calculation result are affected. The optimization determination of the key input parameters of the algorithm is tried from the line loss rate angle, the influence of the manual setting parameters on the user-variable relation correction result is avoided, meanwhile, the reliability and the accuracy of the result are also improved, and the practicability of the user-variable relation correction algorithm is further improved.
The electric quantity relation between the station area and the user is theoretically:
Wherein P is total power consumption of the area, P (i) is power consumption of the user i, and n is the number of users of the area.
In fact, due to the online loss of the cell memory:
According to the regulations of the power supply company, the line loss rate of the transformer area fluctuates within a normal range. If there is a user with wrong user-changing relationship in the system file of the platform, the total power consumption platform of the platform in the marketing metering system is smaller than the sum of the power consumption of the users, as shown in the formula (8).
Therefore, in this embodiment, the best input parameters of the user-variable relationship correction model are determined by using the line loss rate of the transformer area, and the determining process is as shown in fig. 3:
Step 301, traversing the abnormal data proportion in intervals (0,0.1) every 1/n to obtain a plurality of abnormal data proportion values, wherein n is the total number of users in the station area;
Step 302, inputting the abnormal data proportion value into a user change relation correction model to obtain a user with normal user change relation in the platform area and a user with wrong user change relation in the platform area;
Step 303, selecting the sum of the daily electricity consumption of the normal users of the user change relation in the transformer area, calculating the line loss rate of the transformer area, and setting the threshold value of the lowest line loss rate;
Step 304, judging whether the line loss rate of the station area is lower than the lowest line loss threshold, if not, considering that the user with the user-to-user relationship error is missed and executing step 305, if so, taking the next abnormal data proportion value and circularly executing step 202;
Step 305, determining whether the line loss rate of the station area is the minimum line loss rate of the station area, if not, taking the next abnormal data proportion value and executing step 202 in a circulating manner, and if so, taking the abnormal data proportion value corresponding to the line loss rate of the station area as the optimal input parameter. The calculation mode of the line loss rate of the station area is as follows:
The lowest line loss threshold value is within the range of [ -2%,5% ].
Different abnormal data proportion values are input, the LSCP algorithm is utilized to calculate, the number of the user with wrong user-to-user relationship is obtained, and the user with normal user-to-user relationship of the station area is determined. In the embodiment, the abnormal data proportion is traversed to take values every 0.01 in the interval (0,0.1), because the number of users in the transformer area in the simulation scene is about 100, and when the interval is 0.01, the number of users with wrong output user-to-user relationship is increased by about 1 each time.
Considering that the line loss exists in the normal operation of the station area, setting a minimum line loss rate threshold, wherein the minimum line loss rate threshold can be set according to the historical line loss record of the system or the characteristics of the actual station area.
And if the line loss rate obtained by calculation according to different input parameters is lower than the lowest line loss threshold line loss, the user is considered to have the missing of the user with the user-changing relation error. When the proportion value of the input abnormal data is increased, the number of users with wrong output user-to-user relationship is increased, and the user-to-user relationship is possibly misjudged as the wrong user, so that the sum of the power consumption of the users in the area is further reduced, and the line loss rate of the area is increased. Therefore, on the basis of the minimum line loss threshold value, an input value corresponding to the minimum line loss rate is selected as an optimal parameter.
The specific implementation steps of the platform-region user-change relation correction method based on the ensemble learning LSCP algorithm in this embodiment may be summarized as follows:
And collecting the operation data of the platform region from the mining system, wherein the operation data comprise voltage data and daily total power consumption data. Traversing the abnormal data proportion every 0.01 in the interval (0,0.1), performing user-change relation correction by using an LSCP algorithm to obtain users with normal user-change relation, calculating the line loss rate of the station area, determining the value meeting the condition to obtain the best input parameters, inputting the best parameters, performing user-change relation correction by using the LSCP algorithm to obtain users with wrong user-change relation in the station area, checking the user-change relation in situ by using staff, and updating the system user-change relation file in time.
The embodiment also provides a system for checking the district user changing relation based on the LSCP algorithm based on the district user changing relation checking method, which specifically comprises the following steps:
The data acquisition module is used for acquiring the running data of the transformer area, namely, the daily electricity quantity and voltage readings of all users in the transformer area and the daily electricity quantity and voltage readings of a power supply transformer in the transformer area;
the model building module is used for building a user-variable relation correction model according to various anomaly detection algorithms and platform region operation data;
The data calculation module is used for calculating the correlation among the local anomaly score, the line loss rate of the transformer area and the maximum value of the data and the local anomaly score in the matrix of the user voltage readings in the transformer area;
The algorithm screening module is used for screening the detector for the basis abnormality according to the calculation result of the data calculation module;
and the output module is used for obtaining the user change relation error user in the platform area according to the output result of the model.
In order to verify the universality and reliability of the method, under a simulation scene, the user change relation of the areas is checked by utilizing two-day voltage data for 6 areas, and the test results are shown in table 2.
Table 2 test results in simulation scenario
In table 2, the recall ratio of the area 5 is 80%, the precision ratio is 98.71%, the users with 1 user change relation error are missed, and the recall ratio and the precision ratio of the remaining 5 areas are 100%. However, if the voltage data of more than 2 days is used for calculation for the area 5, the recall and precision can still reach 100%. In practical application, for higher accuracy and reliability, voltage data over 2 days can be used for calculation under the condition that the voltage data of the transformer area are sufficient. Therefore, the method has high accuracy and applicability in the identification and correction of the change relation of the platform region, and can be well suitable for the practical application of the correction of the change relation of the platform region.
Compared with the prior art, the invention combines a plurality of abnormal detection models, realizes detection of different angles of data, strengthens learning of data characteristics, obtains better performance and reliability, strengthens stability and accuracy of an error correction result, and applies an abnormal detection method with an integrated framework to identification and error correction of user-variable relations. The invention can utilize a parallel integrated anomaly detection algorithm based on local selection combination to carry out user change relation check based on the actual platform region operation data of the existing electricity consumption system, marketing system and the like, calculate users with wrong user change relation in a certain platform region and clearly check the direction on site at the system level.
The beneficial effects of the invention also include:
(1) According to the invention, key input parameters of the algorithm are optimized and determined from the line loss rate, so that the influence of manually setting parameters on the user-variable relationship correction result is avoided, meanwhile, the reliability and accuracy of the result are increased, and the practicability of the user-variable relationship correction algorithm provided by the invention is further improved.
(2) Based on the method in the invention, the number of abnormal data, namely the number of users with wrong user-to-user relationship can be accurately judged by controlling the abnormal data proportion for a large number of areas, and then specific wrong users can be checked based on the user-by-user check of actual engineering maintenance personnel, thereby effectively reducing the work load of field check, improving the efficiency of field check and saving the check cost. While the applicant has described and illustrated the embodiments of the present invention in detail with reference to the drawings, it should be understood by those skilled in the art that the above embodiments are only preferred embodiments of the present invention, and the detailed description is only for the purpose of helping the reader to better understand the spirit of the present invention, and not to limit the scope of the present invention, but any improvements or modifications based on the spirit of the present invention should fall within the scope of the present invention.

Claims (11)

1.一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于,包括以下步骤:1. A method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm, characterized in that it comprises the following steps: 步骤1,采集多个台区N天内的台区运行数据;Step 1, collecting operation data of multiple substations within N days; 步骤2,基于多个基异常检测器和台区运行数据计算台区内用户的异常分数以构建户变关系纠查模型;Step 2, calculating the anomaly scores of users in the area based on multiple base anomaly detectors and area operation data to build a household change relationship correction model; 步骤3,户变关系纠查模型筛选台区内户变关系正常用户并根据户变关系正常用户的运行数据计算台区线损率以确定该模型的最佳输入参数,包括:Step 3: The household transformer relationship correction model selects users with normal household transformer relationship in the transformer area and calculates the transformer area line loss rate based on the operation data of users with normal household transformer relationship to determine the optimal input parameters of the model, including: 步骤301,将异常数据比例在区间(0,0.1]每隔1/n进行遍历取值,得到多个异常数据比例值,其中n为台区内用户总数;Step 301, traverse and take values of the abnormal data ratio every 1/n in the interval (0, 0.1] to obtain multiple abnormal data ratio values, where n is the total number of users in the station area; 步骤302,将异常数据比例值输入至户变关系纠查模型以得到台区内户变关系正常用户和台区内户变关系错误用户;Step 302, inputting the abnormal data ratio value into the household change relationship correction model to obtain the users with normal household change relationship in the station area and the users with wrong household change relationship in the station area; 步骤303,选用台区内户变关系正常用户的日用电量之和计算台区线损率,并设置最低线损率阈值;Step 303, the sum of daily power consumption of users with normal household transformer relationships in the transformer area is selected to calculate the transformer area line loss rate, and a minimum line loss rate threshold is set; 步骤304,判断台区线损率是否低于最低线损率阈值,如果否,则认为存在户变关系错误用户的漏查并执行步骤305,如果是则取值下一个异常数据比例值并循环执行步骤302;Step 304, determine whether the line loss rate of the station area is lower than the minimum line loss rate threshold. If not, it is considered that there are missed users with wrong household change relationships and step 305 is executed. If yes, the next abnormal data ratio value is obtained and step 302 is executed repeatedly; 步骤305,判断所述台区线损率是否为最小台区线损率,如果否,则取值下一个异常数据比例值并循环执行步骤302,如果是,则该台区线损率所对应的异常数据比例值为最佳输入参数,台区线损率的计算方式如下:Step 305, determine whether the area line loss rate is the minimum area line loss rate, if not, then take the next abnormal data ratio value and loop through step 302, if yes, then the abnormal data ratio value corresponding to the area line loss rate is the best input parameter, the area line loss rate is calculated as follows: 步骤4,在所述户变关系纠查模型中输入最佳输入参数,得到台区内户变关系错误用户。Step 4, inputting optimal input parameters into the household change relationship correction model to obtain the household change relationship error users in the station area. 2.根据权利要求1中所述的一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于:2. According to the method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm as described in claim 1, it is characterized in that: 所述台区运行数据包括:台区内所有用户的日用电量和电压读数、台区内供电变压器日用电量和电压读数。The substation operation data includes: daily power consumption and voltage readings of all users in the substation, and daily power consumption and voltage readings of power supply transformers in the substation. 3.根据权利要求1中所述的一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于:3. According to the method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm as described in claim 1, it is characterized in that: 所述户变关系纠查模型包括多个基异常检测器,其中基异常检测器用于判断台区内用户的户变关系。The household change relationship correction model includes a plurality of base anomaly detectors, wherein the base anomaly detectors are used to determine the household change relationship of users in a station area. 4.根据权利要求3中所述的一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于:4. A method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm according to claim 3, characterized in that: 所述户变关系纠查模型的构建过程包括:The construction process of the household change relationship investigation model includes: S1、输入台区内所有用户的电压读数,并利用多个基异常检测器计算各用户电压读数的异常分数;S1, input the voltage readings of all users in the substation area, and use multiple base anomaly detectors to calculate the anomaly score of each user's voltage reading; S2、将异常分数最高的用户判定为户变关系错误用户,其余用户被判定为户变关系正常用户。S2. The user with the highest anomaly score is determined as a user with incorrect household change relationship, and the remaining users are determined as users with normal household change relationship. 5.根据权利要求4中所述的一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于:5. A method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm according to claim 4, characterized in that: 所述S1还包括训练基异常检测器:将台区内所有用户的电压读数分别输入R个基异常检测器计算得到R个该用户电压读数的局部异常分数,并建立局部异常分数矩阵,分别计算该矩阵内数据与局部异常分数的最大值之间的相关性,筛选出相关性最大的x个基异常检测器。The S1 also includes a training base anomaly detector: the voltage readings of all users in the substation area are respectively input into R base anomaly detectors to calculate R local anomaly scores of the voltage readings of the users, and a local anomaly score matrix is established, and the correlation between the data in the matrix and the maximum value of the local anomaly score is respectively calculated, and x base anomaly detectors with the largest correlation are screened out. 6.根据权利要求5中所述的一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于:6. A method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm according to claim 5, characterized in that: 所述构建过程还包括:利用筛选出的x个基异常检测器,计算得到第i个台区内用户的x个异常分数,将x个异常分数的平均值作为该用户的异常分数。The construction process also includes: using the selected x base anomaly detectors to calculate the number of users in the i-th station area The average of the x anomaly scores is taken as the anomaly score of the user. 7.根据权利要求4中所述的一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于:7. A method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm according to claim 4, characterized in that: 台区内所有用户的异常分数构成异常分数集,异常分数集S中分数高的前Y个数据视为台区内户变关系错误用户的数据,其中:,式中,n为台区内用户总数,c为异常数据比例,通过异常数据比例c,控制台区内户变关系错误用户的个数Y。The anomaly scores of all users in the area constitute the anomaly score set , the first Y data with high scores in the abnormal score set S are regarded as the data of users with wrong household transformer relationships in the station area, where: , where n is the total number of users in the station area, c is the abnormal data ratio, and the number of users with incorrect household-to-household relationship errors in the station area, Y, is controlled by the abnormal data ratio c. 8.根据权利要求5中所述的一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于:8. A method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm according to claim 5, characterized in that: 筛选出的相关性最大的x个基异常检测器包括孤立森林、OC-SVM、COPOD和离群因子检测算法LOF;The x-based anomaly detectors with the greatest correlation screened out include Isolation Forest, OC-SVM, COPOD, and the outlier factor detection algorithm LOF; 采用皮尔逊相关系数计算所述矩阵内数据与局部异常分数的最大值之间的相关性。The Pearson correlation coefficient is used to calculate the correlation between the data in the matrix and the maximum value of the local anomaly score. 9.根据权利要求1中所述的一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于:9. The method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm according to claim 1, characterized in that: 所述最低线损率阈值的取值范围是[-2%,5%]。The value range of the minimum line loss rate threshold is [-2%, 5%]. 10.根据权利要求1中所述的一种基于集成学习LSCP算法的台区户变关系纠查方法,其特征在于:10. The method for checking the relationship between household transformers in a transformer area based on an integrated learning LSCP algorithm according to claim 1, characterized in that: 得到台区内户变关系错误用户后,还人工核实户变关系错误用户,更新系统户变关系档案。After obtaining the users with incorrect household change relationships in the area, the users with incorrect household change relationships are manually verified and the system household change relationship files are updated. 11.基于权利要求1-9中任一项所述的一种基于集成学习LSCP算法的台区户变关系纠查方法的一种基于集成学习LSCP算法的台区户变关系纠查系统,其特征在于,包括:11. A method for checking the relationship between household changes in a substation area based on an integrated learning LSCP algorithm according to any one of claims 1 to 9, a system for checking the relationship between household changes in a substation area based on an integrated learning LSCP algorithm, characterized in that it comprises: 数据采集模块,用于采集台区运行数据:台区内所有用户的日用电量和电压读数、台区内供电变压器日用电量和电压读数;Data collection module, used to collect substation operation data: daily power consumption and voltage readings of all users in the substation, daily power consumption and voltage readings of power supply transformers in the substation; 模型搭建模块:用于根据多种异常检测算法和台区运行数据构建户变关系纠查模型;Model building module: used to build a household-to-household relationship correction model based on multiple anomaly detection algorithms and substation operation data; 数据计算模块:用于计算台区内用户电压读数的局部异常分数、异常分数、台区线损率和矩阵内数据与局部异常分数的最大值之间的相关性;其中,通过输入台区内所有用户的电压读数,并利用多个基异常检测器计算各用户电压读数的异常分数,将台区内所有用户的电压读数分别输入R个基异常检测器计算得到R个该用户电压读数的局部异常分数,并建立局部异常分数矩阵,分别计算该矩阵内数据与局部异常分数的最大值之间的相关性,筛选出相关性最大的x个基异常检测器;所述矩阵内数据与局部异常分数的最大值之间的相关性采用皮尔逊相关系数计算;所述台区线损率的计算方式如下:Data calculation module: used to calculate the local anomaly score, anomaly score, line loss rate of the area and the correlation between the maximum value of the local anomaly score and the data in the matrix; wherein, by inputting the voltage readings of all users in the area and using multiple base anomaly detectors to calculate the anomaly score of each user's voltage reading, the voltage readings of all users in the area are respectively input into R base anomaly detectors to calculate R local anomaly scores of the user's voltage readings, and a local anomaly score matrix is established, and the correlation between the data in the matrix and the maximum value of the local anomaly score is respectively calculated, and x base anomaly detectors with the largest correlation are selected; the correlation between the data in the matrix and the maximum value of the local anomaly score is calculated using the Pearson correlation coefficient; the calculation method of the line loss rate of the area is as follows: ; 算法筛选模块:用于根据数据计算模块的计算结果筛选出于基异常检测器;Algorithm screening module: used to screen out base anomaly detectors according to the calculation results of the data calculation module; 输出模块:用于根据模型的输出结果得到台区内户变关系错误用户。Output module: used to obtain users with household transformer relationship errors in the substation area based on the output results of the model.
CN202210600394.9A 2022-05-30 2022-05-30 A method and system for checking the relationship between household transformers in substations based on ensemble learning LSCP algorithm Active CN115293481B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210600394.9A CN115293481B (en) 2022-05-30 2022-05-30 A method and system for checking the relationship between household transformers in substations based on ensemble learning LSCP algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210600394.9A CN115293481B (en) 2022-05-30 2022-05-30 A method and system for checking the relationship between household transformers in substations based on ensemble learning LSCP algorithm

Publications (2)

Publication Number Publication Date
CN115293481A CN115293481A (en) 2022-11-04
CN115293481B true CN115293481B (en) 2024-12-10

Family

ID=83820952

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210600394.9A Active CN115293481B (en) 2022-05-30 2022-05-30 A method and system for checking the relationship between household transformers in substations based on ensemble learning LSCP algorithm

Country Status (1)

Country Link
CN (1) CN115293481B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118211157B (en) * 2024-02-06 2025-02-07 国网山东省电力公司日照供电公司 A method, device and electronic equipment for locating line loss in a transformer area

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110516912A (en) * 2019-07-24 2019-11-29 长沙恒电聚能电子科技有限公司 A kind of power distribution station family becomes the recognition methods of relationship
CN113011997A (en) * 2021-02-20 2021-06-22 上海电机学院 Power grid user electricity utilization abnormal behavior detection method

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109919425A (en) * 2019-01-23 2019-06-21 国网浙江省电力有限公司 An error correction method based on correlation analysis for user relationship in station area
EP3968040A4 (en) * 2019-06-21 2022-12-21 Siemens Aktiengesellschaft Abnormal electricity use recognition method and device, and computer readable storage medium
CN111505433B (en) * 2020-04-10 2022-06-28 国网浙江余姚市供电有限公司 A method for error correction and phase identification of household-change relationship in low-voltage station area
CN113723844B (en) * 2021-09-06 2024-04-19 东南大学 Low-voltage station theoretical line loss calculation method based on ensemble learning
CN114021430A (en) * 2021-10-20 2022-02-08 国网江苏省电力有限公司南京供电分公司 Household variable relation identification method based on electric quantity and voltage information fusion

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110516912A (en) * 2019-07-24 2019-11-29 长沙恒电聚能电子科技有限公司 A kind of power distribution station family becomes the recognition methods of relationship
CN113011997A (en) * 2021-02-20 2021-06-22 上海电机学院 Power grid user electricity utilization abnormal behavior detection method

Also Published As

Publication number Publication date
CN115293481A (en) 2022-11-04

Similar Documents

Publication Publication Date Title
CN110336534B (en) Fault diagnosis method based on photovoltaic array electrical parameter time series feature extraction
CN113011481B (en) Electric energy meter function abnormality assessment method and system based on decision tree algorithm
CN111505433B (en) A method for error correction and phase identification of household-change relationship in low-voltage station area
Ma et al. Topology identification of distribution networks using a split-EM based data-driven approach
CN112035544B (en) Power distribution network data anomaly monitoring and diagnosis method
CN111461921B (en) Load modeling typical user database updating method based on machine learning
CN110503153B (en) Photovoltaic system fault diagnosis method based on differential evolution algorithm and support vector machine
CN109034461A (en) A kind of voltage dip Stochastic prediction method based on actual electric network monitoring information
CN114528949A (en) Parameter optimization-based electric energy metering abnormal data identification and compensation method
CN113392877B (en) Daily load curve clustering method based on ant colony algorithm and C-K algorithm
CN108182257A (en) A kind of GSA bad data detection and identification methods based on the optimization of areal concentration statistical method
CN117611190A (en) Regional power system carbon emission measuring and calculating method, device and medium
CN116821832A (en) Abnormal data identification and correction method for high-voltage industrial and commercial user power load
CN115293481B (en) A method and system for checking the relationship between household transformers in substations based on ensemble learning LSCP algorithm
CN115033591B (en) Intelligent detection method, system, storage medium and computer equipment for electric charge data abnormality
CN119831618A (en) Carbon emission prediction system and method
CN111626468B (en) A Photovoltaic Interval Prediction Method Based on Skew Convex Loss Function
CN118602396A (en) Combustion optimization method and device for multi-pass circulating fluidized bed boiler
CN111861256A (en) An active distribution network reconfiguration decision-making method and system
CN113743460B (en) Method, device and computer equipment for determining fault cause of power transmission line
CN106844908B (en) A kind of power station equipment adaptive modeling method based on Sample Similarity weighting
CN116109324A (en) Combined electricity larceny detection method based on network characteristics and user behaviors
CN115101136A (en) Large-scale aluminum electrolysis cell global anode effect prediction method
Ma et al. Identification Method for Users-Transformer Relationship in Station Area Based on Local Selective Combination in Parallel Outlier Ensembles Algorithm.
CN116796899A (en) Novel power system decision optimization method and system based on project success evaluation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant