[go: up one dir, main page]

CN114297186B - Power consumption data preprocessing method and system based on deviation coefficient - Google Patents

Power consumption data preprocessing method and system based on deviation coefficient Download PDF

Info

Publication number
CN114297186B
CN114297186B CN202111651714.5A CN202111651714A CN114297186B CN 114297186 B CN114297186 B CN 114297186B CN 202111651714 A CN202111651714 A CN 202111651714A CN 114297186 B CN114297186 B CN 114297186B
Authority
CN
China
Prior art keywords
data
electricity consumption
module
consumption data
equal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111651714.5A
Other languages
Chinese (zh)
Other versions
CN114297186A (en
Inventor
杨舟
周政雷
陈珏羽
李刚
蒋雯倩
江革力
陈俊
张智勇
徐植
唐利涛
邓戈锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangxi Power Grid Co Ltd
Original Assignee
Guangxi Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangxi Power Grid Co Ltd filed Critical Guangxi Power Grid Co Ltd
Priority to CN202111651714.5A priority Critical patent/CN114297186B/en
Publication of CN114297186A publication Critical patent/CN114297186A/en
Application granted granted Critical
Publication of CN114297186B publication Critical patent/CN114297186B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本发明公开一种基于偏离系数的用电数据预处理方法及系统,依据用电量偏离系数τ对用电数据进行修正,然后在对修正后的用电数据进行数据清洗。本发明依据用电数据偏离情况对用电数据进行修正,减少异常因素对用电数据的干扰,消除假的异常数据的影响,为用电数据监测提供较为连续且可靠的参考数据。

The present invention discloses a method and system for preprocessing electricity consumption data based on a deviation coefficient, which corrects the electricity consumption data according to the electricity consumption deviation coefficient τ, and then performs data cleaning on the corrected electricity consumption data. The present invention corrects the electricity consumption data according to the deviation of the electricity consumption data, reduces the interference of abnormal factors on the electricity consumption data, eliminates the influence of false abnormal data, and provides relatively continuous and reliable reference data for electricity consumption data monitoring.

Description

一种基于偏离系数的用电数据预处理方法及系统A method and system for preprocessing electricity consumption data based on deviation coefficient

技术领域Technical Field

本发明涉及数据处理技术领域。具体地说是一种基于偏离系数的用电数据预处理方法及系统。The present invention relates to the technical field of data processing, and more particularly to a method and system for preprocessing electricity consumption data based on a deviation coefficient.

背景技术Background technique

随着智能电表的使用以及行业智能化的发展,通过物联网实现用户用电监测已经是行业趋势。而通过物联网实现对用户用电情况的远程监测,需要对用户用电异常情况进行识别和判断,这就需要对用户的历史用电数据进行分析处理,并将处理后的数据作为对电网运行状况进行预判的依据。在电网实际运行过程中,存在一些使得用电数据偏离实际值的影响因素,如线路接头较多带来的线损增加以及电压波动比较大带来的损耗等,这些影响因素会使一些好的数据表现出来为异常数据,这就导致了异常数据增多,减少了数据分析中所需要参考数据,从而会影响数据分析准确度,进而影响对电网运行状况的预判。With the use of smart meters and the development of industry intelligence, it has become an industry trend to monitor user electricity consumption through the Internet of Things. To remotely monitor user electricity consumption through the Internet of Things, it is necessary to identify and judge abnormal user electricity consumption, which requires analyzing and processing the user's historical electricity consumption data, and using the processed data as the basis for predicting the operation of the power grid. In the actual operation of the power grid, there are some factors that cause the electricity consumption data to deviate from the actual value, such as the increase in line loss caused by more line joints and the loss caused by large voltage fluctuations. These factors will make some good data appear as abnormal data, which leads to an increase in abnormal data and reduces the reference data required for data analysis, which will affect the accuracy of data analysis and further affect the prediction of the operation of the power grid.

发明内容Summary of the invention

为此,本发明所要解决的技术问题在于提供一种基于偏离系数的用电数据预处理方法及系统,依据用电数据偏离情况对用电数据进行修正,减少异常因素对用电数据的干扰,消除假的异常数据,为用电数据监测提供较为连续且可靠的参考数据。To this end, the technical problem to be solved by the present invention is to provide a method and system for preprocessing electricity consumption data based on a deviation coefficient, correct the electricity consumption data according to the deviation of the electricity consumption data, reduce the interference of abnormal factors on the electricity consumption data, eliminate false abnormal data, and provide relatively continuous and reliable reference data for electricity consumption data monitoring.

为解决上述技术问题,本发明提供如下技术方案:In order to solve the above technical problems, the present invention provides the following technical solutions:

一种基于偏离系数的用电数据预处理方法,包括如下步骤:A method for preprocessing electricity consumption data based on a deviation coefficient comprises the following steps:

a)依据用电量偏离系数τ利用下述公式对用电数据进行修正:a) According to the power consumption deviation coefficient τ, the power consumption data is corrected using the following formula:

τ=Q/ΔQτ= Qinitial /ΔQ

式中:α为修正系数;Where: α is the correction coefficient;

Q为用电量统计时间点Tn时采集到的用电量,n≥1;Q initial is the power consumption collected at the power consumption statistical time point T n , n ≥ 1;

Q修正为修正后的用电量; Qcorrected is the corrected electricity consumption;

ΔQ为用电数据统计期T内单位时间平均用电量;ΔQ is the average power consumption per unit time during the power consumption data statistical period T;

b)对修正后的数据进行清洗。b) Clean the corrected data.

上述基于偏离系数的用电数据预处理方法,在步骤a)中,修正系数α通过如下方法进行确定:In the above-mentioned method for preprocessing electricity consumption data based on deviation coefficient, in step a), the correction coefficient α is determined by the following method:

以时间长度Δt对用电数据统计期T进行分割,计算每个时间段的总体标准差,选取总体标准差最小的时间段内用电量平均值Δq作为修正参考用电量,然后通过下述公式确定修正系数α:The statistical period T of electricity consumption data is divided into time lengths Δt, the overall standard deviation of each time period is calculated, and the average power consumption Δq in the time period with the smallest overall standard deviation is selected as the corrected reference power consumption. Then, the correction coefficient α is determined by the following formula:

α=(Δq/ΔQ)(2Δt/T)α=(Δq/ΔQ) (2Δt/T) .

上述基于偏离系数的用电数据预处理方法,Δt/T大于或等于0.05。In the above-mentioned electricity consumption data preprocessing method based on deviation coefficient, Δt/T is greater than or equal to 0.05.

上述基于偏离系数的用电数据预处理方法,Δt/T小于或等于0.5。In the above-mentioned electricity consumption data preprocessing method based on deviation coefficient, Δt/T is less than or equal to 0.5.

上述基于偏离系数的用电数据预处理方法,在步骤a)中,在对用电数据进行修正前,用波动比ζ对用电数据波动性进行评估,其中,ζ=|(Qn+1-Qn)/Qn|,n大于或等于1;当ζ大于或等于0.25且自第(n+1)个采集点起连续有大于或等于3个采集点采集到的用电数据波动比ζ小于或等于0.1时,则在对用电数据进行修正时,则将自第(n+1)至第(n+m)个采集点的用电数据使用独立的修正系数进行修正,其中,m大于或等于3。In the above-mentioned electricity consumption data preprocessing method based on deviation coefficient, in step a), before correcting the electricity consumption data, the volatility of the electricity consumption data is evaluated using the fluctuation ratio ζ, wherein ζ=|(Qn +1 - Qn )/ Qn |, and n is greater than or equal to 1; when ζ is greater than or equal to 0.25 and the fluctuation ratio ζ of the electricity consumption data collected by greater than or equal to 3 consecutive collection points starting from the (n+1)th collection point is less than or equal to 0.1, then when correcting the electricity consumption data, the electricity consumption data from the (n+1)th to the (n+m)th collection points are corrected using independent correction coefficients, wherein m is greater than or equal to 3.

上述基于偏离系数的用电数据预处理方法,当ζ大于或等于0.25时,在自第(n+1)个采集点起连续有少于3个采集点采集到的用电数据波动比ζ小于或等于0.1的情况下,自第(n+1)个采集点采集到的数据仍采用原有的修正系数进行修正。In the above-mentioned electricity consumption data preprocessing method based on the deviation coefficient, when ζ is greater than or equal to 0.25, when there are less than 3 consecutive collection points since the (n+1)th collection point whose electricity consumption data fluctuation ratio ζ is less than or equal to 0.1, the data collected from the (n+1)th collection point is still corrected using the original correction coefficient.

上述基于偏离系数的用电数据预处理方法,当ζ大于或等于0.25时,在自第(n+1)至第(n+m)个采集点中有大于或等于[0.8m]个采集点采集到的用电数据与第(n+1)个采集点采集到的用电数据波动比ζ小于或等于0.1,则在对用电数据进行修正时,将自第(n+1)至第(n+m)个采集点的用电数据使用独立的修正系数进行修正,其中,m大于或等于5。In the above-mentioned electricity consumption data preprocessing method based on the deviation coefficient, when ζ is greater than or equal to 0.25, the fluctuation ratio ζ of the electricity consumption data collected by greater than or equal to [0.8m] collection points from the (n+1)th to the (n+m)th collection points and the electricity consumption data collected by the (n+1)th collection point is less than or equal to 0.1, then when correcting the electricity consumption data, the electricity consumption data from the (n+1)th to the (n+m)th collection points are corrected using an independent correction coefficient, wherein m is greater than or equal to 5.

利用上述基于偏离系数的用电数据预处理方法进行用电数据预处理的系统,包括:A system for preprocessing electricity consumption data using the above-mentioned electricity consumption data preprocessing method based on deviation coefficient comprises:

数据采集模块,用以对用电数据进行采集和分组;A data collection module, used to collect and group electricity consumption data;

数据偏离分析模块,用以对用电数据进行偏离分析;Data deviation analysis module, used to perform deviation analysis on electricity consumption data;

数据波动分析模块,用以对相邻两个采集点的用电数据进行波动分析;Data fluctuation analysis module, used to perform fluctuation analysis on the power consumption data of two adjacent collection points;

数据修正模块,用以对用电数据进行修正;A data correction module, used to correct the electricity consumption data;

数据清洗模块,用以对修正后的用电数据进行清洗;A data cleaning module, used to clean the corrected electricity consumption data;

数据采集模块分别与数据偏离分析模块和数据波动分析模块通信连接,数据偏离分析模块和数据波动分析模块分别与数据修正模块通信连接,数据修正模块与数据清洗模块通信连接。The data acquisition module is respectively connected to the data deviation analysis module and the data fluctuation analysis module in communication, the data deviation analysis module and the data fluctuation analysis module are respectively connected to the data correction module in communication, and the data correction module is connected to the data cleaning module in communication.

上述系统,还包括数据整理模块,数据偏离分析模块和数据波动分析模块分别与数据整理模块通信连接,数据整理模块与数据修正模块通信连接。The above system also includes a data sorting module, the data deviation analysis module and the data fluctuation analysis module are respectively connected to the data sorting module in communication, and the data sorting module is connected to the data correction module in communication.

上述系统,还包括数据存储模块,数据整理模块与数据存储模块通信连接,数据存储模块与数据修正模块通信连接。The above system also includes a data storage module, the data sorting module is communicatively connected to the data storage module, and the data storage module is communicatively connected to the data correction module.

本发明的技术方案取得了如下有益的技术效果:The technical solution of the present invention achieves the following beneficial technical effects:

1.本发明根据用电数据偏离情况对用电数据进行修正,降低了假的异常数据被清洗的可能性,为利用历史用电数据对电网用电数据监测提供的样本分析用数据。1. The present invention corrects the electricity consumption data according to the deviation of the electricity consumption data, reduces the possibility of false abnormal data being cleaned, and provides sample analysis data for monitoring the electricity consumption data of the power grid using historical electricity consumption data.

2.根据用电数据波动情况对用电数据采用不同修正方式,避免对用电数据波动较大时间段内的用电数据修正不足或者过度。2. Use different correction methods for electricity consumption data according to the fluctuation of electricity consumption data to avoid under- or over-correction of electricity consumption data during periods of large fluctuations.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1本发明中基于偏离系数的用电数据预处理系统的工作原理图;FIG1 is a working principle diagram of the power consumption data preprocessing system based on the deviation coefficient in the present invention;

图2修正前的用电数据折线图;Figure 2 Line chart of electricity consumption data before correction;

图3修正后的用电数据折线图。Figure 3: Corrected line chart of electricity consumption data.

具体实施方式Detailed ways

如图1所示,本发明中利用基于偏离系数的用电数据预处理方法进行用电数据预处理的系统,包括数据采集模块、数据偏离分析模块、数据波动分析模块、数据修正模块、数据清洗模块、数据整理模块和数据存储模块,数据采集模块分别与数据偏离分析模块和数据波动分析模块通信连接,数据偏离分析模块和数据波动分析模块分别与数据整理模块通信连接,数据整理模块与数据存储模块通信连接,数据存储模块与数据修正模块通信连接,数据修正模块与数据清洗模块通信连接。As shown in Figure 1, the system for preprocessing electricity data using the electricity data preprocessing method based on the deviation coefficient in the present invention includes a data acquisition module, a data deviation analysis module, a data fluctuation analysis module, a data correction module, a data cleaning module, a data sorting module and a data storage module. The data acquisition module is communicated with the data deviation analysis module and the data fluctuation analysis module respectively, the data deviation analysis module and the data fluctuation analysis module are communicated with the data sorting module respectively, the data sorting module is communicated with the data storage module, the data storage module is communicated with the data correction module, and the data correction module is communicated with the data cleaning module.

其中,数据采集模块,用以对用电数据进行采集和分组;数据偏离分析模块,用以对用电数据进行偏离分析;数据波动分析模块,用以对相邻两个采集点的用电数据进行波动分析;数据修正模块,用以对用电数据进行修正;数据清洗模块,用以对修正后的用电数据进行清洗。Among them, the data acquisition module is used to collect and group electricity consumption data; the data deviation analysis module is used to perform deviation analysis on electricity consumption data; the data fluctuation analysis module is used to perform fluctuation analysis on electricity consumption data of two adjacent collection points; the data correction module is used to correct electricity consumption data; and the data cleaning module is used to clean the corrected electricity consumption data.

在实际用电过程中,有很多因素会导致智能电表采集到的用电数据与实际用电数据出现偏离现象,在对用电数据进行修正以前,需要对用电数据进行偏离分析,这样可以避免将正确的数据调整成错误的数据或者将错误的数据调整成更离谱的数据。其中,首先要对用电数据进行偏离分析,设定偏离阈值,本实施例中设定的偏离阈值为(0.85,1.3],然后根据用电数据偏离程度对用电数据进行修正,最后在对修正后的用电数据进行清洗。即利用用电数据预处理系统对历史用电数据进行预处理,具体步骤为:In the actual electricity consumption process, there are many factors that will cause the electricity consumption data collected by the smart meter to deviate from the actual electricity consumption data. Before correcting the electricity consumption data, it is necessary to perform a deviation analysis on the electricity consumption data to avoid adjusting the correct data to wrong data or adjusting the wrong data to more outrageous data. First, the electricity consumption data must be analyzed for deviation and a deviation threshold must be set. The deviation threshold set in this embodiment is (0.85, 1.3], and then the electricity consumption data must be corrected according to the degree of deviation of the electricity consumption data. Finally, the corrected electricity consumption data must be cleaned. That is, the electricity consumption data preprocessing system is used to preprocess the historical electricity consumption data. The specific steps are as follows:

a)依据用电量偏离系数τ利用下述公式对用电数据进行修正:a) According to the power consumption deviation coefficient τ, the power consumption data is corrected using the following formula:

τ=Q/ΔQτ= Qinitial /ΔQ

式中:α为修正系数;Where: α is the correction coefficient;

Q为用电量统计时间点Tn时采集到的用电量,n≥1;Q initial is the power consumption collected at the power consumption statistical time point T n , n ≥ 1;

Q修正为修正后的用电量; Qcorrected is the corrected electricity consumption;

ΔQ为用电数据统计期T内单位时间平均用电量;ΔQ is the average power consumption per unit time during the power consumption data statistical period T;

b)对修正后的数据进行清洗。b) Clean the corrected data.

在步骤a)中,修正系数α通过如下方法进行确定:In step a), the correction coefficient α is determined by the following method:

以时间长度Δt对用电数据统计期T进行分割,计算每个时间段的总体标准差,选取总体标准差最小的时间段内用电量平均值Δq作为修正参考用电量,然后通过下述公式确定修正系数α:The statistical period T of electricity consumption data is divided into time lengths Δt, the overall standard deviation of each time period is calculated, and the average power consumption Δq in the time period with the smallest overall standard deviation is selected as the corrected reference power consumption. Then, the correction coefficient α is determined by the following formula:

α=(Δq/ΔQ)(2Δt/T)α=(Δq/ΔQ) (2Δt/T) .

其中,修正系数α是基于用电数据规律性波动提出的,即在一定的时间内用电数据是相近的,波动幅度较小,具有相对明显的规律性,譬如耗电较大的设备使用时间点和时间长短都具有一定的规律性,因此选取总体标准差最小的时间段内用电量平均值Δq作为修正参考用电量,将之与用电数据统计期T内单位时间平均用电量ΔQ以及时间长度Δt和用电数据统计期T进行关联推导,得出修正系数α的计算公式。Among them, the correction coefficient α is proposed based on the regular fluctuation of electricity consumption data, that is, the electricity consumption data are similar within a certain period of time, the fluctuation range is small, and it has relatively obvious regularity. For example, the time points and duration of use of equipment with high power consumption have certain regularity. Therefore, the average power consumption Δq in the time period with the smallest overall standard deviation is selected as the corrected reference power consumption, and it is correlated with the average power consumption per unit time ΔQ in the electricity consumption data statistical period T, as well as the time length Δt and the electricity consumption data statistical period T to derive the calculation formula of the correction coefficient α.

为了降低用电数据预处理难度,提高用电数据预处理效率,本实施例中,Δt/T大于或等于0.05且小于或等于0.5,即当Δt/T小于0.05时,就调整Δt的大小。In order to reduce the difficulty of electricity consumption data preprocessing and improve the efficiency of electricity consumption data preprocessing, in this embodiment, Δt/T is greater than or equal to 0.05 and less than or equal to 0.5, that is, when Δt/T is less than 0.05, the size of Δt is adjusted.

鉴于不同用电时段的用电数据会存在一定的差异,对于差异较大的用电数据用同一个修正系数进行修正就会存在修正过度,即出现修正后的数据出现失真现象,因而,在步骤a)中,在对用电数据进行修正前,用波动比ζ对用电数据波动性进行评估,其中,ζ=|(Qn+1-Qn)/Qn|,n大于或等于1;当ζ大于或等于0.25且自第(n+1)个采集点起连续有大于或等于3个采集点采集到的用电数据波动比ζ小于或等于0.1时,则在对用电数据进行修正时,将自第(n+1)至第(n+m)个采集点的用电数据使用独立的修正系数进行修正,其中,m大于或等于3,而当ζ大于或等于0.25时,在自第(n+1)个采集点起连续有少于3个采集点采集到的用电数据波动比ζ小于或等于0.1的情况下,自第(n+1)个采集点采集到的数据仍采用原有的修正系数进行修正。鉴于某种因素可能会导致用电数据出现较为频繁的波动,因此,当ζ大于或等于0.25时,在自第(n+1)至第(n+m)个采集点中有大于或等于[0.8m]个采集点采集到的用电数据与第(n+1)个采集点采集到的用电数据波动比ζ小于或等于0.1,则在对用电数据进行修正时,则将自第(n+1)至第(n+m)个采集点的用电数据使用独立的修正系数进行修正,其中,m大于或等于5。In view of the fact that there are certain differences in the power consumption data in different power consumption periods, using the same correction coefficient to correct the power consumption data with large differences will result in over-correction, that is, the corrected data will be distorted. Therefore, in step a), before correcting the power consumption data, the volatility of the power consumption data is evaluated using the fluctuation ratio ζ, where ζ = |(Q n+1 -Q n )/Q n |, n is greater than or equal to 1; when ζ is greater than or equal to 0.25 and there are more than or equal to 3 consecutive collection points since the (n+1)th collection point whose power consumption data fluctuation ratio is less than or equal to ζ, then when correcting the power consumption data, the power consumption data from the (n+1)th to the (n+m)th collection points are corrected using an independent correction coefficient, wherein m is greater than or equal to 3, and when ζ is greater than or equal to 0.25, when there are less than 3 consecutive collection points since the (n+1)th collection point whose power consumption data fluctuation ratio is less than or equal to ζ, the data collected from the (n+1)th collection point is still corrected using the original correction coefficient. In view of the fact that certain factors may cause more frequent fluctuations in electricity consumption data, when ζ is greater than or equal to 0.25, and the fluctuation ratio ζ of the electricity consumption data collected by greater than or equal to [0.8m] collection points from the (n+1)th to the (n+m)th collection points and the electricity consumption data collected by the (n+1)th collection point is less than or equal to 0.1, when correcting the electricity consumption data, the electricity consumption data from the (n+1)th to the (n+m)th collection points are corrected using an independent correction coefficient, where m is greater than or equal to 5.

使用本发明中的用电数据预处理方法对用户A在2020年10月3日至2020年11月11日用电数据进行处理,其处理结果如表1所示。The electricity consumption data preprocessing method in the present invention is used to process the electricity consumption data of user A from October 3, 2020 to November 11, 2020, and the processing results are shown in Table 1.

表1用户A的历史用电数据及其预处理结果Table 1 Historical electricity consumption data of user A and its preprocessing results

日期date 用电量energy used 偏离系数τDeviation coefficient τ 修正后用电数据Corrected electricity consumption data 2020年10月3日October 3, 2020 2.142.14 1.1961.196 2.142.14 2020年10月4日October 4, 2020 1.981.98 1.1061.106 1.981.98 2020年10月5日October 5, 2020 1.721.72 0.9610.961 1.721.72 2020年10月6日October 6, 2020 1.481.48 0.8230.823 1.491.49 2020年10月7日October 7, 2020 2.142.14 1.1961.196 2.142.14 2020年10月8日October 8, 2020 1.581.58 0.8830.883 1.581.58 2020年10月9日October 9, 2020 2.002.00 1.1171.117 2.002.00 2020年10月10日October 10, 2020 1.511.51 0.8440.844 1.521.52 2020年10月11日October 11, 2020 2.102.10 1.1731.173 2.102.10 2020年10月12日October 12, 2020 1.521.52 0.8490.849 1.531.53 2020年10月13日October 13, 2020 2.102.10 1.1731.173 2.102.10 2020年10月14日October 14, 2020 1.601.60 0.8940.894 1.601.60 2020年10月15日October 15, 2020 1.751.75 0.9780.978 1.751.75 2020年10月16日October 16, 2020 2.092.09 1.1681.168 2.092.09 2020年10月17日October 17, 2020 1.851.85 1.0341.034 1.851.85 2020年10月18日October 18, 2020 1.801.80 1.0061.006 1.801.80 2020年10月19日October 19, 2020 1.731.73 0.9660.966 1.731.73 2020年10月20日October 20, 2020 1.471.47 0.8210.821 1.481.48 2020年10月21日October 21, 2020 1.991.99 1.1121.112 1.991.99 2020年10月22日October 22, 2020 1.511.51 0.8440.844 1.521.52 2020年10月23日October 23, 2020 1.601.60 0.8940.894 1.601.60 2020年10月24日October 24, 2020 1.431.43 0.7990.799 1.441.44 2020年10月25日October 25, 2020 1.621.62 0.9050.905 1.621.62 2020年10月26日October 26, 2020 2.182.18 1.2181.218 2.182.18 2020年10月27日October 27, 2020 1.641.64 0.9160.916 1.641.64 2020年10月28日October 28, 2020 2.002.00 1.1171.117 2.002.00 2020年10月29日October 29, 2020 2.022.02 1.1281.128 2.022.02 2020年10月30日October 30, 2020 1.731.73 0.9660.966 1.731.73 2020年10月31日October 31, 2020 1.521.52 0.8490.849 1.531.53 2020年11月1日November 1, 2020 1.761.76 0.9830.983 1.761.76 2020年11月2日November 2, 2020 1.501.50 0.8380.838 1.511.51 2020年11月3日November 3, 2020 2.252.25 1.2571.257 2.252.25 2020年11月4日November 4, 2020 1.421.42 0.7930.793 1.431.43 2020年11月5日November 5, 2020 1.451.45 0.8100.810 1.461.46 2020年11月6日November 6, 2020 1.551.55 0.8660.866 1.551.55 2020年11月7日November 7, 2020 1.411.41 0.7880.788 1.421.42 2020年11月8日November 8, 2020 2.252.25 1.2571.257 2.252.25 2020年11月9日November 9, 2020 1.611.61 0.8990.899 1.611.61 2020年11月10日November 10, 2020 2.132.13 1.1901.190 2.132.13 2020年11月11日November 11, 2020 2.062.06 1.1511.151 2.062.06 2020年11月12日November 12, 2020 1.721.72 0.9610.961 1.721.72 2020年11月13日November 13, 2020 1.331.33 0.7430.743 1.341.34 2020年11月14日November 14, 2020 1.461.46 0.8160.816 1.471.47

其中,T=43,Δt=5,Δq为2020年10月15日至2020年10月19日的平均用电量。将修正前后的用电量数据做折线图,如图2和图3所示,修正后的用电量数据折线图与修正前的用电量数据折线图变化并不明显,但是偏离较大的用电数据修正后趋近于用电数据的基本变化数据和变化趋势数据复合而成的用电量变化趋势,其中,用电数据的基本变化数据指的是仅随使用时间长短发生变化而不随季节变换而发生变化的用电数据,如照明和其他日常用电,用电数据的变化趋势数据指的是随季节变换而发生变化的用电数据,如空调制冷或取暖用电。Among them, T = 43, Δt = 5, and Δq is the average electricity consumption from October 15, 2020 to October 19, 2020. The electricity consumption data before and after correction are plotted as line graphs, as shown in Figures 2 and 3. The line graph of the electricity consumption data after correction does not change significantly from the line graph of the electricity consumption data before correction, but the electricity consumption data with a large deviation tends to be close to the electricity consumption change trend composed of the basic change data and change trend data of the electricity consumption data after correction. Among them, the basic change data of the electricity consumption data refers to the electricity consumption data that only changes with the length of use time but not with the change of seasons, such as lighting and other daily electricity consumption, and the change trend data of the electricity consumption data refers to the electricity consumption data that changes with the change of seasons, such as air conditioning cooling or heating electricity.

显然,上述实施例仅仅是为清楚地说明所作的举例,而并非对实施方式的限定。对于所属领域的普通技术人员来说,在上述说明的基础上还可以做出其它不同形式的变化或变动。这里无需也无法对所有的实施方式予以穷举。而由此所引伸出的显而易见的变化或变动仍处于本专利申请权利要求的保护范围之中。Obviously, the above embodiments are merely examples for the purpose of clear explanation, and are not intended to limit the implementation methods. For those skilled in the art, other different forms of changes or modifications can be made based on the above description. It is not necessary and impossible to list all the implementation methods here. The obvious changes or modifications derived therefrom are still within the scope of protection of the claims of this patent application.

Claims (6)

1. The power consumption data preprocessing method based on the deviation coefficient is characterized by comprising the following steps of:
a) And correcting the electricity consumption data according to the electricity consumption deviation coefficient tau by using the following formula:
τ=Q Initially, the method comprises /ΔQ
Wherein: alpha is a correction coefficient;
Q Initially, the method comprises is the electricity consumption acquired at the electricity consumption statistics time point T n, and n is more than or equal to 1;
Q correction is the corrected electricity consumption;
Delta Q is average electricity consumption per unit time in the electricity consumption data statistics period T;
wherein, the correction coefficient alpha is determined by the following method:
Dividing the electricity consumption data statistics period T by using the time length delta T, calculating the total standard deviation of each time period, selecting the average value delta q of the electricity consumption in the time period with the minimum total standard deviation as the correction reference electricity consumption, and then determining the correction coefficient alpha by the following formula:
α=(Δq/ΔQ)(2Δt/T)
before correcting the power consumption data, the fluctuation of the power consumption data is evaluated by using a fluctuation ratio ζ, wherein ζ= | (Q n+1-Qn)/Qn |, n is larger than or equal to 1), when ζ is larger than or equal to 0.25 and the fluctuation ratio ζ of the power consumption data collected from the (n+1) th collection point is larger than or equal to 0.1 continuously, the power consumption data from the (n+1) th collection point to the (n+m) th collection point is corrected by using an independent correction coefficient when ζ is larger than or equal to 0.25, and when ζ is larger than or equal to 0.25, the data collected from the (n+1) th collection point is still corrected by using the original correction coefficient when ζ is larger than or equal to 0.25 and the data collected from the (n+1) th collection point to the (n+1) th collection point is still corrected by using the independent correction coefficient when ζ is larger than or equal to 0.25, and when ζ is larger than or equal to 0.1) th collection point, the data collected from the (n+1) th collection point to the (n+1) th collection point is continuously less than 3 collection point is still corrected by using the original correction coefficient when ζ is larger than or equal to 0.1;
b) And cleaning the corrected data.
2. The method for preprocessing electricity consumption data based on a deviation coefficient according to claim 1, wherein Δt/T is greater than or equal to 0.05.
3. The power consumption data preprocessing method based on the deviation coefficient according to claim 2, wherein Δt/T is less than or equal to 0.5.
4. A system for preprocessing electricity data using the electricity data preprocessing method based on the deviation coefficient according to any one of claims 1 to 3, characterized by comprising:
the data acquisition module is used for acquiring and grouping power consumption data;
The data deviation analysis module is used for performing deviation analysis on the electricity consumption data;
The data fluctuation analysis module is used for carrying out fluctuation analysis on the electricity utilization data of two adjacent acquisition points;
the data correction module is used for correcting the electricity consumption data;
The data cleaning module is used for cleaning the corrected electricity consumption data;
The data acquisition module is respectively in communication connection with the data deviation analysis module and the data fluctuation analysis module, the data deviation analysis module and the data fluctuation analysis module are respectively in communication connection with the data correction module, and the data correction module is in communication connection with the data cleaning module.
5. The system of claim 4, further comprising a data sort module, wherein the data skew analysis module and the data surge analysis module are each in communication with the data sort module, and wherein the data sort module is in communication with the data correction module.
6. The system of claim 5, further comprising a data storage module, the data collation module being in communication with the data storage module, the data storage module being in communication with the data correction module.
CN202111651714.5A 2021-12-30 2021-12-30 Power consumption data preprocessing method and system based on deviation coefficient Active CN114297186B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111651714.5A CN114297186B (en) 2021-12-30 2021-12-30 Power consumption data preprocessing method and system based on deviation coefficient

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111651714.5A CN114297186B (en) 2021-12-30 2021-12-30 Power consumption data preprocessing method and system based on deviation coefficient

Publications (2)

Publication Number Publication Date
CN114297186A CN114297186A (en) 2022-04-08
CN114297186B true CN114297186B (en) 2024-04-26

Family

ID=80972818

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111651714.5A Active CN114297186B (en) 2021-12-30 2021-12-30 Power consumption data preprocessing method and system based on deviation coefficient

Country Status (1)

Country Link
CN (1) CN114297186B (en)

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106327359A (en) * 2016-09-22 2017-01-11 国网上海市电力公司 A Judgment Method for Collected Data Abnormality Based on Power Consumption Pattern Analysis
CN106503880A (en) * 2016-09-22 2017-03-15 国网上海市电力公司 A kind of INTELLIGENT IDENTIFICATION method of low-voltage collecting meter reading system zero power user
CN106505635A (en) * 2016-09-20 2017-03-15 北京恒泰实达科技股份有限公司 Abandon the minimum active power dispatch model of wind and scheduling system
CN107292482A (en) * 2017-04-28 2017-10-24 国网浙江省电力公司 Electricity charge method of adjustment based on load fluctuation
CN107292428A (en) * 2017-06-07 2017-10-24 国网浙江省电力公司物资分公司 A kind of distribution Power Material procurement demand forecasting system
CN108647275A (en) * 2018-04-28 2018-10-12 国网北京市电力公司 The recognition methods of isolated island detecting state and device, storage medium, processor
CN108695806A (en) * 2018-08-24 2018-10-23 四川大学 Embedded heating material is from ice-melt conducting wire anti-icing control method online
CN108882872A (en) * 2016-04-15 2018-11-23 欧姆龙株式会社 Biont information analytical equipment, Biont information analysis system, program and Biont information analysis method
CN108898251A (en) * 2018-06-29 2018-11-27 上海电力学院 Consider the marine wind electric field power forecasting method of meteorological similitude and power swing
CN110611333A (en) * 2019-11-11 2019-12-24 燕山大学 A Distributed Coordinated Control Method of Island Microgrid Considering Communication Problems

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108882872A (en) * 2016-04-15 2018-11-23 欧姆龙株式会社 Biont information analytical equipment, Biont information analysis system, program and Biont information analysis method
CN106505635A (en) * 2016-09-20 2017-03-15 北京恒泰实达科技股份有限公司 Abandon the minimum active power dispatch model of wind and scheduling system
CN106327359A (en) * 2016-09-22 2017-01-11 国网上海市电力公司 A Judgment Method for Collected Data Abnormality Based on Power Consumption Pattern Analysis
CN106503880A (en) * 2016-09-22 2017-03-15 国网上海市电力公司 A kind of INTELLIGENT IDENTIFICATION method of low-voltage collecting meter reading system zero power user
CN107292482A (en) * 2017-04-28 2017-10-24 国网浙江省电力公司 Electricity charge method of adjustment based on load fluctuation
CN107292428A (en) * 2017-06-07 2017-10-24 国网浙江省电力公司物资分公司 A kind of distribution Power Material procurement demand forecasting system
CN108647275A (en) * 2018-04-28 2018-10-12 国网北京市电力公司 The recognition methods of isolated island detecting state and device, storage medium, processor
CN108898251A (en) * 2018-06-29 2018-11-27 上海电力学院 Consider the marine wind electric field power forecasting method of meteorological similitude and power swing
CN108695806A (en) * 2018-08-24 2018-10-23 四川大学 Embedded heating material is from ice-melt conducting wire anti-icing control method online
CN110611333A (en) * 2019-11-11 2019-12-24 燕山大学 A Distributed Coordinated Control Method of Island Microgrid Considering Communication Problems

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于分时段多模型的短期电价预测;胡峰 等;华东电力;20080624;第36卷(第06期);20-23 *
考虑季节特性的改进小波包-RBF用电量预测方法;覃泓皓 等;电力科学与技术学报;20160628;第31卷(第02期);37-44 *

Also Published As

Publication number Publication date
CN114297186A (en) 2022-04-08

Similar Documents

Publication Publication Date Title
CN108763729B (en) Coupling state evaluation method of electromechanical system in process industry based on network structure entropy
CN109740648B (en) Method and device for identifying abnormal data of power load and computer equipment
CN102339288A (en) Method and device for detecting abnormal data of data warehouse
CN108088916B (en) Method and system for improving data quality of online monitoring of dissolved gas in oil
CN106204332B (en) A Method for Predicting Efficiency Attenuation of Photovoltaic Power Stations
CN110046152A (en) A method of processing electricity consumption data missing values
CN112232366A (en) Electrical equipment fault early warning method and system based on RFID monitoring
CN117459073B (en) Intelligent management method for heat pump system operation data
CN107733737A (en) A kind of abnormal method of monitoring traffic in network
CN109634944A (en) A kind of network loss data cleaning method based on various dimensions space-time analysis
CN117387172B (en) Terminal air conditioner energy saving method and system based on accurate recommended equipment control parameters
CN116303391A (en) Processing method for remote transmission of electric energy time sequence data by intelligent ammeter
CN114297186B (en) Power consumption data preprocessing method and system based on deviation coefficient
CN114826543B (en) AIOT-based steam ejector pump parameter transmission system and method
WO2020119012A1 (en) Dynamic system static gain estimation method based on historical data ramp response
CN118364417B (en) Phase identification method and device for single-phase meter user
CN113159138B (en) Gas boiler fault diagnosis method and device based on data fusion
CN118149149B (en) Intelligent control method for valve running state detection based on Internet of things
CN114491984A (en) Method for extracting thermal fatigue load in aeroengine load spectrum and determining distribution rule
CN113810792A (en) Edge data acquisition and analysis system based on cloud computing
CN116776128B (en) Intelligent household data acquisition system based on Internet of things
CN114756722B (en) Multi-data centralized management control system and method
CN115828439B (en) Method and device for identifying abnormal loss of wind generating set
CN115579876A (en) Method and system for identifying fan power characteristic curve
CN115081840A (en) Daily electric quantity abnormal value detection and correction system based on first-order difference and ARIMA method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant