[go: up one dir, main page]

CN111382789B - Power load identification method and system based on machine learning - Google Patents

Power load identification method and system based on machine learning Download PDF

Info

Publication number
CN111382789B
CN111382789B CN202010152529.0A CN202010152529A CN111382789B CN 111382789 B CN111382789 B CN 111382789B CN 202010152529 A CN202010152529 A CN 202010152529A CN 111382789 B CN111382789 B CN 111382789B
Authority
CN
China
Prior art keywords
data
electrical appliance
power
electrical
power load
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010152529.0A
Other languages
Chinese (zh)
Other versions
CN111382789A (en
Inventor
李波
周年荣
曹敏
张林山
王浩
罗永睦
轩辕哲
邹京希
朱全聪
利佳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electric Power Research Institute of Yunnan Power Grid Co Ltd
Original Assignee
Electric Power Research Institute of Yunnan Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electric Power Research Institute of Yunnan Power Grid Co Ltd filed Critical Electric Power Research Institute of Yunnan Power Grid Co Ltd
Priority to CN202010152529.0A priority Critical patent/CN111382789B/en
Publication of CN111382789A publication Critical patent/CN111382789A/en
Application granted granted Critical
Publication of CN111382789B publication Critical patent/CN111382789B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24133Distances to prototypes
    • G06F18/24137Distances to cluster centroïds
    • G06F18/2414Smoothing the distance, e.g. radial basis function networks [RBFN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J3/00Circuit arrangements for AC mains or AC distribution networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Power Engineering (AREA)
  • Supply And Distribution Of Alternating Current (AREA)

Abstract

The power load identification method and the system based on machine learning provided by the application are characterized in that the basis of actually measured electrical parameter data including current, voltage, power and the like is in a unified format, and on the basis of long-time extraction, collection, analysis, induction and training of power load characteristics, the type of an electrical appliance in use can be accurately identified under the condition that a plurality of pieces of power load overall basis electrical parameter data for a period of time including voltage, current, active power, reactive power and the like are known. Therefore, the machine learning model training method and system for power load identification provided by the application do not need to manually adjust parameters, and compared with the traditional method, such as time domain waveform matching, feature point matching, spectrum analysis and the like, the method and system have high matching accuracy, can automatically learn and automatically acquire the feature parameters required by power load identification, thereby improving the application range of the model and improving the accuracy of power load identification.

Description

基于机器学习的电力负荷识别方法及系统Power load identification method and system based on machine learning

技术领域Technical field

本申请涉及电力负荷检测技术领域,尤其涉及一种基于机器学习的电力负荷识别方法及系统。The present application relates to the technical field of electric load detection, and in particular to a method and system for electric load identification based on machine learning.

背景技术Background technique

电力负荷特征是电力负荷从电力系统的电源吸取的有功功率和无功功率随负荷端点的电压及系统频率变化而改变的规律;电力负荷特征是电力系统的重要组成部分;通过电力负荷特征识别用电设备对于智能电网技术的发展有重要作用。The power load characteristics are the rules in which the active power and reactive power absorbed by the power load from the power supply of the power system change with the voltage at the load end point and the system frequency. The power load characteristics are an important part of the power system. The power load characteristics are used to identify Electrical equipment plays an important role in the development of smart grid technology.

电力负荷识别最为常用的方法为侵入式和非侵入式识别方法。其中,侵入式识别方法需要建立监测系统把传感器安装至各负荷处,这种方法虽然可以直接获得负荷的测量数据,但是安装成本高、安装过程复杂且维护也相对困难;而非侵入式识别方法只需要在电力供给的总入口处安装监测设备即可以对整个系统内的各负荷分解、监测和识别。具体地,非侵入式识别方法是基于电器负荷印记特征提取和识别的;其中,电器负荷印记特征能反映用电设备在运行中的独特的信息,如电压、有功的波形、启动电流等;在设备运行中这些负荷印记特征会重复出现,基于此,我们就可以把用电设备识别出来。The most commonly used methods for power load identification are intrusive and non-intrusive identification methods. Among them, the intrusive identification method requires the establishment of a monitoring system and the installation of sensors at each load. Although this method can directly obtain the measurement data of the load, the installation cost is high, the installation process is complicated, and the maintenance is relatively difficult; the non-intrusive identification method Only by installing monitoring equipment at the main entrance of the power supply can the loads in the entire system be decomposed, monitored and identified. Specifically, the non-intrusive identification method is based on the extraction and identification of electrical load imprint features; among them, the electrical load imprint features can reflect the unique information of the electrical equipment in operation, such as voltage, active power waveform, starting current, etc.; in These load imprint characteristics will appear repeatedly during equipment operation. Based on this, we can identify the electrical equipment.

其中,负荷印记特征的设计和提取是整个方法的主要难点;特征设计一般采用较为简单的电流、电压、有功功率和无功功率的稳/暂态特征及其组合。但是,人工设计的信号特征需要人为手动调节参数,存在复杂度和维度较低的问题,同时,传统的匹配算法例如时域的波形匹配,特征点匹配以及谱分析等方法匹配准确率不高,进而用电负荷识别准确率不高,实际应用效果不甚理想;且数据建模是一个非常必要和重要的工作。针对运行状态比较稳定的用电设备,例如电视机、电水壶、电脑等,负荷识别难度相对较低,如果对于工作状态较多,例如全自动洗衣机,因为工作中用电情况变化比较多,负荷识别的难度也非常大。针对以上问题,基于稳态特征的提取技术不能有效应对一些识别难度较高的场景。Among them, the design and extraction of load imprint features is the main difficulty of the entire method; feature design generally uses relatively simple steady/transient characteristics of current, voltage, active power and reactive power and their combinations. However, artificially designed signal features require manual adjustment of parameters, which has the problem of low complexity and low dimensionality. At the same time, traditional matching algorithms such as time domain waveform matching, feature point matching, and spectral analysis have low matching accuracy. Furthermore, the accuracy of electrical load identification is not high, and the practical application effect is not ideal; and data modeling is a very necessary and important task. For electrical equipment with relatively stable operating conditions, such as televisions, electric kettles, computers, etc., load identification is relatively difficult. If there are many working conditions, such as fully automatic washing machines, the power consumption changes a lot during work, and the load It is also very difficult to identify. In response to the above problems, extraction technology based on steady-state features cannot effectively deal with some scenes with high recognition difficulty.

发明内容Contents of the invention

本申请提供了一种基于机器学习的电力负荷识别方法及系统,以解决现有方法中人工设计的信号特征需要人为手动调节参数从而导致用电负荷识别准确率低的技术问题。This application provides an electric load identification method and system based on machine learning to solve the technical problem in the existing method that artificially designed signal characteristics require manual adjustment of parameters, resulting in low accuracy of electric load identification.

为了解决上述技术问题,本申请实施例公开了如下技术方案:In order to solve the above technical problems, the embodiments of this application disclose the following technical solutions:

第一方面,本申请实施例公开了一种基于机器学习的电力负荷识别方法,所述方法包括:In a first aspect, embodiments of the present application disclose a method for identifying electric power loads based on machine learning. The method includes:

获取各个电器的历史电气参数数据集;Obtain the historical electrical parameter data set of each electrical appliance;

清洗各个电器的所述历史电气参数数据集;Cleaning the historical electrical parameter data set of each electrical appliance;

将单个电器的清洗后的历史电气参数数据集按照比例划分成原始训练集、验证集及测试集;Divide the cleaned historical electrical parameter data set of a single electrical appliance into the original training set, verification set and test set in proportion;

沿时间轴对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集;Intercept data fragments from the original training set, verification set and test set along the timeline to generate a training set, verification set and test set composed of data fragments;

对每个目标电器分别建立基于降噪自编码器的卷积神经网络模型;Establish a convolutional neural network model based on denoising autoencoder for each target electrical appliance;

对每个目标电器根据数据片段组成的训练集、验证集和测试集对所述基于降噪自编码器的卷积神经网络模型进行训练得到每个目标电器的优化模型;For each target electrical appliance, train the convolutional neural network model based on the denoising autoencoder based on the training set, verification set and test set composed of data segments to obtain an optimized model for each target electrical appliance;

采集用户电力负荷的当前数据,并将所述当前数据输入每个目标电器的优化模型中,分离出电器的工作状态,输出电力负荷的类别结果。Collect the current data of the user's electric load, input the current data into the optimization model of each target electrical appliance, isolate the working status of the electrical appliance, and output the category results of the electric load.

可选的,所述获取各个电器的历史电气参数数据集,包括:Optionally, obtaining the historical electrical parameter data set of each electrical appliance includes:

整合与汇总当今公共数据集得到第一数据集;Integrate and summarize today’s public data sets to obtain the first data set;

在用户总的进线端安装电能表获取一个或多个空间内的总体和单个用电负荷的电气参数得到第二数据集;Install an electric energy meter at the user's general incoming line end to obtain the electrical parameters of the overall and individual electrical loads in one or more spaces to obtain a second data set;

根据所述第一数据集和所述第二数据集汇总得到各个电器的历史电气参数数据集。A historical electrical parameter data set of each electrical appliance is obtained based on the first data set and the second data set.

可选的,所述沿时间轴对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集,包括:Optionally, the original training set, verification set and test set are intercepted along the timeline to generate a training set, verification set and test set composed of data fragments, including:

使用长度为n,移动步长为1的滑动窗口沿时间轴方向对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集。Use a sliding window with a length of n and a moving step of 1 to intercept data fragments from the original training set, verification set and test set along the time axis direction to generate a training set, verification set and test set composed of data fragments.

可选的,所述清洗各个电器的所述历史电气参数数据集,包括:数据格式的统一、下采样到指定频率、电压归一化。Optionally, the cleaning of the historical electrical parameter data set of each electrical appliance includes: unification of data format, down-sampling to a specified frequency, and voltage normalization.

可选的,所述数据格式的统一,包括:Optional, the unification of the data format includes:

根据将有功功率转化为[0,1]之间的数值,其中:according to Convert active power into a value between [0, 1], where:

S[i]表示采样值即瞬时有功功率,C为电力负荷类型,sa为样本数据,sc为电力负荷c的有功功率。S[i] represents the sampling value, which is the instantaneous active power, C is the electric load type, sa is the sample data, and s c is the active power of the electric load c.

可选的,所述下采样到指定频率,包括:Optionally, the downsampling to a specified frequency includes:

若采样率低于1Hz,则按原有采样率进行记录;If the sampling rate is lower than 1Hz, it will be recorded at the original sampling rate;

若采样率高于1Hz,则将采样率降采样至1Hz;If the sampling rate is higher than 1Hz, downsample the sampling rate to 1Hz;

其中,所述将采样率降采样至1Hz包括:Among them, downsampling the sampling rate to 1Hz includes:

使用每1秒间隔采样点的数值,抛弃所有1秒内的其他采样值;Use the value of the sampling point every 1 second and discard all other sampling values within 1 second;

计算相邻1秒钟内原始采样点的平均值作为1秒钟边界数据值;Calculate the average of the original sampling points within adjacent 1 second as the 1 second boundary data value;

计算相邻1秒钟内原始采样点的中值作为1秒钟边界数据值。Calculate the median value of the original sampling points within adjacent 1 second as the 1 second boundary data value.

可选的,所述电压归一化,包括:Optionally, the voltage normalization includes:

根据将电压归一至同一波动范围内,其中:according to Normalize the voltage to the same fluctuation range, where:

Powernormalised表示归一化功率值,Power表示测量功率值,Voltagenominal表示名义电压值Voltageobserved表示测量电压值。Power normalized represents the normalized power value, Power represents the measured power value, and Voltagenominal represents the nominal voltage value. Voltageobserved represents the measured voltage value.

可选的,所述清洗各个电器的所述历史电气参数数据集,还包括检测间隙、正常运行时间及识别能耗排名在前K位的电力负荷,其中k为可调整参数。Optionally, the cleaning of the historical electrical parameter data set of each electrical appliance also includes detecting gaps, normal operating times, and identifying the top K power loads with energy consumption, where k is an adjustable parameter.

可选的,所述对每个目标电器根据数据片段组成的训练集、验证集和测试集对所述基于降噪自编码器的卷积神经网络模型进行训练得到每个目标电器的优化模型,包括:Optionally, the convolutional neural network model based on the denoising autoencoder is trained for each target electrical appliance based on the training set, verification set and test set composed of data segments to obtain an optimized model for each target electrical appliance, include:

使用目标电器数据片段组成的训练集训练所述基于降噪自编码器的卷积神经网络模型的参数;Use a training set composed of target electrical appliance data segments to train the parameters of the convolutional neural network model based on the denoising autoencoder;

将不同训练阶段所得到的不同模型在数据片段组成的验证集上验证测试,直至效果最好作为目标电器的对应模型;Verify and test the different models obtained in different training stages on the verification set composed of data fragments until the best effect is used as the corresponding model of the target appliance;

利用数据片段组成的测试集对目标电器的对应模型进行性能测试,直至性能最优得到目标电器的优化模型。Use the test set composed of data fragments to perform performance testing on the corresponding model of the target electrical appliance until the performance is optimal to obtain the optimized model of the target electrical appliance.

第二方面,本申请基于上述的基于机器学习的电力负荷识别方法,本申请还提供了一种基于机器学习的电力负荷识别系统,所述系统包括:In the second aspect, this application is based on the above-mentioned machine learning-based power load identification method. This application also provides a machine learning-based power load identification system. The system includes:

数据集获取模块,用于获取各个电器的历史电气参数数据集;The data set acquisition module is used to obtain the historical electrical parameter data set of each electrical appliance;

数据集清洗模块,用于清洗各个电器的所述历史电气参数数据集;A data set cleaning module, used to clean the historical electrical parameter data set of each electrical appliance;

数据集划分模块,用于将单个电器的清洗后的历史电气参数数据集按照比例划分成原始训练集、验证集及测试集;The data set division module is used to divide the cleaned historical electrical parameter data set of a single electrical appliance into the original training set, verification set and test set in proportion;

数据集片段截取模块,用于沿时间轴对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集;A data set fragment interception module is used to intercept data fragments from the original training set, verification set and test set along the timeline, and generate a training set, verification set and test set composed of data fragments;

模型建立模块,用于对每个目标电器分别建立基于降噪自编码器的卷积神经网络模型;The model building module is used to establish a convolutional neural network model based on the denoising autoencoder for each target electrical appliance;

模型优化模块,用于对每个目标电器根据数据片段组成的训练集、验证集和测试集对所述基于降噪自编码器的卷积神经网络模型进行训练得到每个目标电器的优化模型;A model optimization module, used to train the convolutional neural network model based on the denoising autoencoder based on the training set, verification set and test set composed of data fragments for each target appliance to obtain an optimization model for each target appliance;

电力负荷类别输出模块,用于采集用户电力负荷的当前数据,并将所述当前数据输入每个目标电器的优化模型中,分离出电器的工作状态,输出电力负荷的类别结果。The electric load category output module is used to collect the current data of the user's electric load, input the current data into the optimization model of each target electrical appliance, isolate the working status of the electrical appliance, and output the category result of the electric load.

与现有技术相比,本申请的有益效果为:Compared with the existing technology, the beneficial effects of this application are:

由上述技术方案可见,本实施例提供的基于机器学习的电力负荷识别方法及系统具体地以实测的电气参数数据包括电流、电压、有功功率和无功功率等为基础,将基础电气参数数据统一格式,在我们长时间针对电力负荷特征提取、采集、分析、归纳和训练的基础上,可以在已知一段时间的若干用电负荷总体基础电气参数数据包括电压、电流、有功功率、无功功率等的情况下,正确识别出正在使用的电器种类。因此本申请提供的用于电力负荷识别的机器学习模型训练方法及系统不需要人为进行手动调节参数,较传统方法相比如时域的波形匹配,特征点匹配以及谱分析等匹配准确率高,本申请可以自主学习并且自动获得识别电力负荷所需要的特征参数,从而提高了模型的适用范围,提高了电力负荷识别的准确率。It can be seen from the above technical solution that the machine learning-based power load identification method and system provided by this embodiment are specifically based on the measured electrical parameter data including current, voltage, active power and reactive power, etc., and unify the basic electrical parameter data Format, based on our long-term extraction, collection, analysis, induction and training of electric load characteristics, we can know the overall basic electrical parameter data of several electric loads for a period of time, including voltage, current, active power, and reactive power. etc., correctly identify the type of electrical appliance being used. Therefore, the machine learning model training method and system for power load identification provided by this application does not require manual adjustment of parameters. Compared with traditional methods such as time domain waveform matching, feature point matching and spectrum analysis, the matching accuracy is higher. The application can learn independently and automatically obtain the characteristic parameters needed to identify electric loads, thereby improving the applicable scope of the model and improving the accuracy of electric load identification.

应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本申请。It should be understood that the above general description and the following detailed description are only exemplary and explanatory, and do not limit the present application.

附图说明Description of the drawings

为了更清楚地说明本申请的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,对于本领域普通技术人员而言,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the present application more clearly, the drawings required to be used in the embodiments will be briefly introduced below. Obviously, for those of ordinary skill in the art, without exerting creative efforts, Additional drawings can be obtained from these drawings.

附图1为本申请实施例提供的一种基于机器学习的电力负荷识别方法的流程示意图;Figure 1 is a schematic flow chart of a machine learning-based power load identification method provided by an embodiment of the present application;

附图2为本申请实施例提供的一种基于机器学习的电力负荷识别系统的结构示意图;Figure 2 is a schematic structural diagram of a power load identification system based on machine learning provided by an embodiment of the present application;

附图3为本申请实施例提供的DSP单相多功能表对总体基础电气参数数据监测的示意图。Figure 3 is a schematic diagram of overall basic electrical parameter data monitoring by a DSP single-phase multi-function meter provided by an embodiment of the present application.

具体实施方式Detailed ways

为了使本技术领域的人员更好地理解本申请中的技术方案,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本申请保护的范围。In order to enable those in the technical field to better understand the technical solutions in the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described The embodiments are only some of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative efforts should fall within the scope of protection of this application.

电力负荷印记特征能反映一个用电设备在运行中所体现的独特的反映用电状态的信息,比如电压、有功功率的波形、电流等,在设备运行过程中,这些负荷印记会重复出现,基于此,我们就可以把各个用电设备识别出来。Electric load imprint characteristics can reflect the unique information reflecting the power consumption status of an electrical equipment during operation, such as voltage, active power waveform, current, etc. During the operation of the equipment, these load imprints will appear repeatedly. Based on In this way, we can identify each electrical equipment.

第一方面,本申请实施例公开了一种基于机器学习的电力负荷识别方法,如图1所示,所述方法包括:In the first aspect, an embodiment of the present application discloses a method for identifying electric power loads based on machine learning, as shown in Figure 1. The method includes:

S110:获取各个电器的历史电气参数数据集。S110: Obtain the historical electrical parameter data set of each electrical appliance.

所述获取各个电器的历史电气参数数据集,包括:The acquisition of historical electrical parameter data sets of each electrical appliance includes:

整合与汇总当今公共数据集得到第一数据集;Integrate and summarize today’s public data sets to obtain the first data set;

在用户总的进线端安装电能表获取一个或多个空间内的总体和单个用电负荷的电气参数得到第二数据集;Install an electric energy meter at the user's general incoming line end to obtain the electrical parameters of the overall and individual electrical loads in one or more spaces to obtain a second data set;

根据所述第一数据集和所述第二数据集汇总得到各个电器的历史电气参数数据集。A historical electrical parameter data set of each electrical appliance is obtained based on the first data set and the second data set.

本申请实施例中,除了汇总当今公共数据集外,还在用户总的进线端安装电能表获取电气参数数。In the embodiment of this application, in addition to summarizing today's public data sets, an electric energy meter is also installed at the user's total incoming line end to obtain electrical parameters.

由于在数据测量获得总负荷的暂态和稳态信号中,会存在测量误差的以下几种情况:一是测量装置的不一致性,即对于同一个用电设备,不同的测量装置有不同的测量值;二是传感器由于在压缩、传输原始数据的过程中,会造成数据丢失。正是因为数据采集与传输会造成数据偏差或丢失,所以有必要对数据进行处理以提高负荷识别方法的抗噪能力;三是要研究数据采样的周期对负荷识别的影响,探讨数据采样开销与系统建模复杂度的平衡点。Because in the data measurement to obtain the transient and steady-state signals of the total load, there will be the following measurement errors: First, the inconsistency of the measuring devices, that is, for the same electrical equipment, different measuring devices have different measurements. Second, the sensor will cause data loss during the process of compressing and transmitting raw data. Precisely because data collection and transmission will cause data deviation or loss, it is necessary to process the data to improve the noise immunity of the load identification method; third, it is necessary to study the impact of the data sampling cycle on load identification, and explore the relationship between data sampling overhead and A balance of system modeling complexity.

我们采集的数据集是能源消耗的记录,在一定时间范围内使用多套监控仪器设备监测房间或空间内的各项用电器,采取低频和高频两种数据采集模式。低频采集模式采集频率为1Hz,而高频采集可以达到10kHz至100kHz。低频信号主要针对负荷稳态特征进行提取,而高频信号则可以获得负荷暂态特征和高频谐波特性。一般来说,高频信号可以包括更多的负荷用电特征,利于模型的训练和精确度的提升,但也对数据的采集、传输、压缩和处理能力提出了更高的要求,提高了系统的复杂性。在整个研究过程中,我们需要根据实际情况对系统的复杂性和准确性进行取舍。The data set we collect is a record of energy consumption. We use multiple sets of monitoring instruments and equipment to monitor various electrical appliances in a room or space within a certain time range, using two data collection modes: low frequency and high frequency. The acquisition frequency of low-frequency acquisition mode is 1Hz, while the high-frequency acquisition can reach 10kHz to 100kHz. Low-frequency signals are mainly extracted for load steady-state characteristics, while high-frequency signals can obtain load transient characteristics and high-frequency harmonic characteristics. Generally speaking, high-frequency signals can include more load power characteristics, which is beneficial to model training and accuracy improvement, but it also puts forward higher requirements for data collection, transmission, compression and processing capabilities, improving the system complexity. Throughout the research process, we need to make trade-offs between the complexity and accuracy of the system based on the actual situation.

此外,我们使用DSP单相多功能电能表进行总体基础电气参数数据(包括电压、电流、有功功率、无功功率等)的监测,如附图3所示,附图3为附图3为本申请实施例提供的DSP单相多功能表对总体基础电气参数数据监测的示意图,该表采用RS485远程链接监控控制面板上,数据采集每分钟查询一次并且及时链接数据实时采集服务器,此设备的优点是:DIN35MM导轨安装,具有装卸方便的特点;通讯速率可达9600bps,传输速率快;采用六路开关量输入及输出,满足对测量数据输入及输出的要求;DSP芯片可根据实际需求进行重构和开发,满足实验环境的要求。In addition, we use DSP single-phase multi-function electric energy meters to monitor the overall basic electrical parameter data (including voltage, current, active power, reactive power, etc.), as shown in Figure 3. Figure 3 is based on Figure 3. Schematic diagram of the overall basic electrical parameter data monitoring provided by the DSP single-phase multi-function meter provided in the application embodiment. The meter uses RS485 remote link to the monitoring control panel. The data collection is queried once every minute and is linked to the real-time data collection server in time. The advantages of this equipment Yes: DIN35MM guide rail installation, easy to load and unload; communication rate can reach 9600bps, fast transmission rate; adopts six-way switch input and output to meet the requirements for measurement data input and output; DSP chip can be reconstructed and processed according to actual needs Developed to meet the requirements of the experimental environment.

比如:我们使用DSP单相多功能电能表进行总体基础电气参数数据(包括电压、电流、有功功率、无功功率等)的采集,同一参数内部的采样数据按照时间顺序排列。以有功功率为例,pi表示电能表在第i个采样时刻点所检测到的电压度数,原始采集的数据按时间顺序排列为有功功率序列Pactive={p1,p2,…pi,…},形成电力负荷特征数据采样模块的输出,同时作为电力负荷数据清洗模块的输入。For example: we use a DSP single-phase multi-function electric energy meter to collect overall basic electrical parameter data (including voltage, current, active power, reactive power, etc.), and the sampling data within the same parameter are arranged in chronological order. Taking active power as an example, p i represents the voltage degree detected by the electric energy meter at the i-th sampling time point. The original collected data is arranged in time order as the active power sequence P active = {p 1 , p 2 ,…p i ,…}, forms the output of the power load characteristic data sampling module and serves as the input of the power load data cleaning module.

S120:清洗各个电器的所述历史电气参数数据集。S120: Clean the historical electrical parameter data set of each electrical appliance.

在得到电力负荷数据的信息以后,电力负荷数据清洗将是本套系统和方法的核心技术,主要包括自动筛选与异常数据清理,实现噪声的辨认与分离,下采样,丢弃率,数据归一化,缺失数据补偿及处理,Top-k,错误数据剔除等等。After obtaining the information of electric load data, electric load data cleaning will be the core technology of this system and method, which mainly includes automatic screening and abnormal data cleaning, realizing noise identification and separation, down-sampling, discard rate, and data normalization. , missing data compensation and processing, Top-k, erroneous data elimination, etc.

本系统需要创建数据CSV文件,通过采取删除不完整的数据来完成清理数据集的工作(有部分仪器设备数据因不同时间戳存在数据不完整或丢失的情况)。当创建完数据集并生成CSV文件导入后,数据会驻留在我们的内存数据结构中,该结构可以在整个训练过程中使用(磁盘保存或加载数据)。为了解决不同数据集数据格式不统一的问题,我们需要进行几种预处理工作。This system needs to create data CSV files and complete the work of cleaning the data set by deleting incomplete data (some instrument and equipment data have incomplete or lost data due to different timestamps). Once the dataset is created and the CSV file is generated for import, the data resides in our in-memory data structure, which can be used throughout the training process (saving or loading data to disk). In order to solve the problem of inconsistent data formats in different data sets, we need to perform several preprocessing tasks.

我们使用DSP单相多功能电能表进行总体基础电气参数数据(包括电压、电流、有功功率、无功功率等)的采集,同一参数内部的采样数据按照时间顺序排列。以有功功率为例,pi表示电能表在第i个采样时刻点所检测到的电压度数,原始采集的数据按时间顺序排列为有功功率序列Pactive={p1,p2,…pi,…},形成电力负荷特征数据采样模块的输出,同时作为电力负荷数据清洗模块的输入。We use a DSP single-phase multi-function electric energy meter to collect overall basic electrical parameter data (including voltage, current, active power, reactive power, etc.). The sampling data within the same parameter are arranged in chronological order. Taking active power as an example, p i represents the voltage degree detected by the electric energy meter at the i-th sampling time point. The original collected data is arranged in time order as the active power sequence P active = {p 1 , p 2 ,…p i ,…}, forms the output of the power load characteristic data sampling module and serves as the input of the power load data cleaning module.

在数据清洗模块中,我们根据实际的应用需求将电气时序数据下采样到指定频率f,假设频率f在原采样数据中对应5个采样点的间隔周期,则下采样后的有功功率数据序列为P'active={p1,p6,p11,…p5i+1,…}={q1,q2,…qi,…}。再对所得到的相对低频时序数据进行归一化等其他预处理操作,生成能够被模型直接用作训练的连续时序数据。由于电力负荷特征识别模型同时接受多个电气参数输入,但对某一具体电气参数的输入维度有限且为固定长度n,无法一次性将此电气参数的所有连续时序数据全部输入,故采用长度为n的滑动窗口(sliding window)方式以步长l不断滑动,每次滑动后截取此电气参数序列中固定长度为n的数据子序列,例如有功功率子序列Pi={qin+1,qin+2,…qin+n},以此作为电力负荷特征识别模块输入。In the data cleaning module, we down-sample the electrical timing data to the specified frequency f according to the actual application requirements. Assuming that the frequency f corresponds to the interval period of 5 sampling points in the original sampling data, the down-sampled active power data sequence is P ' active = {p 1 , p 6 , p 11 ,...p 5i+1 ,...}={q 1 , q 2 ,...q i ,...}. The obtained relatively low-frequency time series data is then normalized and other preprocessing operations are performed to generate continuous time series data that can be directly used by the model for training. Since the power load characteristic identification model accepts multiple electrical parameter inputs at the same time, but the input dimension of a specific electrical parameter is limited and has a fixed length n, it is impossible to input all the continuous time series data of this electrical parameter at one time, so the length is The sliding window method of n continuously slides with a step size l. After each sliding, a data subsequence of fixed length n in the electrical parameter sequence is intercepted, for example, the active power subsequence P i ={q in+1 ,q in+2 ,…q in+n }, which is used as the input of the power load characteristic identification module.

不同电器在负荷特征识别模块中有单独对应的深度神经网络模型,数据子序列会同时输入不同电器对应的深度神经网络模型,每个电气参数在同一时间段内的n个采样数值对应到深度神经网络输入层相应的n个节点,经过网络正向传播运算,输出此数据子序列中第m个目标电器工作状态序列Sm,i={sm,in+1,sm,in+2,…sm,in+n}。状态序列长度为n但不限于n,由模型架构决定。基于每个电器的工作状态可以判断此段序列中所有在使用的电器种类。Different electrical appliances have separate corresponding deep neural network models in the load characteristic identification module. The data subsequences are simultaneously input into the deep neural network models corresponding to different electrical appliances. The n sampled values of each electrical parameter in the same time period correspond to the deep neural network models. The corresponding n nodes of the network input layer, through the forward propagation operation of the network, output the mth target electrical appliance operating status sequence S m,i in this data subsequence = {s m,in+1 ,s m,in+2 , …s m,in+n }. The length of the state sequence is n but is not limited to n and is determined by the model architecture. Based on the working status of each electrical appliance, the types of all electrical appliances in use in this sequence can be determined.

具体包括:Specifically include:

(1)统一格式(1)Unified format

由于原始数据集格式并不统一,本方法需要提取每个数据集的特征进行评估,为了避免由于不同电器的消耗功率相差太大,对判断产生很大的干扰,需要将数据进行清洗,同时也是归一化的操作,即转化为[0,1]之间的值,S[i]表示采样值即瞬时有功功率,C为电力负荷类型,sa为样本数据,sc为电力负荷c的有功功率,公式如下:Since the format of the original data set is not uniform, this method needs to extract the characteristics of each data set for evaluation. In order to avoid great interference in the judgment due to the large difference in power consumption of different electrical appliances, the data needs to be cleaned, and it is also The normalization operation is to convert it into a value between [0, 1], S[i] represents the sampled value, which is the instantaneous active power, C is the electric load type, sa is the sample data, s c is the active power of the electric load c Power, the formula is as follows:

(2)下采样(2) Downsampling

设备监视器的采样率在数据集中在0.008Hz到16kHz之间,所以,本系统会使用诸如平均值、模式和中值的聚合函数将数据集下采样到指定频率。The sampling rate of the device monitor in the data set is between 0.008Hz and 16kHz, so this system will use aggregation functions such as mean, mode, and median to downsample the data set to the specified frequency.

若采样率低于1Hz,则按原有采样率进行记录。If the sampling rate is lower than 1Hz, it will be recorded at the original sampling rate.

若采样率高于1Hz,则将采样率降采样至1Hz。具体降采样方法包括:If the sampling rate is higher than 1Hz, the sampling rate is downsampled to 1Hz. Specific downsampling methods include:

1.使用每1秒间隔采样点的数值,抛弃所有1秒内的其他采样值;1. Use the value of the sampling point every 1 second and discard all other sampling values within 1 second;

2.计算相邻1秒钟内原始采样点的平均值作为1秒钟边界数据值;2. Calculate the average of the original sampling points within adjacent 1 second as the 1 second boundary data value;

3.计算相邻1秒钟内原始采样点的中值作为1秒钟边界数据值;3. Calculate the median value of the original sampling points within adjacent 1 second as the 1 second boundary data value;

(3)电压归一化(3)Voltage normalization

由于采集的电压存在波动的情况,例如,同一数据集中显示电压从180-250V变化,而另一个数据集中的电压在118-123V范围内变化。本系统必须考虑这些电压波动带来的影响,因为它们会明显影响功耗。Because the collected voltage fluctuates, for example, the same data set shows the voltage changes from 180-250V, while the voltage in another data set changes in the range of 118-123V. The system must take into account the effects of these voltage fluctuations as they can significantly affect power consumption.

根据将电压归一至同一波动范围内,其中:according to Normalize the voltage to the same fluctuation range, where:

Powernormalised表示归一化功率值,Power表示测量功率值,Voltagenominal表示名义电压值,Voltageobserved表示测量电压值。Power normalized represents the normalized power value, Power represents the measured power value, Voltagenominal represents the nominal voltage value, and Voltageobserved represents the measured voltage value.

(4)Top-k(4)Top-k

一般来说,我们的识别系统针对排名在前k位(其中k为可调整参数)的大能耗设备而不是所有设备,原因有以下三点,首先,前k位耗电设备已经能对整体电力消耗情况提供大部分的参考信息;其次,这些设备具有最显着的特征,可以认为其余设备仅产生噪声;第三,针对较大占比耗电设备的建模和识别会大大提高数据的可靠性。Generally speaking, our identification system targets the large energy-consuming devices ranked in the top k (where k is an adjustable parameter) rather than all devices. There are three reasons. First, the top k power-consuming devices can already detect the overall Power consumption provides most of the reference information; secondly, these devices have the most significant characteristics, and the rest of the devices can be considered to only generate noise; thirdly, modeling and identifying a larger proportion of power-consuming devices will greatly improve the accuracy of the data. reliability.

在数据清洗的过程中,本系统也会解决数据集的其他常见问题,例如:设备传感器未报告读数、小数据缺失、去除异常数值如观察到的电压超过额定电压的两倍、主电源数据丢失数据等等。During the data cleaning process, this system will also address other common issues with the data set, such as: equipment sensors not reporting readings, small data missing, removal of abnormal values such as observed voltages exceeding twice the rated voltage, and loss of main power data Data and more.

(5)检测间隙(5)Detection gap

当今很多算法都假设每个数据采集装置的通讯是连续的,然而,实际情况是,有时候会发生数据采集装置断开或发生故障等情况,如果我们设定一个参数值,当发生断开或故障的时间大于设定的参数值,那么,可以认为一个连续电力数据样本中会存在间隙。例如,我们计算相邻采样点时间戳之间的差值,若大于某一个参数如10秒,则认为该数据集中存在间隙。对于存在间隙的数据序列,不可以直接用于系统训练集和测试集,所有的训练数据和测试数据序列必须选取中间无间隙的数据序列。Many algorithms today assume that the communication of each data acquisition device is continuous. However, the actual situation is that sometimes the data acquisition device is disconnected or malfunctions. If we set a parameter value, when a disconnection or failure occurs, If the fault time is greater than the set parameter value, it can be considered that there will be a gap in a continuous power data sample. For example, we calculate the difference between the timestamps of adjacent sampling points. If it is greater than a certain parameter such as 10 seconds, it is considered that there is a gap in the data set. Data sequences with gaps cannot be directly used in the system training set and test set. All training data and test data sequences must be data sequences without gaps in the middle.

(6)正常运行时间(6)Uptime

正常运行时间是传感器记录的总时间。它是最后一个时间戳,减去第一个时间戳,减去所有存在的间隙后所得到的持续时间。Uptime is the total time recorded by the sensor. It is the duration of the last timestamp, minus the first timestamp, minus any existing gaps.

S130:将单个电器的清洗后的历史电气参数数据集按照比例划分成原始训练集、验证集及测试集。S130: Divide the cleaned historical electrical parameter data set of a single electrical appliance into an original training set, a verification set and a test set in proportion.

S140:沿时间轴对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集。S140: Intercept data fragments from the original training set, verification set and test set along the timeline, and generate a training set, verification set and test set composed of data fragments.

比如,在数据清洗模块中,我们根据实际的应用需求将电气时序数据下采样到指定频率f,假设频率f在原采样数据中对应5个采样点的间隔周期,则下采样后的有功功率数据序列为P'active={p1,p6,p11,…p5i+1,…}={q1,q2,…qi,…}。再对所得到的相对低频时序数据进行归一化等其他预处理操作,生成能够被模型直接用作训练的连续时序数据。由于电力负荷特征识别模型同时接受多个电气参数输入,但对某一具体电气参数的输入维度有限且为固定长度n,无法一次性将此电气参数的所有连续时序数据全部输入,故采用长度为n的滑动窗口(sliding window)方式以步长l不断滑动,每次滑动后截取此电气参数序列中固定长度为n的数据子序列,例如有功功率子序列Pi={qin+1,qin+2,…qin+n},以此作为电力负荷特征识别模块输入。For example, in the data cleaning module, we down-sample the electrical timing data to the specified frequency f according to the actual application requirements. Assuming that the frequency f corresponds to the interval period of 5 sampling points in the original sampling data, then the down-sampled active power data sequence It is P' active = {p 1 , p 6 , p 11 ,…p 5i+1 ,…}={q 1 , q 2 ,…q i ,…}. The obtained relatively low-frequency time series data is then normalized and other preprocessing operations are performed to generate continuous time series data that can be directly used by the model for training. Since the power load characteristic identification model accepts multiple electrical parameter inputs at the same time, but the input dimension of a specific electrical parameter is limited and has a fixed length n, it is impossible to input all the continuous time series data of this electrical parameter at one time, so the length is The sliding window method of n continuously slides with a step size l. After each sliding, a data subsequence of fixed length n in the electrical parameter sequence is intercepted, for example, the active power subsequence P i ={q in+1 ,q in+2 ,…q in+n }, which is used as the input of the power load characteristic identification module.

S150:对每个目标电器分别建立基于降噪自编码器的卷积神经网络模型。S150: Establish a convolutional neural network model based on the denoising autoencoder for each target electrical appliance.

在本发明实施例中,通过我们设计的电力负荷特征识别模块,可以在未知分线路的电力负荷特征的情况下,通过监测总线的电力负荷数据,来识别正在使用的用电器。In the embodiment of the present invention, through the power load characteristic identification module designed by us, the electrical appliances in use can be identified by monitoring the power load data of the bus when the power load characteristics of the branch lines are unknown.

电力负荷特征识别模块的设计基于深度神经网络,不需要复杂的人为设计的特征参数,即可自动获得识别电力负荷所需要的特征参数,本专利中,数据采集与数据清洗是智能配用电信息负荷特征识别的准备工作,电力负荷特征识别模块则是本专利的关键技术。The design of the power load feature identification module is based on a deep neural network. It does not require complex artificially designed feature parameters to automatically obtain the feature parameters required to identify the power load. In this patent, data acquisition and data cleaning are intelligent distribution and consumption information. Preparatory work for load feature identification, the power load feature identification module is the key technology of this patent.

将电力负荷分解看作“降噪”任务,其中目标电器的功率相当于“干净”的主信号,而同时间其他电器所产生的功率则是背景“噪音”,任务目的则是将目标电器功率从背景“噪音”功率中分离出来。Think of the power load decomposition as a "noise reduction" task, in which the power of the target appliance is equivalent to the "clean" main signal, while the power generated by other appliances at the same time is the background "noise". The purpose of the task is to reduce the power of the target appliance to Separate from background "noise" power.

基于降噪自编码器(Denoising AutoEncoder,DAE)概念搭建深度神经网络,对第一层和最后一层网络采用卷积层,利用其位置、缩放和扭曲不变性,提取有用的低级特征,比如1000瓦的阶跃变化特征无论发生在功率片段的什么位置都很可能是有用的特征。Build a deep neural network based on the concept of Denoising AutoEncoder (DAE), use convolutional layers for the first and last layer of the network, and use its position, scaling and distortion invariance to extract useful low-level features, such as 1000 The characteristic of a step change in watts is likely to be a useful feature regardless of where it occurs in the power segment.

基于降噪自编码器的卷积神经网络模型的架构及详细参数如下表所示:The architecture and detailed parameters of the convolutional neural network model based on denoising autoencoders are shown in the following table:

表一基于降噪自编码器的卷积神经网络模型的架构及参数Table 1 Architecture and parameters of the convolutional neural network model based on denoising autoencoder

在电力负荷特征识别模块中,例如学习率,batch size,反向传播的优化模型,dropout函数,激活函数的选择等等,不同的参数组合会带来不同的机器学习效果,我们的系统支持不同的参数组合优化,不需要人为设计电力负荷的特征参数,即可在电力负荷特征识别模块中准确识别出用电设备的种类。In the power load feature identification module, such as learning rate, batch size, back propagation optimization model, dropout function, activation function selection, etc., different parameter combinations will bring different machine learning effects, and our system supports different By optimizing the parameter combination, there is no need to manually design the characteristic parameters of the electric load, and the type of electrical equipment can be accurately identified in the electric load characteristic identification module.

S160:对每个目标电器根据数据片段组成的训练集、验证集和测试集对所述基于降噪自编码器的卷积神经网络模型进行训练得到每个目标电器的优化模型。S160: For each target electrical appliance, train the convolutional neural network model based on the denoising autoencoder based on the training set, verification set and test set composed of data segments to obtain an optimized model for each target electrical appliance.

S170:采集用户电力负荷的当前数据,并将所述当前数据输入每个目标电器的优化模型中,分离出电器的工作状态,输出电力负荷的类别结果。S170: Collect the current data of the user's electric load, input the current data into the optimization model of each target electrical appliance, isolate the working status of the electrical appliance, and output the category result of the electric load.

不同电器在负荷特征识别模块中有单独对应的深度神经网络模型,数据子序列会同时输入不同电器对应的深度神经网络模型,每个电气参数在同一时间段内的n个采样数值对应到深度神经网络输入层相应的n个节点,经过网络正向传播运算,输出此数据子序列中第m个目标电器工作状态序列Sm,i={sm,in+1,sm,in+2,…sm,in+n}。状态序列长度为n但不限于n,由模型架构决定。基于每个电器的工作状态可以判断此段序列中所有在使用的电器种类。Different electrical appliances have separate corresponding deep neural network models in the load characteristic identification module. The data subsequences are simultaneously input into the deep neural network models corresponding to different electrical appliances. The n sampled values of each electrical parameter in the same time period correspond to the deep neural network models. The corresponding n nodes of the network input layer, through the forward propagation operation of the network, output the mth target electrical appliance operating status sequence S m,i in this data subsequence = {s m,in+1 ,s m,in+2 , …s m,in+n }. The length of the state sequence is n but is not limited to n and is determined by the model architecture. Based on the working status of each electrical appliance, the types of all electrical appliances in use in this sequence can be determined.

我们向电力负荷识别模块中输入由前述电力负荷采集和清洗模块处理过的基础电气参数数据(包括电流、电压、有功功率、无功功率)作为输入数据,目标是通过电气数据推断出每个用电设备的设备类别。We input the basic electrical parameter data (including current, voltage, active power, reactive power) processed by the aforementioned power load acquisition and cleaning module into the power load identification module as input data. The goal is to infer each user through the electrical data. Equipment category for electrical equipment.

所选用的数据和识别方法具体如下:The data and identification methods selected are as follows:

选取总电路的数据基础电气参数数据(包括电流、电压、有功功率、无功功率);Select the basic electrical parameter data of the total circuit (including current, voltage, active power, and reactive power);

选取1分钟至5分钟的上述数据,以采样率为1Hz为例,则有60至300个采样点;Select the above data from 1 minute to 5 minutes, taking the sampling rate of 1Hz as an example, there will be 60 to 300 sampling points;

将上述数据处理成16个数据点作为一个序列的若干序列,其中剔除有间隙的序列;Process the above data into several sequences of 16 data points as one sequence, and eliminate sequences with gaps;

载入提前训练好的用于电力负荷识别的机器学习模型程序,可以用任何现有的机器学习或神经网络编程框架实现,如tensor flow,keras等等);Load the pre-trained machine learning model program for power load identification, which can be implemented using any existing machine learning or neural network programming framework, such as tensor flow, keras, etc.);

将数据序列输入机器学习模型,识别模块输出对应每个时刻采样点的当前设备类别数据(用整数表示,例如1=空调,2=烧水壶,3=洗衣机,4=电灯,等等)。Input the data sequence into the machine learning model, and the identification module outputs the current equipment category data corresponding to the sampling point at each moment (expressed by integers, such as 1 = air conditioner, 2 = kettle, 3 = washing machine, 4 = light, etc.).

我们向电力负荷识别模块中输入由前述电力负荷采集和清洗模块处理过的基础电气参数数据包括电流、电压、有功功率、无功功率作为输入数据,目标是通过电气数据推断出每个用电设备的设备类别。We input the basic electrical parameter data processed by the aforementioned power load collection and cleaning module into the power load identification module, including current, voltage, active power, and reactive power as input data. The goal is to infer each electrical equipment through the electrical data. device category.

所选用的数据和识别方法具体如下:The data and identification methods selected are as follows:

选取总电路的数据基础电气参数数据如包括电流、电压、有功功率、无功功率;Select the basic electrical parameter data of the total circuit, such as current, voltage, active power, and reactive power;

选取1分钟至5分钟的上述数据,以采样率为1Hz为例,则有60至300个采样点;Select the above data from 1 minute to 5 minutes, taking the sampling rate of 1Hz as an example, there will be 60 to 300 sampling points;

将上述数据处理成16个数据点作为一个序列的若干序列,其中剔除有间隙的序列;Process the above data into several sequences of 16 data points as one sequence, and eliminate sequences with gaps;

载入提前训练好的用于电力负荷识别的机器学习模型程序,可以用任何现有的机器学习或神经网络编程框架实现,如tensor flow,keras等等;Load the pre-trained machine learning model program for power load identification, which can be implemented using any existing machine learning or neural network programming framework, such as tensor flow, keras, etc.;

将数据序列输入机器学习模型,识别模块输出对应每个时刻采样点的当前设备类别数据(用整数表示,例如1=空调,2=烧水壶,3=洗衣机,4=电灯,等等)。Input the data sequence into the machine learning model, and the identification module outputs the current equipment category data corresponding to the sampling point at each moment (expressed by integers, such as 1 = air conditioner, 2 = kettle, 3 = washing machine, 4 = light, etc.).

本实施例提供的基于机器学习的电力负荷识别方法及系统具体地以实测的电气参数数据包括电流、电压、有功功率和无功功率等为基础,将基础电气参数数据统一格式,在我们长时间针对电力负荷特征提取、采集、分析、归纳和训练的基础上,可以在已知一段时间的若干用电负荷总体基础电气参数数据包括电压、电流、有功功率、无功功率等的情况下,正确识别出正在使用的电器种类。因此本申请提供的用于电力负荷识别的机器学习模型训练方法及系统不需要人为进行手动调节参数,较传统方法相比如时域的波形匹配,特征点匹配以及谱分析等匹配准确率高,本申请可以自主学习并且自动获得识别电力负荷所需要的特征参数,从而提高了模型的适用范围,提高了电力负荷识别的准确率。The power load identification method and system based on machine learning provided in this embodiment is specifically based on the measured electrical parameter data including current, voltage, active power and reactive power, etc., and unifies the basic electrical parameter data into a unified format. In our long-term Based on the extraction, collection, analysis, induction and training of electric load characteristics, it is possible to accurately determine the overall basic electrical parameter data of several electric loads for a period of time, including voltage, current, active power, reactive power, etc. Identify the type of appliance being used. Therefore, the machine learning model training method and system for power load identification provided by this application does not require manual adjustment of parameters. Compared with traditional methods such as time domain waveform matching, feature point matching and spectrum analysis, the matching accuracy is higher. The application can learn independently and automatically obtain the characteristic parameters needed to identify electric loads, thereby improving the applicable scope of the model and improving the accuracy of electric load identification.

第二方面,本申请基于上述的基于机器学习的电力负荷识别方法,本申请还提供了一种基于机器学习的电力负荷识别系统,如图2所示,所述系统包括:In the second aspect, this application is based on the above-mentioned machine learning-based power load identification method. This application also provides a machine learning-based power load identification system, as shown in Figure 2. The system includes:

数据集获取模块,用于获取各个电器的历史电气参数数据集;The data set acquisition module is used to obtain the historical electrical parameter data set of each electrical appliance;

数据集清洗模块,用于清洗各个电器的所述历史电气参数数据集;A data set cleaning module, used to clean the historical electrical parameter data set of each electrical appliance;

数据集划分模块,用于将单个电器的清洗后的历史电气参数数据集按照比例划分成原始训练集、验证集及测试集;The data set division module is used to divide the cleaned historical electrical parameter data set of a single electrical appliance into the original training set, verification set and test set in proportion;

数据集片段截取模块,用于沿时间轴对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集;A data set fragment interception module is used to intercept data fragments from the original training set, verification set and test set along the timeline, and generate a training set, verification set and test set composed of data fragments;

模型建立模块,用于对每个目标电器分别建立基于降噪自编码器的卷积神经网络模型;The model building module is used to establish a convolutional neural network model based on the denoising autoencoder for each target electrical appliance;

模型优化模块,用于对每个目标电器根据数据片段组成的训练集、验证集和测试集对所述基于降噪自编码器的卷积神经网络模型进行训练得到每个目标电器的优化模型;A model optimization module, used to train the convolutional neural network model based on the denoising autoencoder based on the training set, verification set and test set composed of data fragments for each target appliance to obtain an optimization model for each target appliance;

电力负荷类别输出模块,用于采集用户电力负荷的当前数据,并将所述当前数据输入每个目标电器的优化模型中,分离出电器的工作状态,输出电力负荷的类别结果。The electric load category output module is used to collect the current data of the user's electric load, input the current data into the optimization model of each target electrical appliance, isolate the working status of the electrical appliance, and output the category result of the electric load.

由于以上实施方式均是在其他方式之上引用结合进行说明,不同实施例之间均具有相同的部分,本说明书中各个实施例之间相同、相似的部分互相参见即可。在此不再详细阐述。Since the above embodiments are described by reference and combination with other methods, different embodiments all have the same parts, and the same and similar parts between the various embodiments in this specification can be referred to each other. No further details will be given here.

本领域技术人员在考虑说明书及实践这里发明的公开后,将容易想到本申请的其他实施方案。本申请旨在涵盖本发明的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本申请的一般性原理并包括本申请未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本申请的真正范围和精神由权利要求的内容指出。Other embodiments of the present application will be readily apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of the present invention that follow the general principles of this application and include common knowledge or customary technical means in the technical field that are not disclosed in this application. . It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the application being indicated by the following claims.

以上所述的本申请实施方式并不构成对本申请保护范围的限定。The above-described embodiments of the present application do not limit the scope of protection of the present application.

Claims (8)

1.一种基于机器学习的电力负荷识别方法,其特征在于,所述方法包括:1. A power load identification method based on machine learning, characterized in that the method includes: 获取各个电器的历史电气参数数据集;Obtain the historical electrical parameter data set of each electrical appliance; 清洗各个电器的所述历史电气参数数据集;Cleaning the historical electrical parameter data set of each electrical appliance; 将单个电器的清洗后的历史电气参数数据集按照比例划分成原始训练集、验证集及测试集;Divide the cleaned historical electrical parameter data set of a single electrical appliance into the original training set, verification set and test set in proportion; 沿时间轴对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集;Intercept data fragments from the original training set, verification set and test set along the timeline to generate a training set, verification set and test set composed of data fragments; 对每个目标电器分别建立基于降噪自编码器的卷积神经网络模型;Establish a convolutional neural network model based on denoising autoencoder for each target electrical appliance; 对每个目标电器根据数据片段组成的训练集、验证集和测试集对所述基于降噪自编码器的卷积神经网络模型进行训练得到每个目标电器的优化模型;For each target electrical appliance, train the convolutional neural network model based on the denoising autoencoder based on the training set, verification set and test set composed of data segments to obtain an optimized model for each target electrical appliance; 采集用户电力负荷的当前数据,并将所述当前数据输入每个目标电器的优化模型中,分离出电器的工作状态,输出电力负荷的类别结果;Collect the current data of the user's electric load, input the current data into the optimization model of each target electrical appliance, isolate the working status of the electrical appliance, and output the category results of the electric load; 所述清洗各个电器的所述历史电气参数数据集包括:自动筛选与异常数据清理,实现噪声的辨认与分离,下采样,丢弃率,数据归一化,缺失数据补偿及处理,Top-k以及错误数据剔除;The cleaning of the historical electrical parameter data set of each electrical appliance includes: automatic screening and abnormal data cleaning, noise identification and separation, down-sampling, discard rate, data normalization, missing data compensation and processing, Top-k and Error data removal; 所述获取各个电器的历史电气参数数据集,包括:The acquisition of historical electrical parameter data sets of each electrical appliance includes: 整合与汇总当今公共数据集得到第一数据集;Integrate and summarize today’s public data sets to obtain the first data set; 在用户总的进线端安装电能表获取一个或多个空间内的总体和单个用电负荷的电气参数得到第二数据集;Install an electric energy meter at the user's general incoming line end to obtain the electrical parameters of the overall and individual electrical loads in one or more spaces to obtain a second data set; 根据所述第一数据集和所述第二数据集汇总得到各个电器的历史电气参数数据集。A historical electrical parameter data set of each electrical appliance is obtained based on the first data set and the second data set. 2.根据权利要求1所述的基于机器学习的电力负荷识别方法,其特征在于,所述沿时间轴对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集,包括:2. The power load identification method based on machine learning according to claim 1, characterized in that, the original training set, the verification set and the test set are intercepted along the time axis to generate data fragments. The training set, validation set and test set include: 使用长度为n,移动步长为1的滑动窗口沿时间轴方向对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集。Use a sliding window with a length of n and a moving step of 1 to intercept data fragments from the original training set, verification set and test set along the time axis direction to generate a training set, verification set and test set composed of data fragments. 3.根据权利要求1所述的基于机器学习的电力负荷识别方法,其特征在于,还包括数据格式的统一:3. The power load identification method based on machine learning according to claim 1, characterized in that it also includes the unification of data formats: 根据将有功功率转化为[0,1]之间的数值,其中:according to Convert active power into a value between [0, 1], where: S[i]表示采样值即瞬时有功功率,C为电力负荷类型,为样本数据,s c 为电力负荷c的有功功率。S[i] represents the sampled value, which is the instantaneous active power, C is the electric load type, is the sample data, s c is the active power of the electric load c. 4.根据权利要求1所述的基于机器学习的电力负荷识别方法,其特征在于,还包括下采样到指定频率:4. The power load identification method based on machine learning according to claim 1, further comprising down-sampling to a specified frequency: 若采样率低于1Hz,则按原有采样率进行记录;If the sampling rate is lower than 1Hz, it will be recorded at the original sampling rate; 若采样率高于1Hz,则将采样率降采样至1Hz;If the sampling rate is higher than 1Hz, downsample the sampling rate to 1Hz; 其中,所述将采样率降采样至1Hz包括:Among them, downsampling the sampling rate to 1Hz includes: 使用每1秒间隔采样点的数值,抛弃所有1秒内的其他采样值;Use the value of the sampling point every 1 second and discard all other sampling values within 1 second; 计算相邻1秒钟内原始采样点的平均值作为1秒钟边界数据值;Calculate the average of the original sampling points within adjacent 1 second as the 1 second boundary data value; 计算相邻1秒钟内原始采样点的中值作为1秒钟边界数据值。Calculate the median value of the original sampling points within adjacent 1 second as the 1 second boundary data value. 5.根据权利要求1所述的基于机器学习的电力负荷识别方法,其特征在于,还包括电压归一化:5. The power load identification method based on machine learning according to claim 1, characterized in that it also includes voltage normalization: 根据Power=/>Power将电压归一至同一波动范围内,其中:According to Power =/> Power normalizes the voltage to the same fluctuation range, where: Power表示归一化功率值,Power表示测量功率值,Voltagenominal表示名义电压值Voltageobserved表示测量电压值。Power Represents the normalized power value, Power represents the measured power value, Voltage nominal represents the nominal voltage value, and Voltage observed represents the measured voltage value. 6.根据权利要求1所述的基于机器学习的电力负荷识别方法,其特征在于,所述清洗各个电器的所述历史电气参数数据集,还包括检测间隙、正常运行时间及识别能耗排名在前K位的电力负荷,其中k为可调整参数。6. The power load identification method based on machine learning according to claim 1, characterized in that the cleaning of the historical electrical parameter data set of each electrical appliance further includes detecting gaps, normal operating time and identifying energy consumption rankings. The electric load of the first K positions, where k is an adjustable parameter. 7.根据权利要求1所述的基于机器学习的电力负荷识别方法,其特征在于,所述对每个目标电器根据数据片段组成的训练集、验证集和测试集对所述基于降噪自编码器的卷积神经网络模型进行训练得到每个目标电器的优化模型,包括:7. The power load identification method based on machine learning according to claim 1, characterized in that, for each target electrical appliance, the training set, verification set and test set composed of data fragments are used to identify the power load based on noise reduction auto-encoding. The convolutional neural network model of the device is trained to obtain the optimization model of each target electrical appliance, including: 使用目标电器数据片段组成的训练集训练所基于降噪自编码器的卷积神经网络模型的参数;Parameters of the convolutional neural network model based on the denoising autoencoder for training using a training set composed of target electrical appliance data segments; 将不同训练阶段所得到的不同模型在数据片段组成的验证集上验证测试,直至效果最好作为目标电器的对应模型;Verify and test the different models obtained in different training stages on the verification set composed of data fragments until the best effect is used as the corresponding model of the target appliance; 利用数据片段组成的测试集对目标电器的对应模型进行性能测试,直至性能最优得到目标电器的优化模型。Use the test set composed of data fragments to perform performance testing on the corresponding model of the target electrical appliance until the performance is optimal to obtain the optimized model of the target electrical appliance. 8.一种基于机器学习的电力负荷识别系统,其特征在于,所述系统包括:8. A power load identification system based on machine learning, characterized in that the system includes: 数据集获取模块,用于获取各个电器的历史电气参数数据集;所述获取各个电器的历史电气参数数据集,包括:整合与汇总当今公共数据集得到第一数据集;在用户总的进线端安装电能表获取一个或多个空间内的总体和单个用电负荷的电气参数得到第二数据集;根据所述第一数据集和所述第二数据集汇总得到各个电器的历史电气参数数据集;The data set acquisition module is used to obtain the historical electrical parameter data set of each electrical appliance; the acquisition of the historical electrical parameter data set of each electrical appliance includes: integrating and summarizing current public data sets to obtain the first data set; in the user's total incoming line The end-installed electric energy meter obtains the electrical parameters of the overall and individual electrical loads in one or more spaces to obtain a second data set; and obtains historical electrical parameter data of each electrical appliance based on the first data set and the second data set. set; 数据集清洗模块,用于清洗各个电器的所述历史电气参数数据集;所述清洗各个电器的所述历史电气参数数据集包括:自动筛选与异常数据清理,实现噪声的辨认与分离,下采样,丢弃率,数据归一化,缺失数据补偿及处理,Top-k以及错误数据剔除;The data set cleaning module is used to clean the historical electrical parameter data set of each electrical appliance; the cleaning of the historical electrical parameter data set of each electrical appliance includes: automatic screening and abnormal data cleaning, realizing the identification and separation of noise, and down-sampling , discard rate, data normalization, missing data compensation and processing, Top-k and erroneous data elimination; 数据集划分模块,用于将单个电器的清洗后的历史电气参数数据集按照比例划分成原始训练集、验证集及测试集;The data set division module is used to divide the cleaned historical electrical parameter data set of a single electrical appliance into the original training set, verification set and test set in proportion; 数据集片段截取模块,用于沿时间轴对所述原始训练集、验证集及测试集进行数据片段的截取,生成由数据片段组成的训练集、验证集和测试集;A data set fragment interception module is used to intercept data fragments from the original training set, verification set and test set along the timeline, and generate a training set, verification set and test set composed of data fragments; 模型建立模块,用于对每个目标电器分别建立基于降噪自编码器的卷积神经网络模型;The model building module is used to establish a convolutional neural network model based on the denoising autoencoder for each target electrical appliance; 模型优化模块,用于对每个目标电器根据数据片段组成的训练集、验证集和测试集对所述基于降噪自编码器的卷积神经网络模型进行训练得到每个目标电器的优化模型;A model optimization module, used to train the convolutional neural network model based on the denoising autoencoder based on the training set, verification set and test set composed of data fragments for each target appliance to obtain an optimization model for each target appliance; 电力负荷类别输出模块,用于采集用户电力负荷的当前数据,并将所述当前数据输入每个目标电器的优化模型中,分离出电器的工作状态,输出电力负荷的类别结果。The electric load category output module is used to collect the current data of the user's electric load, input the current data into the optimization model of each target electrical appliance, isolate the working status of the electrical appliance, and output the category result of the electric load.
CN202010152529.0A 2020-03-06 2020-03-06 Power load identification method and system based on machine learning Active CN111382789B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010152529.0A CN111382789B (en) 2020-03-06 2020-03-06 Power load identification method and system based on machine learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010152529.0A CN111382789B (en) 2020-03-06 2020-03-06 Power load identification method and system based on machine learning

Publications (2)

Publication Number Publication Date
CN111382789A CN111382789A (en) 2020-07-07
CN111382789B true CN111382789B (en) 2023-11-14

Family

ID=71217025

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010152529.0A Active CN111382789B (en) 2020-03-06 2020-03-06 Power load identification method and system based on machine learning

Country Status (1)

Country Link
CN (1) CN111382789B (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111917114B (en) * 2020-08-20 2022-02-08 国网湖南省电力有限公司 Power load event detection method suitable for embedded platform
CN112149346B (en) * 2020-09-07 2024-04-26 华中科技大学 A wind farm equivalent modeling method, device, electronic device and storage medium
WO2022141330A1 (en) * 2020-12-31 2022-07-07 Guizhou Power Grid Company Limited A non-intrusive load identification method based on fingerprint characteristics of load power
CN112801115B (en) * 2021-01-26 2022-09-06 广西电网有限责任公司电力科学研究院 Power fluctuation control method and application based on microgrid source-load comprehensive characteristic image
CN114970648A (en) * 2021-02-20 2022-08-30 青岛特来电新能源科技有限公司 New energy equipment identification method, device and medium
SE2150281A1 (en) * 2021-03-11 2022-09-12 Ctek Sweden Ab Method for detecting performance deterioration of components
CN113033633B (en) * 2021-03-12 2022-12-09 贵州电网有限责任公司 Equipment type identification method combining power fingerprint knowledge and neural network
CN113191600A (en) * 2021-04-13 2021-07-30 清科优能(深圳)技术有限公司 Intelligent house non-invasive load identification method based on data mining
CN117556203B (en) * 2023-11-13 2024-11-26 中国铁塔股份有限公司重庆市分公司 A building energy consumption sensing method and system
CN118584232A (en) * 2024-08-06 2024-09-03 青岛鼎信通讯股份有限公司 Method, device, equipment and medium for identifying electric equipment
CN119619686B (en) * 2025-02-11 2025-07-15 广州疆海科技有限公司 Non-intrusive load monitoring method, device, computer equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108416695A (en) * 2018-02-24 2018-08-17 合肥工业大学 Power load probability density prediction method, system and medium based on deep learning
CN108616120A (en) * 2018-04-28 2018-10-02 西安理工大学 A kind of non-intrusive electrical load decomposition method based on RBF neural
CN109116100A (en) * 2018-07-09 2019-01-01 清华大学 It is a kind of based on coding-decoding structure electric load electricity consumption decomposition method
CN109145949A (en) * 2018-07-19 2019-01-04 山东师范大学 Non-intrusive electrical load monitoring and decomposition method and system based on integrated study
CN110376457A (en) * 2019-06-28 2019-10-25 同济大学 Non-intrusion type load monitoring method and device based on semi-supervised learning algorithm
CN110536237A (en) * 2019-09-04 2019-12-03 国网四川省电力公司电力科学研究院 Location information acquisition method based on UWB

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100305889A1 (en) * 2009-05-27 2010-12-02 General Electric Company Non-intrusive appliance load identification using cascaded cognitive learning

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108416695A (en) * 2018-02-24 2018-08-17 合肥工业大学 Power load probability density prediction method, system and medium based on deep learning
CN108616120A (en) * 2018-04-28 2018-10-02 西安理工大学 A kind of non-intrusive electrical load decomposition method based on RBF neural
CN109116100A (en) * 2018-07-09 2019-01-01 清华大学 It is a kind of based on coding-decoding structure electric load electricity consumption decomposition method
CN109145949A (en) * 2018-07-19 2019-01-04 山东师范大学 Non-intrusive electrical load monitoring and decomposition method and system based on integrated study
CN110376457A (en) * 2019-06-28 2019-10-25 同济大学 Non-intrusion type load monitoring method and device based on semi-supervised learning algorithm
CN110536237A (en) * 2019-09-04 2019-12-03 国网四川省电力公司电力科学研究院 Location information acquisition method based on UWB

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于大数据Bayes分类的家电设备识别算法;晋远等;《建筑科学》;20170415;第33卷(第04期);全文 *
胡忠义著."实验结果与分析".《基于计算智能技术的电力负荷预测理论及应用》.2019,58-60. *

Also Published As

Publication number Publication date
CN111382789A (en) 2020-07-07

Similar Documents

Publication Publication Date Title
CN111382789B (en) Power load identification method and system based on machine learning
CN111242391B (en) Machine learning model training method and system for power load identification
US9817045B2 (en) Methods and system for nonintrusive load monitoring
Luo et al. Monitoring HVAC equipment electrical loads from a centralized location-methods and field test results
Yu et al. Nonintrusive appliance load monitoring for smart homes: Recent advances and future issues
CN108459295B (en) CVT on-line monitoring system and method based on distributed data acquisition processing
WO2015008645A1 (en) Monitoring apparatus, monitoring method, and program
CN110133393A (en) An electricity monitoring system and method based on non-intrusive monitoring technology
CN113075512B (en) Transformer discharge fault diagnosis method and system based on acoustic detection
CN106908671A (en) A kind of non-intrusion type household loads intelligent detecting method and system
WO2025086414A1 (en) Power distribution network fault detection method applicable to distributed power supply
CN109581268B (en) Fault diagnosis method and device for optical fiber current transformer
CN118114019A (en) Automatic identification method and system for power distribution network topology based on data analysis
CN117630573A (en) Intelligent fault detection method for embedded cable
CN117607784A (en) A real-time monitoring system for electric meter operating errors
Liu et al. Adaptive multitimescale event detection in nonintrusive load monitoring based on minimum description length principle
CN117951633B (en) Photovoltaic power generation equipment fault diagnosis method and system
CN202720307U (en) A traveling wave location device for power line faults
CN118839132A (en) Power consumption monitoring and analyzing method for electric energy meter
CN203672481U (en) Electric equipment noise detection device
CN118228024A (en) Method and system for analyzing and storing high-frequency data of electronic transformer
CN116881773A (en) Equipment fault recognition method and system based on deep learning
CN115932699A (en) Multi-dimensional diagnosis method and system for signal sampling channel
CN108594035A (en) A kind of load testing method and system
CN111682643A (en) An intelligent circuit breaker that realizes the identification of electric load characteristics

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant