[go: up one dir, main page]

CN118669281A - Online state evaluation method, device and equipment of wind motor and storage medium - Google Patents

Online state evaluation method, device and equipment of wind motor and storage medium Download PDF

Info

Publication number
CN118669281A
CN118669281A CN202410878613.9A CN202410878613A CN118669281A CN 118669281 A CN118669281 A CN 118669281A CN 202410878613 A CN202410878613 A CN 202410878613A CN 118669281 A CN118669281 A CN 118669281A
Authority
CN
China
Prior art keywords
data
wind turbine
target state
features
online
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202410878613.9A
Other languages
Chinese (zh)
Inventor
李宏祥
王良红
王飞
范贤旗
杨盼盼
李王文
刘斌
高庆鹏
朱兴翔
杨云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yunnan Dianneng Intelligent Energy Co ltd
Original Assignee
Yunnan Dianneng Intelligent Energy Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yunnan Dianneng Intelligent Energy Co ltd filed Critical Yunnan Dianneng Intelligent Energy Co ltd
Priority to CN202410878613.9A priority Critical patent/CN118669281A/en
Publication of CN118669281A publication Critical patent/CN118669281A/en
Pending legal-status Critical Current

Links

Classifications

    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F03MACHINES OR ENGINES FOR LIQUIDS; WIND, SPRING, OR WEIGHT MOTORS; PRODUCING MECHANICAL POWER OR A REACTIVE PROPULSIVE THRUST, NOT OTHERWISE PROVIDED FOR
    • F03DWIND MOTORS
    • F03D17/00Monitoring or testing of wind motors, e.g. diagnostics
    • F03D17/005Monitoring or testing of wind motors, e.g. diagnostics using computation methods, e.g. neural networks
    • F03D17/006Estimation methods
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F03MACHINES OR ENGINES FOR LIQUIDS; WIND, SPRING, OR WEIGHT MOTORS; PRODUCING MECHANICAL POWER OR A REACTIVE PROPULSIVE THRUST, NOT OTHERWISE PROVIDED FOR
    • F03DWIND MOTORS
    • F03D17/00Monitoring or testing of wind motors, e.g. diagnostics
    • F03D17/027Monitoring or testing of wind motors, e.g. diagnostics characterised by the component being monitored or tested
    • F03D17/036Generators

Landscapes

  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Sustainable Development (AREA)
  • Sustainable Energy (AREA)
  • Chemical & Material Sciences (AREA)
  • Combustion & Propulsion (AREA)
  • Mechanical Engineering (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Theoretical Computer Science (AREA)
  • Wind Motors (AREA)

Abstract

本发明涉及新能源设备状态评估技术领域,公开了一种风电机的在线状态评估方法、装置、设备及存储介质,方法包括通过设备,实时采集风电机的运行数据,对采集的运行数据进行异常值检测处理;通过缓解算法Relief,从异常值检测处理后的运行数据中提取目标状态特征,根据特征的相关性指标,对提取到的目标状态特征进行选择;利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;利用建立好的在线状态评估模型,对实时采集的数据进行预测和判断,评估风电机的在线状态;装置包括数据处理模块、模型构建模块、状态评估模块。本发明提高风电机在线状态评估的准确性和可靠性,实现对风电机运行状态的实时监测和预测。

The present invention relates to the technical field of state assessment of new energy equipment, and discloses an online state assessment method, device, equipment and storage medium for a wind turbine. The method includes collecting the operation data of the wind turbine in real time through the equipment, and performing abnormal value detection processing on the collected operation data; extracting target state features from the operation data after abnormal value detection processing through a relief algorithm Relief, and selecting the extracted target state features according to the correlation index of the features; using the operation data containing the target state features for training to establish an online state assessment model for the wind turbine; using the established online state assessment model to predict and judge the real-time collected data, and evaluate the online state of the wind turbine; the device includes a data processing module, a model building module, and a state assessment module. The present invention improves the accuracy and reliability of online state assessment of the wind turbine, and realizes real-time monitoring and prediction of the operation state of the wind turbine.

Description

一种风电机的在线状态评估方法、装置、设备及存储介质A method, device, equipment and storage medium for online status assessment of wind turbine

技术领域Technical Field

本发明涉及新能源设备状态评估技术领域,尤其涉及一种风电机的在线状态评估方法、装置、设备及存储介质。The present invention relates to the technical field of new energy equipment status assessment, and in particular to an online status assessment method, device, equipment and storage medium for a wind turbine.

背景技术Background Art

风能作为技术成熟、绿色无污染的可再生能源,已经成为解决能源污染问题不可或缺的力量。风电行业的快速发展以及风电机组的广泛安装给风电机组安全稳定运行带来了巨大的挑战。例如,由于受风力资源分布的影响,风电机组大多安装在山区、海口、海岛等风口处,长期工作在严寒酷暑、极端温差、沙尘暴、雷暴雨以及无规律变向的风力等恶劣环境下,导致风电机组故障频发,致使风电机组安全稳定运行受到严重影响;再者风电机组是一个复杂、非线性、强耦合的控制系统,机组的各个部件之间相互耦合,任何部件出现微小的故障,在相对滞后的精准检测技术下将会持续恶化,最终造成致命性的故障,给风电机组的维护和检修工作提高了难度;此外,风电机组大多数位于偏远地区,从发生故障到恢复正常运行往往需要支付巨额运营成本还需要承担停机造成的经济损失;据文献指出,我国针对风电机组研发能力以及运营维护技术均有不足,现有运行的风电机组将逐步达到质保期,促使风电机组的运营成本在总支出中占据较高的比例。风电机组控制系统故障率最高,其次是叶片、变桨系统等,而平均排查故障耗时最多的是叶片,其次是齿轮箱和发电机。因此针对故障排查时间长且故障率高的风机叶片进行故障检测和预警研究,检测出故障的发生并发出预警,使运营维护人员能够及时地做出反应,避免带来更大的经济损失。但是风电机组叶片故障检测与预警面临着以下几个难点:1)由于风电机组在正常工作状况下SCADA系统的监测数据中蕴含着大量的噪声信息并且存在数据的部分丢失,SCADA系统采集的数据具有不连续性,对故障检测结果造成重要影响;2)风电机组是一个复杂、非线性、强耦合的控制系统,各个状态参数之间相互耦合,具有极其复杂的相关性,存在大量与故障无关的特征。若不加以特征选取,会影响模型的泛化能力,进一步影响故障检测与预警的准确性;3)在实际运行中,由于环境干扰和运行方式的变化,风电机组故障模式也会发生变化,因此叶片开裂故障检测模型需要具有一定的泛化能力。因此,如何利用风电机组SCADA系统历史数据本身价值,挖掘各个状态参数与风电机组叶片开裂故障之间的关系,实现风电机组运行过程中叶片开裂故障检测与预警,对于优化风电机组运维策略、促进电力系统安全稳定运行以及提升风力发电的经济效应具有重要影响。As a green, pollution-free, and renewable energy source with mature technology, wind energy has become an indispensable force in solving energy pollution problems. The rapid development of the wind power industry and the widespread installation of wind turbines have brought huge challenges to the safe and stable operation of wind turbines. For example, due to the influence of wind resource distribution, wind turbines are mostly installed in windy places such as mountainous areas, seaports, and islands. They work in harsh environments such as severe cold and heat, extreme temperature differences, sandstorms, thunderstorms, and irregular wind changes for a long time, resulting in frequent failures of wind turbines, which seriously affects the safe and stable operation of wind turbines. In addition, wind turbines are complex, nonlinear, and strongly coupled control systems. The various components of the unit are coupled with each other. If any component has a minor failure, it will continue to deteriorate under the relatively lagging precise detection technology, and eventually cause a fatal failure, which increases the difficulty of wind turbine maintenance and overhaul. In addition, most wind turbines are located in remote areas, and it is often necessary to pay huge operating costs from the occurrence of failure to the restoration of normal operation, and also to bear the economic losses caused by downtime. According to the literature, my country's research and development capabilities and operation and maintenance technologies for wind turbines are insufficient, and the existing wind turbines in operation will gradually reach the warranty period, causing the operating costs of wind turbines to account for a higher proportion of total expenditures. The wind turbine control system has the highest failure rate, followed by blades, pitch systems, etc., and the average troubleshooting time is the longest for blades, followed by gearboxes and generators. Therefore, fault detection and early warning research is carried out for wind turbine blades with long troubleshooting time and high failure rate, to detect the occurrence of faults and issue early warnings, so that operation and maintenance personnel can respond in time to avoid greater economic losses. However, wind turbine blade fault detection and early warning face the following difficulties: 1) Since the monitoring data of the SCADA system of the wind turbine under normal working conditions contains a lot of noise information and there is partial data loss, the data collected by the SCADA system is discontinuous, which has an important impact on the fault detection results; 2) The wind turbine is a complex, nonlinear, and strongly coupled control system. The various state parameters are coupled with each other, have extremely complex correlations, and have a large number of features that are not related to the fault. If feature selection is not performed, the generalization ability of the model will be affected, and the accuracy of fault detection and early warning will be further affected; 3) In actual operation, due to environmental interference and changes in operation mode, the fault mode of the wind turbine will also change, so the blade cracking fault detection model needs to have a certain generalization ability. Therefore, how to utilize the value of the historical data of the wind turbine SCADA system itself, explore the relationship between various state parameters and wind turbine blade cracking failures, and realize blade cracking fault detection and early warning during wind turbine operation has an important impact on optimizing wind turbine operation and maintenance strategies, promoting safe and stable operation of power systems, and improving the economic effects of wind power generation.

现有技术一,申请号:CN202211548233.6公开了阈值风电机组的性能评估与能效诊断方法及系统,方法包括:基于待诊断的目标风电机组的有功功率,对目标风电机组的发电性能进行在线评估;结合在线评估的结果构建目标风电机组的能效状态指标体系,并确定每个能效状态指标的基准值;针对能效状态指标体系构建风电机组能效诊断本体知识库;获取目标风电机组的实时运行数据,根据实时运行数据和所述基准值进行能效异常识别,并基于能效异常识别的结果通过所述能效诊断本体知识库进行能效故障诊断。虽然能够对风电机组的发电性能进行实时评估,并能准确诊断出能效故障模式和能效故障原因,提高了能效诊断的准确性和针对性。但是构建知识库需要的数据较多,一定程度上增加了数据处理时间,使得故障诊断的时效性较差。Prior art 1, application number: CN202211548233.6 discloses a performance evaluation and energy efficiency diagnosis method and system for a threshold wind turbine, the method comprising: based on the active power of the target wind turbine to be diagnosed, online evaluation of the power generation performance of the target wind turbine; combining the results of the online evaluation to construct an energy efficiency status index system for the target wind turbine, and determining the benchmark value of each energy efficiency status index; constructing a wind turbine energy efficiency diagnosis ontology knowledge base for the energy efficiency status index system; obtaining real-time operation data of the target wind turbine, identifying energy efficiency anomalies according to the real-time operation data and the benchmark value, and performing energy efficiency fault diagnosis based on the energy efficiency anomaly identification result through the energy efficiency diagnosis ontology knowledge base. Although it is possible to evaluate the power generation performance of the wind turbine in real time, and accurately diagnose the energy efficiency fault mode and energy efficiency fault cause, the accuracy and pertinence of energy efficiency diagnosis are improved. However, the construction of the knowledge base requires a lot of data, which increases the data processing time to a certain extent, making the timeliness of fault diagnosis poor.

现有技术二,申请号:CN202111313388.7公开了一种面向大型风电机组实时出力性能的评估方法,基于风电机组正常运行状态下数据采集与监视控制系统记录数据集,设计利用滑窗技术的离群点检测方法,并设计带有时间信息提取的特征提取模型,选取XGBoost模型拟合风能利用系数与输入特征的等效数学模型,并统计训练集中超限程度,用于在线应用时出力性能的实时评估。在实时出力性能评估方法中,对每个数据点能够进行多次判断,并统计其离群程度进行离群点检测,从而获取更鲁棒、更灵活的检测结果;在特征提取中考虑变量时间依赖性,虽然解决了机器学习XGBoost模型没有利用时间信息的问题,基于训练集超限程度作为在线实时评估依据,避免了人为主观因素的干扰,保证了评估的准确性。但是功能较为单一,仅能对实时处理进行评估,无法对风电机的在线状态进行全方位的评估,导致评估结果不准确。Prior art 2, application number: CN202111313388.7 discloses a method for evaluating the real-time output performance of large wind turbines. Based on the data set recorded by the data acquisition and monitoring control system under normal operation of the wind turbine, an outlier detection method using sliding window technology is designed, and a feature extraction model with time information extraction is designed. The XGBoost model is selected to fit the equivalent mathematical model of the wind energy utilization coefficient and the input feature, and the degree of overrun in the training set is statistically analyzed for real-time evaluation of output performance during online application. In the real-time output performance evaluation method, each data point can be judged multiple times, and its degree of outlier is statistically analyzed for outlier detection, so as to obtain more robust and flexible detection results; the time dependency of variables is considered in feature extraction, although the problem that the machine learning XGBoost model does not use time information is solved, the degree of overrun in the training set is used as the basis for online real-time evaluation, which avoids the interference of human subjective factors and ensures the accuracy of the evaluation. However, the function is relatively single, and it can only evaluate real-time processing, and cannot comprehensively evaluate the online status of the wind turbine, resulting in inaccurate evaluation results.

现有技术三,申请号:CN202111589047.2公开了一种基于改进栈式自编码的风电机组发电机健康评估方法,包括以下步骤:获取训练数据集:对风电机组发电机运行数据进行清洗后再进行线性归一化处理,得到有效的训练数据和测试数据,然后对栈式自编码模型进行训练、测试;构建多个栈式自编码器模型;对每个栈式自编码器模型进行训练;集成提取训练数据集的深度特征;将训练好的基准模型作为发电机在线状态检测器,并将册数数据输入到基准模型中,得到每个时间段内风电机组发电机的健康度并输出。虽然保证健康评估结果客观性的前提下提高了准确性,更加灵敏直观、能够再故障发生前检测出故障趋势。但是依赖于所有数据点,使得模型对于噪声和异常值的影响较大。Prior art three, application number: CN202111589047.2 discloses a health assessment method for wind turbine generators based on improved stacked autoencoders, including the following steps: obtaining a training data set: cleaning the wind turbine generator operating data and then performing linear normalization processing to obtain effective training data and test data, and then training and testing the stacked autoencoder model; constructing multiple stacked autoencoder models; training each stacked autoencoder model; integrating and extracting the deep features of the training data set; using the trained benchmark model as the generator online status detector, and inputting the volume data into the benchmark model to obtain and output the health of the wind turbine generator in each time period. Although the accuracy is improved under the premise of ensuring the objectivity of the health assessment results, it is more sensitive and intuitive, and can detect fault trends before the fault occurs. However, it depends on all data points, which makes the model more affected by noise and outliers.

目前现有技术一、现有技术二及现有技术三存在评估系统的智能化程度较低,对基础数据采集精度较差的问题。因而,本发明提供一种风电机的在线状态评估方法、装置、设备及存储介质,支持风机实时信息采集,提高设备运行可靠性。At present, the existing technologies 1, 2 and 3 have the problem that the intelligence level of the evaluation system is low and the basic data collection accuracy is poor. Therefore, the present invention provides an online status evaluation method, device, equipment and storage medium for a wind turbine, which supports real-time information collection of the wind turbine and improves the reliability of equipment operation.

发明内容Summary of the invention

本发明的主要目的在于提供一种风电机的在线状态评估方法、装置、设备及存储介质,以解决现有技术中评估系统的智能化程度较低,对基础数据采集精度较差的问题。The main purpose of the present invention is to provide a method, device, equipment and storage medium for online status evaluation of a wind turbine, so as to solve the problems in the prior art that the evaluation system has a low intelligence level and poor basic data collection accuracy.

为实现上述目的,本发明提供如下技术方案:To achieve the above object, the present invention provides the following technical solutions:

一种风电机的在线状态评估方法,所述风电机的在线状态评估方法包括:A method for evaluating the online status of a wind turbine, comprising:

通过传感器设备,实时采集风电机的振动数据、温度数据、电流数据及功率数据运行数据,对采集到的运行数据进行异常值检测处理;Through sensor equipment, the vibration data, temperature data, current data and power data of the wind turbine are collected in real time, and abnormal value detection and processing are performed on the collected operation data;

从异常值检测处理后的运行数据中提取目标状态特征,目标状态特征用于描述风电机的运行状态;对提取到的目标状态特征进行选择;利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;Extract target state features from the operating data after outlier detection processing, the target state features are used to describe the operating state of the wind turbine; select the extracted target state features; use the operating data containing the target state features for training to establish an online state assessment model for the wind turbine;

通过交叉验证,对建立的在线状态评估模型进行评估,利用建立好的在线状态评估模型,对实时采集的数据进行预测和判断,评估风电机的在线状态;将评估结果可视化展示,并根据评估结果进行相应的操作和维护。Through cross-validation, the established online status assessment model is evaluated. The established online status assessment model is used to predict and judge the real-time collected data to evaluate the online status of the wind turbine. The assessment results are visualized and corresponding operations and maintenance are performed based on the assessment results.

作为本发明的进一步改进,对采集到的运行数据进行异常值检测处理的过程,包括:As a further improvement of the present invention, the process of performing outlier detection processing on the collected operating data includes:

确定运行数据窗口的大小,从运行数据序列的开头开始,依次将窗口内的数据点放入一个数组中,对窗口内的数据点进行排序,并取其中间位置的值作为中位数;如果窗口大小为奇数,则中位数为排序后的中间值;如果窗口大小为偶数,则中位数为排序后中间两个值的平均值;Determine the size of the running data window, start from the beginning of the running data sequence, put the data points in the window into an array, sort the data points in the window, and take the value in the middle position as the median; if the window size is an odd number, the median is the middle value after sorting; if the window size is an even number, the median is the average of the two middle values after sorting;

将当前窗口的中位数作为当前数据点的值,替换原始运行数据序列中的对应位置的值;将窗口向后移动一个位置,直到遍历完所有数据点,得到去噪后的运行数据;The median of the current window is used as the value of the current data point to replace the value of the corresponding position in the original running data sequence; the window is moved backward by one position until all data points are traversed to obtain the denoised running data;

使用统计学方法,对去噪后的运行数据进行描述性统计分析,检测是否存在与其他运行数据明显偏离的异常值;使用箱线图,通过判断数据点与其它数据点的偏离程度,识别异常值,将被识别为异常值的数据点从数据集中删除。Use statistical methods to perform descriptive statistical analysis on the denoised operating data to detect whether there are outliers that deviate significantly from other operating data; use box plots to identify outliers by judging the degree of deviation between data points and other data points, and delete data points identified as outliers from the data set.

作为本发明的进一步改进,识别异常值的过程,包括:As a further improvement of the present invention, the process of identifying outliers includes:

计算去噪后的运行数据的均值和标准差,分别表示数据的中心趋势和离散程度;绘制箱线图显示运行数据的分布情况,箱线图包括上边缘、上四分位数、中位数、下四分位数和下边缘,异常值定义为超出上下四分位数1.5倍的四分位距的数据点;Calculate the mean and standard deviation of the denoised running data, which represent the central tendency and dispersion of the data respectively; draw a box plot to show the distribution of the running data. The box plot includes the upper edge, upper quartile, median, lower quartile and lower edge. The outlier is defined as a data point that exceeds the interquartile range of 1.5 times the upper and lower quartiles.

计算每个数据点的Z-score值,表示该数据点与平均值之间的偏离程度,公式为:Z=(X-μ)/σ,其中X为数据点,μ为均值,σ为标准差,Z-score大于3或小于-3的数据点被视为异常值;Calculate the Z-score value of each data point, which indicates the degree of deviation between the data point and the mean value. The formula is: Z = (X-μ)/σ, where X is the data point, μ is the mean, and σ is the standard deviation. Data points with a Z-score greater than 3 or less than -3 are considered outliers.

进行Grubbs’Test识别单个异常值,通过计算每个数据点与均值之间的差异,找出与其他数据点明显偏离的数据点。Grubbs’ Test is performed to identify single outliers. The difference between each data point and the mean is calculated to find data points that deviate significantly from other data points.

作为本发明的进一步改进,在线状态评估模型的构建过程,包括:As a further improvement of the present invention, the construction process of the online status evaluation model includes:

使用缓解算法Relief从经过异常值检测处理的运行数据中提取目标状态特征,计算特征与目标状态之间的相关性来评估特征的重要性;Use the relief algorithm Relief to extract target state features from the running data processed by outlier detection, and calculate the correlation between the features and the target state to evaluate the importance of the features;

根据相关性指标对提取到的目标状态特征进行选择,相关性指标确定哪些特征对于描述风电机的运行状态最为重要;The extracted target state features are selected according to the correlation index, which determines which features are most important for describing the operating state of the wind turbine;

利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;在训练过程中,使用机器学习算法构建模型,以根据目标状态特征对风电机的在线状态进行评估。The online state assessment model of the wind turbine is established by training with the operating data containing the target state characteristics. During the training process, the model is constructed using the machine learning algorithm to assess the online state of the wind turbine according to the target state characteristics.

作为本发明的进一步改进,缓解算法Relief地过程,包括:As a further improvement of the present invention, the process of the relief algorithm Relief includes:

初始化特征权重,对于每个特征,初始化其权重为0;随机选择一个样本作为参考样本;根据欧氏距离计算其他样本与参考样本之间的相似度,并根据相似度对其他样本进行排序;Initialize feature weights. For each feature, initialize its weight to 0. Randomly select a sample as a reference sample. Calculate the similarity between other samples and the reference sample based on the Euclidean distance, and sort other samples according to the similarity.

对于每个特征,计算参考样本与最近邻样本之间的差异值。如果最近邻样本属于目标状态类别,则增加特征权重;如果最近邻样本不属于目标状态类别,则减少特征权重;For each feature, calculate the difference between the reference sample and the nearest neighbor sample. If the nearest neighbor sample belongs to the target state category, increase the feature weight; if the nearest neighbor sample does not belong to the target state category, decrease the feature weight;

根据差异值和样本权重更新特征权重,重复直到遍历所有样本;根据特征权重进行特征选择,选择达到权重阈值的特征作为目标状态特征。Update the feature weight according to the difference value and sample weight, and repeat until all samples are traversed; perform feature selection based on the feature weight, and select the features that reach the weight threshold as the target state features.

作为本发明的进一步改进,根据相关性指标对提取到的目标状态特征进行选择的过程,包括:As a further improvement of the present invention, the process of selecting the extracted target state features according to the correlation index includes:

计算特征与目标状态之间的相关性,对于每个特征,通过互信息计算其与目标状态之间的相关性指标;Calculate the correlation between the features and the target state. For each feature, calculate the correlation index between it and the target state through mutual information;

根据互信息的值,确定哪些特征对于描述风电机的运行状态最为重要,设置一个互信息的阈值,只选择互信息值超过阈值的特征作为目标状态特征;互信息值越大表示特征与目标状态之间的相关性越高,被认为是重要特征;According to the mutual information value, determine which features are most important for describing the operating status of the wind turbine, set a mutual information threshold, and only select features whose mutual information value exceeds the threshold as target state features; the larger the mutual information value, the higher the correlation between the feature and the target state, and it is considered to be an important feature;

根据相关性指标,按照相关性指标的大小进行排序,选择排名靠前的特征作为目标状态特征;According to the relevance index, sort according to the size of the relevance index and select the top-ranked features as the target state features;

互信息的计算过程,包括:The calculation process of mutual information includes:

计算特征和目标状态各自的概率分布,即特征的概率分布和目标状态的概率分布;计算特征和目标状态的联合概率分布,即特征和目标状态同时发生的概率;Calculate the probability distribution of each feature and target state, that is, the probability distribution of the feature and the probability distribution of the target state; calculate the joint probability distribution of the feature and the target state, that is, the probability of the feature and the target state occurring at the same time;

步根据边缘概率分布和联合概率分布,计算特征与目标状态之间的互信息,互信息的计算公式如下:Step 2: According to the marginal probability distribution and the joint probability distribution, the mutual information between the feature and the target state is calculated. The calculation formula of the mutual information is as follows:

I(X,Y)=∑P(x,y)*log(P(x,y)/(P(x)*P(y)))I(X,Y)=∑P(x,y)*log(P(x,y)/(P(x)*P(y)))

其中,X表示特征,Y表示目标状态,P(x,y)表示特征和目标状态同时发生的概率,P(x)和P(y)分别表示特征和目标状态的边缘概率分布;Where X represents the feature, Y represents the target state, P(x,y) represents the probability of the feature and the target state occurring at the same time, and P(x) and P(y) represent the marginal probability distribution of the feature and the target state respectively;

将互信息的计算结果与互信息的阈值进行对比,选择处互信息值超过阈值的特征作为目标状态特征。The calculated result of mutual information is compared with the threshold of mutual information, and the features whose mutual information value exceeds the threshold are selected as the target state features.

作为本发明的进一步改进,使用机器学习算法构建模型的过程,包括:As a further improvement of the present invention, the process of building a model using a machine learning algorithm includes:

对包含目标状态特征的运行数据进行归一化预处理,选择随机森林的机器学习算法进行建模;Normalize and preprocess the running data containing target state features, and select the random forest machine learning algorithm for modeling;

将预处理后的数据集划分为训练集和测试集,训练集用于在线状态评估模型的训练和参数调优,测试集用于评估在线状态评估模型的性能;The preprocessed data set is divided into a training set and a test set. The training set is used for training and parameter tuning of the online state assessment model, and the test set is used to evaluate the performance of the online state assessment model.

使用训练集对选定的机器学习算法进行训练,调整在线状态评估模型的参数;将训练集输入到机器学习算法中进行训练,训练集包含经过预处理的运行数据和对应的目标状态;Using the training set to train the selected machine learning algorithm, and adjusting the parameters of the online state assessment model; inputting the training set into the machine learning algorithm for training, the training set including the preprocessed operating data and the corresponding target state;

在训练过程中,机器学习算法根据输入的特征和目标状态之间的关系来学习在线状态评估模型的参数;根据训练数据的特征和目标状态的对应关系调整在线状态评估模型的权重和偏置;During the training process, the machine learning algorithm learns the parameters of the online state assessment model based on the relationship between the input features and the target state; the weights and biases of the online state assessment model are adjusted based on the corresponding relationship between the features of the training data and the target state;

训练过程中,进行调整决策树的深度、随机森林中树的数量等参数调优,训练完成后,得到了训练好的在线状态评估模型;During the training process, the depth of the decision tree, the number of trees in the random forest and other parameters are adjusted. After the training is completed, a trained online state evaluation model is obtained;

其中,通过尝试不同的参数取值,计算相应的损失函数值,并选择使得损失函数最小的参数组合作为最终的模型参数设置。Among them, by trying different parameter values, the corresponding loss function value is calculated, and the parameter combination that minimizes the loss function is selected as the final model parameter setting.

为实现上述目的,本发明还提供了如下技术方案:To achieve the above object, the present invention also provides the following technical solutions:

一种风电机的在线状态评估装置,其应用于所述的风电机的在线状态评估方法,所述风电机的在线状态评估装置包括:An online state assessment device for a wind turbine, which is applied to the online state assessment method for a wind turbine, comprises:

数据处理模块,用于通过传感器等设备,实时采集风电机的振动数据、温度数据、电流数据及功率数据运行数据,对采集到的运行数据进行异常值检测处理;The data processing module is used to collect the vibration data, temperature data, current data and power data of the wind turbine in real time through sensors and other equipment, and perform abnormal value detection and processing on the collected operating data;

模型构建模块,用于通过缓解算法Relief,从异常值检测处理后的运行数据中提取目标状态特征,目标状态特征用于描述风电机的运行状态;根据特征的相关性指标,对提取到的目标状态特征进行选择;The model building module is used to extract target state features from the operating data after outlier detection processing through the relief algorithm Relief. The target state features are used to describe the operating state of the wind turbine. The extracted target state features are selected according to the correlation index of the features.

状态评估模块,用于利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;通过交叉验证,对建立的在线状态评估模型进行评估,利用建立好的在线状态评估模型,对实时采集的数据进行预测和判断,评估风电机的在线状态;将评估结果可视化展示,并根据评估结果进行相应的操作和维护。The status assessment module is used to use the operating data containing the target status characteristics for training to establish an online status assessment model for the wind turbine; evaluate the established online status assessment model through cross-validation, use the established online status assessment model to predict and judge the real-time collected data, and evaluate the online status of the wind turbine; visualize the evaluation results, and perform corresponding operations and maintenance based on the evaluation results.

为实现上述目的,本发明还提供了如下技术方案:To achieve the above object, the present invention also provides the following technical solutions:

一种电子设备,其特征在于,包括处理器、以及与所述处理器耦接的存储器,所述存储器存储有可被所述处理器执行的程序指令;所述处理器执行所述存储器存储的所述程序指令时实现所述的风电机的在线状态评估方法。An electronic device, characterized in that it includes a processor and a memory coupled to the processor, wherein the memory stores program instructions executable by the processor; when the processor executes the program instructions stored in the memory, the online status assessment method of the wind turbine is implemented.

为实现上述目的,本发明还提供了如下技术方案:To achieve the above object, the present invention also provides the following technical solutions:

一种存储介质,其特征在于,所述存储介质内存储有程序指令,所述程序指令被处理器执行时实现能够实现所述的风电机的在线状态评估方法。A storage medium, characterized in that program instructions are stored in the storage medium, and when the program instructions are executed by a processor, the online status assessment method of the wind turbine can be implemented.

本发明通过实时采集风电机的振动数据、温度数据、电流数据及功率数据等运行数据,并进行异常值检测处理,可以排除异常数据的干扰,确保后续分析和评估的准确性,可以提高风电机在线状态评估的可靠性和精度。通过缓解算法Relief,从异常值检测处理后的运行数据中提取目标状态特征,可以识别出对风电机运行状态具有重要影响的特征;通过特征的相关性指标选择出最具代表性的特征,可以减少冗余信息并提高模型训练的效率;建立风电机的在线状态评估模型,可以实现对风电机运行状态的准确预测和判断。通过交叉验证对建立的在线状态评估模型进行评估,可以验证模型的性能和准确度,利用建立好的在线状态评估模型对实时采集的数据进行预测和判断,可以实时监测风电机的运行状态,及时发现潜在问题和故障,进行相应的操作和维护;将评估结果可视化展示,可以直观地了解风电机的运行状态,提供决策依据和指导,提高风电机的运维效率和可靠性。The present invention collects the vibration data, temperature data, current data, power data and other operating data of the wind turbine in real time, and performs abnormal value detection processing, so as to eliminate the interference of abnormal data, ensure the accuracy of subsequent analysis and evaluation, and improve the reliability and accuracy of the online state evaluation of the wind turbine. Through the relief algorithm Relief, the target state features are extracted from the operating data after the abnormal value detection processing, and the features that have an important impact on the operating state of the wind turbine can be identified; the most representative features are selected through the correlation index of the features, which can reduce redundant information and improve the efficiency of model training; the online state evaluation model of the wind turbine is established, and the accurate prediction and judgment of the operating state of the wind turbine can be achieved. The established online state evaluation model is evaluated through cross-validation, and the performance and accuracy of the model can be verified. The established online state evaluation model is used to predict and judge the real-time collected data, and the operating state of the wind turbine can be monitored in real time, potential problems and faults can be discovered in time, and corresponding operations and maintenance can be performed; the evaluation results are visualized, and the operating state of the wind turbine can be intuitively understood, and a decision-making basis and guidance can be provided, so as to improve the operation and maintenance efficiency and reliability of the wind turbine.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为本发明风电机的在线状态评估方法一个实施例的步骤流程示意图;FIG1 is a schematic diagram of a step flow chart of an embodiment of an online state assessment method for a wind turbine according to the present invention;

图2为本发明风电机的在线状态评估方法一个实施例对采集到的运行数据进行异常值检测处理的步骤流程示意图;FIG2 is a schematic flow chart of the steps of performing abnormal value detection processing on the collected operating data according to an embodiment of the online state assessment method of a wind turbine according to the present invention;

图3为本发明风电机的在线状态评估方法一个实施例在线状态评估模型的构建的步骤流程示意图;3 is a schematic flow chart of steps for constructing an online status assessment model of an embodiment of an online status assessment method for a wind turbine according to the present invention;

图4为本发明风电机的在线状态评估方法一个实施例将评估结果可视化展示的步骤流程示意图;FIG4 is a schematic flow chart of steps for visually displaying evaluation results according to an embodiment of an online status evaluation method for a wind turbine according to the present invention;

图5为本发明风电机的在线状态评估装置一个实施例的功能模块示意图;FIG5 is a schematic diagram of functional modules of an embodiment of an online status assessment device for a wind turbine according to the present invention;

图6为本发明电子设备一个实施例的结构示意图;FIG6 is a schematic structural diagram of an electronic device according to an embodiment of the present invention;

图7为本发明存储介质一个实施例的结构示意图。FIG. 7 is a schematic diagram of the structure of a storage medium according to an embodiment of the present invention.

具体实施方式DETAILED DESCRIPTION

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅是本发明的一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following will be combined with the drawings in the embodiments of the present invention to clearly and completely describe the technical solutions in the embodiments of the present invention. Obviously, the described embodiments are only part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of the present invention.

本发明中的术语“第一”“第二”“第三”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”“第二”“第三”的特征可以明示或者隐含地包括至少一个该特征。本发明的描述中,“多个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。本发明实施例中所有方向性指示(诸如上、下、左、右、前、后……)仅用于解释在某一特定姿态(如附图所示)下各部件之间的相对位置关系、运动情况等,如果该特定姿态发生改变时,则该方向性指示也相应地随之改变。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second" and "third" in the present invention are only used for descriptive purposes and cannot be understood as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Therefore, the features defined as "first", "second" and "third" can explicitly or implicitly include at least one of the features. In the description of the present invention, the meaning of "multiple" is at least two, such as two, three, etc., unless otherwise clearly and specifically defined. All directional indications (such as up, down, left, right, front, back...) in the embodiments of the present invention are only used to explain the relative position relationship, movement, etc. between the components under a certain specific posture (as shown in the accompanying drawings). If the specific posture changes, the directional indication also changes accordingly. In addition, the terms "including" and "having" and any of their variations are intended to cover non-exclusive inclusions. For example, a process, method, system, product or device that includes a series of steps or units is not limited to the listed steps or units, but optionally also includes steps or units that are not listed, or optionally also includes other steps or units inherent to these processes, methods, products or devices.

在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本发明的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其他实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其他实施例相结合。Reference to "embodiments" herein means that a particular feature, structure, or characteristic described in conjunction with the embodiments may be included in at least one embodiment of the present invention. The appearance of the phrase in various places in the specification does not necessarily refer to the same embodiment, nor is it an independent or alternative embodiment that is mutually exclusive with other embodiments. It is explicitly and implicitly understood by those skilled in the art that the embodiments described herein may be combined with other embodiments.

如图1所示,本实施例提供了风电机的在线状态评估方法的一个实施例,在本实施例中,该风电机的在线状态评估方法具体包括以下步骤:As shown in FIG. 1 , this embodiment provides an embodiment of an online state assessment method for a wind turbine. In this embodiment, the online state assessment method for a wind turbine specifically includes the following steps:

步骤S1:通过传感器等设备,实时采集风电机的振动数据、温度数据、电流数据及功率数据等运行数据,对采集到的运行数据进行异常值检测处理;Step S1: using sensors and other equipment to collect vibration data, temperature data, current data, power data and other operating data of the wind turbine in real time, and performing abnormal value detection processing on the collected operating data;

步骤S2:从异常值检测处理后的运行数据中提取目标状态特征,目标状态特征用于描述风电机的运行状态;对提取到的目标状态特征进行选择;利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;Step S2: extracting target state features from the operating data after outlier detection processing, the target state features are used to describe the operating state of the wind turbine; selecting the extracted target state features; using the operating data containing the target state features for training to establish an online state assessment model for the wind turbine;

步骤S3:通过交叉验证,对建立的在线状态评估模型进行评估,利用建立好的在线状态评估模型,对实时采集的数据进行预测和判断,评估风电机的在线状态;将评估结果可视化展示,并根据评估结果进行相应的操作和维护。Step S3: Evaluate the established online status evaluation model through cross-validation, use the established online status evaluation model to predict and judge the real-time collected data, and evaluate the online status of the wind turbine; visualize the evaluation results, and perform corresponding operations and maintenance based on the evaluation results.

优选地,本实施例的步骤S1通过实时采集风电机的振动数据、温度数据、电流数据及功率数据等运行数据,并进行异常值检测处理,可以排除异常数据的干扰,确保后续分析和评估的准确性,可以提高风电机在线状态评估的可靠性和精度。步骤S2通过缓解算法Relief,从异常值检测处理后的运行数据中提取目标状态特征,可以识别出对风电机运行状态具有重要影响的特征;通过特征的相关性指标选择出最具代表性的特征,可以减少冗余信息并提高模型训练的效率;建立风电机的在线状态评估模型,可以实现对风电机运行状态的准确预测和判断。步骤S3通过交叉验证对建立的在线状态评估模型进行评估,可以验证模型的性能和准确度,利用建立好的在线状态评估模型对实时采集的数据进行预测和判断,可以实时监测风电机的运行状态,及时发现潜在问题和故障,进行相应的操作和维护;将评估结果可视化展示,可以直观地了解风电机的运行状态,提供决策依据和指导,提高风电机的运维效率和可靠性。Preferably, step S1 of this embodiment collects the vibration data, temperature data, current data, power data and other operating data of the wind turbine in real time, and performs abnormal value detection processing, which can eliminate the interference of abnormal data, ensure the accuracy of subsequent analysis and evaluation, and improve the reliability and accuracy of the online state evaluation of the wind turbine. Step S2 extracts the target state features from the operating data after abnormal value detection processing through the relief algorithm Relief, and can identify the features that have an important impact on the operating state of the wind turbine; selects the most representative features through the correlation index of the features, which can reduce redundant information and improve the efficiency of model training; establishes an online state evaluation model for the wind turbine, and can realize accurate prediction and judgment of the operating state of the wind turbine. Step S3 evaluates the established online state evaluation model through cross-validation, which can verify the performance and accuracy of the model, and uses the established online state evaluation model to predict and judge the real-time collected data, so as to monitor the operating state of the wind turbine in real time, timely discover potential problems and faults, and perform corresponding operations and maintenance; the evaluation results are visualized, so as to intuitively understand the operating state of the wind turbine, provide decision-making basis and guidance, and improve the operation and maintenance efficiency and reliability of the wind turbine.

综上所述,本实施例提高风电机在线状态评估的准确性和可靠性,实现对风电机运行状态的实时监测和预测,及时发现和解决潜在问题和故障,提高风电机的运维效率和可靠性;保障风电机的安全运行,降低故障率和维修成本,提高风电发电效率,推动可再生能源的可持续发展。In summary, this embodiment improves the accuracy and reliability of online status assessment of wind turbines, realizes real-time monitoring and prediction of the operating status of wind turbines, promptly discovers and resolves potential problems and faults, and improves the operation and maintenance efficiency and reliability of wind turbines; ensures the safe operation of wind turbines, reduces failure rates and maintenance costs, improves wind power generation efficiency, and promotes the sustainable development of renewable energy.

进一步地,如图2所示,步骤S1中对采集到的运行数据进行异常值检测处理的过程具体包括以下步骤:Further, as shown in FIG2 , the process of performing outlier detection processing on the collected operating data in step S1 specifically includes the following steps:

步骤S11:确定运行数据窗口的大小,从运行数据序列的开头开始,依次将窗口内的数据点放入一个数组中,对窗口内的数据点进行排序,并取其中间位置的值作为中位数;如果窗口大小为奇数,则中位数为排序后的中间值;如果窗口大小为偶数,则中位数为排序后中间两个值的平均值;Step S11: Determine the size of the running data window, start from the beginning of the running data sequence, put the data points in the window into an array in turn, sort the data points in the window, and take the value in the middle position as the median; if the window size is an odd number, the median is the middle value after sorting; if the window size is an even number, the median is the average of the two middle values after sorting;

步骤S12:将当前窗口的中位数作为当前数据点的值,替换原始运行数据序列中的对应位置的值;将窗口向后移动一个位置,直到遍历完所有数据点,得到去噪后的运行数据;Step S12: taking the median of the current window as the value of the current data point, replacing the value of the corresponding position in the original running data sequence; moving the window backward by one position until all data points are traversed to obtain the denoised running data;

步骤S13:使用统计学方法,对去噪后的运行数据进行描述性统计分析,检测是否存在与其他运行数据明显偏离的异常值;使用箱线图,通过判断数据点与其它数据点的偏离程度,识别异常值,将被识别为异常值的数据点从数据集中删除。Step S13: Use statistical methods to perform descriptive statistical analysis on the denoised operating data to detect whether there are outliers that are significantly deviated from other operating data; use box plots to identify outliers by judging the degree of deviation between data points and other data points, and delete data points identified as outliers from the data set.

优选地,本实施例的步骤S11确定运行数据窗口的大小,并对窗口内的数据点进行排序,取中位数;通过中位数的计算,平滑数据并消除噪声;减少数据中的随机噪声对后续分析的影响,使数据更加可靠和稳定。步骤S12将当前窗口的中位数作为当前数据点的值,替换原始运行数据序列中的对应位置的值;将去噪后的中位数值应用于原始数据,进一步平滑数据并减少噪声的影响,得到更加平滑和可靠的运行数据,为后续的异常值检测提供更好的数据基础。步骤S13使用统计学方法,如箱线图,对去噪后的运行数据进行描述性统计分析,并检测是否存在与其他数据明显偏离的异常值;通过统计分析方法检测异常值,即识别与其他数据明显不同的数据点,识别和排除异常值,以保证后续的数据分析和决策的准确性和可靠性;通过删除异常值,可以减少异常数据对整体数据分布和分析结果的影响,提高数据的质量和准确性。Preferably, step S11 of this embodiment determines the size of the operation data window, sorts the data points in the window, and takes the median; by calculating the median, smoothes the data and eliminates noise; reduces the impact of random noise in the data on subsequent analysis, making the data more reliable and stable. Step S12 uses the median of the current window as the value of the current data point to replace the value of the corresponding position in the original operation data sequence; applies the denoised median value to the original data, further smoothes the data and reduces the impact of noise, obtains smoother and more reliable operation data, and provides a better data basis for subsequent outlier detection. Step S13 uses statistical methods, such as box plots, to perform descriptive statistical analysis on the denoised operation data, and detects whether there are outliers that are significantly deviated from other data; detects outliers by statistical analysis methods, that is, identifies data points that are significantly different from other data, identifies and excludes outliers to ensure the accuracy and reliability of subsequent data analysis and decision-making; by deleting outliers, the impact of abnormal data on the overall data distribution and analysis results can be reduced, and the quality and accuracy of the data can be improved.

进一步地,步骤S13中识别异常值的过程具体包括以下步骤:Furthermore, the process of identifying outliers in step S13 specifically includes the following steps:

步骤S131:计算去噪后的运行数据的均值和标准差,分别表示数据的中心趋势和离散程度;绘制箱线图显示运行数据的分布情况,箱线图包括上边缘、上四分位数、中位数、下四分位数和下边缘,异常值定义为超出上下四分位数1.5倍的四分位距的数据点;Step S131: Calculate the mean and standard deviation of the denoised operating data, which respectively represent the central tendency and dispersion of the data; draw a box plot to display the distribution of the operating data, where the box plot includes the upper edge, upper quartile, median, lower quartile and lower edge, and an outlier is defined as a data point that exceeds the interquartile range of 1.5 times the upper and lower quartiles;

步骤S132:计算每个数据点的Z-score值,表示该数据点与平均值之间的偏离程度,公式为:Z=(X-μ)/σ,其中X为数据点,μ为均值,σ为标准差,Z-score大于3或小于-3的数据点被视为异常值;Step S132: Calculate the Z-score value of each data point, which indicates the degree of deviation between the data point and the mean value. The formula is: Z = (X-μ)/σ, where X is the data point, μ is the mean value, and σ is the standard deviation. Data points with a Z-score greater than 3 or less than -3 are considered outliers.

步骤S133:进行Grubbs’Test识别单个异常值,通过计算每个数据点与均值之间的差异,找出与其他数据点明显偏离的数据点。Step S133: Perform Grubbs’ Test to identify single outliers, and find data points that deviate significantly from other data points by calculating the difference between each data point and the mean.

优选地,本实施例的步骤S131描述数据的中心趋势和离散程度,并直观地展示数据的分布情况;通过箱线图,可以识别出数据点是否偏离了正常的范围,超出了上下四分位数1.5倍的四分位距的数据点被定义为异常值;对数据进行可视化分析,以便直观地发现异常值。步骤S132计算Z-score。通过计算每个数据点与均值之间的偏离程度,用Z-score来衡量,如果Z-score大于3或小于-3,表示该数据点与平均值之间的偏差非常大,被视为异常值;通过统计方法识别出与平均值明显偏离的异常数据点。步骤S133使用Grubbs’Test识别单个异常值,通过计算每个数据点与均值之间的差异,找出与其他数据点明显偏离的数据点;通过进行统计检验,可以确定哪些数据点是显著偏离的异常值;通过统计学方法识别出单个的异常数据点。Preferably, step S131 of this embodiment describes the central tendency and dispersion of the data, and intuitively displays the distribution of the data; through the box plot, it can be identified whether the data point deviates from the normal range, and the data point that exceeds the interquartile range of 1.5 times the upper and lower quartiles is defined as an outlier; the data is visualized for analysis so as to intuitively find outliers. Step S132 calculates the Z-score. By calculating the degree of deviation between each data point and the mean, it is measured by the Z-score. If the Z-score is greater than 3 or less than -3, it means that the deviation between the data point and the mean is very large and is regarded as an outlier; the abnormal data points that deviate significantly from the mean are identified by statistical methods. Step S133 uses Grubbs'Test to identify single outliers, and by calculating the difference between each data point and the mean, finds out the data points that deviate significantly from other data points; by performing statistical tests, it can be determined which data points are significantly deviated outliers; and single abnormal data points are identified by statistical methods.

综上所述,本实施例对采集到的运行数据进行异常值检测和处理,通过识别和删除异常值,可以提高数据的质量和准确性,使得后续的数据分析和建模工作更加可靠和有效。In summary, this embodiment performs outlier detection and processing on the collected operating data. By identifying and deleting outliers, the quality and accuracy of the data can be improved, making subsequent data analysis and modeling work more reliable and effective.

进一步的,如图3所示,步骤S2中在线状态评估模型的构建过程具体包括以下步骤:Further, as shown in FIG3 , the process of constructing the online status evaluation model in step S2 specifically includes the following steps:

步骤S21:使用缓解算法Relief从经过异常值检测处理的运行数据中提取目标状态特征,计算特征与目标状态之间的相关性来评估特征的重要性;Step S21: extracting target state features from the operating data processed by outlier detection using the relief algorithm Relief, and calculating the correlation between the features and the target state to evaluate the importance of the features;

步骤S22:根据相关性指标对提取到的目标状态特征进行选择,相关性指标确定哪些特征对于描述风电机的运行状态最为重要;Step S22: selecting the extracted target state features according to the correlation index, the correlation index determining which features are most important for describing the operating state of the wind turbine;

步骤S23:利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;在训练过程中,使用机器学习算法构建模型,以根据目标状态特征对风电机的在线状态进行评估。Step S23: using the operating data containing the target state characteristics for training to establish an online state assessment model for the wind turbine; during the training process, using a machine learning algorithm to construct a model to assess the online state of the wind turbine according to the target state characteristics.

优选地,本实施例的步骤S21使用缓解算法Relief从经过异常值检测处理的运行数据中提取目标状态特征,计算特征与目标状态之间的相关性来评估特征的重要性;通过数据分析和特征提取,得到与风电机运行状态相关的特征,从而提高模型的准确性和可靠性;为后续的特征选择和模型建立提供重要的数据基础。步骤S22根据相关性指标对提取到的目标状态特征进行选择,相关性指标确定哪些特征对于描述风电机的运行状态最为重要;通过相关性分析,筛选出对风电机运行状态具有重要影响的特征,减少特征的维度,提高模型的简洁性和可解释性;提高模型的效率和准确性,同时降低建模的复杂度。步骤S23利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型,在训练过程中,使用机器学习算法构建模型,以根据目标状态特征对风电机的在线状态进行评估;通过模型训练,学习特征与目标状态之间的关联规律,从而实现对风电机在线状态的准确评估;提供一种快速、准确的方法来评估风电机的状态,及时发现潜在问题并采取相应的操作和维护措施,提高风电机的稳定性和可靠性。Preferably, step S21 of this embodiment uses the relief algorithm Relief to extract target state features from the operating data processed by outlier detection, and calculates the correlation between the features and the target state to evaluate the importance of the features; through data analysis and feature extraction, features related to the operating state of the wind turbine are obtained, thereby improving the accuracy and reliability of the model; providing an important data basis for subsequent feature selection and model establishment. Step S22 selects the extracted target state features according to the correlation index, and the correlation index determines which features are most important for describing the operating state of the wind turbine; through correlation analysis, features that have an important impact on the operating state of the wind turbine are screened out, the dimension of the features is reduced, and the simplicity and interpretability of the model are improved; the efficiency and accuracy of the model are improved, while reducing the complexity of modeling. Step S23 uses the operating data containing the target state characteristics for training to establish an online state evaluation model for the wind turbine. During the training process, a machine learning algorithm is used to build a model to evaluate the online state of the wind turbine according to the target state characteristics; through model training, the association between the characteristics and the target state is learned, so as to achieve accurate evaluation of the online state of the wind turbine; a fast and accurate method is provided to evaluate the state of the wind turbine, timely discover potential problems and take corresponding operation and maintenance measures, so as to improve the stability and reliability of the wind turbine.

进一步地,步骤S21中缓解算法Relief地过程具体包括以下步骤:Furthermore, the process of the relief algorithm Relief in step S21 specifically includes the following steps:

步骤S211:初始化特征权重,对于每个特征,初始化其权重为0;随机选择一个样本作为参考样本;根据欧氏距离计算其他样本与参考样本之间的相似度,并根据相似度对其他样本进行排序;Step S211: Initialize feature weights. For each feature, initialize its weight to 0; randomly select a sample as a reference sample; calculate the similarity between other samples and the reference sample according to the Euclidean distance, and sort other samples according to the similarity;

步骤S212:对于每个特征,计算参考样本与最近邻样本之间的差异值。如果最近邻样本属于目标状态类别,则增加特征权重;如果最近邻样本不属于目标状态类别,则减少特征权重;Step S212: For each feature, calculate the difference between the reference sample and the nearest neighbor sample. If the nearest neighbor sample belongs to the target state category, increase the feature weight; if the nearest neighbor sample does not belong to the target state category, reduce the feature weight;

步骤S213:根据差异值和样本权重更新特征权重,重复直到遍历所有样本;根据特征权重进行特征选择,选择达到权重阈值的特征作为目标状态特征。Step S213: Update the feature weight according to the difference value and the sample weight, and repeat until all samples are traversed; perform feature selection according to the feature weight, and select the feature that reaches the weight threshold as the target state feature.

优选地,本实施例的步骤S211初始化特征权重并选择参考样本,计算其他样本与参考样本之间的相似度;通过计算相似度,可以了解每个样本与参考样本的相似程度,为后续步骤提供基础数据。步骤S212计算参考样本与最近邻样本之间的差异值,并根据最近邻样本的类别对特征权重进行调整;通过计算差异值和调整权重,可以评估每个特征对目标状态的重要性,并根据最近邻样本的类别调整权重,从而区分对目标状态的贡献。步骤S213根据差异值和样本权重更新特征权重;通过迭代更新特征权重,可以逐渐提高对目标状态的判别能力,以更准确地选择目标状态特征。Preferably, step S211 of this embodiment initializes the feature weights and selects a reference sample, and calculates the similarity between other samples and the reference sample; by calculating the similarity, the degree of similarity between each sample and the reference sample can be understood, providing basic data for subsequent steps. Step S212 calculates the difference value between the reference sample and the nearest neighbor sample, and adjusts the feature weight according to the category of the nearest neighbor sample; by calculating the difference value and adjusting the weight, the importance of each feature to the target state can be evaluated, and the weight can be adjusted according to the category of the nearest neighbor sample to distinguish the contribution to the target state. Step S213 updates the feature weight according to the difference value and sample weight; by iteratively updating the feature weight, the ability to discriminate the target state can be gradually improved to more accurately select the target state feature.

综上所述,本实施例的缓解算法Relief通过评估特征与目标状态之间的相关性,选择重要的特征,从而提高风电机在线状态评估模型的准确性和可靠性。通过准确评估风电机的在线状态,可以及时检测潜在问题,提高风电机的运行效率和安全性。In summary, the mitigation algorithm Relief of this embodiment improves the accuracy and reliability of the wind turbine online status assessment model by evaluating the correlation between the features and the target state. By accurately evaluating the online status of the wind turbine, potential problems can be detected in time, improving the operating efficiency and safety of the wind turbine.

进一步地,步骤S22中根据相关性指标对提取到的目标状态特征进行选择的过程具体包括以下步骤:Furthermore, the process of selecting the extracted target state features according to the correlation index in step S22 specifically includes the following steps:

步骤S221:计算特征与目标状态之间的相关性,对于每个特征,通过互信息计算其与目标状态之间的相关性指标;Step S221: Calculate the correlation between the feature and the target state. For each feature, calculate the correlation index between it and the target state through mutual information.

步骤S222:根据互信息的值,确定哪些特征对于描述风电机的运行状态最为重要,设置一个互信息的阈值,只选择互信息值超过阈值的特征作为目标状态特征;互信息值越大表示特征与目标状态之间的相关性越高,被认为是重要特征;Step S222: according to the mutual information value, determine which features are most important for describing the operating state of the wind turbine, set a mutual information threshold, and only select features whose mutual information value exceeds the threshold as target state features; the larger the mutual information value, the higher the correlation between the feature and the target state, and is considered to be an important feature;

步骤S223:根据相关性指标,按照相关性指标的大小进行排序,选择排名靠前的特征作为目标状态特征。Step S223: sort according to the relevance index and select the top-ranked features as the target state features.

优选地,本实施例的步骤S221计算特征与目标状态之间的相关性,通过互信息计算其与目标状态之间的相关性指标,量化特征与目标状态之间的相互依赖程度,通过互信息衡量特征对目标状态的贡献程度;计算得到的相关性指标可以用来评估特征与目标状态之间的相关性。步骤S222根据互信息的值确定哪些特征对于描述目标状态最为重要,通过设置一个互信息的阈值,只选择互信息值超过阈值的特征作为目标状态特征;互信息值越大表示特征与目标状态之间的相关性越高,被认为是重要特征;筛选出对目标状态具有较高相关性的特征,以保留对目标状态预测有用的特征。步骤S223根据相关性指标,按照相关性指标的大小进行排序,选择排名靠前的特征作为目标状态特征;根据相关性指标的重要性排序,选择具有最高相关性的特征作为目标状态特征;通过选择排名靠前的特征,可以保留对目标状态预测更有贡献的特征。Preferably, step S221 of this embodiment calculates the correlation between the feature and the target state, calculates the correlation index between it and the target state through mutual information, quantifies the degree of mutual dependence between the feature and the target state, and measures the contribution of the feature to the target state through mutual information; the calculated correlation index can be used to evaluate the correlation between the feature and the target state. Step S222 determines which features are most important for describing the target state according to the value of the mutual information, and only selects features whose mutual information values exceed the threshold as target state features by setting a threshold of the mutual information; the larger the mutual information value, the higher the correlation between the feature and the target state, which is considered to be an important feature; screen out features with higher correlation to the target state to retain features useful for target state prediction. Step S223 sorts according to the correlation index according to the size of the correlation index, and selects the top-ranked features as target state features; sorts according to the importance of the correlation index, and selects the features with the highest correlation as target state features; by selecting the top-ranked features, features that contribute more to target state prediction can be retained.

综上所述,本实施例筛选出与目标状态相关性较高的特征,从而提高对目标状态的预测准确性和可解释性;通过特征选择可以降低数据维度,减少冗余特征的影响,提高模型的泛化能力和效率。同时,特征选择还可以降低模型的复杂度和计算成本,使得模型更易于理解和解释。In summary, this embodiment selects features with high correlation with the target state, thereby improving the prediction accuracy and interpretability of the target state; feature selection can reduce data dimensions, reduce the impact of redundant features, and improve the generalization ability and efficiency of the model. At the same time, feature selection can also reduce the complexity and computational cost of the model, making the model easier to understand and explain.

进一步地,步骤S221中互信息的计算过程具体包括以下步骤:Furthermore, the mutual information calculation process in step S221 specifically includes the following steps:

步骤S2211:计算特征和目标状态各自的概率分布,即特征的概率分布和目标状态的概率分布;计算特征和目标状态的联合概率分布,即特征和目标状态同时发生的概率;Step S2211: Calculate the probability distribution of each feature and target state, that is, the probability distribution of the feature and the probability distribution of the target state; calculate the joint probability distribution of the feature and the target state, that is, the probability of the feature and the target state occurring at the same time;

步骤S2212:根据边缘概率分布和联合概率分布,计算特征与目标状态之间的互信息,互信息的计算公式如下:Step S2212: Calculate the mutual information between the feature and the target state according to the marginal probability distribution and the joint probability distribution. The calculation formula of the mutual information is as follows:

I(X,Y)=∑P(x,y)*log(P(x,y)/(P(x)*P(y)))I(X,Y)=∑P(x,y)*log(P(x,y)/(P(x)*P(y)))

其中,X表示特征,Y表示目标状态,P(x,y)表示特征和目标状态同时发生的概率,P(x)和P(y)分别表示特征和目标状态的边缘概率分布;Where X represents the feature, Y represents the target state, P(x,y) represents the probability of the feature and the target state occurring at the same time, and P(x) and P(y) represent the marginal probability distribution of the feature and the target state respectively;

步骤S2213:将互信息的计算结果与互信息的阈值进行对比,选择处互信息值超过阈值的特征作为目标状态特征。Step S2213: Compare the calculation result of the mutual information with the threshold of the mutual information, and select the feature whose mutual information value exceeds the threshold as the target state feature.

优选地,本实施例的步骤S2211计算特征和目标状态的概率分布,可以帮助了解特征和目标状态的分布情况,为后续计算互信息提供基础。步骤S2212通过计算互信息,可以衡量特征和目标状态之间的相关性,互信息值越大表示特征与目标状态之间的相关性越高,被认为是重要特征。步骤S2213根据互信息的阈值,筛选出互信息值超过阈值的特征作为目标状态特征,通过设置阈值来控制选择的特征数量,只选择与目标状态相关性较高的特征,减少了冗余信息。Preferably, step S2211 of this embodiment calculates the probability distribution of features and target states, which can help understand the distribution of features and target states and provide a basis for the subsequent calculation of mutual information. Step S2212 can measure the correlation between features and target states by calculating mutual information. The larger the mutual information value, the higher the correlation between the feature and the target state, which is considered to be an important feature. Step S2213 selects features whose mutual information values exceed the threshold as target state features based on the threshold of the mutual information. By setting the threshold to control the number of selected features, only features with high correlation with the target state are selected, reducing redundant information.

综上所述,本实施例根据特征与目标状态之间的相关性选择出最能描述目标状态的特征;对于分析和预测目标状态具有重要意义,可以帮助深入理解目标状态的变化规律,从而进行有效的决策和控制。In summary, this embodiment selects the feature that best describes the target state based on the correlation between the feature and the target state; it is of great significance for analyzing and predicting the target state, and can help to deeply understand the changing law of the target state, thereby making effective decisions and controls.

进一步地,步骤S23中使用机器学习算法构建模型的过程具体包括以下步骤:Furthermore, the process of using the machine learning algorithm to build a model in step S23 specifically includes the following steps:

步骤S231:对包含目标状态特征的运行数据进行归一化预处理,选择随机森林的机器学习算法进行建模;Step S231: normalize and preprocess the operation data containing the target state characteristics, and select the random forest machine learning algorithm for modeling;

步骤S232:将预处理后的数据集划分为训练集和测试集,训练集用于在线状态评估模型的训练和参数调优,测试集用于评估在线状态评估模型的性能;Step S232: Divide the preprocessed data set into a training set and a test set, the training set is used for training and parameter tuning of the online state assessment model, and the test set is used for evaluating the performance of the online state assessment model;

步骤S233:使用训练集对选定的机器学习算法进行训练,调整在线状态评估模型的参数。Step S233: Use the training set to train the selected machine learning algorithm and adjust the parameters of the online state assessment model.

优选地,本实施例的步骤S231归一化预处理能够将不同特征的取值范围统一,避免某些特征对模型训练的影响过大;选择随机森林算法进行建模是因为随机森林具有较好的特征选择能力和鲁棒性。步骤S232将数据集划分为训练集和测试集可以用于模型的训练和评估,训练集用于训练模型,优化模型的参数和算法,使其能够充分学习目标状态的特征,测试集用于评估模型的性能,验证模型的泛化能力和预测准确度。步骤S233通过训练模型和调整参数,可以优化模型的性能,提高模型的预测准确度,通过优化在线状态评估模型,可以提高风电机的在线状态评估的准确度和可靠性,为风电机的运维管理提供支持,减少故障风险和维修成本。Preferably, the normalization preprocessing in step S231 of this embodiment can unify the value ranges of different features and avoid excessive influence of certain features on model training; the random forest algorithm is selected for modeling because random forest has good feature selection ability and robustness. Step S232 divides the data set into a training set and a test set, which can be used for model training and evaluation. The training set is used to train the model, optimize the parameters and algorithms of the model so that it can fully learn the characteristics of the target state, and the test set is used to evaluate the performance of the model and verify the generalization ability and prediction accuracy of the model. Step S233 can optimize the performance of the model and improve the prediction accuracy of the model by training the model and adjusting the parameters. By optimizing the online status evaluation model, the accuracy and reliability of the online status evaluation of the wind turbine can be improved, providing support for the operation and maintenance management of the wind turbine and reducing the risk of failure and maintenance costs.

进一步地,步骤S233中使用训练集对选定的机器学习算法进行训练的过程具体包括以下步骤:Furthermore, the process of using the training set to train the selected machine learning algorithm in step S233 specifically includes the following steps:

步骤S2331:将训练集输入到机器学习算法中进行训练,训练集包含经过预处理的运行数据和对应的目标状态;Step S2331: inputting a training set into a machine learning algorithm for training, wherein the training set includes preprocessed operating data and corresponding target states;

步骤S2332:在训练过程中,机器学习算法根据输入的特征和目标状态之间的关系来学习在线状态评估模型的参数;根据训练数据的特征和目标状态的对应关系调整在线状态评估模型的权重和偏置;Step S2332: During the training process, the machine learning algorithm learns the parameters of the online state assessment model according to the relationship between the input features and the target state; and adjusts the weights and biases of the online state assessment model according to the corresponding relationship between the features of the training data and the target state;

其中,对在线状态评估模型的参数进行初始化,初始值是随机值;使用训练集中的数据样本输入在线状态评估模型,计算在线状态评估模型的预测结果,并与真实的目标状态进行比较;根据比较结果,计算损失函数的值,用来衡量在线状态评估模型预测结果与目标状态之间的差距;对损失函数进行求导,得到关于在线状态评估模型参数的梯度,梯度表示了损失函数在当前参数取值处的变化率,指示了参数更新的方向;根据梯度的方向和大小,使用梯度下降来更新在线状态评估模型的参数,更新的过程是将当前参数值减去学习率乘以梯度,学习率决定了每次参数更新的步长;重复迭代,直到达到指定的迭代次数或者损失函数的变化足够小,达到了预设的收敛条件;The parameters of the online state evaluation model are initialized, and the initial values are random values; the data samples in the training set are used to input the online state evaluation model, and the prediction results of the online state evaluation model are calculated and compared with the actual target state; according to the comparison results, the value of the loss function is calculated to measure the gap between the prediction results of the online state evaluation model and the target state; the loss function is derived to obtain the gradient of the parameters of the online state evaluation model. The gradient represents the rate of change of the loss function at the current parameter value and indicates the direction of parameter update; according to the direction and size of the gradient, the parameters of the online state evaluation model are updated using gradient descent. The updating process is to subtract the learning rate from the current parameter value and multiply it by the gradient. The learning rate determines the step size of each parameter update; the iteration is repeated until the specified number of iterations is reached or the change in the loss function is small enough to meet the preset convergence condition;

步骤S2333:训练过程中,进行调整决策树的深度、随机森林中树的数量等参数调优,训练完成后,得到了训练好的在线状态评估模型;Step S2333: During the training process, parameters such as the depth of the decision tree and the number of trees in the random forest are adjusted. After the training is completed, a trained online state evaluation model is obtained;

其中,通过尝试不同的参数取值,计算相应的损失函数值,并选择使得损失函数最小的参数组合作为最终的模型参数设置。Among them, by trying different parameter values, the corresponding loss function value is calculated, and the parameter combination that minimizes the loss function is selected as the final model parameter setting.

优选地,本实施例的步骤S2331将训练集输入到机器学习算法中进行训练,通过训练集中的数据样本来训练模型,包含了经过预处理的运行数据和对应的目标状态;为了让机器学习算法能够学习到输入特征和目标状态之间的关系,从而能够进行准确的预测和分类。步骤S2332在训练过程中,机器学习算法根据输入的特征和目标状态之间的关系来学习在线状态评估模型的参数,通过调整模型的权重和偏置,使得模型能够更好地拟合训练数据;为了最小化损失函数,即使模型的预测结果与目标状态尽可能地接近。步骤S2333训练过程中,进行调整决策树的深度、随机森林中树的数量等参数调优,通过尝试不同的参数取值,计算相应的损失函数值,并选择使得损失函数最小的参数组合作为最终的模型参数设置,目的是为了优化模型的性能,提高模型的准确度和泛化能力。Preferably, step S2331 of this embodiment inputs the training set into the machine learning algorithm for training, and trains the model through the data samples in the training set, including the preprocessed operation data and the corresponding target state; in order to enable the machine learning algorithm to learn the relationship between the input features and the target state, so as to be able to make accurate predictions and classifications. Step S2332 During the training process, the machine learning algorithm learns the parameters of the online state evaluation model according to the relationship between the input features and the target state, and adjusts the weights and biases of the model so that the model can better fit the training data; in order to minimize the loss function, the prediction results of the model are as close to the target state as possible. Step S2333 During the training process, the depth of the decision tree, the number of trees in the random forest and other parameters are adjusted, and the corresponding loss function values are calculated by trying different parameter values, and the parameter combination that minimizes the loss function is selected as the final model parameter setting, in order to optimize the performance of the model and improve the accuracy and generalization ability of the model.

综上所述,本实施例通过训练集来训练模型,使得模型能够学习到输入特征和目标状态之间的关系,并通过调整模型参数来最小化损失函数,使得模型的预测结果与目标状态尽可能地接近;最终达到的意义是得到一个训练好的在线状态评估模型,该模型能够对新的输入特征进行预测和分类,从而为后续的应用提供准确的结果。同时,通过参数调优,还可以提高模型的性能和泛化能力,使得模型具有更好的适应性和推广能力。In summary, this embodiment trains the model through the training set so that the model can learn the relationship between the input features and the target state, and minimizes the loss function by adjusting the model parameters so that the prediction results of the model are as close as possible to the target state; the ultimate goal is to obtain a trained online state evaluation model that can predict and classify new input features, thereby providing accurate results for subsequent applications. At the same time, through parameter tuning, the performance and generalization ability of the model can also be improved, so that the model has better adaptability and promotion ability.

进一步地,如图4所示,步骤S3中将评估结果可视化展示的过程具体包括以下步骤:Further, as shown in FIG4 , the process of visually displaying the evaluation results in step S3 specifically includes the following steps:

步骤S31:将在线状态评估模型的预测结果与实际的目标状态进行比较,计算评估指标,评估在线状态评估模型的性能;Step S31: Compare the prediction result of the online state evaluation model with the actual target state, calculate the evaluation index, and evaluate the performance of the online state evaluation model;

步骤S32:将实时采集的数据输入到建立好的在线状态评估模型中,获取模型的预测结果;Step S32: input the real-time collected data into the established online status assessment model to obtain the prediction result of the model;

步骤S33:将在线状态评估模型的预测结果以图表形式展示出来,根据在线状态评估的预测结果,及时采取措施进行维护。Step S33: Display the prediction results of the online status evaluation model in the form of a chart, and take timely maintenance measures based on the prediction results of the online status evaluation.

优选地,本实施例的步骤S31通过比较在线状态评估模型的预测结果与实际的目标状态,可以计算评估指标,如准确率、召回率、F1值等,以评估在线状态评估模型的性能;帮助判断模型的准确性和可靠性,同时也为后续的模型改进提供参考。步骤S32将实时采集的数据输入到在线状态评估模型中,获取模型的预测结果;利用已建立的模型对实时数据进行预测,以得到当前风电机的在线状态。该预测结果可以用于判断风电机是否正常运行或存在异常情况。步骤S33将在线状态评估模型的预测结果以图表形式展示出来,可以直观地展示风电机的在线状态。通过可视化展示,操作人员可以更快速地了解风电机的运行情况,及时采取措施进行维护和修复;有助于提高风电机的运行效率和可靠性,减少停机时间和维修成本,提高风电发电的稳定性和经济性。Preferably, step S31 of this embodiment can calculate evaluation indicators such as accuracy, recall rate, F1 value, etc. by comparing the prediction results of the online state evaluation model with the actual target state to evaluate the performance of the online state evaluation model; help judge the accuracy and reliability of the model, and also provide a reference for subsequent model improvement. Step S32 inputs the real-time collected data into the online state evaluation model to obtain the prediction results of the model; uses the established model to predict the real-time data to obtain the current online state of the wind turbine. The prediction result can be used to determine whether the wind turbine is operating normally or there is an abnormality. Step S33 displays the prediction results of the online state evaluation model in the form of a chart, which can intuitively display the online state of the wind turbine. Through visual display, operators can understand the operation of the wind turbine more quickly and take timely measures for maintenance and repair; it helps to improve the operating efficiency and reliability of the wind turbine, reduce downtime and maintenance costs, and improve the stability and economy of wind power generation.

综上所述,本实施例通过以上步骤的实施,可以对风电机的在线状态进行评估,并及时采取相应的操作和维护措施,从而保证风电机的正常运行,提高风电发电系统的可靠性和效益。In summary, by implementing the above steps, this embodiment can evaluate the online status of the wind turbine and take corresponding operation and maintenance measures in time, thereby ensuring the normal operation of the wind turbine and improving the reliability and efficiency of the wind power generation system.

进一步地,步骤S33中预测结果以图表形式展示出来的过程具体包括以下步骤:Furthermore, the process of displaying the prediction results in the form of a chart in step S33 specifically includes the following steps:

步骤S331:将预测结果转换成时间序列数据;Step S331: converting the prediction results into time series data;

步骤S332:选择折线图展示预测结果,横轴为时间轴,表示时间的连续性,按照时间的顺序排列数据点;纵轴伟预测结果的值,是状态的评估分数;标题是在线状态评估结果的变化趋势,横轴标签是时间,纵轴标签是评估结果;Step S332: Select a line chart to display the prediction results. The horizontal axis is the time axis, indicating the continuity of time, and the data points are arranged in chronological order. The vertical axis is the value of the prediction result, which is the evaluation score of the state. The title is the change trend of the online state evaluation result. The horizontal axis label is time, and the vertical axis label is the evaluation result.

步骤S333:利用数据可视化工具,将预测结果以图表形式呈现出来。Step S333: Use data visualization tools to present the prediction results in the form of charts.

优选地,本实施例的步骤S331将预测结果转换成时间序列数据是将预测结果按照时间的顺序进行排列,方便后续的可视化展示;为了将预测结果与时间关联起来,可以更清晰地观察预测结果的变化趋势。步骤S332选择折线图展示预测结果使用折线图来展示预测结果的变化趋势,横轴表示时间的连续性,纵轴表示评估结果的值,标题和标签的设置可以提供对图表内容的描述和解释;通过可视化展示,直观地观察预测结果的变化趋势,帮助用户理解和分析评估模型的性能。步骤S333利用数据可视化工具将预测结果以图表形式呈现出来,通过数据可视化工具将处理好的数据转化为图表,例如使用Python中的Matplotlib、Seaborn等库;将数据转化为直观的图表形式,提供更直观、易懂的展示方式,使用户更容易理解和分析预测结果,及时采取相应的维护措施。Preferably, step S331 of this embodiment converts the prediction results into time series data by arranging the prediction results in chronological order to facilitate subsequent visual display; in order to associate the prediction results with time, the changing trend of the prediction results can be observed more clearly. Step S332 selects a line chart to display the prediction results. A line chart is used to display the changing trend of the prediction results. The horizontal axis represents the continuity of time, and the vertical axis represents the value of the evaluation result. The setting of the title and label can provide a description and explanation of the content of the chart; through visual display, the changing trend of the prediction results can be intuitively observed to help users understand and analyze the performance of the evaluation model. Step S333 uses a data visualization tool to present the prediction results in the form of a chart, and converts the processed data into a chart through a data visualization tool, such as using Matplotlib, Seaborn and other libraries in Python; converts the data into an intuitive chart form, providing a more intuitive and easy-to-understand display method, so that users can more easily understand and analyze the prediction results and take corresponding maintenance measures in time.

如图5所示,本实施例还提供了风电机的在线状态评估装置的一个实施例,在本实施例中,风电机的在线状态评估装置应用于如上述实施例中的风电机的在线状态评估方法,该风电机的在线状态评估装置包括依次电性连接的数据处理模块1、模型构建模块2、状态评估模块3;As shown in FIG5 , this embodiment further provides an embodiment of an online state assessment device for a wind turbine. In this embodiment, the online state assessment device for a wind turbine is applied to the online state assessment method for a wind turbine in the above embodiment. The online state assessment device for a wind turbine comprises a data processing module 1, a model building module 2, and a state assessment module 3 which are electrically connected in sequence;

其中,数据处理模块1用于通过传感器等设备,实时采集风电机的振动数据、温度数据、电流数据及功率数据等运行数据,对采集到的运行数据进行异常值检测处理;模型构建模块2用于通过缓解算法Relief,从异常值检测处理后的运行数据中提取目标状态特征,目标状态特征用于描述风电机的运行状态;根据特征的相关性指标,对提取到的目标状态特征进行选择;状态评估模块3用于利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;通过交叉验证,对建立的在线状态评估模型进行评估,利用建立好的在线状态评估模型,对实时采集的数据进行预测和判断,评估风电机的在线状态;将评估结果可视化展示,并根据评估结果进行相应的操作和维护。Among them, the data processing module 1 is used to collect the vibration data, temperature data, current data, power data and other operating data of the wind turbine in real time through sensors and other equipment, and perform outlier detection processing on the collected operating data; the model building module 2 is used to extract the target state characteristics from the operating data after outlier detection processing through the relief algorithm Relief, and the target state characteristics are used to describe the operating state of the wind turbine; according to the correlation index of the characteristics, the extracted target state characteristics are selected; the state evaluation module 3 is used to use the operating data containing the target state characteristics for training to establish an online state evaluation model of the wind turbine; through cross-validation, the established online state evaluation model is evaluated, and the real-time collected data is predicted and judged using the established online state evaluation model to evaluate the online state of the wind turbine; the evaluation results are visualized and corresponding operations and maintenance are performed according to the evaluation results.

优选地,本实施例的数据处理模块1通过对实时采集的风电机运行数据进行异常值检测处理,可以识别和剔除可能存在的异常数据,保证后续模型构建和状态评估的准确性和可靠性。模型构建模块2使用缓解算法Relief从异常值检测处理后的运行数据中提取目标状态特征,这些特征能够有效地描述风电机的运行状态,通过相关性指标选择特征,可以提高模型的质量和效果。状态评估模块3利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型,通过交叉验证对模型进行评估,可以评估模型的准确性和泛化能力;利用建立的在线状态评估模型对实时采集的数据进行预测和判断,可以及时发现风电机的异常状态,为操作和维护提供依据。Preferably, the data processing module 1 of this embodiment can identify and eliminate possible abnormal data by performing abnormal value detection processing on the real-time collected wind turbine operation data, thereby ensuring the accuracy and reliability of subsequent model construction and state evaluation. The model construction module 2 uses the relief algorithm Relief to extract target state features from the operation data after abnormal value detection processing. These features can effectively describe the operation state of the wind turbine. The quality and effect of the model can be improved by selecting features through correlation indicators. The state evaluation module 3 uses the operation data containing the target state features for training to establish an online state evaluation model for the wind turbine. The model is evaluated through cross-validation to evaluate the accuracy and generalization ability of the model; the established online state evaluation model is used to predict and judge the real-time collected data, so that the abnormal state of the wind turbine can be discovered in time, providing a basis for operation and maintenance.

综上所述,本实施例包括提高数据的可靠性和准确性、提取有效的特征、建立准确的状态评估模型、及时发现和处理异常情况;提高风电机的运行效率和可靠性,减少故障和停机时间,降低维护成本,提高发电效益。In summary, this embodiment includes improving the reliability and accuracy of data, extracting effective features, establishing an accurate status assessment model, and timely discovering and handling abnormal situations; improving the operating efficiency and reliability of wind turbines, reducing failures and downtime, reducing maintenance costs, and improving power generation efficiency.

进一步地,数据处理模块1具体包括:Furthermore, the data processing module 1 specifically includes:

数据排序子模块,用于确定运行数据窗口的大小,从运行数据序列的开头开始,依次将窗口内的数据点放入一个数组中,对窗口内的数据点进行排序,并取其中间位置的值作为中位数;如果窗口大小为奇数,则中位数为排序后的中间值;如果窗口大小为偶数,则中位数为排序后中间两个值的平均值;The data sorting submodule is used to determine the size of the running data window, starting from the beginning of the running data sequence, and sequentially put the data points in the window into an array, sort the data points in the window, and take the value in the middle position as the median; if the window size is an odd number, the median is the middle value after sorting; if the window size is an even number, the median is the average of the two middle values after sorting;

数据去噪子模块,用于将当前窗口的中位数作为当前数据点的值,替换原始运行数据序列中的对应位置的值;将窗口向后移动一个位置,直到遍历完所有数据点,得到去噪后的运行数据;The data denoising submodule is used to use the median of the current window as the value of the current data point to replace the value of the corresponding position in the original running data sequence; the window is moved backward by one position until all data points are traversed to obtain the denoised running data;

异常值判断子模块,用于使用统计学方法,对去噪后的运行数据进行描述性统计分析,检测是否存在与其他运行数据明显偏离的异常值;使用箱线图,通过判断数据点与其它数据点的偏离程度,识别异常值,将被识别为异常值的数据点从数据集中删除。The outlier judgment submodule is used to use statistical methods to perform descriptive statistical analysis on the denoised operating data to detect whether there are outliers that are significantly deviated from other operating data; use box plots to identify outliers by judging the degree of deviation between data points and other data points, and delete data points identified as outliers from the data set.

优选地,本实施例的数据排序子模块将窗口内的数据点进行排序,并取中位数作为窗口的代表值;确定窗口的大小,并选择合适的窗口代表值来平衡数据的变化。数据去噪子模块将当前窗口的中位数作为当前数据点的值,替换原始数据序列中的对应位置的值;通过使用中位数来去除原始数据中的噪声,从而得到更平滑的数据序列。异常值判断子模块使用统计学方法和箱线图来检测去噪后的数据中是否存在明显偏离的异常值;根据数据的分布情况和离群点的定义,识别出异常值并将其从数据集中删除;确保数据的准确性和可靠性,避免异常值对结果的影响。Preferably, the data sorting submodule of this embodiment sorts the data points in the window and takes the median as the representative value of the window; determines the size of the window, and selects a suitable window representative value to balance the change of the data. The data denoising submodule uses the median of the current window as the value of the current data point, replacing the value of the corresponding position in the original data sequence; removes the noise in the original data by using the median, thereby obtaining a smoother data sequence. The outlier judgment submodule uses statistical methods and box-and-whisker plots to detect whether there are obviously deviated outliers in the denoised data; identifies outliers and deletes them from the data set based on the distribution of the data and the definition of outliers; ensures the accuracy and reliability of the data and avoids the influence of outliers on the results.

进一步的,模型构建模块2具体包括:Furthermore, the model building module 2 specifically includes:

特征评估子模块,用于使用缓解算法Relief从经过异常值检测处理的运行数据中提取目标状态特征,计算特征与目标状态之间的相关性来评估特征的重要性;The feature evaluation submodule is used to extract the target state features from the running data processed by outlier detection using the relief algorithm Relief, and calculate the correlation between the features and the target state to evaluate the importance of the features;

特征确定子模块,用于根据相关性指标对提取到的目标状态特征进行选择,相关性指标确定哪些特征对于描述风电机的运行状态最为重要;A feature determination submodule is used to select the extracted target state features according to a correlation index, which determines which features are most important for describing the operating state of the wind turbine;

模型训练子模块,利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;在训练过程中,使用机器学习算法构建模型,以根据目标状态特征对风电机的在线状态进行评估。The model training submodule uses the operating data containing the target state characteristics for training to establish an online state assessment model for the wind turbine. During the training process, a machine learning algorithm is used to build a model to evaluate the online state of the wind turbine based on the target state characteristics.

优选地,本实施例的特征评估子模块使用Relief算法从经过异常值检测处理的运行数据中提取目标状态特征,并评估特征与目标状态之间的相关性;确定哪些特征对于描述风电机的运行状态最为重要,从而提供有价值的特征信息给后续的特征选择和模型训练。特征确定子模块根据相关性指标对提取到的目标状态特征进行选择;通过筛选出与目标状态相关性较高的特征,提高模型的准确性和效果,减少特征维度,提高模型的可解释性和计算效率。模型训练子模块利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;通过机器学习算法构建模型,利用目标状态特征对风电机的在线状态进行评估,提供对风电机状态的实时预测和监测,帮助提高风电机的运行效率和可靠性。Preferably, the feature evaluation submodule of this embodiment uses the Relief algorithm to extract target state features from the operating data processed by outlier detection, and evaluates the correlation between the features and the target state; determines which features are most important for describing the operating state of the wind turbine, thereby providing valuable feature information for subsequent feature selection and model training. The feature determination submodule selects the extracted target state features according to the correlation index; by screening out features with high correlation with the target state, the accuracy and effect of the model are improved, the feature dimension is reduced, and the interpretability and computational efficiency of the model are improved. The model training submodule uses the operating data containing the target state features for training to establish an online state evaluation model for the wind turbine; constructs a model through a machine learning algorithm, uses the target state features to evaluate the online state of the wind turbine, provides real-time prediction and monitoring of the wind turbine state, and helps improve the operating efficiency and reliability of the wind turbine.

进一步地,状态评估模块3具体包括:Furthermore, the status assessment module 3 specifically includes:

性能评估子模块,用于将在线状态评估模型的预测结果与实际的目标状态进行比较,计算评估指标,评估在线状态评估模型的性能;The performance evaluation submodule is used to compare the prediction results of the online state evaluation model with the actual target state, calculate the evaluation index, and evaluate the performance of the online state evaluation model;

结果输出子模块,用于将实时采集的数据输入到建立好的在线状态评估模型中,获取模型的预测结果;The result output submodule is used to input the real-time collected data into the established online status evaluation model to obtain the prediction results of the model;

结果展示子模块,用于将在线状态评估模型的预测结果以图表形式展示出来,根据在线状态评估的预测结果,及时采取措施进行维护。The result display submodule is used to display the prediction results of the online status evaluation model in the form of charts and graphs, and take timely maintenance measures based on the prediction results of the online status evaluation.

优选地,本实施例的性能评估子模块通过比较在线状态评估模型的预测结果与实际目标状态,计算评估指标,如准确率、精确率、召回率等来评估模型的性能;帮助评估在线状态评估模型的准确性和可靠性,了解模型的预测能力,为模型改进和优化提供依据。结果输出子模块将实时采集的数据输入到在线状态评估模型中,获取模型的预测结果;实现实时的状态评估,根据模型的预测结果判断风电机的运行状态,为及时采取措施进行维护和调整提供依据。结果展示子模块将在线状态评估模型的预测结果以图表形式展示出来,如曲线图、散点图等;通过直观的图表展示,帮助用户了解风电机的运行状态,及时发现异常情况或趋势,进行预警和决策,提高风电机的运行效率和可靠性。Preferably, the performance evaluation submodule of this embodiment evaluates the performance of the model by comparing the prediction results of the online state evaluation model with the actual target state and calculating evaluation indicators such as accuracy, precision, recall, etc.; helps evaluate the accuracy and reliability of the online state evaluation model, understands the prediction ability of the model, and provides a basis for model improvement and optimization. The result output submodule inputs the real-time collected data into the online state evaluation model to obtain the prediction results of the model; realizes real-time state evaluation, judges the operating state of the wind turbine according to the prediction results of the model, and provides a basis for taking timely measures for maintenance and adjustment. The result display submodule displays the prediction results of the online state evaluation model in the form of charts, such as curve charts, scatter plots, etc.; through intuitive chart display, it helps users understand the operating state of the wind turbine, discover abnormal conditions or trends in time, make early warnings and decisions, and improve the operating efficiency and reliability of the wind turbine.

综上所述,本实施例整个状态评估模块的目的是通过建立模型、评估模型性能和展示结果,实现对风电机运行状态的实时监测和评估,以提高风电机的运行效率、降低故障率,并为维护和调整提供科学依据。In summary, the purpose of the entire status assessment module of this embodiment is to achieve real-time monitoring and assessment of the operating status of the wind turbine by establishing a model, evaluating model performance and displaying results, so as to improve the operating efficiency of the wind turbine, reduce the failure rate, and provide a scientific basis for maintenance and adjustment.

如图6所示,本实施例提供了电子设备的一个实施例,在本实施例中,该电子设备4包括处理器41及和处理器41耦接的存储器42。As shown in FIG. 6 , this embodiment provides an embodiment of an electronic device. In this embodiment, the electronic device 4 includes a processor 41 and a memory 42 coupled to the processor 41 .

存储器42存储有用于实现上述任一实施例的风电机的在线状态评估方法的程序指令。The memory 42 stores program instructions for implementing the online state assessment method for a wind turbine according to any of the above embodiments.

处理器41用于执行存储器42存储的程序指令以进行风电机的在线状态评估。The processor 41 is used to execute program instructions stored in the memory 42 to perform online status assessment of the wind turbine.

其中,处理器41还可以称为CPU(Central Processing Unit,中央处理单元)。处理器41可能是一种集成电路芯片,具有信号的处理能力。处理器41还可以是通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现场可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 41 may also be referred to as a CPU (Central Processing Unit). The processor 41 may be an integrated circuit chip having signal processing capabilities. The processor 41 may also be a general-purpose processor, a digital signal processor (DSP), an application-specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. The general-purpose processor may be a microprocessor or the processor may also be any conventional processor, etc.

进一步地,图7为本申请一实施例的存储介质的结构示意图,本申请实施例的存储介质5存储有能够实现上述所有方法的程序指令51,其中,该程序指令51可以以软件产品的形式存储在上述存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本申请各个实施方式所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-OnlyMemory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质,或者是计算机、服务器、手机、平板等终端设备。Further, FIG. 7 is a schematic diagram of the structure of a storage medium of an embodiment of the present application, wherein the storage medium 5 of the embodiment of the present application stores program instructions 51 capable of implementing all the above methods, wherein the program instructions 51 can be stored in the above storage medium in the form of a software product, including several instructions for enabling a computer device (which can be a personal computer, a server, or a network device, etc.) or a processor to execute all or part of the steps of the methods described in each embodiment of the present application. The aforementioned storage medium includes: various media that can store program codes, such as a USB flash drive, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk or an optical disk, or terminal devices such as a computer, a server, a mobile phone, and a tablet.

在本发明所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其他的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其他的形式。In the several embodiments provided by the present invention, it should be understood that the disclosed systems, devices and methods can be implemented in other ways. For example, the device embodiments described above are only schematic, for example, the division of units is only a logical function division, and there may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed. Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be an indirect coupling or communication connection through some interfaces, devices or units, which can be electrical, mechanical or other forms.

另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。以上仅为本发明的实施方式,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所做的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit may be implemented in the form of hardware or in the form of software functional units. The above is only an implementation mode of the present invention, and does not limit the patent scope of the present invention. Any equivalent structure or equivalent process transformation made by using the contents of the specification and drawings of the present invention, or directly or indirectly used in other related technical fields, is also included in the patent protection scope of the present invention.

以上对发明的具体实施方式进行了详细说明,但其只作为范例,本发明并不限制于以上描述的具体实施方式。对于本领域的技术人员而言,任何对该发明进行的等同修改或替代也都在本发明的范畴之中,因此,在不脱离本发明的精神和原则范围下所作的均等变换和修改、改进等,都应涵盖在本发明的范围内。The specific implementation methods of the invention are described in detail above, but they are only examples, and the invention is not limited to the specific implementation methods described above. For those skilled in the art, any equivalent modification or substitution of the invention is also within the scope of the invention, and therefore, the equalization, modification, improvement, etc. made without departing from the spirit and principle of the invention should be included in the scope of the invention.

Claims (10)

1.一种风电机的在线状态评估方法,其特征在于,所述风电机的在线状态评估方法包括:1. A method for online status assessment of a wind turbine, characterized in that the method for online status assessment of a wind turbine comprises: 通过传感器设备,实时采集风电机的振动数据、温度数据、电流数据及功率数据运行数据,对采集到的运行数据进行异常值检测处理;Through sensor equipment, the vibration data, temperature data, current data and power data of the wind turbine are collected in real time, and abnormal value detection and processing are performed on the collected operation data; 从异常值检测处理后的运行数据中提取目标状态特征,目标状态特征用于描述风电机的运行状态;对提取到的目标状态特征进行选择;利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;Extract target state features from the operating data after outlier detection processing, the target state features are used to describe the operating state of the wind turbine; select the extracted target state features; use the operating data containing the target state features for training to establish an online state assessment model for the wind turbine; 通过交叉验证,对建立的在线状态评估模型进行评估,利用建立好的在线状态评估模型,对实时采集的数据进行预测和判断,评估风电机的在线状态;将评估结果可视化展示,并根据评估结果进行相应的操作和维护。Through cross-validation, the established online status assessment model is evaluated. The established online status assessment model is used to predict and judge the real-time collected data to evaluate the online status of the wind turbine. The assessment results are visualized and corresponding operations and maintenance are performed based on the assessment results. 2.根据权利要求1所述的风电机的在线状态评估方法,其特征在于,对采集到的运行数据进行异常值检测处理的过程,包括:2. The online state assessment method of a wind turbine according to claim 1 is characterized in that the process of performing abnormal value detection processing on the collected operating data comprises: 确定运行数据窗口的大小,从运行数据序列的开头开始,依次将窗口内的数据点放入一个数组中,对窗口内的数据点进行排序,并取其中间位置的值作为中位数;如果窗口大小为奇数,则中位数为排序后的中间值;如果窗口大小为偶数,则中位数为排序后中间两个值的平均值;Determine the size of the running data window, start from the beginning of the running data sequence, put the data points in the window into an array, sort the data points in the window, and take the value in the middle position as the median; if the window size is an odd number, the median is the middle value after sorting; if the window size is an even number, the median is the average of the two middle values after sorting; 将当前窗口的中位数作为当前数据点的值,替换原始运行数据序列中的对应位置的值;将窗口向后移动一个位置,直到遍历完所有数据点,得到去噪后的运行数据;The median of the current window is used as the value of the current data point to replace the value of the corresponding position in the original running data sequence; the window is moved backward by one position until all data points are traversed to obtain the denoised running data; 使用统计学方法,对去噪后的运行数据进行描述性统计分析,检测是否存在与其他运行数据明显偏离的异常值;使用箱线图,通过判断数据点与其它数据点的偏离程度,识别异常值,将被识别为异常值的数据点从数据集中删除。Use statistical methods to perform descriptive statistical analysis on the denoised operating data to detect whether there are outliers that deviate significantly from other operating data; use box plots to identify outliers by judging the degree of deviation between data points and other data points, and delete data points identified as outliers from the data set. 3.根据权利要求2所述的风电机的在线状态评估方法,其特征在于,识别异常值的过程,包括:3. The online state assessment method of a wind turbine according to claim 2, characterized in that the process of identifying abnormal values comprises: 计算去噪后的运行数据的均值和标准差,分别表示数据的中心趋势和离散程度;绘制箱线图显示运行数据的分布情况,箱线图包括上边缘、上四分位数、中位数、下四分位数和下边缘,异常值定义为超出上下四分位数1.5倍的四分位距的数据点;Calculate the mean and standard deviation of the denoised running data, which represent the central tendency and dispersion of the data respectively; draw a box plot to show the distribution of the running data. The box plot includes the upper edge, upper quartile, median, lower quartile and lower edge. The outlier is defined as a data point that exceeds the interquartile range of 1.5 times the upper and lower quartiles. 计算每个数据点的Z-score值,表示该数据点与平均值之间的偏离程度,公式为:Z=(X-μ)/σ,其中X为数据点,μ为均值,σ为标准差,Z-score大于3或小于-3的数据点被视为异常值;Calculate the Z-score value of each data point, which indicates the degree of deviation between the data point and the mean value. The formula is: Z = (X-μ)/σ, where X is the data point, μ is the mean, and σ is the standard deviation. Data points with a Z-score greater than 3 or less than -3 are considered outliers. 进行Grubbs’Test识别单个异常值,通过计算每个数据点与均值之间的差异,找出与其他数据点明显偏离的数据点。Grubbs’ Test is performed to identify single outliers. The difference between each data point and the mean is calculated to find data points that deviate significantly from other data points. 4.根据权利要求1所述的风电机的在线状态评估方法,其特征在于,在线状态评估模型的构建过程,包括:4. The online state assessment method of a wind turbine according to claim 1, characterized in that the process of constructing an online state assessment model comprises: 使用缓解算法Relief从经过异常值检测处理的运行数据中提取目标状态特征,计算特征与目标状态之间的相关性来评估特征的重要性;Use the relief algorithm Relief to extract target state features from the running data processed by outlier detection, and calculate the correlation between the features and the target state to evaluate the importance of the features; 根据相关性指标对提取到的目标状态特征进行选择,相关性指标确定哪些特征对于描述风电机的运行状态最为重要;The extracted target state features are selected according to the correlation index, which determines which features are most important for describing the operating state of the wind turbine; 利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;在训练过程中,使用机器学习算法构建模型,以根据目标状态特征对风电机的在线状态进行评估。The online state assessment model of the wind turbine is established by training with the operating data containing the target state characteristics. During the training process, the model is constructed using the machine learning algorithm to assess the online state of the wind turbine according to the target state characteristics. 5.根据权利要求4所述的风电机的在线状态评估方法,其特征在于,缓解算法Relief地过程,包括:5. The online state assessment method of a wind turbine according to claim 4, characterized in that the relief algorithm relief process comprises: 初始化特征权重,对于每个特征,初始化其权重为0;随机选择一个样本作为参考样本;根据欧氏距离计算其他样本与参考样本之间的相似度,并根据相似度对其他样本进行排序;Initialize feature weights. For each feature, initialize its weight to 0. Randomly select a sample as a reference sample. Calculate the similarity between other samples and the reference sample based on the Euclidean distance, and sort other samples according to the similarity. 对于每个特征,计算参考样本与最近邻样本之间的差异值;如果最近邻样本属于目标状态类别,则增加特征权重;如果最近邻样本不属于目标状态类别,则减少特征权重;For each feature, calculate the difference between the reference sample and the nearest neighbor sample; if the nearest neighbor sample belongs to the target state category, increase the feature weight; if the nearest neighbor sample does not belong to the target state category, reduce the feature weight; 根据差异值和样本权重更新特征权重,重复直到遍历所有样本;根据特征权重进行特征选择,选择达到权重阈值的特征作为目标状态特征。Update the feature weight according to the difference value and sample weight, and repeat until all samples are traversed; perform feature selection based on the feature weight, and select the features that reach the weight threshold as the target state features. 6.根据权利要求4所述的风电机的在线状态评估方法,其特征在于,根据相关性指标对提取到的目标状态特征进行选择的过程,包括:6. The online state assessment method of a wind turbine according to claim 4 is characterized in that the process of selecting the extracted target state features according to the correlation index comprises: 计算特征与目标状态之间的相关性,对于每个特征,通过互信息计算其与目标状态之间的相关性指标;Calculate the correlation between the features and the target state. For each feature, calculate the correlation index between it and the target state through mutual information; 根据互信息的值,确定哪些特征对于描述风电机的运行状态最为重要,设置一个互信息的阈值,只选择互信息值超过阈值的特征作为目标状态特征;互信息值越大表示特征与目标状态之间的相关性越高,被认为是重要特征;According to the mutual information value, determine which features are most important for describing the operating status of the wind turbine, set a mutual information threshold, and only select features whose mutual information value exceeds the threshold as target state features; the larger the mutual information value, the higher the correlation between the feature and the target state, and it is considered to be an important feature; 根据相关性指标,按照相关性指标的大小进行排序,选择排名靠前的特征作为目标状态特征;According to the relevance index, sort according to the size of the relevance index and select the top-ranked features as the target state features; 互信息的计算过程,包括:The calculation process of mutual information includes: 计算特征和目标状态各自的概率分布,即特征的概率分布和目标状态的概率分布;计算特征和目标状态的联合概率分布,即特征和目标状态同时发生的概率;Calculate the probability distribution of each feature and target state, that is, the probability distribution of the feature and the probability distribution of the target state; calculate the joint probability distribution of the feature and the target state, that is, the probability of the feature and the target state occurring at the same time; 步根据边缘概率分布和联合概率分布,计算特征与目标状态之间的互信息,互信息的计算公式如下:Step 2: According to the marginal probability distribution and the joint probability distribution, the mutual information between the feature and the target state is calculated. The calculation formula of the mutual information is as follows: I(X,Y)=∑P(x,y)*log(P(x,y)/(P(x)*P(y)))I(X,Y)=∑P(x,y)*log(P(x,y)/(P(x)*P(y))) 其中,X表示特征,Y表示目标状态,P(x,y)表示特征和目标状态同时发生的概率,P(x)和P(y)分别表示特征和目标状态的边缘概率分布;Where X represents the feature, Y represents the target state, P(x,y) represents the probability of the feature and the target state occurring at the same time, and P(x) and P(y) represent the marginal probability distribution of the feature and the target state respectively; 将互信息的计算结果与互信息的阈值进行对比,选择处互信息值超过阈值的特征作为目标状态特征。The calculated result of mutual information is compared with the threshold of mutual information, and the features whose mutual information value exceeds the threshold are selected as the target state features. 7.根据权利要求4所述的风电机的在线状态评估方法,其特征在于,使用机器学习算法构建模型的过程,包括:7. The online state assessment method of a wind turbine according to claim 4, characterized in that the process of constructing a model using a machine learning algorithm comprises: 对包含目标状态特征的运行数据进行归一化预处理,选择随机森林的机器学习算法进行建模;Normalize and preprocess the running data containing target state features, and select the random forest machine learning algorithm for modeling; 将预处理后的数据集划分为训练集和测试集,训练集用于在线状态评估模型的训练和参数调优,测试集用于评估在线状态评估模型的性能;The preprocessed data set is divided into a training set and a test set. The training set is used for training and parameter tuning of the online state assessment model, and the test set is used to evaluate the performance of the online state assessment model. 使用训练集对选定的机器学习算法进行训练,调整在线状态评估模型的参数;将训练集输入到机器学习算法中进行训练,训练集包含经过预处理的运行数据和对应的目标状态;Using the training set to train the selected machine learning algorithm, and adjusting the parameters of the online state assessment model; inputting the training set into the machine learning algorithm for training, the training set including the preprocessed operating data and the corresponding target state; 在训练过程中,机器学习算法根据输入的特征和目标状态之间的关系来学习在线状态评估模型的参数;根据训练数据的特征和目标状态的对应关系调整在线状态评估模型的权重和偏置;During the training process, the machine learning algorithm learns the parameters of the online state assessment model based on the relationship between the input features and the target state; the weights and biases of the online state assessment model are adjusted based on the corresponding relationship between the features of the training data and the target state; 训练过程中,进行调整决策树的深度、随机森林中树的数量参数调优,训练完成后,得到了训练好的在线状态评估模型;During the training process, the depth of the decision tree and the number of trees in the random forest are adjusted. After the training is completed, a trained online state evaluation model is obtained; 其中,通过尝试不同的参数取值,计算相应的损失函数值,并选择使得损失函数最小的参数组合作为最终的模型参数设置。Among them, by trying different parameter values, the corresponding loss function value is calculated, and the parameter combination that minimizes the loss function is selected as the final model parameter setting. 8.一种风电机的在线状态评估装置,其应用于如权利要求1至7之一所述的风电机的在线状态评估方法,其特征在于,所述风电机的在线状态评估装置包括:8. An online status assessment device for a wind turbine, applied to the online status assessment method for a wind turbine according to any one of claims 1 to 7, characterized in that the online status assessment device for a wind turbine comprises: 数据处理模块,用于通过传感器设备,实时采集风电机的振动数据、温度数据、电流数据及功率数据运行数据,对采集到的运行数据进行异常值检测处理;The data processing module is used to collect the vibration data, temperature data, current data and power data of the wind turbine in real time through sensor equipment, and perform abnormal value detection and processing on the collected operating data; 模型构建模块,用于通过缓解算法Relief,从异常值检测处理后的运行数据中提取目标状态特征,目标状态特征用于描述风电机的运行状态;根据特征的相关性指标,对提取到的目标状态特征进行选择;The model building module is used to extract target state features from the operating data after outlier detection processing through the relief algorithm Relief. The target state features are used to describe the operating state of the wind turbine. The extracted target state features are selected according to the correlation index of the features. 状态评估模块,用于利用包含目标状态特征的运行数据进行训练,建立风电机的在线状态评估模型;通过交叉验证,对建立的在线状态评估模型进行评估,利用建立好的在线状态评估模型,对实时采集的数据进行预测和判断,评估风电机的在线状态;将评估结果可视化展示,并根据评估结果进行相应的操作和维护。The status assessment module is used to use the operating data containing the target status characteristics for training to establish an online status assessment model for the wind turbine; evaluate the established online status assessment model through cross-validation, use the established online status assessment model to predict and judge the real-time collected data, and evaluate the online status of the wind turbine; visualize the evaluation results, and perform corresponding operations and maintenance based on the evaluation results. 9.一种电子设备,其特征在于,包括处理器、以及与所述处理器耦接的存储器,所述存储器存储有可被所述处理器执行的程序指令;所述处理器执行所述存储器存储的所述程序指令时实现如权利要求1至7中任一项所述的风电机的在线状态评估方法。9. An electronic device, characterized in that it comprises a processor and a memory coupled to the processor, the memory storing program instructions executable by the processor; when the processor executes the program instructions stored in the memory, the online status assessment method of the wind turbine as described in any one of claims 1 to 7 is implemented. 10.一种存储介质,其特征在于,所述存储介质内存储有程序指令,所述程序指令被处理器执行时实现能够实现如权利要求1至7中任一项所述的风电机的在线状态评估方法。10. A storage medium, characterized in that program instructions are stored in the storage medium, and when the program instructions are executed by a processor, the online state assessment method of a wind turbine according to any one of claims 1 to 7 can be implemented.
CN202410878613.9A 2024-07-02 2024-07-02 Online state evaluation method, device and equipment of wind motor and storage medium Pending CN118669281A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410878613.9A CN118669281A (en) 2024-07-02 2024-07-02 Online state evaluation method, device and equipment of wind motor and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410878613.9A CN118669281A (en) 2024-07-02 2024-07-02 Online state evaluation method, device and equipment of wind motor and storage medium

Publications (1)

Publication Number Publication Date
CN118669281A true CN118669281A (en) 2024-09-20

Family

ID=92722457

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410878613.9A Pending CN118669281A (en) 2024-07-02 2024-07-02 Online state evaluation method, device and equipment of wind motor and storage medium

Country Status (1)

Country Link
CN (1) CN118669281A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119167240A (en) * 2024-11-21 2024-12-20 云南电网有限责任公司 A fault cause analysis system based on machine learning

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119167240A (en) * 2024-11-21 2024-12-20 云南电网有限责任公司 A fault cause analysis system based on machine learning

Similar Documents

Publication Publication Date Title
CN118504794B (en) Intelligent fault diagnosis system of electric spark forming machine
CN118094438B (en) Distribution cable operation analysis method and system based on big data
CN118011138A (en) A data-driven power grid fault diagnosis method and system
CN118261584B (en) Transformer state evaluation method and system based on multi-parameter data
CN118536048B (en) Switchgear insulation status monitoring and management system
CN118075791A (en) Wireless communication network fault prediction method and system
CN118364381B (en) Direct-current power supply early warning operation and maintenance method and system based on gradual change type data statistical analysis
CN118245917A (en) Fault detection method and system for servo inverter
CN117134503B (en) State monitoring method and system for large-scale power supply device
CN116629627A (en) Intelligent detection system of power transmission on-line monitoring device
CN118554636A (en) Unmanned inspection method and unmanned inspection device for electric power facilities of power storage station
CN119204709B (en) An AI-based intelligent risk assessment system and method for scientific and technological equipment
CN118669281A (en) Online state evaluation method, device and equipment of wind motor and storage medium
CN119231760A (en) Electrical distribution cabinet and control method thereof
CN118779643A (en) Ink coating distribution monitoring system and method based on big data
CN117574180B (en) Fuel production and emission system data correlation control management system
CN114117887A (en) Real-time evaluation method, system and medium for online power generation performance of wind turbines
CN117390556A (en) Fault monitoring method and device for diesel generator set
CN119807713B (en) Station electric equipment steady-state feature extraction system based on load perception
CN119675591B (en) PID detection method and device for photovoltaic module and electronic equipment
CN118747330A (en) A device monitoring method and system based on chromaticity spectrum mapping
CN119716362A (en) Station power consumption system safety energy-efficiency monitoring system based on multisource information fusion
CN119716407A (en) Online monitoring platform for cable insulation status in distribution network based on equivalent electrical characteristics
CN118966813A (en) A method, device, electronic equipment and storage medium for automatically locating high-loss transmission area
CN118860716A (en) Mainboard fault detection method and computer equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination