[go: up one dir, main page]

CN110427974A - A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine - Google Patents

A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine Download PDF

Info

Publication number
CN110427974A
CN110427974A CN201910614733.7A CN201910614733A CN110427974A CN 110427974 A CN110427974 A CN 110427974A CN 201910614733 A CN201910614733 A CN 201910614733A CN 110427974 A CN110427974 A CN 110427974A
Authority
CN
China
Prior art keywords
support vector
data
vector machine
health status
generalized support
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910614733.7A
Other languages
Chinese (zh)
Inventor
吴军
郭鹏飞
陈作懿
严喆
程一伟
邓超
朱海平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong University of Science and Technology
Original Assignee
Huazhong University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong University of Science and Technology filed Critical Huazhong University of Science and Technology
Priority to CN201910614733.7A priority Critical patent/CN110427974A/en
Publication of CN110427974A publication Critical patent/CN110427974A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Testing And Monitoring For Control Systems (AREA)

Abstract

本发明属于设备状态监测相关技术领域,其公开了一种基于广义支持向量机的液压部件健康状态检测方法,该方法包括以下步骤:(1)获取多组样本液压部件处于不同健康状态下的传感器信号以作为样本数据,且同时获得对应的的健康状态数据;(2)计算所得到的特征数据与所述健康状态数据之间的相关性,并选取训练特征;(3)采用堆叠集成学习方法构建多个广义支持向量机模型,并采用所述广义支持向量机模型训练过程中所输出的数据来训练得到随机森林模型,由此得到了广义支持向量机检测模型;(4)将得到的待测液压部件的实时传感器信号数据输入到所述广义支持向量机检测模型,以进行健康状态检测。本发明提高了检测精度及灵活性,适用性较强。

The invention belongs to the related technical field of equipment state monitoring, and discloses a method for detecting the health status of hydraulic components based on generalized support vector machines. The signal is used as sample data, and the corresponding health status data is obtained at the same time; (2) the correlation between the obtained feature data and the health status data is calculated, and the training features are selected; (3) the stacking integration learning method is adopted Build a plurality of generalized support vector machine models, and use the data output in the generalized support vector machine model training process to train to obtain the random forest model, thus obtained the generalized support vector machine detection model; (4) will obtain The real-time sensor signal data of the measured hydraulic components is input to the generalized support vector machine detection model for health status detection. The invention improves detection precision and flexibility, and has strong applicability.

Description

一种基于广义支持向量机的液压部件健康状态检测方法A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine

技术领域technical field

本发明属于设备状态监测相关技术领域,更具体地,涉及一种基于广义支持向量机的液压部件健康状态检测方法。The invention belongs to the related technical field of equipment state monitoring, and more specifically relates to a method for detecting the health state of hydraulic components based on a generalized support vector machine.

背景技术Background technique

液压系统是工业机电设备中的关键组成部分,其运行状态直接影响整个机电设备的稳定性与可靠性。若液压系统出现突发故障,将会导致设备停机,影响企业生产效率,给企业带来严重经济损失。因此,对液压系统进行健康状态感知是非常必要的。The hydraulic system is a key component of industrial electromechanical equipment, and its operating status directly affects the stability and reliability of the entire electromechanical equipment. If the hydraulic system fails suddenly, it will cause the equipment to stop, affect the production efficiency of the enterprise, and bring serious economic losses to the enterprise. Therefore, it is very necessary to perceive the health status of the hydraulic system.

液压系统健康状态并不能简单地划分为正常运行与故障损坏,其损坏过程是一个长期的、不断加深的过程。健康状态感知是对其损坏过程中的任意时间节点进行感知,得到的故障状态可能会有多种,工作人员需要根据对液压系统部件的健康状态感知得到的不同的故障状态做出相应的维修处理措施,以防止事故的发生,避免经济损失及人员伤害。The health status of the hydraulic system cannot be simply divided into normal operation and fault damage, and the damage process is a long-term and deepening process. Health status perception is to perceive any time node in the damage process, and the obtained fault states may be various. The staff need to make corresponding repairs according to the different fault states obtained from the health state perception of hydraulic system components Measures to prevent accidents, avoid economic losses and personal injuries.

支持向量机(Support vector machine,SVM)是一种有监督的机器学习方法,可以实现对损坏与非损坏的特征值分类,具有样本量需求小,诊断速度快等优点。然而,经典支持向量机只适用于二分类的诊断问题,常用的帮助支持向量机进行多分类的方法OVO、OVA等又会导致诊断效果下降,且其他分类方法的样本需求量较大,诊断速度较慢。相应地,本领域存在着发展一种精度较好的基于广义支持向量机的液压部件健康状态检测方法的技术需求,以实现机电设备液压系统的健康状态的准确感知。Support vector machine (Support vector machine, SVM) is a supervised machine learning method, which can realize the classification of damaged and non-damaged eigenvalues, and has the advantages of small sample size requirements and fast diagnosis speed. However, the classic support vector machine is only suitable for the diagnosis of two classifications. The commonly used methods to help the support vector machine to perform multi-classification, such as OVO and OVA, will lead to a decline in the diagnosis effect, and the demand for samples of other classification methods is large, and the diagnosis speed slower. Correspondingly, there is a technical requirement in this field to develop a method for detecting the health status of hydraulic components based on generalized support vector machines with better accuracy, so as to realize accurate perception of the health status of the hydraulic system of electromechanical equipment.

发明内容Contents of the invention

针对现有技术的以上缺陷或改进需求,本发明提供了一种基于广义支持向量机的液压部件健康状态检测方法,其基于现有液压系统的健康状态监测特点,研究及设计了一种精度较好的基于广义支持向量机的液压部件健康状态检测方法。所述检测方法采用了基于不同核函数的广义支持向量机对液压系统的各种液压部件的多种健康状态进行感知,并使用集成学习方法将基于广义支持向量机的健康状态感知结果进行集成以得到最终的健康状态感知结果,由此实现了液压部件的健康状态的感知,提高了检测精度及灵活性,适用性较强。Aiming at the above defects or improvement needs of the prior art, the present invention provides a generalized support vector machine-based method for detecting the health status of hydraulic components. A good health detection method for hydraulic components based on generalized support vector machines. The detection method uses a generalized support vector machine based on different kernel functions to perceive multiple health states of various hydraulic components of the hydraulic system, and uses an integrated learning method to integrate the health state perception results based on the generalized support vector machine. The final health status perception result is obtained, thereby realizing the health status perception of hydraulic components, improving detection accuracy and flexibility, and having strong applicability.

为实现上述目的,本发明提供了一种基于广义支持向量机的液压部件健康状态检测方法,所述检测方法包括以下步骤:In order to achieve the above object, the present invention provides a method for detecting the state of health of hydraulic components based on generalized support vector machines, the detection method comprising the following steps:

(1)获取多组样本液压部件处于不同健康状态下的传感器信号以作为样本数据,且同时获得每个样本数据对应的样本液压部件所处的健康状态数据;(1) Obtain multiple sets of sensor signals of sample hydraulic components in different health states as sample data, and simultaneously obtain the health state data of the sample hydraulic components corresponding to each sample data;

(2)对所述样本数据进行数据清洗,并对清洗后的所述样本数据提取统计特征;(2) performing data cleaning on the sample data, and extracting statistical features from the cleaned sample data;

(3)基于皮尔森相关系数计算所得到的特征数据与所述健康状态数据之间的相关性,并将所述特征数据按照对应的相关性自大到小进行排序,选取排在前预定个数的特征作为训练特征;(3) Calculate the correlation between the characteristic data obtained based on the Pearson correlation coefficient and the health status data, and sort the characteristic data according to the corresponding correlation from large to small, and select the first predetermined one The features of numbers are used as training features;

(4)基于所述训练特征及所述训练特征所对应的健康状态数据,采用堆叠集成学习方法构建多个广义支持向量机模型,并采用所述广义支持向量机模型训练过程中所输出的数据来训练得到随机森林模型,由此得到了由堆叠集成学习方法集成的广义支持向量机检测模型;(4) Based on the health status data corresponding to the training features and the training features, a plurality of generalized support vector machine models are constructed by stacking integrated learning methods, and the data output during the training process of the generalized support vector machine models are used To train the random forest model, and thus obtain the generalized support vector machine detection model integrated by the stacked ensemble learning method;

(5)在线实时获取待测液压部件的传感器信号数据,并将得到的传感器信号数据进行处理后输入到所述广义支持向量机检测模型,进而所述广义支持向量机检测模型对待测液压部件进行健康状态检测。(5) Obtain the sensor signal data of the hydraulic component to be tested in real time online, and input the obtained sensor signal data into the generalized support vector machine detection model after processing, and then perform the generalized support vector machine detection model on the hydraulic component to be measured Health status detection.

进一步地,所述传感器信号包括温度、流量、压力及功率。Further, the sensor signal includes temperature, flow, pressure and power.

进一步地,所述统计特征包括绝对均值、有效值、方根幅值、歪度、峭度、峭度指标、歪度指标及波形指标。Further, the statistical features include absolute mean value, effective value, square root amplitude, skewness, kurtosis, kurtosis index, skewness index and waveform index.

进一步地,采用中值滤波对所述样本数据进行清洗。Further, median filtering is used to clean the sample data.

进一步地,采用皮尔森相关系数进行特征选择时,样本数据与健康状态标签数据的相关性系数的计算公式为:Further, when using the Pearson correlation coefficient for feature selection, the calculation formula of the correlation coefficient between the sample data and the health status label data is:

式中,cov(X,Y)表示协方差;σX表示所提取的特征数据集X的标准差;σY表示Y的标准差;X[i]表示第i种特征的样本数据;Y表示样本数据对应的健康状态标签数据。In the formula, cov(X, Y) represents the covariance; σ X represents the standard deviation of the extracted feature data set X; σ Y represents the standard deviation of Y; X[i] represents the sample data of the i-th feature; The health status label data corresponding to the sample data.

进一步地,所述预定个数为3个。Further, the predetermined number is 3.

进一步地,所述广义支持向量机模型包括多个不同核函数。Further, the generalized support vector machine model includes multiple different kernel functions.

进一步地,获取不同健康状态下的样本液压部件所关联的两个传感器的信号作为样本数据。Further, the signals of two sensors associated with the sample hydraulic components in different health states are acquired as sample data.

总体而言,通过本发明所构思的以上技术方案与现有技术相比,本发明提供的基于广义支持向量机的液压部件健康状态检测方法主要具有以下有益效果:Generally speaking, compared with the prior art through the above technical solutions conceived by the present invention, the generalized support vector machine-based hydraulic component health status detection method provided by the present invention mainly has the following beneficial effects:

1.基于皮尔森相关系数计算所得到的特征数据与所述健康状态数据之间的相关性,使用皮尔森相关性系数,可以直观反映提取特征值与部件健康状态的相关程度,以用来选择相关性最高,效果最好的特征值数据,进而提高最终分类效果的精度,进而提高检测精度。1. Based on the correlation between the characteristic data calculated by the Pearson correlation coefficient and the health status data, using the Pearson correlation coefficient can intuitively reflect the degree of correlation between the extracted feature value and the health status of the component for selection The eigenvalue data with the highest correlation and the best effect can improve the accuracy of the final classification effect, thereby improving the detection accuracy.

2.采用堆叠集成学习方法构建多个广义支持向量机模型,并采用所述广义支持向量机模型训练过程中所输出的数据来训练得到随机森林模型,由此得到了由堆叠集成学习方法集成的广义支持向量机检测模型,使用Stacking集成学习方法,可以将多种分类器集成起来,提高了准确率,其中Stacking方法中还包括K折交叉验证,此方法可以有效降低分类器训练时的过拟合。2. Use the stacked ensemble learning method to construct multiple generalized support vector machine models, and use the data output in the generalized support vector machine model training process to train the random forest model, thus obtaining the integrated by the stacked ensemble learning method The generalized support vector machine detection model uses the Stacking integrated learning method to integrate multiple classifiers to improve the accuracy rate. The Stacking method also includes K-fold cross-validation, which can effectively reduce the overfitting of the classifier training. combine.

3.使用广义支持向量机方法,可以弥补传统SVM只能二分类的问题,而且对比其他的多分类方法,广义支持向量机训练时间短,分类效果更好。3. Using the generalized support vector machine method can make up for the problem that the traditional SVM can only be divided into two categories. Compared with other multi-classification methods, the generalized support vector machine has a shorter training time and better classification effect.

4.所述检测方法能够及时感知液压部件的健康状态,并提高了液压部件健康状态感知的准确性和有效性,且还可以用于液压系统健康状态的感知,为解决液压系统健康状态感知问题提供了一种新思路。4. The detection method can sense the health state of hydraulic components in time, and improves the accuracy and effectiveness of hydraulic component health state perception, and can also be used for the perception of hydraulic system health state, in order to solve the problem of hydraulic system health state perception A new way of thinking is provided.

附图说明Description of drawings

图1是本发明提供的基于广义支持向量机的液压部件健康状态检测方法的流程示意图;Fig. 1 is the schematic flow chart of the method for detecting the state of health of a hydraulic component based on a generalized support vector machine provided by the present invention;

图2是图1中的基于广义支持向量机的液压部件健康状态检测方法涉及的堆叠集成广义多分类支持向量机的模型构建与在线样本测试的流程示意图。Fig. 2 is a schematic flowchart of model building and online sample testing of the stacked integrated generalized multi-classification support vector machine involved in the generalized support vector machine-based hydraulic component health state detection method in Fig. 1 .

具体实施方式Detailed ways

为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。此外,下面所描述的本发明各个实施方式中所涉及到的技术特征只要彼此之间未构成冲突就可以相互组合。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

请参阅图1及图2,本发明提供的基于广义支持向量机的液压部件健康状态检测方法,所述检测方法能够实现机电设备健康状态诊断的精确化,其主要包括以下步骤:Please refer to Fig. 1 and Fig. 2, the generalized support vector machine based hydraulic component health status detection method provided by the present invention, the detection method can realize the precision of the health status diagnosis of electromechanical equipment, and it mainly includes the following steps:

步骤一,获取多组样本液压部件处于不同健康状态下的传感器信号以作为样本数据,且同时获得每个样本数据对应的样本液压部件所处的健康状态数据。Step 1: Obtain multiple sets of sensor signals of the sample hydraulic components in different health states as sample data, and simultaneously obtain the health state data of the sample hydraulic components corresponding to each sample data.

具体地,获取不同健康状态下的样本液压部件所关联的两个传感器的信号,如温度、流量、压力、功率等传感器信号作为样本数据,并且得到每个传感器信号所对应的样本液压部件的健康状态数据。本实施方式中,获取样本液压部件在Y种健康状态下的2个关联传感器信号,而每个信号样本对应的状态也是可以获取的;当然,当对象为液压系统时,可以针对液压系统的多个液压部件分别进行健康状态检测,进而综合得到的多个所述健康状态检测结果来确定所述液压系统的健康状态,这里关联的传感器是由液压系统来确定的,不同的液压系统里的某个液压部件对应的传感器不同,选择的传感器数量也就不同。Specifically, the signals of two sensors associated with the sample hydraulic components in different health states, such as temperature, flow, pressure, power and other sensor signals, are obtained as sample data, and the health of the sample hydraulic components corresponding to each sensor signal is obtained status data. In this embodiment, the two associated sensor signals of the sample hydraulic components in Y health states are obtained, and the state corresponding to each signal sample can also be obtained; of course, when the object is a hydraulic system, multiple Each hydraulic component performs health status detection respectively, and then the health status of the hydraulic system is determined by combining multiple health status detection results obtained. Here, the associated sensor is determined by the hydraulic system. A certain hydraulic system in different hydraulic systems The sensors corresponding to each hydraulic component are different, and the number of sensors selected is also different.

步骤二,对所述样本数据进行数据清洗,并对清洗后的所述样本数据提取统计特征。Step 2, performing data cleaning on the sample data, and extracting statistical features from the cleaned sample data.

具体地,对得到的所述样本数据进行数据清洗以去除奇异值,并对清洗后的所述样本数据提取统计特征,所述统计特征包括绝对均值、有效值、方根幅值、歪度、峭度、峭度指标、歪度指标及波形指标。Specifically, data cleaning is performed on the obtained sample data to remove singular values, and statistical features are extracted from the cleaned sample data, and the statistical features include absolute mean value, effective value, square root amplitude, skewness, Kurtosis, kurtosis index, skewness index and waveform index.

对所述样本数据进行清洗时涉及到中值滤波,具体包括以下步骤:The median filtering is involved in cleaning the sample data, which specifically includes the following steps:

(21)基于所述样本数据选择所对应的传感器中的一个传感器所采集的传感器信号数据的组数为m,每组包含数据点数为n,即传感器采集了m组数据,每组数据具有n个样本点。(21) The number of groups of sensor signal data collected by one of the corresponding sensors selected based on the sample data is m, and the number of data points included in each group is n, that is, the sensor has collected m groups of data, and each group of data has n sample points.

(22)确定中值滤波窗口的长度k。(22) Determine the length k of the median filter window.

(23)在第一组数据中取前k个数据,则:(23) Take the first k data in the first set of data, then:

N(k+i-1)/2=median(Ni,Ni+1,…,Ni+k-1)N (k+i-1)/2 =median(Ni,N i+1 ,...,N i+k-1 )

式中,Ni表示第i个数据;i初始为1;median()表示求中值,当(k+i-1)为奇数时加1。In the formula, N i represents the i-th data; i is initially 1; median() represents the median value, and 1 is added when (k+i-1) is an odd number.

(24)i=i+1,重复步骤(23),直到i+k=n,如此所有m组数据都经过处理以得到中值滤波后的数据集。(24) i=i+1, repeat step (23) until i+k=n, so all m sets of data are processed to obtain a median-filtered data set.

(25)基于所述样本数据选择所对应的传感器中的另一个传感器所得到的m组传感器数据(每组包含n个数据点)进行步骤(22)至步骤(24)的处理。(25) Select m sets of sensor data (each set includes n data points) obtained by another sensor in the corresponding sensor based on the sample data, and perform the processing from step (22) to step (24).

本实施方式中所使用的8个统计特征值见表1。See Table 1 for the 8 statistical feature values used in this embodiment.

表1统计特征指标Table 1 Statistical characteristic indicators

表中,xi为第i次采样的信号序列;i=1,2,…,N,N为总采样次数;σ为数据的标准差。所提出的统计特征集合为F=[F1,F2,…,F8],统计特征参数包含有量纲特征参数和无量纲特征参数,[F1,F2,…,F5]为有量纲统计指标,[F6,F7,F8]为无量纲统计特征指标。In the table, x i is the signal sequence of the i-th sampling; i=1, 2, ..., N, N is the total sampling times; σ is the standard deviation of the data. The proposed statistical feature set is F=[F 1 , F 2 ,…,F 8 ], the statistical feature parameters include dimensioned feature parameters and dimensionless feature parameters, [F 1 , F 2 ,…,F 5 ] is Dimensioned statistical indicators, [F 6 , F 7 , F 8 ] are dimensionless statistical characteristic indicators.

步骤三,基于皮尔森相关系数计算所得到的特征数据与所述健康状态数据之间的相关性,并将所述特征数据按照对应的相关性自大到小进行排序,选取排在前预定个数的特征作为训练特征。Step 3: Calculate the correlation between the characteristic data obtained based on the Pearson correlation coefficient and the health status data, and sort the characteristic data according to the corresponding correlation from large to small, and select the top predetermined Number features are used as training features.

具体地,所述预定个数为N,其可以根据最终效果进行改变。使用皮尔森相关系数进行的特征选择主要包括以下步骤:Specifically, the predetermined number is N, which can be changed according to the final effect. Feature selection using the Pearson correlation coefficient mainly includes the following steps:

(31)确定所提取的特征数据对应的真实的健康状态标签数据集Y,所选择的传感器中的一个的特征数据集X,X中包含提取的8种特征数据,X[i]表示第i种特征的样本数据。(31) Determine the real health status label data set Y corresponding to the extracted feature data, the feature data set X of one of the selected sensors, X contains 8 kinds of feature data extracted, and X[i] represents the i-th sample data of this feature.

(32)使用以下公式计算获得X[i]与Y的相关性系数:(32) Use the following formula to calculate the correlation coefficient between X[i] and Y:

得到8种特征的8组相关系数,其中cov(X,Y)表示协方差;σX表示X的标准差,σY表示Y的标准差。Get 8 sets of correlation coefficients for 8 features, where cov(X, Y) means covariance; σ X means the standard deviation of X, and σ Y means the standard deviation of Y.

(33)根据每个特征得到的相关系数,取相关系数最大的3种特征作为堆叠(Stacking)集成广义多分类支持向量机模型的输入数据集。(33) According to the correlation coefficient obtained by each feature, take the three features with the largest correlation coefficient as the input data set of the stacking integrated generalized multi-class support vector machine model.

(34)对所选择的另一个传感器提取的特征进行步骤(31)至步骤(33)处理,同样选择3种特征作为堆叠集成广义多分类支持向量机模型的输入数据集。(34) Perform steps (31) to (33) on the features extracted by another selected sensor, and also select three types of features as the input data set of the stacked and integrated generalized multi-classification support vector machine model.

步骤四,基于所述训练特征及所述训练特征所对应的健康状态数据,采用堆叠集成学习方法构建多个广义支持向量机模型,并采用所述广义支持向量机模型训练过程中所输出的数据来训练得到随机森林模型,由此得到了由堆叠集成学习方法集成的广义支持向量机检测模型。Step 4, based on the training features and the health status data corresponding to the training features, use the stacked ensemble learning method to construct multiple generalized support vector machine models, and use the data output during the generalized support vector machine model training process To train the random forest model, and thus obtain the generalized support vector machine detection model integrated by the stacked ensemble learning method.

具体地,使用得到的N个特征与每个训练特征对应的健康状态,构建K×M个广义支持向量机模型并得到模型训练过程中输出的数据,使用K×M个广义支持向量机的输出数据再去训练一个随机森林模型,至此得到由堆叠(Stacking)方法集成的广义支持向量机检测模型。Specifically, use the obtained N features and the health status corresponding to each training feature to construct K×M generalized support vector machine models and obtain the output data during model training, and use the output of K×M generalized support vector machines The data is then used to train a random forest model, so far a generalized support vector machine detection model integrated by the stacking method is obtained.

本实施方式中,广义支持向量机检测模型的构建主要包括以下步骤:In this embodiment, the construction of the generalized support vector machine detection model mainly includes the following steps:

(41)使用得到的两种传感器的经过选择的特征值数据,每一个传感器的特征值数据有m组,将每个传感器的m组特征值数据合并至一个大数据集X,假设每个传感器有4种特征,一个传感器有100组数据,则X中数据为2*4=8列,100行。(41) Using the selected eigenvalue data of the two sensors obtained, each sensor has m sets of eigenvalue data, and merges the m sets of eigenvalue data of each sensor into a large data set X, assuming that each sensor There are 4 types of features, one sensor has 100 sets of data, then the data in X is 2*4=8 columns, 100 rows.

(42)使用大数据集X中的每一行特征值数据对应的健康状态标签数据Y,Y为m行1列。(42) Use the health status label data Y corresponding to each row of eigenvalue data in the large data set X, and Y is m rows and 1 column.

(43)确定K折交叉验证中参数K的值,将X按行均分为K个集合,每个集合有行数据,将K个集合组合K次,每种组合包含训练集与校验集,确保K个集合中的每一个集合都在K种组合中作为一次校验集,而每种组合的训练集为剩下的K-1个集合的组合。具体的,假如K=3,则将X按行均分为3个集合a,b,c;将3个集合组合三次,其中第一个组合里a为校验集,b、c共同作为训练集,第二个组合里b为校验集,a、c共同作为训练集,第三个组合类似。(43) Determine the value of the parameter K in the K-fold cross-validation, and divide X into K sets by row, and each set has Row data, combine K sets K times, each combination contains training set and verification set, ensure that each set of K sets is used as a verification set in K combinations, and the training set of each combination It is the combination of the remaining K-1 sets. Specifically, if K=3, then divide X into three sets a, b, and c by row; combine the three sets three times, in which a is the verification set in the first combination, and b and c are jointly used as training set, b in the second combination is the verification set, a and c are used as the training set together, and the third combination is similar.

(44)使用K个线性核的广义多分类支持向量机对K种组合进行训练与分类,每次训练后对校验集进行预测,以得到对校验集的预测数据,K个已训练的线性核广义支持向量机模型对校验集预测得到K个预测数据集合,将K个集合校验集预测数据合并,以得到与原始X数据集的行数m相同行数的数据,将这个集合命名为H,H作为随机森林模型的训练输入数据。(44) Use the generalized multi-classification support vector machine with K linear kernels to train and classify K combinations, and predict the verification set after each training to obtain the prediction data for the verification set. K trained The linear kernel generalized support vector machine model predicts the verification set to obtain K prediction data sets, and merges the K sets of verification set prediction data to obtain data with the same number of rows as the original X data set m. Named as H, H is used as the training input data of the random forest model.

(45)使用K个多项式(Poly)核,K个径向基(RBF)核的广义多分类支持向量机,重复步骤(44),以得到另外两个集合H2,H3。所述支持向量机包括多个不同核函数。(45) Using K polynomial (Poly) kernels and K radial basis (RBF) kernels generalized multi-classification support vector machine, repeat step (44) to obtain another two sets H 2 , H 3 . The support vector machine includes a plurality of different kernel functions.

(46)使用随机森林模型,用数据集H,H2,H3训练一个随机森林模型,最终得到由堆叠(Stacking)集成的广义多分类支持向量机检测模型。(46) Use the random forest model, train a random forest model with the data sets H, H 2 , H 3 , and finally obtain a generalized multi-class support vector machine detection model integrated by stacking.

步骤五,在线实时获取待测液压部件的传感器信号数据,并将得到的传感器信号数据输入到所述广义支持向量机检测模型,进而所述广义支持向量机检测模型对待测液压部件进行健康状态检测。其中,所述传感器信号数据所对应的特征与所述训练特征相一致。Step 5: Obtain the sensor signal data of the hydraulic component to be tested in real time online, and input the obtained sensor signal data into the generalized support vector machine detection model, and then the generalized support vector machine detection model performs health status detection of the hydraulic component to be tested . Wherein, the features corresponding to the sensor signal data are consistent with the training features.

为了对本发明进行进一步的详细说明,使用液压系统健康状态数据来验证本方法,实验装置模拟了四种故障,其故障模式与控制参数如表2所示。In order to further describe the present invention in detail, the health state data of the hydraulic system is used to verify the method. The experimental device simulates four types of faults, and the fault modes and control parameters are shown in Table 2.

表2液压系统故障模式及其控制参数Table 2 Hydraulic system failure modes and their control parameters

每种进行健康状态感知的部件都有不同的健康状态,如表3所示:Each component that performs health status awareness has a different health status, as shown in Table 3:

表3健康状态标签与含义Table 3 Health Status Labels and Meanings

液压系统平台实验数据集中包含18种传感器数据,包括压力、功率、流量、温度、振动、效率传感器等,各传感器分别布置在相应部件位置,压力、功率传感器频率为100Hz,流量传感器频率为10Hz,温度、振动、效率传感器频率为1Hz。共采集了2205组数据,一组数据为60s,根据传感器频率不同每种传感器的一组数据中数据点也不同,每组数据都有相应的健康状态标签。The experimental data set of the hydraulic system platform contains 18 kinds of sensor data, including pressure, power, flow, temperature, vibration, efficiency sensors, etc., and each sensor is arranged at the corresponding component position. The frequency of the pressure and power sensors is 100 Hz, and the frequency of the flow sensor is 10 Hz. Temperature, vibration, efficiency sensor frequency is 1Hz. A total of 2205 sets of data were collected, one set of data was 60s, and the data points in each set of data of each sensor were different according to the frequency of the sensor, and each set of data had a corresponding health status label.

对于冷却器的健康状态感知选择了温度传感器TS3,TS4;对于阀的健康状态感知选择了压力传感器PS1、PS2;对于泵的健康状态感知选择了流量传感器FS1,系统效率传感器数据SE;对于蓄能器的健康状态感知选择功率传感器EPS1,压力传感器PS2。进行中值滤波得到滤波后的数据,之后对每种部件所选取的每个传感器的每组数据提取8种统计特征。The temperature sensors TS3 and TS4 are selected for the health status perception of the cooler; the pressure sensors PS1 and PS2 are selected for the health status perception of the valve; the flow sensor FS1 and the system efficiency sensor data SE are selected for the health status perception of the pump; for energy storage Select the power sensor EPS1 and pressure sensor PS2 for the health status perception of the device. Perform median filtering to obtain the filtered data, and then extract 8 statistical features for each set of data of each sensor selected for each component.

使用皮尔森系数进行相关性分析,以得到和部件的健康状态数据相关性最强的特征值数据。和冷却系统健康状态相关性最强的特征为有效值、绝对均值、峭度指标;和阀健康状态相关性最强的特征为方根幅值、歪度、波形指标;和泵健康状态相关性最强的特征为方根幅值、绝对均值、歪度指标;由于蓄能器的相关特征值的相关系数都很小,故使用所有的特征值。Correlation analysis is performed using the Pearson coefficient to obtain the eigenvalue data with the strongest correlation with the health status data of the component. The features most correlated with cooling system health status are effective value, absolute mean value, and kurtosis index; the features most correlated with valve health status are square root amplitude, skewness, waveform index; and pump health status correlation The strongest features are square root magnitude, absolute mean, and skewness index; since the correlation coefficients of the relevant eigenvalues of the accumulator are very small, all eigenvalues are used.

将每种部件对应的特征值数据分别合并为一个矩阵,即每种部件都有2205组数据,而由于选择特征值数目、传感器数量不同,得到的列数不同。同时得到相对应的4种2205行1列的标签数据集,每种数据集的标签种类如表3所示。Merge the eigenvalue data corresponding to each component into a matrix, that is, each component has 2205 sets of data, and the number of columns obtained is different due to the number of selected eigenvalues and the number of sensors. At the same time, four corresponding label data sets with 2205 rows and one column are obtained. The label types of each data set are shown in Table 3.

对每种部件的特征值矩阵中随机抽取70%作为训练集,30%作为测试集。使用Stacking集成广义支持向量机方法,其中取3折交叉验证,广义支持向量机分别取线性核、Poly核、RBF核。Randomly select 70% of the eigenvalue matrix of each component as the training set and 30% as the test set. Use Stacking to integrate the generalized support vector machine method, in which 3-fold cross-validation is adopted, and the generalized support vector machine adopts linear kernel, poly kernel and RBF kernel respectively.

对于每种核的广义支持向量机,在试验中不断调试参数,最终得到的参数如下:For the generalized support vector machine of each kernel, the parameters are continuously debugged in the experiment, and the final parameters are as follows:

线性核:k=0,p=1,lambda=1*10-8,epsilon=1*10-6Linear kernel: k=0, p=1, lambda=1*10 -8 , epsilon=1*10 -6 ;

Poly核:k=0,p=2,lambda=1*10-8,epsilon=1*10-6Poly core: k=0, p=2, lambda=1*10 -8 , epsilon=1*10 -6 ;

RBF核:k=0,p=1,lambda=1*10-8,epsilon=1*10-6RBF kernel: k=0, p=1, lambda=1*10 -8 , epsilon=1*10 -6 .

最终结果如表4所示:The final results are shown in Table 4:

表4分类结果与准确率Table 4 Classification results and accuracy

故障位置fault location 训练集个数Number of training sets 测试集个数Number of test sets 第一层准确率first layer accuracy 测试集预测准确率Test set prediction accuracy 冷却系统cooling system 15061506 649649 100%100% 100%100% 阀V10Valve V10 15061506 649649 100%100% 100%100% MP1泵MP1 pump 13351335 580580 100%100% 100%100% 蓄能器泄露accumulator leak 13351335 580580 82%82% 76%76%

为了说明本方法的准确性,将本方法与LDA方法、ANN方法、线性核SVM、RBF核SVM进行了对比,结果如表5所示,本方法的健康状态感知正确率优于其他方法。In order to illustrate the accuracy of this method, this method was compared with the LDA method, ANN method, linear kernel SVM, and RBF kernel SVM. The results are shown in Table 5. The health status perception accuracy of this method is better than other methods.

表5不同方法之间比较Table 5 Comparison between different methods

本发明提供的基于广义支持向量机的液压部件健康状态检测方法,所述检测方法结合了皮尔森相关系数、堆叠集成学习方法及广义支持向量机,提高了最终分类效果的精度,进而提高检测精度,且灵活性较好,适用性较强。The health state detection method of hydraulic components based on the generalized support vector machine provided by the present invention, the detection method combines the Pearson correlation coefficient, the stacking ensemble learning method and the generalized support vector machine, improves the accuracy of the final classification effect, and then improves the detection accuracy , and has good flexibility and strong applicability.

本领域的技术人员容易理解,以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。Those skilled in the art can easily understand that the above are only preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present invention, etc., All should be included within the protection scope of the present invention.

Claims (8)

1. a kind of hydraulic unit method for detecting health status based on Generalized Support Vector Machine, which is characterized in that this method includes Following steps:
(1) it obtains multiple groups sample hydraulic unit and be in the sensor signal under different health status using as sample data, and together When obtain state of health data locating for the corresponding sample hydraulic unit of each sample data;
(2) data cleansing is carried out to the sample data, and statistical nature is extracted to the sample data after cleaning;
(3) correlation between obtained characteristic and the state of health data is calculated based on Pearson correlation coefficients, And be ranked up the characteristic from large to small according to corresponding correlation, choose the feature conduct of predetermined number before coming Training characteristics;
(4) based on state of health data corresponding to the training characteristics and the training characteristics, using stacking integrated study side Method constructs multiple Generalized Support Vector Machine models, and using the number exported during the Generalized Support Vector Machine model training According to training to obtain Random Forest model, the Generalized Support Vector Machine detection integrated by stacking integrated learning approach is resulted in Model;
(5) it obtains the sensor signal data of hydraulic unit to be measured in real time online, and obtained sensor signal data is inputted To the Generalized Support Vector Machine detection model, and then the Generalized Support Vector Machine detection model carries out hydraulic unit to be measured Health status detection.
2. the hydraulic unit method for detecting health status based on Generalized Support Vector Machine, feature exist as described in claim 1 In: the sensor signal includes temperature, flow, pressure and power.
3. the hydraulic unit method for detecting health status based on Generalized Support Vector Machine, feature exist as described in claim 1 In: the statistical nature includes absolute mean, virtual value, root amplitude, flexure, kurtosis, kurtosis index, flexure index and waveform Index.
4. the hydraulic unit method for detecting health status based on Generalized Support Vector Machine, feature exist as described in claim 1 In: the sample data is cleaned using median filtering.
5. the hydraulic unit method for detecting health status based on Generalized Support Vector Machine, feature exist as described in claim 1 In: when carrying out feature selecting using Pearson correlation coefficients, the relative coefficient of sample data and health status label data Calculation formula are as follows:
In formula, cov (X, Y) indicates covariance;σXIndicate the standard deviation of extracted characteristic data set X;σYIndicate the standard deviation of Y; X [i] indicates the sample data of i-th kind of feature;Y indicates the corresponding health status label data of sample data.
6. the hydraulic unit method for detecting health status as described in any one in claim 1-5 based on Generalized Support Vector Machine, It is characterized by: the predetermined number is 3.
7. the hydraulic unit method for detecting health status as described in any one in claim 1-5 based on Generalized Support Vector Machine, It is characterized by: the Generalized Support Vector Machine model includes multiple and different kernel functions.
8. the hydraulic unit method for detecting health status as described in any one in claim 1-5 based on Generalized Support Vector Machine, It is characterized by: obtaining the signal of two sensors associated by the sample hydraulic unit under different health status as sample number According to.
CN201910614733.7A 2019-07-09 2019-07-09 A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine Pending CN110427974A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910614733.7A CN110427974A (en) 2019-07-09 2019-07-09 A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910614733.7A CN110427974A (en) 2019-07-09 2019-07-09 A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine

Publications (1)

Publication Number Publication Date
CN110427974A true CN110427974A (en) 2019-11-08

Family

ID=68409126

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910614733.7A Pending CN110427974A (en) 2019-07-09 2019-07-09 A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine

Country Status (1)

Country Link
CN (1) CN110427974A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111831708A (en) * 2020-07-16 2020-10-27 平安科技(深圳)有限公司 Sample analysis method, device, electronic device and medium based on missing data
CN112964460A (en) * 2021-03-29 2021-06-15 华南理工大学 System and method for monitoring loosening of dental implant repair screw
CN114239161A (en) * 2021-11-29 2022-03-25 武汉欧格莱液压动力设备有限公司 Mechanical state detection method for hydraulic element

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107895212A (en) * 2017-12-01 2018-04-10 国网山东省电力公司信息通信公司 Lead-acid battery life-span prediction method based on sliding window and various visual angles Fusion Features
CN108959566A (en) * 2018-07-04 2018-12-07 哈尔滨工业大学 A kind of medical text based on Stacking integrated study goes privacy methods and system
CN108985335A (en) * 2018-06-19 2018-12-11 中国原子能科学研究院 The integrated study prediction technique of nuclear reactor cladding materials void swelling
CN108986869A (en) * 2018-07-26 2018-12-11 南京群顶科技有限公司 A kind of disk failure detection method predicted using multi-model
CN109186964A (en) * 2018-07-28 2019-01-11 华中科技大学 Rotary machinery fault diagnosis method based on angle resampling and ROC-SVM
US10209974B1 (en) * 2017-12-04 2019-02-19 Banjo, Inc Automated model management methods
CN109359193A (en) * 2018-09-25 2019-02-19 济南大学 A method and system for abnormal telephone identification based on PCA dimensionality reduction with stacked two-layer framework
CN109514349A (en) * 2018-11-12 2019-03-26 西安交通大学 Tool wear state monitoring method based on vibration signal and Stacking integrated model
CN109781706A (en) * 2019-02-11 2019-05-21 上海应用技术大学 A training method for Raman spectroscopy recognition model of foodborne pathogens based on PCA-Stacking
CN109918708A (en) * 2019-01-21 2019-06-21 昆明理工大学 A Construction Method of Material Properties Prediction Model Based on Heterogeneous Ensemble Learning

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107895212A (en) * 2017-12-01 2018-04-10 国网山东省电力公司信息通信公司 Lead-acid battery life-span prediction method based on sliding window and various visual angles Fusion Features
US10209974B1 (en) * 2017-12-04 2019-02-19 Banjo, Inc Automated model management methods
CN108985335A (en) * 2018-06-19 2018-12-11 中国原子能科学研究院 The integrated study prediction technique of nuclear reactor cladding materials void swelling
CN108959566A (en) * 2018-07-04 2018-12-07 哈尔滨工业大学 A kind of medical text based on Stacking integrated study goes privacy methods and system
CN108986869A (en) * 2018-07-26 2018-12-11 南京群顶科技有限公司 A kind of disk failure detection method predicted using multi-model
CN109186964A (en) * 2018-07-28 2019-01-11 华中科技大学 Rotary machinery fault diagnosis method based on angle resampling and ROC-SVM
CN109359193A (en) * 2018-09-25 2019-02-19 济南大学 A method and system for abnormal telephone identification based on PCA dimensionality reduction with stacked two-layer framework
CN109514349A (en) * 2018-11-12 2019-03-26 西安交通大学 Tool wear state monitoring method based on vibration signal and Stacking integrated model
CN109918708A (en) * 2019-01-21 2019-06-21 昆明理工大学 A Construction Method of Material Properties Prediction Model Based on Heterogeneous Ensemble Learning
CN109781706A (en) * 2019-02-11 2019-05-21 上海应用技术大学 A training method for Raman spectroscopy recognition model of foodborne pathogens based on PCA-Stacking

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
YIWEI CHENG 等: "Multisensory Data-Driven Health Degradation Monitoring of Machining Tools by Generalized Multiclass Support Vector Machine", 《IEEE ACCESS》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111831708A (en) * 2020-07-16 2020-10-27 平安科技(深圳)有限公司 Sample analysis method, device, electronic device and medium based on missing data
WO2021151305A1 (en) * 2020-07-16 2021-08-05 平安科技(深圳)有限公司 Sample analysis method, apparatus, electronic device, and medium based on missing data
CN112964460A (en) * 2021-03-29 2021-06-15 华南理工大学 System and method for monitoring loosening of dental implant repair screw
CN114239161A (en) * 2021-11-29 2022-03-25 武汉欧格莱液压动力设备有限公司 Mechanical state detection method for hydraulic element

Similar Documents

Publication Publication Date Title
CN110132598B (en) A Fault Noise Diagnosis Algorithm for Rolling Bearings of Rotating Equipment
CN112257530B (en) Rolling bearing fault diagnosis method based on blind signal separation and support vector machine
CN111964908B (en) Bearing fault diagnosis method under variable working condition based on multi-scale weight distribution convolutional neural network model
CN113537328B (en) A deep learning-based rotating machinery fault diagnosis method and device
CN104712542B (en) A kind of reciprocating compressor sensitive features based on Internet of Things are extracted and method for diagnosing faults
CN113255848A (en) Water turbine cavitation sound signal identification method based on big data learning
CN110084316B (en) Fault diagnosis method based on fine time-shifting multi-scale arrangement entropy and firefly algorithm optimization support vector machine
CN112001110B (en) Structural damage identification monitoring method based on vibration signal space real-time recurrent graph convolutional neural network
CN109858104A (en) A kind of rolling bearing health evaluating and method for diagnosing faults and monitoring system
CN106769049A (en) A kind of Fault Diagnosis of Roller Bearings based on Laplce's score value and SVMs
CN116593157A (en) Gear fault diagnosis method for complex working conditions based on matching meta-learning under few samples
CN103646177B (en) Civil engineering structure damage early warning method considering temperature influence
CN110427974A (en) A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine
TW200819723A (en) Method and system for diagnosing equipment, and recording medium thereof
CN109299727A (en) An Improved Extreme Learning Machine Fault Diagnosis Method Based on Information Reconstruction
CN114742115B (en) Method for constructing fault diagnosis model of rolling bearing and diagnosis method
Miao et al. Adaptive densely connected convolutional auto-encoder-based feature learning of gearbox vibration signals
CN110887899B (en) A method for monitoring and identifying water erosion defects of steam turbine blades
CN114297921A (en) AM-TCN-based fault diagnosis method
CN111678699A (en) A method and system for early fault monitoring and diagnosis of rolling bearing
CN116337447B (en) Method and equipment for diagnosing faults of wheel pair bearings of railway vehicle under non-stationary working condition
CN103994334A (en) Oil transportation pipeline leakage flow estimating device and method based on KPCA-RBF curve fitting
CN109902731B (en) A method and device for detecting performance faults based on support vector machines
CN113553759A (en) Water pump residual service life prediction method, device, equipment and storage medium
CN114528906A (en) Fault diagnosis method, device, equipment and medium for rotary machine

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191108

RJ01 Rejection of invention patent application after publication