CN110427974A - A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine - Google Patents
A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine Download PDFInfo
- Publication number
- CN110427974A CN110427974A CN201910614733.7A CN201910614733A CN110427974A CN 110427974 A CN110427974 A CN 110427974A CN 201910614733 A CN201910614733 A CN 201910614733A CN 110427974 A CN110427974 A CN 110427974A
- Authority
- CN
- China
- Prior art keywords
- support vector
- data
- vector machine
- health status
- generalized support
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2411—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Testing And Monitoring For Control Systems (AREA)
Abstract
本发明属于设备状态监测相关技术领域,其公开了一种基于广义支持向量机的液压部件健康状态检测方法,该方法包括以下步骤:(1)获取多组样本液压部件处于不同健康状态下的传感器信号以作为样本数据,且同时获得对应的的健康状态数据;(2)计算所得到的特征数据与所述健康状态数据之间的相关性,并选取训练特征;(3)采用堆叠集成学习方法构建多个广义支持向量机模型,并采用所述广义支持向量机模型训练过程中所输出的数据来训练得到随机森林模型,由此得到了广义支持向量机检测模型;(4)将得到的待测液压部件的实时传感器信号数据输入到所述广义支持向量机检测模型,以进行健康状态检测。本发明提高了检测精度及灵活性,适用性较强。
The invention belongs to the related technical field of equipment state monitoring, and discloses a method for detecting the health status of hydraulic components based on generalized support vector machines. The signal is used as sample data, and the corresponding health status data is obtained at the same time; (2) the correlation between the obtained feature data and the health status data is calculated, and the training features are selected; (3) the stacking integration learning method is adopted Build a plurality of generalized support vector machine models, and use the data output in the generalized support vector machine model training process to train to obtain the random forest model, thus obtained the generalized support vector machine detection model; (4) will obtain The real-time sensor signal data of the measured hydraulic components is input to the generalized support vector machine detection model for health status detection. The invention improves detection precision and flexibility, and has strong applicability.
Description
技术领域technical field
本发明属于设备状态监测相关技术领域,更具体地,涉及一种基于广义支持向量机的液压部件健康状态检测方法。The invention belongs to the related technical field of equipment state monitoring, and more specifically relates to a method for detecting the health state of hydraulic components based on a generalized support vector machine.
背景技术Background technique
液压系统是工业机电设备中的关键组成部分,其运行状态直接影响整个机电设备的稳定性与可靠性。若液压系统出现突发故障,将会导致设备停机,影响企业生产效率,给企业带来严重经济损失。因此,对液压系统进行健康状态感知是非常必要的。The hydraulic system is a key component of industrial electromechanical equipment, and its operating status directly affects the stability and reliability of the entire electromechanical equipment. If the hydraulic system fails suddenly, it will cause the equipment to stop, affect the production efficiency of the enterprise, and bring serious economic losses to the enterprise. Therefore, it is very necessary to perceive the health status of the hydraulic system.
液压系统健康状态并不能简单地划分为正常运行与故障损坏,其损坏过程是一个长期的、不断加深的过程。健康状态感知是对其损坏过程中的任意时间节点进行感知,得到的故障状态可能会有多种,工作人员需要根据对液压系统部件的健康状态感知得到的不同的故障状态做出相应的维修处理措施,以防止事故的发生,避免经济损失及人员伤害。The health status of the hydraulic system cannot be simply divided into normal operation and fault damage, and the damage process is a long-term and deepening process. Health status perception is to perceive any time node in the damage process, and the obtained fault states may be various. The staff need to make corresponding repairs according to the different fault states obtained from the health state perception of hydraulic system components Measures to prevent accidents, avoid economic losses and personal injuries.
支持向量机(Support vector machine,SVM)是一种有监督的机器学习方法,可以实现对损坏与非损坏的特征值分类,具有样本量需求小,诊断速度快等优点。然而,经典支持向量机只适用于二分类的诊断问题,常用的帮助支持向量机进行多分类的方法OVO、OVA等又会导致诊断效果下降,且其他分类方法的样本需求量较大,诊断速度较慢。相应地,本领域存在着发展一种精度较好的基于广义支持向量机的液压部件健康状态检测方法的技术需求,以实现机电设备液压系统的健康状态的准确感知。Support vector machine (Support vector machine, SVM) is a supervised machine learning method, which can realize the classification of damaged and non-damaged eigenvalues, and has the advantages of small sample size requirements and fast diagnosis speed. However, the classic support vector machine is only suitable for the diagnosis of two classifications. The commonly used methods to help the support vector machine to perform multi-classification, such as OVO and OVA, will lead to a decline in the diagnosis effect, and the demand for samples of other classification methods is large, and the diagnosis speed slower. Correspondingly, there is a technical requirement in this field to develop a method for detecting the health status of hydraulic components based on generalized support vector machines with better accuracy, so as to realize accurate perception of the health status of the hydraulic system of electromechanical equipment.
发明内容Contents of the invention
针对现有技术的以上缺陷或改进需求,本发明提供了一种基于广义支持向量机的液压部件健康状态检测方法,其基于现有液压系统的健康状态监测特点,研究及设计了一种精度较好的基于广义支持向量机的液压部件健康状态检测方法。所述检测方法采用了基于不同核函数的广义支持向量机对液压系统的各种液压部件的多种健康状态进行感知,并使用集成学习方法将基于广义支持向量机的健康状态感知结果进行集成以得到最终的健康状态感知结果,由此实现了液压部件的健康状态的感知,提高了检测精度及灵活性,适用性较强。Aiming at the above defects or improvement needs of the prior art, the present invention provides a generalized support vector machine-based method for detecting the health status of hydraulic components. A good health detection method for hydraulic components based on generalized support vector machines. The detection method uses a generalized support vector machine based on different kernel functions to perceive multiple health states of various hydraulic components of the hydraulic system, and uses an integrated learning method to integrate the health state perception results based on the generalized support vector machine. The final health status perception result is obtained, thereby realizing the health status perception of hydraulic components, improving detection accuracy and flexibility, and having strong applicability.
为实现上述目的,本发明提供了一种基于广义支持向量机的液压部件健康状态检测方法,所述检测方法包括以下步骤:In order to achieve the above object, the present invention provides a method for detecting the state of health of hydraulic components based on generalized support vector machines, the detection method comprising the following steps:
(1)获取多组样本液压部件处于不同健康状态下的传感器信号以作为样本数据,且同时获得每个样本数据对应的样本液压部件所处的健康状态数据;(1) Obtain multiple sets of sensor signals of sample hydraulic components in different health states as sample data, and simultaneously obtain the health state data of the sample hydraulic components corresponding to each sample data;
(2)对所述样本数据进行数据清洗,并对清洗后的所述样本数据提取统计特征;(2) performing data cleaning on the sample data, and extracting statistical features from the cleaned sample data;
(3)基于皮尔森相关系数计算所得到的特征数据与所述健康状态数据之间的相关性,并将所述特征数据按照对应的相关性自大到小进行排序,选取排在前预定个数的特征作为训练特征;(3) Calculate the correlation between the characteristic data obtained based on the Pearson correlation coefficient and the health status data, and sort the characteristic data according to the corresponding correlation from large to small, and select the first predetermined one The features of numbers are used as training features;
(4)基于所述训练特征及所述训练特征所对应的健康状态数据,采用堆叠集成学习方法构建多个广义支持向量机模型,并采用所述广义支持向量机模型训练过程中所输出的数据来训练得到随机森林模型,由此得到了由堆叠集成学习方法集成的广义支持向量机检测模型;(4) Based on the health status data corresponding to the training features and the training features, a plurality of generalized support vector machine models are constructed by stacking integrated learning methods, and the data output during the training process of the generalized support vector machine models are used To train the random forest model, and thus obtain the generalized support vector machine detection model integrated by the stacked ensemble learning method;
(5)在线实时获取待测液压部件的传感器信号数据,并将得到的传感器信号数据进行处理后输入到所述广义支持向量机检测模型,进而所述广义支持向量机检测模型对待测液压部件进行健康状态检测。(5) Obtain the sensor signal data of the hydraulic component to be tested in real time online, and input the obtained sensor signal data into the generalized support vector machine detection model after processing, and then perform the generalized support vector machine detection model on the hydraulic component to be measured Health status detection.
进一步地,所述传感器信号包括温度、流量、压力及功率。Further, the sensor signal includes temperature, flow, pressure and power.
进一步地,所述统计特征包括绝对均值、有效值、方根幅值、歪度、峭度、峭度指标、歪度指标及波形指标。Further, the statistical features include absolute mean value, effective value, square root amplitude, skewness, kurtosis, kurtosis index, skewness index and waveform index.
进一步地,采用中值滤波对所述样本数据进行清洗。Further, median filtering is used to clean the sample data.
进一步地,采用皮尔森相关系数进行特征选择时,样本数据与健康状态标签数据的相关性系数的计算公式为:Further, when using the Pearson correlation coefficient for feature selection, the calculation formula of the correlation coefficient between the sample data and the health status label data is:
式中,cov(X,Y)表示协方差;σX表示所提取的特征数据集X的标准差;σY表示Y的标准差;X[i]表示第i种特征的样本数据;Y表示样本数据对应的健康状态标签数据。In the formula, cov(X, Y) represents the covariance; σ X represents the standard deviation of the extracted feature data set X; σ Y represents the standard deviation of Y; X[i] represents the sample data of the i-th feature; The health status label data corresponding to the sample data.
进一步地,所述预定个数为3个。Further, the predetermined number is 3.
进一步地,所述广义支持向量机模型包括多个不同核函数。Further, the generalized support vector machine model includes multiple different kernel functions.
进一步地,获取不同健康状态下的样本液压部件所关联的两个传感器的信号作为样本数据。Further, the signals of two sensors associated with the sample hydraulic components in different health states are acquired as sample data.
总体而言,通过本发明所构思的以上技术方案与现有技术相比,本发明提供的基于广义支持向量机的液压部件健康状态检测方法主要具有以下有益效果:Generally speaking, compared with the prior art through the above technical solutions conceived by the present invention, the generalized support vector machine-based hydraulic component health status detection method provided by the present invention mainly has the following beneficial effects:
1.基于皮尔森相关系数计算所得到的特征数据与所述健康状态数据之间的相关性,使用皮尔森相关性系数,可以直观反映提取特征值与部件健康状态的相关程度,以用来选择相关性最高,效果最好的特征值数据,进而提高最终分类效果的精度,进而提高检测精度。1. Based on the correlation between the characteristic data calculated by the Pearson correlation coefficient and the health status data, using the Pearson correlation coefficient can intuitively reflect the degree of correlation between the extracted feature value and the health status of the component for selection The eigenvalue data with the highest correlation and the best effect can improve the accuracy of the final classification effect, thereby improving the detection accuracy.
2.采用堆叠集成学习方法构建多个广义支持向量机模型,并采用所述广义支持向量机模型训练过程中所输出的数据来训练得到随机森林模型,由此得到了由堆叠集成学习方法集成的广义支持向量机检测模型,使用Stacking集成学习方法,可以将多种分类器集成起来,提高了准确率,其中Stacking方法中还包括K折交叉验证,此方法可以有效降低分类器训练时的过拟合。2. Use the stacked ensemble learning method to construct multiple generalized support vector machine models, and use the data output in the generalized support vector machine model training process to train the random forest model, thus obtaining the integrated by the stacked ensemble learning method The generalized support vector machine detection model uses the Stacking integrated learning method to integrate multiple classifiers to improve the accuracy rate. The Stacking method also includes K-fold cross-validation, which can effectively reduce the overfitting of the classifier training. combine.
3.使用广义支持向量机方法,可以弥补传统SVM只能二分类的问题,而且对比其他的多分类方法,广义支持向量机训练时间短,分类效果更好。3. Using the generalized support vector machine method can make up for the problem that the traditional SVM can only be divided into two categories. Compared with other multi-classification methods, the generalized support vector machine has a shorter training time and better classification effect.
4.所述检测方法能够及时感知液压部件的健康状态,并提高了液压部件健康状态感知的准确性和有效性,且还可以用于液压系统健康状态的感知,为解决液压系统健康状态感知问题提供了一种新思路。4. The detection method can sense the health state of hydraulic components in time, and improves the accuracy and effectiveness of hydraulic component health state perception, and can also be used for the perception of hydraulic system health state, in order to solve the problem of hydraulic system health state perception A new way of thinking is provided.
附图说明Description of drawings
图1是本发明提供的基于广义支持向量机的液压部件健康状态检测方法的流程示意图;Fig. 1 is the schematic flow chart of the method for detecting the state of health of a hydraulic component based on a generalized support vector machine provided by the present invention;
图2是图1中的基于广义支持向量机的液压部件健康状态检测方法涉及的堆叠集成广义多分类支持向量机的模型构建与在线样本测试的流程示意图。Fig. 2 is a schematic flowchart of model building and online sample testing of the stacked integrated generalized multi-classification support vector machine involved in the generalized support vector machine-based hydraulic component health state detection method in Fig. 1 .
具体实施方式Detailed ways
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。此外,下面所描述的本发明各个实施方式中所涉及到的技术特征只要彼此之间未构成冲突就可以相互组合。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.
请参阅图1及图2,本发明提供的基于广义支持向量机的液压部件健康状态检测方法,所述检测方法能够实现机电设备健康状态诊断的精确化,其主要包括以下步骤:Please refer to Fig. 1 and Fig. 2, the generalized support vector machine based hydraulic component health status detection method provided by the present invention, the detection method can realize the precision of the health status diagnosis of electromechanical equipment, and it mainly includes the following steps:
步骤一,获取多组样本液压部件处于不同健康状态下的传感器信号以作为样本数据,且同时获得每个样本数据对应的样本液压部件所处的健康状态数据。Step 1: Obtain multiple sets of sensor signals of the sample hydraulic components in different health states as sample data, and simultaneously obtain the health state data of the sample hydraulic components corresponding to each sample data.
具体地,获取不同健康状态下的样本液压部件所关联的两个传感器的信号,如温度、流量、压力、功率等传感器信号作为样本数据,并且得到每个传感器信号所对应的样本液压部件的健康状态数据。本实施方式中,获取样本液压部件在Y种健康状态下的2个关联传感器信号,而每个信号样本对应的状态也是可以获取的;当然,当对象为液压系统时,可以针对液压系统的多个液压部件分别进行健康状态检测,进而综合得到的多个所述健康状态检测结果来确定所述液压系统的健康状态,这里关联的传感器是由液压系统来确定的,不同的液压系统里的某个液压部件对应的传感器不同,选择的传感器数量也就不同。Specifically, the signals of two sensors associated with the sample hydraulic components in different health states, such as temperature, flow, pressure, power and other sensor signals, are obtained as sample data, and the health of the sample hydraulic components corresponding to each sensor signal is obtained status data. In this embodiment, the two associated sensor signals of the sample hydraulic components in Y health states are obtained, and the state corresponding to each signal sample can also be obtained; of course, when the object is a hydraulic system, multiple Each hydraulic component performs health status detection respectively, and then the health status of the hydraulic system is determined by combining multiple health status detection results obtained. Here, the associated sensor is determined by the hydraulic system. A certain hydraulic system in different hydraulic systems The sensors corresponding to each hydraulic component are different, and the number of sensors selected is also different.
步骤二,对所述样本数据进行数据清洗,并对清洗后的所述样本数据提取统计特征。Step 2, performing data cleaning on the sample data, and extracting statistical features from the cleaned sample data.
具体地,对得到的所述样本数据进行数据清洗以去除奇异值,并对清洗后的所述样本数据提取统计特征,所述统计特征包括绝对均值、有效值、方根幅值、歪度、峭度、峭度指标、歪度指标及波形指标。Specifically, data cleaning is performed on the obtained sample data to remove singular values, and statistical features are extracted from the cleaned sample data, and the statistical features include absolute mean value, effective value, square root amplitude, skewness, Kurtosis, kurtosis index, skewness index and waveform index.
对所述样本数据进行清洗时涉及到中值滤波,具体包括以下步骤:The median filtering is involved in cleaning the sample data, which specifically includes the following steps:
(21)基于所述样本数据选择所对应的传感器中的一个传感器所采集的传感器信号数据的组数为m,每组包含数据点数为n,即传感器采集了m组数据,每组数据具有n个样本点。(21) The number of groups of sensor signal data collected by one of the corresponding sensors selected based on the sample data is m, and the number of data points included in each group is n, that is, the sensor has collected m groups of data, and each group of data has n sample points.
(22)确定中值滤波窗口的长度k。(22) Determine the length k of the median filter window.
(23)在第一组数据中取前k个数据,则:(23) Take the first k data in the first set of data, then:
N(k+i-1)/2=median(Ni,Ni+1,…,Ni+k-1)N (k+i-1)/2 =median(Ni,N i+1 ,...,N i+k-1 )
式中,Ni表示第i个数据;i初始为1;median()表示求中值,当(k+i-1)为奇数时加1。In the formula, N i represents the i-th data; i is initially 1; median() represents the median value, and 1 is added when (k+i-1) is an odd number.
(24)i=i+1,重复步骤(23),直到i+k=n,如此所有m组数据都经过处理以得到中值滤波后的数据集。(24) i=i+1, repeat step (23) until i+k=n, so all m sets of data are processed to obtain a median-filtered data set.
(25)基于所述样本数据选择所对应的传感器中的另一个传感器所得到的m组传感器数据(每组包含n个数据点)进行步骤(22)至步骤(24)的处理。(25) Select m sets of sensor data (each set includes n data points) obtained by another sensor in the corresponding sensor based on the sample data, and perform the processing from step (22) to step (24).
本实施方式中所使用的8个统计特征值见表1。See Table 1 for the 8 statistical feature values used in this embodiment.
表1统计特征指标Table 1 Statistical characteristic indicators
表中,xi为第i次采样的信号序列;i=1,2,…,N,N为总采样次数;σ为数据的标准差。所提出的统计特征集合为F=[F1,F2,…,F8],统计特征参数包含有量纲特征参数和无量纲特征参数,[F1,F2,…,F5]为有量纲统计指标,[F6,F7,F8]为无量纲统计特征指标。In the table, x i is the signal sequence of the i-th sampling; i=1, 2, ..., N, N is the total sampling times; σ is the standard deviation of the data. The proposed statistical feature set is F=[F 1 , F 2 ,…,F 8 ], the statistical feature parameters include dimensioned feature parameters and dimensionless feature parameters, [F 1 , F 2 ,…,F 5 ] is Dimensioned statistical indicators, [F 6 , F 7 , F 8 ] are dimensionless statistical characteristic indicators.
步骤三,基于皮尔森相关系数计算所得到的特征数据与所述健康状态数据之间的相关性,并将所述特征数据按照对应的相关性自大到小进行排序,选取排在前预定个数的特征作为训练特征。Step 3: Calculate the correlation between the characteristic data obtained based on the Pearson correlation coefficient and the health status data, and sort the characteristic data according to the corresponding correlation from large to small, and select the top predetermined Number features are used as training features.
具体地,所述预定个数为N,其可以根据最终效果进行改变。使用皮尔森相关系数进行的特征选择主要包括以下步骤:Specifically, the predetermined number is N, which can be changed according to the final effect. Feature selection using the Pearson correlation coefficient mainly includes the following steps:
(31)确定所提取的特征数据对应的真实的健康状态标签数据集Y,所选择的传感器中的一个的特征数据集X,X中包含提取的8种特征数据,X[i]表示第i种特征的样本数据。(31) Determine the real health status label data set Y corresponding to the extracted feature data, the feature data set X of one of the selected sensors, X contains 8 kinds of feature data extracted, and X[i] represents the i-th sample data of this feature.
(32)使用以下公式计算获得X[i]与Y的相关性系数:(32) Use the following formula to calculate the correlation coefficient between X[i] and Y:
得到8种特征的8组相关系数,其中cov(X,Y)表示协方差;σX表示X的标准差,σY表示Y的标准差。Get 8 sets of correlation coefficients for 8 features, where cov(X, Y) means covariance; σ X means the standard deviation of X, and σ Y means the standard deviation of Y.
(33)根据每个特征得到的相关系数,取相关系数最大的3种特征作为堆叠(Stacking)集成广义多分类支持向量机模型的输入数据集。(33) According to the correlation coefficient obtained by each feature, take the three features with the largest correlation coefficient as the input data set of the stacking integrated generalized multi-class support vector machine model.
(34)对所选择的另一个传感器提取的特征进行步骤(31)至步骤(33)处理,同样选择3种特征作为堆叠集成广义多分类支持向量机模型的输入数据集。(34) Perform steps (31) to (33) on the features extracted by another selected sensor, and also select three types of features as the input data set of the stacked and integrated generalized multi-classification support vector machine model.
步骤四,基于所述训练特征及所述训练特征所对应的健康状态数据,采用堆叠集成学习方法构建多个广义支持向量机模型,并采用所述广义支持向量机模型训练过程中所输出的数据来训练得到随机森林模型,由此得到了由堆叠集成学习方法集成的广义支持向量机检测模型。Step 4, based on the training features and the health status data corresponding to the training features, use the stacked ensemble learning method to construct multiple generalized support vector machine models, and use the data output during the generalized support vector machine model training process To train the random forest model, and thus obtain the generalized support vector machine detection model integrated by the stacked ensemble learning method.
具体地,使用得到的N个特征与每个训练特征对应的健康状态,构建K×M个广义支持向量机模型并得到模型训练过程中输出的数据,使用K×M个广义支持向量机的输出数据再去训练一个随机森林模型,至此得到由堆叠(Stacking)方法集成的广义支持向量机检测模型。Specifically, use the obtained N features and the health status corresponding to each training feature to construct K×M generalized support vector machine models and obtain the output data during model training, and use the output of K×M generalized support vector machines The data is then used to train a random forest model, so far a generalized support vector machine detection model integrated by the stacking method is obtained.
本实施方式中,广义支持向量机检测模型的构建主要包括以下步骤:In this embodiment, the construction of the generalized support vector machine detection model mainly includes the following steps:
(41)使用得到的两种传感器的经过选择的特征值数据,每一个传感器的特征值数据有m组,将每个传感器的m组特征值数据合并至一个大数据集X,假设每个传感器有4种特征,一个传感器有100组数据,则X中数据为2*4=8列,100行。(41) Using the selected eigenvalue data of the two sensors obtained, each sensor has m sets of eigenvalue data, and merges the m sets of eigenvalue data of each sensor into a large data set X, assuming that each sensor There are 4 types of features, one sensor has 100 sets of data, then the data in X is 2*4=8 columns, 100 rows.
(42)使用大数据集X中的每一行特征值数据对应的健康状态标签数据Y,Y为m行1列。(42) Use the health status label data Y corresponding to each row of eigenvalue data in the large data set X, and Y is m rows and 1 column.
(43)确定K折交叉验证中参数K的值,将X按行均分为K个集合,每个集合有行数据,将K个集合组合K次,每种组合包含训练集与校验集,确保K个集合中的每一个集合都在K种组合中作为一次校验集,而每种组合的训练集为剩下的K-1个集合的组合。具体的,假如K=3,则将X按行均分为3个集合a,b,c;将3个集合组合三次,其中第一个组合里a为校验集,b、c共同作为训练集,第二个组合里b为校验集,a、c共同作为训练集,第三个组合类似。(43) Determine the value of the parameter K in the K-fold cross-validation, and divide X into K sets by row, and each set has Row data, combine K sets K times, each combination contains training set and verification set, ensure that each set of K sets is used as a verification set in K combinations, and the training set of each combination It is the combination of the remaining K-1 sets. Specifically, if K=3, then divide X into three sets a, b, and c by row; combine the three sets three times, in which a is the verification set in the first combination, and b and c are jointly used as training set, b in the second combination is the verification set, a and c are used as the training set together, and the third combination is similar.
(44)使用K个线性核的广义多分类支持向量机对K种组合进行训练与分类,每次训练后对校验集进行预测,以得到对校验集的预测数据,K个已训练的线性核广义支持向量机模型对校验集预测得到K个预测数据集合,将K个集合校验集预测数据合并,以得到与原始X数据集的行数m相同行数的数据,将这个集合命名为H,H作为随机森林模型的训练输入数据。(44) Use the generalized multi-classification support vector machine with K linear kernels to train and classify K combinations, and predict the verification set after each training to obtain the prediction data for the verification set. K trained The linear kernel generalized support vector machine model predicts the verification set to obtain K prediction data sets, and merges the K sets of verification set prediction data to obtain data with the same number of rows as the original X data set m. Named as H, H is used as the training input data of the random forest model.
(45)使用K个多项式(Poly)核,K个径向基(RBF)核的广义多分类支持向量机,重复步骤(44),以得到另外两个集合H2,H3。所述支持向量机包括多个不同核函数。(45) Using K polynomial (Poly) kernels and K radial basis (RBF) kernels generalized multi-classification support vector machine, repeat step (44) to obtain another two sets H 2 , H 3 . The support vector machine includes a plurality of different kernel functions.
(46)使用随机森林模型,用数据集H,H2,H3训练一个随机森林模型,最终得到由堆叠(Stacking)集成的广义多分类支持向量机检测模型。(46) Use the random forest model, train a random forest model with the data sets H, H 2 , H 3 , and finally obtain a generalized multi-class support vector machine detection model integrated by stacking.
步骤五,在线实时获取待测液压部件的传感器信号数据,并将得到的传感器信号数据输入到所述广义支持向量机检测模型,进而所述广义支持向量机检测模型对待测液压部件进行健康状态检测。其中,所述传感器信号数据所对应的特征与所述训练特征相一致。Step 5: Obtain the sensor signal data of the hydraulic component to be tested in real time online, and input the obtained sensor signal data into the generalized support vector machine detection model, and then the generalized support vector machine detection model performs health status detection of the hydraulic component to be tested . Wherein, the features corresponding to the sensor signal data are consistent with the training features.
为了对本发明进行进一步的详细说明,使用液压系统健康状态数据来验证本方法,实验装置模拟了四种故障,其故障模式与控制参数如表2所示。In order to further describe the present invention in detail, the health state data of the hydraulic system is used to verify the method. The experimental device simulates four types of faults, and the fault modes and control parameters are shown in Table 2.
表2液压系统故障模式及其控制参数Table 2 Hydraulic system failure modes and their control parameters
每种进行健康状态感知的部件都有不同的健康状态,如表3所示:Each component that performs health status awareness has a different health status, as shown in Table 3:
表3健康状态标签与含义Table 3 Health Status Labels and Meanings
液压系统平台实验数据集中包含18种传感器数据,包括压力、功率、流量、温度、振动、效率传感器等,各传感器分别布置在相应部件位置,压力、功率传感器频率为100Hz,流量传感器频率为10Hz,温度、振动、效率传感器频率为1Hz。共采集了2205组数据,一组数据为60s,根据传感器频率不同每种传感器的一组数据中数据点也不同,每组数据都有相应的健康状态标签。The experimental data set of the hydraulic system platform contains 18 kinds of sensor data, including pressure, power, flow, temperature, vibration, efficiency sensors, etc., and each sensor is arranged at the corresponding component position. The frequency of the pressure and power sensors is 100 Hz, and the frequency of the flow sensor is 10 Hz. Temperature, vibration, efficiency sensor frequency is 1Hz. A total of 2205 sets of data were collected, one set of data was 60s, and the data points in each set of data of each sensor were different according to the frequency of the sensor, and each set of data had a corresponding health status label.
对于冷却器的健康状态感知选择了温度传感器TS3,TS4;对于阀的健康状态感知选择了压力传感器PS1、PS2;对于泵的健康状态感知选择了流量传感器FS1,系统效率传感器数据SE;对于蓄能器的健康状态感知选择功率传感器EPS1,压力传感器PS2。进行中值滤波得到滤波后的数据,之后对每种部件所选取的每个传感器的每组数据提取8种统计特征。The temperature sensors TS3 and TS4 are selected for the health status perception of the cooler; the pressure sensors PS1 and PS2 are selected for the health status perception of the valve; the flow sensor FS1 and the system efficiency sensor data SE are selected for the health status perception of the pump; for energy storage Select the power sensor EPS1 and pressure sensor PS2 for the health status perception of the device. Perform median filtering to obtain the filtered data, and then extract 8 statistical features for each set of data of each sensor selected for each component.
使用皮尔森系数进行相关性分析,以得到和部件的健康状态数据相关性最强的特征值数据。和冷却系统健康状态相关性最强的特征为有效值、绝对均值、峭度指标;和阀健康状态相关性最强的特征为方根幅值、歪度、波形指标;和泵健康状态相关性最强的特征为方根幅值、绝对均值、歪度指标;由于蓄能器的相关特征值的相关系数都很小,故使用所有的特征值。Correlation analysis is performed using the Pearson coefficient to obtain the eigenvalue data with the strongest correlation with the health status data of the component. The features most correlated with cooling system health status are effective value, absolute mean value, and kurtosis index; the features most correlated with valve health status are square root amplitude, skewness, waveform index; and pump health status correlation The strongest features are square root magnitude, absolute mean, and skewness index; since the correlation coefficients of the relevant eigenvalues of the accumulator are very small, all eigenvalues are used.
将每种部件对应的特征值数据分别合并为一个矩阵,即每种部件都有2205组数据,而由于选择特征值数目、传感器数量不同,得到的列数不同。同时得到相对应的4种2205行1列的标签数据集,每种数据集的标签种类如表3所示。Merge the eigenvalue data corresponding to each component into a matrix, that is, each component has 2205 sets of data, and the number of columns obtained is different due to the number of selected eigenvalues and the number of sensors. At the same time, four corresponding label data sets with 2205 rows and one column are obtained. The label types of each data set are shown in Table 3.
对每种部件的特征值矩阵中随机抽取70%作为训练集,30%作为测试集。使用Stacking集成广义支持向量机方法,其中取3折交叉验证,广义支持向量机分别取线性核、Poly核、RBF核。Randomly select 70% of the eigenvalue matrix of each component as the training set and 30% as the test set. Use Stacking to integrate the generalized support vector machine method, in which 3-fold cross-validation is adopted, and the generalized support vector machine adopts linear kernel, poly kernel and RBF kernel respectively.
对于每种核的广义支持向量机,在试验中不断调试参数,最终得到的参数如下:For the generalized support vector machine of each kernel, the parameters are continuously debugged in the experiment, and the final parameters are as follows:
线性核:k=0,p=1,lambda=1*10-8,epsilon=1*10-6;Linear kernel: k=0, p=1, lambda=1*10 -8 , epsilon=1*10 -6 ;
Poly核:k=0,p=2,lambda=1*10-8,epsilon=1*10-6;Poly core: k=0, p=2, lambda=1*10 -8 , epsilon=1*10 -6 ;
RBF核:k=0,p=1,lambda=1*10-8,epsilon=1*10-6。RBF kernel: k=0, p=1, lambda=1*10 -8 , epsilon=1*10 -6 .
最终结果如表4所示:The final results are shown in Table 4:
表4分类结果与准确率Table 4 Classification results and accuracy
为了说明本方法的准确性,将本方法与LDA方法、ANN方法、线性核SVM、RBF核SVM进行了对比,结果如表5所示,本方法的健康状态感知正确率优于其他方法。In order to illustrate the accuracy of this method, this method was compared with the LDA method, ANN method, linear kernel SVM, and RBF kernel SVM. The results are shown in Table 5. The health status perception accuracy of this method is better than other methods.
表5不同方法之间比较Table 5 Comparison between different methods
本发明提供的基于广义支持向量机的液压部件健康状态检测方法,所述检测方法结合了皮尔森相关系数、堆叠集成学习方法及广义支持向量机,提高了最终分类效果的精度,进而提高检测精度,且灵活性较好,适用性较强。The health state detection method of hydraulic components based on the generalized support vector machine provided by the present invention, the detection method combines the Pearson correlation coefficient, the stacking ensemble learning method and the generalized support vector machine, improves the accuracy of the final classification effect, and then improves the detection accuracy , and has good flexibility and strong applicability.
本领域的技术人员容易理解,以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。Those skilled in the art can easily understand that the above are only preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present invention, etc., All should be included within the protection scope of the present invention.
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910614733.7A CN110427974A (en) | 2019-07-09 | 2019-07-09 | A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910614733.7A CN110427974A (en) | 2019-07-09 | 2019-07-09 | A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN110427974A true CN110427974A (en) | 2019-11-08 |
Family
ID=68409126
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201910614733.7A Pending CN110427974A (en) | 2019-07-09 | 2019-07-09 | A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN110427974A (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111831708A (en) * | 2020-07-16 | 2020-10-27 | 平安科技(深圳)有限公司 | Sample analysis method, device, electronic device and medium based on missing data |
| CN112964460A (en) * | 2021-03-29 | 2021-06-15 | 华南理工大学 | System and method for monitoring loosening of dental implant repair screw |
| CN114239161A (en) * | 2021-11-29 | 2022-03-25 | 武汉欧格莱液压动力设备有限公司 | Mechanical state detection method for hydraulic element |
Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107895212A (en) * | 2017-12-01 | 2018-04-10 | 国网山东省电力公司信息通信公司 | Lead-acid battery life-span prediction method based on sliding window and various visual angles Fusion Features |
| CN108959566A (en) * | 2018-07-04 | 2018-12-07 | 哈尔滨工业大学 | A kind of medical text based on Stacking integrated study goes privacy methods and system |
| CN108985335A (en) * | 2018-06-19 | 2018-12-11 | 中国原子能科学研究院 | The integrated study prediction technique of nuclear reactor cladding materials void swelling |
| CN108986869A (en) * | 2018-07-26 | 2018-12-11 | 南京群顶科技有限公司 | A kind of disk failure detection method predicted using multi-model |
| CN109186964A (en) * | 2018-07-28 | 2019-01-11 | 华中科技大学 | Rotary machinery fault diagnosis method based on angle resampling and ROC-SVM |
| US10209974B1 (en) * | 2017-12-04 | 2019-02-19 | Banjo, Inc | Automated model management methods |
| CN109359193A (en) * | 2018-09-25 | 2019-02-19 | 济南大学 | A method and system for abnormal telephone identification based on PCA dimensionality reduction with stacked two-layer framework |
| CN109514349A (en) * | 2018-11-12 | 2019-03-26 | 西安交通大学 | Tool wear state monitoring method based on vibration signal and Stacking integrated model |
| CN109781706A (en) * | 2019-02-11 | 2019-05-21 | 上海应用技术大学 | A training method for Raman spectroscopy recognition model of foodborne pathogens based on PCA-Stacking |
| CN109918708A (en) * | 2019-01-21 | 2019-06-21 | 昆明理工大学 | A Construction Method of Material Properties Prediction Model Based on Heterogeneous Ensemble Learning |
-
2019
- 2019-07-09 CN CN201910614733.7A patent/CN110427974A/en active Pending
Patent Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107895212A (en) * | 2017-12-01 | 2018-04-10 | 国网山东省电力公司信息通信公司 | Lead-acid battery life-span prediction method based on sliding window and various visual angles Fusion Features |
| US10209974B1 (en) * | 2017-12-04 | 2019-02-19 | Banjo, Inc | Automated model management methods |
| CN108985335A (en) * | 2018-06-19 | 2018-12-11 | 中国原子能科学研究院 | The integrated study prediction technique of nuclear reactor cladding materials void swelling |
| CN108959566A (en) * | 2018-07-04 | 2018-12-07 | 哈尔滨工业大学 | A kind of medical text based on Stacking integrated study goes privacy methods and system |
| CN108986869A (en) * | 2018-07-26 | 2018-12-11 | 南京群顶科技有限公司 | A kind of disk failure detection method predicted using multi-model |
| CN109186964A (en) * | 2018-07-28 | 2019-01-11 | 华中科技大学 | Rotary machinery fault diagnosis method based on angle resampling and ROC-SVM |
| CN109359193A (en) * | 2018-09-25 | 2019-02-19 | 济南大学 | A method and system for abnormal telephone identification based on PCA dimensionality reduction with stacked two-layer framework |
| CN109514349A (en) * | 2018-11-12 | 2019-03-26 | 西安交通大学 | Tool wear state monitoring method based on vibration signal and Stacking integrated model |
| CN109918708A (en) * | 2019-01-21 | 2019-06-21 | 昆明理工大学 | A Construction Method of Material Properties Prediction Model Based on Heterogeneous Ensemble Learning |
| CN109781706A (en) * | 2019-02-11 | 2019-05-21 | 上海应用技术大学 | A training method for Raman spectroscopy recognition model of foodborne pathogens based on PCA-Stacking |
Non-Patent Citations (1)
| Title |
|---|
| YIWEI CHENG 等: "Multisensory Data-Driven Health Degradation Monitoring of Machining Tools by Generalized Multiclass Support Vector Machine", 《IEEE ACCESS》 * |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111831708A (en) * | 2020-07-16 | 2020-10-27 | 平安科技(深圳)有限公司 | Sample analysis method, device, electronic device and medium based on missing data |
| WO2021151305A1 (en) * | 2020-07-16 | 2021-08-05 | 平安科技(深圳)有限公司 | Sample analysis method, apparatus, electronic device, and medium based on missing data |
| CN112964460A (en) * | 2021-03-29 | 2021-06-15 | 华南理工大学 | System and method for monitoring loosening of dental implant repair screw |
| CN114239161A (en) * | 2021-11-29 | 2022-03-25 | 武汉欧格莱液压动力设备有限公司 | Mechanical state detection method for hydraulic element |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110132598B (en) | A Fault Noise Diagnosis Algorithm for Rolling Bearings of Rotating Equipment | |
| CN112257530B (en) | Rolling bearing fault diagnosis method based on blind signal separation and support vector machine | |
| CN111964908B (en) | Bearing fault diagnosis method under variable working condition based on multi-scale weight distribution convolutional neural network model | |
| CN113537328B (en) | A deep learning-based rotating machinery fault diagnosis method and device | |
| CN104712542B (en) | A kind of reciprocating compressor sensitive features based on Internet of Things are extracted and method for diagnosing faults | |
| CN113255848A (en) | Water turbine cavitation sound signal identification method based on big data learning | |
| CN110084316B (en) | Fault diagnosis method based on fine time-shifting multi-scale arrangement entropy and firefly algorithm optimization support vector machine | |
| CN112001110B (en) | Structural damage identification monitoring method based on vibration signal space real-time recurrent graph convolutional neural network | |
| CN109858104A (en) | A kind of rolling bearing health evaluating and method for diagnosing faults and monitoring system | |
| CN106769049A (en) | A kind of Fault Diagnosis of Roller Bearings based on Laplce's score value and SVMs | |
| CN116593157A (en) | Gear fault diagnosis method for complex working conditions based on matching meta-learning under few samples | |
| CN103646177B (en) | Civil engineering structure damage early warning method considering temperature influence | |
| CN110427974A (en) | A Health Status Detection Method of Hydraulic Components Based on Generalized Support Vector Machine | |
| TW200819723A (en) | Method and system for diagnosing equipment, and recording medium thereof | |
| CN109299727A (en) | An Improved Extreme Learning Machine Fault Diagnosis Method Based on Information Reconstruction | |
| CN114742115B (en) | Method for constructing fault diagnosis model of rolling bearing and diagnosis method | |
| Miao et al. | Adaptive densely connected convolutional auto-encoder-based feature learning of gearbox vibration signals | |
| CN110887899B (en) | A method for monitoring and identifying water erosion defects of steam turbine blades | |
| CN114297921A (en) | AM-TCN-based fault diagnosis method | |
| CN111678699A (en) | A method and system for early fault monitoring and diagnosis of rolling bearing | |
| CN116337447B (en) | Method and equipment for diagnosing faults of wheel pair bearings of railway vehicle under non-stationary working condition | |
| CN103994334A (en) | Oil transportation pipeline leakage flow estimating device and method based on KPCA-RBF curve fitting | |
| CN109902731B (en) | A method and device for detecting performance faults based on support vector machines | |
| CN113553759A (en) | Water pump residual service life prediction method, device, equipment and storage medium | |
| CN114528906A (en) | Fault diagnosis method, device, equipment and medium for rotary machine |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191108 |
|
| RJ01 | Rejection of invention patent application after publication |