[go: up one dir, main page]

CN113378725B - Multi-scale-channel attention network-based tool fault diagnosis method, equipment and storage medium - Google Patents

Multi-scale-channel attention network-based tool fault diagnosis method, equipment and storage medium Download PDF

Info

Publication number
CN113378725B
CN113378725B CN202110662716.8A CN202110662716A CN113378725B CN 113378725 B CN113378725 B CN 113378725B CN 202110662716 A CN202110662716 A CN 202110662716A CN 113378725 B CN113378725 B CN 113378725B
Authority
CN
China
Prior art keywords
branch
scale
layer
channel attention
tool
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110662716.8A
Other languages
Chinese (zh)
Other versions
CN113378725A (en
Inventor
袁东风
狄子钧
周晓天
李东阳
梁道君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong University
Original Assignee
Shandong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong University filed Critical Shandong University
Priority to CN202110662716.8A priority Critical patent/CN113378725B/en
Publication of CN113378725A publication Critical patent/CN113378725A/en
Application granted granted Critical
Publication of CN113378725B publication Critical patent/CN113378725B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/02Preprocessing
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B23MACHINE TOOLS; METAL-WORKING NOT OTHERWISE PROVIDED FOR
    • B23QDETAILS, COMPONENTS, OR ACCESSORIES FOR MACHINE TOOLS, e.g. ARRANGEMENTS FOR COPYING OR CONTROLLING; MACHINE TOOLS IN GENERAL CHARACTERISED BY THE CONSTRUCTION OF PARTICULAR DETAILS OR COMPONENTS; COMBINATIONS OR ASSOCIATIONS OF METAL-WORKING MACHINES, NOT DIRECTED TO A PARTICULAR RESULT
    • B23Q17/00Arrangements for observing, indicating or measuring on machine tools
    • B23Q17/09Arrangements for observing, indicating or measuring on machine tools for indicating or measuring cutting pressure or for determining cutting-tool condition, e.g. cutting ability, load on tool
    • B23Q17/0952Arrangements for observing, indicating or measuring on machine tools for indicating or measuring cutting pressure or for determining cutting-tool condition, e.g. cutting ability, load on tool during machining
    • B23Q17/0957Detection of tool breakage
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/12Classification; Matching

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Mechanical Engineering (AREA)
  • Testing Of Devices, Machine Parts, Or Other Structures Thereof (AREA)

Abstract

The invention relates to a tool fault diagnosis method, equipment and storage medium based on a multi-scale-channel attention network, which comprises the following steps: (1) data acquisition; (2) data preprocessing; (3) constructing a multi-scale-channel attention network model; (4) training; (5) testing. According to the invention, the multi-scale-channel attention network is adopted to carry out tool wear fault diagnosis, and a channel attention mechanism is introduced into residual connection of the multi-scale network, so that different importance degrees of vibration signals of a machine tool spindle in three directions on tool wear state classification tasks are mapped to a characteristic learning process, and data are better fused.

Description

一种基于多尺度-通道注意力网络的刀具故障诊断方法、设备 及存储介质A tool fault diagnosis method, device and storage medium based on multi-scale-channel attention network

技术领域Technical Field

本发明涉及智能制造产品质量管控领域,特别涉及一种基于多尺度-通道注意力网络的刀具故障诊断方法、设备及存储介质。The present invention relates to the field of intelligent manufacturing product quality control, and in particular to a tool fault diagnosis method, device and storage medium based on a multi-scale-channel attention network.

背景技术Background Art

数控机床作为“工业母机”,在生产加工过程中应用非常广泛。刀具作为数控机床的切削工具,其实时健康状态直接影响着机床的加工效率和产品质量。刀具与工件直接接触并相互作用,在刀具高速切削的过程中,会产生不可避免的磨损损伤,对刀具磨损状态的精准监测有助于避免因刀具失效导致的产品质量问题。As the "mother machine of industry", CNC machine tools are widely used in the production and processing process. As the cutting tool of CNC machine tools, the real-time health status of the tool directly affects the processing efficiency and product quality of the machine tool. The tool is in direct contact and interacts with the workpiece. In the process of high-speed cutting of the tool, inevitable wear and damage will occur. Accurate monitoring of the tool wear status helps to avoid product quality problems caused by tool failure.

现阶段,在实际工业现场仍使用人工检测的方法,这种检测方法耗时耗力,并且存在由作业人员自身原因而引起的误差。随着制造业由自动化向智能化转型,人工智能技术与制造业深度融合成为关键。我们希望在故障诊断方法中导入深度学习,取代传统人工直接接触的检测方法,从而实现刀具精准的故障诊断。At present, manual inspection methods are still used in actual industrial sites. This inspection method is time-consuming and labor-intensive, and there are errors caused by the operators themselves. As the manufacturing industry transforms from automation to intelligence, the deep integration of artificial intelligence technology and manufacturing has become the key. We hope to introduce deep learning into the fault diagnosis method to replace the traditional manual direct contact inspection method, so as to achieve accurate fault diagnosis of cutting tools.

一般的智能化检测过程为采集监测信号(例如切削力信号、振动信号、声发射信号、主轴电流信号等),并分析这些信号与刀具磨损状态之间的映射关系。目前,已经有大量国内外学者开展了此类研究。Hu等提出了一种多尺度网络(Multiscale Network,MSNet),该网络包含一个三分支结构,每个分支具有不同的卷积层深度,从而可以提取一维振动信号不同层次的特征,并通过全连接合并这些特征。李鹏等使用一维卷积网络对加工过程中机床主轴和工作台的振动与声发射信号进行初步特征提取,之后将信号特征输入长短时记忆网络进行分析,最后得出刀具磨损状态的评估结果,分类准确率达到93.8%。Hsieh等使用快速傅里叶变换将振动时域信号转换为频域信号,并通过类均值散射准则提取频域特征,将特征输入单层的卷积神经网络,对刀具磨损状态进行分类,根据实验结果,发现Z方向振动信号与X方向或Y方向振动信号的组合可以得到更好的分类结果。为了优化模型的梯度传播,文章在具有相同特征尺度寸的卷积层之间加入残差连接。然而,上述方法存在以下问题:(1)受限于数据集的有效且单一,只能将单通道振动信号转换为单通道图像,单通道包含的特征匮乏,可能会对刀具故障分类精度有所影响;(2)受限于特征提取与特征分析分步完成,模型的自学习能力弱,最终的刀具故障诊断的准确率低;(3)受限于模型结构简单,没有使用有效的数据融合方法,导致不同方向的振动信号特征融合效果差,无法有效地提取数据特征。The general intelligent detection process is to collect monitoring signals (such as cutting force signals, vibration signals, acoustic emission signals, spindle current signals, etc.) and analyze the mapping relationship between these signals and tool wear status. At present, a large number of domestic and foreign scholars have carried out such research. Hu et al. proposed a multiscale network (MSNet), which contains a three-branch structure, each branch has a different convolution layer depth, so that the features of different levels of one-dimensional vibration signals can be extracted and merged through full connection. Li Peng et al. used a one-dimensional convolutional network to perform preliminary feature extraction on the vibration and acoustic emission signals of the machine tool spindle and worktable during processing, and then input the signal features into the long short-term memory network for analysis, and finally obtained the evaluation results of the tool wear status, with a classification accuracy of 93.8%. Hsieh et al. used fast Fourier transform to convert the vibration time domain signal into frequency domain signal, and extracted the frequency domain features through the mean scattering criterion, and input the features into a single-layer convolutional neural network to classify the tool wear status. According to the experimental results, it was found that the combination of Z-direction vibration signal and X-direction or Y-direction vibration signal can obtain better classification results. In order to optimize the gradient propagation of the model, the article adds residual connections between convolutional layers with the same feature scale. However, the above method has the following problems: (1) Due to the validity and singleness of the data set, only single-channel vibration signals can be converted into single-channel images. The single channel contains insufficient features, which may affect the accuracy of tool fault classification; (2) Due to the fact that feature extraction and feature analysis are completed step by step, the self-learning ability of the model is weak, and the accuracy of the final tool fault diagnosis is low; (3) Due to the simple model structure, no effective data fusion method is used, resulting in poor fusion effect of vibration signal features in different directions, and the inability to effectively extract data features.

发明内容Summary of the invention

针对以上存在的问题,本发明提出了一种基于多尺度-通道注意力网络的刀具故障诊断方法。In view of the above problems, the present invention proposes a tool fault diagnosis method based on a multi-scale-channel attention network.

本发明通过刀具磨损试验平台,采集了符合实际工业生产现场的机床主轴振动信号与刀具磨损值,振动信号包含X、Y、Z三个方向。基于特征融合的思路,将X、Y、Z三个方向的振动信号进行拼接,构成三通道的多通道特征图。The present invention uses a tool wear test platform to collect machine tool spindle vibration signals and tool wear values that conform to actual industrial production sites. The vibration signals include three directions: X, Y, and Z. Based on the idea of feature fusion, the vibration signals in the three directions of X, Y, and Z are spliced to form a three-channel multi-channel feature map.

本发明采用卷积神经网络提取特征图的特征并进行分类任务,卷积神经网络具有出色的自适应特征学习能力,可以很好地挖掘数据的潜在特征。The present invention uses a convolutional neural network to extract the features of the feature map and perform classification tasks. The convolutional neural network has excellent adaptive feature learning capabilities and can well mine the potential features of the data.

在本发明中,为了将机床主轴三个方向振动信号对刀具磨损状态分类任务的不同重要程度映射至特征学习的过程中,使得数据更好地融合,对多尺度网络做出改进,将通道注意力引入多尺度网络。In the present invention, in order to map the different importance of the vibration signals in three directions of the machine tool spindle to the tool wear state classification task into the feature learning process, so that the data can be better integrated, the multi-scale network is improved and channel attention is introduced into the multi-scale network.

本发明还提供了一种计算机设备及存储介质。The invention also provides a computer device and a storage medium.

本发明的技术方案为:The technical solution of the present invention is:

一种基于多尺度-通道注意力网络的刀具故障诊断方法,包括步骤如下:A tool fault diagnosis method based on a multi-scale-channel attention network comprises the following steps:

(1)数据采集:(1) Data collection:

分别采集机床主轴X、Y、Z三个轴的振动信号与每一次走刀后的刀具磨损值,其中,机床主轴X、Y、Z三个轴所处的主轴坐标系根据右手笛卡尔直角坐标系建立;The vibration signals of the three axes X, Y, and Z of the machine tool spindle and the tool wear value after each tool pass are collected respectively. The spindle coordinate system of the three axes X, Y, and Z of the machine tool spindle is established according to the right-handed Cartesian rectangular coordinate system;

(2)数据预处理:(2) Data preprocessing:

根据刀具磨损值进行刀具磨损阶段分类,以分类结果作为标签,将振动信号进行分类;The tool wear stages are classified according to the tool wear values, and the vibration signals are classified using the classification results as labels;

将机床主轴X、Y、Z三个轴的振动信号按照时间顺序分段为长度为n2的切片,之后将机床主轴X、Y、Z三个轴的切片进行拼接,构建为n×n×3的三通道输入特征图,其中,n为振动信号切片转换为特征图时,特征图的高或宽;The vibration signals of the machine tool spindle X, Y, and Z axes are segmented into slices of length n 2 in time order, and then the slices of the machine tool spindle X, Y, and Z axes are spliced to construct a three-channel input feature map of n×n×3, where n is the height or width of the feature map when the vibration signal slice is converted into a feature map;

数据预处理后的数据中选取训练集、验证集和测试集;Select training set, validation set and test set from the data after data preprocessing;

(3)构建多尺度-通道注意力网络模型:(3) Constructing a multi-scale-channel attention network model:

所述多尺度-通道注意力网络模型包括输入层、三个分支、全连接层;The multi-scale-channel attention network model includes an input layer, three branches, and a fully connected layer;

三个分支包括第一分支、第二分支、第三分支,第一分支包括5个卷积层;第二分支包括2个卷积层;第三分支包括1个卷积层;The three branches include a first branch, a second branch, and a third branch. The first branch includes 5 convolutional layers; the second branch includes 2 convolutional layers; and the third branch includes 1 convolutional layer.

在第一分支的第二个卷积层与第二分支的第一个卷积层、在第一分支的第四个卷积层与第二分支的第二个卷积层、在第二分支的第一个卷积层与第三分支的第一个卷积层之间均设置有残差连接,残差加和结果分别输入卷积层第一分支的第三个卷积层、第二分支的第四个卷积层、第二分支的第二个卷积层;Residual connections are set between the second convolution layer of the first branch and the first convolution layer of the second branch, between the fourth convolution layer of the first branch and the second convolution layer of the second branch, and between the first convolution layer of the second branch and the first convolution layer of the third branch. The residual summation results are respectively input into the third convolution layer of the first branch of the convolution layer, the fourth convolution layer of the second branch, and the second convolution layer of the second branch;

第二分支的第一个卷积层的输出特征图、第二分支的第二个卷积层的输出特征图及第三分支的第一个卷积层的输出特征图做残差连接之前输入通道注意力模块。The output feature map of the first convolutional layer of the second branch, the output feature map of the second convolutional layer of the second branch, and the output feature map of the first convolutional layer of the third branch are input into the channel attention module before the residual connection.

(4)训练:将训练集输入多尺度-通道注意力网络模型进行训练,在训练的同时记录每一次训练周期的训练集准确率与损失函数;(4) Training: The training set is input into the multi-scale-channel attention network model for training, and the training set accuracy and loss function of each training cycle are recorded during training;

(5)测试:将测试集输入训练好的多尺度-通道注意力网络模型,输出测试集数据对应的刀具磨损阶段。(5) Testing: The test set is input into the trained multi-scale-channel attention network model, and the tool wear stage corresponding to the test set data is output.

根据本发明优选的,数据预处理后的数据的80%作为训练集,20%作为测试集,训练集中的80%用作训练,训练集中的20%作为验证集。Preferably according to the present invention, 80% of the data after data preprocessing is used as a training set, and 20% is used as a test set, 80% of the training set is used for training, and 20% of the training set is used as a validation set.

进一步优选的,三个分支包括第一分支、第二分支、第三分支的数学表达式分别如式(Ⅰ)、式(Ⅱ)、式(III)所示:Further preferably, the three branches include the first branch, the second branch, and the third branch, and their mathematical expressions are shown in formula (I), formula (II), and formula (III), respectively:

Figure GDA0004197768550000031
Figure GDA0004197768550000031

Figure GDA0004197768550000032
Figure GDA0004197768550000032

Figure GDA0004197768550000033
Figure GDA0004197768550000033

式(Ⅰ)、式(Ⅱ)、式(III)中,i1=1,2,…5,i2=1,2,i3=1;In formula (I), formula (II) and formula (III), i 1 =1, 2, ... 5, i 2 =1, 2, i 3 =1;

Figure GDA0004197768550000034
是指第一分支的第i1个卷积层的输出特征图,
Figure GDA0004197768550000035
是指第一分支的第i1个卷积层,
Figure GDA0004197768550000036
是指第一分支的第i1-1个卷积层的输出特征图;
Figure GDA0004197768550000034
refers to the output feature map of the i1th convolutional layer of the first branch,
Figure GDA0004197768550000035
refers to the i1th convolutional layer of the first branch,
Figure GDA0004197768550000036
It refers to the output feature map of the i 1 -1th convolutional layer of the first branch;

Figure GDA0004197768550000037
是指第二分支的第i2个卷积层的输出特征图,
Figure GDA0004197768550000038
是指第二分支的第i2个卷积层,
Figure GDA0004197768550000039
是指第二分支的第i2-1个卷积层的输出特征图;
Figure GDA0004197768550000037
refers to the output feature map of the i2th convolutional layer of the second branch,
Figure GDA0004197768550000038
refers to the i 2 th convolutional layer of the second branch,
Figure GDA0004197768550000039
It refers to the output feature map of the i 2 -1th convolutional layer of the second branch;

Figure GDA00041977685500000310
是指第三分支的第i3个卷积层的输出特征图,
Figure GDA00041977685500000311
是指第三分支的第i3个卷积层,
Figure GDA00041977685500000312
是指第三分支的第i3-1个卷积层的输出特征图。
Figure GDA00041977685500000310
refers to the output feature map of the i 3rd convolutional layer of the third branch,
Figure GDA00041977685500000311
refers to the 3rd convolutional layer of the third branch,
Figure GDA00041977685500000312
It refers to the output feature map of the i 3 -1 th convolutional layer of the third branch.

进一步优选的,每个卷积层的卷积核为3×3,每个卷积层之后加一个ReLU非线性激活函数。Further preferably, the convolution kernel of each convolution layer is 3×3, and a ReLU nonlinear activation function is added after each convolution layer.

进一步优选的,第一分支的5个卷积层的输出特征图尺度分别为64×64×3,32×32×3,16×16×3,8×8×3,4×4×3;第二分支的2个卷积层的输出特征图尺度分别为32×32×3,8×8×3;第三分支的1个卷积层的输出特征图尺度为32×32×3。Further preferably, the output feature map scales of the five convolutional layers of the first branch are 64×64×3, 32×32×3, 16×16×3, 8×8×3, and 4×4×3 respectively; the output feature map scales of the two convolutional layers of the second branch are 32×32×3 and 8×8×3 respectively; and the output feature map scale of one convolutional layer of the third branch is 32×32×3.

进一步优选的,通道注意力模块分别采用全局平均池化操作与全局最大池化操作获取全局感受野,再将它们分别输入一个两层的神经网络,第一层神经元个数为1,激活函数为ReLU,第二层神经元个数为3,这个两层的神经网络是共享的,之后将它们进行相加生成通道权重,最后通过归一化层将通道权重归一化至(0,1)之间。Further preferably, the channel attention module uses global average pooling operation and global maximum pooling operation to obtain the global receptive field, and then inputs them into a two-layer neural network, the number of neurons in the first layer is 1, the activation function is ReLU, and the number of neurons in the second layer is 3. The two-layer neural network is shared, and then they are added to generate channel weights, and finally the channel weights are normalized to between (0,1) through the normalization layer.

进一步优选的,将第二分支的第一个卷积层的输出特征图、第二分支的第二个卷积层的输出特征图及第三分支的第一个卷积层的输出特征图,做残差连接之前输入通道注意力模块,经过池化操作得到通道权重,之后通道权重与原始输出特征图进行相乘,将通道权重映射到浅层分支卷积层的输出特征图上,带有通道权重的特征图与深层分支卷积层包括第一分支的第二个卷积层、在第一分支的第四个卷积层、在第二分支的第一个卷积层的输出特征图进行残差连接,输入深层分支卷积层的下一个卷积层。Further preferably, the output feature map of the first convolutional layer of the second branch, the output feature map of the second convolutional layer of the second branch, and the output feature map of the first convolutional layer of the third branch are input into the channel attention module before the residual connection, and the channel weight is obtained after the pooling operation. Then, the channel weight is multiplied by the original output feature map, and the channel weight is mapped to the output feature map of the shallow branch convolutional layer. The feature map with the channel weight is residually connected with the output feature map of the deep branch convolutional layer including the second convolutional layer of the first branch, the fourth convolutional layer of the first branch, and the first convolutional layer of the second branch, and is input into the next convolutional layer of the deep branch convolutional layer.

根据本发明优选的,步骤(1)中,采用型号为KS903的振动传感器采集机床主轴X、Y、Z三个轴的振动信号,采样频率为10240Hz;采用19JC数字式万能工具显微镜采集刀具磨损值。Preferably, according to the present invention, in step (1), a vibration sensor of model KS903 is used to collect vibration signals of the three axes X, Y, and Z of the machine tool spindle, and the sampling frequency is 10240 Hz; and a 19JC digital universal tool microscope is used to collect tool wear values.

根据本发明优选的,步骤(2)中,根据刀具磨损值进行刀具磨损阶段分类,以分类结果作为标签,将振动信号进行分类;具体是指:According to the preferred embodiment of the present invention, in step (2), the tool wear stage is classified according to the tool wear value, and the vibration signal is classified using the classification result as a label; specifically, it refers to:

绘制刀具全生命周期副后刀面最大磨损值的曲线图,与机床主轴X轴的全生命周期振动信号的时域图;其中,最大磨损值的曲线图的横坐标为走刀次数,纵坐标为最大磨损值大小;振动信号的时域图的横坐标为时间,纵坐标为振动信号幅值;Draw a curve graph of the maximum wear value of the secondary flank of the tool during its entire life cycle, and a time domain graph of the vibration signal of the X-axis of the machine tool spindle during its entire life cycle; the abscissa of the curve graph of the maximum wear value is the number of tool passes, and the ordinate is the maximum wear value; the abscissa of the time domain graph of the vibration signal is time, and the ordinate is the vibration signal amplitude;

根据最大磨损值曲线斜率变化将振动信号分为三类,最大磨损值曲线斜率不大于0.01时,对应的振动信号处于快速初始磨损阶段,最大磨损值曲线斜率在0.01至0.3之间时,对应的振动信号处于稳态磨损阶段,最大磨损值曲线斜率不小于0.3时,对应的振动信号处于急速磨损阶段;The vibration signal is divided into three categories according to the change of the slope of the maximum wear value curve. When the slope of the maximum wear value curve is not greater than 0.01, the corresponding vibration signal is in the rapid initial wear stage. When the slope of the maximum wear value curve is between 0.01 and 0.3, the corresponding vibration signal is in the steady-state wear stage. When the slope of the maximum wear value curve is not less than 0.3, the corresponding vibration signal is in the rapid wear stage.

根据本发明优选的,步骤(4)中,使用交叉熵函数L作为损失函数,如式(IV)所示:Preferably, according to the present invention, in step (4), a cross entropy function L is used as a loss function, as shown in formula (IV):

Figure GDA0004197768550000041
Figure GDA0004197768550000041

式(IV)中,yi表示真实值,

Figure GDA0004197768550000042
表示预测值,N为样本数量。In formula (IV), yi represents the true value,
Figure GDA0004197768550000042
represents the predicted value, and N is the number of samples.

一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现基于多尺度-通道注意力网络的刀具故障诊断方法的步骤。A computer device comprises a memory and a processor, wherein the memory stores a computer program, and when the processor executes the computer program, the steps of a tool fault diagnosis method based on a multi-scale-channel attention network are implemented.

一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现基于多尺度-通道注意力网络的刀具故障诊断方法的步骤。A computer-readable storage medium stores a computer program, which, when executed by a processor, implements the steps of a tool fault diagnosis method based on a multi-scale-channel attention network.

本发明的有益效果为:The beneficial effects of the present invention are:

1.本发明根据机床主轴振动信号与刀具磨损状态的映射关系,采用卷积神经网络对刀具故障进行诊断。1. The present invention uses a convolutional neural network to diagnose tool faults based on the mapping relationship between the machine tool spindle vibration signal and the tool wear status.

2.本发明基于特征融合的思路,三个方向的振动信号作为输入特征图的三个通道,将机床主轴三个轴的振动信号进行拼接,之后输入卷积神经网络。2. The present invention is based on the idea of feature fusion. The vibration signals in three directions are used as the three channels of the input feature map. The vibration signals of the three axes of the machine tool spindle are spliced and then input into the convolutional neural network.

3.本发明为了使三个方向的振动信号更有效地进行融合,将通道注意力机制引入多尺度卷积神经网络,利用通道之间的相互依赖性,将三个方向的振动信号对刀具状态分类的不同重要程度映射至特征学习的过程中。3. In order to more effectively fuse the vibration signals in three directions, the present invention introduces the channel attention mechanism into the multi-scale convolutional neural network, and utilizes the interdependence between channels to map the different importance of the vibration signals in three directions to the tool state classification into the feature learning process.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1(a)为刀具全生命周期副后刀面最大磨损值曲线图;Figure 1(a) is a graph showing the maximum wear value of the secondary flank during the tool life cycle;

图1(b)为机床主轴X方向的全生命周期振动信号时域图;Figure 1(b) is a time domain diagram of the full life cycle vibration signal of the machine tool spindle in the X direction;

图2为多尺度-通道注意力网络模型的结构示意图;Figure 2 is a schematic diagram of the structure of a multi-scale-channel attention network model;

图3(a)为多尺度-通道注意力网络模型训练过程的准确率曲线图;Figure 3(a) is a graph showing the accuracy of the multi-scale-channel attention network model training process;

图3(b)为多尺度-通道注意力网络模型训练过程的损失函数曲线图;Figure 3(b) is a graph of the loss function during the training process of the multi-scale-channel attention network model;

图4为混淆矩阵示意图;Figure 4 is a schematic diagram of a confusion matrix;

具体实施方式DETAILED DESCRIPTION

下面结合说明书附图和实施例对本发明作进一步限定,但不限于此。The present invention will be further defined below in conjunction with the accompanying drawings and embodiments, but is not limited thereto.

实施例1Example 1

一种基于多尺度-通道注意力网络的刀具故障诊断方法,包括步骤如下:A tool fault diagnosis method based on a multi-scale-channel attention network comprises the following steps:

(1)数据采集:(1) Data collection:

为了采集符合实际工业生产场景的真实数据,本发明设计了刀具磨损试验平台。利用刀具磨损试验平台,分别采集机床主轴X、Y、Z三个轴的振动信号与每一次走刀后的刀具磨损值,其中,机床主轴X、Y、Z三个轴所处的主轴坐标系根据右手笛卡尔直角坐标系建立;In order to collect real data that conforms to actual industrial production scenarios, the present invention designs a tool wear test platform. Using the tool wear test platform, the vibration signals of the three axes of the machine tool spindle X, Y, and Z and the tool wear value after each tool pass are collected respectively, wherein the spindle coordinate system of the three axes of the machine tool spindle X, Y, and Z is established according to the right-handed Cartesian rectangular coordinate system;

(2)数据预处理:(2) Data preprocessing:

刀具磨损值曲线的斜率反映了当前阶段刀具磨损的程度,本发明根据刀具磨损值进行刀具磨损阶段分类,以分类结果作为标签,将振动信号进行分类;The slope of the tool wear value curve reflects the degree of tool wear at the current stage. The present invention classifies the tool wear stages according to the tool wear value, and uses the classification results as labels to classify the vibration signals;

将机床主轴X、Y、Z三个轴的振动信号按照时间顺序分段为长度为n2的切片,之后将机床主轴X、Y、Z三个轴的切片进行拼接,构建为n×n×3的三通道输入特征图,其中,n为振动信号切片转换为特征图时,特征图的高或宽;该数字为人为定义大小;The vibration signals of the X, Y, and Z axes of the machine tool spindle are segmented into slices of length n 2 in time order, and then the slices of the X, Y, and Z axes of the machine tool spindle are spliced to construct a three-channel input feature map of n×n×3, where n is the height or width of the feature map when the vibration signal slice is converted into a feature map; this number is an artificially defined size;

数据预处理后的数据的80%作为训练集,20%作为测试集,训练集中的80%用作训练,训练集中的20%作为验证集。80% of the preprocessed data is used as the training set, 20% is used as the test set, 80% of the training set is used for training, and 20% of the training set is used as the validation set.

(3)构建多尺度-通道注意力网络模型:(3) Constructing a multi-scale-channel attention network model:

多尺度网络包含三个分支,不同分支的卷积层深度不同,较深的网络分支可以提取到更多的局部信息,较浅的网络分支可以提取到更多的局部信息,最后在全连接层将不同层次的特征进行特征融合。多尺度网络在不同分支且具有相同输出特征图尺度的卷积层之间设置残差连接,这样帮助网络梯度传输,提升网络的性能。The multi-scale network consists of three branches. The depth of the convolution layer of different branches is different. The deeper network branches can extract more local information, and the shallower network branches can extract more local information. Finally, the features of different levels are fused in the fully connected layer. The multi-scale network sets residual connections between the convolution layers of different branches with the same output feature map scale, which helps the network gradient transmission and improves the network performance.

在较浅的网络分支的卷积层做残差连接之前,将卷积层的输出特征图输入通道注意力模块,通过池化操作,捕捉X、Y、Z三个方向振动信号的相关性,得到通道注意力权重,之后通道权重与输出特征图进行像素点相乘,将通道权重应设置特征图上。带有通道注意力权重的特征图与较深网络分支的卷积层的输出特征图做残差连接,从而将三个方向的不同重要程度映射至特征学习的过程中。Before the convolution layer of the shallower network branch is connected with the residual connection, the output feature map of the convolution layer is input into the channel attention module. Through the pooling operation, the correlation of the vibration signals in the three directions of X, Y, and Z is captured to obtain the channel attention weight. Then the channel weight is multiplied by the output feature map pixel by pixel, and the channel weight should be set on the feature map. The feature map with the channel attention weight is connected with the output feature map of the convolution layer of the deeper network branch with the residual connection, so as to map the different importance of the three directions to the feature learning process.

如图2所示,多尺度-通道注意力网络模型包括输入层、三个分支、全连接层;As shown in Figure 2, the multi-scale-channel attention network model includes an input layer, three branches, and a fully connected layer;

三个分支包括第一分支、第二分支、第三分支,第一分支包括5个卷积层;第二分支包括2个卷积层;第三分支包括1个卷积层;The three branches include a first branch, a second branch, and a third branch. The first branch includes 5 convolutional layers; the second branch includes 2 convolutional layers; and the third branch includes 1 convolutional layer.

在不同分支、具有相同输出特征图尺度的卷积层之间实现恒等映射,将浅层卷积网络分支的特征图与深层卷积网络分支的特征图做残差连接,之后输入深层卷积网络分支的下一层卷积层。在第一分支的第二个卷积层与第二分支的第一个卷积层、在第一分支的第四个卷积层与第二分支的第二个卷积层、在第二分支的第一个卷积层与第三分支的第一个卷积层之间均设置有残差连接,残差加和结果分别输入卷积层第一分支的第三个卷积层、第二分支的第四个卷积层、第二分支的第二个卷积层;残差连接可以在前向过程中帮助网络中的特征进行恒等映射,当浅层网络的输出已经达到最优时,特征直接传递到深层网络;在反向过程中帮助传导梯度,让更深的模型能够成功训练,提高网络的性能。To achieve identity mapping between convolutional layers of different branches with the same output feature map scale, the feature map of the shallow convolutional network branch is residually connected with the feature map of the deep convolutional network branch, and then input into the next convolutional layer of the deep convolutional network branch. Residual connections are set between the second convolutional layer of the first branch and the first convolutional layer of the second branch, the fourth convolutional layer of the first branch and the second convolutional layer of the second branch, and the first convolutional layer of the second branch and the first convolutional layer of the third branch. The residual summation results are input into the third convolutional layer of the first branch of the convolutional layer, the fourth convolutional layer of the second branch, and the second convolutional layer of the second branch respectively; the residual connection can help the features in the network to be identically mapped in the forward process. When the output of the shallow network has reached the optimal level, the features are directly transferred to the deep network; in the reverse process, it helps to conduct gradients, so that deeper models can be successfully trained and the performance of the network can be improved.

在浅层分支卷积层的输出特征图做残差连接之前输入通道注意力模块,第二分支的第一个卷积层的输出特征图、第二分支的第二个卷积层的输出特征图及第三分支的第一个卷积层的输出特征图做残差连接之前输入通道注意力模块。The channel attention module is input before the residual connection is made to the output feature map of the shallow branch convolution layer, and the channel attention module is input before the residual connection is made to the output feature map of the first convolution layer of the second branch, the output feature map of the second convolution layer of the second branch, and the output feature map of the first convolution layer of the third branch.

三个分支包括第一分支、第二分支、第三分支的数学表达式分别如式(Ⅰ)、式(Ⅱ)、式(III)所示:The mathematical expressions of the three branches including the first branch, the second branch and the third branch are shown in formula (I), formula (II) and formula (III) respectively:

Figure GDA0004197768550000061
Figure GDA0004197768550000061

Figure GDA0004197768550000062
Figure GDA0004197768550000062

Figure GDA0004197768550000063
Figure GDA0004197768550000063

式(Ⅰ)、式(Ⅱ)、式(III)中,i1=1,2,…5,i2=1,2,i3=1;In formula (I), formula (II) and formula (III), i 1 =1, 2, ... 5, i 2 =1, 2, i 3 =1;

Figure GDA0004197768550000064
是指第一分支的第i1个卷积层的输出特征图,
Figure GDA0004197768550000065
是指第一分支的第i1个卷积层,
Figure GDA0004197768550000066
是指第一分支的第i1-1个卷积层的输出特征图;
Figure GDA0004197768550000064
refers to the output feature map of the i1th convolutional layer of the first branch,
Figure GDA0004197768550000065
refers to the i1th convolutional layer of the first branch,
Figure GDA0004197768550000066
It refers to the output feature map of the i 1 -1th convolutional layer of the first branch;

Figure GDA0004197768550000067
是指第二分支的第i2个卷积层的输出特征图,
Figure GDA0004197768550000068
是指第二分支的第i2个卷积层,
Figure GDA0004197768550000069
是指第二分支的第i2-1个卷积层的输出特征图;
Figure GDA0004197768550000067
refers to the output feature map of the i2th convolutional layer of the second branch,
Figure GDA0004197768550000068
refers to the i 2 th convolutional layer of the second branch,
Figure GDA0004197768550000069
It refers to the output feature map of the i 2 -1th convolutional layer of the second branch;

Figure GDA00041977685500000610
是指第三分支的第i3个卷积层的输出特征图,
Figure GDA00041977685500000611
是指第三分支的第i3个卷积层,
Figure GDA00041977685500000612
是指第三分支的第i3-1个卷积层的输出特征图。
Figure GDA00041977685500000610
refers to the output feature map of the i 3rd convolutional layer of the third branch,
Figure GDA00041977685500000611
refers to the 3rd convolutional layer of the third branch,
Figure GDA00041977685500000612
It refers to the output feature map of the i 3 -1 th convolutional layer of the third branch.

每个卷积层的卷积核为3×3,每个卷积层之后加一个ReLU非线性激活函数。The convolution kernel of each convolution layer is 3×3, and a ReLU nonlinear activation function is added after each convolution layer.

第一分支的5个卷积层的输出特征图尺度分别为64×64×3,32×32×3,16×16×3,8×8×3,4×4×3;第二分支的2个卷积层的输出特征图尺度分别为32×32×3,8×8×3;第三分支的1个卷积层的输出特征图尺度为32×32×3。The output feature map scales of the five convolutional layers of the first branch are 64×64×3, 32×32×3, 16×16×3, 8×8×3, and 4×4×3 respectively; the output feature map scales of the two convolutional layers of the second branch are 32×32×3 and 8×8×3 respectively; the output feature map scale of the one convolutional layer of the third branch is 32×32×3.

通道注意力模块分别采用全局平均池化操作与全局最大池化操作获取全局感受野,再将它们分别输入一个两层的神经网络,第一层神经元个数为1,激活函数为ReLU,第二层神经元个数为3,这个两层的神经网络是共享的,之后将它们进行相加生成通道权重,最后通过归一化层将通道权重归一化至(0,1)之间。The channel attention module uses global average pooling and global maximum pooling operations to obtain the global receptive field, and then inputs them into a two-layer neural network. The number of neurons in the first layer is 1, the activation function is ReLU, and the number of neurons in the second layer is 3. The two-layer neural network is shared, and then they are added to generate channel weights. Finally, the channel weights are normalized to between (0,1) through the normalization layer.

在浅层分支卷积层的输出特征图做残差连接之前输入通道注意力模块,利用通道之间的相互依赖性,捕捉X、Y、Z三个方向振动信号的相关性,从而可以将三个方向的不同重要程度通过通道注意力学习的方式映射至特征提取的过程中。将第二分支的第一个卷积层的输出特征图、第二分支的第二个卷积层的输出特征图及第三分支的第一个卷积层的输出特征图,做残差连接之前输入通道注意力模块,经过池化操作得到通道权重,之后通道权重与原始输出特征图进行相乘,将通道权重映射到浅层分支卷积层的输出特征图上,带有通道权重的特征图与深层分支卷积层包括第一分支的第二个卷积层、在第一分支的第四个卷积层、在第二分支的第一个卷积层的输出特征图进行残差连接,输入深层分支卷积层的下一个卷积层。Before the output feature map of the shallow branch convolution layer is connected with the residual connection, the channel attention module is input. The mutual dependence between channels is used to capture the correlation of the vibration signals in the three directions of X, Y, and Z, so that the different importance of the three directions can be mapped to the feature extraction process through channel attention learning. The output feature map of the first convolution layer of the second branch, the output feature map of the second convolution layer of the second branch, and the output feature map of the first convolution layer of the third branch are input with the channel attention module before the residual connection. The channel weight is obtained through pooling operation, and then the channel weight is multiplied with the original output feature map. The channel weight is mapped to the output feature map of the shallow branch convolution layer. The feature map with the channel weight is residually connected with the output feature map of the deep branch convolution layer, including the second convolution layer of the first branch, the fourth convolution layer of the first branch, and the first convolution layer of the second branch, and input into the next convolution layer of the deep branch convolution layer.

以卷积层输出特征图

Figure GDA0004197768550000071
Figure GDA0004197768550000072
为例:
Figure GDA0004197768550000073
Figure GDA0004197768550000074
表示第一分支的第三个卷积层的输入,A表示通道注意力模块,
Figure GDA0004197768550000075
表示第一分支的第二个卷积层的输出特征图,
Figure GDA0004197768550000076
表示第二分支的第一个卷积层的输出特征图。通过高效通道注意力模块,提升影响因素较强的通道特征权重,而抑制影响因素较弱的通道特征权重。Output feature map with convolutional layer
Figure GDA0004197768550000071
and
Figure GDA0004197768550000072
For example:
Figure GDA0004197768550000073
Figure GDA0004197768550000074
represents the input of the third convolutional layer of the first branch, A represents the channel attention module,
Figure GDA0004197768550000075
represents the output feature map of the second convolutional layer of the first branch,
Figure GDA0004197768550000076
The output feature map of the first convolutional layer of the second branch. Through the efficient channel attention module, the weights of channel features with stronger influencing factors are increased, while the weights of channel features with weaker influencing factors are suppressed.

(4)训练:将训练集输入多尺度-通道注意力网络模型进行训练,在训练的同时记录每一次训练周期的训练集准确率与损失函数;(4) Training: The training set is input into the multi-scale-channel attention network model for training, and the training set accuracy and loss function of each training cycle are recorded during training;

(5)测试:将测试集输入训练好的多尺度-通道注意力网络模型,输出测试集数据对应的刀具磨损阶段。记录测试集每一种类别的准确率,并绘制混淆矩阵。采用准确率(Accuracy)、精确率(Precision)、召回率(Recall)、F1分数(F1 Score)四个指标说明网络的性能。(5) Testing: Input the test set into the trained multi-scale-channel attention network model and output the tool wear stage corresponding to the test set data. Record the accuracy of each category in the test set and draw the confusion matrix. Use the four indicators of accuracy, precision, recall, and F1 score to illustrate the performance of the network.

实施例2Example 2

根据实施例1所述的一种基于多尺度-通道注意力网络的刀具故障诊断方法,其区别在于:The tool fault diagnosis method based on a multi-scale-channel attention network according to embodiment 1 is different in that:

步骤(1)中,采用型号为KS903的振动传感器采集机床主轴X、Y、Z三个轴的振动信号,采样频率为10240Hz;采用19JC数字式万能工具显微镜采集刀具磨损值。数据采集在刀具磨损试验平台进行,工件切削过程在数控机床立式加工中心(VDF-850)完成,刀具为直径10mm的三刃立铣刀,切削工件为45号钢圆柱形工件。数控机床数控机床主轴转速为2000r/min,进给速度为764mm/min。铣削深度和宽度分别为2mm和5mm。为了加速刀具磨损,采用无切削液的干切削方式。铣刀的加工总长度在110米左右,使刀具达到严重磨损的程度,并且有小面积的铣屑。每把刀具采集35-47组试验数据,每组铣削过程耗时4分17秒。同时,在每组切削完成之后,采用19JC数字式万能工具显微镜采集刀具磨损值,采用副后刀面最大磨损宽度VBmax作为刀具磨损阶段的分类标签。In step (1), a vibration sensor of model KS903 is used to collect vibration signals of the three axes X, Y, and Z of the machine tool spindle, and the sampling frequency is 10240Hz; a 19JC digital universal tool microscope is used to collect tool wear values. Data collection is carried out on a tool wear test platform, and the workpiece cutting process is completed on a CNC machine tool vertical machining center (VDF-850). The tool is a three-edge end mill with a diameter of 10mm, and the cutting workpiece is a 45# steel cylindrical workpiece. The spindle speed of the CNC machine tool is 2000r/min, and the feed speed is 764mm/min. The milling depth and width are 2mm and 5mm respectively. In order to accelerate tool wear, a dry cutting method without cutting fluid is adopted. The total processing length of the milling cutter is about 110 meters, so that the tool reaches a serious degree of wear and there is a small area of milling chips. 35-47 sets of test data are collected for each tool, and each set of milling process takes 4 minutes and 17 seconds. At the same time, after each group of cutting is completed, a 19JC digital universal tool microscope is used to collect tool wear values, and the maximum wear width VBmax of the secondary flank is used as the classification label of the tool wear stage.

步骤(2)中,根据刀具磨损值进行刀具磨损阶段分类,以分类结果作为标签,将振动信号进行分类;具体是指:In step (2), the tool wear stage is classified according to the tool wear value, and the vibration signal is classified using the classification result as a label; specifically, it means:

绘制刀具全生命周期副后刀面最大磨损值的曲线图,与机床主轴X轴的全生命周期振动信号的时域图;其中,最大磨损值的曲线图的横坐标为走刀次数,纵坐标为最大磨损值大小;振动信号的时域图的横坐标为时间,纵坐标为振动信号幅值;Draw a curve graph of the maximum wear value of the secondary flank of the tool during its entire life cycle, and a time domain graph of the vibration signal of the X-axis of the machine tool spindle during its entire life cycle; the abscissa of the curve graph of the maximum wear value is the number of tool passes, and the ordinate is the maximum wear value; the abscissa of the time domain graph of the vibration signal is time, and the ordinate is the vibration signal amplitude;

根据最大磨损值曲线斜率变化将振动信号分为三类,最大磨损值曲线斜率不大于0.01时,对应的振动信号处于快速初始磨损阶段,最大磨损值曲线斜率在0.01至0.3之间时,对应的振动信号处于稳态磨损阶段,最大磨损值曲线斜率不小于0.3时,对应的振动信号处于急速磨损阶段;The vibration signal is divided into three categories according to the change of the slope of the maximum wear value curve. When the slope of the maximum wear value curve is not greater than 0.01, the corresponding vibration signal is in the rapid initial wear stage. When the slope of the maximum wear value curve is between 0.01 and 0.3, the corresponding vibration signal is in the steady-state wear stage. When the slope of the maximum wear value curve is not less than 0.3, the corresponding vibration signal is in the rapid wear stage.

根据最大磨损值曲线斜率变化将振动信号分为三类,分别对应三个不同的刀具磨损阶段:快速初始磨损阶段、稳态磨损阶段和急速磨损阶段。刀具全生命周期最大磨损值如图1(a)所示。由图1(a)可以看出,刀具在1-5次走刀阶段的磨损较快,此阶段磨损曲线斜率较大。在6-41次走刀阶段,磨损值均匀增长,直到达到极限值,此阶段为刀具的有效工作时间。在42-47次走刀阶段,刀具磨损值急速上升并导致刀具失效,此阶段的磨损曲线斜率急速增大。虽然在图1(a)中初始磨损阶段不是很明显,但在图1(b)的振动信号时域图中,初始阶段振动信号的幅值较大,产生这种现象的原因是新刀表面粗糙不平,接触应力较大,同时,新刀有脱碳、氧化层引起的表面缺陷。根据以上分析,将振动信号分为三个阶段。According to the change of the slope of the maximum wear value curve, the vibration signal is divided into three categories, corresponding to three different tool wear stages: rapid initial wear stage, steady-state wear stage and rapid wear stage. The maximum wear value of the tool in the whole life cycle is shown in Figure 1(a). As can be seen from Figure 1(a), the tool wears faster in the 1-5 pass stage, and the slope of the wear curve in this stage is larger. In the 6-41 pass stage, the wear value increases evenly until it reaches the limit value. This stage is the effective working time of the tool. In the 42-47 pass stage, the tool wear value rises rapidly and causes the tool to fail. The slope of the wear curve in this stage increases rapidly. Although the initial wear stage is not very obvious in Figure 1(a), in the vibration signal time domain diagram of Figure 1(b), the amplitude of the vibration signal in the initial stage is large. The reason for this phenomenon is that the surface of the new tool is rough and uneven, the contact stress is large, and the new tool has surface defects caused by decarburization and oxidation layer. According to the above analysis, the vibration signal is divided into three stages.

在振动信号输入多尺度-通道注意力网络模型之前,将振动信号按照信号的时间顺序分段为长度为642的切片,之后将X、Y、Z三个方向的振动信号切片作为输入特征图的三个通道进行拼接。假设有X、Y、Z方向的三个同时段的信号切片,

Figure GDA0004197768550000081
将切片构建为64×64×3的输入特征图输入多尺度-通道注意力网络模型。Before the vibration signal is input into the multi-scale-channel attention network model, the vibration signal is segmented into slices of length 64 2 according to the time order of the signal, and then the vibration signal slices in the three directions of X, Y, and Z are spliced as the three channels of the input feature map. Assuming there are three signal slices in the same period of the X, Y, and Z directions,
Figure GDA0004197768550000081
The slices are constructed as 64×64×3 input feature maps and input into the multi-scale-channel attention network model.

步骤(4)中,训练的迭代次数设置为100次,使用交叉熵函数L作为损失函数,如式(IV)所示:In step (4), the number of training iterations is set to 100, and the cross entropy function L is used as the loss function, as shown in formula (IV):

Figure GDA0004197768550000091
Figure GDA0004197768550000091

式(IV)中,yi表示真实值,

Figure GDA0004197768550000092
表示预测值,N为样本数量。In formula (IV), yi represents the true value,
Figure GDA0004197768550000092
represents the predicted value, and N is the number of samples.

训练过程中记录每一次训练周期的训练集与验证集的准确率与损失函数值。训练过程结束后保存最优的验证模型做测试,并绘制训练集准确率曲线与训练过程损失函数曲线,图3(a)为多尺度-通道注意力网络模型训练过程的准确率曲线图;图3(b)为多尺度-通道注意力网络模型训练过程的损失函数曲线图;During the training process, the accuracy and loss function values of the training set and validation set are recorded for each training cycle. After the training process is completed, the optimal validation model is saved for testing, and the training set accuracy curve and the training process loss function curve are plotted. Figure 3(a) is the accuracy curve of the multi-scale-channel attention network model training process; Figure 3(b) is the loss function curve of the multi-scale-channel attention network model training process;

记录测试集每一种类别的准确率,并绘制混淆矩阵,图4为混淆矩阵示意图;表1为多尺度-通道注意力网络模型的四种评价指标表。包括准确率(Accuracy)、精确率(Precision)、召回率(Recall)、F1分数(F1 Score)四个指标说明多尺度-通道注意力网络模型的性能,Record the accuracy of each category of the test set and draw a confusion matrix. Figure 4 is a schematic diagram of the confusion matrix. Table 1 is a table of four evaluation indicators of the multi-scale-channel attention network model. The four indicators of accuracy, precision, recall, and F1 score illustrate the performance of the multi-scale-channel attention network model.

表1Table 1

Figure GDA0004197768550000093
Figure GDA0004197768550000093

从表1可知,准确率(Accuracy)、精确率(Precision)、召回率(Recall)、F1分数(F1Score)四个指标都较高,说明多尺度-通道注意力网络模型性能较优。As can be seen from Table 1, the four indicators of accuracy, precision, recall and F1 score are all high, indicating that the multi-scale-channel attention network model has better performance.

实施例3Example 3

一种计算机设备,包括存储器和处理器,存储器存储有计算机程序,处理器执行计算机程序时实现实施例1或2所述的基于多尺度-通道注意力网络的刀具故障诊断方法的步骤。A computer device includes a memory and a processor, wherein the memory stores a computer program, and when the processor executes the computer program, the steps of the tool fault diagnosis method based on a multi-scale-channel attention network described in Example 1 or 2 are implemented.

实施例4Example 4

一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现实施例1或2所述的基于多尺度-通道注意力网络的刀具故障诊断方法的步骤。A computer-readable storage medium having a computer program stored thereon, which, when executed by a processor, implements the steps of the tool fault diagnosis method based on a multi-scale-channel attention network described in Example 1 or 2.

Claims (9)

1. The tool fault diagnosis method based on the multiscale-channel attention network is characterized by comprising the following steps of:
(1) And (3) data acquisition:
respectively collecting vibration signals of three axes of a machine tool spindle X, Y, Z and cutter abrasion values after each feeding, wherein a spindle coordinate system of the machine tool spindle X, Y, Z is established according to a right-hand Cartesian rectangular coordinate system;
(2) Data preprocessing:
classifying the cutter abrasion stage according to the cutter abrasion value, and classifying the vibration signals by taking the classification result as a label;
the vibration signals of the three axes of the main shaft X, Y, Z of the machine tool are segmented into the length n according to the time sequence 2 Then splicing the sections of the three axes of the main shaft X, Y, Z of the machine tool to construct an n multiplied by 3 three-channel input feature map, wherein n is the height or width of the feature map when the vibration signal section is converted into the feature map;
selecting a training set, a verification set and a test set from the data after the data preprocessing;
(3) Constructing a multi-scale-channel attention network model:
the multi-scale-channel attention network model comprises an input layer, three branches and a full-connection layer;
the three branches comprise a first branch, a second branch and a third branch, and the first branch comprises 5 convolution layers; the second branch comprises 2 convolution layers; the third branch comprises 1 convolution layer;
residual error connection is arranged between the second convolution layer of the first branch and the first convolution layer of the second branch, between the fourth convolution layer of the first branch and the second convolution layer of the second branch and between the first convolution layer of the second branch and the first convolution layer of the third branch, and residual error addition results are respectively input into the third convolution layer of the first branch, the fifth convolution layer of the first branch and the second convolution layer of the second branch;
the input channel attention module is used for carrying out residual connection on the output characteristic diagram of the first convolution layer of the second branch, the output characteristic diagram of the second convolution layer of the second branch and the output characteristic diagram of the first convolution layer of the third branch;
(4) Training: inputting the training set into a multi-scale-channel attention network model for training, and recording the accuracy and the loss function of the training set in each training period while training;
(5) And (3) testing: inputting the test set into a trained multi-scale-channel attention network model, and outputting a cutter abrasion stage corresponding to the test set data;
the mathematical expressions of the three branches including the first branch, the second branch and the third branch are respectively shown as the formula (I), the formula (II) and the formula (III):
Figure FDA0004173027160000021
Figure FDA0004173027160000022
Figure FDA0004173027160000023
in the formula (I), the formula (II) and the formula (III), i 1 =1,2,…5,i 2 =1,2,i 3 =1;
Figure FDA0004173027160000024
Refers to the ith branch of the first branch 1 Output feature map of the convolutional layers, +.>
Figure FDA0004173027160000025
Refers to the ith branch of the first branch 1 The number of the convolution layers is one,
Figure FDA0004173027160000026
refers to the ith branch of the first branch 1 -an output profile of 1 convolutional layer;
Figure FDA0004173027160000027
refers to the ith of the second branch 2 Output feature map of the convolutional layers, +.>
Figure FDA0004173027160000028
Refers to the ith of the second branch 2 The number of the convolution layers is one,
Figure FDA0004173027160000029
refers to the ith of the second branch 2 -an output profile of 1 convolutional layer;
Figure FDA00041730271600000210
refers to the ith branch of the third branch 3 Output feature map of the convolutional layers, +.>
Figure FDA00041730271600000211
Refers to the ith branch of the third branch 3 Convolutional layers->
Figure FDA00041730271600000212
Refers to the ith branch of the third branch 3 -an output profile of 1 convolutional layer;
the channel attention module acquires global receptive fields by adopting global average pooling operation and global maximum pooling operation respectively, inputs the global receptive fields into a two-layer neural network respectively, wherein the number of neurons of the first layer is 1, the activation function is ReLU, the number of neurons of the second layer is 3, the two-layer neural networks are shared, then the two-layer neural networks are added to generate channel weights, and finally the channel weights are normalized to be between (0 and 1) through a normalization layer;
and (3) inputting a channel attention module before residual connection to obtain channel weights by pooling operation, multiplying the channel weights by the original output characteristic diagram, mapping the channel weights onto the output characteristic diagram of the shallow branch convolution layer, and carrying out residual connection on the characteristic diagram with the channel weights and the deep branch convolution layer comprising the second convolution layer of the first branch, the fourth convolution layer of the first branch and the output characteristic diagram of the first convolution layer of the second branch.
2. The multi-scale channel attention network based tool fault diagnosis method according to claim 1, wherein the convolution kernel of each convolution layer is 3 x 3, and a ReLU nonlinear activation function is added after each convolution layer.
3. The multi-scale-channel attention network based tool fault diagnosis method according to claim 1, wherein the output feature map scale of the 5 convolution layers of the first branch is 64×64×3, 32×32×3, 16×16×3,8×8×3,4×4×3, respectively; the output feature map scales of the 2 convolution layers of the second branch are respectively 32×32×3,8×8×3; the output feature map scale of 1 convolution layer of the third branch is 32×32×3.
4. The method for diagnosing a tool failure based on a multi-scale and channel attention network according to claim 1, wherein in the step (1), vibration signals of three axes of a main shaft X, Y, Z of a machine tool are collected by using a vibration sensor with a model KS903, and the sampling frequency is 10240Hz; and acquiring the cutter abrasion value by using a 19JC digital universal tool microscope.
5. The multi-scale channel attention network based tool fault diagnosis method according to claim 1, wherein 80% of the data after the data preprocessing is used as a training set, 20% is used as a test set, 80% of the training set is used as training, and 20% of the training set is used as a verification set.
6. The multi-scale-channel attention network based tool fault diagnosis method according to claim 1, wherein in the step (2), tool wear stage classification is performed according to tool wear values, and vibration signals are classified by using classification results as labels; the method specifically comprises the following steps:
drawing a graph of the maximum abrasion value of the tool surface of the auxiliary tool in the whole life cycle and a time domain graph of a vibration signal in the whole life cycle of the X axis of a main shaft of a machine tool; wherein, the abscissa of the graph of the maximum abrasion value is the number of times of feeding, and the ordinate is the maximum abrasion value; the abscissa of the time domain graph of the vibration signal is time, and the ordinate is vibration signal amplitude;
vibration signals are classified into three types according to the slope change of the maximum abrasion value curve, when the slope of the maximum abrasion value curve is not more than 0.01, the corresponding vibration signals are in a rapid initial abrasion stage, when the slope of the maximum abrasion value curve is between 0.01 and 0.3, the corresponding vibration signals are in a steady abrasion stage, and when the slope of the maximum abrasion value curve is not less than 0.3, the corresponding vibration signals are in a rapid abrasion stage.
7. The multi-scale-channel attention network based tool fault diagnosis method according to any one of claims 1 to 6, wherein in step (4), a cross entropy function L is used as a loss function, as shown in formula (iv):
Figure FDA0004173027160000031
in the formula (IV), y i The true value is represented by a value that is true,
Figure FDA0004173027160000032
representing the predicted value, N is the number of samples.
8. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor, when executing the computer program, implements the steps of the multi-scale-channel attention network based tool fault diagnosis method of any of claims 1-7.
9. A computer readable storage medium having stored thereon a computer program, characterized in that the computer program when executed by a processor implements the steps of the multi-scale-channel attention network based tool fault diagnosis method of any of claims 1-7.
CN202110662716.8A 2021-06-15 2021-06-15 Multi-scale-channel attention network-based tool fault diagnosis method, equipment and storage medium Active CN113378725B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110662716.8A CN113378725B (en) 2021-06-15 2021-06-15 Multi-scale-channel attention network-based tool fault diagnosis method, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110662716.8A CN113378725B (en) 2021-06-15 2021-06-15 Multi-scale-channel attention network-based tool fault diagnosis method, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN113378725A CN113378725A (en) 2021-09-10
CN113378725B true CN113378725B (en) 2023-06-06

Family

ID=77574456

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110662716.8A Active CN113378725B (en) 2021-06-15 2021-06-15 Multi-scale-channel attention network-based tool fault diagnosis method, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113378725B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114066838A (en) * 2021-11-11 2022-02-18 南方电网电力科技股份有限公司 Detection method and device for product surface defects based on multi-scale attention mechanism
CN114161228B (en) * 2022-02-14 2022-06-21 湖南大学 Tool wear prediction method, device, equipment and storage medium
CN114559298B (en) * 2022-02-25 2022-12-06 中国科学院合肥物质科学研究院 Cutter wear monitoring method based on physical information fusion
CN114818795B (en) * 2022-04-13 2024-10-22 浙大宁波理工学院 Transfer learning network model, model training method and model-based monitoring system
CN114897028B (en) * 2022-05-27 2023-04-07 清华大学 Numerical control machining surface roughness prediction method and device
CN115310486B (en) * 2022-08-09 2023-09-26 重庆大学 Intelligent welding quality detection method
CN115582733A (en) * 2022-10-28 2023-01-10 大连工业大学 A Milling Cutter Wear Monitoring Method Based on Residual Structure and Convolutional Neural Network
CN115564789A (en) * 2022-12-01 2023-01-03 北京矩视智能科技有限公司 Method and device for segmenting defect region of workpiece by cross-level fusion and storage medium
CN115971970B (en) * 2022-12-02 2024-03-26 西南交通大学 A milling cutter wear monitoring method based on multi-parameter guided spatial attention mechanism
CN116690313B (en) * 2023-08-01 2023-12-12 成都飞机工业(集团)有限责任公司 Failure monitoring method for machining cutter of web plate of aircraft structural member

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111709265A (en) * 2019-12-11 2020-09-25 深学科技(杭州)有限公司 Camera monitoring state classification method based on attention mechanism residual error network
CN111738111A (en) * 2020-06-10 2020-10-02 杭州电子科技大学 A Road Extraction Method Based on Multi-branch Cascaded Hole Spatial Pyramid for High Resolution Remote Sensing Images
CN112284736A (en) * 2020-10-23 2021-01-29 天津大学 A Convolutional Neural Network Fault Diagnosis Method Based on Multi-Channel Attention Module
CN112576492A (en) * 2020-12-09 2021-03-30 中国石油天然气集团有限公司 Intelligent diagnosis method for electric submersible pump production well fault

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11676284B2 (en) * 2019-03-22 2023-06-13 Nvidia Corporation Shape fusion for image analysis
CN111721536B (en) * 2020-07-20 2022-05-27 哈尔滨理工大学 Rolling bearing fault diagnosis method for improving model migration strategy
CN112254964A (en) * 2020-09-03 2021-01-22 太原理工大学 Rolling bearing fault diagnosis method based on rapid multi-scale convolution neural network
CN112233026A (en) * 2020-09-29 2021-01-15 南京理工大学 A SAR image denoising method based on multi-scale residual attention network

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111709265A (en) * 2019-12-11 2020-09-25 深学科技(杭州)有限公司 Camera monitoring state classification method based on attention mechanism residual error network
CN111738111A (en) * 2020-06-10 2020-10-02 杭州电子科技大学 A Road Extraction Method Based on Multi-branch Cascaded Hole Spatial Pyramid for High Resolution Remote Sensing Images
CN112284736A (en) * 2020-10-23 2021-01-29 天津大学 A Convolutional Neural Network Fault Diagnosis Method Based on Multi-Channel Attention Module
CN112576492A (en) * 2020-12-09 2021-03-30 中国石油天然气集团有限公司 Intelligent diagnosis method for electric submersible pump production well fault

Also Published As

Publication number Publication date
CN113378725A (en) 2021-09-10

Similar Documents

Publication Publication Date Title
CN113378725B (en) Multi-scale-channel attention network-based tool fault diagnosis method, equipment and storage medium
CN110509109B (en) Cutter wear monitoring method based on multi-scale depth convolution cyclic neural network
CN111037365B (en) Tool Condition Monitoring Dataset Enhancement Method Based on Generative Adversarial Networks
CN112257530B (en) Rolling bearing fault diagnosis method based on blind signal separation and support vector machine
CN113011256B (en) Cross-category fault diagnosis method and system based on small sample learning and storage medium
WO2024065777A1 (en) Method, apparatus, electronic device, and storage medium for diagnosing industrial fault
CN113128561A (en) Machine tool bearing fault diagnosis method
CN112541233A (en) Rotary machine fault diagnosis method based on improved convolutional neural network
CN112766182B (en) Broaching tool wear state identification method
CN112487890B (en) Bearing acoustic signal fault diagnosis method based on parallel sparse filtering
CN116787225B (en) A method for monitoring wear of rotating tools based on acoustic and vibration signal feature fusion graph
CN111753891A (en) A rolling bearing fault diagnosis method based on unsupervised feature learning
CN117520825B (en) Industrial master machining workpiece quality prediction method based on multi-scale feature fusion
CN108393744B (en) Multi-sensing monitoring method for cutter state
CN114462480A (en) A method for fault diagnosis of multi-source sensor rolling mill based on unbalanced data set
CN117628005A (en) A fusion signal hydraulic motor fault diagnosis method and system
CN110488607A (en) A kind of recognition methods that lathe tool is worn based on convolution residual error network and transfer learning
CN116625683A (en) Wind turbine generator system bearing fault identification method, system and device and electronic equipment
CN115877205A (en) Intelligent fault diagnosis system and method for servo motor
CN109434562A (en) Milling cutter state of wear recognition methods based on partition clustering
CN112861275A (en) Rotary machine fault diagnosis method based on minimum information entropy feature learning model
CN113807431A (en) An intelligent spindle state assessment method and system based on multi-source information fusion
CN108427375A (en) A method of based on the more sensor monitoring cutting tool states of bandpass filtering treatment
Li et al. Tool fault diagnosis based on improved multiscale network and feature fusion
CN114997333B (en) Fault diagnosis method and device for wind driven generator

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant