CN113378725B

CN113378725B - Multi-scale-channel attention network-based tool fault diagnosis method, equipment and storage medium

Info

Publication number: CN113378725B
Application number: CN202110662716.8A
Authority: CN
Inventors: 袁东风; 狄子钧; 周晓天; 李东阳; 梁道君
Original assignee: Shandong University
Current assignee: Shandong University
Priority date: 2021-06-15
Filing date: 2021-06-15
Publication date: 2023-06-06
Anticipated expiration: 2041-06-15
Also published as: CN113378725A

Abstract

The invention relates to a tool fault diagnosis method, equipment and storage medium based on a multi-scale-channel attention network, which comprises the following steps: (1) data acquisition; (2) data preprocessing; (3) constructing a multi-scale-channel attention network model; (4) training; (5) testing. According to the invention, the multi-scale-channel attention network is adopted to carry out tool wear fault diagnosis, and a channel attention mechanism is introduced into residual connection of the multi-scale network, so that different importance degrees of vibration signals of a machine tool spindle in three directions on tool wear state classification tasks are mapped to a characteristic learning process, and data are better fused.

Description

A tool fault diagnosis method, device and storage medium based on multi-scale-channel attention network

技术领域Technical Field

本发明涉及智能制造产品质量管控领域，特别涉及一种基于多尺度-通道注意力网络的刀具故障诊断方法、设备及存储介质。The present invention relates to the field of intelligent manufacturing product quality control, and in particular to a tool fault diagnosis method, device and storage medium based on a multi-scale-channel attention network.

背景技术Background Art

数控机床作为“工业母机”，在生产加工过程中应用非常广泛。刀具作为数控机床的切削工具，其实时健康状态直接影响着机床的加工效率和产品质量。刀具与工件直接接触并相互作用，在刀具高速切削的过程中，会产生不可避免的磨损损伤，对刀具磨损状态的精准监测有助于避免因刀具失效导致的产品质量问题。As the "mother machine of industry", CNC machine tools are widely used in the production and processing process. As the cutting tool of CNC machine tools, the real-time health status of the tool directly affects the processing efficiency and product quality of the machine tool. The tool is in direct contact and interacts with the workpiece. In the process of high-speed cutting of the tool, inevitable wear and damage will occur. Accurate monitoring of the tool wear status helps to avoid product quality problems caused by tool failure.

现阶段，在实际工业现场仍使用人工检测的方法，这种检测方法耗时耗力，并且存在由作业人员自身原因而引起的误差。随着制造业由自动化向智能化转型，人工智能技术与制造业深度融合成为关键。我们希望在故障诊断方法中导入深度学习，取代传统人工直接接触的检测方法，从而实现刀具精准的故障诊断。At present, manual inspection methods are still used in actual industrial sites. This inspection method is time-consuming and labor-intensive, and there are errors caused by the operators themselves. As the manufacturing industry transforms from automation to intelligence, the deep integration of artificial intelligence technology and manufacturing has become the key. We hope to introduce deep learning into the fault diagnosis method to replace the traditional manual direct contact inspection method, so as to achieve accurate fault diagnosis of cutting tools.

一般的智能化检测过程为采集监测信号(例如切削力信号、振动信号、声发射信号、主轴电流信号等)，并分析这些信号与刀具磨损状态之间的映射关系。目前，已经有大量国内外学者开展了此类研究。Hu等提出了一种多尺度网络(Multiscale Network，MSNet)，该网络包含一个三分支结构，每个分支具有不同的卷积层深度，从而可以提取一维振动信号不同层次的特征，并通过全连接合并这些特征。李鹏等使用一维卷积网络对加工过程中机床主轴和工作台的振动与声发射信号进行初步特征提取，之后将信号特征输入长短时记忆网络进行分析，最后得出刀具磨损状态的评估结果，分类准确率达到93.8％。Hsieh等使用快速傅里叶变换将振动时域信号转换为频域信号，并通过类均值散射准则提取频域特征，将特征输入单层的卷积神经网络，对刀具磨损状态进行分类，根据实验结果，发现Z方向振动信号与X方向或Y方向振动信号的组合可以得到更好的分类结果。为了优化模型的梯度传播，文章在具有相同特征尺度寸的卷积层之间加入残差连接。然而，上述方法存在以下问题：(1)受限于数据集的有效且单一，只能将单通道振动信号转换为单通道图像，单通道包含的特征匮乏，可能会对刀具故障分类精度有所影响；(2)受限于特征提取与特征分析分步完成，模型的自学习能力弱，最终的刀具故障诊断的准确率低；(3)受限于模型结构简单，没有使用有效的数据融合方法，导致不同方向的振动信号特征融合效果差，无法有效地提取数据特征。The general intelligent detection process is to collect monitoring signals (such as cutting force signals, vibration signals, acoustic emission signals, spindle current signals, etc.) and analyze the mapping relationship between these signals and tool wear status. At present, a large number of domestic and foreign scholars have carried out such research. Hu et al. proposed a multiscale network (MSNet), which contains a three-branch structure, each branch has a different convolution layer depth, so that the features of different levels of one-dimensional vibration signals can be extracted and merged through full connection. Li Peng et al. used a one-dimensional convolutional network to perform preliminary feature extraction on the vibration and acoustic emission signals of the machine tool spindle and worktable during processing, and then input the signal features into the long short-term memory network for analysis, and finally obtained the evaluation results of the tool wear status, with a classification accuracy of 93.8%. Hsieh et al. used fast Fourier transform to convert the vibration time domain signal into frequency domain signal, and extracted the frequency domain features through the mean scattering criterion, and input the features into a single-layer convolutional neural network to classify the tool wear status. According to the experimental results, it was found that the combination of Z-direction vibration signal and X-direction or Y-direction vibration signal can obtain better classification results. In order to optimize the gradient propagation of the model, the article adds residual connections between convolutional layers with the same feature scale. However, the above method has the following problems: (1) Due to the validity and singleness of the data set, only single-channel vibration signals can be converted into single-channel images. The single channel contains insufficient features, which may affect the accuracy of tool fault classification; (2) Due to the fact that feature extraction and feature analysis are completed step by step, the self-learning ability of the model is weak, and the accuracy of the final tool fault diagnosis is low; (3) Due to the simple model structure, no effective data fusion method is used, resulting in poor fusion effect of vibration signal features in different directions, and the inability to effectively extract data features.

发明内容Summary of the invention

针对以上存在的问题，本发明提出了一种基于多尺度-通道注意力网络的刀具故障诊断方法。In view of the above problems, the present invention proposes a tool fault diagnosis method based on a multi-scale-channel attention network.

本发明通过刀具磨损试验平台，采集了符合实际工业生产现场的机床主轴振动信号与刀具磨损值，振动信号包含X、Y、Z三个方向。基于特征融合的思路，将X、Y、Z三个方向的振动信号进行拼接，构成三通道的多通道特征图。The present invention uses a tool wear test platform to collect machine tool spindle vibration signals and tool wear values that conform to actual industrial production sites. The vibration signals include three directions: X, Y, and Z. Based on the idea of feature fusion, the vibration signals in the three directions of X, Y, and Z are spliced to form a three-channel multi-channel feature map.

本发明采用卷积神经网络提取特征图的特征并进行分类任务，卷积神经网络具有出色的自适应特征学习能力，可以很好地挖掘数据的潜在特征。The present invention uses a convolutional neural network to extract the features of the feature map and perform classification tasks. The convolutional neural network has excellent adaptive feature learning capabilities and can well mine the potential features of the data.

在本发明中，为了将机床主轴三个方向振动信号对刀具磨损状态分类任务的不同重要程度映射至特征学习的过程中，使得数据更好地融合，对多尺度网络做出改进，将通道注意力引入多尺度网络。In the present invention, in order to map the different importance of the vibration signals in three directions of the machine tool spindle to the tool wear state classification task into the feature learning process, so that the data can be better integrated, the multi-scale network is improved and channel attention is introduced into the multi-scale network.

本发明还提供了一种计算机设备及存储介质。The invention also provides a computer device and a storage medium.

本发明的技术方案为：The technical solution of the present invention is:

一种基于多尺度-通道注意力网络的刀具故障诊断方法，包括步骤如下：A tool fault diagnosis method based on a multi-scale-channel attention network comprises the following steps:

(1)数据采集：(1) Data collection:

分别采集机床主轴X、Y、Z三个轴的振动信号与每一次走刀后的刀具磨损值，其中，机床主轴X、Y、Z三个轴所处的主轴坐标系根据右手笛卡尔直角坐标系建立；The vibration signals of the three axes X, Y, and Z of the machine tool spindle and the tool wear value after each tool pass are collected respectively. The spindle coordinate system of the three axes X, Y, and Z of the machine tool spindle is established according to the right-handed Cartesian rectangular coordinate system;

(2)数据预处理：(2) Data preprocessing:

根据刀具磨损值进行刀具磨损阶段分类，以分类结果作为标签，将振动信号进行分类；The tool wear stages are classified according to the tool wear values, and the vibration signals are classified using the classification results as labels;

将机床主轴X、Y、Z三个轴的振动信号按照时间顺序分段为长度为n²的切片，之后将机床主轴X、Y、Z三个轴的切片进行拼接，构建为n×n×3的三通道输入特征图，其中，n为振动信号切片转换为特征图时，特征图的高或宽；The vibration signals of the machine tool spindle X, Y, and Z axes are segmented into slices of length n ² in time order, and then the slices of the machine tool spindle X, Y, and Z axes are spliced to construct a three-channel input feature map of n×n×3, where n is the height or width of the feature map when the vibration signal slice is converted into a feature map;

数据预处理后的数据中选取训练集、验证集和测试集；Select training set, validation set and test set from the data after data preprocessing;

(3)构建多尺度-通道注意力网络模型：(3) Constructing a multi-scale-channel attention network model:

所述多尺度-通道注意力网络模型包括输入层、三个分支、全连接层；The multi-scale-channel attention network model includes an input layer, three branches, and a fully connected layer;

三个分支包括第一分支、第二分支、第三分支，第一分支包括5个卷积层；第二分支包括2个卷积层；第三分支包括1个卷积层；The three branches include a first branch, a second branch, and a third branch. The first branch includes 5 convolutional layers; the second branch includes 2 convolutional layers; and the third branch includes 1 convolutional layer.

在第一分支的第二个卷积层与第二分支的第一个卷积层、在第一分支的第四个卷积层与第二分支的第二个卷积层、在第二分支的第一个卷积层与第三分支的第一个卷积层之间均设置有残差连接，残差加和结果分别输入卷积层第一分支的第三个卷积层、第二分支的第四个卷积层、第二分支的第二个卷积层；Residual connections are set between the second convolution layer of the first branch and the first convolution layer of the second branch, between the fourth convolution layer of the first branch and the second convolution layer of the second branch, and between the first convolution layer of the second branch and the first convolution layer of the third branch. The residual summation results are respectively input into the third convolution layer of the first branch of the convolution layer, the fourth convolution layer of the second branch, and the second convolution layer of the second branch;

第二分支的第一个卷积层的输出特征图、第二分支的第二个卷积层的输出特征图及第三分支的第一个卷积层的输出特征图做残差连接之前输入通道注意力模块。The output feature map of the first convolutional layer of the second branch, the output feature map of the second convolutional layer of the second branch, and the output feature map of the first convolutional layer of the third branch are input into the channel attention module before the residual connection.

(4)训练：将训练集输入多尺度-通道注意力网络模型进行训练，在训练的同时记录每一次训练周期的训练集准确率与损失函数；(4) Training: The training set is input into the multi-scale-channel attention network model for training, and the training set accuracy and loss function of each training cycle are recorded during training;

(5)测试：将测试集输入训练好的多尺度-通道注意力网络模型，输出测试集数据对应的刀具磨损阶段。(5) Testing: The test set is input into the trained multi-scale-channel attention network model, and the tool wear stage corresponding to the test set data is output.

根据本发明优选的，数据预处理后的数据的80％作为训练集，20％作为测试集，训练集中的80％用作训练，训练集中的20％作为验证集。Preferably according to the present invention, 80% of the data after data preprocessing is used as a training set, and 20% is used as a test set, 80% of the training set is used for training, and 20% of the training set is used as a validation set.

进一步优选的，三个分支包括第一分支、第二分支、第三分支的数学表达式分别如式(Ⅰ)、式(Ⅱ)、式(III)所示：Further preferably, the three branches include the first branch, the second branch, and the third branch, and their mathematical expressions are shown in formula (I), formula (II), and formula (III), respectively:

式(Ⅰ)、式(Ⅱ)、式(III)中，i₁＝1,2,…5，i₂＝1,2，i₃＝1；In formula (I), formula (II) and formula (III), i ₁ =1, 2, ... 5, i ₂ =1, 2, i ₃ =1;

是指第一分支的第i₁个卷积层的输出特征图，

是指第一分支的第i₁个卷积层，

是指第一分支的第i₁-1个卷积层的输出特征图；

refers to the output feature map of the _i1th convolutional layer of the first branch,

refers to the _i1th convolutional layer of the first branch,

It refers to the output feature map of the i ₁ -1th convolutional layer of the first branch;

是指第二分支的第i₂个卷积层的输出特征图，

是指第二分支的第i₂个卷积层，

是指第二分支的第i₂-1个卷积层的输出特征图；

refers to the output feature map of the _i2th convolutional layer of the second branch,

refers to the i ₂ th convolutional layer of the second branch,

It refers to the output feature map of the i ₂ -1th convolutional layer of the second branch;

是指第三分支的第i₃个卷积层的输出特征图，

是指第三分支的第i₃个卷积层，

是指第三分支的第i₃-1个卷积层的输出特征图。

refers to the output feature map of the i _3rd convolutional layer of the third branch,

refers to the _3rd convolutional layer of the third branch,

It refers to the output feature map of the i ₃ -1 th convolutional layer of the third branch.

进一步优选的，每个卷积层的卷积核为3×3，每个卷积层之后加一个ReLU非线性激活函数。Further preferably, the convolution kernel of each convolution layer is 3×3, and a ReLU nonlinear activation function is added after each convolution layer.

进一步优选的，第一分支的5个卷积层的输出特征图尺度分别为64×64×3，32×32×3，16×16×3，8×8×3，4×4×3；第二分支的2个卷积层的输出特征图尺度分别为32×32×3，8×8×3；第三分支的1个卷积层的输出特征图尺度为32×32×3。Further preferably, the output feature map scales of the five convolutional layers of the first branch are 64×64×3, 32×32×3, 16×16×3, 8×8×3, and 4×4×3 respectively; the output feature map scales of the two convolutional layers of the second branch are 32×32×3 and 8×8×3 respectively; and the output feature map scale of one convolutional layer of the third branch is 32×32×3.

进一步优选的，通道注意力模块分别采用全局平均池化操作与全局最大池化操作获取全局感受野，再将它们分别输入一个两层的神经网络，第一层神经元个数为1，激活函数为ReLU，第二层神经元个数为3，这个两层的神经网络是共享的，之后将它们进行相加生成通道权重，最后通过归一化层将通道权重归一化至(0,1)之间。Further preferably, the channel attention module uses global average pooling operation and global maximum pooling operation to obtain the global receptive field, and then inputs them into a two-layer neural network, the number of neurons in the first layer is 1, the activation function is ReLU, and the number of neurons in the second layer is 3. The two-layer neural network is shared, and then they are added to generate channel weights, and finally the channel weights are normalized to between (0,1) through the normalization layer.

进一步优选的，将第二分支的第一个卷积层的输出特征图、第二分支的第二个卷积层的输出特征图及第三分支的第一个卷积层的输出特征图，做残差连接之前输入通道注意力模块，经过池化操作得到通道权重，之后通道权重与原始输出特征图进行相乘，将通道权重映射到浅层分支卷积层的输出特征图上，带有通道权重的特征图与深层分支卷积层包括第一分支的第二个卷积层、在第一分支的第四个卷积层、在第二分支的第一个卷积层的输出特征图进行残差连接，输入深层分支卷积层的下一个卷积层。Further preferably, the output feature map of the first convolutional layer of the second branch, the output feature map of the second convolutional layer of the second branch, and the output feature map of the first convolutional layer of the third branch are input into the channel attention module before the residual connection, and the channel weight is obtained after the pooling operation. Then, the channel weight is multiplied by the original output feature map, and the channel weight is mapped to the output feature map of the shallow branch convolutional layer. The feature map with the channel weight is residually connected with the output feature map of the deep branch convolutional layer including the second convolutional layer of the first branch, the fourth convolutional layer of the first branch, and the first convolutional layer of the second branch, and is input into the next convolutional layer of the deep branch convolutional layer.

根据本发明优选的，步骤(1)中，采用型号为KS903的振动传感器采集机床主轴X、Y、Z三个轴的振动信号，采样频率为10240Hz；采用19JC数字式万能工具显微镜采集刀具磨损值。Preferably, according to the present invention, in step (1), a vibration sensor of model KS903 is used to collect vibration signals of the three axes X, Y, and Z of the machine tool spindle, and the sampling frequency is 10240 Hz; and a 19JC digital universal tool microscope is used to collect tool wear values.

根据本发明优选的，步骤(2)中，根据刀具磨损值进行刀具磨损阶段分类，以分类结果作为标签，将振动信号进行分类；具体是指：According to the preferred embodiment of the present invention, in step (2), the tool wear stage is classified according to the tool wear value, and the vibration signal is classified using the classification result as a label; specifically, it refers to:

绘制刀具全生命周期副后刀面最大磨损值的曲线图，与机床主轴X轴的全生命周期振动信号的时域图；其中，最大磨损值的曲线图的横坐标为走刀次数，纵坐标为最大磨损值大小；振动信号的时域图的横坐标为时间，纵坐标为振动信号幅值；Draw a curve graph of the maximum wear value of the secondary flank of the tool during its entire life cycle, and a time domain graph of the vibration signal of the X-axis of the machine tool spindle during its entire life cycle; the abscissa of the curve graph of the maximum wear value is the number of tool passes, and the ordinate is the maximum wear value; the abscissa of the time domain graph of the vibration signal is time, and the ordinate is the vibration signal amplitude;

根据最大磨损值曲线斜率变化将振动信号分为三类，最大磨损值曲线斜率不大于0.01时，对应的振动信号处于快速初始磨损阶段，最大磨损值曲线斜率在0.01至0.3之间时，对应的振动信号处于稳态磨损阶段，最大磨损值曲线斜率不小于0.3时，对应的振动信号处于急速磨损阶段；The vibration signal is divided into three categories according to the change of the slope of the maximum wear value curve. When the slope of the maximum wear value curve is not greater than 0.01, the corresponding vibration signal is in the rapid initial wear stage. When the slope of the maximum wear value curve is between 0.01 and 0.3, the corresponding vibration signal is in the steady-state wear stage. When the slope of the maximum wear value curve is not less than 0.3, the corresponding vibration signal is in the rapid wear stage.

根据本发明优选的，步骤(4)中，使用交叉熵函数L作为损失函数，如式(IV)所示：Preferably, according to the present invention, in step (4), a cross entropy function L is used as a loss function, as shown in formula (IV):

式(IV)中，y_i表示真实值，

表示预测值，N为样本数量。In formula (IV), _yi represents the true value,

represents the predicted value, and N is the number of samples.

一种计算机设备，包括存储器和处理器，所述存储器存储有计算机程序，所述处理器执行所述计算机程序时实现基于多尺度-通道注意力网络的刀具故障诊断方法的步骤。A computer device comprises a memory and a processor, wherein the memory stores a computer program, and when the processor executes the computer program, the steps of a tool fault diagnosis method based on a multi-scale-channel attention network are implemented.

一种计算机可读存储介质，其上存储有计算机程序，所述计算机程序被处理器执行时实现基于多尺度-通道注意力网络的刀具故障诊断方法的步骤。A computer-readable storage medium stores a computer program, which, when executed by a processor, implements the steps of a tool fault diagnosis method based on a multi-scale-channel attention network.

本发明的有益效果为：The beneficial effects of the present invention are:

1.本发明根据机床主轴振动信号与刀具磨损状态的映射关系，采用卷积神经网络对刀具故障进行诊断。1. The present invention uses a convolutional neural network to diagnose tool faults based on the mapping relationship between the machine tool spindle vibration signal and the tool wear status.

2.本发明基于特征融合的思路，三个方向的振动信号作为输入特征图的三个通道，将机床主轴三个轴的振动信号进行拼接，之后输入卷积神经网络。2. The present invention is based on the idea of feature fusion. The vibration signals in three directions are used as the three channels of the input feature map. The vibration signals of the three axes of the machine tool spindle are spliced and then input into the convolutional neural network.

3.本发明为了使三个方向的振动信号更有效地进行融合，将通道注意力机制引入多尺度卷积神经网络，利用通道之间的相互依赖性，将三个方向的振动信号对刀具状态分类的不同重要程度映射至特征学习的过程中。3. In order to more effectively fuse the vibration signals in three directions, the present invention introduces the channel attention mechanism into the multi-scale convolutional neural network, and utilizes the interdependence between channels to map the different importance of the vibration signals in three directions to the tool state classification into the feature learning process.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1(a)为刀具全生命周期副后刀面最大磨损值曲线图；Figure 1(a) is a graph showing the maximum wear value of the secondary flank during the tool life cycle;

图1(b)为机床主轴X方向的全生命周期振动信号时域图；Figure 1(b) is a time domain diagram of the full life cycle vibration signal of the machine tool spindle in the X direction;

图2为多尺度-通道注意力网络模型的结构示意图；Figure 2 is a schematic diagram of the structure of a multi-scale-channel attention network model;

图3(a)为多尺度-通道注意力网络模型训练过程的准确率曲线图；Figure 3(a) is a graph showing the accuracy of the multi-scale-channel attention network model training process;

图3(b)为多尺度-通道注意力网络模型训练过程的损失函数曲线图；Figure 3(b) is a graph of the loss function during the training process of the multi-scale-channel attention network model;

图4为混淆矩阵示意图；Figure 4 is a schematic diagram of a confusion matrix;

具体实施方式DETAILED DESCRIPTION

下面结合说明书附图和实施例对本发明作进一步限定，但不限于此。The present invention will be further defined below in conjunction with the accompanying drawings and embodiments, but is not limited thereto.

实施例1Example 1

(1)数据采集：(1) Data collection:

为了采集符合实际工业生产场景的真实数据，本发明设计了刀具磨损试验平台。利用刀具磨损试验平台，分别采集机床主轴X、Y、Z三个轴的振动信号与每一次走刀后的刀具磨损值，其中，机床主轴X、Y、Z三个轴所处的主轴坐标系根据右手笛卡尔直角坐标系建立；In order to collect real data that conforms to actual industrial production scenarios, the present invention designs a tool wear test platform. Using the tool wear test platform, the vibration signals of the three axes of the machine tool spindle X, Y, and Z and the tool wear value after each tool pass are collected respectively, wherein the spindle coordinate system of the three axes of the machine tool spindle X, Y, and Z is established according to the right-handed Cartesian rectangular coordinate system;

(2)数据预处理：(2) Data preprocessing:

刀具磨损值曲线的斜率反映了当前阶段刀具磨损的程度，本发明根据刀具磨损值进行刀具磨损阶段分类，以分类结果作为标签，将振动信号进行分类；The slope of the tool wear value curve reflects the degree of tool wear at the current stage. The present invention classifies the tool wear stages according to the tool wear value, and uses the classification results as labels to classify the vibration signals;

将机床主轴X、Y、Z三个轴的振动信号按照时间顺序分段为长度为n²的切片，之后将机床主轴X、Y、Z三个轴的切片进行拼接，构建为n×n×3的三通道输入特征图，其中，n为振动信号切片转换为特征图时，特征图的高或宽；该数字为人为定义大小；The vibration signals of the X, Y, and Z axes of the machine tool spindle are segmented into slices of length n ² in time order, and then the slices of the X, Y, and Z axes of the machine tool spindle are spliced to construct a three-channel input feature map of n×n×3, where n is the height or width of the feature map when the vibration signal slice is converted into a feature map; this number is an artificially defined size;

数据预处理后的数据的80％作为训练集，20％作为测试集，训练集中的80％用作训练，训练集中的20％作为验证集。80% of the preprocessed data is used as the training set, 20% is used as the test set, 80% of the training set is used for training, and 20% of the training set is used as the validation set.

多尺度网络包含三个分支，不同分支的卷积层深度不同，较深的网络分支可以提取到更多的局部信息，较浅的网络分支可以提取到更多的局部信息，最后在全连接层将不同层次的特征进行特征融合。多尺度网络在不同分支且具有相同输出特征图尺度的卷积层之间设置残差连接，这样帮助网络梯度传输，提升网络的性能。The multi-scale network consists of three branches. The depth of the convolution layer of different branches is different. The deeper network branches can extract more local information, and the shallower network branches can extract more local information. Finally, the features of different levels are fused in the fully connected layer. The multi-scale network sets residual connections between the convolution layers of different branches with the same output feature map scale, which helps the network gradient transmission and improves the network performance.

在较浅的网络分支的卷积层做残差连接之前，将卷积层的输出特征图输入通道注意力模块，通过池化操作，捕捉X、Y、Z三个方向振动信号的相关性，得到通道注意力权重，之后通道权重与输出特征图进行像素点相乘，将通道权重应设置特征图上。带有通道注意力权重的特征图与较深网络分支的卷积层的输出特征图做残差连接，从而将三个方向的不同重要程度映射至特征学习的过程中。Before the convolution layer of the shallower network branch is connected with the residual connection, the output feature map of the convolution layer is input into the channel attention module. Through the pooling operation, the correlation of the vibration signals in the three directions of X, Y, and Z is captured to obtain the channel attention weight. Then the channel weight is multiplied by the output feature map pixel by pixel, and the channel weight should be set on the feature map. The feature map with the channel attention weight is connected with the output feature map of the convolution layer of the deeper network branch with the residual connection, so as to map the different importance of the three directions to the feature learning process.

如图2所示，多尺度-通道注意力网络模型包括输入层、三个分支、全连接层；As shown in Figure 2, the multi-scale-channel attention network model includes an input layer, three branches, and a fully connected layer;

在不同分支、具有相同输出特征图尺度的卷积层之间实现恒等映射，将浅层卷积网络分支的特征图与深层卷积网络分支的特征图做残差连接，之后输入深层卷积网络分支的下一层卷积层。在第一分支的第二个卷积层与第二分支的第一个卷积层、在第一分支的第四个卷积层与第二分支的第二个卷积层、在第二分支的第一个卷积层与第三分支的第一个卷积层之间均设置有残差连接，残差加和结果分别输入卷积层第一分支的第三个卷积层、第二分支的第四个卷积层、第二分支的第二个卷积层；残差连接可以在前向过程中帮助网络中的特征进行恒等映射，当浅层网络的输出已经达到最优时，特征直接传递到深层网络；在反向过程中帮助传导梯度，让更深的模型能够成功训练，提高网络的性能。To achieve identity mapping between convolutional layers of different branches with the same output feature map scale, the feature map of the shallow convolutional network branch is residually connected with the feature map of the deep convolutional network branch, and then input into the next convolutional layer of the deep convolutional network branch. Residual connections are set between the second convolutional layer of the first branch and the first convolutional layer of the second branch, the fourth convolutional layer of the first branch and the second convolutional layer of the second branch, and the first convolutional layer of the second branch and the first convolutional layer of the third branch. The residual summation results are input into the third convolutional layer of the first branch of the convolutional layer, the fourth convolutional layer of the second branch, and the second convolutional layer of the second branch respectively; the residual connection can help the features in the network to be identically mapped in the forward process. When the output of the shallow network has reached the optimal level, the features are directly transferred to the deep network; in the reverse process, it helps to conduct gradients, so that deeper models can be successfully trained and the performance of the network can be improved.

在浅层分支卷积层的输出特征图做残差连接之前输入通道注意力模块，第二分支的第一个卷积层的输出特征图、第二分支的第二个卷积层的输出特征图及第三分支的第一个卷积层的输出特征图做残差连接之前输入通道注意力模块。The channel attention module is input before the residual connection is made to the output feature map of the shallow branch convolution layer, and the channel attention module is input before the residual connection is made to the output feature map of the first convolution layer of the second branch, the output feature map of the second convolution layer of the second branch, and the output feature map of the first convolution layer of the third branch.

三个分支包括第一分支、第二分支、第三分支的数学表达式分别如式(Ⅰ)、式(Ⅱ)、式(III)所示：The mathematical expressions of the three branches including the first branch, the second branch and the third branch are shown in formula (I), formula (II) and formula (III) respectively:

是指第一分支的第i₁个卷积层的输出特征图，

是指第一分支的第i₁个卷积层，

是指第一分支的第i₁-1个卷积层的输出特征图；

refers to the _i1th convolutional layer of the first branch,

是指第二分支的第i₂个卷积层的输出特征图，

是指第二分支的第i₂个卷积层，

是指第二分支的第i₂-1个卷积层的输出特征图；

refers to the i ₂ th convolutional layer of the second branch,

是指第三分支的第i₃个卷积层的输出特征图，

是指第三分支的第i₃个卷积层，

是指第三分支的第i₃-1个卷积层的输出特征图。

refers to the _3rd convolutional layer of the third branch,

每个卷积层的卷积核为3×3，每个卷积层之后加一个ReLU非线性激活函数。The convolution kernel of each convolution layer is 3×3, and a ReLU nonlinear activation function is added after each convolution layer.

第一分支的5个卷积层的输出特征图尺度分别为64×64×3，32×32×3，16×16×3，8×8×3，4×4×3；第二分支的2个卷积层的输出特征图尺度分别为32×32×3，8×8×3；第三分支的1个卷积层的输出特征图尺度为32×32×3。The output feature map scales of the five convolutional layers of the first branch are 64×64×3, 32×32×3, 16×16×3, 8×8×3, and 4×4×3 respectively; the output feature map scales of the two convolutional layers of the second branch are 32×32×3 and 8×8×3 respectively; the output feature map scale of the one convolutional layer of the third branch is 32×32×3.

通道注意力模块分别采用全局平均池化操作与全局最大池化操作获取全局感受野，再将它们分别输入一个两层的神经网络，第一层神经元个数为1，激活函数为ReLU，第二层神经元个数为3，这个两层的神经网络是共享的，之后将它们进行相加生成通道权重，最后通过归一化层将通道权重归一化至(0,1)之间。The channel attention module uses global average pooling and global maximum pooling operations to obtain the global receptive field, and then inputs them into a two-layer neural network. The number of neurons in the first layer is 1, the activation function is ReLU, and the number of neurons in the second layer is 3. The two-layer neural network is shared, and then they are added to generate channel weights. Finally, the channel weights are normalized to between (0,1) through the normalization layer.

在浅层分支卷积层的输出特征图做残差连接之前输入通道注意力模块，利用通道之间的相互依赖性，捕捉X、Y、Z三个方向振动信号的相关性，从而可以将三个方向的不同重要程度通过通道注意力学习的方式映射至特征提取的过程中。将第二分支的第一个卷积层的输出特征图、第二分支的第二个卷积层的输出特征图及第三分支的第一个卷积层的输出特征图，做残差连接之前输入通道注意力模块，经过池化操作得到通道权重，之后通道权重与原始输出特征图进行相乘，将通道权重映射到浅层分支卷积层的输出特征图上，带有通道权重的特征图与深层分支卷积层包括第一分支的第二个卷积层、在第一分支的第四个卷积层、在第二分支的第一个卷积层的输出特征图进行残差连接，输入深层分支卷积层的下一个卷积层。Before the output feature map of the shallow branch convolution layer is connected with the residual connection, the channel attention module is input. The mutual dependence between channels is used to capture the correlation of the vibration signals in the three directions of X, Y, and Z, so that the different importance of the three directions can be mapped to the feature extraction process through channel attention learning. The output feature map of the first convolution layer of the second branch, the output feature map of the second convolution layer of the second branch, and the output feature map of the first convolution layer of the third branch are input with the channel attention module before the residual connection. The channel weight is obtained through pooling operation, and then the channel weight is multiplied with the original output feature map. The channel weight is mapped to the output feature map of the shallow branch convolution layer. The feature map with the channel weight is residually connected with the output feature map of the deep branch convolution layer, including the second convolution layer of the first branch, the fourth convolution layer of the first branch, and the first convolution layer of the second branch, and input into the next convolution layer of the deep branch convolution layer.

以卷积层输出特征图

和

为例：

表示第一分支的第三个卷积层的输入，A表示通道注意力模块，

表示第一分支的第二个卷积层的输出特征图，

表示第二分支的第一个卷积层的输出特征图。通过高效通道注意力模块，提升影响因素较强的通道特征权重，而抑制影响因素较弱的通道特征权重。Output feature map with convolutional layer

and

For example:

represents the input of the third convolutional layer of the first branch, A represents the channel attention module,

represents the output feature map of the second convolutional layer of the first branch,

The output feature map of the first convolutional layer of the second branch. Through the efficient channel attention module, the weights of channel features with stronger influencing factors are increased, while the weights of channel features with weaker influencing factors are suppressed.

(5)测试：将测试集输入训练好的多尺度-通道注意力网络模型，输出测试集数据对应的刀具磨损阶段。记录测试集每一种类别的准确率，并绘制混淆矩阵。采用准确率(Accuracy)、精确率(Precision)、召回率(Recall)、F1分数(F1 Score)四个指标说明网络的性能。(5) Testing: Input the test set into the trained multi-scale-channel attention network model and output the tool wear stage corresponding to the test set data. Record the accuracy of each category in the test set and draw the confusion matrix. Use the four indicators of accuracy, precision, recall, and F1 score to illustrate the performance of the network.

实施例2Example 2

根据实施例1所述的一种基于多尺度-通道注意力网络的刀具故障诊断方法，其区别在于：The tool fault diagnosis method based on a multi-scale-channel attention network according to embodiment 1 is different in that:

步骤(1)中，采用型号为KS903的振动传感器采集机床主轴X、Y、Z三个轴的振动信号，采样频率为10240Hz；采用19JC数字式万能工具显微镜采集刀具磨损值。数据采集在刀具磨损试验平台进行，工件切削过程在数控机床立式加工中心(VDF-850)完成，刀具为直径10mm的三刃立铣刀，切削工件为45号钢圆柱形工件。数控机床数控机床主轴转速为2000r/min，进给速度为764mm/min。铣削深度和宽度分别为2mm和5mm。为了加速刀具磨损，采用无切削液的干切削方式。铣刀的加工总长度在110米左右，使刀具达到严重磨损的程度，并且有小面积的铣屑。每把刀具采集35-47组试验数据，每组铣削过程耗时4分17秒。同时，在每组切削完成之后，采用19JC数字式万能工具显微镜采集刀具磨损值，采用副后刀面最大磨损宽度VBmax作为刀具磨损阶段的分类标签。In step (1), a vibration sensor of model KS903 is used to collect vibration signals of the three axes X, Y, and Z of the machine tool spindle, and the sampling frequency is 10240Hz; a 19JC digital universal tool microscope is used to collect tool wear values. Data collection is carried out on a tool wear test platform, and the workpiece cutting process is completed on a CNC machine tool vertical machining center (VDF-850). The tool is a three-edge end mill with a diameter of 10mm, and the cutting workpiece is a 45# steel cylindrical workpiece. The spindle speed of the CNC machine tool is 2000r/min, and the feed speed is 764mm/min. The milling depth and width are 2mm and 5mm respectively. In order to accelerate tool wear, a dry cutting method without cutting fluid is adopted. The total processing length of the milling cutter is about 110 meters, so that the tool reaches a serious degree of wear and there is a small area of milling chips. 35-47 sets of test data are collected for each tool, and each set of milling process takes 4 minutes and 17 seconds. At the same time, after each group of cutting is completed, a 19JC digital universal tool microscope is used to collect tool wear values, and the maximum wear width VBmax of the secondary flank is used as the classification label of the tool wear stage.

步骤(2)中，根据刀具磨损值进行刀具磨损阶段分类，以分类结果作为标签，将振动信号进行分类；具体是指：In step (2), the tool wear stage is classified according to the tool wear value, and the vibration signal is classified using the classification result as a label; specifically, it means:

根据最大磨损值曲线斜率变化将振动信号分为三类，分别对应三个不同的刀具磨损阶段：快速初始磨损阶段、稳态磨损阶段和急速磨损阶段。刀具全生命周期最大磨损值如图1(a)所示。由图1(a)可以看出，刀具在1-5次走刀阶段的磨损较快，此阶段磨损曲线斜率较大。在6-41次走刀阶段，磨损值均匀增长，直到达到极限值，此阶段为刀具的有效工作时间。在42-47次走刀阶段，刀具磨损值急速上升并导致刀具失效，此阶段的磨损曲线斜率急速增大。虽然在图1(a)中初始磨损阶段不是很明显，但在图1(b)的振动信号时域图中，初始阶段振动信号的幅值较大，产生这种现象的原因是新刀表面粗糙不平，接触应力较大，同时，新刀有脱碳、氧化层引起的表面缺陷。根据以上分析，将振动信号分为三个阶段。According to the change of the slope of the maximum wear value curve, the vibration signal is divided into three categories, corresponding to three different tool wear stages: rapid initial wear stage, steady-state wear stage and rapid wear stage. The maximum wear value of the tool in the whole life cycle is shown in Figure 1(a). As can be seen from Figure 1(a), the tool wears faster in the 1-5 pass stage, and the slope of the wear curve in this stage is larger. In the 6-41 pass stage, the wear value increases evenly until it reaches the limit value. This stage is the effective working time of the tool. In the 42-47 pass stage, the tool wear value rises rapidly and causes the tool to fail. The slope of the wear curve in this stage increases rapidly. Although the initial wear stage is not very obvious in Figure 1(a), in the vibration signal time domain diagram of Figure 1(b), the amplitude of the vibration signal in the initial stage is large. The reason for this phenomenon is that the surface of the new tool is rough and uneven, the contact stress is large, and the new tool has surface defects caused by decarburization and oxidation layer. According to the above analysis, the vibration signal is divided into three stages.

在振动信号输入多尺度-通道注意力网络模型之前，将振动信号按照信号的时间顺序分段为长度为64²的切片，之后将X、Y、Z三个方向的振动信号切片作为输入特征图的三个通道进行拼接。假设有X、Y、Z方向的三个同时段的信号切片，

将切片构建为64×64×3的输入特征图输入多尺度-通道注意力网络模型。Before the vibration signal is input into the multi-scale-channel attention network model, the vibration signal is segmented into slices of length 64 ² according to the time order of the signal, and then the vibration signal slices in the three directions of X, Y, and Z are spliced as the three channels of the input feature map. Assuming there are three signal slices in the same period of the X, Y, and Z directions,

The slices are constructed as 64×64×3 input feature maps and input into the multi-scale-channel attention network model.

步骤(4)中，训练的迭代次数设置为100次，使用交叉熵函数L作为损失函数，如式(IV)所示：In step (4), the number of training iterations is set to 100, and the cross entropy function L is used as the loss function, as shown in formula (IV):

式(IV)中，y_i表示真实值，

represents the predicted value, and N is the number of samples.

训练过程中记录每一次训练周期的训练集与验证集的准确率与损失函数值。训练过程结束后保存最优的验证模型做测试，并绘制训练集准确率曲线与训练过程损失函数曲线，图3(a)为多尺度-通道注意力网络模型训练过程的准确率曲线图；图3(b)为多尺度-通道注意力网络模型训练过程的损失函数曲线图；During the training process, the accuracy and loss function values of the training set and validation set are recorded for each training cycle. After the training process is completed, the optimal validation model is saved for testing, and the training set accuracy curve and the training process loss function curve are plotted. Figure 3(a) is the accuracy curve of the multi-scale-channel attention network model training process; Figure 3(b) is the loss function curve of the multi-scale-channel attention network model training process;

记录测试集每一种类别的准确率，并绘制混淆矩阵，图4为混淆矩阵示意图；表1为多尺度-通道注意力网络模型的四种评价指标表。包括准确率(Accuracy)、精确率(Precision)、召回率(Recall)、F1分数(F1 Score)四个指标说明多尺度-通道注意力网络模型的性能，Record the accuracy of each category of the test set and draw a confusion matrix. Figure 4 is a schematic diagram of the confusion matrix. Table 1 is a table of four evaluation indicators of the multi-scale-channel attention network model. The four indicators of accuracy, precision, recall, and F1 score illustrate the performance of the multi-scale-channel attention network model.

表1Table 1

从表1可知，准确率(Accuracy)、精确率(Precision)、召回率(Recall)、F1分数(F1Score)四个指标都较高，说明多尺度-通道注意力网络模型性能较优。As can be seen from Table 1, the four indicators of accuracy, precision, recall and F1 score are all high, indicating that the multi-scale-channel attention network model has better performance.

实施例3Example 3

一种计算机设备，包括存储器和处理器，存储器存储有计算机程序，处理器执行计算机程序时实现实施例1或2所述的基于多尺度-通道注意力网络的刀具故障诊断方法的步骤。A computer device includes a memory and a processor, wherein the memory stores a computer program, and when the processor executes the computer program, the steps of the tool fault diagnosis method based on a multi-scale-channel attention network described in Example 1 or 2 are implemented.

实施例4Example 4

一种计算机可读存储介质，其上存储有计算机程序，计算机程序被处理器执行时实现实施例1或2所述的基于多尺度-通道注意力网络的刀具故障诊断方法的步骤。A computer-readable storage medium having a computer program stored thereon, which, when executed by a processor, implements the steps of the tool fault diagnosis method based on a multi-scale-channel attention network described in Example 1 or 2.

Claims

1. The tool fault diagnosis method based on the multiscale-channel attention network is characterized by comprising the following steps of:

(1) And (3) data acquisition:

respectively collecting vibration signals of three axes of a machine tool spindle X, Y, Z and cutter abrasion values after each feeding, wherein a spindle coordinate system of the machine tool spindle X, Y, Z is established according to a right-hand Cartesian rectangular coordinate system;

(2) Data preprocessing:

classifying the cutter abrasion stage according to the cutter abrasion value, and classifying the vibration signals by taking the classification result as a label;

the vibration signals of the three axes of the main shaft X, Y, Z of the machine tool are segmented into the length n according to the time sequence ² Then splicing the sections of the three axes of the main shaft X, Y, Z of the machine tool to construct an n multiplied by 3 three-channel input feature map, wherein n is the height or width of the feature map when the vibration signal section is converted into the feature map;

selecting a training set, a verification set and a test set from the data after the data preprocessing;

(3) Constructing a multi-scale-channel attention network model:

the multi-scale-channel attention network model comprises an input layer, three branches and a full-connection layer;

the three branches comprise a first branch, a second branch and a third branch, and the first branch comprises 5 convolution layers; the second branch comprises 2 convolution layers; the third branch comprises 1 convolution layer;

residual error connection is arranged between the second convolution layer of the first branch and the first convolution layer of the second branch, between the fourth convolution layer of the first branch and the second convolution layer of the second branch and between the first convolution layer of the second branch and the first convolution layer of the third branch, and residual error addition results are respectively input into the third convolution layer of the first branch, the fifth convolution layer of the first branch and the second convolution layer of the second branch;

the input channel attention module is used for carrying out residual connection on the output characteristic diagram of the first convolution layer of the second branch, the output characteristic diagram of the second convolution layer of the second branch and the output characteristic diagram of the first convolution layer of the third branch;

(4) Training: inputting the training set into a multi-scale-channel attention network model for training, and recording the accuracy and the loss function of the training set in each training period while training;

(5) And (3) testing: inputting the test set into a trained multi-scale-channel attention network model, and outputting a cutter abrasion stage corresponding to the test set data;

the mathematical expressions of the three branches including the first branch, the second branch and the third branch are respectively shown as the formula (I), the formula (II) and the formula (III):

in the formula (I), the formula (II) and the formula (III), i ₁ ＝1,2,…5，i ₂ ＝1,2，i ₃ ＝1；

Refers to the ith branch of the first branch ₁ Output feature map of the convolutional layers, +.>

Refers to the ith branch of the first branch ₁ The number of the convolution layers is one,

refers to the ith branch of the first branch ₁ -an output profile of 1 convolutional layer;

refers to the ith of the second branch ₂ Output feature map of the convolutional layers, +.>

Refers to the ith of the second branch ₂ The number of the convolution layers is one,

refers to the ith of the second branch ₂ -an output profile of 1 convolutional layer;

refers to the ith branch of the third branch ₃ Output feature map of the convolutional layers, +.>

Refers to the ith branch of the third branch ₃ Convolutional layers->

Refers to the ith branch of the third branch ₃ -an output profile of 1 convolutional layer;

the channel attention module acquires global receptive fields by adopting global average pooling operation and global maximum pooling operation respectively, inputs the global receptive fields into a two-layer neural network respectively, wherein the number of neurons of the first layer is 1, the activation function is ReLU, the number of neurons of the second layer is 3, the two-layer neural networks are shared, then the two-layer neural networks are added to generate channel weights, and finally the channel weights are normalized to be between (0 and 1) through a normalization layer;

and (3) inputting a channel attention module before residual connection to obtain channel weights by pooling operation, multiplying the channel weights by the original output characteristic diagram, mapping the channel weights onto the output characteristic diagram of the shallow branch convolution layer, and carrying out residual connection on the characteristic diagram with the channel weights and the deep branch convolution layer comprising the second convolution layer of the first branch, the fourth convolution layer of the first branch and the output characteristic diagram of the first convolution layer of the second branch.

2. The multi-scale channel attention network based tool fault diagnosis method according to claim 1, wherein the convolution kernel of each convolution layer is 3 x 3, and a ReLU nonlinear activation function is added after each convolution layer.

3. The multi-scale-channel attention network based tool fault diagnosis method according to claim 1, wherein the output feature map scale of the 5 convolution layers of the first branch is 64×64×3, 32×32×3, 16×16×3,8×8×3,4×4×3, respectively; the output feature map scales of the 2 convolution layers of the second branch are respectively 32×32×3,8×8×3; the output feature map scale of 1 convolution layer of the third branch is 32×32×3.

4. The method for diagnosing a tool failure based on a multi-scale and channel attention network according to claim 1, wherein in the step (1), vibration signals of three axes of a main shaft X, Y, Z of a machine tool are collected by using a vibration sensor with a model KS903, and the sampling frequency is 10240Hz; and acquiring the cutter abrasion value by using a 19JC digital universal tool microscope.

5. The multi-scale channel attention network based tool fault diagnosis method according to claim 1, wherein 80% of the data after the data preprocessing is used as a training set, 20% is used as a test set, 80% of the training set is used as training, and 20% of the training set is used as a verification set.

6. The multi-scale-channel attention network based tool fault diagnosis method according to claim 1, wherein in the step (2), tool wear stage classification is performed according to tool wear values, and vibration signals are classified by using classification results as labels; the method specifically comprises the following steps:

drawing a graph of the maximum abrasion value of the tool surface of the auxiliary tool in the whole life cycle and a time domain graph of a vibration signal in the whole life cycle of the X axis of a main shaft of a machine tool; wherein, the abscissa of the graph of the maximum abrasion value is the number of times of feeding, and the ordinate is the maximum abrasion value; the abscissa of the time domain graph of the vibration signal is time, and the ordinate is vibration signal amplitude;

vibration signals are classified into three types according to the slope change of the maximum abrasion value curve, when the slope of the maximum abrasion value curve is not more than 0.01, the corresponding vibration signals are in a rapid initial abrasion stage, when the slope of the maximum abrasion value curve is between 0.01 and 0.3, the corresponding vibration signals are in a steady abrasion stage, and when the slope of the maximum abrasion value curve is not less than 0.3, the corresponding vibration signals are in a rapid abrasion stage.

7. The multi-scale-channel attention network based tool fault diagnosis method according to any one of claims 1 to 6, wherein in step (4), a cross entropy function L is used as a loss function, as shown in formula (iv):

in the formula (IV), y _i The true value is represented by a value that is true,

representing the predicted value, N is the number of samples.

8. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor, when executing the computer program, implements the steps of the multi-scale-channel attention network based tool fault diagnosis method of any of claims 1-7.

9. A computer readable storage medium having stored thereon a computer program, characterized in that the computer program when executed by a processor implements the steps of the multi-scale-channel attention network based tool fault diagnosis method of any of claims 1-7.