[go: up one dir, main page]

CN110084220A - A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode - Google Patents

A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode Download PDF

Info

Publication number
CN110084220A
CN110084220A CN201910378462.XA CN201910378462A CN110084220A CN 110084220 A CN110084220 A CN 110084220A CN 201910378462 A CN201910378462 A CN 201910378462A CN 110084220 A CN110084220 A CN 110084220A
Authority
CN
China
Prior art keywords
image
scale
histogram
fourier
vehicle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910378462.XA
Other languages
Chinese (zh)
Inventor
方昳凡
许清
陆相羽
黄子恒
易和阳
滕飞宇
杨森元
戈洋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing University of Post and Telecommunications
Original Assignee
Chongqing University of Post and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing University of Post and Telecommunications filed Critical Chongqing University of Post and Telecommunications
Priority to CN201910378462.XA priority Critical patent/CN110084220A/en
Publication of CN110084220A publication Critical patent/CN110084220A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/10Image enhancement or restoration using non-spatial domain filtering
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/50Extraction of image or video features by performing operations within image blocks; by using histograms, e.g. histogram of oriented gradients [HoG]; by summing image-intensity values; Projection analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/59Context or environment of the image inside of a vehicle, e.g. relating to seat occupancy, driver state or inner lighting conditions
    • G06V20/597Recognising the driver's state or behaviour, e.g. attention or drowsiness
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/172Classification, e.g. identification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20048Transform domain processing
    • G06T2207/20056Discrete and fast Fourier transform, [DFT, FFT]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/467Encoded features or binary features, e.g. local binary patterns [LBP]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)

Abstract

The present invention relates to a kind of vehicle-mounted fatigue detection methods based on multiple dimensioned binary mode, what is solved is the low technical problem of detection accuracy, several unduplicated subregions are divided by using by training sample image, feature extraction is carried out using multiple dimensioned local binary pattern, obtains multiple dimensioned partial binary characteristics of image;Discrete fourier variation is carried out to multiple dimensioned partial binary characteristics of image, obtains histogram Fourier's feature vector of multiple dimensioned binary mode;Step 4, connect and compose histogram Fourier's feature vector of multiple dimensioned binary mode, for characterizing characteristics of image, select kernel function, classification based training is carried out with MLBP feature of the Nonlinear Support Vector Machines to sample image, and the technical solution of the svm classifier model after being trained and parameter, the problem is preferably resolved, can be used in vehicle-mounted fatigue detection.

Description

一种基于多尺度二进制模式的车载疲劳检测方法A Vehicle Fatigue Detection Method Based on Multi-scale Binary Patterns

技术领域technical field

本发明涉及人工智能技术领域,具体涉及一种基于多尺度二进制模式的车载疲劳检测方法。The invention relates to the technical field of artificial intelligence, in particular to a vehicle fatigue detection method based on a multi-scale binary pattern.

背景技术Background technique

随着社会的发展和经济的进步,汽车已经成为人们拓展生活空间,提高生活效率,提升生活品质的必备交通工具。汽车数量的与日剧增表现为社会繁荣的同时也带来了诸多社会问题。其中首要问题是道路安全问题,道路交通事故业已成为造成人类非正常死亡首要因素。有统计表明,在所有道路交通事故中,人为因素占80%,而疲劳驾驶又是最普遍的人为因素。因此,进行疲劳驾驶识别和预警对于避免恶性交通事故发生,保障人们生命和财产安全将起到至关重要的作用。With the development of society and the progress of economy, cars have become a necessary means of transportation for people to expand their living space, improve their living efficiency and improve their quality of life. The rapid increase in the number of automobiles has brought many social problems while the society is prosperous. The primary issue is road safety, and road traffic accidents have become the primary factor causing abnormal deaths of human beings. Statistics show that in all road traffic accidents, human factors account for 80%, and fatigue driving is the most common human factor. Therefore, the recognition and early warning of fatigue driving will play a vital role in avoiding vicious traffic accidents and ensuring the safety of people's lives and property.

现有技术中,LBP方法己经在很多方法中得到应用,但人脸识别受姿态和光照变化影响,光照变化、角度变化会对特征提取形成阻碍,影响检测率,导致疲劳检测不够准确。LBP算子在它的应用中有以下局限性。第一,单一尺度LBP算子在计算捕获图像结构的特征时不能够检测主要的纹理特征,并且它们对于图像变换和旋转鲁棒性不强。第二,在单信道,比如灰度图像空间,特征提取仅仅捕获单色强度信息,这限制了识别性能。In the prior art, the LBP method has been applied in many methods. However, face recognition is affected by posture and illumination changes. Illumination changes and angle changes will hinder feature extraction, affect the detection rate, and cause fatigue detection to be inaccurate. The LBP operator has the following limitations in its application. First, single-scale LBP operators cannot detect dominant texture features when computing features that capture image structures, and they are not robust to image transformations and rotations. Second, in single-channel, such as grayscale image spaces, feature extraction only captures monochromatic intensity information, which limits recognition performance.

本发明庭一种针对提高疲劳检测准确性的算法,随着从物理和生物视觉引出的计算机视觉、图像分析、信号处理发展功能的融合,能够解决上述技术问题。The present invention is an algorithm aimed at improving the accuracy of fatigue detection. With the fusion of computer vision, image analysis, and signal processing development functions derived from physical and biological vision, it can solve the above technical problems.

发明内容SUMMARY OF THE INVENTION

本发明所要解决的技术问题是现有技术中存在的检测精度差的技术问题。提供一种新的基于多尺度二进制模式的车载疲劳检测方法,该基于多尺度二进制模式的车载疲劳检测方法具有检测精度高的特点。The technical problem to be solved by the invention is the technical problem of poor detection accuracy existing in the prior art. A new vehicle fatigue detection method based on multi-scale binary patterns is provided, and the vehicle fatigue detection method based on multi-scale binary patterns has the characteristics of high detection accuracy.

为解决上述技术问题,采用的技术方案如下:In order to solve the above technical problems, the technical scheme adopted is as follows:

一种基于多尺度二进制模式的车载疲劳检测方法,所述基于多尺度二进制模式的车载疲劳检测方法包括:A vehicle fatigue detection method based on a multi-scale binary pattern, the vehicle fatigue detection method based on a multi-scale binary pattern comprises:

步骤一,对驾驶员图像人工分类标记疲劳样本和不疲劳样本,分别选择训练样本和测试样本,对训练样本和测试样本进行预处理;Step 1: Manually classify and mark fatigue samples and non-fatigue samples on the driver image, select training samples and test samples respectively, and preprocess the training samples and test samples;

步骤二,将训练样本图像划分成若干不重复的子区域,使用多尺度局部二进制模式进行特征提取,获得多尺度局部二进制图像特征;Step 2, divide the training sample image into several non-repetitive sub-regions, use multi-scale local binary patterns for feature extraction, and obtain multi-scale local binary image features;

步骤三,对多尺度局部二进制图像特征进行离散傅里叶变化,得到多尺度二进制模式的直方图傅里叶特征向量;Step 3, performing discrete Fourier transformation on the multi-scale local binary image features to obtain the histogram Fourier feature vector of the multi-scale binary pattern;

步骤四,连接构成多尺度二进制模式的直方图傅里叶特征向量,用于表征图像特征,选择核函数,用非线性支持向量机对样本图像的MLBP特征进行分类训练,并得到训练后的SVM分类模型和参数;Step 4, connect the histogram Fourier feature vectors that constitute the multi-scale binary pattern to characterize the image features, select the kernel function, use the nonlinear support vector machine to classify and train the MLBP features of the sample image, and obtain the trained SVM Classification models and parameters;

步骤四,对测试样本重复步骤二至步骤三,利用训练得到的SVM分类模型和参数对测试样本的多尺度二进制模式的直方图傅里叶特征进行检测。Step 4: repeat steps 2 to 3 for the test sample, and use the trained SVM classification model and parameters to detect the histogram Fourier feature of the multi-scale binary pattern of the test sample.

本发明的工作原理:发明给出了一种基于多尺度二进制模式的直方图傅里叶特征(MLBP-HF)和支持向量机SVM的车载疲劳检测方法。主要是以统计学习理论为思想基础的、按监督学习方式进行数据分类的支持向量机。Working principle of the present invention: the invention provides a vehicle fatigue detection method based on multi-scale binary pattern histogram Fourier feature (MLBP-HF) and support vector machine SVM. It is mainly based on the statistical learning theory and supports vector machines for data classification according to the supervised learning method.

首先基于多尺度局部二进制模式的特征提取:First, feature extraction based on multi-scale local binary patterns:

将图像划分为多个同一大小但并不重叠的子区域,M0,M1,…Mj-1,每一块子区域的不同半径的LBP直方图,用以下公式计算:Divide the image into multiple sub-regions of the same size but not overlapping, M 0 , M 1 ,...M j-1 , the LBP histogram of each sub-region with different radii, calculated with the following formula:

B(v)为Bool指针,是对于每一个不同尺度的区域(Mj)计算直方图提供区域性信息,L为直方图bins的数量。B(v) is a Bool pointer, which is to calculate the histogram for each area of different scales (M j ) to provide regional information, and L is the number of histogram bins.

通过以下公式连接上述各个子区域的直方图得到一个单向量,是最终的多尺度特征描述算子:Connect the histograms of the above sub-regions by the following formula to obtain a single vector, which is the final multi-scale feature description operator:

fj=[hP,1,j,hP,2,j,...,hP,R,j]。f j =[h P,1,j ,h P,2,j ,...,h P,R,j ].

特征提取方法可以有效地表征疲劳,并取得了较好的检测效果,检测速度快,受光照变化影响较小,但是同时发现对于驾驶员人脸存在倾斜或偏转时检测率很低,漏检率和误检率较高。在对驾驶员人脸图像提取出MLBP特征后,使用傅里叶变换来对MLBP特征进行处理,进行离散傅里叶变换对于具有一定偏转角度的图像能够从特征上进行矫正,进而减小对疲劳检测的影响,提高检测率。The feature extraction method can effectively represent fatigue and achieve better detection results. The detection speed is fast and is less affected by illumination changes. However, it is found that the detection rate is very low when the driver's face is tilted or deflected, and the missed detection rate and high false positive rate. After extracting the MLBP features from the driver's face image, use Fourier transform to process the MLBP features, and perform discrete Fourier transform to correct the image with a certain deflection angle from the feature, thereby reducing the fatigue Detection of the impact, improve the detection rate.

上述方案中,为优化,进一步地,所述预处理包括将RGB图像转换为灰度图像及图像标准化处理。In the above solution, for optimization, further, the preprocessing includes converting the RGB image into a grayscale image and image standardization.

进一步地,所述基于多尺度局部二进制模式进行特征提取包括:Further, the feature extraction based on multi-scale local binary patterns includes:

步骤1,用LBP8,1,LBP8,2,LBP8,3三种LBP算子对原图进行计算,得到在不同尺度下的图像,分别为I1,I2,I3;Step 1, use LBP 8,1 , LBP 8,2 , LBP 8,3 three LBP operators to calculate the original image, and obtain images at different scales, respectively I1, I2, I3;

步骤2,对I1,I2,I3进行分块,分成的子块分别为:Step 2, divide I1, I2, and I3 into blocks, and the sub-blocks are as follows:

I1(1,n),I2(1,n),I3(1,n);I1(1,n),I2(1,n),I3(1,n);

步骤3,对不同尺度下对应的子区域进行拼接;Step 3, splicing the corresponding sub-regions at different scales;

步骤4,对步骤3中的每一个拼接结果计算LBP直方图;Step 4, calculating the LBP histogram for each splicing result in step 3;

其中,n为正整数。Wherein, n is a positive integer.

进一步地,步骤三包括:Further, step three includes:

步骤3.1,用离散傅里叶变换对多尺度局部二进制图像特征进行变换,设HP,r(n,·)为直方图hP,r(U(n,c))的第n行的DFT,即:Step 3.1, use discrete Fourier transform to transform multi-scale local binary image features, let H P,r (n, ) be the DFT of the nth row of the histogram h P,r (U(n,c)) ,which is:

HP,r(n,u)=∑hP,r(U(n,c))e-i2πur/P H P,r (n,u)=∑h P,r (U(n,c))e -i2πur/P

步骤3.2,对于离散傅里叶变换输入适量的循环位移带来的DFT系数中的相移,判定出h'P,r(U(n,c))=hP,r(U(n,c-a)),Step 3.2, for the phase shift in the DFT coefficients brought by the discrete Fourier transform input with an appropriate amount of cyclic shift, it is determined that h'P ,r (U(n,c))=hP ,r (U(n,ca )),

则定义H'P,r(n,c)=HP,r(n,u)e-i2πur/PThen define H' P,r (n,c)=H P,r (n,u)e -i2πur/P ;

定义对于任意的1≤n1,n2≤P-1,有:Definition For any 1≤n 1 , n 2 ≤P-1, there are:

其中,为HP,r(n2,u)的复共轭;in, is the complex conjugate of H P,r (n 2 ,u);

对于任意1≤n1,n2≤P-1和多尺度二进制模式的直方图傅里叶特征,计算出:For any 1≤n 1 , n 2 ≤P-1 and histogram Fourier features of multiscale binary patterns, compute:

定义傅里叶幅度谱为 Define the Fourier magnitude spectrum as

定义 definition

则LBPr/u2作为一个子集包含于傅里叶幅度谱。Then LBP r/u2 is included as a subset in the Fourier magnitude spectrum.

用U(n,c)表示图像的规范LBP模式,圆形领域为(8,R)的58种不同的规范模式中,不同的行表示中心像素经过规范LBP算子计算得到的位序列中1的数量,不同的列表示图像的旋转角度。Use U(n,c) to represent the canonical LBP mode of the image. Among the 58 different canonical modes with the circular area (8,R), different rows represent the 1 in the bit sequence calculated by the canonical LBP operator for the central pixel. The number of different columns represent the rotation angle of the image.

用n用表示模式屮位的数量,也即行数,c为旋转角度也即列数。如果样本点数量为P,则0≤n≤P+1,其中n=P+1表示所有非规范模式,并且,当1≤n≤P-1时,旋转度0≤c≤P-1。加入用f'(x,y)表示f(x,y)偏转后的图像,偏转角度为α。偏转后点(x,y)移至(x',y'),可以知道两点也偏转α度。Use n to represent the number of pattern bits, that is, the number of rows, and c is the rotation angle, that is, the number of columns. If the number of sample points is P, then 0≤n≤P+1, where n=P+1 represents all non-canonical modes, and when 1≤n≤P-1, the degree of rotation is 0≤c≤P-1. Add f'(x,y) to represent the deflected image of f(x,y), and the deflection angle is α. After the deflection, the point (x, y) moves to (x', y'), and it can be known that the two points are also deflected by α degrees.

根据多尺度局部二进算子的特点,偏转角度可以做到两个样本点角度的整数倍,即:According to the characteristics of multi-scale local binary operators, the deflection angle can be an integer multiple of the angle of two sample points, namely:

则旋转后图像的规范模式直方图,对应于原图像在每一行的直方图的循环移位。Then the canonical mode histogram of the rotated image corresponds to the cyclic shift of the histogram of the original image in each row.

h'P,r(U(n,c+a))=hP,r(U(n,c))。h'P ,r (U(n,c+a))=hP ,r (U(n,c)).

为了避免循环移位给旋转后的图像直方图的影响,对图像使用傅里叶变换。To avoid the effect of cyclic shift on the rotated image histogram, a Fourier transform is applied to the image.

用离散傅里叶变换DFT来对之前得到的MLBP特征进行变换:Use the discrete Fourier transform DFT to transform the previously obtained MLBP features:

设HP,r(n,·)为直方图hP,r(U(n,c))的第n行的DFT,即:Let H P,r (n, ) be the DFT of the nth row of the histogram h P,r (U(n,c)), namely:

HP,r(n,u)=∑hP,r(U(n,c))e-i2πur/PH P,r (n,u)=∑h P,r (U(n,c))e -i2πur/P ;

对于DFT输入适量的循环位移带来的DFT系数中的相移,如果:The phase shift in the DFT coefficients brought about by an appropriate amount of cyclic shift for the DFT input if:

h'P,r(U(n,c))=hP,r(U(n,c-a)),那么有H'P,r(n,c)=HP,r(n,u)e-i2πur/Ph' P,r (U(n,c))=h P,r (U(n,ca)), then H' P,r (n,c)=H P,r (n,u)e -i2πur/P ,

并且对于任意的1≤n1,n2≤P-1有:And for any 1≤n 1 , n 2 ≤P-1 have:

其中为HP,r(n2,u)的复共轭。in is the complex conjugate of H P,r (n 2 ,u).

显示对于任意1≤n1,n2≤P-1和MLBP-HF,特征:Show that for any 1≤n 1 , n 2 ≤P-1 and MLBP-HF, the features:

对于hP,r(U(n,c))的行的循环转换是不变的,并且对于输入图像f(x,y)的旋转也是不变的。The cyclic transformation of the rows of h P,r (U(n,c)) is invariant, and it is also invariant to the rotation of the input image f(x,y).

傅里叶幅度谱可以视为这些特征的一种特殊情况。Fourier magnitude spectrum can be considered a special case of these characteristics.

最终,既然那么LBPr/u2作为一个子集包含于傅里叶幅度谱中。Ultimately, since Then LBP r/u2 is included as a subset in the Fourier magnitude spectrum.

从而,在对驾驶员人脸图像提取出MLBP特征后,即可使用傅里叶变换来对特征进行处理,它的好处在于对于具有一定偏转角度的图像能够从特征上进行矫正,进而减小对疲劳检测的影响,提高检测率。Therefore, after extracting MLBP features from the driver's face image, Fourier transform can be used to process the features. Fatigue detection effect, improve detection rate.

本发明的有益效果:本发明针提出了一种基于多尺度二进制模式直方图傅里叶特征(MLBP-HF)和SVM的疲劳检测方法。方法中提取的MLBP-HF特征不仅能克服光照变化的影响,又能最大限度保留图像细节特征。而且,对MLBP特征进行傅里叶变化后能克服姿态变化的影响,解决了对于倾斜人脸检测率较低的问题。本发明有效的提高了人脸识别方面的识别率,使得识别受姿态和光照变化影响较小,疲劳检测率在一定程度上得到了提升。Beneficial effects of the present invention: the present invention proposes a fatigue detection method based on multi-scale binary pattern histogram Fourier feature (MLBP-HF) and SVM. The MLBP-HF features extracted in the method can not only overcome the influence of illumination changes, but also preserve the image details to the greatest extent. Moreover, the Fourier transform of MLBP features can overcome the influence of attitude changes and solve the problem of low detection rate for tilted faces. The invention effectively improves the recognition rate of face recognition, makes the recognition less affected by posture and illumination changes, and improves the fatigue detection rate to a certain extent.

附图说明Description of drawings

下面结合附图和实施例对本发明进一步说明。The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

图1,实施例1中的车载疲劳检测方法流程图。FIG. 1 is a flow chart of the vehicle fatigue detection method in Embodiment 1.

具体实施方式Detailed ways

为了使本发明的目的、技术方案及优点更加清楚明白,以下结合实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本发明,并不用于限定本发明。In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

实施例1Example 1

本实施例提供一种基于多尺度二进制模式的车载疲劳检测方法,如图1,所述基于多尺度二进制模式的车载疲劳检测方法包括:This embodiment provides a vehicle fatigue detection method based on multi-scale binary patterns, as shown in Figure 1, the vehicle fatigue detection method based on multi-scale binary patterns includes:

步骤一,对驾驶员图像人工分类标记疲劳样本和不疲劳样本,分别选择训练样本和测试样本,对训练样本和测试样本进行预处理;Step 1: Manually classify and mark fatigue samples and non-fatigue samples on the driver image, select training samples and test samples respectively, and preprocess the training samples and test samples;

步骤二,将训练样本图像划分成若干不重复的子区域,使用多尺度局部二进制模式进行特征提取,获得多尺度局部二进制图像特征;Step 2, divide the training sample image into several non-repetitive sub-regions, use multi-scale local binary patterns for feature extraction, and obtain multi-scale local binary image features;

步骤三,对多尺度局部二进制图像特征进行离散傅里叶变化,得到多尺度二进制模式的直方图傅里叶特征向量;Step 3, performing discrete Fourier transformation on the multi-scale local binary image features to obtain the histogram Fourier feature vector of the multi-scale binary pattern;

步骤四,连接构成多尺度二进制模式的直方图傅里叶特征向量,用于表征图像特征,选择核函数,用非线性支持向量机对样本图像的MLBP特征进行分类训练,并得到训练后的SVM分类模型和参数;Step 4, connect the histogram Fourier feature vectors that constitute the multi-scale binary pattern to characterize the image features, select the kernel function, use the nonlinear support vector machine to classify and train the MLBP features of the sample image, and obtain the trained SVM Classification models and parameters;

步骤四,对测试样本重复步骤二至步骤三,利用训练得到的SVM分类模型和参数对测试样本的多尺度二进制模式的直方图傅里叶特征进行检测。Step 4: repeat steps 2 to 3 for the test sample, and use the trained SVM classification model and parameters to detect the histogram Fourier feature of the multi-scale binary pattern of the test sample.

具体地,所述预处理包括将RGB图像转换为灰度图像及图像标准化处理。Specifically, the preprocessing includes converting the RGB image into a grayscale image and image standardization.

详细地,所述基于多尺度局部二进制模式进行特征提取包括:In detail, the feature extraction based on multi-scale local binary patterns includes:

步骤1,用LBP8,1,LBP8,2,LBP8,3三种LBP算子对原图进行计算,得到在不同尺度下的图像,分别为I1,I2,I3;Step 1, use LBP 8,1 , LBP 8,2 , LBP 8,3 three LBP operators to calculate the original image, and obtain images at different scales, respectively I1, I2, I3;

步骤2,对I1,I2,I3进行分块,分成的子块分别为:Step 2, divide I1, I2, and I3 into blocks, and the sub-blocks are as follows:

I1(1,n),I2(1,n),I3(1,n);I1(1,n),I2(1,n),I3(1,n);

步骤3,对不同尺度下对应的子区域进行拼接;Step 3, splicing the corresponding sub-regions at different scales;

步骤4,对步骤3中的每一个拼接结果计算LBP直方图。Step 4, calculate the LBP histogram for each splicing result in step 3.

详细地,步骤三包括:In detail, step three includes:

步骤3.1,用离散傅里叶变换对多尺度局部二进制图像特征进行变换,设HP,r(n,·)为直方图hP,r(U(n,c))的第n行的DFT,即:Step 3.1, use discrete Fourier transform to transform multi-scale local binary image features, let H P,r (n, ) be the DFT of the nth row of the histogram h P,r (U(n,c)) ,which is:

HP,r(n,u)=∑hP,r(U(n,c))e-i2πur/P H P,r (n,u)=∑h P,r (U(n,c))e -i2πur/P

步骤3.2,对于离散傅里叶变换输入适量的循环位移带来的DFT系数中的相移,判定出h'P,r(U(n,c))=hP,r(U(n,c-a)),Step 3.2, for the phase shift in the DFT coefficients brought by the discrete Fourier transform input with an appropriate amount of cyclic shift, it is determined that h'P ,r (U(n,c))=hP ,r (U(n,ca )),

则定义H'P,r(n,c)=HP,r(n,u)e-i2πur/PThen define H' P,r (n,c)=H P,r (n,u)e -i2πur/P ;

定义对于任意的1≤n1,n2≤P-1,有:Definition For any 1≤n 1 , n 2 ≤P-1, there are:

其中,为HP,r(n2,u)的复共轭;in, is the complex conjugate of H P,r (n 2 ,u);

对于任意1≤n1,n2≤P-1和多尺度二进制模式的直方图傅里叶特征,计算出:For any 1≤n 1 , n 2 ≤P-1 and histogram Fourier features of multiscale binary patterns, compute:

定义傅里叶幅度谱为 Define the Fourier magnitude spectrum as

定义 definition

则LBPr/u2作为一个子集包含于傅里叶幅度谱。Then LBP r/u2 is included as a subset in the Fourier magnitude spectrum.

如图1所示,基于MLBP特征提取,包含以下2个步骤:As shown in Figure 1, feature extraction based on MLBP includes the following two steps:

①将图像划分为多个同一大小但并不重叠的子区域,M0,M1,…Mj-1,每一块子区域的不同半径的LBP直方图用公式计算;① Divide the image into multiple sub-regions of the same size but not overlapping, M 0 , M 1 , ... M j-1 , the LBP histogram of each sub-region with different radii is calculated by the formula;

②通过公式连接上述各个子区域的直方图得到一个单向量,这就是最终的多尺度特征描述算子。② Connect the histograms of the above sub-regions through the formula to obtain a single vector, which is the final multi-scale feature description operator.

如图1所示,图像的MLBP特征获取,过程包含以下4个步骤:As shown in Figure 1, the MLBP feature acquisition process of the image includes the following four steps:

①用三种不同LBP算子对原图进行计算,得到在不同尺度下的图像;① Use three different LBP operators to calculate the original image to obtain images at different scales;

②对不同尺度下的图像进行分块;②Block images at different scales;

③对不同尺度下对应的子区域进行拼接;③ Splicing the corresponding sub-regions at different scales;

④对每一个拼接结果计算LBP直方图。④ Calculate the LBP histogram for each splicing result.

如图1所示,标准的SVM分类,分为两个阶段:As shown in Figure 1, the standard SVM classification is divided into two stages:

第一,训练阶段:First, the training phase:

①建立训练样本集;① Establish a training sample set;

②选择核函数及参数,将低维空间向高维空间变换;② Select the kernel function and parameters to transform the low-dimensional space to the high-dimensional space;

③输入样本正规化;③ Normalization of input samples;

④构造核矩阵;④ Construct a kernel matrix;

⑤求解拉格朗日系数;⑤ Solve the Lagrangian coefficient;

⑥计算出支持向量,求解分类超平面系数;⑥Calculate support vectors and solve classification hyperplane coefficients;

⑦根据求得的各类系数,建立训练数据的最优分类超平面,完成训练过程。⑦ According to the obtained various coefficients, the optimal classification hyperplane of the training data is established to complete the training process.

第二,检测阶段:Second, the detection phase:

①读入学习阶段训练得到的模板数据文件;① Read in the template data file trained in the learning phase;

②根据公式计算新输入特征数据的输出值;② Calculate the output value of the new input characteristic data according to the formula;

③利用分类函数计算,给出分类结果。③ Use the classification function to calculate and give the classification result.

本实施例提出了一种基于MLBP特征和SVM的车载疲劳检测方法,与现有技术相比本发明改善了具有一定倾斜免度的人脸图像会影响检测率的问题,对原基于MLBP算子的特征提取方法进行优。This embodiment proposes a vehicle-mounted fatigue detection method based on MLBP features and SVM. Compared with the prior art, the present invention improves the problem that a face image with a certain degree of inclination will affect the detection rate. The feature extraction method is optimized.

本发明分为训练样本和测试样本,实验分为训练和检测两个过程,在训练阶段,首先使用基于MLBP算子的方法对人脸训练样本图像进行特征提取,然后将特征向量输入SVM到中进行分类训练,并选取一种核函数获取SVM分类模型;在检测阶段,首先对测试样本用同样的方法进行特征提取,然后再用SVM分类模型进行疲劳检测。在提取特征时采用了图像分割技术,通过对不同的分割类型进行比较实验,并进行实验结果分析之后,针对该方法对于倾斜人脸检测率较低的问题。The present invention is divided into training samples and test samples, and the experiment is divided into two processes of training and detection. In the training stage, the method based on the MLBP operator is first used to extract the features of the face training sample images, and then the feature vector is input into the SVM. Carry out classification training, and select a kernel function to obtain the SVM classification model; in the detection stage, first use the same method to extract features from the test samples, and then use the SVM classification model for fatigue detection. Image segmentation technology is used in extracting features. After comparing experiments with different segmentation types and analyzing the experimental results, the method is aimed at the problem of low detection rate of tilted faces.

本实施例提出了一种基于多尺度二进制模式直方图傅里叶特征MLBP-HF和支持向量机SVM的疲劳检测方法。该方法中提取的MLBP-HF特征不仅能克服光照变化的影响,又能最大限度保留图像细节特征,而且,对MLBP特征进行傅里叶变化后能克服姿态变化的影响。本实施例在人脸识别方面有较好的识别率,受姿态和光照变化影响较小,并且使得疲劳检测率在一定程度上得到了提升。因此这种基于MLBP特征和SVM的车载疲劳检测方法对于车载疲劳检测领域具有重要的理论意义和应用价值。This embodiment proposes a fatigue detection method based on multi-scale binary pattern histogram Fourier feature MLBP-HF and support vector machine SVM. The MLBP-HF feature extracted in this method can not only overcome the influence of illumination changes, but also preserve the image detail features to the maximum extent, and can overcome the influence of attitude changes after performing Fourier transformation on MLBP features. This embodiment has a better recognition rate in face recognition, is less affected by posture and illumination changes, and improves the fatigue detection rate to a certain extent. Therefore, this vehicle fatigue detection method based on MLBP features and SVM has important theoretical significance and application value in the field of vehicle fatigue detection.

尽管上面对本发明说明性的具体实施方式进行了描述,以便于本技术领域的技术人员能够理解本发明,但是本发明不仅限于具体实施方式的范围,对本技术领域的普通技术人员而言,只要各种变化只要在所附的权利要求限定和确定的本发明精神和范围内,一切利用本发明构思的发明创造均在保护之列。Although the illustrative specific embodiments of the present invention have been described above, so that those skilled in the art can understand the present invention, the present invention is not limited to the scope of the specific embodiments. For those of ordinary skill in the art, as long as each As long as the changes are within the spirit and scope of the present invention defined and determined by the appended claims, all inventions and creations utilizing the concept of the present invention are included in the protection list.

Claims (4)

1. A vehicle-mounted fatigue detection method based on a multi-scale binary mode is characterized by comprising the following steps: the vehicle-mounted fatigue detection method based on the multi-scale binary mode comprises the following steps:
step one, marking a fatigue sample and a non-fatigue sample for manual classification of a driver image, respectively selecting a training sample and a testing sample, and preprocessing the training sample and the testing sample;
dividing the training sample image into a plurality of non-repetitive sub-regions, and performing feature extraction by using a multi-scale local binary mode to obtain multi-scale local binary image features;
performing discrete Fourier change on the multi-scale local binary image characteristics to obtain a histogram Fourier characteristic vector of the multi-scale binary mode;
connecting histogram Fourier feature vectors forming a multi-scale binary mode, representing image features, selecting a kernel function, carrying out classification training on MLBP (maximum likelihood back propagation) features of the sample image by using a nonlinear support vector machine, and obtaining a trained SVM classification model and parameters;
and step four, repeating the step two to the step three on the test sample, and detecting the histogram Fourier characteristics of the multi-scale binary mode of the test sample by utilizing the SVM classification model and the parameters obtained by training.
2. The vehicle-mounted fatigue detection method based on the multi-scale binary pattern according to claim 1, characterized in that: the preprocessing comprises converting the RGB image into a gray scale image and image standardization processing.
3. The vehicle-mounted fatigue detection method based on the multi-scale binary pattern according to claim 1, characterized in that: the feature extraction based on the multi-scale local binary pattern comprises the following steps:
step 1, with LBP8,1,LBP8,2,LBP8,3Calculating the original image by using the three LBP operators to obtain images under different scales, namely I1, I2 and I3;
step 2, partitioning I1, I2 and I3 into sub-blocks which are respectively:
I1(1,n),I2(1,n),I3(1,n);
step 3, splicing the corresponding sub-regions under different scales;
step 4, calculating an LBP histogram for each splicing result in the step 3;
wherein n is a positive integer.
4. The vehicle-mounted fatigue detection method based on the multi-scale binary pattern according to claim 1, characterized in that: the third step comprises:
step 3.1, transforming the multi-scale local binary image characteristics by using discrete Fourier transform, and setting HP,r(n,. cndot.) is a histogram hP,rDFT of the nth row of (U (n, c)), namely:
HP,r(n,u)=∑hP,r(U(n,c))e-i2πur/P
step 3.2, h 'is determined for the phase shift in DFT coefficient caused by inputting proper amount of cyclic shift in discrete Fourier transform'P,r(U(n,c))=hP,r(U(n,c-a)),
Then define H'P,r(n,c)=HP,r(n,u)e-i2πur/P
Definition for arbitrary 1 ≦ n1,n2P-1 is less than or equal to, and the following components are:
wherein,is HP,r(n2Complex conjugation of u);
for any 1 ≦ n1,n2And (3) calculating the histogram Fourier characteristics of the multi-scale binary mode and the P-1 or less, and calculating:
defining a Fourier magnitude spectrum as
Definition of
Then LBPr/u2As a subset, to the fourier magnitude spectrum.
CN201910378462.XA 2019-05-08 2019-05-08 A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode Pending CN110084220A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910378462.XA CN110084220A (en) 2019-05-08 2019-05-08 A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910378462.XA CN110084220A (en) 2019-05-08 2019-05-08 A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode

Publications (1)

Publication Number Publication Date
CN110084220A true CN110084220A (en) 2019-08-02

Family

ID=67419208

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910378462.XA Pending CN110084220A (en) 2019-05-08 2019-05-08 A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode

Country Status (1)

Country Link
CN (1) CN110084220A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113205081A (en) * 2021-06-11 2021-08-03 北京惠朗时代科技有限公司 SVM model worker fatigue accurate judgment method based on significance detection
US12443387B2 (en) 2019-08-21 2025-10-14 Micron Technology, Inc. Intelligent audio control in vehicles
US12518570B2 (en) 2019-12-18 2026-01-06 Lodestar Licensing Group Llc Predictive maintenance of automotive transmission
US12525111B2 (en) 2019-08-21 2026-01-13 Lodestar Licensing Group Llc Drowsiness detection for vehicle control

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105117688A (en) * 2015-07-29 2015-12-02 重庆电子工程职业学院 Face identification method based on texture feature fusion and SVM
CN105976809A (en) * 2016-05-25 2016-09-28 中国地质大学(武汉) Voice-and-facial-expression-based identification method and system for dual-modal emotion fusion
CN109190455A (en) * 2018-07-18 2019-01-11 东南大学 Black smoke vehicle recognition methods based on Gaussian Mixture and autoregressive moving-average model

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105117688A (en) * 2015-07-29 2015-12-02 重庆电子工程职业学院 Face identification method based on texture feature fusion and SVM
CN105976809A (en) * 2016-05-25 2016-09-28 中国地质大学(武汉) Voice-and-facial-expression-based identification method and system for dual-modal emotion fusion
CN109190455A (en) * 2018-07-18 2019-01-11 东南大学 Black smoke vehicle recognition methods based on Gaussian Mixture and autoregressive moving-average model

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赵李坤: "基于人脸特征识别的驾驶员疲劳检测方法研究", 《中国优秀硕士学位论文全文数据库•工程科技Ⅱ辑》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12443387B2 (en) 2019-08-21 2025-10-14 Micron Technology, Inc. Intelligent audio control in vehicles
US12525111B2 (en) 2019-08-21 2026-01-13 Lodestar Licensing Group Llc Drowsiness detection for vehicle control
US12518570B2 (en) 2019-12-18 2026-01-06 Lodestar Licensing Group Llc Predictive maintenance of automotive transmission
CN113205081A (en) * 2021-06-11 2021-08-03 北京惠朗时代科技有限公司 SVM model worker fatigue accurate judgment method based on significance detection
CN113205081B (en) * 2021-06-11 2024-01-05 北京惠朗时代科技有限公司 SVM model worker fatigue accurate judging method based on significance detection

Similar Documents

Publication Publication Date Title
Wei et al. Multi-target defect identification for railway track line based on image processing and improved YOLOv3 model
Calik et al. Large-scale offline signature recognition via deep neural networks and feature embedding
CN102147858B (en) License plate character identification method
CN104598885B (en) The detection of word label and localization method in street view image
CN102163281B (en) Real-time human body detection method based on AdaBoost frame and colour of head
CN110084220A (en) A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode
CN105488536A (en) Agricultural pest image recognition method based on multi-feature deep learning technology
CN103679191B (en) An automatic fake-licensed vehicle detection method based on static state pictures
CN114220178A (en) Signature identification system and method based on channel attention mechanism
CN102867195B (en) Method for detecting and identifying a plurality of types of objects in remote sensing image
CN104282008B (en) The method and apparatus that Texture Segmentation is carried out to image
CN112200186A (en) Vehicle logo recognition method based on improved YOLO_V3 model
CA2435290A1 (en) Computer-aided image analysis
CN104751475A (en) Feature point optimization matching method for static image object recognition
Wang et al. Fully convolutional network based skeletonization for handwritten chinese characters
CN107886539A (en) High class gear visible detection method under a kind of industrial scene
CN104616016A (en) Global feature and local feature combined texture feature description method
CN100345152C (en) Face recognition method based on template matching
Cüceloğlu et al. Detecting handwritten signatures in scanned documents
CN103902965B (en) Spatial domain symbiosis image representing method and its application in image classification, identification
CN116343232A (en) Ancient book mathematical symbol recognition method based on pre-classification
Yan et al. Chinese text location under complex background using Gabor filter and SVM
CN116740443A (en) A multi-scale feature matching printing defect detection method based on few-shot learning
Girish et al. Tire imprint identification and classification using VGG19
Singh et al. Verifying bank checks using deep learning and image processing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190802

RJ01 Rejection of invention patent application after publication