CN110084220A - A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode - Google Patents
A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode Download PDFInfo
- Publication number
- CN110084220A CN110084220A CN201910378462.XA CN201910378462A CN110084220A CN 110084220 A CN110084220 A CN 110084220A CN 201910378462 A CN201910378462 A CN 201910378462A CN 110084220 A CN110084220 A CN 110084220A
- Authority
- CN
- China
- Prior art keywords
- image
- scale
- histogram
- fourier
- vehicle
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2411—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/10—Image enhancement or restoration using non-spatial domain filtering
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/50—Extraction of image or video features by performing operations within image blocks; by using histograms, e.g. histogram of oriented gradients [HoG]; by summing image-intensity values; Projection analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/59—Context or environment of the image inside of a vehicle, e.g. relating to seat occupancy, driver state or inner lighting conditions
- G06V20/597—Recognising the driver's state or behaviour, e.g. attention or drowsiness
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/172—Classification, e.g. identification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20048—Transform domain processing
- G06T2207/20056—Discrete and fast Fourier transform, [DFT, FFT]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/46—Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
- G06V10/467—Encoded features or binary features, e.g. local binary patterns [LBP]
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Image Analysis (AREA)
Abstract
Description
技术领域technical field
本发明涉及人工智能技术领域,具体涉及一种基于多尺度二进制模式的车载疲劳检测方法。The invention relates to the technical field of artificial intelligence, in particular to a vehicle fatigue detection method based on a multi-scale binary pattern.
背景技术Background technique
随着社会的发展和经济的进步,汽车已经成为人们拓展生活空间,提高生活效率,提升生活品质的必备交通工具。汽车数量的与日剧增表现为社会繁荣的同时也带来了诸多社会问题。其中首要问题是道路安全问题,道路交通事故业已成为造成人类非正常死亡首要因素。有统计表明,在所有道路交通事故中,人为因素占80%,而疲劳驾驶又是最普遍的人为因素。因此,进行疲劳驾驶识别和预警对于避免恶性交通事故发生,保障人们生命和财产安全将起到至关重要的作用。With the development of society and the progress of economy, cars have become a necessary means of transportation for people to expand their living space, improve their living efficiency and improve their quality of life. The rapid increase in the number of automobiles has brought many social problems while the society is prosperous. The primary issue is road safety, and road traffic accidents have become the primary factor causing abnormal deaths of human beings. Statistics show that in all road traffic accidents, human factors account for 80%, and fatigue driving is the most common human factor. Therefore, the recognition and early warning of fatigue driving will play a vital role in avoiding vicious traffic accidents and ensuring the safety of people's lives and property.
现有技术中,LBP方法己经在很多方法中得到应用,但人脸识别受姿态和光照变化影响,光照变化、角度变化会对特征提取形成阻碍,影响检测率,导致疲劳检测不够准确。LBP算子在它的应用中有以下局限性。第一,单一尺度LBP算子在计算捕获图像结构的特征时不能够检测主要的纹理特征,并且它们对于图像变换和旋转鲁棒性不强。第二,在单信道,比如灰度图像空间,特征提取仅仅捕获单色强度信息,这限制了识别性能。In the prior art, the LBP method has been applied in many methods. However, face recognition is affected by posture and illumination changes. Illumination changes and angle changes will hinder feature extraction, affect the detection rate, and cause fatigue detection to be inaccurate. The LBP operator has the following limitations in its application. First, single-scale LBP operators cannot detect dominant texture features when computing features that capture image structures, and they are not robust to image transformations and rotations. Second, in single-channel, such as grayscale image spaces, feature extraction only captures monochromatic intensity information, which limits recognition performance.
本发明庭一种针对提高疲劳检测准确性的算法,随着从物理和生物视觉引出的计算机视觉、图像分析、信号处理发展功能的融合,能够解决上述技术问题。The present invention is an algorithm aimed at improving the accuracy of fatigue detection. With the fusion of computer vision, image analysis, and signal processing development functions derived from physical and biological vision, it can solve the above technical problems.
发明内容SUMMARY OF THE INVENTION
本发明所要解决的技术问题是现有技术中存在的检测精度差的技术问题。提供一种新的基于多尺度二进制模式的车载疲劳检测方法,该基于多尺度二进制模式的车载疲劳检测方法具有检测精度高的特点。The technical problem to be solved by the invention is the technical problem of poor detection accuracy existing in the prior art. A new vehicle fatigue detection method based on multi-scale binary patterns is provided, and the vehicle fatigue detection method based on multi-scale binary patterns has the characteristics of high detection accuracy.
为解决上述技术问题,采用的技术方案如下:In order to solve the above technical problems, the technical scheme adopted is as follows:
一种基于多尺度二进制模式的车载疲劳检测方法,所述基于多尺度二进制模式的车载疲劳检测方法包括:A vehicle fatigue detection method based on a multi-scale binary pattern, the vehicle fatigue detection method based on a multi-scale binary pattern comprises:
步骤一,对驾驶员图像人工分类标记疲劳样本和不疲劳样本,分别选择训练样本和测试样本,对训练样本和测试样本进行预处理;Step 1: Manually classify and mark fatigue samples and non-fatigue samples on the driver image, select training samples and test samples respectively, and preprocess the training samples and test samples;
步骤二,将训练样本图像划分成若干不重复的子区域,使用多尺度局部二进制模式进行特征提取,获得多尺度局部二进制图像特征;Step 2, divide the training sample image into several non-repetitive sub-regions, use multi-scale local binary patterns for feature extraction, and obtain multi-scale local binary image features;
步骤三,对多尺度局部二进制图像特征进行离散傅里叶变化,得到多尺度二进制模式的直方图傅里叶特征向量;Step 3, performing discrete Fourier transformation on the multi-scale local binary image features to obtain the histogram Fourier feature vector of the multi-scale binary pattern;
步骤四,连接构成多尺度二进制模式的直方图傅里叶特征向量,用于表征图像特征,选择核函数,用非线性支持向量机对样本图像的MLBP特征进行分类训练,并得到训练后的SVM分类模型和参数;Step 4, connect the histogram Fourier feature vectors that constitute the multi-scale binary pattern to characterize the image features, select the kernel function, use the nonlinear support vector machine to classify and train the MLBP features of the sample image, and obtain the trained SVM Classification models and parameters;
步骤四,对测试样本重复步骤二至步骤三,利用训练得到的SVM分类模型和参数对测试样本的多尺度二进制模式的直方图傅里叶特征进行检测。Step 4: repeat steps 2 to 3 for the test sample, and use the trained SVM classification model and parameters to detect the histogram Fourier feature of the multi-scale binary pattern of the test sample.
本发明的工作原理:发明给出了一种基于多尺度二进制模式的直方图傅里叶特征(MLBP-HF)和支持向量机SVM的车载疲劳检测方法。主要是以统计学习理论为思想基础的、按监督学习方式进行数据分类的支持向量机。Working principle of the present invention: the invention provides a vehicle fatigue detection method based on multi-scale binary pattern histogram Fourier feature (MLBP-HF) and support vector machine SVM. It is mainly based on the statistical learning theory and supports vector machines for data classification according to the supervised learning method.
首先基于多尺度局部二进制模式的特征提取:First, feature extraction based on multi-scale local binary patterns:
将图像划分为多个同一大小但并不重叠的子区域,M0,M1,…Mj-1,每一块子区域的不同半径的LBP直方图,用以下公式计算:Divide the image into multiple sub-regions of the same size but not overlapping, M 0 , M 1 ,...M j-1 , the LBP histogram of each sub-region with different radii, calculated with the following formula:
B(v)为Bool指针,是对于每一个不同尺度的区域(Mj)计算直方图提供区域性信息,L为直方图bins的数量。B(v) is a Bool pointer, which is to calculate the histogram for each area of different scales (M j ) to provide regional information, and L is the number of histogram bins.
通过以下公式连接上述各个子区域的直方图得到一个单向量,是最终的多尺度特征描述算子:Connect the histograms of the above sub-regions by the following formula to obtain a single vector, which is the final multi-scale feature description operator:
fj=[hP,1,j,hP,2,j,...,hP,R,j]。f j =[h P,1,j ,h P,2,j ,...,h P,R,j ].
特征提取方法可以有效地表征疲劳,并取得了较好的检测效果,检测速度快,受光照变化影响较小,但是同时发现对于驾驶员人脸存在倾斜或偏转时检测率很低,漏检率和误检率较高。在对驾驶员人脸图像提取出MLBP特征后,使用傅里叶变换来对MLBP特征进行处理,进行离散傅里叶变换对于具有一定偏转角度的图像能够从特征上进行矫正,进而减小对疲劳检测的影响,提高检测率。The feature extraction method can effectively represent fatigue and achieve better detection results. The detection speed is fast and is less affected by illumination changes. However, it is found that the detection rate is very low when the driver's face is tilted or deflected, and the missed detection rate and high false positive rate. After extracting the MLBP features from the driver's face image, use Fourier transform to process the MLBP features, and perform discrete Fourier transform to correct the image with a certain deflection angle from the feature, thereby reducing the fatigue Detection of the impact, improve the detection rate.
上述方案中,为优化,进一步地,所述预处理包括将RGB图像转换为灰度图像及图像标准化处理。In the above solution, for optimization, further, the preprocessing includes converting the RGB image into a grayscale image and image standardization.
进一步地,所述基于多尺度局部二进制模式进行特征提取包括:Further, the feature extraction based on multi-scale local binary patterns includes:
步骤1,用LBP8,1,LBP8,2,LBP8,3三种LBP算子对原图进行计算,得到在不同尺度下的图像,分别为I1,I2,I3;Step 1, use LBP 8,1 , LBP 8,2 , LBP 8,3 three LBP operators to calculate the original image, and obtain images at different scales, respectively I1, I2, I3;
步骤2,对I1,I2,I3进行分块,分成的子块分别为:Step 2, divide I1, I2, and I3 into blocks, and the sub-blocks are as follows:
I1(1,n),I2(1,n),I3(1,n);I1(1,n),I2(1,n),I3(1,n);
步骤3,对不同尺度下对应的子区域进行拼接;Step 3, splicing the corresponding sub-regions at different scales;
步骤4,对步骤3中的每一个拼接结果计算LBP直方图;Step 4, calculating the LBP histogram for each splicing result in step 3;
其中,n为正整数。Wherein, n is a positive integer.
进一步地,步骤三包括:Further, step three includes:
步骤3.1,用离散傅里叶变换对多尺度局部二进制图像特征进行变换,设HP,r(n,·)为直方图hP,r(U(n,c))的第n行的DFT,即:Step 3.1, use discrete Fourier transform to transform multi-scale local binary image features, let H P,r (n, ) be the DFT of the nth row of the histogram h P,r (U(n,c)) ,which is:
HP,r(n,u)=∑hP,r(U(n,c))e-i2πur/P H P,r (n,u)=∑h P,r (U(n,c))e -i2πur/P
步骤3.2,对于离散傅里叶变换输入适量的循环位移带来的DFT系数中的相移,判定出h'P,r(U(n,c))=hP,r(U(n,c-a)),Step 3.2, for the phase shift in the DFT coefficients brought by the discrete Fourier transform input with an appropriate amount of cyclic shift, it is determined that h'P ,r (U(n,c))=hP ,r (U(n,ca )),
则定义H'P,r(n,c)=HP,r(n,u)e-i2πur/P;Then define H' P,r (n,c)=H P,r (n,u)e -i2πur/P ;
定义对于任意的1≤n1,n2≤P-1,有:Definition For any 1≤n 1 , n 2 ≤P-1, there are:
其中,为HP,r(n2,u)的复共轭;in, is the complex conjugate of H P,r (n 2 ,u);
对于任意1≤n1,n2≤P-1和多尺度二进制模式的直方图傅里叶特征,计算出:For any 1≤n 1 , n 2 ≤P-1 and histogram Fourier features of multiscale binary patterns, compute:
定义傅里叶幅度谱为 Define the Fourier magnitude spectrum as
定义 definition
则LBPr/u2作为一个子集包含于傅里叶幅度谱。Then LBP r/u2 is included as a subset in the Fourier magnitude spectrum.
用U(n,c)表示图像的规范LBP模式,圆形领域为(8,R)的58种不同的规范模式中,不同的行表示中心像素经过规范LBP算子计算得到的位序列中1的数量,不同的列表示图像的旋转角度。Use U(n,c) to represent the canonical LBP mode of the image. Among the 58 different canonical modes with the circular area (8,R), different rows represent the 1 in the bit sequence calculated by the canonical LBP operator for the central pixel. The number of different columns represent the rotation angle of the image.
用n用表示模式屮位的数量,也即行数,c为旋转角度也即列数。如果样本点数量为P,则0≤n≤P+1,其中n=P+1表示所有非规范模式,并且,当1≤n≤P-1时,旋转度0≤c≤P-1。加入用f'(x,y)表示f(x,y)偏转后的图像,偏转角度为α。偏转后点(x,y)移至(x',y'),可以知道两点也偏转α度。Use n to represent the number of pattern bits, that is, the number of rows, and c is the rotation angle, that is, the number of columns. If the number of sample points is P, then 0≤n≤P+1, where n=P+1 represents all non-canonical modes, and when 1≤n≤P-1, the degree of rotation is 0≤c≤P-1. Add f'(x,y) to represent the deflected image of f(x,y), and the deflection angle is α. After the deflection, the point (x, y) moves to (x', y'), and it can be known that the two points are also deflected by α degrees.
根据多尺度局部二进算子的特点,偏转角度可以做到两个样本点角度的整数倍,即:According to the characteristics of multi-scale local binary operators, the deflection angle can be an integer multiple of the angle of two sample points, namely:
则旋转后图像的规范模式直方图,对应于原图像在每一行的直方图的循环移位。Then the canonical mode histogram of the rotated image corresponds to the cyclic shift of the histogram of the original image in each row.
h'P,r(U(n,c+a))=hP,r(U(n,c))。h'P ,r (U(n,c+a))=hP ,r (U(n,c)).
为了避免循环移位给旋转后的图像直方图的影响,对图像使用傅里叶变换。To avoid the effect of cyclic shift on the rotated image histogram, a Fourier transform is applied to the image.
用离散傅里叶变换DFT来对之前得到的MLBP特征进行变换:Use the discrete Fourier transform DFT to transform the previously obtained MLBP features:
设HP,r(n,·)为直方图hP,r(U(n,c))的第n行的DFT,即:Let H P,r (n, ) be the DFT of the nth row of the histogram h P,r (U(n,c)), namely:
HP,r(n,u)=∑hP,r(U(n,c))e-i2πur/P;H P,r (n,u)=∑h P,r (U(n,c))e -i2πur/P ;
对于DFT输入适量的循环位移带来的DFT系数中的相移,如果:The phase shift in the DFT coefficients brought about by an appropriate amount of cyclic shift for the DFT input if:
h'P,r(U(n,c))=hP,r(U(n,c-a)),那么有H'P,r(n,c)=HP,r(n,u)e-i2πur/P,h' P,r (U(n,c))=h P,r (U(n,ca)), then H' P,r (n,c)=H P,r (n,u)e -i2πur/P ,
并且对于任意的1≤n1,n2≤P-1有:And for any 1≤n 1 , n 2 ≤P-1 have:
其中为HP,r(n2,u)的复共轭。in is the complex conjugate of H P,r (n 2 ,u).
显示对于任意1≤n1,n2≤P-1和MLBP-HF,特征:Show that for any 1≤n 1 , n 2 ≤P-1 and MLBP-HF, the features:
对于hP,r(U(n,c))的行的循环转换是不变的,并且对于输入图像f(x,y)的旋转也是不变的。The cyclic transformation of the rows of h P,r (U(n,c)) is invariant, and it is also invariant to the rotation of the input image f(x,y).
傅里叶幅度谱可以视为这些特征的一种特殊情况。Fourier magnitude spectrum can be considered a special case of these characteristics.
最终,既然那么LBPr/u2作为一个子集包含于傅里叶幅度谱中。Ultimately, since Then LBP r/u2 is included as a subset in the Fourier magnitude spectrum.
从而,在对驾驶员人脸图像提取出MLBP特征后,即可使用傅里叶变换来对特征进行处理,它的好处在于对于具有一定偏转角度的图像能够从特征上进行矫正,进而减小对疲劳检测的影响,提高检测率。Therefore, after extracting MLBP features from the driver's face image, Fourier transform can be used to process the features. Fatigue detection effect, improve detection rate.
本发明的有益效果:本发明针提出了一种基于多尺度二进制模式直方图傅里叶特征(MLBP-HF)和SVM的疲劳检测方法。方法中提取的MLBP-HF特征不仅能克服光照变化的影响,又能最大限度保留图像细节特征。而且,对MLBP特征进行傅里叶变化后能克服姿态变化的影响,解决了对于倾斜人脸检测率较低的问题。本发明有效的提高了人脸识别方面的识别率,使得识别受姿态和光照变化影响较小,疲劳检测率在一定程度上得到了提升。Beneficial effects of the present invention: the present invention proposes a fatigue detection method based on multi-scale binary pattern histogram Fourier feature (MLBP-HF) and SVM. The MLBP-HF features extracted in the method can not only overcome the influence of illumination changes, but also preserve the image details to the greatest extent. Moreover, the Fourier transform of MLBP features can overcome the influence of attitude changes and solve the problem of low detection rate for tilted faces. The invention effectively improves the recognition rate of face recognition, makes the recognition less affected by posture and illumination changes, and improves the fatigue detection rate to a certain extent.
附图说明Description of drawings
下面结合附图和实施例对本发明进一步说明。The present invention will be further described below in conjunction with the accompanying drawings and embodiments.
图1,实施例1中的车载疲劳检测方法流程图。FIG. 1 is a flow chart of the vehicle fatigue detection method in Embodiment 1.
具体实施方式Detailed ways
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本发明,并不用于限定本发明。In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.
实施例1Example 1
本实施例提供一种基于多尺度二进制模式的车载疲劳检测方法,如图1,所述基于多尺度二进制模式的车载疲劳检测方法包括:This embodiment provides a vehicle fatigue detection method based on multi-scale binary patterns, as shown in Figure 1, the vehicle fatigue detection method based on multi-scale binary patterns includes:
步骤一,对驾驶员图像人工分类标记疲劳样本和不疲劳样本,分别选择训练样本和测试样本,对训练样本和测试样本进行预处理;Step 1: Manually classify and mark fatigue samples and non-fatigue samples on the driver image, select training samples and test samples respectively, and preprocess the training samples and test samples;
步骤二,将训练样本图像划分成若干不重复的子区域,使用多尺度局部二进制模式进行特征提取,获得多尺度局部二进制图像特征;Step 2, divide the training sample image into several non-repetitive sub-regions, use multi-scale local binary patterns for feature extraction, and obtain multi-scale local binary image features;
步骤三,对多尺度局部二进制图像特征进行离散傅里叶变化,得到多尺度二进制模式的直方图傅里叶特征向量;Step 3, performing discrete Fourier transformation on the multi-scale local binary image features to obtain the histogram Fourier feature vector of the multi-scale binary pattern;
步骤四,连接构成多尺度二进制模式的直方图傅里叶特征向量,用于表征图像特征,选择核函数,用非线性支持向量机对样本图像的MLBP特征进行分类训练,并得到训练后的SVM分类模型和参数;Step 4, connect the histogram Fourier feature vectors that constitute the multi-scale binary pattern to characterize the image features, select the kernel function, use the nonlinear support vector machine to classify and train the MLBP features of the sample image, and obtain the trained SVM Classification models and parameters;
步骤四,对测试样本重复步骤二至步骤三,利用训练得到的SVM分类模型和参数对测试样本的多尺度二进制模式的直方图傅里叶特征进行检测。Step 4: repeat steps 2 to 3 for the test sample, and use the trained SVM classification model and parameters to detect the histogram Fourier feature of the multi-scale binary pattern of the test sample.
具体地,所述预处理包括将RGB图像转换为灰度图像及图像标准化处理。Specifically, the preprocessing includes converting the RGB image into a grayscale image and image standardization.
详细地,所述基于多尺度局部二进制模式进行特征提取包括:In detail, the feature extraction based on multi-scale local binary patterns includes:
步骤1,用LBP8,1,LBP8,2,LBP8,3三种LBP算子对原图进行计算,得到在不同尺度下的图像,分别为I1,I2,I3;Step 1, use LBP 8,1 , LBP 8,2 , LBP 8,3 three LBP operators to calculate the original image, and obtain images at different scales, respectively I1, I2, I3;
步骤2,对I1,I2,I3进行分块,分成的子块分别为:Step 2, divide I1, I2, and I3 into blocks, and the sub-blocks are as follows:
I1(1,n),I2(1,n),I3(1,n);I1(1,n),I2(1,n),I3(1,n);
步骤3,对不同尺度下对应的子区域进行拼接;Step 3, splicing the corresponding sub-regions at different scales;
步骤4,对步骤3中的每一个拼接结果计算LBP直方图。Step 4, calculate the LBP histogram for each splicing result in step 3.
详细地,步骤三包括:In detail, step three includes:
步骤3.1,用离散傅里叶变换对多尺度局部二进制图像特征进行变换,设HP,r(n,·)为直方图hP,r(U(n,c))的第n行的DFT,即:Step 3.1, use discrete Fourier transform to transform multi-scale local binary image features, let H P,r (n, ) be the DFT of the nth row of the histogram h P,r (U(n,c)) ,which is:
HP,r(n,u)=∑hP,r(U(n,c))e-i2πur/P H P,r (n,u)=∑h P,r (U(n,c))e -i2πur/P
步骤3.2,对于离散傅里叶变换输入适量的循环位移带来的DFT系数中的相移,判定出h'P,r(U(n,c))=hP,r(U(n,c-a)),Step 3.2, for the phase shift in the DFT coefficients brought by the discrete Fourier transform input with an appropriate amount of cyclic shift, it is determined that h'P ,r (U(n,c))=hP ,r (U(n,ca )),
则定义H'P,r(n,c)=HP,r(n,u)e-i2πur/P;Then define H' P,r (n,c)=H P,r (n,u)e -i2πur/P ;
定义对于任意的1≤n1,n2≤P-1,有:Definition For any 1≤n 1 , n 2 ≤P-1, there are:
其中,为HP,r(n2,u)的复共轭;in, is the complex conjugate of H P,r (n 2 ,u);
对于任意1≤n1,n2≤P-1和多尺度二进制模式的直方图傅里叶特征,计算出:For any 1≤n 1 , n 2 ≤P-1 and histogram Fourier features of multiscale binary patterns, compute:
定义傅里叶幅度谱为 Define the Fourier magnitude spectrum as
定义 definition
则LBPr/u2作为一个子集包含于傅里叶幅度谱。Then LBP r/u2 is included as a subset in the Fourier magnitude spectrum.
如图1所示,基于MLBP特征提取,包含以下2个步骤:As shown in Figure 1, feature extraction based on MLBP includes the following two steps:
①将图像划分为多个同一大小但并不重叠的子区域,M0,M1,…Mj-1,每一块子区域的不同半径的LBP直方图用公式计算;① Divide the image into multiple sub-regions of the same size but not overlapping, M 0 , M 1 , ... M j-1 , the LBP histogram of each sub-region with different radii is calculated by the formula;
②通过公式连接上述各个子区域的直方图得到一个单向量,这就是最终的多尺度特征描述算子。② Connect the histograms of the above sub-regions through the formula to obtain a single vector, which is the final multi-scale feature description operator.
如图1所示,图像的MLBP特征获取,过程包含以下4个步骤:As shown in Figure 1, the MLBP feature acquisition process of the image includes the following four steps:
①用三种不同LBP算子对原图进行计算,得到在不同尺度下的图像;① Use three different LBP operators to calculate the original image to obtain images at different scales;
②对不同尺度下的图像进行分块;②Block images at different scales;
③对不同尺度下对应的子区域进行拼接;③ Splicing the corresponding sub-regions at different scales;
④对每一个拼接结果计算LBP直方图。④ Calculate the LBP histogram for each splicing result.
如图1所示,标准的SVM分类,分为两个阶段:As shown in Figure 1, the standard SVM classification is divided into two stages:
第一,训练阶段:First, the training phase:
①建立训练样本集;① Establish a training sample set;
②选择核函数及参数,将低维空间向高维空间变换;② Select the kernel function and parameters to transform the low-dimensional space to the high-dimensional space;
③输入样本正规化;③ Normalization of input samples;
④构造核矩阵;④ Construct a kernel matrix;
⑤求解拉格朗日系数;⑤ Solve the Lagrangian coefficient;
⑥计算出支持向量,求解分类超平面系数;⑥Calculate support vectors and solve classification hyperplane coefficients;
⑦根据求得的各类系数,建立训练数据的最优分类超平面,完成训练过程。⑦ According to the obtained various coefficients, the optimal classification hyperplane of the training data is established to complete the training process.
第二,检测阶段:Second, the detection phase:
①读入学习阶段训练得到的模板数据文件;① Read in the template data file trained in the learning phase;
②根据公式计算新输入特征数据的输出值;② Calculate the output value of the new input characteristic data according to the formula;
③利用分类函数计算,给出分类结果。③ Use the classification function to calculate and give the classification result.
本实施例提出了一种基于MLBP特征和SVM的车载疲劳检测方法,与现有技术相比本发明改善了具有一定倾斜免度的人脸图像会影响检测率的问题,对原基于MLBP算子的特征提取方法进行优。This embodiment proposes a vehicle-mounted fatigue detection method based on MLBP features and SVM. Compared with the prior art, the present invention improves the problem that a face image with a certain degree of inclination will affect the detection rate. The feature extraction method is optimized.
本发明分为训练样本和测试样本,实验分为训练和检测两个过程,在训练阶段,首先使用基于MLBP算子的方法对人脸训练样本图像进行特征提取,然后将特征向量输入SVM到中进行分类训练,并选取一种核函数获取SVM分类模型;在检测阶段,首先对测试样本用同样的方法进行特征提取,然后再用SVM分类模型进行疲劳检测。在提取特征时采用了图像分割技术,通过对不同的分割类型进行比较实验,并进行实验结果分析之后,针对该方法对于倾斜人脸检测率较低的问题。The present invention is divided into training samples and test samples, and the experiment is divided into two processes of training and detection. In the training stage, the method based on the MLBP operator is first used to extract the features of the face training sample images, and then the feature vector is input into the SVM. Carry out classification training, and select a kernel function to obtain the SVM classification model; in the detection stage, first use the same method to extract features from the test samples, and then use the SVM classification model for fatigue detection. Image segmentation technology is used in extracting features. After comparing experiments with different segmentation types and analyzing the experimental results, the method is aimed at the problem of low detection rate of tilted faces.
本实施例提出了一种基于多尺度二进制模式直方图傅里叶特征MLBP-HF和支持向量机SVM的疲劳检测方法。该方法中提取的MLBP-HF特征不仅能克服光照变化的影响,又能最大限度保留图像细节特征,而且,对MLBP特征进行傅里叶变化后能克服姿态变化的影响。本实施例在人脸识别方面有较好的识别率,受姿态和光照变化影响较小,并且使得疲劳检测率在一定程度上得到了提升。因此这种基于MLBP特征和SVM的车载疲劳检测方法对于车载疲劳检测领域具有重要的理论意义和应用价值。This embodiment proposes a fatigue detection method based on multi-scale binary pattern histogram Fourier feature MLBP-HF and support vector machine SVM. The MLBP-HF feature extracted in this method can not only overcome the influence of illumination changes, but also preserve the image detail features to the maximum extent, and can overcome the influence of attitude changes after performing Fourier transformation on MLBP features. This embodiment has a better recognition rate in face recognition, is less affected by posture and illumination changes, and improves the fatigue detection rate to a certain extent. Therefore, this vehicle fatigue detection method based on MLBP features and SVM has important theoretical significance and application value in the field of vehicle fatigue detection.
尽管上面对本发明说明性的具体实施方式进行了描述,以便于本技术领域的技术人员能够理解本发明,但是本发明不仅限于具体实施方式的范围,对本技术领域的普通技术人员而言,只要各种变化只要在所附的权利要求限定和确定的本发明精神和范围内,一切利用本发明构思的发明创造均在保护之列。Although the illustrative specific embodiments of the present invention have been described above, so that those skilled in the art can understand the present invention, the present invention is not limited to the scope of the specific embodiments. For those of ordinary skill in the art, as long as each As long as the changes are within the spirit and scope of the present invention defined and determined by the appended claims, all inventions and creations utilizing the concept of the present invention are included in the protection list.
Claims (4)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910378462.XA CN110084220A (en) | 2019-05-08 | 2019-05-08 | A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910378462.XA CN110084220A (en) | 2019-05-08 | 2019-05-08 | A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN110084220A true CN110084220A (en) | 2019-08-02 |
Family
ID=67419208
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201910378462.XA Pending CN110084220A (en) | 2019-05-08 | 2019-05-08 | A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN110084220A (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113205081A (en) * | 2021-06-11 | 2021-08-03 | 北京惠朗时代科技有限公司 | SVM model worker fatigue accurate judgment method based on significance detection |
| US12443387B2 (en) | 2019-08-21 | 2025-10-14 | Micron Technology, Inc. | Intelligent audio control in vehicles |
| US12518570B2 (en) | 2019-12-18 | 2026-01-06 | Lodestar Licensing Group Llc | Predictive maintenance of automotive transmission |
| US12525111B2 (en) | 2019-08-21 | 2026-01-13 | Lodestar Licensing Group Llc | Drowsiness detection for vehicle control |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105117688A (en) * | 2015-07-29 | 2015-12-02 | 重庆电子工程职业学院 | Face identification method based on texture feature fusion and SVM |
| CN105976809A (en) * | 2016-05-25 | 2016-09-28 | 中国地质大学(武汉) | Voice-and-facial-expression-based identification method and system for dual-modal emotion fusion |
| CN109190455A (en) * | 2018-07-18 | 2019-01-11 | 东南大学 | Black smoke vehicle recognition methods based on Gaussian Mixture and autoregressive moving-average model |
-
2019
- 2019-05-08 CN CN201910378462.XA patent/CN110084220A/en active Pending
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105117688A (en) * | 2015-07-29 | 2015-12-02 | 重庆电子工程职业学院 | Face identification method based on texture feature fusion and SVM |
| CN105976809A (en) * | 2016-05-25 | 2016-09-28 | 中国地质大学(武汉) | Voice-and-facial-expression-based identification method and system for dual-modal emotion fusion |
| CN109190455A (en) * | 2018-07-18 | 2019-01-11 | 东南大学 | Black smoke vehicle recognition methods based on Gaussian Mixture and autoregressive moving-average model |
Non-Patent Citations (1)
| Title |
|---|
| 赵李坤: "基于人脸特征识别的驾驶员疲劳检测方法研究", 《中国优秀硕士学位论文全文数据库•工程科技Ⅱ辑》 * |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12443387B2 (en) | 2019-08-21 | 2025-10-14 | Micron Technology, Inc. | Intelligent audio control in vehicles |
| US12525111B2 (en) | 2019-08-21 | 2026-01-13 | Lodestar Licensing Group Llc | Drowsiness detection for vehicle control |
| US12518570B2 (en) | 2019-12-18 | 2026-01-06 | Lodestar Licensing Group Llc | Predictive maintenance of automotive transmission |
| CN113205081A (en) * | 2021-06-11 | 2021-08-03 | 北京惠朗时代科技有限公司 | SVM model worker fatigue accurate judgment method based on significance detection |
| CN113205081B (en) * | 2021-06-11 | 2024-01-05 | 北京惠朗时代科技有限公司 | SVM model worker fatigue accurate judging method based on significance detection |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Wei et al. | Multi-target defect identification for railway track line based on image processing and improved YOLOv3 model | |
| Calik et al. | Large-scale offline signature recognition via deep neural networks and feature embedding | |
| CN102147858B (en) | License plate character identification method | |
| CN104598885B (en) | The detection of word label and localization method in street view image | |
| CN102163281B (en) | Real-time human body detection method based on AdaBoost frame and colour of head | |
| CN110084220A (en) | A kind of vehicle-mounted fatigue detection method based on multiple dimensioned binary mode | |
| CN105488536A (en) | Agricultural pest image recognition method based on multi-feature deep learning technology | |
| CN103679191B (en) | An automatic fake-licensed vehicle detection method based on static state pictures | |
| CN114220178A (en) | Signature identification system and method based on channel attention mechanism | |
| CN102867195B (en) | Method for detecting and identifying a plurality of types of objects in remote sensing image | |
| CN104282008B (en) | The method and apparatus that Texture Segmentation is carried out to image | |
| CN112200186A (en) | Vehicle logo recognition method based on improved YOLO_V3 model | |
| CA2435290A1 (en) | Computer-aided image analysis | |
| CN104751475A (en) | Feature point optimization matching method for static image object recognition | |
| Wang et al. | Fully convolutional network based skeletonization for handwritten chinese characters | |
| CN107886539A (en) | High class gear visible detection method under a kind of industrial scene | |
| CN104616016A (en) | Global feature and local feature combined texture feature description method | |
| CN100345152C (en) | Face recognition method based on template matching | |
| Cüceloğlu et al. | Detecting handwritten signatures in scanned documents | |
| CN103902965B (en) | Spatial domain symbiosis image representing method and its application in image classification, identification | |
| CN116343232A (en) | Ancient book mathematical symbol recognition method based on pre-classification | |
| Yan et al. | Chinese text location under complex background using Gabor filter and SVM | |
| CN116740443A (en) | A multi-scale feature matching printing defect detection method based on few-shot learning | |
| Girish et al. | Tire imprint identification and classification using VGG19 | |
| Singh et al. | Verifying bank checks using deep learning and image processing |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190802 |
|
| RJ01 | Rejection of invention patent application after publication |