[go: up one dir, main page]

CN104134364B - Real-time traffic sign identification method and system with self-learning capacity - Google Patents

Real-time traffic sign identification method and system with self-learning capacity Download PDF

Info

Publication number
CN104134364B
CN104134364B CN201410363876.2A CN201410363876A CN104134364B CN 104134364 B CN104134364 B CN 104134364B CN 201410363876 A CN201410363876 A CN 201410363876A CN 104134364 B CN104134364 B CN 104134364B
Authority
CN
China
Prior art keywords
traffic sign
matrix
class
graph
mapping
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201410363876.2A
Other languages
Chinese (zh)
Other versions
CN104134364A (en
Inventor
李晶晶
鲁珂
谢昌元
张旭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN201410363876.2A priority Critical patent/CN104134364B/en
Publication of CN104134364A publication Critical patent/CN104134364A/en
Application granted granted Critical
Publication of CN104134364B publication Critical patent/CN104134364B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Image Analysis (AREA)

Abstract

The invention discloses a real-time traffic signal identification method and system with self-learning capacity. The real-time traffic signal identification method with the self-learning capacity includes the steps that acquired image data are detected, and then traffic sign images are acquired; the detected traffic sign images are identified according to a method based on dimensionality reduction. The acquired image data are detected, the traffic sign images are acquired, dimensionality reduction processing is conducted on the traffic sign images, the traffic sign images are compared with images in a classification library, the meanings of the traffic sign images are acquired, a mapping matrix obtained after dimensionality reduction is updated through self-learning, traffic signs are more accurately identified, the running speed is high according to the adopted dimensionality reduction method, and then the traffic signs are quickly and accurately identified.

Description

具有自我学习能力的实时交通标记识别方法及系统Real-time traffic sign recognition method and system with self-learning ability

技术领域technical field

本发明涉及交通标志识别技术领域,具体地,涉及一种具有自我学习能力的实时交通标记识别方法及系统。The invention relates to the technical field of traffic sign recognition, in particular to a real-time traffic sign recognition method and system with self-learning capability.

背景技术Background technique

随着Google无人驾驶汽车的发布,智能交通再一次成为人们热议的话题,在当前的道路交通组织方式下,无人驾驶要融入现有道路交通环境中,必须要解决交通标志的识别问题。另一方面,如果汽车或者车载设备能够识别交通标志,无疑将降低驾驶员的负担,带来更便捷的驾驶体验,与汽车控制系统联动,会带来更智能的驾驶方式,可以减少交通事故发生率。With the release of Google's driverless car, intelligent transportation has once again become a hot topic of discussion. Under the current road traffic organization method, if driverless driving is to be integrated into the existing road traffic environment, it is necessary to solve the problem of traffic sign recognition . On the other hand, if a car or vehicle-mounted equipment can recognize traffic signs, it will undoubtedly reduce the burden on the driver and bring a more convenient driving experience. Linkage with the car control system will bring a smarter driving method and reduce traffic accidents. Rate.

目前,在计算机系统上,已经提出了一些交通标志识别算法,但是这些成果大部分仅限制在研究和实验领域,或者只是运行在PC上,并没有应用于实际的汽车或车载设备中,经过调查分析,认为现有技术面对或存在以下问题:1)缺少合适的环境图像采集设备,2)算法识别率低,不能满足自动识别的需求,3)算法训练周期长,运行开销大,无法满足实时场景。At present, on the computer system, some traffic sign recognition algorithms have been proposed, but most of these achievements are only limited to the field of research and experimentation, or only run on the PC, and have not been applied to actual cars or vehicle-mounted equipment. After investigation According to the analysis, the existing technology faces or has the following problems: 1) lack of suitable environmental image acquisition equipment, 2) the recognition rate of the algorithm is low, which cannot meet the needs of automatic recognition, 3) the algorithm training period is long, and the operation cost is large, which cannot meet the requirements of automatic recognition. real-time scene.

发明内容Contents of the invention

本发明的目的在于,针对上述问题,提出一种具有自我学习能力的实时交通标记识别方法及系统,以实现快速准确的识别交通标记的优点。The purpose of the present invention is to address the above problems, to propose a real-time traffic sign recognition method and system with self-learning ability, so as to realize the advantages of fast and accurate recognition of traffic signs.

为实现上述目的,本发明采用的技术方案是:In order to achieve the above object, the technical scheme adopted in the present invention is:

一种具有自我学习能力的实时交通标记识别方法,包括将采集的图像数据进行检测,从而得到交通标志图像的步骤;A real-time traffic sign recognition method with self-learning ability, including the step of detecting the collected image data to obtain the traffic sign image;

对上述检测到的交通标志图像采用基于降维的方法进行识别的步骤;A step of identifying the above-mentioned detected traffic sign image using a method based on dimensionality reduction;

上述基于降维的方法为:将交通标志图像表示为一个矩阵X,X为高维矩阵,然后将X通过一个线性映射投影到一个低维空间,将X对应的低维空间矩阵表示为y,则映射关系为:The above method based on dimensionality reduction is: represent the traffic sign image as a matrix X, X is a high-dimensional matrix, and then project X to a low-dimensional space through a linear map, and represent the low-dimensional space matrix corresponding to X as y, Then the mapping relationship is:

y=XATy=XA T ,

其中,A是映射矩阵,是通过训练得到的,具体为,在初始化阶段,训练库预先设定样本为训练库,通过预先设定的样本,得到映射矩阵A,在实际运用中采集到新的交通标志图像后,将代表新的交通标志图像的特征矩阵利用A映射到一个低维空间,然后利用分类器将低维映射值与样本映射值分类,得出新的交通标志图像属于哪类,最终得到识别结果,如果识别正确,则不做处理,如果识别错误,则将新的交通标志图像发送给云端服务器,云端服务器将其加入到训练库中,重新训练得到新的映射矩阵A′,得到A′后,使用网络将该映射矩阵传输到安装在移动终端上的数据处理模块,使用A′替换A,即A′成为新映射矩阵。Among them, A is the mapping matrix, which is obtained through training. Specifically, in the initialization phase, the training library presets samples as the training library, and the mapping matrix A is obtained through the preset samples. After the traffic sign image, the feature matrix representing the new traffic sign image is mapped to a low-dimensional space by A, and then the classifier is used to classify the low-dimensional mapping value and the sample mapping value to obtain which category the new traffic sign image belongs to. Finally, the recognition result is obtained. If the recognition is correct, no processing will be performed. If the recognition is wrong, the new traffic sign image will be sent to the cloud server, and the cloud server will add it to the training library and retrain to obtain a new mapping matrix A'. After obtaining A', use the network to transmit the mapping matrix to the data processing module installed on the mobile terminal, and use A' to replace A, that is, A' becomes a new mapping matrix.

优选的,上述利用分类器将低维映射值与样本映射值分类中的分类器至少包括最近邻分类器和支持向量机分类器。Preferably, the above-mentioned classifiers for classifying low-dimensional mapping values and sample mapping values using classifiers include at least a nearest neighbor classifier and a support vector machine classifier.

优选的,上述基于降维的识别方法,方法为基于稀疏表示的图嵌入方法。Preferably, the above recognition method based on dimensionality reduction is a graph embedding method based on sparse representation.

优选的,所述基于稀疏表示的图嵌入方法具体为:Preferably, the graph embedding method based on sparse representation is specifically:

步骤401:将训练库中的交通标志图像进行分类得到分层图结构,在不同的层分别构建类内图和类间图;Step 401: classify the traffic sign images in the training library to obtain a hierarchical graph structure, and construct intra-class graphs and inter-class graphs in different layers;

步骤402:将上述分层图结构应用到图嵌入框架下,得到如下目标函数:Step 402: Apply the above layered graph structure to the graph embedding framework to obtain the following objective function:

其中:y表示低维空间矩阵,X表示采集的样本集合,Ww表示类内图的权重矩阵,Wb表示类间图的权重矩阵,Lw和Lb分别是类内图和类间图的拉普拉斯特征矩阵,定义为L=D-W,D是一个对角矩阵,其对角元素的值由Dii=∑jwij计算得到,其余位置元素值为0,子空间映射矩阵A通过求解如下式得到:Among them: y represents the low-dimensional space matrix, X represents the collected sample set, W w represents the weight matrix of the intra-class graph, W b represents the weight matrix of the inter-class graph, L w and L b are the intra-class graph and the inter-class graph respectively The Laplacian characteristic matrix of , defined as L=DW, D is a diagonal matrix, the value of its diagonal element is calculated by D ii =∑ j w ij , and the value of other position elements is 0, the subspace mapping matrix A It is obtained by solving the following formula:

AXTLwXAT=λAXTLbXATAX T L w XA T =λAX T L b XA T ,

假设a1,a2,……ad为求解上式得到的特征向量,λ1,λ2,……λd为对应的特征值,并且满足条件λ12<……<λd,映射关系表示为:Suppose a 1 , a 2 ,...a d are the eigenvectors obtained by solving the above formula, λ 1 , λ 2 ,...λ d are the corresponding eigenvalues, and satisfy the condition λ 12 <...<λ d , the mapping relationship is expressed as:

X→y=XAT,A=[a1,a1,......ad];X→y=XA T , A=[a 1 , a 1 ,...a d ];

步骤403:引入稀疏表示优化步骤402中的图嵌入;Step 403: Introducing sparse representation to optimize the graph embedding in step 402;

具体为首先,目标函数初步定义为: Specifically, first, the objective function is initially defined as:

为了使A满足稀疏性在目标函数中加入如下的正则项:min||A||2,1In order to make A satisfy the sparsity, the following regular term is added to the objective function: min||A|| 2 , 1,

将步骤402中的f转化为如下等价公式:Convert f in step 402 into the following equivalent formula:

min yTLwymin y T L w y

s.t. yTLby=I,st y T L b y = I,

得到最终的目标函数:Get the final objective function:

s.t. yTLby=I,st y T L b y = I,

其中,ω>0为平衡参数。将L对A求导,并令导数为零,得到A的表达式为:Among them, ω>0 is a balance parameter. Deriving L with respect to A, and setting the derivative to zero, the expression of A is:

其中,Δ是对角矩阵,其对角元素由以下公式计算where, Δ is a diagonal matrix whose diagonal elements are calculated by

非对角位置元素值为0。The value of the off-diagonal element is 0.

将得到的A带入到最终目标函数L中,然后用拉格朗日法解最优化问题,优化解为下式前d个最小特征值对应的特征向量:Bring the obtained A into the final objective function L, and then use the Lagrangian method to solve the optimization problem. The optimal solution is the eigenvector corresponding to the first d smallest eigenvalues of the following formula:

Γy=λLby,Γy=λL b y,

其中, in,

使用迭代法来解决此优化问题,即首先固定A,求解y,然后使用得到的y去更新A,如此往复,直到A和y收敛。 Use an iterative method to solve this optimization problem, that is, first fix A, solve y, and then use the obtained y to update A, and so on until A and y converge.

优选的,所述将检测的交通标志图像进行分层得到分层图结构具体为:采用类内图和类间图的方法,所述类内图:每类数据进行局部近邻链接,采用k近邻方法,根据实验效果,调整参数k的值,对于有链接的边,赋予权重,权重采用热核函数定义,然后每类的权重矩阵,组合起来,为类内图的权重矩阵Ww;其中热核函数的定义为如果节点i和j之间存在连接,则设置权值wij=exp(-||xi-xj||22),否则权值设为0。Preferably, the layered graph structure obtained by layering the detected traffic sign images is specifically: adopting the methods of intra-class graph and inter-class graph, and said intra-class graph: each type of data is connected with local neighbors, and k-nearest neighbors are used Method, according to the experimental results, adjust the value of the parameter k, assign weights to the edges with links, and the weights are defined by the thermal kernel function, and then the weight matrix of each class is combined to form the weight matrix W w of the intra-class graph; where the thermal The kernel function is defined as if there is a connection between nodes i and j, then set the weight w ij =exp(-||x i -x j || 22 ), otherwise set the weight to 0.

所述类间图:由于交通标记的特殊性,即某几类信号的相似度很高,存在小类的情况,因此,先对信号进行分类,标记好大类的记号,寻找一类与其他几类最近的点,进行链接,权重矩阵采用热核函数定义,然后对于大类之间,选取一大类与其他大类间最近的点进行连接,赋予权重值,获得类间图的权重矩阵WbThe inter-class graph: due to the particularity of traffic signs, that is, certain types of signals have a high similarity and there are small classes. Therefore, first classify the signals, mark the signs of the major classes, and search for one class and other classes. The nearest points of several categories are linked, and the weight matrix is defined by the thermal kernel function. Then, for the large categories, the nearest points between a large category and other categories are selected for connection, and weight values are assigned to obtain the weight matrix of the inter-category graph. W b .

优选的,上述类内图的k=4。Preferably, k=4 in the above intraclass graph.

同时本发明技术方案还公开一种运行具有自我学习能力的实时交通标记识别方法的系统,包括图像采集模块、结果输出模块和数据处理模块,所述图像采集模块采集的数据经数据处理模块处理后通过结果输出模块显示,所述图像采集模块和结果输出模块采用智能移动终端,所述数据处理模块由智能移动终端和云端服务器完成,具体为简单快速的线性运算由智能移动终端完成,所述线性运算包括特征降维和分类器,训练过程由云端服务器完成,且云端服务器和智能移动终端双向通信。At the same time, the technical solution of the present invention also discloses a system for operating a real-time traffic sign recognition method with self-learning ability, including an image acquisition module, a result output module and a data processing module, and the data collected by the image acquisition module is processed by the data processing module The result output module shows that the image acquisition module and the result output module adopt an intelligent mobile terminal, and the data processing module is completed by an intelligent mobile terminal and a cloud server, specifically, simple and fast linear operations are completed by an intelligent mobile terminal, and the linear The operation includes feature dimension reduction and classifier, and the training process is completed by the cloud server, and the cloud server communicates with the smart mobile terminal two-way.

优选的,所述智能移动终端为具有摄像头的智能手机。Preferably, the smart mobile terminal is a smart phone with a camera.

本发明的技术方案具有以下有益效果:The technical solution of the present invention has the following beneficial effects:

本发明的技术方案,通过对采集的图像数据进行检测,得到交通标志图像,并对交通标志图像进行降维处理,然后与分类库进行比对,从而得出交通标志图像的含义,并通过自我学习对降维的映射矩阵进行更新,从而使得交通标记的识别更加准确,而采用的降维方法运行速度快,在当前主流手机硬件配置水平下可以满足实时场景的应用,从而达到了快速准确的识别交通标记的目的。In the technical solution of the present invention, the traffic sign image is obtained by detecting the collected image data, and the traffic sign image is subjected to dimensionality reduction processing, and then compared with the classification library to obtain the meaning of the traffic sign image, and through self Learning to update the mapping matrix of dimension reduction, so that the recognition of traffic signs is more accurate, and the dimension reduction method adopted is fast, and can meet the application of real-time scenarios under the current mainstream mobile phone hardware configuration level, thus achieving fast and accurate recognition. Identify the purpose of traffic signs.

下面通过附图和实施例,对本发明的技术方案做进一步的详细描述。The technical solutions of the present invention will be described in further detail below with reference to the accompanying drawings and embodiments.

附图说明Description of drawings

图1为本发明实施例所述的具有自我学习能力的实时交通标记识别方法及系统的原理框图;Fig. 1 is the functional block diagram of the real-time traffic sign recognition method and system with self-learning ability described in the embodiment of the present invention;

图2a、图2c和图2e为二维数据示意图;Figure 2a, Figure 2c and Figure 2e are schematic diagrams of two-dimensional data;

图2b、图2d和图2f为采用LPP、LDA以及LDA over LPP算法降维后的一维数据示意图;Figure 2b, Figure 2d and Figure 2f are schematic diagrams of one-dimensional data after dimensionality reduction using LPP, LDA and LDA over LPP algorithms;

图3为本发明实施例所述的分层图结构示意图;FIG. 3 is a schematic diagram of a layered graph structure according to an embodiment of the present invention;

图4为本发明实施例所述的类内图构建示意图;FIG. 4 is a schematic diagram of constructing an intraclass graph according to an embodiment of the present invention;

图5a和图5b为本发明实施例所述的类间图的构建示意图;Fig. 5a and Fig. 5b are schematic diagrams of constructing the interclass diagram described in the embodiment of the present invention;

图6为本发明实施例所述的具有自我学习能力的实时交通标记识别方法应用示意图。Fig. 6 is a schematic diagram of the application of the real-time traffic sign recognition method with self-learning ability according to the embodiment of the present invention.

结合附图,本发明实施例中附图标记如下:In conjunction with the accompanying drawings, the reference signs in the embodiments of the present invention are as follows:

1-检测标记;2-图像捕捉工作区;3-识别结果显示区;4-设置菜单;5-切换镜头;6-语音提示;7-反馈结果;8-识别图像;9-实时识别。1- detection mark; 2- image capture work area; 3- recognition result display area; 4- setting menu; 5- switching camera; 6- voice prompt; 7- feedback result; 8- recognition image;

具体实施方式detailed description

以下结合附图对本发明的优选实施例进行说明,应当理解,此处所描述的优选实施例仅用于说明和解释本发明,并不用于限定本发明。The preferred embodiments of the present invention will be described below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

一种具有自我学习能力的实时交通标记识别方法,包括将采集的图像数据进行检测,从而得到交通标志图像的步骤;A real-time traffic sign recognition method with self-learning ability, including the step of detecting the collected image data to obtain the traffic sign image;

对上述检测到的交通标志图像采用基于降维的方法进行识别的步骤;A step of identifying the above-mentioned detected traffic sign image using a method based on dimensionality reduction;

上述基于降维的方法具体为:目前主流的交通标志识别方法,在性能准确率上能达到甚至超过人脑自然识别率的主要有两类方法,第一类是基于神经网络的,该类方法的特点是识别率高,但是训练开销非常大,无法适应实时识别场景;第二类是基于降维方法的,该类方法的特点是识别准确率相对第一类方法略低,但是训练开销更小。本专利技术方案采用基于降维的方法。该方法的基本思想是,将交通标志图像表示为一个矩阵X,X通常是一个高维矩阵,直接对X进行运算会需要很大的计算开销,将X通过一个线性映射投影到一个低维空间,如果将X对应的低维空间矩阵表示为y,则这种映射关系可以表示为:The above methods based on dimensionality reduction are specifically: the current mainstream traffic sign recognition methods, there are mainly two types of methods that can reach or even exceed the natural recognition rate of the human brain in terms of performance accuracy. The first type is based on neural networks. The characteristic of this type of method is that the recognition rate is high, but the training cost is very large, and it cannot adapt to the real-time recognition scene; the second type is based on the dimensionality reduction method, which is characterized by a slightly lower recognition accuracy than the first type method, but the training cost is higher. Small. The technical solution of this patent adopts a method based on dimensionality reduction. The basic idea of this method is to represent the traffic sign image as a matrix X, X is usually a high-dimensional matrix, directly operating on X will require a lot of computational overhead, and project X to a low-dimensional space through a linear map , if the low-dimensional space matrix corresponding to X is expressed as y, then this mapping relationship can be expressed as:

y=XAT, (1)y=XA T , (1)

其中,A是映射矩阵,通过复杂的训练得到。在机器学习领域一般的解决方案中,要得到A,通常需要进行特征分解运算,这种运算开销很大。同时,在实际环境中采集到的交通图像,因为光照和角度等差异,对识别是一个很大的挑战,如果只是采用既有的训练样本,得到的结果在测试机上可能会有令人满意的效果,但是不一定能适应真实的使用场景,所以,引入自我学习能力,具体表现为,在系统(算法)的初始化阶段,训练库中只有预先设定的样本,通过这些样本,得到映射矩阵A,当用户在实际使用中采集到新的交通标志图像后,将代表该图像的特征矩阵利用A映射到一个低维空间,然后利用分类器(比如最近邻分类和支持向量机)将低维映射值与样本映射值分类,得出该图像属于哪一类,最终得到识别结果,如果识别正确,则不做其他处理,如果识别错误,则将该图像发送给云端服务器,云端服务器将其加入到训练库中,重新训练得到新的映射矩阵A′,得到A′后,使用网络将该映射矩阵传输到安装在智能手机上的数据处理模块,使用A′替换A,此后,智能手机将使用A′对图像降维,如此往复,随着新训练样本的不断加入,系统可以始终保持令人满意的识别率。而且,在这整个过程中,智能手机的运算开销都非常小,可以满足实时应用的要求。具有自我学习能力的实时交通标记识别方法和识别系统的分工如图1所示。从图1中可以看出,从采集图像到显示结果的整个完整的识别过程都是在智能手机上完成的,并且仅仅用传输一个矩阵所产生的网络开销带来了识别率的提高和计算开销的大幅削减,使得系统能够适用于实时场景,同时还保持了自我学习能力。Among them, A is the mapping matrix, which is obtained through complex training. In the general solution in the field of machine learning, to get A, it usually needs to perform eigendecomposition operation, which is very expensive. At the same time, traffic images collected in the actual environment are a great challenge to recognition due to differences in illumination and angles. If only existing training samples are used, the results obtained on the test machine may be satisfactory. effect, but it may not be able to adapt to real usage scenarios. Therefore, the introduction of self-learning ability, specifically, in the initialization phase of the system (algorithm), there are only preset samples in the training library. Through these samples, the mapping matrix A is obtained. , when the user collects a new traffic sign image in actual use, the feature matrix representing the image is mapped to a low-dimensional space using A, and then the low-dimensional mapping is made using a classifier (such as nearest neighbor classification and support vector machine) Value and sample mapping value classification to get which category the image belongs to, and finally get the recognition result. If the recognition is correct, no other processing will be done. If the recognition is wrong, the image will be sent to the cloud server, and the cloud server will add it to the In the training library, retrain to get a new mapping matrix A', after getting A', use the network to transmit the mapping matrix to the data processing module installed on the smart phone, use A' to replace A, after that, the smart phone will use A 'Reduce the dimension of the image, and so on, with the continuous addition of new training samples, the system can always maintain a satisfactory recognition rate. Moreover, in the whole process, the computing overhead of the smart phone is very small, which can meet the requirements of real-time applications. The real-time traffic sign recognition method with self-learning ability and the division of labor of the recognition system are shown in Figure 1. It can be seen from Figure 1 that the entire complete recognition process from image acquisition to display results is completed on the smartphone, and only the network overhead generated by transmitting a matrix brings the improvement of recognition rate and computational overhead The drastic reduction of , makes the system applicable to real-time scenarios, while maintaining the ability of self-learning.

其具体工作如下:首先,通过智能手机的摄像头获取环境图像数据。获取方法是使用系统提供的摄像头接口,在启动手机端App时,默认打开手机摄像头,并保持屏幕在唤醒状态,系统可以以每秒最高12帧的速率在摄像头的取景窗口中获取图像数据,并将这些图像数据提交给数据处理模块中的交通标志检测单元检测。Its specific work is as follows: First, the environment image data is obtained through the camera of the smart phone. The acquisition method is to use the camera interface provided by the system. When the mobile phone app is started, the mobile phone camera is turned on by default and the screen is kept awake. The system can obtain image data in the viewfinder window of the camera at a rate of up to 12 frames per second, and Submit these image data to the traffic sign detection unit in the data processing module for detection.

交通标志检测单元逐帧检测由图像采集单元获取到的数据,如果在图像中检测到交通标志,则在图像相应的区域画一个方框,以直观地显示检测结果。交通标志的检测可以有很多方法,比如基于形状和颜色特征进行检测,本技术方案使用方向梯度直方图特征(HOG)进行检测。该检测方法使用的是现有比较成熟的算法。The traffic sign detection unit detects the data acquired by the image acquisition unit frame by frame. If a traffic sign is detected in the image, a box is drawn in the corresponding area of the image to visually display the detection result. There are many methods for traffic sign detection, such as detection based on shape and color features. This technical solution uses the histogram of oriented gradient feature (HOG) for detection. This detection method uses an existing relatively mature algorithm.

检测到交通标志后,将交通标志对应区域内的图像提取出来,进行识别。识别本质上是一个分类的过程,即将图像进行一系列处理,得到一个结果,然后将该结果与训练结果一起分类,检查目标图像会被分到哪一类。比如,输入图像最后的处理结果与训练库中的禁止左转弯最接近,则认为输入图像是禁止左转弯标志。目前主流的识别方法主要有两类,一类是基于神经网络的,一类是基于子空间的。由于基于神经网络的方法运算开销巨大,目前的硬件条件下,不适用于实时系统。所以使用第二种方法,即基于子空间的方法。基于子空间的方法有线性的和非线性的,比较而言,线性方法速度更快,本专利技术方案使用了线性方法,线性判别分析(LDA)和局部保持投影(LPP)是其中比较经典的线性方法,LDA注重图像数据间的可分性,即能较好地保持数据的全局判别信息,LPP则更关注于数据的局部关系,能很好地保留数据的局部结构特征。在交通标志识别系统中,不仅需要判别交通标志属于哪一个大类,比如限速或者警告,而且由于存在遮挡、光照和角度等的影响,还需要在类内做进一步判别,所以,在本技术方案中结合了LDA和LPP的优势,采用了一种新的算法,称之为LDA over LPP,该算法不仅可以保留数据的全局判别信息,还能保留数据的局部结构特征。LDA over LPP算法首先构建图结构:类内图和类间图,对于有监督的分类识别问题,希望一个类别的数据能够更加地紧凑的挨在一起,而不同类别之间的数据可以相对的远离,这样不同类之间会具有较好区分性。LDA over LPP的方法就是在LDA的基础上,引入尽可能多的保留局部流形结构的思想。LDA over LPP获得类间图和类内图后,根据拉普拉斯原理,可以获得以下两个目标函数,一个类间局部函数fb,一个类内局部函数fwAfter the traffic sign is detected, the image in the area corresponding to the traffic sign is extracted for recognition. Recognition is essentially a classification process, which is to perform a series of processing on the image to obtain a result, and then classify the result together with the training result to check which category the target image will be classified into. For example, if the final processing result of the input image is the closest to the no left turn in the training library, the input image is considered to be a no left turn sign. At present, there are two main types of mainstream recognition methods, one is based on neural networks, and the other is based on subspaces. Due to the huge computational overhead of the method based on the neural network, it is not suitable for real-time systems under the current hardware conditions. So use the second method, the subspace-based method. There are linear and nonlinear methods based on subspace. In comparison, the linear method is faster. This patented technical solution uses a linear method. Linear discriminant analysis (LDA) and local preservation projection (LPP) are more classic. In the linear method, LDA pays attention to the separability of image data, that is, it can better maintain the global discriminant information of the data, while LPP pays more attention to the local relationship of the data, which can well preserve the local structural characteristics of the data. In the traffic sign recognition system, it is not only necessary to determine which category the traffic sign belongs to, such as speed limit or warning, but also to make further discrimination within the category due to the influence of occlusion, illumination and angle. Therefore, in this technology The scheme combines the advantages of LDA and LPP, and adopts a new algorithm called LDA over LPP. This algorithm can not only retain the global discriminant information of the data, but also retain the local structural characteristics of the data. The LDA over LPP algorithm first constructs graph structures: intra-class graphs and inter-class graphs. For supervised classification and recognition problems, it is hoped that the data of a category can be more compact together, while the data between different categories can be relatively far away. , so that there will be better discrimination between different classes. The method of LDA over LPP is to introduce as many ideas as possible to preserve the local manifold structure on the basis of LDA. After LDA over LPP obtains the inter-class graph and intra-class graph, according to the Laplace principle, the following two objective functions can be obtained, an inter-class local function f b and an intra-class local function f w :

LDA over LPP算法的目标是最小化类内距离,同时最大化类间距离。这样做可以使得类间更加有区分性,并且类内局部流形结构能更好的保留,更加的紧凑,从而得到一个更加符合实际情况的低维子空间映射。根据Fisher准则,LDA over LPP的目标函数为:The goal of the LDA over LPP algorithm is to minimize the intra-class distance while maximizing the inter-class distance. Doing so can make the class more distinguishable, and the local manifold structure within the class can be better preserved and more compact, so as to obtain a low-dimensional subspace mapping that is more in line with the actual situation. According to Fisher's criterion, the objective function of LDA over LPP is:

该算法可以形象化地用图2a至图2f表示,在图2a至图2f中,将两类二维数据映射到一维,可以看出,在图2a和图2b中,三个算法都有效;在图2c和图2d中,两个不同的类距离很近,由于LPP没有提供全局的判别信息,所以将两类混在了一起;在图2e和图2f中,由于LDA忽略了数据局部流形结构,所以无法识别属于同一类的两个聚簇;总体结论是LPP和LDA在某些情况下会失效,但是LDA over LPP始终能良好工作。The algorithm can be visualized in Figure 2a to Figure 2f. In Figure 2a to Figure 2f, two types of two-dimensional data are mapped to one dimension. It can be seen that in Figure 2a and Figure 2b, all three algorithms are effective ; In Figure 2c and Figure 2d, the distance between two different classes is very close, because LPP does not provide global discriminative information, so the two classes are mixed together; in Figure 2e and Figure 2f, because LDA ignores the data local flow shape structure, so two clusters belonging to the same class cannot be identified; the general conclusion is that LPP and LDA fail in some cases, but LDA over LPP always works well.

本发明所公开的方法和系统中得到映射矩阵的核心是基于图嵌入的稀疏表示方法(SRGE),该算法的要点如下:The core of obtaining the mapping matrix in the method and system disclosed by the present invention is the sparse representation method (SRGE) based on graph embedding, and the key points of the algorithm are as follows:

(1)由于训练数据是标记数据,即有监督的模式,根据图嵌入的思想,首先需要建立图结构。为了同时满足全局判别信息和局部结构特征,我们提出了一种分层的图结构,分层的核心思想是逐层分类,首先将交通标志分为警告标志、禁令标志、限速标志和指路标志等大类,然后再递归地将大类分为更小的类,比如针对限速标志,我们可根据实际情况分为限速20、限速60和限速80等子类,每一个子类当中又包含了不同光照、遮挡和角度的训练样本。图3给出了我们所建立的分层图结构的思想。该思想的形式化描述为,给定m个训练样本,表示为X={x1,x2,……xm},可将这些点分为C类,第i类包含有pi个训练图像样本,于是有(1) Since the training data is labeled data, that is, a supervised pattern, according to the idea of graph embedding, it is first necessary to establish a graph structure. In order to satisfy both global discriminant information and local structural features, we propose a layered graph structure. The core idea of layering is layer-by-layer classification. Firstly, the traffic signs are divided into warning signs, prohibition signs, speed limit signs and guidance signs. Signs and other major categories, and then recursively divide the major categories into smaller categories. For example, for speed limit signs, we can divide them into subcategories such as speed limit 20, speed limit 60 and speed limit 80 according to the actual situation. Each subcategory The class contains training samples with different lighting, occlusion and angles. Figure 3 gives the idea of the hierarchical graph structure we built. The formal description of this idea is, given m training samples, expressed as X={x 1 ,x 2 ,...x m }, these points can be divided into C categories, and the i-th category contains p i training samples image sample, so we have

如图3所示,分层图结构由类内图和类间图来构造。在本文中,使用{Gb,Wb}来表示类间图,用{Gw,Ww}来表示类内图。分层图结构的构造方法为:As shown in Figure 3, the hierarchical graph structure is constructed by intra-class graphs and inter-class graphs. In this paper, {G b ,W b } is used to represent the inter-class graph, and {G w ,W w } is used to represent the intra-class graph. The construction method of the layered graph structure is:

a)类内图:每类数据进行局部近邻链接,采用k近邻方法,根据实验效果,调整参数k的值。对于有链接的边,赋予权重,本技术方案中权重采用热核函数定义。然后每类的权重矩阵,组合起来,为类内图的权重矩阵Ww。类内图的构建如图4所示,为了描述的方便,在图中每一类只选择了一个样本示例说明,在构建k近邻图时,选取k=4。a) Intra-class graph: Local neighbor links are performed for each type of data, and the k-nearest neighbor method is used to adjust the value of the parameter k according to the experimental results. For the edges with links, weights are assigned, and in this technical solution, the weights are defined by thermokernel functions. Then the weight matrix of each class is combined to form the weight matrix W w of the intra-class graph. The construction of the intra-class graph is shown in Figure 4. For the convenience of description, only one sample example is selected for each class in the figure. When constructing the k-nearest neighbor graph, k=4 is selected.

b)类间图:由于交通标记的特殊性,即某几类信号的相似度很高,存在小类的情况,因此,根据先验知识先对信号进行分类,标记好大类的记号,即限速信号属于大类1,三角信号属于大类2,以此类推,首先对于每一小类,构建类间图,寻找一类与其他几类最近的点,进行链接,权重矩阵仍然用热核函数定义。其中热核函数的定义为如果节点i和j之间存在连接,则设置权值b) Inter-class diagram: Due to the particularity of traffic signs, that is, certain types of signals have a high similarity and there are small classes. Therefore, according to the prior knowledge, the signals are first classified and the signs of the major classes are marked, that is, The speed limit signal belongs to the major category 1, the triangular signal belongs to the major category 2, and so on. First, for each sub-category, construct an inter-class graph, find the nearest points of one category and other categories, and link them. The weight matrix is still used Kernel function definition. The thermokernel function is defined as if there is a connection between nodes i and j, then set the weight

wij=exp(-||xi-xj||22) (6),w ij = exp(-||x i -x j || 22 ) (6),

其中σ是一个自由参数,可以根据识别结果调整,否则权值为0。然后对于大类之间,选取一大类与其他大类间最近的点进行连接,赋予权重值。结合两步,获得类间的权重矩阵Wb。类间图的构建如图5a和图5b所示,其中,子图5a表示一个子集下的类间图的构建,子图5b表示整个集合里面子图的构建。where σ is a free parameter that can be adjusted according to the recognition result, otherwise the weight is 0. Then, for the major categories, select the nearest point between a major category and other major categories to connect, and assign a weight value. Combining the two steps, the weight matrix W b between classes is obtained. The construction of the inter-class graph is shown in Figure 5a and Figure 5b, wherein the sub-graph 5a represents the construction of the inter-class graph under a subset, and the sub-graph 5b represents the construction of the sub-graph in the entire set.

(2)将分层图结构应用到图嵌入框架下,可以得到如下所示的目标函数:(2) Applying the hierarchical graph structure to the graph embedding framework, the objective function can be obtained as follows:

其中:y表示低维空间矩阵,X表示采集的样本集合,Ww表示类内图的权重矩阵,Wb表示类间图的权重矩阵,Lw和Lb分别是类内图和类间图的拉普拉斯特征矩阵,定义为L=D-W,D是一个对角矩阵,Among them: y represents the low-dimensional space matrix, X represents the collected sample set, W w represents the weight matrix of the intra-class graph, W b represents the weight matrix of the inter-class graph, L w and L b are the intra-class graph and the inter-class graph respectively The Laplacian characteristic matrix, defined as L=DW, D is a diagonal matrix,

Dii=∑jwij. (8),D ii =∑ j w ij . (8),

子空间映射矩阵A可以通过求解如下的等式得到:The subspace mapping matrix A can be obtained by solving the following equation:

AXTLwXAT=λAXTLbXAT. (9)AX T L w XA T =λAX T L b XA T . (9)

假设a1,a2,……ad为求解上式得到的特征向量,λ1,λ2,……λd为对应的特征指,并且满足条件λ12<……<λd,可将映射关系表示为:Suppose a 1 , a 2 ,...a d are the eigenvectors obtained by solving the above formula, λ 1 , λ 2 ,...λ d are the corresponding eigenvectors, and satisfy the condition λ 12 <...<λ d , the mapping relationship can be expressed as:

X→y=XAT,A=[a1,a1,......ad] (10),X→y=XA T , A=[a 1 , a 1 , . . . a d ] (10),

从而得到映射矩阵A,下面的步骤为得到更好的结果,对映射矩阵A进行优化。Thus, the mapping matrix A is obtained. In order to obtain better results, the following steps optimize the mapping matrix A.

(3)由于在实际环境中,拍摄到的交通标志会受光照和遮挡等的影响,为了提高复杂环境下系统的识别效率,引入稀疏表示来优化(2)中的图嵌入。首先,目标函数初步定义为:(3) Since in the actual environment, the captured traffic signs will be affected by illumination and occlusion, etc., in order to improve the recognition efficiency of the system in complex environments, a sparse representation is introduced to optimize the graph embedding in (2). First, the objective function is initially defined as:

为了使A满足稀疏性,我们在目标函数中加入如下的正则项:In order to make A satisfy sparsity, we add the following regular term to the objective function:

min||A||2,1 (12)将(2)中的f转化为如下的等价公式:min||A|| 2 , 1 (12) Transform f in (2) into the following equivalent formula:

min yTLwy s.t.yTLby=I, (13)min y T L w y sty T L b y = I, (13)

该公式是谱图理论中现有的知识,对于本领域技术人员是公知的,其中s.t.表示subject to。This formula is the existing knowledge in the spectrum theory and is well known to those skilled in the art, wherein s.t. represents subject to.

结合公式11、公式12和公式13,得到最终的目标函数:Combining Formula 11, Formula 12 and Formula 13, the final objective function is obtained:

s.t. yTLby=I (14)st y T L b y = I (14)

其中,ω和为平衡参数,控制相乘部分的贡献大小。将L对A求导,并令导数为零,可以得到A的表达式为:Among them, ω and As a balance parameter, it controls the contribution of the multiplication part. Deriving L to A, and setting the derivative to zero, the expression of A can be obtained as:

其中,Δ是对角矩阵,其对角元素的值由公式(16)计算,其余位置元素值为0。Among them, Δ is a diagonal matrix, the values of its diagonal elements are calculated by formula (16), and the values of other position elements are 0.

将此处得到的A带入到最终目标函数L中,然后用拉格朗日法解最优化问题,优化解为如下等式前d个最小特征值对应的特征向量:Bring the A obtained here into the final objective function L, and then use the Lagrangian method to solve the optimization problem. The optimal solution is the eigenvector corresponding to the first d smallest eigenvalues of the following equation:

Γy=λLby (17)Γy=λL b y (17)

其中,in,

采用迭代的方式来解决这个优化问题,即首先固定A,求解y,然后使用得到的y去更新A,如此往复,直到A和y收敛。Iterative approach is used to solve this optimization problem, that is, first fix A, solve y, and then use the obtained y to update A, and so on until A and y converge.

通过降维,可以得到维度很小的特征矩阵,接着将这些低维度的特征输入到分类器中,根据分类器的输出结果可以确定输入图像属于哪一类,然后根据类的标签得到图像的识别结果。Through dimensionality reduction, a feature matrix with a small dimension can be obtained, and then these low-dimensional features are input into the classifier, and according to the output result of the classifier, it can be determined which category the input image belongs to, and then the recognition of the image is obtained according to the class label result.

在得到图像识别结果后,通过智能手机的App界面将结果显示给用户或者通过智能手机的扬声器播放给用户。如果识别结果不是用户期待的,那么,用户可以通过点击App的反馈结果按钮或者直接使用语音输入的方式告诉系统识别结果有误。这时,系统会将识别有误的交通标志的原始特征矩阵通过网络发送到云端服务器,云端服务器在接收到用户的反馈后,会将反馈结果重新加入到训练库中进行训练,经过训练,会得到一个新的映射矩阵,同时将这个新的映射矩阵发送给智能手机客户端,当再有交通标志出现时,将使用新的变换矩阵对其进行降维,获得低维特征,然后使用分类器进行分类,得到识别结果。After obtaining the image recognition result, the result is displayed to the user through the App interface of the smart phone or played to the user through the speaker of the smart phone. If the recognition result is not what the user expects, the user can tell the system that the recognition result is wrong by clicking the App's feedback result button or directly using voice input. At this time, the system will send the original feature matrix of the incorrectly recognized traffic sign to the cloud server through the network. After receiving the feedback from the user, the cloud server will re-add the feedback result to the training database for training. Get a new mapping matrix, and send this new mapping matrix to the smartphone client at the same time. When there is another traffic sign, it will use the new transformation matrix to reduce the dimension, obtain low-dimensional features, and then use the classifier Classify and get the recognition result.

基于本专利技术方案所叙述的方法,在Android系统下实现了手机客户端,如图6所示。Based on the method described in the technical solution of this patent, a mobile phone client is implemented under the Android system, as shown in FIG. 6 .

本发明技术方案中运行具有自我学习能力的实时交通标记识别方法的系统,包括图像采集模块、结果输出模块和数据处理模块,图像采集模块采集的数据经数据处理模块处理后通过结果输出模块显示,图像采集模块和结果输出模块采用智能移动终端,数据处理模块由智能移动终端和云端服务器完成,具体为简单快速的线性运算由智能移动终端完成,线性运算包括特征降维和分类器,训练过程由云端服务器完成,且云端服务器和智能移动终端双向通信。The system running the real-time traffic sign recognition method with self-learning ability in the technical solution of the present invention includes an image acquisition module, a result output module and a data processing module, and the data collected by the image acquisition module is displayed by the result output module after being processed by the data processing module. The image acquisition module and the result output module use smart mobile terminals, and the data processing module is completed by smart mobile terminals and cloud servers. Specifically, simple and fast linear operations are completed by smart mobile terminals. Linear operations include feature dimensionality reduction and classifiers, and the training process is performed by cloud The server is completed, and the two-way communication between the cloud server and the smart mobile terminal.

交通标志识别系统应该至少包含三个模块,即图像采集模块,数据处理模块和结果输出模块。图像采集模块通常是一个摄像头,在行车过程中,不断采集现实场景中的图像;数据处理模块包括交通标志检测子模块和交通标志识别子模块,交通标志检测子模块用于在图像采集模块中得到的图像中检测是否存在交通标志,如果存在交通标志,则由数据处理模块中的交通标志识别子模块确定该交通标志的含义;结果输出模块用于将交通标志识别模块得到的结果输出给用户,输出方式可以是文本或声音等形式,同时,结果输出模块可以接受用户对结果的反馈。The traffic sign recognition system should contain at least three modules, namely image acquisition module, data processing module and result output module. The image acquisition module is usually a camera, which continuously collects images in the real scene during driving; the data processing module includes a traffic sign detection sub-module and a traffic sign recognition sub-module, and the traffic sign detection sub-module is used to obtain Detect whether there is a traffic sign in the image, if there is a traffic sign, the meaning of the traffic sign is determined by the traffic sign recognition sub-module in the data processing module; the result output module is used to output the result obtained by the traffic sign recognition module to the user, The output method can be in the form of text or sound, and at the same time, the result output module can accept the user's feedback on the result.

在本技术方案中,将图像采集模块和结果输出模块的功能交给智能手机(也可以为具有摄像头的平板电脑、车载平台等移动设备)完成,而将数据处理模块的功能分担给智能手机和云端服务器,具体来说是将简单快速的线性运算交给智能手机完成,而将复杂耗时的训练过程交给云端服务器完成。本技术方案描述的系统和方法中选择智能手机有很多优势:1)目前主流的智能手机设备硬件性能良好,摄像头分辨率普遍高于普通网络摄像头;2)处理器具有较强的运算能力,能够实时完成一些比较简单的图像处理;3)集成了网络传输模块,可以方便地与其他设备通信;4)硬件升级换代快,软件安装部署方便,并且用户有主动升级的意识;5)便于携带,易于集成更多功能。In this technical solution, the functions of the image acquisition module and the result output module are handed over to the smart phone (also can be a mobile device such as a tablet computer with a camera, a vehicle-mounted platform) to complete, and the functions of the data processing module are distributed to the smart phone and Cloud servers, specifically, hand over simple and fast linear calculations to smartphones, while handing over complex and time-consuming training processes to cloud servers. There are many advantages in choosing a smart phone in the system and method described in this technical solution: 1) the hardware performance of the current mainstream smart phone equipment is good, and the resolution of the camera is generally higher than that of an ordinary network camera; 2) the processor has strong computing power and can Real-time completion of some relatively simple image processing; 3) Integrated network transmission module, which can easily communicate with other devices; 4) Fast hardware upgrade, easy software installation and deployment, and users have the awareness of active upgrade; 5) Easy to carry, Easy to integrate more functions.

最后应说明的是:以上所述仅为本发明的优选实施例而已,并不用于限制本发明,尽管参照前述实施例对本发明进行了详细的说明,对于本领域的技术人员来说,其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。Finally, it should be noted that: the above is only a preferred embodiment of the present invention, and is not intended to limit the present invention. Although the present invention has been described in detail with reference to the foregoing embodiments, for those skilled in the art, it still The technical solutions recorded in the foregoing embodiments may be modified, or some technical features thereof may be equivalently replaced. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included within the protection scope of the present invention.

Claims (5)

1. A real-time traffic sign recognition method with self-learning ability is characterized by comprising the steps of detecting collected image data to obtain a traffic sign image;
identifying the detected traffic sign image by adopting a dimension reduction method;
the dimension reduction method comprises the following steps: the traffic sign image is represented as a matrix X, wherein X is a high-dimensional matrix, then X is projected to a low-dimensional space through a linear mapping, the low-dimensional space matrix corresponding to X is represented as y, and then the mapping relation is as follows:
y=XAT
the method comprises the steps that A is a mapping matrix obtained through training, specifically, in an initialization stage, a sample is preset in a training library as the training library, the mapping matrix A is obtained through the preset sample, after a new traffic sign image is collected in actual application, a feature matrix representing the new traffic sign image is mapped to a low-dimensional space through A, then a classifier is used for classifying the low-dimensional mapping value and the sample mapping value to obtain which type the new traffic sign image belongs to, finally, a recognition result is obtained, if the recognition is correct, no processing is carried out, if the recognition is wrong, the new traffic sign image is sent to a cloud server, the cloud server adds the new traffic sign image into the training library, retrains to obtain a new mapping matrix A ', and after A' is obtained, the mapping matrix is transmitted to a data processing module installed on a mobile terminal through a network, replacing A with A ', namely A' becomes a new mapping matrix;
the classifier for classifying the low-dimensional mapping values and the sample mapping values by using the classifier at least comprises a nearest neighbor classifier and a support vector machine classifier;
the identification method based on dimension reduction is a graph embedding method based on sparse representation;
the graph embedding method based on sparse representation specifically comprises the following steps:
step 401: classifying the traffic sign images in the training library to obtain a layered graph structure, and respectively constructing an intra-class graph and an inter-class graph at different layers;
step 402: applying the hierarchical graph structure to a graph embedding frame to obtain the following objective functions:
wherein i and j are indexes, and the value range is from 0 to the total number of training samples, yiRepresents the mapping of the i-th training sample in the low-dimensional space, yjRepresents the mapping of the jth training sample in the low-dimensional space
Wherein: y tableA low dimensional spatial matrix is shown, X represents the set of samples collected, WwWeight matrix, W, representing intra-class diagramsbWeight matrix, L, representing an inter-class graphwAnd LbLaplace feature matrices, defined as L, for intra-class and inter-class graphs, respectivelyw=Dw-Ww,Lb=Db-Wb,DwAnd DbAre two diagonal matrices whose diagonal elements have values respectively represented by WwAnd WbSumming all elements of corresponding row to obtain the value of the element at the other position as 0, i.e. D is a diagonal matrix, Dii=∑jwij
The subspace mapping matrix A is obtained by solving the following equation:
AXTLwXAT=λAXTLbXAT
suppose a1,a2,……,adTo solve the eigenvectors obtained by the above equation, λ1,λ2,……,λdIs a corresponding characteristic value and satisfies a condition lambda12<……<λdThe mapping relationship is expressed as:
X→y=XAT,A=[a1,a1,……ad];
step 403: graph embedding in the sparse representation optimization step 402 is introduced;
specifically, first, the objective function is defined as:
in order for A to satisfy sparsity, the following regularization term is added to the objective function: min | | A | luminance2,1
Converting f in step 402 into the following equation:
min yTLwy
s.t. yTLby=I,
s.t. represents a constraint condition, combines the sparsely represented objective function and the objective function embedded in the graph 402, and adds the two to obtain a final objective function L:
s.t.yTLby=I,
s.t. represents a constraint condition, wherein I represents an identity matrix, namely the diagonal position element value of the matrix is 1, the other position element values are 0,
wherein, ω andfor the balance parameter, L is derived from a and the derivative is made zero, resulting in an expression for a:
where Δ is a diagonal matrix
Wherein i is an index, AiRepresents the sum of all the values of the elements of the ith row of matrix a added together; and (3) bringing the obtained A into a final objective function L, solving the optimization problem by using a Lagrange method, and optimizing the solution into eigenvectors corresponding to the first d minimum eigenvalues of the following formula:
y=λLby,
wherein, an iterative approach is used to solve the optimization problem, i.e. first fix a, solve y, then use the resulting y to update a, and so on until a and y converge.
2. The method of claim 1The real-time traffic sign identification method with self-learning capability is characterized in that the method for layering the detected traffic sign images to obtain a layered graph structure specifically comprises the following steps: a method of employing an intra-class graph and an inter-class graph, the intra-class graph: performing local neighbor link on each type of data, adjusting the value of a parameter k according to an experimental effect by adopting a k neighbor method, giving a weight to a linked edge, defining the weight by adopting a thermonuclear function, and combining weight matrixes of each type to form a weight matrix W of an intra-class graphw(ii) a Wherein the definition of the thermal kernel function is that if there is a connection between nodes i and j, the weight w is setij=exp(-||xi-xj||22) Otherwise, the weight value is 0,
the inter-class diagram: because of the particularity of the traffic sign, that is, the similarity of some signal is very high, and there is a situation of subclass, therefore, firstly, the signal is classified, the mark of the major class is marked, the nearest point between one class and other classes is searched for and linked, the weight matrix is defined by the thermonuclear function, then, for the major classes, the nearest point between one major class and other major classes is selected to be connected, the weight value is given, and the weight matrix W of the inter-class diagram is obtainedb
3. The method as claimed in claim 2, wherein k is 4 in the intra-class diagram.
4. A system for operating the real-time traffic sign recognition method with self-learning ability of claims 1 to 3, comprising an image acquisition module, a result output module and a data processing module, wherein the data acquired by the image acquisition module is processed by the data processing module and then displayed by the result output module, the image acquisition module and the result output module adopt intelligent mobile terminals, the data processing module is completed by the intelligent mobile terminals and a cloud server, particularly, simple and rapid linear operation is completed by the intelligent mobile terminals, the linear operation comprises a feature descent sum classifier, the training process is completed by the cloud server, and the cloud server and the intelligent mobile terminals are in two-way communication.
5. The system according to claim 4, wherein the smart mobile terminal is a smartphone having a camera.
CN201410363876.2A 2014-07-29 2014-07-29 Real-time traffic sign identification method and system with self-learning capacity Expired - Fee Related CN104134364B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410363876.2A CN104134364B (en) 2014-07-29 2014-07-29 Real-time traffic sign identification method and system with self-learning capacity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410363876.2A CN104134364B (en) 2014-07-29 2014-07-29 Real-time traffic sign identification method and system with self-learning capacity

Publications (2)

Publication Number Publication Date
CN104134364A CN104134364A (en) 2014-11-05
CN104134364B true CN104134364B (en) 2017-02-15

Family

ID=51807024

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410363876.2A Expired - Fee Related CN104134364B (en) 2014-07-29 2014-07-29 Real-time traffic sign identification method and system with self-learning capacity

Country Status (1)

Country Link
CN (1) CN104134364B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104527427A (en) * 2014-12-23 2015-04-22 广东好帮手电子科技股份有限公司 Velocity-limiting mark recognizing and vehicle-velocity control method and system based on vehicle traveling recording instrument
CN104992165A (en) * 2015-07-24 2015-10-21 天津大学 Extreme learning machine based traffic sign recognition method
EP3144918B1 (en) * 2015-09-21 2018-01-10 Urban Software Institute GmbH Computer system and method for monitoring a traffic system
CN105528754A (en) * 2015-12-28 2016-04-27 湖南师范大学 Old people information service system based on dual neural network behavior recognition model
CN107220643A (en) * 2017-04-12 2017-09-29 广东工业大学 The Traffic Sign Recognition System of deep learning model based on neurological network
CN110852304B (en) * 2019-11-22 2022-03-22 重庆大学 Hyperspectral data processing method based on deep learning method
CN111008672B (en) * 2019-12-23 2022-06-10 腾讯科技(深圳)有限公司 Sample extraction method, sample extraction device, computer-readable storage medium and computer equipment
CN111444364B (en) * 2020-03-04 2024-01-30 中国建设银行股份有限公司 Image detection method and device
CN113837144B (en) * 2021-10-25 2022-09-13 广州微林软件有限公司 Intelligent image data acquisition and processing method for refrigerator

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102012213344A1 (en) * 2012-07-30 2014-01-30 Robert Bosch Gmbh Method for driver assistance on board of motor vehicle, particularly for traffic sign recognition, involves determining direction change of motor vehicle, selecting camera image as function of direction change, and determining traffic sign
CN103500341A (en) * 2013-09-16 2014-01-08 安徽工程大学 Recognition device used for road signboard

Also Published As

Publication number Publication date
CN104134364A (en) 2014-11-05

Similar Documents

Publication Publication Date Title
CN104134364B (en) Real-time traffic sign identification method and system with self-learning capacity
CN112232293B (en) Image processing model training method, image processing method and related equipment
CN112990211B (en) Training method, image processing method and device for neural network
CN111027378B (en) Pedestrian re-identification method, device, terminal and storage medium
CN109145759B (en) Vehicle attribute identification method, device, server and storage medium
CN111460968B (en) Unmanned aerial vehicle identification and tracking method and device based on video
US20180012082A1 (en) System and method for image analysis
CN110222686B (en) Object detection method, object detection device, computer equipment and storage medium
CN113807399A (en) Neural network training method, neural network detection method and neural network detection device
CN109101602A (en) Image encrypting algorithm training method, image search method, equipment and storage medium
CN114283351A (en) Video scene segmentation method, device, equipment and computer readable storage medium
KR20190029083A (en) Apparatus and Method for learning a neural network
CN110765882A (en) Video tag determination method, device, server and storage medium
CN113688814B (en) Image recognition method and device
CN116486524A (en) Alternating-current charging electronic lock control method based on scene recognition
CN112101185B (en) Method for training wrinkle detection model, electronic equipment and storage medium
Li et al. Lightweight convolutional neural network for aircraft small target real-time detection in Airport videos in complex scenes
EP4287145A1 (en) Statistical model-based false detection removal algorithm from images
CN113052150A (en) Living body detection method, living body detection device, electronic apparatus, and computer-readable storage medium
CN117854156B (en) Training method and related device for feature extraction model
WO2025016081A1 (en) Data processing method and related apparatus, and device and storage medium
EP4283529B1 (en) Method for training an object recognition model in a computing device
CN117475253A (en) Model training method and device, electronic equipment and storage medium
CN115984723A (en) Road damage detection method, system, device, storage medium and computer equipment
KR20230090979A (en) Method and apparatus for suggestion of consultation guide information using neural networks

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170215

Termination date: 20200729