CN102855498B - Character identifying method and device - Google Patents
Character identifying method and device Download PDFInfo
- Publication number
- CN102855498B CN102855498B CN201110192827.3A CN201110192827A CN102855498B CN 102855498 B CN102855498 B CN 102855498B CN 201110192827 A CN201110192827 A CN 201110192827A CN 102855498 B CN102855498 B CN 102855498B
- Authority
- CN
- China
- Prior art keywords
- character
- tuple
- identified
- perspective
- recognized
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 43
- 239000013598 vector Substances 0.000 claims abstract description 47
- 230000009466 transformation Effects 0.000 claims abstract description 13
- 239000000284 extract Substances 0.000 claims abstract description 7
- 238000000605 extraction Methods 0.000 claims description 6
- 230000008569 process Effects 0.000 claims description 6
- 230000011218 segmentation Effects 0.000 claims description 3
- 238000006243 chemical reaction Methods 0.000 claims 5
- 230000003252 repetitive effect Effects 0.000 claims 1
- 230000001131 transforming effect Effects 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 15
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 7
- 238000004590 computer program Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 238000013139 quantization Methods 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012015 optical character recognition Methods 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
Landscapes
- Character Discrimination (AREA)
Abstract
本发明涉及字符识别方法和装置,其中该字符识别方法包括:利用交比从待识别字符的凸包多边形上选择有序的四个点构成的四元组;将待识别字符变换到由所选择的四元组确定的透视不变坐标系中;从变换后的待识别字符中提取特征来获得待识别字符的特征向量;在预先存储的字符类别表中查找与所获得的待识别字符的特征向量匹配的记录,向查找到的记录所对应的字符类别进行投票;对于待识别字符的凸包多边形上的不同四元组重复上述步骤预定次数;以及将获得投票票数最多的字符类别确定为字符识别结果。
The present invention relates to a character recognition method and device, wherein the character recognition method includes: using the cross ratio to select a quadruple group composed of four points in order from the convex hull polygon of the character to be recognized; transforming the character to be recognized to the selected In the perspective-invariant coordinate system determined by the quadruple group; from the characters to be recognized after the transformation, extract features to obtain the feature vector of the characters to be recognized; in the pre-stored character category table, search and obtain the characteristics of the characters to be recognized For the records matched by the vector, vote for the character category corresponding to the found record; repeat the above steps for a predetermined number of times for different quadruples on the convex hull polygon of the character to be recognized; and determine the character category with the largest number of votes as the character Recognition results.
Description
技术领域technical field
本发明涉及模式识别领域,更具体地涉及一种字符识别方法和装置。The present invention relates to the field of pattern recognition, more particularly to a character recognition method and device.
背景技术Background technique
对大透视变形下的字符进行识别是一个非常重要的课题,因为识别透视变形的字符是很多实际应用的基础,而且,透视变形的字符广泛存在于我们的日常生活中,例如,真实场景下的字符识别。Recognition of characters under large perspective deformation is a very important topic, because recognizing characters with perspective deformation is the basis of many practical applications, and characters with perspective deformation widely exist in our daily life, for example, characters in real scenes Character recognition.
为了解决这个问题,一种基本方法是首先矫正透视变形的图像到正视图像,然后对矫正后的图像进行传统的OCR(光学字符识别)识别。但是这种方法受到具体应用的限制,如要求存在轮廓和字符线条以及一些结构等。因此技术人员开始专注于对每一个单字符进行识别。To solve this problem, a basic approach is to first rectify the perspective-distorted image to an orthographic image, and then perform traditional OCR (Optical Character Recognition) recognition on the rectified image. But this method is limited by specific applications, such as requiring the existence of outlines and character lines, as well as some structures. So technicians start to focus on recognizing each single character.
几何哈希算法(Geometric Hashing(GH))是一个通用的基于模型的物体识别算法,该算法在物体经历了各种变换和只有部分信息存在时也可以适用。几何哈希算法的优点在于可以简单并行处理,以及在只存在部分信息时也可以工作的能力。因此,几何哈希算法被用于仿射变换的物体识别和三维物体识别中。Geometric Hashing (GH) is a general-purpose model-based object recognition algorithm, which is also applicable when the object has undergone various transformations and only part of the information exists. The advantages of geometric hashing algorithms are that they can be easily parallelized and the ability to work when only partial information exists. Therefore, the geometric hash algorithm is used in object recognition of affine transformation and three-dimensional object recognition.
使用仿射变换模型来近似透射模型是识别透视变形字符的一种传统方法。M.Iwamura,T.Tsuji,A.Horimatsu和K.Kise等人在2009年的ICDAR发表的题为“Realtimecamera-based recognition of characters and pictograms”的文章中改进了几何哈希算法,并提出了一种对相机拍摄字符进行识别的实时算法。该算法采用仿射模型,为了构建仿射不变坐标系,需要3个坐标点(3元组)作为坐标系的基。采用仿射不变量,即重心和面积比(area ratio)来降低3元组的自由度。然而,由于仿射模型仅在物体的大小比物体与相机之间的距离足够小的条件下,即透视变形较小时,才能被认为是透视模型的近似,而当透视变形较大时,上述近似不再成立。所以,为了识别大透视变形下的字符,需要研发新的方法。Approximating the transmission model using an affine transformation model is a traditional method for recognizing perspective deformed characters. M.Iwamura, T.Tsuji, A.Horimatsu and K.Kise et al. improved the geometric hash algorithm in the article entitled "Realtime camera-based recognition of characters and pictograms" published by ICDAR in 2009, and proposed a A real-time algorithm for recognizing characters captured by a camera. The algorithm uses an affine model, and in order to construct an affine invariant coordinate system, three coordinate points (3-tuples) are needed as the basis of the coordinate system. Affine invariants, ie, center of gravity and area ratio, are used to reduce the degrees of freedom of 3-tuples. However, since the affine model can only be considered as an approximation of the perspective model when the size of the object is sufficiently smaller than the distance between the object and the camera, that is, the perspective distortion is small, and when the perspective distortion is large, the above approximation no longer holds true. Therefore, in order to recognize characters under large perspective deformation, new methods need to be developed.
另一种识别透视变形字符的现有方法是对于每个字符形成交比谱(cross ratiospectrum),通过比较当前字符的交比谱和模板字符的交比谱来识别字符(参见Linlin Li和Chew Lim Tan在2008年的ICPR发表的题为“Character recognition under severeperspective distortion”的文章)。这种方法的缺陷是字符识别所需的时间随着待识别字符的类别数量的增长而线性增长。因此这种方法在待识别字符的类别较多的应用中的使用是受到限制的。Another existing method for identifying perspective deformed characters is to form a cross ratio spectrum for each character, and identify characters by comparing the cross ratio spectrum of the current character with the cross ratio spectrum of the template character (see Linlin Li and Chew Lim Tan published an article entitled "Character recognition under severe perspective distortion" in ICPR in 2008). The disadvantage of this method is that the time required for character recognition increases linearly with the number of categories of characters to be recognized. Therefore, the use of this method in applications with many types of characters to be recognized is limited.
发明内容Contents of the invention
根据本发明的第一方面,提供了一种字符识别方法,包括:利用交比从待识别字符的凸包多边形上选择有序的四个点构成的四元组;将待识别字符变换到由所选择的四元组确定的透视不变坐标系中;从变换后的待识别字符中提取特征来获得待识别字符的特征向量;在预先存储的字符类别表中查找与所获得的待识别字符的特征向量匹配的记录,向查找到的记录所对应的字符类别进行投票;对于待识别字符的凸包多边形上的不同四元组重复上述步骤预定次数;以及将获得投票票数最多的字符类别确定为字符识别结果。According to the first aspect of the present invention, a kind of character recognition method is provided, comprising: utilize the intersection ratio to select the quaternion of ordered four points from the convex hull polygon of character to be recognized; In the perspective-invariant coordinate system determined by the selected quadruple; extract features from the converted character to be recognized to obtain the feature vector of the character to be recognized; search and obtain the character to be recognized in the pre-stored character category table For the records matched by the eigenvector of the found record, vote for the character category corresponding to the found record; repeat the above steps for a predetermined number of times for different quadruples on the convex hull polygon of the character to be recognized; and determine the character category that will obtain the largest number of votes is the character recognition result.
根据本发明的第二方面,提供了一种字符识别装置,包括:选择单元,被配置用于利用交比从待识别字符的凸包多边形上选择有序的四个点构成的四元组;变换单元,被配置用于将待识别字符变换到由所选择的四元组确定的透视不变坐标系中;提取单元,被配置用于从变换后的待识别字符中提取特征来获得待识别字符的特征向量;投票单元,被配置用于在预先存储的字符类别表中查找与所获得的待识别字符的特征向量匹配的记录,向查找到的记录所对应的字符类别进行投票;重复控制单元,被配置用于对于待识别字符的凸包多边形上的不同四元组重复上述步骤预定次数;以及确定单元,被配置用于将获得投票票数最多的字符类别确定为字符识别结果。According to a second aspect of the present invention, there is provided a character recognition device, comprising: a selection unit configured to select a quadruple composed of ordered four points from the convex hull polygon of the character to be recognized by using the cross ratio; The transformation unit is configured to transform the character to be recognized into the perspective invariant coordinate system determined by the selected quadruple; the extraction unit is configured to extract features from the transformed character to be recognized to obtain the character to be recognized The feature vector of the character; the voting unit is configured to search the record matched with the obtained character vector of the character to be recognized in the pre-stored character category table, and vote for the character category corresponding to the found record; repeat control A unit configured to repeat the above steps a predetermined number of times for different quadruples on the convex hull polygon of the character to be recognized; and a determining unit configured to determine the character category that has received the most votes as the character recognition result.
另外,本发明的实施例还提供了用于实现上述方法的计算机程序。In addition, embodiments of the present invention also provide computer programs for implementing the above methods.
此外,本发明的实施例还提供了至少计算机可读介质形式的计算机程序产品,其上记录有用于实现上述方法的计算机程序代码。In addition, embodiments of the present invention also provide at least a computer program product in the form of a computer-readable medium, on which computer program codes for implementing the above method are recorded.
通过本发明,可以识别大透视变形下的字符,与传统使用仿射模型的方法相比,本发明的识别率更高。Through the present invention, characters under large perspective deformation can be recognized, and compared with the traditional method using an affine model, the recognition rate of the present invention is higher.
另外,本发明的方法,在部分信息缺失时仍可以进行识别。In addition, the method of the present invention can still perform identification when some information is missing.
本发明的方法还可以区分识别不同字体的字符。The method of the present invention can also distinguish and recognize characters of different fonts.
通过以下结合附图对本发明的最佳实施例的详细说明,本发明的这些以及其它的优点将更加明显。These and other advantages of the present invention will become more apparent through the following detailed description of the preferred embodiments of the present invention with reference to the accompanying drawings.
附图说明Description of drawings
参照下面结合附图对本发明实施例的说明,会更加容易地理解本发明的以上和其它目的、特点和优点。附图中的部件只是为了示出本发明的原理。在附图中,相同的或类似的技术特征或部件将采用相同或类似的附图标记来表示。附图中:The above and other objects, features and advantages of the present invention will be more easily understood with reference to the following description of the embodiments of the present invention in conjunction with the accompanying drawings. The components in the drawings are only to illustrate the principles of the invention. In the drawings, the same or similar technical features or components will be denoted by the same or similar reference numerals. In the attached picture:
图1示出了根据本发明实施例的用于识别字符的方法的流程图;Fig. 1 shows a flow chart of a method for recognizing characters according to an embodiment of the present invention;
图2A示出了字符‘H’的凸包多边形和内部轮廓;Figure 2A shows the convex hull polygon and inner contour of the character 'H';
图2B示出了字符‘H’的凸包多边形上的两点之间的线段与内部轮廓的交点的示图;Figure 2B shows a diagram of the intersection of the line segment between two points on the convex hull polygon of the character 'H' and the inner contour;
图2C示出了发生了透视变形的字符‘H’的示图;Figure 2C shows a diagram of the character 'H' with perspective distortion;
图2D示出了字符‘H’的凸包多边形上的两点之间的线段与字符的内部轮廓上的锯齿相交的示图;Fig. 2D shows a diagram in which the line segment between two points on the convex hull polygon of the character 'H' intersects with the sawtooth on the inner contour of the character;
图3A示出了图2B和2C中的字符‘H’变换到透视不变坐标系下的示图;Fig. 3 A shows the figure that character ' H' in Fig. 2B and 2C is transformed under the perspective invariant coordinate system;
图3B示出了选择的无效的四元组变换到透视不变坐标系下的示图;FIG. 3B shows a diagram of selected invalid quadruples transformed into a perspective-invariant coordinate system;
图4A示出了将透视不变坐标系分割成4*4个方块的示图;FIG. 4A shows a diagram of dividing the perspective invariant coordinate system into 4*4 squares;
图4B示出了归一化的直方图;Figure 4B shows a normalized histogram;
图4C示出了哈希表中的一条记录的示图;Figure 4C shows a diagram of a record in the hash table;
图5A-5C分别示出了大透视变形下的、损坏的和缺失的字符的示图;Figures 5A-5C show views of damaged and missing characters, respectively, under large perspective distortion;
图6示出了根据本发明实施例的用于识别字符的装置的组成框图;FIG. 6 shows a block diagram of a device for recognizing characters according to an embodiment of the present invention;
图7示出了识别字符装置中的提取单元的组成框图;以及Fig. 7 shows the composition block diagram of the extracting unit in the character recognition device; And
图8示出了可用于实施根据本发明实施例的方法和装置的计算机的示意性框图。Fig. 8 shows a schematic block diagram of a computer that can be used to implement the method and device according to the embodiments of the present invention.
具体实施方式detailed description
下面参照附图来说明本发明的实施例。在本发明的一个附图或一种实施方式中描述的元素和特征可以与一个或更多个其它附图或实施方式中示出的元素和特征相结合。应当注意,为了清楚的目的,附图和说明中省略了与本发明无关的、本领域普通技术人员已知的部件和处理的表示和描述。Embodiments of the present invention will be described below with reference to the drawings. Elements and features described in one drawing or one embodiment of the present invention may be combined with elements and features shown in one or more other drawings or embodiments. It should be noted that representation and description of components and processes that are not related to the present invention and known to those of ordinary skill in the art are omitted from the drawings and descriptions for the purpose of clarity.
图1示出了根据本发明的实施例的用于识别字符的方法的流程图。Fig. 1 shows a flowchart of a method for recognizing characters according to an embodiment of the present invention.
首先,在步骤S102中,利用交比从待识别字符的凸包多边形上选择有序的四个点构成的四元组。First, in step S102, an orderly four-tuple group composed of four points is selected from the convex hull polygon of the character to be recognized by using the intersection ratio.
关于交比(Cross Radio)的基本概念如下:如果P0,A,B,P2四点共线,则交比Cr定义如下:The basic concept of Cross Radio is as follows: if P 0 , A, B, and P 2 are collinear, then the Cross Radio is defined as follows:
其中,D()表示两点之间的距离。Among them, D() represents the distance between two points.
本领域技术人员可知,交比是透视变形中的不变量,即,在任何透视变形下,交比Cr(P0,A,B,P2)保持不变。Those skilled in the art know that the cross ratio is an invariant variable in perspective deformation, that is, under any perspective deformation, the cross ratio Cr(P 0 , A, B, P 2 ) remains unchanged.
下面将参照图2A-2D,以字符‘H’为例,详细说明如何在待识别字符的凸包多边形上选择有序的四个点。Referring to Figures 2A-2D below, taking the character 'H' as an example, how to select four orderly points on the convex hull polygon of the character to be recognized will be described in detail.
在本发明中,凸包多边形的定义如下:凸包多边形是一个多边形,该多边形的顶点是字符的凸包。例如,在图2A中,字符‘H’的凸包多边形由实线表示,而字符‘H’的内部轮廓由虚线表示。对于字符‘H’的凸包多边形上的两点以及该两点之间的线段与字符‘H’的内部轮廓的两个交点计算的交比值在透视变形下保持不变,其中,当交点多于两个时,选用前两个交点来计算交比值。例如,在图2B中,P0,P1,P2和P3为字符‘H’的凸包多边形上的点,A和B为线段P0P2与字符‘H’的内部轮廓的前两个交点,C和D为线段P1P3与字符‘H’的内部轮廓的前两个交点。选定字符‘H’的凸包多边形上的P0点,可以确定凸包多边形上的P2点,使得P0,A,B,P2点的交比值Cr(P0,A,B,P2)最接近一个预定值。In the present invention, a convex-hull polygon is defined as follows: a convex-hull polygon is a polygon whose vertices are the convex hull of a character. For example, in FIG. 2A, the convex hull polygon of the character 'H' is represented by a solid line, while the inner contour of the character 'H' is represented by a dashed line. For the two points on the convex hull polygon of the character 'H' and the two intersection points of the line segment between the two points and the inner contour of the character 'H', the intersection ratio value remains unchanged under the perspective deformation, wherein, when the intersection point is more When there are two, the first two intersection points are selected to calculate the cross ratio value. For example, in Figure 2B, P 0 , P 1 , P 2 and P 3 are the points on the convex hull polygon of the character 'H', A and B are the front of the line segment P 0 P 2 and the inner contour of the character 'H' Two intersection points, C and D are the first two intersection points of the line segment P 1 P 3 and the inner contour of the character 'H'. Point P 0 on the convex-hull polygon of the selected character 'H' can determine P 2 points on the convex-hull polygon, so that P 0 , A, B, and the intersection ratio Cr (P 0 , A, B, P 2 ) is closest to a predetermined value.
由于字符的内部轮廓可能出现锯齿现象,因而凸包多边形上的两点之间的线段与字符的内部轮廓的交点可能是与字符的内部轮廓上的锯齿的交点,例如图2D中所示的B1和B2。在这种情况下,由于B1和B2之间的距离很短,从而计算出的交比值很大。为了避免出现与字符的内部轮廓上的锯齿相交的情况,选点准则不能选用太大的交比值。根据发明人在实践中的观察,一般来说,交比值应该在(1,2]的范围内,而采用接近1.5的交比值可以获得较好的效果。因此在实践中,交比值的预定值通常选取1.5。Since the inner contour of the character may appear jagged, the intersection of the line segment between two points on the convex hull polygon and the inner contour of the character may be the intersection with the sawtooth on the inner contour of the character, such as B shown in Figure 2D 1 and B2 . In this case, since the distance between B1 and B2 is very short, the calculated cross ratio value is large. In order to avoid intersecting with the sawtooth on the inner outline of the character, the point selection criterion cannot choose a too large cross ratio value. According to the inventor's observations in practice, generally speaking, the cross ratio value should be in the scope of (1, 2], and adopting a cross ratio value close to 1.5 can obtain better results. Therefore in practice, the predetermined value of the cross ratio value Usually choose 1.5.
如上所述,任意选定P0点,可以确定P2点,使得P0,A,B,P2满足|1.5-Cr(P0,A,B,P2)|最小;再选定P1点,其中,P0、P1、P2满足逆时针排列,可以确定P3点,使得P1,C,D,P3满足|1.5-Cr(P1,C,D,P3)|最小,这样就完成了从待识别字符的凸包多边形上选择有序的四个点。本领域技术人员可以理解,也可以使P0、P1、P2满足顺时针排列。As mentioned above, point P 0 can be selected arbitrarily, and point P 2 can be determined so that P 0 , A, B, P 2 satisfy |1.5-Cr(P 0 , A, B, P 2 )|minimum; then select P 1 point, where P 0 , P 1 , and P 2 are arranged counterclockwise, and P 3 points can be determined such that P 1 , C, D, and P 3 satisfy |1.5-Cr(P 1 , C, D, P 3 ) |Minimum, this completes the selection of ordered four points from the convex hull polygon of the character to be recognized. Those skilled in the art can understand that P 0 , P 1 , and P 2 can also be arranged clockwise.
图2C是图2B中的字符‘H’发生了透视变形的示图,其中的点P0′,P1′,P2′,P3′分别对应于图2B中的点P0,P1,P2和P3,A′,B′和C′,D′分别是线段P0′P2′和线段P1′P3′与发生透视变形的‘H’的内部轮廓的前两个交点,所以A′,B′,C′,D′分别对应于图2B中的A,B,C,D。由于交比是透视不变量,因此图2C中发生透视变形的‘H’上的点P0′,A′,B′,P2′的交比Cr(P0′,A′,B′,P2′)以及点P1′,C′,D′,P3′的交比Cr(P1′,C′,D′,P3′)保持不变。也就是说:Fig. 2C is a diagram showing perspective deformation of the character 'H' in Fig. 2B, where points P 0 ′, P 1 ′, P 2 ′, and P 3 ′ correspond to points P 0 and P 1 in Fig. 2B respectively , P 2 and P 3 , A', B' and C', D' are line segment P 0 ′P 2 ′ and line segment P 1 ′P 3 ′ and the first two internal contours of 'H' with perspective deformation intersection, so A', B', C', D' correspond to A, B, C, D in Fig. 2B, respectively. Since the cross ratio is a perspective invariant, the cross ratio Cr (P 0 ′ , A ′, B′, P 2 ′) and the cross ratio Cr(P 1 ′, C′, D′, P 3 ′) of the points P 1 ′, C′, D′, P 3 ′ remain unchanged. That is to say:
Cr(P0′,A′,B′,P2′)=Cr(P0,A,B,P2);Cr(P 0 ', A', B', P 2 ') = Cr(P 0 , A, B, P 2 );
Cr(P1′,C′,D′,P3′)=Cr(P1,C,D,P3)。Cr(P 1 ', C', D', P 3 ') = Cr(P 1 , C, D, P 3 ).
按照上述方式,给定了P0(P0′)点,就可以确定P2(P2′)点,无论发生什么样的透视变换;同样,给定P1(P1′)点,就可以确定P3(P3′)点。通过这样的方式选择的四元组,只有两个自由度,因而可以降低运算量。According to the above method, given the point P 0 (P 0 ′), the point P 2 (P 2 ′) can be determined, no matter what kind of perspective transformation occurs; similarly, given the point P 1 (P 1 ′), the The P 3 (P 3 ′) point can be determined. The quadruples selected in this way have only two degrees of freedom, thus reducing the amount of computation.
接下来,在步骤S104中,将待识别字符变换到由所选择的四元组确定的透视不变坐标系中。Next, in step S104, the character to be recognized is transformed into the perspective invariant coordinate system determined by the selected quadruple.
在本发明中,将透视不变坐标系限定为一个正方形,正方形的边长为L,中心为(xc,yc)。在选定图2B中的字符‘H’上的四元组P0,P1,P2,P3的情况下,字符‘H’变换到透视不变坐标系下的示图如图3A所示。图2B中的四个点P0,P1,P2,P3分别映射到图3A中的透视不变坐标系中的I0(xc-l,yc+l),I1(xc-l,yc-l),I2(xc+l,yc-l),I3(xc+l,yc+l),其中,l=α*L/2,α∈[0,1],即,I0,I1,I2和I3落入正方形以内。由于图2C是图2B中的字符‘H’发生了透视变形,在选定对应四元组P0′,P1′,P2′,P3′的情况下,图2C中的字符‘H’变换到透视不变坐标系下的示图也如图3A所示,并且图2C中的点P0′,P1′,P2′,P3′也同样分别映射到图3A中的透视不变坐标系中的I0,I1,I2和I3。In the present invention, the perspective invariant coordinate system is defined as a square, the side length of the square is L, and the center is (xc, yc). In the case of selecting the quaternion P 0 , P 1 , P 2 , P 3 on the character 'H' in Figure 2B, the diagram of character 'H' transformed to the perspective invariant coordinate system is shown in Figure 3A Show. The four points P 0 , P 1 , P 2 , and P 3 in Fig. 2B are respectively mapped to I 0 (xc-l, yc+l), I 1 (xc-l , yc-l), I 2 (xc+l, yc-l), I 3 (xc+l, yc+l), where, l=α*L/2, α∈[0,1], that is, I 0 , I 1 , I 2 and I 3 fall within the square. Since Figure 2C is the perspective deformation of the character 'H' in Figure 2B, in the case of selecting the corresponding quadruple P 0 ′, P 1 ′, P 2 ′, P 3 ′, the character 'H in Figure 2C The diagram transformed into the perspective-invariant coordinate system is also shown in Figure 3A, and the points P 0 ′, P 1 ′, P 2 ′, and P 3 ′ in Figure 2C are also mapped to the perspective in Figure 3A respectively I 0 , I 1 , I 2 and I 3 in the invariant coordinate system.
P0,P1,P2,P3(P0′,P1′,P2′,P3′)与I0,I1,I2和I3这四个对应对可以确定唯一的透视变换矩阵,根据该透视变换矩阵,可以将待识别字符变换到透视不变坐标系下。The four corresponding pairs of P 0 , P 1 , P 2 , P 3 (P 0 ′, P 1 ′, P 2 ′, P 3 ′) and I 0 , I 1 , I 2 and I 3 can determine a unique perspective Transformation matrix, according to the perspective transformation matrix, the character to be recognized can be transformed into a perspective invariant coordinate system.
确定透视变换矩阵的方法已经公知,相关文献有Multiple View Geometry inComputer Vision.Richard Hartley and Andrew Zisserman,Cambridge UniversityPress,2004,在此不做赘述。The method of determining the perspective transformation matrix is already known, and the relevant literature includes Multiple View Geometry in Computer Vision. Richard Hartley and Andrew Zisserman, Cambridge University Press, 2004, and will not be repeated here.
优选地,在将待识别字符变换到透视不变坐标系之后,可以根据待识别字符的凸包多边形上的点被变换到透视不变坐标系下的比例来确定所选择的四元组的有效性。当凸包多边形上的点被变换到透视不变坐标系下的比例小于预定比例时,可以确定所选择的四元组无效。Preferably, after transforming the character to be recognized into the perspective invariant coordinate system, the effective ratio of the selected quadruple can be determined according to the ratio of points on the convex hull polygon of the character to be recognized transformed into the perspective invariant coordinate system sex. When the ratio of points on the convex hull polygon transformed into the perspective invariant coordinate system is smaller than a predetermined ratio, it may be determined that the selected quadruple is invalid.
在一个示例中,如果待识别字符的凸包多边形上的点被变换到该透视不变坐标系下的比例多于或等于90%,则认为选取的四元组是有效的。In one example, if the proportion of points on the convex hull polygon of the character to be recognized transformed into the perspective invariant coordinate system is greater than or equal to 90%, the selected quadruple is considered valid.
如果待识别字符的凸包多边形上的点被变换到该透视不变坐标系下的比例小于90%,则可以确定所选择的四元组无效,舍弃该四元组,重新进行选择。If the proportion of the points on the convex hull polygon of the character to be recognized transformed into the perspective invariant coordinate system is less than 90%, it can be determined that the selected quadruple is invalid, discard the quadruple, and re-select.
本领域技术人员可以理解,上述预定比例也可以选取90%以外的其它值。Those skilled in the art can understand that the aforementioned predetermined ratio can also be other values than 90%.
图3B示出了待识别字符的凸包多边形上的点被变换到透视不变坐标系下的比例小于90%的情况,在这种情况下,所选取的四元组是无效的,在识别过程中要被舍弃。Fig. 3B shows the situation that the point on the convex hull polygon of the character to be recognized is transformed into the situation that the ratio under the perspective invariant coordinate system is less than 90%. In this case, the selected quadruple group is invalid. be discarded in the process.
接下来,在步骤S106中,从变换后的待识别字符中提取特征来获得待识别字符的特征向量。Next, in step S106, features are extracted from the converted character to be recognized to obtain a feature vector of the character to be recognized.
优选地,可以将透视不变坐标系分割成多个子区域,根据待识别字符在各个子区域中的像素数目来构建直方图,将该直方图作为特征向量。例如,可以将透视不变坐标系分割成m×m个方块,其中m是大于1的整数。这样,可以得到一个横坐标是子区域的序号、纵坐标是子区域内所包含的字符像素的数目的直方图,将该直方图作为特征向量fcurrent。进一步优选地,可以对该直方图进行归一化,以将归一化后的直方图作为特征向量fcurrent。本领域技术人员可以理解,也可以提取边缘方向、梯度信息等特征来形成特征向量。Preferably, the perspective-invariant coordinate system may be divided into multiple sub-regions, a histogram is constructed according to the number of pixels of the character to be recognized in each sub-region, and the histogram is used as a feature vector. For example, the perspective-invariant coordinate system may be divided into m×m squares, where m is an integer greater than 1. In this way, a histogram in which the abscissa is the serial number of the sub-region and the ordinate is the number of character pixels contained in the sub-region can be obtained, and the histogram is used as the feature vector fcurrent. Further preferably, the histogram can be normalized, so that the normalized histogram can be used as the feature vector fcurrent. Those skilled in the art can understand that features such as edge direction and gradient information can also be extracted to form feature vectors.
在步骤S108中,在预先存储的字符类别表中查找与所获得的待识别字符的特征向量匹配的记录,向查找到的记录所对应的字符类别进行投票。In step S108, a record matching the obtained feature vector of the character to be recognized is searched in the pre-stored character category table, and a vote is performed for the character category corresponding to the found record.
具体地说,字符类别表中的每条记录中包含字符类别和对应的特征向量。其中,特征向量可以是直方图、归一化的直方图等形式。Specifically, each record in the character category table contains a character category and a corresponding feature vector. Wherein, the feature vector may be in the form of a histogram, a normalized histogram, or the like.
在一个示例中,字符类别表可以是哈希表,其中哈希表中的每条记录还包含对其中的特征向量进行哈希处理而得到的索引值。In an example, the character category table may be a hash table, wherein each record in the hash table further includes an index value obtained by hashing the feature vector therein.
在该示例中,对特征向量进行哈希处理,例如,进行均匀二值量化,并将量化的特征向量转换成一个索引值bin。本领域技术人员可以理解,也可以对特征向量进行三级量化等多级量化,并将量化的特征向量转换成索引值bin。In this example, hash processing is performed on the feature vector, for example, uniform binary quantization is performed, and the quantized feature vector is converted into an index value bin. Those skilled in the art can understand that multi-level quantization such as three-level quantization can also be performed on the feature vectors, and the quantized feature vectors can be converted into index value bins.
根据获得的bin值,在预先存储的字符类别表(如哈希表)中查找到相应的字符类别d和特征向量fstored。According to the obtained bin value, the corresponding character category d and feature vector fstored are found in a pre-stored character category table (such as a hash table).
比较待识别字符的特征向量fcurrent与字符类别表中存储的特征向量fstored之间的欧氏距离,如果‖fcurrent-fstored‖小于一个较小的预定值,则给字符类别d投一票。Compare the Euclidean distance between the feature vector fcurrent of the character to be recognized and the feature vector fstored stored in the character category table, if ‖fcurrent-fstored‖ is smaller than a small predetermined value, vote for the character category d.
如果对于相同的bin值有多票投给同一字符类别d,则忽略从该bin值投给同一字符类别d的其他票数,即,只算一票。If there are multiple votes for the same character category d for the same bin value, the other votes for the same character category d from the bin value are ignored, ie, only one vote is counted.
图4A示出了以将透视不变坐标系分割成4*4个方块为例来得到特征向量。图4B示出了对每个方块中的字符像素数进行归一化得到的作为特征向量的直方图。将特征向量均匀二值量化,并将量化的特征向量转换成索引值bin,也就是图4C中所示的哈希表中的40944。根据该值,可以查找到哈希表中的字符类别d和相应的特征向量。FIG. 4A shows that the feature vector is obtained by dividing the perspective invariant coordinate system into 4*4 squares as an example. FIG. 4B shows a histogram obtained by normalizing the number of character pixels in each square as a feature vector. The eigenvectors are uniformly binary quantized, and the quantized eigenvectors are converted into index value bins, that is, 40944 in the hash table shown in FIG. 4C . According to this value, the character category d and the corresponding feature vector in the hash table can be found.
对于步骤S108中的字符类别表,可以在进行字符识别之前在学习阶段针对用于学习的每个模板字符通过以下方式来构建上述字符类别表:首先,要提取模板字符的特征向量,包括:利用交比从模板字符的凸包多边形中选择有序的四个点构成的四元组;将模板字符变换到由所选择的四元组确定的透视不变坐标系中;从变换后的模板字符中提取特征来获得模板字符的特征向量。提取模板字符的特征向量的步骤与上文中获得待识别字符的特征向量的步骤类似,在此不做赘述。For the character category table in step S108, the above-mentioned character category table can be constructed in the following manner for each template character used for learning in the learning phase before character recognition: first, the feature vector of the template character will be extracted, including: using Cross-ratio selects a quadruple of ordered four points from the convex hull polygon of the template character; transforms the template character into the perspective invariant coordinate system determined by the selected quadruple; from the transformed template character Extract features to obtain the feature vector of the template character. The steps of extracting the feature vectors of the template characters are similar to the steps of obtaining the feature vectors of the characters to be recognized above, and will not be repeated here.
需要说明的是,在将模板字符变换到透视不变坐标系之后,要根据模板字符的凸包多边形上的点被变换到透视不变坐标系下的比例来确定所选择的四元组的有效性。如果所选择的四元组无效,则舍弃该四元组,重新进行选择。这里确定所选择的四元组的有效性的方法与以上针对待识别字符确定所选择的四元组的有效性的方法类似,在此不做赘述。It should be noted that after the template character is transformed into the perspective-invariant coordinate system, the effective value of the selected quadruple should be determined according to the ratio of points on the convex hull polygon of the template character transformed into the perspective-invariant coordinate system. sex. If the selected quadruple is invalid, discard the quadruple and re-select. The method for determining the validity of the selected quadruple here is similar to the above method for determining the validity of the selected quadruple for the character to be recognized, and will not be repeated here.
在提取了模板字符的特征向量之后,将模板字符的字符类别和所获得的特征向量作为一条记录存放在字符类别表中。然后,重复从模板字符的凸包多边形中选择有序的四个点构成的四元组直到得到字符类别表中的一条记录的多个步骤,直到遍历模板字符的凸包多边形上的所有四元组。从而构建了包括每个模板字符的字符类别表。After the feature vector of the template character is extracted, the character category of the template character and the obtained feature vector are stored as a record in the character category table. Then, repeat the multiple steps of selecting ordered quadruples of four points from the convex hull polygon of the template character until a record in the character category table is obtained, until all quadruples on the convex hull polygon of the template character are traversed Group. Thus, a character category table including each template character is constructed.
在此需要注意的是,虽然在上面描述了怎样生成字符类别表,但是本领域的技术人员应当理解,对于根据本发明实施例的用于识别字符的方法来说,只需要预先存储了一个如上所述的字符类别表即可,而无需关心字符类别表是怎样生成的。It should be noted here that although the above describes how to generate the character category table, those skilled in the art should understand that for the method for recognizing characters according to the embodiment of the present invention, it is only necessary to pre-store one of the above The above character category table is sufficient, and there is no need to care about how the character category table is generated.
接下来,在步骤S110中,对于待识别字符的凸包多边形上的不同四元组重复上述步骤预定次数。这里,预定次数可以是预先设定的次数,也可以是在凸包多边形上的点中每隔一个点或每隔两个点进行选点,来执行上述步骤S102-S108,即,完成一次投票。Next, in step S110, the above steps are repeated for a predetermined number of times for different quadruples on the convex hull polygon of the character to be recognized. Here, the predetermined number of times can be a preset number of times, or it can be selected every other point or every two points among the points on the convex hull polygon to perform the above steps S102-S108, that is, to complete a vote .
最后,在步骤S112中,将获得投票票数最多的字符类别确定为字符识别结果。Finally, in step S112, the character category with the most votes is determined as the character recognition result.
上述字符类别也可以广义的使用,例如,在多种字体的情况下,字体A-Arial和A-Calibri可以算作一类,也可以算作两类来区分识别。The above character categories can also be used in a broad sense. For example, in the case of multiple fonts, the fonts A-Arial and A-Calibri can be counted as one type, or can be counted as two types to distinguish and recognize.
根据本发明的字符识别方法,在图5A所示的大透视变形下的字符,以及图5B和5C所示的字符有一些损坏或缺失的情况下,都可以进行字符识别。According to the character recognition method of the present invention, character recognition can be performed even when the characters under the large perspective deformation shown in FIG. 5A and the characters shown in FIGS. 5B and 5C are damaged or missing.
图6示出了根据本发明实施例的用于识别字符的装置的框图。字符识别装置600包括选择单元602、变换单元604、提取单元606、投票单元608、重复控制单元610和确定单元612。选择单元602,被配置用于利用交比从待识别字符的凸包多边形上选择有序的四个点构成的四元组。变换单元604,被配置用于将待识别字符变换到由所选择的四元组确定的透视不变坐标系中。提取单元606,被配置用于从变换后的待识别字符中提取特征来获得待识别字符的特征向量。投票单元608,被配置用于在预先存储的字符类别表中查找与所获得的待识别字符的特征向量匹配的记录,向查找到的记录所对应的字符类别进行投票。重复控制单元610,被配置用于对于待识别字符的凸包多边形上的不同四元组重复上述步骤预定次数。确定单元612,被配置用于将获得投票票数最多的字符类别确定为字符识别结果。Fig. 6 shows a block diagram of an apparatus for character recognition according to an embodiment of the present invention. The character recognition device 600 includes a selection unit 602 , a transformation unit 604 , an extraction unit 606 , a voting unit 608 , a repetition control unit 610 and a determination unit 612 . The selection unit 602 is configured to select an orderly quadruple composed of four points from the convex hull polygon of the character to be recognized by using the cross ratio. The transformation unit 604 is configured to transform the character to be recognized into a perspective-invariant coordinate system determined by the selected quadruple. The extraction unit 606 is configured to extract features from the transformed character to be recognized to obtain a feature vector of the character to be recognized. The voting unit 608 is configured to search a pre-stored character category table for a record matching the obtained feature vector of the character to be recognized, and vote for the character category corresponding to the found record. The repetition control unit 610 is configured to repeat the above steps for a predetermined number of times for different quadruples on the convex hull polygon of the character to be recognized. The determining unit 612 is configured to determine the character category with the most votes as the character recognition result.
类似地,字符类别表可以预先存储在用于识别字符的装置中,也可以在进行字符识别之前在学习阶段针对用于学习的每个模板字符来构建字符类别表。这里构建字符类别表的方式,与以上参照图1至图4描述的本发明的字符识别方法实施例中构建字符类别表的方式类似,在此不再赘述。Similarly, the character category table may be pre-stored in the device for character recognition, or a character category table may be constructed for each template character used for learning in the learning phase before character recognition. The manner of constructing the character category table here is similar to the manner of constructing the character category table in the embodiment of the character recognition method of the present invention described above with reference to FIGS. 1 to 4 , and will not be repeated here.
可选地,字符识别装置600还包括:判断单元(未示出),被配置用于根据待识别字符的凸包多边形上的点被变换到透视不变坐标系下的比例是否小于预定比例来判断所选择的四元组是否无效,如果确定所选择的四元组无效,则舍弃该四元组。Optionally, the character recognition device 600 further includes: a judging unit (not shown), configured to determine whether a point on the convex hull polygon of the character to be recognized is transformed into a perspective-invariant coordinate system according to a ratio smaller than a predetermined ratio. It is judged whether the selected quadruple is invalid, and if it is determined that the selected quadruple is invalid, the quadruple is discarded.
可选地,字符类别表采用哈希表,其中哈希表中的每条记录还包含对其中的特征向量进行哈希处理而得到的索引值。Optionally, the character category table is a hash table, where each record in the hash table also includes an index value obtained by hashing the feature vector therein.
可选地,提取单元606包括分割子单元6062和直方图构建子单元6064,分割子单元6062被配置用于将透视不变坐标系分割成多个子区域,直方图构建子单元6064被配置用于根据待识别字符在各个子区域中的像素数目来构建直方图,直方图被用作为特征向量。Optionally, the extraction unit 606 includes a segmentation subunit 6062 and a histogram construction subunit 6064, the segmentation subunit 6062 is configured to divide the perspective invariant coordinate system into multiple sub-regions, and the histogram construction subunit 6064 is configured to A histogram is constructed according to the number of pixels of the character to be recognized in each sub-region, and the histogram is used as a feature vector.
关于字符识别装置600的各个部分的操作和功能的细节可以参照结合图1至图4描述的本发明的实施例,这里不再详细描述。Details about the operation and functions of each part of the character recognition device 600 can refer to the embodiment of the present invention described in conjunction with FIG. 1 to FIG. 4 , and will not be described in detail here.
在此需要说明的是,图6和图7所示的字符识别装置600及其组成单元的结构仅仅是示例性的,本领域技术人员可以根据需要对图6和图7所示的结构框图进行修改。It should be noted here that the structure of the character recognition device 600 and its constituent units shown in FIG. 6 and FIG. 7 are only exemplary, and those skilled in the art can make detailed descriptions of the structural block diagrams shown in FIG. 6 and FIG. 7 as needed. Revise.
以上结合具体实施例描述了本发明的基本原理,但是,需要指出的是,对本领域的普通技术人员而言,能够理解本发明的方法和装置的全部或者任何步骤或者部件,可以在任何计算装置(包括处理器、存储介质等)或者计算装置的网络中,以硬件、固件、软件或者它们的组合加以实现,这是本领域普通技术人员在阅读了本发明的说明的情况下运用他们的基本编程技能就能实现的。The basic principles of the present invention have been described above in conjunction with specific embodiments, but it should be pointed out that those skilled in the art can understand that all or any steps or components of the method and device of the present invention can be implemented on any computing device (including processors, storage media, etc.) or a network of computing devices, implemented with hardware, firmware, software, or a combination thereof, this is a person of ordinary skill in the art who uses their basic knowledge after reading the description of the present invention programming skills will do.
因此,本发明的目的还可以通过在任何计算装置上运行一个程序或者一组程序来实现。所述计算装置可以是公知的通用装置。因此,本发明的目的也可以仅仅通过提供包含实现所述方法或者装置的程序代码的程序产品来实现。也就是说,这样的程序产品也构成本发明,并且存储有这样的程序产品的存储介质也构成本发明。显然,所述存储介质可以是任何公知的存储介质或者将来所开发出来的任何存储介质。Therefore, the object of the present invention can also be achieved by running a program or a group of programs on any computing device. The computing device may be a known general-purpose device. Therefore, the object of the present invention can also be achieved only by providing a program product including program codes for realizing the method or device. That is, such a program product also constitutes the present invention, and a storage medium storing such a program product also constitutes the present invention. Obviously, the storage medium may be any known storage medium or any storage medium developed in the future.
在通过软件和/或固件实现本发明的实施例的情况下,从存储介质或网络向具有专用硬件结构的计算机,例如图8所示的通用计算机800安装构成该软件的程序,该计算机在安装有各种程序时,能够执行各种功能等等。In the case of implementing the embodiments of the present invention by software and/or firmware, the program constituting the software is installed from a storage medium or network to a computer having a dedicated hardware configuration, such as a general-purpose computer 800 shown in FIG. When there are various programs, it is possible to perform various functions and so on.
图8示出了可用于实施根据本发明实施例的方法和装置的计算机的示意性框图。在图8中,中央处理单元(CPU)801根据只读存储器(ROM)802中存储的程序或从存储部分808加载到随机存取存储器(RAM)803的程序执行各种处理。在RAM 803中,还根据需要存储当CPU 801执行各种处理等等时所需的数据。CPU 801、ROM 802和RAM 803经由总线804彼此连接。输入/输出接口805也连接到总线804。Fig. 8 shows a schematic block diagram of a computer that can be used to implement the method and device according to the embodiments of the present invention. In FIG. 8 , a central processing unit (CPU) 801 executes various processes according to programs stored in a read only memory (ROM) 802 or loaded from a storage section 808 to a random access memory (RAM) 803 . In the RAM 803 , data required when the CPU 801 executes various processes and the like is also stored as necessary. The CPU 801 , ROM 802 , and RAM 803 are connected to each other via a bus 804 . The input/output interface 805 is also connected to the bus 804 .
下述部件连接到输入/输出接口805:输入部分806(包括键盘、鼠标等等)、输出部分807(包括显示器,比如阴极射线管(CRT)、液晶显示器(LCD)等,和扬声器等)、存储部分808(包括硬盘等)、通信部分809(包括网络接口卡比如LAN卡、调制解调器等)。通信部分809经由网络比如因特网执行通信处理。根据需要,驱动器810也可连接到输入/输出接口805。可拆卸介质811比如磁盘、光盘、磁光盘、半导体存储器等等可以根据需要被安装在驱动器810上,使得从中读出的计算机程序根据需要被安装到存储部分808中。The following components are connected to the input/output interface 805: an input section 806 (including a keyboard, a mouse, etc.), an output section 807 (including a display such as a cathode ray tube (CRT), a liquid crystal display (LCD), etc., and a speaker, etc.), A storage section 808 (including a hard disk, etc.), a communication section 809 (including a network interface card such as a LAN card, a modem, etc.). The communication section 809 performs communication processing via a network such as the Internet. A driver 810 may also be connected to the input/output interface 805 as needed. A removable medium 811 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like can be mounted on the drive 810 as needed, so that a computer program read therefrom can be installed into the storage section 808 as needed.
在通过软件实现上述系列处理的情况下,从网络比如因特网或存储介质比如可拆卸介质811安装构成软件的程序。In the case of realizing the above-described series of processing by software, the programs constituting the software are installed from a network such as the Internet or a storage medium such as the removable medium 811 .
本领域的技术人员应当理解,这种存储介质不局限于图8所示的其中存储有程序、与设备相分离地分发以向用户提供程序的可拆卸介质811。可拆卸介质811的例子包含磁盘(包含软盘(注册商标))、光盘(包含光盘只读存储器(CD-ROM)和数字通用盘(DVD))、磁光盘(包含迷你盘(MD)(注册商标))和半导体存储器。或者,存储介质可以是ROM 802、存储部分808中包含的硬盘等等,其中存有程序,并且与包含它们的设备一起被分发给用户。Those skilled in the art should understand that such a storage medium is not limited to the removable medium 811 shown in FIG. 8 in which the program is stored and distributed separately from the device to provide the program to the user. Examples of the removable media 811 include magnetic disks (including floppy disks (registered trademark)), optical disks (including compact disk read only memory (CD-ROM) and digital versatile disks (DVD)), magneto-optical disks (including )) and semiconductor memory. Alternatively, the storage medium may be the ROM 802, a hard disk contained in the storage section 808, or the like, in which the programs are stored and distributed to users together with devices containing them.
本发明还提出一种存储有机器可读取的指令代码的程序产品。所述指令代码由机器读取并执行时,可执行上述根据本发明实施例的方法。The invention also proposes a program product storing machine-readable instruction codes. When the instruction code is read and executed by a machine, the above-mentioned method according to the embodiment of the present invention can be executed.
相应地,用于承载上述存储有机器可读取的指令代码的程序产品的存储介质也包括在本发明的公开中。所述存储介质包括但不限于软盘、光盘、磁光盘、存储卡、存储棒等等。Correspondingly, a storage medium for carrying the program product storing the above-mentioned machine-readable instruction codes is also included in the disclosure of the present invention. The storage medium includes, but is not limited to, a floppy disk, an optical disk, a magneto-optical disk, a memory card, a memory stick, and the like.
在上面对本发明具体实施例的描述中,针对一种实施方式描述和/或示出的特征可以以相同或类似的方式在一个或更多个其它实施方式中使用,与其它实施方式中的特征相组合,或替代其它实施方式中的特征。In the above description of specific embodiments of the present invention, features described and/or illustrated for one embodiment can be used in the same or similar manner in one or more other embodiments, and features in other embodiments Combination or replacement of features in other embodiments.
应该强调,术语“包括/包含”在本文使用时指特征、要素、步骤或组件的存在,但并不排除一个或更多个其它特征、要素、步骤或组件的存在或附加。It should be emphasized that the term "comprising/comprising" when used herein refers to the presence of a feature, element, step or component, but does not exclude the presence or addition of one or more other features, elements, steps or components.
此外,本发明的方法不限于按照说明书中描述的时间顺序来执行,也可以按照其他的时间顺序地、并行地或独立地执行。因此,本说明书中描述的方法的执行顺序不对本发明的技术范围构成限制。In addition, the method of the present invention is not limited to being executed in the chronological order described in the specification, and may also be executed in other chronological order, in parallel or independently. Therefore, the execution order of the methods described in this specification does not limit the technical scope of the present invention.
尽管上面已经通过对本发明的具体实施例的描述对本发明进行了披露,但是,应该理解,上述的所有实施例和示例均是示例性的,而非限制性的。本领域的技术人员可在所附权利要求的精神和范围内设计对本发明的各种修改、改进或者等同物。这些修改、改进或者等同物也应当被认为包括在本发明的保护范围内。Although the present invention has been disclosed by the description of specific embodiments of the present invention above, it should be understood that all the above embodiments and examples are illustrative rather than restrictive. Those skilled in the art can devise various modifications, improvements or equivalents to the present invention within the spirit and scope of the appended claims. These modifications, improvements or equivalents should also be considered to be included in the protection scope of the present invention.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110192827.3A CN102855498B (en) | 2011-07-01 | 2011-07-01 | Character identifying method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110192827.3A CN102855498B (en) | 2011-07-01 | 2011-07-01 | Character identifying method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102855498A CN102855498A (en) | 2013-01-02 |
CN102855498B true CN102855498B (en) | 2016-08-31 |
Family
ID=47402074
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201110192827.3A Active CN102855498B (en) | 2011-07-01 | 2011-07-01 | Character identifying method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102855498B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150006360A1 (en) * | 2013-06-28 | 2015-01-01 | Google Inc. | Threshold Confidence Levels for Extracted Card Data |
CN105184328B (en) * | 2015-08-17 | 2018-11-27 | 浪潮金融信息技术有限公司 | A kind of method and device identifying image |
CN117085969B (en) * | 2023-10-11 | 2024-02-13 | 中国移动紫金(江苏)创新研究院有限公司 | Artificial Intelligence Industrial Vision Inspection Methods, Devices, Equipment and Storage Media |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101373519A (en) * | 2007-08-20 | 2009-02-25 | 富士通株式会社 | Character recognition device and method |
CN101536035A (en) * | 2006-08-31 | 2009-09-16 | 公立大学法人大阪府立大学 | Image recognition method, image recognition device, and image recognition program |
-
2011
- 2011-07-01 CN CN201110192827.3A patent/CN102855498B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101536035A (en) * | 2006-08-31 | 2009-09-16 | 公立大学法人大阪府立大学 | Image recognition method, image recognition device, and image recognition program |
CN101373519A (en) * | 2007-08-20 | 2009-02-25 | 富士通株式会社 | Character recognition device and method |
Also Published As
Publication number | Publication date |
---|---|
CN102855498A (en) | 2013-01-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102388392B (en) | Pattern recognition device | |
WO2017016240A1 (en) | Banknote serial number identification method | |
CN107944020A (en) | Facial image lookup method and device, computer installation and storage medium | |
JP2014232533A (en) | System and method for ocr output verification | |
JP5283088B2 (en) | Image search device and computer program for image search applied to image search device | |
US20090041361A1 (en) | Character recognition apparatus, character recognition method, and computer product | |
CN101493887B (en) | Eyebrow image segmentation method based on semi-supervised learning and hash index | |
US8027978B2 (en) | Image search method, apparatus, and program | |
JP4300098B2 (en) | Object identification method and apparatus, program and recording medium | |
CN110363224B (en) | Object classification method and system based on image and electronic equipment | |
CN112883980B (en) | Data processing method and system | |
WO2022134580A1 (en) | Method and apparatus for acquiring certificate information, and storage medium and computer device | |
CN105512657A (en) | Character recognition method and apparatus | |
CN112926601A (en) | Image recognition method, device and equipment based on deep learning and storage medium | |
CN102855498B (en) | Character identifying method and device | |
CN117831056A (en) | Bill information extraction method, device and bill information extraction system | |
WO2009048696A1 (en) | Local image descriptors using linear discriminant embedding | |
Chakraborty et al. | Hand gesture recognition: A comparative study | |
CN108090728B (en) | Express information input method and system based on intelligent terminal | |
CN113657273B (en) | Method, device, electronic equipment and medium for determining commodity information | |
CN110020638A (en) | Facial expression recognizing method, device, equipment and medium | |
JP6151908B2 (en) | Learning device, identification device, and program thereof | |
Úbeda et al. | Pattern spotting in historical documents using convolutional models | |
CN113408553A (en) | Image category identification method and device, electronic equipment and storage equipment | |
CN113474786A (en) | Electronic purchase order identification method and device and terminal equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |