CN1025764C

CN1025764C - Characters recognition method and system

Info

Publication number: CN1025764C
Application number: CN 92103651
Authority: CN
Inventors: 杨源远; 路浩如; 杨震; 杨平勇; 李璇
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 1992-05-12
Filing date: 1992-05-12
Publication date: 1994-08-24
Anticipated expiration: 2007-05-12
Also published as: CN1066335A

Abstract

The present invention relates to a character recognition method and a system. Stroke characteristics of a character picture are extracted, and characters are classified, matched and identified by directly making use of the stroke characteristics. Structure meaning of the characters is expressed by adopting a framework form, important influence strokes and stroke connections in a framework are emphasized, strokes with less action is omitted, and a necessary comparison condition for stroke direction of variable permission and similar character discrimination are presented. The present invention is favorable for protruding distinction between the characters, and processes of matching and identification are simplified. Compared with the present universal character identification technology, the present invention has high identification rate and adaptability.

Description

本发明涉及一种字符识别方法和系统，尤其适用于识别手写体汉字和多字体印刷汉字的识别方法。The invention relates to a character recognition method and system, and is especially suitable for recognizing handwritten Chinese characters and multi-font printed Chinese characters.

国内外已经研制的若干字符识别系统，主要采用对字符图象的象元分布抽取特征参量，并以此参量为依据进行分类和匹配识别的字符识别方法。例如，1989年2月8日中国专利审定公告CN1003257B的字符识别系统，1990年11月21日中国专利审定公告CN1010512B所公开的技术。Several character recognition systems that have been developed at home and abroad mainly use the character recognition method of extracting characteristic parameters from the pixel distribution of character images, and classifying and matching recognition based on these parameters. For example, the character recognition system of CN1003257B on February 8, 1989, and the technology disclosed in CN1010512B on November 21, 1990.

因此，通常的技术有如下的问题：Therefore, conventional techniques have the following problems:

1.不能直接反映字符的结构特征，因而忽视了笔划结构作为字符构成的本质特点。1. It cannot directly reflect the structural features of characters, thus ignoring the stroke structure as the essential feature of character formation.

2.大字符集的情况下难以达到高的识别率。2. It is difficult to achieve a high recognition rate in the case of a large character set.

3.区分形态相似或笔划结构复杂的字符十分困难。3. It is very difficult to distinguish characters with similar shapes or complex stroke structures.

4.在手写体字符情况下，字形书写变化很大，所抽取的特征参量分散性大，且需采用高维特征矢量。4. In the case of handwritten characters, the writing of glyphs varies greatly, and the extracted feature parameters are highly dispersed, and high-dimensional feature vectors are required.

本发明的目的是创造一种字符识别方法，力求准确地抽取字符图象的笔划特征，充分反映字符的结构本质;直接利用字符的笔划结构词义对字符分类和匹配识别;运用知识表达字符的结构词义，达到简化字符的匹配识别过程，提高辨认相似字符的准确性和识别方法的适应能力。The purpose of the present invention is to create a character recognition method, strive to accurately extract the stroke features of the character image, fully reflect the structural nature of the character; directly use the stroke structure and meaning of the character to classify and match the character; use knowledge to express the structure of the character Word meaning, to simplify the matching and recognition process of characters, improve the accuracy of recognizing similar characters and the adaptability of recognition methods.

本发明所涉及的字符识别方法包括：对书写有字符的页面扫描获得字符图象为第一步骤;字符图象二值化、字符切分及规格化为第二步骤;抽取字符二值化点阵的笔划结构特征为第三步骤;由结构特征求得分类特征码以确定所属分类为第四步骤;将结构特征与所属分类的字符模型进行匹配并识别之为第五步骤;将识别结果转为可见输出为第六步骤。The character recognition method involved in the present invention includes: scanning the page with characters written on it to obtain a character image is the first step; character image binarization, character segmentation and normalization are the second step Steps: extracting the stroke structure feature of character binarized lattice is the third step; obtaining the classification feature code from the structure feature to determine the category to which it belongs is the fourth step; matching the structure feature with the character model of the category to which it belongs and identifying it as The fifth step; converting the recognition result into visible output is the sixth step.

所述的第三步骤包括：Said third step comprises:

1.字符结构模式作为模式整体可以分解为元字符、笔划和笔划元三种子模式。元字符是构造字符的字符。笔划分解为直线段即为笔划元。笔划元是最低级子模式，用作描述字符模式的结构基元，其结构特征包括笔划元中心坐标、长度、方向和连接关系。1. The character structure pattern as a whole can be decomposed into three sub-patterns: metacharacter, stroke and stroke meta. Metacharacters are the characters that construct characters. Strokes are decomposed into straight line segments which are stroke elements. The stroke unit is the lowest sub-pattern, used as a structural primitive to describe the character mode, and its structural features include stroke unit center coordinates, length, direction and connection relationship.

2.对字符点阵作一次简单的扫描，检测每一象元在8个方向上与相邻象元的连接情况，将其区分为笔划的始端、终端、连接区或普通笔划元素并标记相应的符号，从而将字符点阵平面（CDP）转换成字符象元属性平面（CAP）。2. Do a simple scan of the character lattice, detect the connection between each pixel and the adjacent pixel in 8 directions, distinguish it as the beginning, end, connection area or ordinary stroke elements of the stroke and mark the corresponding The symbol, thus converting the character dot matrix plane (CDP) into the character pixel attribute plane (CAP).

3.除属于连接区的象元以外，在CAP上处于边缘点的象元，计算其“｜”、“-”、“/”、“＼”四个方向上连续的象元个数en，en最大的方向取作该边缘点的纤维主方向。在主方向上的en值称作纤维长度，纤维长度上连接的象元赋以主方向相应的权值。各边缘点的纤维可能相交形成交织区，交织区的象元其方向权值累加。所有边缘点完成上述计算后即可求得字符纤维结构图（CFP）。3. Except for the pixels belonging to the connection area, calculate the number of continuous pixels en in the four directions of "|", "-", "/", and "\" for the pixels at the edge points on the CAP, The direction in which en is greatest is taken as the main fiber direction of the edge point. The value of en in the main direction is called the fiber length, and the pixels connected on the fiber length are given the corresponding weight of the main direction. The fibers at each edge point may intersect to form an interweaving area, and the direction weights of the pixels in the interweaving area are accumulated. After all the edge points complete the above calculations, the character fiber structure map (CFP) can be obtained.

4.对照CAP连接区的方向特征，除去CFP中的噪声纤维，将属于“｜”、“-”、“/”、“＼”四个方向的纤维分别置于V、h、s、b四个平面中，即可求得每一笔划元的中心坐标、长度和方向。4. Compared with the directional characteristics of the CAP connection area, the noise fibers in the CFP were removed, and the fibers belonging to the four directions of "|", "-", "/", and "\" were placed in V, h, s, and b respectively. In a plane, the center coordinates, length and direction of each stroke element can be obtained.

5.利用CAP的端点和连接区特征，结合已经求到的笔划元中心坐标、长度和方向可以计算笔划元的连接关系。5. The connection relationship of stroke elements can be calculated by using the features of endpoints and connection areas of CAP, combined with the obtained center coordinates, length and direction of stroke elements.

所述的第四步骤包括：Said fourth step comprises:

1.应用字符外围结构的四角特征和四边特征作为字符的分类特征，在二个层次上进行外围结构的描述和分类。由已知字符的四角特征和四边特征建立预分类字典。1. Using the four-corner and four-edge features of the character's peripheral structure as the classification feature of the character, the description and classification of the peripheral structure are carried out on two levels. A pre-classification dictionary is established from the four-corner features and four-edge features of known characters.

2.在字符的笔划平面上（CSP）以平面的四个角为中心，搜索距离四角最近的笔划元。2. On the stroke plane (CSP) of the character, take the four corners of the plane as the center, and search for the stroke element closest to the four corners.

3.判断最近角点的笔划元方向属性，并分成横、竖、撇、捺、角、交六种类型，赋以相应的编码，称作角码。由四个角码组成的码串构成字符的第一分类特征。3. Determine the stroke element direction attribute of the nearest corner point, and divide it into six types: horizontal, vertical, left, right, corner, and cross, and assign corresponding codes, which are called corner codes. The code string composed of four corner codes constitutes the first classification feature of the character.

4.在CSP上由中心引出射线，按顺时针扫描，获得射线与字符最外层笔划元所组成的多边形作为字符外围轮廓，抽取其超过某一阈值的凸点，分别计数每一边的凸点数求得四边的码串构成字符的第二分类特征。4. On the CSP, the ray is drawn from the center and scanned clockwise to obtain the polygon composed of the ray and the outermost stroke element of the character as the outline of the character, extract the convex points exceeding a certain threshold, and count the number of convex points on each side Obtain the second classification feature of the characters formed by the code strings of the four sides.

5.查找预分类字典中与待识字符四角码及四边码相同的同类字符代码，完成第四步骤。5. Find the same kind of character codes in the pre-classification dictionary as the four-corner code and the four-side code of the character to be recognized, and complete the fourth step.

所述的第五步骤：The fifth step described:

1.字符结构词义采用框架形式的知识表达，由字符框架表达每一字符模式。在框架中，构成字符的全部笔划元分别在h、v、s、b四个平面上分组排序，并列出必要的笔划连接关系和相似字之间笔划元特征的辨析条件。在字符框架中参与分组排序的每一个笔划元由笔划元框架描述。笔划元框架表达笔划元之正常方向、中心位置和长度。此外，还给出该笔划的权重和允许的畸变方向。字符框架中的必要连接关系和笔划元框架中的权重属于运用知识表达、强调对识别结果有重要影响的笔划元及其连接关系而忽视那些冗余的或影响不大的成份。相似字辨析条件和允许的畸变方向使得识别过程既能顾及在结构复杂而且数量庞大的字符集中辨认不同字符间笔划结构的细微差别，又能对变化万千的字形具有良好的适应能力。1. The word meaning of the character structure adopts the knowledge expression in the form of a frame, and each character pattern is expressed by the character frame. In the framework, all the stroke elements constituting a character are grouped and sorted on the four planes of h, v, s, and b respectively, and the necessary stroke connection relations and the discrimination conditions of stroke element features between similar characters are listed. Each stroke element that participates in the grouping sort in the character frame is described by the stroke element frame. The stroke element frame expresses the normal direction, center position and length of the stroke element. Additionally, the stroke's weight and allowed distortion directions are given. The necessary connections in the character frame and the weights in the stroke element frame belong to the use of knowledge expression, emphasizing the stroke elements and their connections that have an important impact on the recognition results and ignoring those redundant or insignificant components. The similar character discrimination conditions and the allowed distortion direction make the recognition process take into account the It can recognize the nuances of the stroke structure between different characters in a complex and large number of character sets, and has good adaptability to the ever-changing glyphs.

2.取出预分类同类的字符模型，依次与待识字符的笔划元特征进行搜索匹配、计算属性距离，若距离小于某一阈值认为匹配成功，否则认为匹配失败。如此过程在每个模型的四个笔划元子平面上依次执行直至结束。2. Take out the pre-classified character models of the same type, search and match with the stroke meta-features of the characters to be recognized in turn, and calculate the attribute distance. If the distance is less than a certain threshold, the match is considered successful, otherwise the match is considered failed. This process is executed sequentially on the four stroke element sub-planes of each model until the end.

3.按照笔划元框架指定的权重计算笔划元属性的加权距离。对字符结构起关键作用的笔划元由于有最高的权重而便于区分字符间笔划的细微差异，影响不大的笔划元有较小的权重，从而达到忽略冗余笔划的目的。3. Calculate the weighted distance of the stroke meta attribute according to the weight specified by the stroke meta frame. The stroke elements that play a key role in the character structure have the highest weight, which is convenient for distinguishing the subtle differences of strokes between characters, and the stroke elements that have little influence have smaller weights, so as to achieve the purpose of ignoring redundant strokes.

4.匹配未成的笔划元中若存在容许畸变方向的、转向相应方向的样本子平面搜索匹配。4. If there is a sample sub-plane in the unmatched stroke element that allows the distortion direction and turns to the corresponding direction, search and match.

5.对必要的连接关系进行检测，不满足这一要求时退出匹配候选列。5. Check the necessary connection relationship, and exit the matching candidate column if this requirement is not met.

6.检测笔划元比较和相似字符辨析条件，不满足要求时退出匹配候选列。6. Detect stroke element comparison and similar character discrimination conditions, and exit the matching candidate list if the requirements are not met.

7.匹配总距离在阈值范围内的所有字符，按距离从小到大排序，取出最小的几个作为识别候选字，若无识别候选字以拒识处理。7. Match all characters whose total distance is within the threshold range, sort by distance from small to large, and take out the smallest few as recognition candidates. If there is no recognition candidate, it will be rejected.

本发明具有的独特优点可概括如下：The unique advantages that the present invention has can be summarized as follows:

准确抽取笔划结构特征从而充分反映了字符的本质特点。直接利用笔划特征描述字符之结构骨架而以笔划属性矢量适应字符形态的种种变化，实现字符分类和匹配识别。对字符的结构词义模型运用框架形式的知识表达，既便于强调重要的笔划或笔划连接关系，又可忽视对识别字符影响不大的笔划，十分有利于突出字符间的区别简化匹配识别过程。框架中表达了相似字的辨析条件，使得辨认字符间细微的笔划差异成为可能，例如：风、凤;士、土;澜、谰……，从而极大地提高了字符的识别率。在笔划框架中还给出允许畸变的方向，使得识别的灵活性和适应能力显著提高。与现有的技术比较，既避免统计方法中因采用高维特征存在特征选择和模式可分性方面的困难而限制识别率的提高。也避免了结构方法难以适应字符形态多变的缺陷。Accurately extract stroke structure features to fully reflect the essential characteristics of characters. Directly use stroke features to describe the structural skeleton of characters, and use stroke attribute vectors to adapt to various changes in character shapes, and realize character classification and matching recognition. The knowledge expression in the form of a frame is used for the structure and semantic model of characters, which not only facilitates emphasizing important strokes or stroke connection relationships, but also ignores strokes that have little influence on character recognition, which is very conducive to highlighting the differences between characters and simplifying the matching and recognition process. The analysis conditions of similar characters are expressed in the framework, so that the subtle differences between characters can be identified Stroke differences become possible, for example: Feng, Feng; Shi, Tu; Lan, Qiao..., thus greatly improving the recognition rate of characters. The direction of allowable distortion is also given in the stroke frame, so that the flexibility and adaptability of recognition are significantly improved. Compared with the existing technology, it avoids the difficulty of feature selection and pattern separability in the statistical method due to the use of high-dimensional features, which limits the improvement of the recognition rate. It also avoids the defect that the structural method is difficult to adapt to the changeable character form.

本发明的实施例由图文扫描仪、微型计算机主机、显示器、打印机、磁带机及有关接口板组成。扫描仪包括手持扫描在内各种型式均可适用。微型计算机主机使用DOS操作系统最为通用。磁带机不是必要的设备可以作为主机存储器的扩充或后备自由选用。系统的工作原理结合下面的附图逐步说明。The embodiment of the present invention is made up of graphic scanner, microcomputer mainframe, display, printer, magnetic tape drive and relevant interface board. Various types of scanners are available, including handheld scanners. The DOS operating system is the most common for microcomputer mainframes. If the tape drive is not necessary, it can be freely selected as the expansion or backup of the host memory. The working principle of the system is explained step by step with the following drawings.

附图说明：Description of drawings:

图1是本发明实施例的方块结构图Fig. 1 is the block diagram of the embodiment of the present invention

图2是结构特征抽取工作流程图Figure 2 is a workflow diagram of structural feature extraction

图3是结构特征抽取的实例Figure 3 is an example of structural feature extraction

图4是笔划元连接关系描述Figure 4 is a description of the stroke element connection relationship

图5是预分类工作流程图Figure 5 is a flow chart of the pre-classification workflow

图6是四角特征码表Figure 6 is a four-corner feature code table

图7是字符框架Figure 7 is the character frame

图8是笔划元条件排序结构图Figure 8 is a structural diagram of stroke element conditional sorting

图9是笔划元条件排序工作流程图Fig. 9 is a flow chart of stroke element conditional sorting

图10是笔划元框架Figure 10 is the stroke meta frame

图11是运用知识引导的匹配识别工作流程图Figure 11 is a flow chart of matching recognition using knowledge guidance

图12是子平面h笔划元匹配工作流程图。Fig. 12 is a flow chart of sub-plane h stroke element matching.

图1是实施例的系统方块图，书写在纸张上的字符用图文扫描仪扫描页面，每页扫描得到一幅图象文件，按所选的灰度阈值转换成二值化（0，1）点阵，经接口板存入计算机内。由页面切分程序模块搜索点阵的起始行，行总数，字首和字数自动完成字的切分，经规格化处理后得到每个字符的点阵（例如32×32或64×64字符点阵），抽取每个字符点阵的笔划特征，进行分类、匹配进而识别该字符至存于机内的字符点阵全部识别完毕，以机内码表示识别结果。最后以标准字形显示或打印出书写在样张上全部字符的识别结果，或者继续进行必要的编辑。Fig. 1 is the system block diagram of the embodiment, the characters written on the paper scan the page with a graphic scanner, and each page scans to obtain an image file, which is converted into binary by the selected gray scale threshold. Value (0, 1) dot matrix, stored in the computer through the interface board. The page segmentation program module searches for the starting line of the dot matrix, the total number of lines, the prefix and the number of words to automatically complete the word segmentation, and obtain the dot matrix of each character after normalization processing (for example, 32×32 or 64×64 characters Dot matrix), extract the stroke features of each character dot matrix, classify, match and then recognize the character until the character dot matrix stored in the machine is fully recognized, and the recognition result is represented by the machine internal code. Finally, display or print out the recognition results of all the characters written on the sample in standard fonts, or continue to make necessary edits.

图2是结构特征抽取的流程图，以规格化处理后的字符点阵（CDP）作为该流程的起点，扫描CDP的行和列，检测在行和列二个方向取值为1的连续象元数X，记录出现次数最多的X作为笔划宽度wi，在行和列方向用笔划宽度量度连续象元素不足wi时，分别用“｜”和“-”标记该象元。在“-”象元的两侧检测其是否为0，如左侧为0属于左端点，标记为“W”。如右侧为0属于右端点，标记为“E”。在“｜”象元的上、下二方检测其是否为0，上方为0属于上端点标记为“N”，下方为0属于下端点标记为“S”。在CDP中所有既不是“-”亦不是“｜”的象元，按其区域的坐标顺序用小写英文字符标记。该英文字符标记的区域即为笔划的连接区，并计算该连接区的特征。CDP的每一个象元按上述要求由指定符号标记之后即赋予笔划的始端、终端、连接区或普通象元等不同的属性称为字符象元属性平面（CAP），图3示出书写字符“毗”字的结构特征抽取实例。其中左上方是CAP图，下方是连接区特征表，第一列是序号、第二列是连接区代号、第三、四列分别是起始和终结的列坐标、第五、六列分别是起始和终结的横坐标。最后一列是连接区的连接特征，连接特征用代码表示示于图4。对CAP的每一个边缘点，除连接区的象元外，在行、列、左斜、右斜四个方向上计算其连续非0的象元数，取其象元数最大的方向作为该边缘点的纤维主方向，主方向上连接的象元数为纤维长度，各象元赋以主方向相应的权值。各边缘点的纤维可能相交形成交织区，交织区象元其方向权值累加。所有边缘点完成上述计算后即求得字符纤维结构图（CFP）。除去交织区的噪声纤维，将属于行、列、左斜、右斜四个方向的纤维分别置于h、v、s、b四个平面中即可求得每一笔划元的中心坐标、长度和方向，再利用CAP的端点和连接特征求得笔划元的连接关系，从而取得字符的全部结构特征。图3的右上图示出了“毗”字结构特征的实例。Figure 2 is a flow chart of structural feature extraction. The normalized character dot matrix (CDP) is used as the starting point of the process, and the rows and columns of the CDP are scanned to detect continuous images with a value of 1 in the two directions of the row and column. As for the number X, record the most frequently occurring X as the stroke width wi, and when the continuous pixel elements measured by the stroke width in the row and column directions are less than wi, mark the pixel with "|" and "-" respectively. Check whether it is 0 on both sides of the "-" pixel. If the left side is 0, it belongs to the left endpoint, and it is marked as "W". If the 0 on the right side belongs to the right endpoint, it is marked as "E". Check whether it is 0 at the upper and lower sides of the "|" pixel. If the upper part is 0, it belongs to the upper endpoint and is marked as "N", and if the lower part is 0, it belongs to the lower endpoint and is marked as "S". All pixels that are neither "-" nor "|" in CDP are marked with lowercase English characters according to the coordinate order of their area. The area marked by the English characters is the connection area of the stroke, and the features of the connection area are calculated. Each pixel of the CDP is marked by the specified symbol according to the above requirements, and then endowed with different attributes such as the beginning, end, connection area or common pixel of the stroke, which is called the character pixel attribute plane (CAP). Figure 3 shows the writing character " Structural feature extraction example of "" character. Among them, the upper left is the CAP diagram, and the lower part is the connection area feature table. The first column is the serial number, the second column is the connection area code, the third and fourth columns are the starting and ending column coordinates, and the fifth and sixth columns are respectively The abscissa of the start and end. The last column is the connection feature of the connection area, and the connection feature is shown in Figure 4 in code. For each edge point of the CAP, except for the pixels in the connection area, calculate the number of continuous non-zero pixels in the four directions of row, column, left oblique, and right oblique, and choose the direction with the largest number of pixels. To the main direction of the fiber as the edge point, the number of pixels connected in the main direction is the length of the fiber, and each pixel is assigned the corresponding weight of the main direction. The fibers at each edge point may intersect to form an interweaving area, and the direction weights of the pixels in the interweaving area are accumulated. After all the edge points complete the above calculations, the character fiber structure map (CFP) is obtained. Remove the noise fibers in the interweaving area, and place the fibers belonging to the four directions of row, column, left oblique, and right oblique respectively in the four planes h, v, s, and b to obtain the center coordinates and length of each stroke element and direction, and then use the endpoints and connection features of CAP to obtain the connection relationship of stroke elements, so as to obtain all the structural features of characters. The upper right figure of Fig. 3 has shown the example of the structural feature of " adjacent " character.

图5是预分类工作流程图。在字符的笔划平面上，以平面的四个角为中心，搜索距离四角最近的笔划元。判断该笔划元的方向属性，把它们分成横、竖、撇、捺、角、交和空七种类型。它们的编码如图6所示称为角码，由四个角码组成的码串构成字符的第一分类特征。在字符笔划平面上再由中心引出射线，按顺时针扫描，获得射线与字符最外层笔划元所组成的多边形作为字符的外围轮廓，抽取其超过某一阈值的凸点，分别计算每一边的凸点数作为边码，四个边码构成四边码串即为字符的第二分类特征。由四边码和四角码查找预分类字典，获得同类字符代码。Figure 5 is a flow chart of the pre-classification workflow. On the stroke plane of the character, with the four corners of the plane as the center, search for the stroke element closest to the four corners. Judge the direction attribute of this stroke element, divide them into horizontal, vertical, left, right, angle, intersection and empty seven types. Their coding is called angle code as shown in Figure 6, and the code string that is made up of four angle codes constitutes the first classification characteristic of character. Draw a ray from the center on the stroke plane of the character, and scan clockwise to obtain the polygon composed of the ray and the outermost stroke element of the character as the outer contour of the character, extract the convex points exceeding a certain threshold, and calculate the The number of salient points is used as the side code, and the four side codes form a four-side code string, which is the second classification feature of the character. Look up the pre-classified dictionary from the four-sided code and the four-corner code to obtain the same character code.

图7是表达字符结构词义模型的框架，其中带下标的ε_i表示第i个笔划元，分别在h、v、s、b四个子面上分组排序，图8为笔划元条件排序结构图，排序条件可参照图9。必要的连接关系Ω_mn是指该字符第m个笔划元和第n个笔划元之间必须满足的连接关系，例如：“夫”字，第一横笔和竖笔之间必须是相交的关系，而天则无此要求。笔划元比较槽口则用以辨别字符内部笔划长短比较或方向的不同，例如：土、士;天、夭，而相似字符辨析条件则判断某一笔划元缺少或存在时，候选字符的转移方向，例如：风、凤;梁、粱等等。图10是笔划元框架表达图7中的每一个笔规元ε_i的结构特征。包括笔划元的正常方向之量化为横、竖、撇、捺为四个方向分别用h、v、s、b代表;笔划元中心坐标（ x_o， y_o）_i和笔划长度。框架中还给出了该笔划允许畸变的方向ε^′ _i和结构权重wi，前者使匹配过程灵活而提高系统对字形变化的适应能力，后者则突出重点简化匹配。图7和图10组成系统的结构模型。图11示出运用知识引导的匹配识别工作流程图。图12是某子平面笔划元匹配工作流程图。按照预分类所给出的同类字符代码从知识库中逐个取出相应的字符模型，由图9表示的条件排序程序模块对已求得之笔划元进行排序。在h、v、s、b四个子平面上依次将模型中的笔划元与待识字符之笔划元先组内后组间依次匹配，笔划之间的属性距离小于规定阈值δ时认为匹配成功，否则认为匹配失败。若匹配失败，向下搜索字符笔划元是否匹配，如无匹配可能，取下一个模型笔划元进行匹配。这一过程一直进行到最后一个模型笔划元。若模型笔划元全部匹配成功或字符笔划元匹配完毕，则按照指定的权重计算全部笔划的属性距离，距离在阈值△范围之内时认为可列入匹配候选。模型中匹配未成的笔划元中若存在允许畸变方向的，转向相应方向的样本子平面搜索匹配，方法相同。对于列入匹配候选的字符模型进一步检测待识字符是否满足指定的连接关系Ω_mn，例如：夫、天;力、刀;……，夫和力相交关系都是必要的，不满足这一要求时退出匹配候选列。如果模型框架中存在笔划元比较的要求则检查是否满足要求，不满足比较条件的退出候选列。重复上述匹配比较直至全部分类模型匹配完毕。匹配总距离在阈值范围内的所有字符按距离从小到大的次序排列作为识别候选字排列首位的是第一候选字，通常情况下取为识别结果。若无识别候选字则以拒识处理。Fig. 7 is a framework for expressing the word meaning model of character structure, wherein the subscript ε _i represents the i-th stroke element, which is grouped and sorted on the four sub-faces of h, v, s, and b respectively. Fig. 8 is a conditional sorting structure diagram of stroke elements, Refer to Figure 9 for sorting conditions. The necessary connection relationship Ω _mn refers to the connection relationship that must be satisfied between the mth stroke element and the nth stroke element of the character, for example: "Fu", the first horizontal stroke and the vertical stroke must be intersecting , but God has no such requirement. The stroke element comparison notch is used to distinguish the difference in stroke length or direction within the character, for example: Earth, Shi; Tian, Yao, and the similar character discrimination condition is to judge the transfer direction of the candidate character when a certain stroke element is missing or exists. , For example: wind, phoenix; beam, beam and so on. Fig. 10 is a stroke element framework expressing the structural features of each stroke element ε _i in Fig. 7 . Quantization of the normal direction including the stroke element is horizontal, vertical, left and right, and the four directions are represented by h, v, s, b respectively; the central coordinates of the stroke element ( _xo , _yo ) _i and the length of the stroke. The framework also gives the direction ε ^′ _i of the allowable distortion of the stroke and the structural weight wi. The former makes the matching process flexible and improves the system's ability to adapt to font changes, while the latter emphasizes the emphasis and simplifies the matching. Figure 7 and Figure 10 constitute the structural model of the system. Fig. 11 shows the flow chart of matching identification using knowledge guidance. Fig. 12 is a flow chart of a certain sub-plane stroke element matching. According to the similar character codes given by the pre-classification, the corresponding character models are taken out one by one from the knowledge base, and the conditional sorting program module shown in Figure 9 sorts the obtained stroke elements. On the four sub-planes of h, v, s, b, the stroke elements in the model and the stroke elements of the character to be recognized are first matched within the group and then between the groups. When the attribute distance between the strokes is less than the specified threshold δ, the matching is considered successful. Otherwise, the match is considered a failure. If the matching fails, search down whether the character stroke element matches, if there is no match possible, take the next model stroke element to match. This process continues until the last model stroke element. If all the model stroke elements are successfully matched or the character stroke elements are matched, the attribute distance of all strokes is calculated according to the specified weight. If the distance is within the threshold △ range, it can be considered as a matching candidate. If there is an allowable distortion direction in the unmatched stroke element in the model, turn to the sample sub-plane search matching in the corresponding direction, and the method is the same. For the character models included in the matching candidates, further check whether the characters to be recognized meet the specified connection relationship Ω _mn , for example: husband, sky; force, knife;..., the intersecting relationship of husband and force is necessary, and this requirement is not met Exit the matching candidate column. If there is a requirement for stroke element comparison in the model framework, check whether the requirement is met, and the exit candidate column that does not meet the comparison condition. Repeat the above matching comparison until all classification models are matched. All characters whose matching total distance is within the threshold range are arranged in ascending order of distance as recognition candidates. The first candidate is the first candidate, which is usually taken as the recognition result. If there is no candidate for recognition, it will be treated as rejection.

Claims

1. A character recognition method, the first step is to scan the page with characters written on it to obtain the character image; the second step is to convert the character image into binarization, character segmentation and normalization: to extract the character binarization lattice The third step is the stroke structural feature; the fourth step is to obtain the classification feature code from the structural feature to determine the category to which it belongs; the fifth step is to match the structural feature with the character model of the category to which it belongs; the fifth step is to make the recognition result visible Output is the sixth step, and the feature of the present invention is:

Said third step comprises:

(1) The character structure pattern as a whole can be decomposed into three sub-patterns: meta-character, stroke and stroke-meta. A metacharacter is a character for constructing a character, and a stroke is decomposed into a straight line segment, which is a stroke element. The stroke unit is the lowest sub-pattern, used as a structural primitive to describe the character mode, and its structural features include stroke unit center coordinates, length, direction and connection relationship.

(2) Do a simple scan on the character lattice, detect the connection between each pixel and the adjacent pixel in 8 directions, and distinguish it as the beginning, end, connection area or common stroke elements of the stroke and mark them Corresponding symbols, thereby converting the character dot matrix plane (CDP) into the character pixel attribute plane (CAP).

(3) Except for the pixels belonging to the connection area, for the pixels at the edge points on the CAP, calculate the number of continuous pixels en in the four directions of "|", "一", "/", and "\" , the direction with the largest en is taken as the main fiber direction of the edge point. The value of en in the main direction is called the fiber length, and the pixels connected on the fiber length are given the corresponding weight of the main direction. The fibers at each edge point may intersect to form an interweaving area, and the direction weights of the pixels in the interweaving area are accumulated. After all the edge points complete the above calculations, the character fiber structure map (CFP) can be obtained.

(4) Compared with the directional characteristics of the CAP connection area, the noise fibers in the CFP were removed, and the fibers belonging to the four directions of "|", "一", "/", and "\" were placed in V, h, s, and b respectively. In the four planes, the center coordinates, length and direction of each stroke element can be obtained.

(5) Using the features of endpoints and connection areas of CAP, combined with the obtained center coordinates, length and direction of stroke elements, the connection relationship of stroke elements can be calculated.

Said fourth step comprises:

(1) Using the four-corner feature and four-edge feature of the character's peripheral structure as the classification feature of the character, the description and classification of the peripheral structure are carried out on two levels. A pre-classification dictionary is established from the four-corner features and four-edge features of known characters.

(2) On the stroke plane (CSP) of the character, with the four corners of the plane as the center, search for the stroke element closest to the four corners.

(3) Judge the stroke element direction attribute of the nearest corner point, and divide into six types of horizontal, vertical, left, right, corner, and cross, and give corresponding codes, which are called corner codes. The code string composed of four corner codes constitutes the first classification feature of the character.

(4) On the CSP, the ray is drawn from the center and scanned clockwise to obtain the polygon composed of the ray and the outermost stroke element of the character as the outer contour of the character, extract the convex points exceeding a certain threshold, and count the convex points on each side Points are obtained to obtain the second classification feature of the code strings on the four sides.

(5) Find the same kind of character codes in the pre-classification dictionary as the four-corner code and the four-side code of the character to be recognized, and complete the fourth step.

The fifth step described:

(1) The word meaning of the character structure adopts the knowledge expression in the form of a frame, and each character pattern is expressed by the character frame. In the framework, all the stroke elements constituting a character are grouped and sorted on the four planes of h, v, s, and b respectively, and the necessary stroke connection relations and the discrimination conditions of stroke element features between similar characters are listed. Each stroke element that participates in the grouping sort in the character frame is described by the stroke element frame. The stroke element frame expresses the normal direction, center position and length of the stroke element. Additionally, the stroke's weight and allowed distortion directions are given. The necessary connections in the character frame and the weights in the stroke element frame belong to the use of knowledge expression, emphasizing the stroke elements and their connections that have an important impact on the recognition results and ignoring those redundant or insignificant components. The similar character discrimination conditions and the allowed distortion direction enable the recognition process to take into account the subtle differences in the stroke structure of different characters in a complex and large character set, and to have good adaptability to the ever-changing glyphs.

(2) Take out the pre-classified character models of the same kind, search and match with the stroke meta-features of the character to be recognized in turn, and calculate the attribute distance. If the distance is less than a certain threshold, the match is considered successful, otherwise the match is considered failed. This process is executed sequentially on the four stroke element sub-planes of each model until the end.

(3) Calculate the weighted distance of the stroke meta attribute according to the weight specified by the stroke meta frame. The stroke elements that play a key role in the character structure have the highest weight, which is convenient for distinguishing the subtle differences of strokes between characters, and the stroke elements that have little influence have smaller weights, so as to achieve the purpose of ignoring redundant strokes.

(4) If there is a sample sub-plane in the unmatched stroke element that allows the distortion direction and turns to the corresponding direction, search and match.

(5) Detect the necessary connection relationship, and exit the matching candidate column if this requirement is not met.

(6) Detect stroke element comparison and similar character discrimination conditions, and exit the matching candidate list if the requirements are not met.

(7) Match all characters whose total distance is within the threshold range, sort according to the distance from small to large, and take out the smallest few as recognition candidates, if there is no recognition candidate, it will be rejected.