[go: up one dir, main page]

CN115205872B - A method for irregular arc distortion of text images for OCR text recognition - Google Patents

A method for irregular arc distortion of text images for OCR text recognition Download PDF

Info

Publication number
CN115205872B
CN115205872B CN202210814225.5A CN202210814225A CN115205872B CN 115205872 B CN115205872 B CN 115205872B CN 202210814225 A CN202210814225 A CN 202210814225A CN 115205872 B CN115205872 B CN 115205872B
Authority
CN
China
Prior art keywords
arc
image
text
point
annotation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210814225.5A
Other languages
Chinese (zh)
Other versions
CN115205872A (en
Inventor
蔡志旻
沈峰
魏建刚
陈羽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Runhe Runyun Technology Co ltd
Original Assignee
Nanjing Runhe Runyun Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Runhe Runyun Technology Co ltd filed Critical Nanjing Runhe Runyun Technology Co ltd
Priority to CN202210814225.5A priority Critical patent/CN115205872B/en
Publication of CN115205872A publication Critical patent/CN115205872A/en
Application granted granted Critical
Publication of CN115205872B publication Critical patent/CN115205872B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/191Design or setup of recognition systems or techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06V30/19147Obtaining sets of training patterns; Bootstrap methods, e.g. bagging or boosting

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Input (AREA)

Abstract

本发明公开了一种面向OCR文字识别的文本图像不规则弧形扭曲方法,获取原始文本图像和原始文本框标注文件;计算原始文本图像的最大弧形扭曲高度值;根据原始图像的宽度对图像进行分区,不同区域生成不同的弧形扭曲高度值和弧形波个数;根据不同区域的弧形扭曲高度值和弧形波个数计算该区域的图像像素点在Y轴方向上的偏移距离,得到图像像素点新坐标,将像素点按新坐标复制到空白图像上,生成不规则弧形扭曲图像;根据不同区域弧形扭曲高度值和弧形波个数重计算原始文本框标注坐标,生成新的多点标注文本框和标注文件。本发明生成不规则弧形扭曲的文本图像以及对应标注文件,为OCR训练弧形文本识别模型提供大量训练样本。

The present invention discloses an irregular arc distortion method for text images for OCR text recognition, which includes obtaining an original text image and an original text box annotation file; calculating the maximum arc distortion height value of the original text image; partitioning the image according to the width of the original image, generating different arc distortion height values and arc wave numbers in different regions; calculating the offset distance of the image pixel points in the Y-axis direction according to the arc distortion height values and arc wave numbers in different regions, obtaining new coordinates of the image pixel points, copying the pixel points to a blank image according to the new coordinates, generating an irregular arc distortion image; recalculating the original text box annotation coordinates according to the arc distortion height values and arc wave numbers in different regions, generating a new multi-point annotation text box and annotation file. The present invention generates irregular arc distortion text images and corresponding annotation files, and provides a large number of training samples for OCR training arc text recognition models.

Description

Text image irregular arc-shaped distortion method for OCR (optical character recognition)
Technical Field
The invention relates to the technical field of computer image recognition, in particular to an irregular arc-shaped distortion method of a text image for OCR (optical character recognition).
Background
OCR text recognition applications are becoming more and more widespread, and tens of thousands of sample images are typically required to train a text recognition system for different recognition scenarios. Most OCR characters in the market are rectangular texts, and the effect of recognizing arc-shaped distorted text scenes is poor, so that a large number of samples containing arc-shaped distorted images are required to be provided for OCR training models in order to improve the accuracy of OCR character recognition. However, in the actual collection process, the image sample with arc distortion is difficult to collect, and time and effort are consumed when the characters are marked in a multi-point manner.
Disclosure of Invention
In order to solve the defects in the prior art, the invention aims to provide an irregular arc-shaped distortion method for text images facing OCR character recognition, which automatically generates arc-shaped distortion sample images suitable for OCR arc-shaped distortion character recognition.
In order to achieve the purpose of the invention, the technical scheme adopted by the invention is as follows:
An irregular arc-shaped distortion method of a text image facing OCR text recognition comprises the following steps:
step one, acquiring an original text image and an original text box annotation file;
step two, calculating the maximum arc distortion height value of the original text image;
Partitioning the image according to the width of the original image, and generating different arc-shaped distortion height values and the number of arc-shaped waves in different areas;
calculating offset distances of image pixels of different areas in the Y-axis direction according to the arc distortion height values and the number of arc waves of the different areas to obtain new coordinates of the image pixels;
and fifthly, calculating the original text box annotation coordinates according to the arc distortion height values and the arc wave numbers of different areas, and generating a new multi-point annotation text box and an annotation file.
Further, in the first step, an original text image and an original text box annotation file are obtained, including:
reading the width Weight and the Height of the original text image;
the original text box annotation file is read, wherein the file name comprises annotation point coordinates [ (x 1, y 1), (x 2, y 2), (x 3, y 3), (x 4, y 4) ], namely [ text box upper left corner annotation point coordinates, text box upper right corner annotation point coordinates, text box lower left corner annotation point coordinates ], and text content is annotated.
Further, in the second step, calculating a maximum arc distortion height value of the original text image includes:
(2.1) calculating the height values of all text boxes in the annotation file, and taking the minimum height value;
calculating the upper boundary distance values of all the text boxes in the annotation file from the original image, and taking the minimum distance value;
(2.3) taking the minimum value of the three calculated values as the maximum height value max_radius_value of the irregular arc distortion of the image.
Further, in step (2.1), the coordinates of the labeling point of each text box are read, the minimum circumscribed rectangle of the text box is obtained, the height value of the text box is obtained through the minimum circumscribed rectangle, and the height value of each text box in the labeling file is calculated.
Further, in step (2.2), the coordinates of the labeling point of each text box are read, wherein the values y1 and y2 represent the upper boundary distance between the text box and the original text image, the values Height-y3 and Height-y4 represent the lower boundary distance between the text box and the original text image, and the upper and lower boundary distances between each text box in the labeling file and the original text image are calculated.
Further, in the third step, different areas generate different arc distortion height values and arc wave numbers, including:
The different regions randomly generate arc twist height values, and a value random_radius_value is randomly generated between the 1-max_radius_value values as the arc twist height value.
The number of the arc waves is randomly generated in different areas, and the recommended value of the number of the arc waves is between 1 and 5.
Further, in the fourth step, an offset value of the image pixel point in the Y-axis direction is calculated:
Y’=random_radian_value*sin((X_n/Width)*2*3.14*num)(1)
Wherein, range_radius_value represents the arc distortion height value of the image of the region, num represents the number of arc waves, width represents the Width value of the image of the region, and X_n represents the coordinate of the pixel point of the image on the X axis;
the new coordinates of the pixel point are (X _ n, Y _ n + Y'), where Y _ n represents the coordinates of the image pixel point in the Y-axis.
Further, in the fifth step, recalculating the original text box annotation coordinates to generate a new multi-point annotation text box, including:
Dividing the original four-point annotation into multiple point annotation points according to the number of characters of the annotation text content, calculating new coordinates of the divided annotation points, namely multiple point annotation coordinate points, according to the arc distortion height value and the number of arc waves, generating a new text box according to the generated irregular arc distortion image and the generated new coordinates, and generating an annotation file by the new coordinates and the original annotation text content.
Further, the original four-point label is divided into multiple-point labels by adopting a single-line segment point dividing method, which comprises the following steps:
Connecting every two adjacent marking points into a line segment to generate four directional line segments, wherein the four directional line segments are respectively an upper left marking point- > -an upper right marking point, an upper right marking point- > -a lower right marking point, a lower right marking point- > -a lower left marking point and an upper left marking point- >;
taking a line segment formed by the upper left marking point-the upper right marking point and the lower right marking point-the lower left marking point as a horizontal line segment, dividing the line segment into multiple points, generating multiple marking points, and calculating an x value;
Taking a line segment formed by the upper right marking point, the lower right marking point and the lower left marking point as vertical line segments, dividing the line segment into multiple points, generating multiple marking points, and calculating y values;
and combining the calculated x and y values in a one-to-one correspondence manner to generate a multipoint labeling point.
Compared with the prior art, the text image irregular arc distortion method for OCR character recognition has the advantages that the irregular arc distortion of the text image and the recalculation of the corresponding text box are achieved, the text image with the irregular arc distortion and the corresponding annotation file are generated, and a large number of training samples are provided for an OCR training arc text recognition model.
According to the method, the minimum height value of all the text boxes, the minimum distance value from the text boxes to the upper boundary of the original image and the minimum distance value from the text boxes to the lower boundary of the original image are calculated, and the minimum value in the three values is taken as the maximum height value of the maximum arc distortion of the image, so that the arc distortion height of each image can be adjusted in a self-adaptive mode, and the problem that the image content cannot be identified due to overlarge arc distortion degree of the image is avoided.
According to the invention, the image is divided into different areas according to the image width, and the different areas are subjected to arc distortion according to different arc distortion height values, so that a large number of different irregular arc distortion images are produced.
According to the method, the coordinate points of the original image are segmented into multi-point labeling coordinates according to the arc distortion height and the number of characters labeling the text content, and a multi-point labeling frame conforming to the arc distortion text is generated after subsequent recalculation, so that the characters are perfectly attached.
Drawings
FIG. 1 is a flow chart of a method for irregular arc distortion of text images for OCR character recognition according to the present invention;
FIG. 2 is a flow chart for calculating a maximum arc distortion height value for an original text image;
FIG. 3 is an original annotation image provided by an embodiment of the present invention;
FIG. 4 is a graph showing the segmentation of an original coordinate frame into multiple point labels according to the number of characters and the height of the distortion of the labels according to an embodiment of the present invention;
FIG. 5 is an image of irregular arc distortion provided by an embodiment of the present invention;
FIG. 6 is an irregular arc-shaped distortion image with an excessively large arc-shaped distortion height provided by an embodiment of the present invention;
FIG. 7 is an irregular curved warped image with a text box that is oversized from the upper and lower boundaries of the image, provided by an embodiment of the present invention;
FIG. 8 is an irregular curved distorted image with an excessive number of curved waves provided by an embodiment of the present invention.
Detailed Description
The technical scheme of the application is further described below with reference to the accompanying drawings and examples. The following examples are only for more clearly illustrating the technical aspects of the present application, and are not intended to limit the scope of the present application.
As shown in fig. 1, the irregular arc distortion method for text images facing OCR word recognition according to the present invention includes the following steps:
Step one, acquiring an original text image and a corresponding labeling file of an original text box;
the width (Weight) and Height (Height) of the original text image are read in preparation for subsequent segmentation of the text image and calculation of new coordinates of the image pixels.
And reading an original text box annotation file, and obtaining all annotation data, wherein the annotation data comprises file names, and the annotation point coordinates [ (x 1, y 1), (x 2, y 2), (x 3, y 3), (x 4, y 4) ], namely [ text box upper left corner annotation point coordinates, text box upper right corner annotation point coordinates, text box lower left corner annotation point coordinates ], and annotation text contents.
Step two, calculating the maximum arc distortion height value of the original text image, as shown in fig. 2;
(2.1) calculating the height values of all text boxes in the annotation file, and taking the minimum height value;
And D, obtaining the minimum circumscribed rectangle of each text box through the coordinates of the marking point of each text box read in the step I, obtaining the height value of the text box through the minimum circumscribed rectangle, and calculating the height value of each text box in the marking file, and taking the minimum value.
The minimum value is taken because the text content in the original text image is inconsistent in size, for example, the partial text size is one number, the partial text size is seven number, when the arc distortion height is too large, the effect on the large text is small, but the effect on the small text is large, even the small text is distorted and cannot be recognized, and the text content is destroyed, as shown in fig. 6. Therefore, the invention can summarize and view the actual production results, take the minimum text box height value to carry out arc distortion, not only can ensure that large-size fonts generate arc distortion, but also can ensure that small-size fonts do not destroy text contents while generating arc distortion.
Calculating the upper boundary distance values of all the text boxes in the annotation file from the original image, and taking the minimum distance value;
And (3) calculating the upper and lower boundary distances from each text box in the annotation file to the original text image by the annotation point coordinates of each text box read in the step one, wherein the values y1 and y2 represent the upper boundary distance from the text box to the original text image, the values Height-y3 and Height-y4 represent the lower boundary distance from the text box to the original text image, and the minimum value is taken.
This minimum is taken because when the upper or lower boundary is too small, and the arc distortion height is too large, it causes the text to shift excessively beyond the original text image height and disappear, as shown in fig. 7. Therefore, the invention can ensure that the text can be subjected to arc distortion by taking the minimum boundary distance value through summarizing and checking the actual production results, and can also ensure that the text content cannot exceed the height of the original image because of overlarge arc distortion.
(2.3) Taking the minimum value of the three calculated values as the maximum height value of the irregular arc distortion of the image;
the maximum arc distortion height value of each image can be calculated in a good self-adaptive manner by calculating the minimum height value of the text box, the minimum distance value from the text box to the upper boundary of the original image and the minimum distance value from the text box to the lower boundary of the original image and taking the minimum value of the three values as the maximum arc distortion height value max_radius_value of the image, so that the situation that the text content cannot be distinguished due to overlarge distortion of the image after arc distortion and the text content is lost due to overlarge distortion of the image after arc distortion is avoided.
Dividing the image according to the width of the original image, generating an arc distortion height value and the number of arc waves for each area, and generating different arc distortion height values and the number of arc waves for different areas;
for example, the original image is divided into two adjacent areas with the center point [ Weight/2,0] in the width direction of the original image as a demarcation point, and an arc distortion height value is randomly generated for each area. The original image may be divided into a plurality of adjacent areas as desired.
The method of randomly generating the arc-shaped warp height value is to randomly generate a value random_radius_value between 1-max_radius_value values. Since the range_radius_value is different for each block region, the arc distortion height is different for each block region, so that the original image can be deformed with irregular arc distortion.
The number of arc waves is randomly generated, the frequency of arc distortion can be controlled, and the recommended value is 1-5. If the number of the arc waves is too large, the arc distortion frequency of the image is too high, a plurality of arc waves appear, and the image content is destroyed, as shown in fig. 8. Therefore, the method and the device have the advantages that the actual production result summary is used for checking, the value is 1-5, the text content is not destroyed, and the image can be distorted well.
Calculating offset distances of image pixels of the area in the Y-axis direction according to the arc distortion height values and the number of arc waves of different areas, and calculating new coordinates of the image pixels;
The original text image is subjected to irregular arc distortion, only the offset value of each pixel point coordinate in the original image in the Y-axis direction is needed to be calculated, no change occurs in the X-axis direction, and calculation is not needed.
The formula for calculating the offset value of the coordinates of the image pixels in the Y-axis direction is as follows:
Y’=random_radian_value*sin((X_n/Width)*2*3.14*num) (1)
Wherein, range_radius_value represents the arc distortion height value of the image of the region, num represents the number of arc waves, width represents the Width value of the image of the region, and x_n represents the coordinate of the pixel point of the image on the X axis.
Calculating the new Y-axis coordinates of the image pixel points:
Y_n’=Y_n+Y’ (2)
wherein Y_n represents the coordinate of the pixel point on the Y axis, and Y' represents the offset distance value of the pixel point on the Y axis.
The new coordinates of the pixel point are (x_n, y_n'). And simultaneously, generating a blank image with the same width and height as those of the original image, copying the pixel points to the blank image according to the new coordinates when the new coordinates are calculated in the steps, and generating the irregular arc-shaped distorted image after all the pixel points on the original image are copied to the blank image.
And fifthly, dividing the original four-point annotation into multi-point annotation points according to the number of characters of the annotation text content, and recalculating original text box annotation coordinates according to the arc distortion height values and the number of arc waves of different areas to generate a new multi-point annotation text box and an annotation file.
After the irregular arc distortion of the image, the image content changes, the original text box cannot frame the arc distortion text content, and the original text box needs to be recalculated into the multi-point labeling text box, so that the text content is completely framed.
In order to convert the four-point marked text box into the multi-point marked text box, the invention also designs a single-line segment and point method for converting the four-point marked text box into the multi-point marked text box, which comprises the following specific steps:
As shown in fig. 3, the four-point labeling text box is composed of four labeling points, and every two adjacent labeling points are connected into a line segment, namely four directional line segments are generated, wherein the four directional line segments are respectively an upper left labeling point- > -an upper right labeling point, an upper right labeling point- > -a lower right labeling point, a lower right labeling point- > -a lower left labeling point and an upper left labeling point- >. And dividing the original text box into a multi-point labeling box according to the number of characters of the labeling text content.
And taking a line segment formed by the upper left marking point, the upper right marking point and the lower right marking point as horizontal line segments, dividing the line segment into multiple points, and generating multiple marking points. The calculation method comprises the following steps:
x=[x for x in np.linspace(x0,x1,math.ceil(len(word)+1))] (3)
wherein X0 and X1 respectively represent coordinates of two endpoints of a horizontal line segment on an X axis; len (word) +1 denotes the number of marked text characters plus 1, np.linspace (X0, X1, match.ceil (len (word) +1) denotes the uniform division of the line segment between the coordinates X0, X1 into len (word) +1 points.
And taking a line segment formed by the upper right marking point, the lower right marking point and the lower left marking point as vertical line segments, dividing the line segment into multiple points, and generating multiple marking points. The calculation method comprises the following steps:
y=[y for y in np.linspace(y0,y1,math.ceil(len(word)+1))] (4)
Wherein Y0, Y1 respectively represent coordinates of two endpoints of a vertical line segment on a Y axis, len (word) +1 represents the number of marked text characters added with 1, np.
And combining the calculated x and y values in a one-to-one correspondence manner to generate a multi-point labeling point, as shown in fig. 4.
And calculating new coordinates of the divided marking points according to the arc distortion height value and the number of the arc waves, namely putting all coordinate points into the formula (1) and the formula (2) for calculation to obtain corresponding new coordinates, namely the multi-point marking coordinate points. And (3) generating a new text box according to the irregular arc distorted image generated in the step (4) and the generated new coordinates, wherein a final effect diagram is shown in fig. 5, and generating a labeling file by the new coordinates and the original labeling text content.
While the applicant has described and illustrated the embodiments of the present invention in detail with reference to the drawings, it should be understood by those skilled in the art that the above embodiments are only preferred embodiments of the present invention, and the detailed description is only for the purpose of helping the reader to better understand the spirit of the present invention, and not to limit the scope of the present invention, but any improvements or modifications based on the spirit of the present invention should fall within the scope of the present invention.

Claims (8)

1. An irregular arc-shaped distortion method of a text image for OCR character recognition is characterized by comprising the following steps:
step one, acquiring an original text image and an original text box annotation file;
Step two, calculating a maximum arc distortion height value of the original text image, which comprises the following steps:
(2.1) calculating the height values of all text boxes in the annotation file, and taking the minimum height value;
calculating the upper boundary distance values of all the text boxes in the annotation file from the original image, and taking the minimum distance value;
(2.3) taking the minimum value of the three calculated values as the maximum height value max_radius_value of the irregular arc distortion of the image;
Partitioning the image according to the width of the original image, and generating different arc-shaped distortion height values and the number of arc-shaped waves in different areas;
calculating offset distances of image pixels of different areas in the Y-axis direction according to the arc distortion height values and the number of arc waves of the different areas to obtain new coordinates of the image pixels;
and fifthly, calculating the original text box annotation coordinates according to the arc distortion height values and the arc wave numbers of different areas, and generating a new multi-point annotation text box and an annotation file.
2. The method for irregular arc distortion of text images for OCR word recognition according to claim 1,
In the first step, an original text image and an original text box annotation file are obtained, which comprises the following steps:
reading the width Weight and the Height of the original text image;
the original text box annotation file is read, wherein the file name comprises annotation point coordinates [ (x 1, y 1), (x 2, y 2), (x 3, y 3), (x 4, y 4) ], namely [ text box upper left corner annotation point coordinates, text box upper right corner annotation point coordinates, text box lower left corner annotation point coordinates ], and text content is annotated.
3. The method for irregular arc distortion of text images for OCR word recognition according to claim 1,
In the step (2.1), the coordinates of the marking point of each text box are read, the minimum circumscribed rectangle of the text box is obtained, the height value of the text box is obtained through the minimum circumscribed rectangle, and the height value of each text box in the marking file is calculated.
4. The method for irregular arc distortion of text images for OCR word recognition according to claim 1,
In the step (2.2), the coordinate of the labeling point of each text box is read, wherein the values y1 and y2 represent the upper boundary distance between the text box and the original text image, the values Height-y3 and Height-y4 represent the lower boundary distance between the text box and the original text image, and the upper and lower boundary distances between each text box in the labeling file and the original text image are calculated.
5. The method for irregular arc distortion of text images for OCR word recognition according to claim 1,
In the third step, different areas generate different arc distortion height values and arc wave numbers, including:
the arc-shaped distortion height values are randomly generated in different areas, and a value random_radius_value is randomly generated between 1-max_radius_value values to serve as the arc-shaped distortion height value;
The number of the arc waves is randomly generated in different areas, and the recommended value of the number of the arc waves is between 1 and 5.
6. The method for irregular arc distortion of text images for OCR word recognition according to claim 1,
In the fourth step, calculating offset values of the image pixels in the Y-axis direction:
Y’ = random_radian_value * sin((X_n / Width)* 2 * 3.14 * num)(1)
Wherein, range_radius_value represents the arc distortion height value of the image of the region, num represents the number of arc waves, width represents the Width value of the image of the region, and X_n represents the coordinate of the pixel point of the image on the X axis;
The new coordinates of the pixel point are (X _ n, Y _ n + Y'), where Y _ n represents the coordinates of the image pixel point in the Y-axis.
7. The method for irregular arc distortion of text images for OCR word recognition according to claim 1,
In the fifth step, the original text box annotation coordinates are recalculated to generate a new multi-point annotation text box, which comprises the following steps:
Dividing the original four-point annotation into multiple point annotation points according to the number of characters of the annotation text content, calculating new coordinates of the divided annotation points, namely multiple point annotation coordinate points, according to the arc distortion height value and the number of arc waves, generating a new text box according to the generated irregular arc distortion image and the generated new coordinates, and generating an annotation file by the new coordinates and the original annotation text content.
8. The method for irregular arc distortion of text images for OCR word recognition according to claim 7,
Dividing the original four-point label into multi-point labels by adopting a single-line segment point dividing method, wherein the method comprises the following steps:
Connecting every two adjacent marking points into a line segment to generate four directional line segments, wherein the four directional line segments are respectively an upper left marking point- > -an upper right marking point, an upper right marking point- > -a lower right marking point, a lower right marking point- > -a lower left marking point and an upper left marking point- >;
taking a line segment formed by the upper left marking point-the upper right marking point and the lower right marking point-the lower left marking point as a horizontal line segment, dividing the line segment into multiple points, generating multiple marking points, and calculating an x value;
Taking a line segment formed by the upper right marking point, the lower right marking point and the lower left marking point as vertical line segments, dividing the line segment into multiple points, generating multiple marking points, and calculating y values;
and combining the calculated x and y values in a one-to-one correspondence manner to generate a multipoint labeling point.
CN202210814225.5A 2022-07-11 2022-07-11 A method for irregular arc distortion of text images for OCR text recognition Active CN115205872B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210814225.5A CN115205872B (en) 2022-07-11 2022-07-11 A method for irregular arc distortion of text images for OCR text recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210814225.5A CN115205872B (en) 2022-07-11 2022-07-11 A method for irregular arc distortion of text images for OCR text recognition

Publications (2)

Publication Number Publication Date
CN115205872A CN115205872A (en) 2022-10-18
CN115205872B true CN115205872B (en) 2025-02-14

Family

ID=83579435

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210814225.5A Active CN115205872B (en) 2022-07-11 2022-07-11 A method for irregular arc distortion of text images for OCR text recognition

Country Status (1)

Country Link
CN (1) CN115205872B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113392780A (en) * 2021-06-17 2021-09-14 南开大学 Character image distortion deformation method for AI recognition of Chinese characters
CN113505741A (en) * 2021-07-27 2021-10-15 京东科技控股股份有限公司 Text image processing method and device, electronic equipment and storage medium

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009065640A (en) * 2007-08-10 2009-03-26 Sharp Corp Image forming device, image formation method, image formation program, and computer-readable recording medium with recording image forming program stored
CN108549893B (en) * 2018-04-04 2020-03-31 华中科技大学 An End-to-End Recognition Method for Scene Texts of Arbitrary Shapes

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113392780A (en) * 2021-06-17 2021-09-14 南开大学 Character image distortion deformation method for AI recognition of Chinese characters
CN113505741A (en) * 2021-07-27 2021-10-15 京东科技控股股份有限公司 Text image processing method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN115205872A (en) 2022-10-18

Similar Documents

Publication Publication Date Title
CN110210400B (en) Table file detection method and equipment
JP4539756B2 (en) Image processing apparatus and image processing program
CN112085022B (en) Method, system and equipment for recognizing characters
CN110516554A (en) A multi-scene multi-font Chinese character detection and recognition method
CN102184383B (en) Automatic generation method of image sample of printed character
CN111126266B (en) Text processing method, text processing system, equipment and medium
CN105160343A (en) Information identification method and device applied to film on-demand-printing system
CN111291752A (en) Invoice identification method, equipment and medium
CN114663897A (en) Table extraction method and table extraction system
CN118430000A (en) Bill OCR recognition method and system based on deep learning
US20080131000A1 (en) Method for generating typographical line
CN115082935B (en) Method, device and storage medium for correcting document images
CN115205872B (en) A method for irregular arc distortion of text images for OCR text recognition
CN111274863A (en) Text prediction method based on text peak probability density
CN117876348B (en) Automatic pre-detection method and device for paper cup design defects
JP2019159932A (en) Information processing device and program
CN112766418A (en) Image text direction classification method, device, equipment and storage medium
CN111898402A (en) Intelligent typesetting system
CN118135584A (en) Automatic handwriting form recognition method and system based on deep learning
CN115249362B (en) OCR table recognition method and system based on connectivity of pixels in stable direction
CN115147858A (en) Method, device, device and medium for generating handwritten form image data
KR102193403B1 (en) Deep learning-based learning data generation method including an outline and a caption elimination method in a drawing and learning data generation system
CN115019310A (en) Image-text identification method and equipment
US12254615B1 (en) Method and device for automatically pre-detecting paper cup design defects
CN116704518A (en) Text recognition method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant