CN115294571A

CN115294571A - Character recognition method of character scanning image, intelligent watch and electronic equipment

Info

Publication number: CN115294571A
Application number: CN202210971002.XA
Authority: CN
Inventors: 马兵会
Original assignee: Nanjing Baige Zhengliu Network Technology Co ltd
Current assignee: Beijing Baige Feichi Technology Co.,Ltd.
Priority date: 2022-08-14
Filing date: 2022-08-14
Publication date: 2022-11-04

Abstract

The invention discloses a text recognition method for text scanning images, a smart watch and an electronic device. The text recognition method for text scanning images is characterized in that: obtaining a single-frame time-series image of text scanning; Perform image preprocessing on the sequence images to obtain a preprocessed scanned image; perform image splicing operations based on image pyramid template matching on the preprocessed scanned images of adjacent frames to obtain a spliced image; perform text recognition on the spliced image to obtain a text scanned image text recognition results. The text recognition method for text scanning images of the present invention is especially suitable for smart watches equipped with a special scanning camera for text scanning recognition, and solves the problem that the existing text recognition method is applied to smart watches equipped with a special scanning camera for text scanning recognition. Low recognition rate and poor user experience.

Description

Character recognition method for scanned character image, smart watch and electronic device

技术领域technical field

本发明涉及智能移动设备领域，具体的涉及文字扫描图像的文字识别方法、智能手表及电子设备。The invention relates to the field of smart mobile devices, in particular to a character recognition method for scanned character images, smart watches and electronic devices.

背景技术Background technique

在工作和生活中经常需要扫描识别图片里的文字，比如遇到一些无法直接复制文字的网页，或者PPT、PDF文档，手动抄录它们效率实在太差，借助于文字扫描识别技术可以极大的提高图片中文字信息的获取效率。In work and life, it is often necessary to scan and identify text in pictures. For example, when encountering some web pages that cannot directly copy text, or PPT and PDF documents, the efficiency of manually copying them is too low. With the help of text scanning and recognition technology, it can be greatly improved. Acquisition efficiency of text information in pictures.

随着社会的信息化发展，文字扫描识别功能被越来越多的配置在各种软硬件产品上，对于智能手表而言，将自身的功能进行了拓展实现文字扫描识别功能。现有智能手表实现文字扫描识别功能一般通过自身配备的拍摄摄像头拍摄文本对象，由于拍摄摄像头与专用的扫描摄像头存在硬件结构的区别，拍摄摄像头拍摄的文本图片在进行文字识别时识别率会有所降低，用户使用智能手表进行文字扫描识别的使用体验不好。With the development of social informatization, text scanning and recognition functions are more and more configured on various hardware and software products. For smart watches, their own functions have been expanded to realize text scanning and recognition functions. Existing smart watches realize the function of text scanning and recognition, and generally use their own camera to capture text objects. Due to the difference in hardware structure between the camera and the dedicated scanning camera, the recognition rate of text images captured by the camera will be different when performing character recognition. Reduced, the user experience of using smart watches for text scanning and recognition is not good.

为了解决智能手表的文字扫描的识别率，现有智能手表上结合专门用于文字扫描的扫描摄像头，在使用智能手表进行文字扫描识别时，采用配置的扫描摄像头进行文字扫描，提升了文字识别率，且用户使用较为方便。而扫描摄像头被成熟应用在移动智能硬件的一个成功案例就是——智能扫描笔，智能扫描笔通过将扫描摄像头设置在笔头上与被扫描文本对象保持固定的角度和距离，可以获取更适于被识别出文字信息的扫描文本图片。智能手表受限于表体体积，虽然可以空间优化将扫描摄像头装入表体内部，但是扫描摄像头与被扫描文本对象之间的距离、角度都与智能扫描笔不同。因此，如果只是简单的将扫描摄像头“塞进”智能手表的表体，而不进行软件上算法的优化，智能手表集成扫描摄像头并不能有效的提升文字扫描识别率。In order to solve the text scanning recognition rate of the smart watch, the existing smart watch is combined with a scanning camera specially used for text scanning. When using the smart watch for text scanning recognition, the configured scanning camera is used for text scanning, which improves the text recognition rate. , and the user is more convenient to use. A successful case of the mature application of the scanning camera in mobile smart hardware is the intelligent scanning pen. By setting the scanning camera on the tip of the pen and keeping a fixed angle and distance from the scanned text object, the intelligent scanning pen can obtain a text object that is more suitable for being scanned. Image of scanned text with recognized text information. Smart watches are limited by the size of the watch body. Although the space can be optimized to install the scanning camera inside the watch body, the distance and angle between the scanning camera and the scanned text object are different from those of the smart scanning pen. Therefore, if the scanning camera is simply "plugged" into the body of the smart watch without optimizing the algorithm on the software, the integrated scanning camera of the smart watch cannot effectively improve the text scanning recognition rate.

因此，本发明所要解决的技术问题为：针对性配置专门的扫描摄像头进行文字扫描识别的智能手表，如何针对性的进行软件算法优化提升文字识别的识别率？Therefore, the technical problem to be solved by the present invention is: how to specifically optimize the software algorithm to improve the recognition rate of character recognition for a smart watch equipped with a special scanning camera for character scanning and recognition?

有鉴于此，特提出本发明。In view of this, the present invention is proposed.

发明内容Contents of the invention

为了解决上述问题，本发明的第一发明目的是提供一种适应性用于配置专门的扫描摄像头进行文字扫描识别的智能手表的文字扫描图像的文字识别方法，具体地，采用如下技术方案：In order to solve the above problems, the first object of the present invention is to provide a text recognition method adaptable to the text scanning image of a smart watch equipped with a special scanning camera for text scanning recognition. Specifically, the following technical solutions are adopted:

一种文字扫描图像的文字识别方法，包括：A character recognition method for a character scanned image, comprising:

获取文字扫描的单帧时间序列图像；Obtain a single-frame time-series image of text scanning;

针对所述单帧时间序列图像进行图像预处理，得到预处理扫描图像；performing image preprocessing on the single-frame time-series image to obtain a pre-processed scanned image;

将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作，得到拼接图像；performing an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image;

针对所述拼接图像进行文字识别，得到文字扫描图像的文字识别结果。Character recognition is performed on the spliced image to obtain a character recognition result of the scanned character image.

作为本发明的可选实施方式，本实施例的一种文字扫描图像的文字识别方法中，所述获取文字扫描的单帧时间序列图像包括：As an optional implementation mode of the present invention, in a method for character recognition of a character scanned image in this embodiment, the acquisition of a single-frame time-series image of the character scan includes:

获取文字扫描的n幅单帧时间序列图像{P1,…,Pn}，图像Pn表示第n幅单帧时间序列图像；Obtain n single-frame time-series images {P1,...,Pn} of text scanning, and image Pn represents the nth single-frame time-series image;

针对所述单帧时间序列图像进行图片预处理，得到预处理扫描图像包括：Perform image preprocessing on the single-frame time-series image, and obtain the pre-processed scanned image including:

针对所述单帧时间序列图像{P1,…,Pn}中任一单帧时间序列图像Pn进行剪裁处理，去除单帧时间序列图像Pn中按照预设条件选出的目标剪裁部分；Perform clipping processing on any single-frame time-series image Pn in the single-frame time-series images {P1,...,Pn}, and remove the target clipping part selected according to preset conditions in the single-frame time-series image Pn;

针对剪裁处理后的单帧时间序列图像Pn进行亮度补偿处理；performing brightness compensation processing on the clipped single-frame time-series images Pn;

针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正处理，得到预处理扫描图像。A tilt correction process is performed on the single-frame time-series image Pn that has undergone the brightness compensation process to obtain a pre-processed scanned image.

作为本发明的可选实施方式，本实施例的一种文字扫描图像的文字识别方法中，所述针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正，得到预处理扫描图像包括：As an optional implementation mode of the present invention, in a character recognition method of a scanned character image in this embodiment, the tilt correction is performed on the single-frame time-series image Pn that has undergone the brightness compensation process, and the pre-processed scanned image obtained includes :

针对所述亮度补偿处理的单帧时间序列图像Pn依次进行倾斜校正和透视变换，所述倾斜校正的校正角度参数为预设值α；Sequentially performing tilt correction and perspective transformation on the single-frame time-series images Pn processed by the brightness compensation, the correction angle parameter of the tilt correction is a preset value α;

针对校正的图像进行Gamma补偿和动态线性补偿，获取均匀的背景亮度，得到预处理扫描图像；Perform Gamma compensation and dynamic linear compensation on the corrected image to obtain a uniform background brightness and obtain a pre-processed scanned image;

将n幅单帧时间序列图像{P1,…,Pn}经过图片预处理后对应得到n幅预处理扫描图像{newP1,newP2,…,newPn}。Preprocess the n single-frame time series images {P1,...,Pn} to obtain n preprocessed scanned images {newP1, newP2,...,newPn}.

作为本发明的可选实施方式，本实施例的一种文字扫描图像的文字识别方法中，所述将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作，得到拼接图像包括：As an optional implementation of the present invention, in the text recognition method of a scanned text image in this embodiment, the image stitching operation based on image pyramid template matching is performed on the pre-processed scanned images of adjacent frames, and the stitched image includes: :

将按时序到来的当前帧图像newPn提取sift特征；Extract the sift feature from the current frame image newPn arriving in time sequence;

将sift特征图做多层图像金字塔采样，逐层与上一帧图像newPre的图像金字塔特征图像进行做模板匹配，上层图像金字塔的匹配位置作为下层匹配位置的候选参数，The sift feature map is sampled in a multi-layer image pyramid, and template matching is performed layer by layer with the image pyramid feature image of the previous frame image newPre, and the matching position of the upper image pyramid is used as a candidate parameter for the matching position of the lower layer.

加速匹配过程，其中，对于第一帧图像的前一帧图像默认与当前帧相同；Accelerate the matching process, where the previous frame image of the first frame image is the same as the current frame by default;

将位置匹配完成的图像进行图像拷贝拼接，得到拼接图像，输出拼接图像并在屏幕显示。Copy and stitch the images whose positions have been matched to obtain a stitched image, output the stitched image and display it on the screen.

作为本发明的可选实施方式，本实施例的一种文字扫描图像的文字识别方法中，所述针对所述拼接图像进行文字识别，得到文字扫描图像的文字识别结果包括：As an optional implementation mode of the present invention, in a method for character recognition of a scanned character image in this embodiment, performing character recognition on the spliced image to obtain a character recognition result of the scanned character image includes:

所述n幅单帧时间序列图像{P1,…,Pn}经过图像拼接操作后对应得到n幅拼接图像{SP1,SP2,…,SPn}；The n pieces of single-frame time-series images {P1,...,Pn} undergo an image stitching operation to obtain n pieces of spliced images {SP1,SP2,...,SPn};

将当前拼接图像SPn的增量部分输入CNN网络，获得增量部分dFS(FeatureSequence)；Input the incremental part of the current spliced image SPn into the CNN network to obtain the incremental part dFS (FeatureSequence);

将一帧或者多帧拼接图像SPn的增量部分dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别，得到文字扫描图像的文字识别结果。Input the incremental part dFS (Feature Sequence) of one or more frames of spliced image SPn into CRNN+CTC network for character sequence recognition, and obtain the character recognition result of the character scanned image.

作为本发明的可选实施方式，本实施例的一种文字扫描图像的文字识别方法包括：As an optional implementation mode of the present invention, a text recognition method of a text scanning image in this embodiment includes:

预设增量预置max{dFS(Feature Sequence)}；preset increment preset max{dFS(Feature Sequence)};

当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)达到预设增量预置max{dFS(Feature Sequence)}时，将当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别。When the incremental part dFS (Feature Sequence) of the stitched image of the previous frame or multiple frames reaches the preset incremental preset max{dFS (Feature Sequence)}, the incremental part dFS of the stitched image of the current frame or multiple frames (Feature Sequence) Enter the CRNN+CTC network for text sequence recognition.

将当前拼接图像SPn的增量部分输入CNN网络，获得增量部分的dFS(FeatureSequence),判断所述增量部分dFS(Feature Sequence)是否达到预设增量预置max{dFS(Feature Sequence)}；Input the incremental part of the current spliced image SPn into the CNN network to obtain the dFS (Feature Sequence) of the incremental part, and judge whether the incremental part dFS (Feature Sequence) reaches the preset incremental preset max{dFS (Feature Sequence)} ;

若判断结果为是，则将当前拼接图像SPn的增量部分的dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别，若判断结果为否，则将当前拼接图像SPn的增量部分与前一帧或者前几帧未输入CRNN+CTC网络的拼接图像SPnPre的增量部分的dFS(FeatureSequence)进行合并，进一步判断是否达到预设增量预置max{dFS(Feature Sequence)}；If the judgment result is yes, then input the dFS (Feature Sequence) of the incremental part of the current spliced image SPn into the CRNN+CTC network for character sequence recognition; Merge the dFS (Feature Sequence) of the incremental part of the spliced image SPnPre that has not been input into the CRNN+CTC network in one frame or the previous few frames, and further judge whether it reaches the preset incremental preset max{dFS (Feature Sequence)};

若判断结果为是，则将当前拼接图像SPn与前一帧或前几帧的拼接图像的增量部分的dFS(Feature Sequence)合并输入至CRNN+CTC网络进行文字序列识别，若判断结果为否，则继续获取下一帧拼接图像SPnext的增量部分并进行合并直至合并的增量部分达到max{dFS(Feature Sequence)}，或者所有拼接图像处理完成。If the judgment result is yes, merge the dFS (Feature Sequence) of the incremental part of the stitched image of the current spliced image SPn and the previous frame or previous frames into the CRNN+CTC network for text sequence recognition, if the judgment result is no , then continue to acquire the incremental part of the next frame of stitched image SPnext and merge it until the merged incremental part reaches max{dFS(Feature Sequence)}, or all stitched images are processed.

本发明的第二发明目的是提供一种智能手表，采用所述文字扫描图像的文字识别方法，对于通过配备扫描摄像头进行硬件改进的同时，通过采用本发明的文字扫描图像的文字识别方法，进行软件算法优化，可以提升文字扫描识别功能的识别率，提升用户使用体验，具体地，采用如下技术方案：The second object of the present invention is to provide a smart watch, using the text recognition method of the text scanning image, while improving the hardware by using the scanning camera, by using the text recognition method of the text scanning image of the present invention. The optimization of the software algorithm can improve the recognition rate of the text scanning recognition function and improve the user experience. Specifically, the following technical solutions are adopted:

一种智能手表，包括：A smart watch, comprising:

扫描模块，获取文字扫描的单帧时间序列图像；The scanning module acquires single-frame time-series images of text scanning;

图像预处理模块，针对所述单帧时间序列图像进行图像预处理，得到预处理扫描图像；An image preprocessing module, which performs image preprocessing on the single-frame time series image to obtain a preprocessed scanned image;

图像拼接模块，将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作，得到拼接图像；The image stitching module performs an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image;

及文字识别模块，针对所述拼接图像进行文字识别，得到文字扫描图像的文字识别结果。and a character recognition module, which performs character recognition on the spliced image to obtain a character recognition result of the character scanned image.

本发明同时提供电子设备，包括处理器和存储器，所述存储器用于存储计算机可执行程序，当所述计算机程序被所述处理器执行时，所述处理器执行所述的一种文字扫描图像的文字识别方法。The present invention also provides electronic equipment, including a processor and a memory, the memory is used to store a computer executable program, and when the computer program is executed by the processor, the processor executes the above-mentioned text scanning image text recognition method.

本发明还提供一种计算机可读存储介质，存储有计算机可执行程序，所述计算机可执行程序被执行时，实现所述的一种文字扫描图像的文字识别方法。The present invention also provides a computer-readable storage medium storing a computer-executable program. When the computer-executable program is executed, the above-mentioned method for character recognition of a scanned character image is realized.

与现有技术相比，本发明的有益效果：Compared with prior art, the beneficial effect of the present invention:

本发明的文字扫描图像的文字识别方法针对扫描的单帧时间序列图像进行图像预处理，预处理后的扫描图像再进行后续的图片拼接及文字识别，这样，即使扫描的单帧时间序列图像存在缺陷和问题，也可以通过图片预处理的方式提升扫描图像的质量，从而提升了文字识别的识别率，特别适用于配置专门的扫描摄像头进行文字扫描识别的智能手表，解决现有文字识别方法应用在配置专门的扫描摄像头进行文字扫描识别的智能手表，文字扫描识别率低，用户使用体验差的问题。The text recognition method for scanned text images of the present invention performs image preprocessing on scanned single-frame time-series images, and the pre-processed scanned images are then subjected to subsequent image splicing and text recognition. In this way, even if the scanned single-frame time-series images exist Defects and problems can also improve the quality of scanned images through image preprocessing, thereby improving the recognition rate of text recognition. It is especially suitable for smart watches equipped with special scanning cameras for text scanning and recognition, and solves the application of existing text recognition methods. In smart watches equipped with special scanning cameras for text scanning and recognition, the text scanning recognition rate is low and the user experience is poor.

本发明的文字扫描图像的文字识别方法将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作，可以加速匹配过程，提升图像拼接速度，加快得到文字识别结果。The character recognition method of the scanned character image of the present invention performs the image stitching operation based on image pyramid template matching on the preprocessed scanned images of adjacent frames, which can accelerate the matching process, increase the speed of image stitching, and accelerate the obtaining of character recognition results.

附图说明：Description of drawings:

图1本发明实施例文字扫描图像的文字识别方法的流程图；Fig. 1 is the flow chart of the text recognition method of the text scanning image of the embodiment of the present invention;

图2本发明实施例智能手表的文字扫描识别流程图；Figure 2 is a flow chart of text scanning and recognition of smart watches according to an embodiment of the present invention;

图3本发明实施例智能手表的立体结构示意图；Fig. 3 is a schematic diagram of a three-dimensional structure of a smart watch according to an embodiment of the present invention;

图4本发明实施例智能手表的表体的立体结构示意图。Fig. 4 is a schematic diagram of the three-dimensional structure of the watch body of the smart watch according to the embodiment of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚，下面将结合附图，对本发明实施例中的技术方案进行清楚、完整的描述。显然，所描述的实施例是本发明的一部分实施例，而不是全部的实施例。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are some, not all, embodiments of the present invention.

因此，以下对本发明的实施例的详细描述并非旨在限制要求保护的本发明的范围，而是仅仅表示本发明的部分实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。Therefore, the following detailed description of the embodiments of the present invention is not intended to limit the scope of the claimed invention, but merely represents some embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

需要说明的是，在不冲突的情况下，本发明中的实施例及实施例中的特征和技术方案可以相互组合。It should be noted that, in the case of no conflict, the embodiments of the present invention and the features and technical solutions in the embodiments can be combined with each other.

应注意到：相似的标号和字母在下面的附图中表示类似项，因此，一旦某一项在一个附图中被定义，则在随后的附图中不需要对其进行进一步定义和解释。It should be noted that like numerals and letters denote similar items in the following figures, therefore, once an item is defined in one figure, it does not require further definition and explanation in subsequent figures.

在本发明的描述中，需要说明的是，术语“上”、“下”等指示的方位或位置关系为基于附图所示的方位或位置关系，或者是该发明产品使用时惯常摆放的方位或位置关系，或者是本领域技术人员惯常理解的方位或位置关系，这类术语仅是为了便于描述本发明和简化描述，而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作，因此不能理解为对本发明的限制。此外，术语“第一”、“第二”等仅用于区分描述，而不能理解为指示或暗示相对重要性。In the description of the present invention, it should be noted that the orientation or positional relationship indicated by the terms "upper", "lower", etc. is based on the orientation or positional relationship shown in the drawings, or the conventionally placed position when the product of the invention is used. Orientation or positional relationship, or the orientation or positional relationship commonly understood by those skilled in the art, such terms are only for the convenience of describing the present invention and simplifying the description, rather than indicating or implying that the referred device or element must have a specific orientation, Constructed and operative in a particular orientation and therefore are not to be construed as limitations of the invention. In addition, the terms "first", "second", etc. are only used for distinguishing descriptions, and should not be construed as indicating or implying relative importance.

参见图1所示，本实施例的一种文字扫描图像的文字识别方法，包括：Referring to shown in Fig. 1, a kind of text recognition method of text scanning image of the present embodiment comprises:

本实施例的文字扫描图像的文字识别方法针对扫描的单帧时间序列图像进行图像预处理，预处理后的扫描图像再进行后续的图片拼接及文字识别，这样，即使扫描的单帧时间序列图像存在缺陷和问题，也可以通过图片预处理的方式提升扫描图像的质量，从而提升了文字识别的识别率，特别适用于配置专门的扫描摄像头进行文字扫描识别的智能手表，解决现有文字识别方法应用在配置专门的扫描摄像头进行文字扫描识别的智能手表，文字扫描识别率低，用户使用体验差的问题。The text recognition method for scanned text images in this embodiment performs image preprocessing on scanned single-frame time-series images, and the pre-processed scanned images are then subjected to subsequent image splicing and text recognition. In this way, even scanned single-frame time-series images There are defects and problems, and the quality of scanned images can also be improved through image preprocessing, thereby improving the recognition rate of text recognition. It is especially suitable for smart watches equipped with special scanning cameras for text scanning and recognition, and solves the existing text recognition methods. Applied to smart watches equipped with a special scanning camera for text scanning and recognition, the text scanning recognition rate is low, and the user experience is poor.

本实施例的文字扫描图像的文字识别方法将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作，可以加速匹配过程，提升图像拼接速度，加快得到文字识别结果。The text recognition method for scanned text images of this embodiment performs image stitching operations based on image pyramid template matching on preprocessed scanned images of adjacent frames, which can speed up the matching process, increase the speed of image stitching, and speed up obtaining text recognition results.

进一步地，本实施例的一种文字扫描图像的文字识别方法中，所述获取文字扫描的单帧时间序列图像包括：Further, in a method for character recognition of scanned character images in this embodiment, the acquisition of single-frame time-series images of character scans includes:

获取文字扫描的n幅单帧时间序列图像{P1,…,Pn}，图像Pn表示第n幅单帧时间序列图像。Acquire n single-frame time-series images {P1,...,Pn} of text scanning, and image Pn represents the nth single-frame time-series image.

针对所述单帧时间序列图像{P1,…,Pn}中任一单帧时间序列图像Pn进行剪裁处理，去除单帧时间序列图像Pn中按照预设条件选出的目标剪裁部分，具体地，通过预设条件可将单帧时间序列图像Pn中的不需要和被遮挡的部分去除，不需要的部分包括例如设定的图形、图画、标识等；Perform clipping processing on any single-frame time-series image Pn in the single-frame time-series images {P1,...,Pn}, and remove the target clipping part selected according to preset conditions in the single-frame time-series image Pn, specifically, Unnecessary and covered parts in the single-frame time series image Pn can be removed through preset conditions, and the unnecessary parts include, for example, set graphics, pictures, logos, etc.;

进一步地，本市实施的一种文字扫描图像的文字识别方法中，所述针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正，得到预处理扫描图像包括：Further, in a text recognition method for scanned text images implemented in this city, the tilt correction is performed on the single-frame time-series images Pn that have undergone the brightness compensation processing, and the pre-processed scanned images obtained include:

具体地，本实施例倾斜校正的校正角度参数预设值α需要考虑用户在使用智能手表进行文字扫描图像时相对于被扫描文本对象的倾斜角度，由大量图像试验所确定，具体地，所述的校正角度参数预设值α可选取65°。Specifically, the preset value α of the correction angle parameter of the tilt correction in this embodiment needs to consider the tilt angle of the user relative to the scanned text object when using the smart watch to scan the image, which is determined by a large number of image experiments. Specifically, the The preset value α of the correction angle parameter can be selected as 65°.

本实施例补偿参数由大量图像试验获取，具体地，采用的Gamma系数为-0.3，动态线性补偿系数为330。The compensation parameters in this embodiment are obtained from a large number of image experiments. Specifically, the Gamma coefficient used is -0.3, and the dynamic linear compensation coefficient is 330.

进一步地，本实施例的一种文字扫描图像的文字识别方法中，所述将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作，得到拼接图像包括：Further, in the text recognition method of a text scanned image of the present embodiment, performing an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image includes:

具体地，将sift特征图做3层图像金字塔采样，逐层与上一帧图像newPre的图像金字塔特征图像进行做模板匹配，上层金字塔的匹配位置作为下层匹配位置的候选参数，以加速匹配过程。另外，对于第一帧图像的前一帧默认与当前帧相同。Specifically, the sift feature map is sampled in a three-layer image pyramid, and template matching is performed layer by layer with the image pyramid feature image of the previous frame image newPre, and the matching position of the upper pyramid is used as a candidate parameter for the matching position of the lower layer to speed up the matching process. In addition, the previous frame of the first frame image is the same as the current frame by default.

另外，本实施例的拼接图像会被显示与智能手表的屏幕上，在智能手表扫描移动过程中，时间序列图像newP连续到来时，图像连续拼接增长，屏幕上形成随着扫描移动而连续增长的动态图。In addition, the mosaic image in this embodiment will be displayed on the screen of the smart watch. During the scanning movement of the smart watch, when the time-series image newP arrives continuously, the images will continue to be spliced and grow, forming a continuously growing pattern on the screen with the scanning movement. dynamic picture.

进一步地，本实施例的文字扫描图像的文字识别方法中，所述针对所述拼接图像进行文字识别，得到文字扫描图像的文字识别结果包括：Further, in the character recognition method of the scanned character image in this embodiment, performing character recognition on the spliced image to obtain the character recognition result of the scanned character image includes:

具体地，本实施例的一种文字扫描图像的文字识别方法，包括：Specifically, a character recognition method of a character scanned image in this embodiment includes:

当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)达到预设增量预置max{dFS(Feature Sequence)}时，将当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别。增量识别大大加快了文字识别的速度，可设置一个增量阈值max{dFS(Feature Sequence)}(经验值是28)，减少CRNN+CTC网络启动次数。扫描完成，手表抬起时，只要增量>1，最后再启动一次增量识别。When the incremental part dFS (Feature Sequence) of the stitched image of the previous frame or multiple frames reaches the preset incremental preset max{dFS (Feature Sequence)}, the incremental part dFS of the stitched image of the current frame or multiple frames (Feature Sequence) Enter the CRNN+CTC network for text sequence recognition. Incremental recognition greatly speeds up text recognition, and an incremental threshold max{dFS(Feature Sequence)} (experience value is 28) can be set to reduce the number of CRNN+CTC network startups. After the scan is completed, when the watch is lifted, as long as the increment is > 1, the increment recognition will be started again at last.

本实施例同时提供一种智能手表，包括：This embodiment also provides a smart watch, including:

本实施例智能手表的文字扫描模块300专门用于字符扫描识别，字符包括字母、数字、运算符号、标点符号和其他符号，以及一些功能性符号；文字扫描模块300一般采用定焦的扫描摄像头，扫描摄像头的视场角FOV一般较大以获取较为完整的扫描字符，配合设定亮度的补光光源，在使用时文字扫描模块300需要与被扫描对象保持设定的距离以及角度，配合补光光源的补光，摄像头可以拍摄出更加适于进行字符识别的图像，提高字符扫描识别的准确率。The text scanning module 300 of the smart watch of this embodiment is specially used for character scanning and recognition, and the characters include letters, numbers, arithmetic symbols, punctuation marks and other symbols, as well as some functional symbols; the text scanning module 300 generally uses a fixed-focus scanning camera, The field of view FOV of the scanning camera is generally large to obtain relatively complete scanned characters. With the supplementary light source with set brightness, the text scanning module 300 needs to maintain a set distance and angle from the scanned object when in use, and cooperate with the supplementary light. With the supplementary light of the light source, the camera can capture images that are more suitable for character recognition, improving the accuracy of character scanning and recognition.

本实施例智能手表的文字扫描模块300直接设置在表体100上，这在智能手表行业属于创新性的结构设计，将文字扫描模块300设置在表体100上更易于功能模组的集成化安装，而且需要可以针对表体100进行综合性的防水结构设计，用户在使用时可以在表体100上进行各种功能操作，方便用户操作和使用。The text scanning module 300 of the smart watch in this embodiment is directly arranged on the watch body 100, which is an innovative structural design in the smart watch industry. Setting the text scanning module 300 on the watch body 100 is easier for the integrated installation of functional modules , and it is necessary to carry out a comprehensive waterproof structure design for the watch body 100, the user can perform various functional operations on the watch body 100 during use, which is convenient for the user to operate and use.

本实施例的智能手表通过配置专门用于文字扫描的文字扫描模块300，实现了智能手表的文字扫描识别功能，本实施例的智能手表采用的文字扫描图像的文字识别方法，针对扫描的单帧时间序列图像进行图像预处理，预处理后的扫描图像再进行后续的图片拼接及文字识别，这样，即使扫描的单帧时间序列图像存在缺陷和问题，也可以通过图片预处理的方式提升扫描图像的质量，从而提升了文字识别的识别率，特别适用于配置专门的扫描摄像头进行文字扫描识别的智能手表，解决现有文字识别方法应用在配置专门的扫描摄像头进行文字扫描识别的智能手表，文字扫描识别率低，用户使用体验差的问题。The smart watch of this embodiment realizes the text scanning recognition function of the smart watch by configuring the text scanning module 300 specially used for text scanning. The text recognition method of the text scanning image adopted by the smart watch of this embodiment is aimed at scanning single Image preprocessing is performed on the time series images, and the preprocessed scanned images are then subjected to subsequent image stitching and text recognition. In this way, even if there are defects and problems in the scanned single frame time series images, the scanned images can be improved through image preprocessing quality, thereby improving the recognition rate of text recognition, especially suitable for smart watches equipped with special scanning cameras for text scanning and recognition, solving the existing text recognition methods applied to smart watches equipped with special scanning cameras for text scanning and recognition, text Low scanning recognition rate and poor user experience.

参见图2所示的本实施例智能手表进行文字扫描识别的流程图。Refer to FIG. 2 for the flow chart of character scanning and recognition performed by the smart watch of this embodiment.

进一步地，本实施例智能手表的扫描模块获取文字扫描的单帧时间序列图像过程包括：Further, the scanning module of the smart watch in this embodiment acquires a single-frame time-series image process of text scanning including:

本实施例智能手表的图像预处理模块针对所述单帧时间序列图像进行图片预处理，得到预处理扫描图像包括：The image preprocessing module of the smart watch in this embodiment performs image preprocessing on the single-frame time-series image, and the obtained pre-processed scanned image includes:

本实施例智能手表的图像预处理模块针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正，得到预处理扫描图像包括：The image preprocessing module of the smart watch in this embodiment performs tilt correction on the single-frame time-series images Pn processed by the brightness compensation, and obtains the pre-processed scanned images including:

本实施例智能手表的图像拼接模块将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作，得到拼接图像包括：The image stitching module of the smart watch of this embodiment performs image stitching operations based on image pyramid template matching on the pre-processed scanned images of adjacent frames, and the stitched images obtained include:

本实施例智能手表的文字识别模块针对所述拼接图像进行文字识别，得到文字扫描图像的文字识别结果包括：The text recognition module of the smart watch of this embodiment performs text recognition on the spliced image, and the text recognition results obtained from the text scanning image include:

参见图3级图4所示，由于本实施例的智能手表采用专门的文字扫描模块300进行字符扫描，为了保证文字扫描模块300的字符扫描识别效果，需要针对智能手表进行结构改进以适配所述文字扫描模块，具体的，本实施例通过软件与硬件结合的方式来实现针对文字扫描模块300的控制，通过获取文字扫描模块300与被扫描对象之间的距离以及角度，从而来判断是否满足文字扫描模块300进行扫描的条件，若判断结果为是，则控制文字扫描模块300进行扫描，若判断结果为否，则文字扫描模块300不进行扫描，当智能手表运行扫描程序，但文字扫描模块300却不执行扫描操作时，需要在智能手表的屏幕101上进行信息提示，也可以对用户进行操作引导，以实现文字扫描模块300进行扫描。Referring to Figure 3 and Figure 4, since the smart watch of this embodiment uses a special character scanning module 300 to scan characters, in order to ensure the character scanning recognition effect of the character scanning module 300, it is necessary to improve the structure of the smart watch to adapt to the Describe the text scanning module, specifically, this embodiment implements the control for the text scanning module 300 through the combination of software and hardware, by obtaining the distance and angle between the text scanning module 300 and the scanned object, thereby judging whether it satisfies The condition that the text scanning module 300 scans, if the judgment result is yes, then the text scanning module 300 is controlled to scan, if the judgment result is no, then the text scanning module 300 does not scan, when the smart watch runs the scanning program, but the text scanning module 300 does not execute the scanning operation, it is necessary to prompt information on the screen 101 of the smart watch, and also provide operation guidance to the user, so as to realize the text scanning module 300 to scan.

本实施例公开的智能手表，在表体100上设置专门用于字符扫描的文字扫描模块300，通过文字扫描模块300可以实现文字的扫描识别，可以直接将扫描识别的文字直接进行同步显示，也可以针对扫描识别的文字进行同步翻译，实现文字扫描翻译功能。In the smart watch disclosed in this embodiment, a character scanning module 300 specially used for character scanning is arranged on the watch body 100, and the character scanning and recognition can be realized through the character scanning module 300, and the characters recognized by scanning can be directly displayed synchronously. Simultaneous translation can be performed on the scanned and recognized text to realize the text scanning and translation function.

具体地，为了便于用户利用文字扫描模块300进行字符扫描，本实施例所述的表体100上设置扫描定位部400，所述的扫描定位部400在文字扫描模块300进行扫描时与被扫描对象接触，使得所述文字扫描模块300与被扫描对象保持设定的距离和角度。Specifically, in order to facilitate the user to use the character scanning module 300 to scan characters, the watch body 100 described in this embodiment is provided with a scanning positioning part 400, and the scanning positioning part 400 is aligned with the scanned object when the character scanning module 300 scans. contact, so that the text scanning module 300 maintains a set distance and angle with the object to be scanned.

作为本实施例的可选实施方式，本实施例所述的扫描定位部400为设置在表体100上靠近所述文字扫描模块300的定位凸块，所述的定位凸块具有倾斜定位面，所述的倾斜定位面与被扫描对象贴合接触使得文字扫描模块300与被扫描对象保持预设距离和预设倾斜角度，文字扫描模块300即满足预设的扫描条件可以进行字符扫描。As an optional implementation of this embodiment, the scanning positioning part 400 described in this embodiment is a positioning bump disposed on the watch body 100 close to the character scanning module 300, and the positioning bump has an inclined positioning surface. The inclined positioning surface is in close contact with the scanned object so that the character scanning module 300 maintains a preset distance and a preset tilt angle from the scanned object, and the character scanning module 300 satisfies the preset scanning conditions and can scan characters.

进一步地，为了判断所述的扫描定位部400是否处于正确的扫描定位位置，本实施例所述的表体上设置用于监测所述扫描定位部400位置的检测传感器，所述的检测传感器在所述扫描定位部与被扫描对象接触时被触发检测信号，所述智能手表的主控模块根据所述检测信号触发所述文字扫描模块进行扫描。Further, in order to judge whether the scanning positioning part 400 is in the correct scanning positioning position, a detection sensor for monitoring the position of the scanning positioning part 400 is arranged on the watch body described in this embodiment, and the detection sensor is in the When the scanning positioning part is in contact with the object to be scanned, a detection signal is triggered, and the main control module of the smart watch triggers the character scanning module to scan according to the detection signal.

作为本实施例的可选实施方式，本实施例的检测传感器为设置在所述倾斜定位面上的距离传感器或者按压开关，所述的距离传感器可检测倾斜定位面与所述被扫描对象之间的距离从而判断两者是否贴合，所述的按压开关在所述倾斜定位面与所述被扫描对象接触贴合时闭合，发出电信号。As an optional implementation of this embodiment, the detection sensor in this embodiment is a distance sensor or a push switch arranged on the inclined positioning surface, and the distance sensor can detect the distance between the inclined positioning surface and the object to be scanned. In order to judge whether the two are attached, the push switch is closed when the inclined positioning surface is in contact with the object to be scanned, and an electric signal is sent.

本实施例的智能手表在使用字符扫描功能时，先触发启动所述文字扫描模块，所述文字扫描模块被触发启动后并非直接开始进行扫描；进一步地，需要判断当前智能手表与被扫描对象之间的距离以及角度是否满足文字扫描模块进行扫描工作的条件，具体通过判断扫描定位部是否处于被扫描对象的定位位置，若扫描定位部处于被扫描对象的定位位置，则直接触发文字扫描模块的补光灯开启，扫描摄像头开始进行图像扫描，若扫描定位部未处于被扫描对象的定位位置，则控制所述文字扫描模块处于待机状态，不进行扫描，并同时在智能手表的屏幕上进行信息提示，也可以对用户进行操作引导，以实现文字扫描模块进行扫描。When the smart watch of this embodiment uses the character scanning function, it first triggers and starts the character scanning module. After the character scanning module is triggered and starts, it does not start scanning directly; further, it is necessary to determine the current smart watch and the scanned object. Whether the distance and angle between them meet the conditions for the text scanning module to perform scanning work. Specifically, by judging whether the scanning positioning part is at the positioning position of the scanned object, if the scanning positioning part is at the positioning position of the scanned object, it will directly trigger the text scanning module. The supplementary light is turned on, and the scanning camera starts to scan the image. If the scanning positioning part is not in the positioning position of the scanned object, the text scanning module is controlled to be in a standby state, and the scanning is not performed, and information is displayed on the screen of the smart watch at the same time. As a reminder, the user can also be guided to operate, so as to realize the text scanning module to scan.

作为本实施例的可选实施方式，本实施例的智能手表，包括：As an optional implementation of this embodiment, the smart watch of this embodiment includes:

触发模块，设置在所述表体上，所述触发模块与所述主控模块通讯连接，启动扫描识别程序后，所述触发模块监测与被扫描对象之间的距离，当监测到所述距离小于预设值时，控制所述文字扫描模块开启，获取扫描图片。The trigger module is arranged on the watch body, and the trigger module is connected to the main control module in communication. After the scanning recognition program is started, the trigger module monitors the distance between the scanned object and the object to be scanned. When the distance is detected When the value is less than the preset value, the text scanning module is controlled to be turned on to acquire scanned pictures.

作为本实施例的可选实施方式，本实施例所述的智能手表包括表带组件200，所述表体100与表带组件200之间为活动连接，所述智能手表具有：As an optional implementation of this embodiment, the smart watch described in this embodiment includes a strap assembly 200, the watch body 100 and the strap assembly 200 are actively connected, and the smart watch has:

第一状态，所述表体100与表带组件200相互锁定，用于智能手表的佩戴；In the first state, the watch body 100 and the strap assembly 200 are mutually locked for wearing a smart watch;

第二状态，所述表体100与表带组件200解除锁定，所述表体100可相对表带组件200运动至预设位置，用于所述文字扫描模块300扫描进行字符信息扫描。In the second state, the watch body 100 and the strap assembly 200 are unlocked, and the watch body 100 can move relative to the strap assembly 200 to a preset position for scanning by the text scanning module 300 to scan character information.

本实施例通过将表体100与表带组件200之间采用活动连接的方式，用户根据使用需求可以调节表体100与表带组件200之间的相对状态，从而极大的方便用户进行字符扫描操作。In this embodiment, the user can adjust the relative state between the watch body 100 and the strap assembly 200 according to the use requirements by adopting a flexible connection between the watch body 100 and the strap assembly 200, thereby greatly facilitating the user to scan characters operate.

进一步地，所述表体100与表带组件200之间为可拆卸连接，所述智能手表具有：Further, the watch body 100 and the strap assembly 200 are detachably connected, and the smart watch has:

第一状态，所述的表体100与所述表带组件200固定连接，用于智能手表的佩戴；In the first state, the watch body 100 is fixedly connected with the strap assembly 200 for wearing a smart watch;

第二状态，所述的表体100由所述表带组件200上拆卸分离，用于所述文字扫描模块进行字符信息扫描。In the second state, the watch body 100 is disassembled and separated from the strap assembly 200 for the character scanning module to scan character information.

本实施例通过将表体100与表带组件200之间采用可拆卸的连接方式，用户在将智能手表佩戴在手腕上之后，若要利用文字扫描模块300进行字符扫描，则可直接将表体100由表带组件200上拆卸下来进行扫描操作，在完成扫描后，再将表体装回表带组件上即可。In this embodiment, by adopting a detachable connection between the watch body 100 and the strap assembly 200, after the user wears the smart watch on his wrist, if he wants to use the text scanning module 300 to scan characters, he can directly connect the watch body 100 is disassembled from the strap assembly 200 for scanning operation, and after the scanning is completed, the watch body can be put back on the strap assembly.

进一步地，本实施例的表体100与表带组件200之间具有第一状态和第二状态，且第二状态主要用于实现扫描组件的字符扫描，因此，本实施例可以通过判断所述表体100相对于表带组件200的状态判断是否自动触发文字扫描模块开启，具体方案如下：Further, there is a first state and a second state between the watch body 100 and the strap assembly 200 of this embodiment, and the second state is mainly used to realize the character scanning of the scanning assembly, therefore, this embodiment can judge the Whether the state of the watch body 100 relative to the strap assembly 200 is judged to automatically trigger the opening of the text scanning module, the specific scheme is as follows:

具体地，本实施例的智能手表的触发模块，设置在所述表体100上，所述的触发模块被配置为：监测并判断所述表体100相对于表带组件200为第二状态时，自动生成触发指令控制所述文字扫描模块进行字符信息扫描。Specifically, the trigger module of the smart watch of this embodiment is set on the watch body 100, and the trigger module is configured to: monitor and determine when the watch body 100 is in the second state relative to the strap assembly 200 and automatically generate a trigger instruction to control the character scanning module to scan character information.

进一步地，为了实现智能手表图像采集类应用的实现，例如拍摄图片、视频，视频通话以及人脸识别等，本实施例的智能手表包括摄像模块500，所述的摄像模块500设置在所述表体100上，所述的摄像模块500被配置为进行图像采集应用，所述的图像采集应用包括拍摄照片或者视频、视频通话及人脸识别。Further, in order to realize the application of smart watch image collection, such as taking pictures, videos, video calls and face recognition, etc., the smart watch of this embodiment includes a camera module 500, and the camera module 500 is set on the watch On the body 100, the camera module 500 is configured to perform image capture applications, and the image capture applications include taking pictures or videos, video calling and face recognition.

本实施例同时提供一种计算机可读存储介质，存储有计算机可执行程序，所述计算机可执行程序被执行时，实现所述文字扫描图像的文字识别方法。This embodiment also provides a computer-readable storage medium storing a computer-executable program. When the computer-executable program is executed, the character recognition method of the scanned character image is implemented.

本实施例所述存储介质可以包括在基带中或者作为载波一部分传播的数据信号，其中承载了可读程序代码。这种传播的数据信号可以采用多种形式，包括但不限于电磁信号、光信号或上述的任意合适的组合。可读存储介质还可以是可读存储介质以外的任何可读介质，该可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。可读存储介质上包含的程序代码可以用任何适当的介质传输，包括但不限于无线、有线、光缆、RF等等，或者上述的任意合适的组合。The storage medium in this embodiment may include a data signal propagated in baseband or as part of a carrier wave, and readable program codes are carried therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A readable storage medium may also be any readable medium other than a readable storage medium that can send, propagate or transport a program for use by or in conjunction with an instruction execution system, apparatus or device. The program code contained on the readable storage medium may be transmitted by any suitable medium, including but not limited to wireless, cable, optical cable, RF, etc., or any suitable combination of the above.

本实施例还提供了一种电子设备，包括处理器和存储器，所述存储器用于存储计算机可执行程序，当所述计算机程序被所述处理器执行时，所述处理器执行所述文字扫描图像的文字识别方法。This embodiment also provides an electronic device, including a processor and a memory, the memory is used to store a computer executable program, and when the computer program is executed by the processor, the processor executes the text scanning Text recognition methods for images.

电子设备以通用计算设备的形式表现。其中处理器可以是一个，也可以是多个并且协同工作。本发明也不排除进行分布式处理，即处理器可以分散在不同的实体设备中。本发明的电子设备并不限于单一实体，也可以是多个实体设备的总和。Electronic devices take the form of general-purpose computing devices. There can be one or more processors and they work together. The present invention does not exclude distributed processing, that is, processors may be dispersed in different physical devices. The electronic device of the present invention is not limited to a single entity, but may also be the sum of multiple entity devices.

所述存储器存储有计算机可执行程序，通常是机器可读的代码。所述计算机可读程序可以被所述处理器执行，以使得电子设备能够执行本发明的方法，或者方法中的至少部分步骤。The memory stores a computer executable program, typically machine readable code. The computer-readable program can be executed by the processor, so that the electronic device can execute the method of the present invention, or at least part of the steps in the method.

所述存储器包括易失性存储器，例如随机存取存储单元(RAM)和/或高速缓存存储单元，还可以是非易失性存储器，如只读存储单元(ROM)。The memory includes a volatile memory, such as a random access memory unit (RAM) and/or a cache memory unit, and may also be a nonvolatile memory, such as a read only memory unit (ROM).

应当理解，本发明的电子设备中还可以包括上述示例中未示出的元件或组件。例如，有些电子设备中还包括有显示屏等显示单元，有些电子设备还包括人机交互元件，例如按钮、键盘等。只要该电子设备能够执行存储器中的计算机可读程序以实现本发明方法或方法的至少部分步骤，均可认为是本发明所涵盖的电子设备。It should be understood that the electronic device of the present invention may also include elements or components not shown in the above examples. For example, some electronic devices also include display units such as display screens, and some electronic devices also include human-computer interaction components, such as buttons and keyboards. As long as the electronic device can execute the computer-readable program in the memory to realize the method or at least some steps of the method of the present invention, it can be considered as the electronic device covered by the present invention.

通过以上对实施方式的描述，本领域的技术人员易于理解，本发明可以由能够执行特定计算机程序的硬件来实现，例如本发明的系统，以及系统中包含的电子处理单元、服务器、客户端、手机、控制单元、处理器等。本发明也可以由执行本发明的方法的计算机软件来实现，例如由微处理器、电子控制单元，客户端、服务器端等执行的控制软件来实现。但需要说明的是，执行本发明的方法的计算机软件并不限于由一个或特定个的硬件实体中执行，其也可以是由不特定具体硬件的以分布式的方式来实现。对于计算机软件，软件产品可以存储在一个计算机可读的存储介质(可以是CD-ROM，U盘，移动硬盘等)中，也可以分布式存储于网络上，只要其能使得电子设备执行根据本发明的方法。Through the above description of the embodiments, those skilled in the art can easily understand that the present invention can be realized by hardware capable of executing specific computer programs, such as the system of the present invention, and the electronic processing unit, server, client, Mobile phones, control units, processors, etc. The present invention can also be implemented by computer software that executes the method of the present invention, such as by control software executed by a microprocessor, an electronic control unit, a client, a server, and the like. However, it should be noted that the computer software for executing the method of the present invention is not limited to be executed by one or a specific hardware entity, and it may also be implemented in a distributed manner by unspecified specific hardware. For computer software, the software product can be stored in a computer-readable storage medium (which can be CD-ROM, U disk, mobile hard disk, etc.), or can be distributed and stored on the network, as long as it can make the electronic equipment execute according to this document. method of invention.

以上实施例仅用以说明本发明而并非限制本发明所描述的技术方案，尽管本说明书参照上述的各个实施例对本发明已进行了详细的说明，但本发明不局限于上述具体实施方式，因此任何对本发明进行修改或等同替换；而一切不脱离发明的精神和范围的技术方案及其改进，其均涵盖在本发明的权利要求范围当中。The above embodiments are only used to illustrate the present invention and are not intended to limit the technical solutions described in the present invention. Although the specification has described the present invention in detail with reference to the above-mentioned embodiments, the present invention is not limited to the above-mentioned specific implementation methods, so Any modifications or equivalent replacements to the present invention; and all technical solutions and improvements that do not depart from the spirit and scope of the invention are covered by the scope of the claims of the present invention.

Claims

1. A text recognition method of text scanning image, is characterized in that, comprises:

Obtain a single-frame time-series image of text scanning;

performing image preprocessing on the single-frame time-series image to obtain a pre-processed scanned image;

performing an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image;

Character recognition is performed on the spliced image to obtain a character recognition result of the scanned character image.

2. the text recognition method of a kind of text scanning image according to claim 1, is characterized in that, the single-frame time series image of described acquisition text scanning comprises:

Obtain n single-frame time-series images {P1,...,Pn} of text scanning, and image Pn represents the nth single-frame time-series image;

Perform image preprocessing on the single-frame time-series image, and obtain the pre-processed scanned image including:

Perform clipping processing on any single-frame time-series image Pn in the single-frame time-series images {P1,...,Pn}, and remove the target clipping part selected according to preset conditions in the single-frame time-series image Pn;

performing brightness compensation processing on the clipped single-frame time-series images Pn;

A tilt correction process is performed on the single-frame time-series image Pn that has undergone the brightness compensation process to obtain a pre-processed scanned image.

3. The character recognition method of a scanned character image according to claim 2, wherein said performing tilt correction on the single-frame time-series image Pn processed through said brightness compensation to obtain a pre-processed scanned image comprises:

Sequentially performing tilt correction and perspective transformation on the single-frame time-series images Pn processed by the brightness compensation, the correction angle parameter of the tilt correction is a preset value α;

Perform Gamma compensation and dynamic linear compensation on the corrected image to obtain a uniform background brightness and obtain a pre-processed scanned image;

Preprocess the n single-frame time series images {P1,...,Pn} to obtain n preprocessed scanned images {newP1, newP2,...,newPn}.

4. the text recognition method of a kind of text scanning image according to claim 3, it is characterized in that, described preprocessing scanning image of adjacent frame is carried out the image splicing operation based on image pyramid template matching, obtains splicing image and comprises:

Extract the sift feature from the current frame image newPn arriving in time sequence;

The sift feature map is sampled in a multi-layer image pyramid, and template matching is performed layer by layer with the image pyramid feature image of the previous frame image newPre, and the matching position of the upper image pyramid is used as a candidate parameter for the matching position of the lower layer.

Accelerate the matching process, where the previous frame image of the first frame image is the same as the current frame by default;

Copy and stitch the images whose positions have been matched to obtain a stitched image, output the stitched image and display it on the screen.

5. the text recognition method of a kind of text scanning image according to claim 4, it is characterized in that, described character recognition is carried out for described mosaic image, the text recognition result that obtains text scanning image comprises:

The n pieces of single-frame time-series images {P1,...,Pn} undergo an image stitching operation to obtain n pieces of spliced images {SP1,SP2,...,SPn};

Input the incremental part of the current spliced image SPn into the CNN network to obtain the incremental part dFS (Feature Sequence);

Input the incremental part dFS (Feature Sequence) of one or more frames of spliced image SPn into CRNN+CTC network for character sequence recognition, and obtain the character recognition result of the character scanned image.

6. The text recognition method of a kind of text scanning image according to claim 5, is characterized in that, comprises:

preset increment preset max{dFS(Feature Sequence)};

When the incremental part dFS (Feature Sequence) of the stitched image of the previous frame or multiple frames reaches the preset incremental preset max{dFS (Feature Sequence)}, the incremental part dFS of the stitched image of the current frame or multiple frames (Feature Sequence) Enter the CRNN+CTC network for text sequence recognition.

7. The character recognition method of a kind of character scan image according to claim 6, is characterized in that, comprises:

Input the incremental part of the current spliced image SPn into the CNN network to obtain the dFS (Feature Sequence) of the incremental part, and judge whether the incremental part dFS (Feature Sequence) reaches the preset incremental preset max{dFS (Feature Sequence)} ;

If the judgment result is yes, then input the dFS (Feature Sequence) of the incremental part of the current spliced image SPn into the CRNN+CTC network for character sequence recognition; Merge the dFS (Feature Sequence) of the incremental part of the spliced image SPnPre that has not been input into the CRNN+CTC network in one frame or the previous few frames, and further judge whether it reaches the preset incremental preset max{dFS (Feature Sequence)};

If the judgment result is yes, merge the dFS (Feature Sequence) of the incremental part of the stitched image of the current spliced image SPn and the previous frame or previous frames into the CRNN+CTC network for text sequence recognition, if the judgment result is no , then continue to acquire the incremental part of the next frame of stitched image SPnext and merge it until the merged incremental part reaches max{dFS(Feature Sequence)}, or all stitched images are processed.

8. A smart watch, characterized in that it comprises:

The scanning module acquires single-frame time-series images of text scanning;

An image preprocessing module, which performs image preprocessing on the single-frame time series image to obtain a preprocessed scanned image;

The image stitching module performs an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image;

and a character recognition module, which performs character recognition on the spliced image to obtain a character recognition result of the character scanned image.

9. Electronic equipment, comprising a processor and a memory for storing computer-executable programs, characterized in that:

When the computer program is executed by the processor, the processor executes a character recognition method for a character scanned image according to any one of claims 1-7.

10. A computer-readable storage medium, storing a computer-executable program, characterized in that, when the computer-executable program is executed, the text scanning image according to any one of claims 1-7 is realized. Text recognition method.