CN115294571A - Character recognition method of character scanning image, intelligent watch and electronic equipment - Google Patents
Character recognition method of character scanning image, intelligent watch and electronic equipment Download PDFInfo
- Publication number
- CN115294571A CN115294571A CN202210971002.XA CN202210971002A CN115294571A CN 115294571 A CN115294571 A CN 115294571A CN 202210971002 A CN202210971002 A CN 202210971002A CN 115294571 A CN115294571 A CN 115294571A
- Authority
- CN
- China
- Prior art keywords
- image
- text
- scanning
- character
- scanned
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformations in the plane of the image
- G06T3/40—Scaling of whole images or parts thereof, e.g. expanding or contracting
- G06T3/4038—Image mosaicing, e.g. composing plane images from plane sub-images
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformations in the plane of the image
- G06T3/40—Scaling of whole images or parts thereof, e.g. expanding or contracting
- G06T3/4046—Scaling of whole images or parts thereof, e.g. expanding or contracting using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/146—Aligning or centring of the image pick-up or image-field
- G06V30/1463—Orientation detection or correction, e.g. rotation of multiples of 90 degrees
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/16—Image preprocessing
- G06V30/1607—Correcting image deformation, e.g. trapezoidal deformation caused by perspective
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G06T2200/32—Indexing scheme for image data processing or generation, in general involving image mosaicing
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Character Discrimination (AREA)
Abstract
本发明公开了文字扫描图像的文字识别方法、智能手表及电子设备,所述文字扫描图像的文字识别方法,其特征在于,包括:获取文字扫描的单帧时间序列图像;针对所述单帧时间序列图像进行图像预处理,得到预处理扫描图像;将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,得到拼接图像;针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果。本发明的文字扫描图像的文字识别方法,特别适用于配置专门的扫描摄像头进行文字扫描识别的智能手表,解决现有文字识别方法应用在配置专门的扫描摄像头进行文字扫描识别的智能手表,文字扫描识别率低,用户使用体验差的问题。
The invention discloses a text recognition method for text scanning images, a smart watch and an electronic device. The text recognition method for text scanning images is characterized in that: obtaining a single-frame time-series image of text scanning; Perform image preprocessing on the sequence images to obtain a preprocessed scanned image; perform image splicing operations based on image pyramid template matching on the preprocessed scanned images of adjacent frames to obtain a spliced image; perform text recognition on the spliced image to obtain a text scanned image text recognition results. The text recognition method for text scanning images of the present invention is especially suitable for smart watches equipped with a special scanning camera for text scanning recognition, and solves the problem that the existing text recognition method is applied to smart watches equipped with a special scanning camera for text scanning recognition. Low recognition rate and poor user experience.
Description
技术领域technical field
本发明涉及智能移动设备领域,具体的涉及文字扫描图像的文字识别方法、智能手表及电子设备。The invention relates to the field of smart mobile devices, in particular to a character recognition method for scanned character images, smart watches and electronic devices.
背景技术Background technique
在工作和生活中经常需要扫描识别图片里的文字,比如遇到一些无法直接复制文字的网页,或者PPT、PDF文档,手动抄录它们效率实在太差,借助于文字扫描识别技术可以极大的提高图片中文字信息的获取效率。In work and life, it is often necessary to scan and identify text in pictures. For example, when encountering some web pages that cannot directly copy text, or PPT and PDF documents, the efficiency of manually copying them is too low. With the help of text scanning and recognition technology, it can be greatly improved. Acquisition efficiency of text information in pictures.
随着社会的信息化发展,文字扫描识别功能被越来越多的配置在各种软硬件产品上,对于智能手表而言,将自身的功能进行了拓展实现文字扫描识别功能。现有智能手表实现文字扫描识别功能一般通过自身配备的拍摄摄像头拍摄文本对象,由于拍摄摄像头与专用的扫描摄像头存在硬件结构的区别,拍摄摄像头拍摄的文本图片在进行文字识别时识别率会有所降低,用户使用智能手表进行文字扫描识别的使用体验不好。With the development of social informatization, text scanning and recognition functions are more and more configured on various hardware and software products. For smart watches, their own functions have been expanded to realize text scanning and recognition functions. Existing smart watches realize the function of text scanning and recognition, and generally use their own camera to capture text objects. Due to the difference in hardware structure between the camera and the dedicated scanning camera, the recognition rate of text images captured by the camera will be different when performing character recognition. Reduced, the user experience of using smart watches for text scanning and recognition is not good.
为了解决智能手表的文字扫描的识别率,现有智能手表上结合专门用于文字扫描的扫描摄像头,在使用智能手表进行文字扫描识别时,采用配置的扫描摄像头进行文字扫描,提升了文字识别率,且用户使用较为方便。而扫描摄像头被成熟应用在移动智能硬件的一个成功案例就是——智能扫描笔,智能扫描笔通过将扫描摄像头设置在笔头上与被扫描文本对象保持固定的角度和距离,可以获取更适于被识别出文字信息的扫描文本图片。智能手表受限于表体体积,虽然可以空间优化将扫描摄像头装入表体内部,但是扫描摄像头与被扫描文本对象之间的距离、角度都与智能扫描笔不同。因此,如果只是简单的将扫描摄像头“塞进”智能手表的表体,而不进行软件上算法的优化,智能手表集成扫描摄像头并不能有效的提升文字扫描识别率。In order to solve the text scanning recognition rate of the smart watch, the existing smart watch is combined with a scanning camera specially used for text scanning. When using the smart watch for text scanning recognition, the configured scanning camera is used for text scanning, which improves the text recognition rate. , and the user is more convenient to use. A successful case of the mature application of the scanning camera in mobile smart hardware is the intelligent scanning pen. By setting the scanning camera on the tip of the pen and keeping a fixed angle and distance from the scanned text object, the intelligent scanning pen can obtain a text object that is more suitable for being scanned. Image of scanned text with recognized text information. Smart watches are limited by the size of the watch body. Although the space can be optimized to install the scanning camera inside the watch body, the distance and angle between the scanning camera and the scanned text object are different from those of the smart scanning pen. Therefore, if the scanning camera is simply "plugged" into the body of the smart watch without optimizing the algorithm on the software, the integrated scanning camera of the smart watch cannot effectively improve the text scanning recognition rate.
因此,本发明所要解决的技术问题为:针对性配置专门的扫描摄像头进行文字扫描识别的智能手表,如何针对性的进行软件算法优化提升文字识别的识别率?Therefore, the technical problem to be solved by the present invention is: how to specifically optimize the software algorithm to improve the recognition rate of character recognition for a smart watch equipped with a special scanning camera for character scanning and recognition?
有鉴于此,特提出本发明。In view of this, the present invention is proposed.
发明内容Contents of the invention
为了解决上述问题,本发明的第一发明目的是提供一种适应性用于配置专门的扫描摄像头进行文字扫描识别的智能手表的文字扫描图像的文字识别方法,具体地,采用如下技术方案:In order to solve the above problems, the first object of the present invention is to provide a text recognition method adaptable to the text scanning image of a smart watch equipped with a special scanning camera for text scanning recognition. Specifically, the following technical solutions are adopted:
一种文字扫描图像的文字识别方法,包括:A character recognition method for a character scanned image, comprising:
获取文字扫描的单帧时间序列图像;Obtain a single-frame time-series image of text scanning;
针对所述单帧时间序列图像进行图像预处理,得到预处理扫描图像;performing image preprocessing on the single-frame time-series image to obtain a pre-processed scanned image;
将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,得到拼接图像;performing an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image;
针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果。Character recognition is performed on the spliced image to obtain a character recognition result of the scanned character image.
作为本发明的可选实施方式,本实施例的一种文字扫描图像的文字识别方法中,所述获取文字扫描的单帧时间序列图像包括:As an optional implementation mode of the present invention, in a method for character recognition of a character scanned image in this embodiment, the acquisition of a single-frame time-series image of the character scan includes:
获取文字扫描的n幅单帧时间序列图像{P1,…,Pn},图像Pn表示第n幅单帧时间序列图像;Obtain n single-frame time-series images {P1,...,Pn} of text scanning, and image Pn represents the nth single-frame time-series image;
针对所述单帧时间序列图像进行图片预处理,得到预处理扫描图像包括:Perform image preprocessing on the single-frame time-series image, and obtain the pre-processed scanned image including:
针对所述单帧时间序列图像{P1,…,Pn}中任一单帧时间序列图像Pn进行剪裁处理,去除单帧时间序列图像Pn中按照预设条件选出的目标剪裁部分;Perform clipping processing on any single-frame time-series image Pn in the single-frame time-series images {P1,...,Pn}, and remove the target clipping part selected according to preset conditions in the single-frame time-series image Pn;
针对剪裁处理后的单帧时间序列图像Pn进行亮度补偿处理;performing brightness compensation processing on the clipped single-frame time-series images Pn;
针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正处理,得到预处理扫描图像。A tilt correction process is performed on the single-frame time-series image Pn that has undergone the brightness compensation process to obtain a pre-processed scanned image.
作为本发明的可选实施方式,本实施例的一种文字扫描图像的文字识别方法中,所述针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正,得到预处理扫描图像包括:As an optional implementation mode of the present invention, in a character recognition method of a scanned character image in this embodiment, the tilt correction is performed on the single-frame time-series image Pn that has undergone the brightness compensation process, and the pre-processed scanned image obtained includes :
针对所述亮度补偿处理的单帧时间序列图像Pn依次进行倾斜校正和透视变换,所述倾斜校正的校正角度参数为预设值α;Sequentially performing tilt correction and perspective transformation on the single-frame time-series images Pn processed by the brightness compensation, the correction angle parameter of the tilt correction is a preset value α;
针对校正的图像进行Gamma补偿和动态线性补偿,获取均匀的背景亮度,得到预处理扫描图像;Perform Gamma compensation and dynamic linear compensation on the corrected image to obtain a uniform background brightness and obtain a pre-processed scanned image;
将n幅单帧时间序列图像{P1,…,Pn}经过图片预处理后对应得到n幅预处理扫描图像{newP1,newP2,…,newPn}。Preprocess the n single-frame time series images {P1,...,Pn} to obtain n preprocessed scanned images {newP1, newP2,...,newPn}.
作为本发明的可选实施方式,本实施例的一种文字扫描图像的文字识别方法中,所述将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,得到拼接图像包括:As an optional implementation of the present invention, in the text recognition method of a scanned text image in this embodiment, the image stitching operation based on image pyramid template matching is performed on the pre-processed scanned images of adjacent frames, and the stitched image includes: :
将按时序到来的当前帧图像newPn提取sift特征;Extract the sift feature from the current frame image newPn arriving in time sequence;
将sift特征图做多层图像金字塔采样,逐层与上一帧图像newPre的图像金字塔特征图像进行做模板匹配,上层图像金字塔的匹配位置作为下层匹配位置的候选参数,The sift feature map is sampled in a multi-layer image pyramid, and template matching is performed layer by layer with the image pyramid feature image of the previous frame image newPre, and the matching position of the upper image pyramid is used as a candidate parameter for the matching position of the lower layer.
加速匹配过程,其中,对于第一帧图像的前一帧图像默认与当前帧相同;Accelerate the matching process, where the previous frame image of the first frame image is the same as the current frame by default;
将位置匹配完成的图像进行图像拷贝拼接,得到拼接图像,输出拼接图像并在屏幕显示。Copy and stitch the images whose positions have been matched to obtain a stitched image, output the stitched image and display it on the screen.
作为本发明的可选实施方式,本实施例的一种文字扫描图像的文字识别方法中,所述针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果包括:As an optional implementation mode of the present invention, in a method for character recognition of a scanned character image in this embodiment, performing character recognition on the spliced image to obtain a character recognition result of the scanned character image includes:
所述n幅单帧时间序列图像{P1,…,Pn}经过图像拼接操作后对应得到n幅拼接图像{SP1,SP2,…,SPn};The n pieces of single-frame time-series images {P1,...,Pn} undergo an image stitching operation to obtain n pieces of spliced images {SP1,SP2,...,SPn};
将当前拼接图像SPn的增量部分输入CNN网络,获得增量部分dFS(FeatureSequence);Input the incremental part of the current spliced image SPn into the CNN network to obtain the incremental part dFS (FeatureSequence);
将一帧或者多帧拼接图像SPn的增量部分dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别,得到文字扫描图像的文字识别结果。Input the incremental part dFS (Feature Sequence) of one or more frames of spliced image SPn into CRNN+CTC network for character sequence recognition, and obtain the character recognition result of the character scanned image.
作为本发明的可选实施方式,本实施例的一种文字扫描图像的文字识别方法包括:As an optional implementation mode of the present invention, a text recognition method of a text scanning image in this embodiment includes:
预设增量预置max{dFS(Feature Sequence)};preset increment preset max{dFS(Feature Sequence)};
当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)达到预设增量预置max{dFS(Feature Sequence)}时,将当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别。When the incremental part dFS (Feature Sequence) of the stitched image of the previous frame or multiple frames reaches the preset incremental preset max{dFS (Feature Sequence)}, the incremental part dFS of the stitched image of the current frame or multiple frames (Feature Sequence) Enter the CRNN+CTC network for text sequence recognition.
作为本发明的可选实施方式,本实施例的一种文字扫描图像的文字识别方法包括:As an optional implementation mode of the present invention, a text recognition method of a text scanning image in this embodiment includes:
将当前拼接图像SPn的增量部分输入CNN网络,获得增量部分的dFS(FeatureSequence),判断所述增量部分dFS(Feature Sequence)是否达到预设增量预置max{dFS(Feature Sequence)};Input the incremental part of the current spliced image SPn into the CNN network to obtain the dFS (Feature Sequence) of the incremental part, and judge whether the incremental part dFS (Feature Sequence) reaches the preset incremental preset max{dFS (Feature Sequence)} ;
若判断结果为是,则将当前拼接图像SPn的增量部分的dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别,若判断结果为否,则将当前拼接图像SPn的增量部分与前一帧或者前几帧未输入CRNN+CTC网络的拼接图像SPnPre的增量部分的dFS(FeatureSequence)进行合并,进一步判断是否达到预设增量预置max{dFS(Feature Sequence)};If the judgment result is yes, then input the dFS (Feature Sequence) of the incremental part of the current spliced image SPn into the CRNN+CTC network for character sequence recognition; Merge the dFS (Feature Sequence) of the incremental part of the spliced image SPnPre that has not been input into the CRNN+CTC network in one frame or the previous few frames, and further judge whether it reaches the preset incremental preset max{dFS (Feature Sequence)};
若判断结果为是,则将当前拼接图像SPn与前一帧或前几帧的拼接图像的增量部分的dFS(Feature Sequence)合并输入至CRNN+CTC网络进行文字序列识别,若判断结果为否,则继续获取下一帧拼接图像SPnext的增量部分并进行合并直至合并的增量部分达到max{dFS(Feature Sequence)},或者所有拼接图像处理完成。If the judgment result is yes, merge the dFS (Feature Sequence) of the incremental part of the stitched image of the current spliced image SPn and the previous frame or previous frames into the CRNN+CTC network for text sequence recognition, if the judgment result is no , then continue to acquire the incremental part of the next frame of stitched image SPnext and merge it until the merged incremental part reaches max{dFS(Feature Sequence)}, or all stitched images are processed.
本发明的第二发明目的是提供一种智能手表,采用所述文字扫描图像的文字识别方法,对于通过配备扫描摄像头进行硬件改进的同时,通过采用本发明的文字扫描图像的文字识别方法,进行软件算法优化,可以提升文字扫描识别功能的识别率,提升用户使用体验,具体地,采用如下技术方案:The second object of the present invention is to provide a smart watch, using the text recognition method of the text scanning image, while improving the hardware by using the scanning camera, by using the text recognition method of the text scanning image of the present invention. The optimization of the software algorithm can improve the recognition rate of the text scanning recognition function and improve the user experience. Specifically, the following technical solutions are adopted:
一种智能手表,包括:A smart watch, comprising:
扫描模块,获取文字扫描的单帧时间序列图像;The scanning module acquires single-frame time-series images of text scanning;
图像预处理模块,针对所述单帧时间序列图像进行图像预处理,得到预处理扫描图像;An image preprocessing module, which performs image preprocessing on the single-frame time series image to obtain a preprocessed scanned image;
图像拼接模块,将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,得到拼接图像;The image stitching module performs an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image;
及文字识别模块,针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果。and a character recognition module, which performs character recognition on the spliced image to obtain a character recognition result of the character scanned image.
本发明同时提供电子设备,包括处理器和存储器,所述存储器用于存储计算机可执行程序,当所述计算机程序被所述处理器执行时,所述处理器执行所述的一种文字扫描图像的文字识别方法。The present invention also provides electronic equipment, including a processor and a memory, the memory is used to store a computer executable program, and when the computer program is executed by the processor, the processor executes the above-mentioned text scanning image text recognition method.
本发明还提供一种计算机可读存储介质,存储有计算机可执行程序,所述计算机可执行程序被执行时,实现所述的一种文字扫描图像的文字识别方法。The present invention also provides a computer-readable storage medium storing a computer-executable program. When the computer-executable program is executed, the above-mentioned method for character recognition of a scanned character image is realized.
与现有技术相比,本发明的有益效果:Compared with prior art, the beneficial effect of the present invention:
本发明的文字扫描图像的文字识别方法针对扫描的单帧时间序列图像进行图像预处理,预处理后的扫描图像再进行后续的图片拼接及文字识别,这样,即使扫描的单帧时间序列图像存在缺陷和问题,也可以通过图片预处理的方式提升扫描图像的质量,从而提升了文字识别的识别率,特别适用于配置专门的扫描摄像头进行文字扫描识别的智能手表,解决现有文字识别方法应用在配置专门的扫描摄像头进行文字扫描识别的智能手表,文字扫描识别率低,用户使用体验差的问题。The text recognition method for scanned text images of the present invention performs image preprocessing on scanned single-frame time-series images, and the pre-processed scanned images are then subjected to subsequent image splicing and text recognition. In this way, even if the scanned single-frame time-series images exist Defects and problems can also improve the quality of scanned images through image preprocessing, thereby improving the recognition rate of text recognition. It is especially suitable for smart watches equipped with special scanning cameras for text scanning and recognition, and solves the application of existing text recognition methods. In smart watches equipped with special scanning cameras for text scanning and recognition, the text scanning recognition rate is low and the user experience is poor.
本发明的文字扫描图像的文字识别方法将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,可以加速匹配过程,提升图像拼接速度,加快得到文字识别结果。The character recognition method of the scanned character image of the present invention performs the image stitching operation based on image pyramid template matching on the preprocessed scanned images of adjacent frames, which can accelerate the matching process, increase the speed of image stitching, and accelerate the obtaining of character recognition results.
附图说明:Description of drawings:
图1本发明实施例文字扫描图像的文字识别方法的流程图;Fig. 1 is the flow chart of the text recognition method of the text scanning image of the embodiment of the present invention;
图2本发明实施例智能手表的文字扫描识别流程图;Figure 2 is a flow chart of text scanning and recognition of smart watches according to an embodiment of the present invention;
图3本发明实施例智能手表的立体结构示意图;Fig. 3 is a schematic diagram of a three-dimensional structure of a smart watch according to an embodiment of the present invention;
图4本发明实施例智能手表的表体的立体结构示意图。Fig. 4 is a schematic diagram of the three-dimensional structure of the watch body of the smart watch according to the embodiment of the present invention.
具体实施方式Detailed ways
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合附图,对本发明实施例中的技术方案进行清楚、完整的描述。显然,所描述的实施例是本发明的一部分实施例,而不是全部的实施例。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are some, not all, embodiments of the present invention.
因此,以下对本发明的实施例的详细描述并非旨在限制要求保护的本发明的范围,而是仅仅表示本发明的部分实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。Therefore, the following detailed description of the embodiments of the present invention is not intended to limit the scope of the claimed invention, but merely represents some embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
需要说明的是,在不冲突的情况下,本发明中的实施例及实施例中的特征和技术方案可以相互组合。It should be noted that, in the case of no conflict, the embodiments of the present invention and the features and technical solutions in the embodiments can be combined with each other.
应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步定义和解释。It should be noted that like numerals and letters denote similar items in the following figures, therefore, once an item is defined in one figure, it does not require further definition and explanation in subsequent figures.
在本发明的描述中,需要说明的是,术语“上”、“下”等指示的方位或位置关系为基于附图所示的方位或位置关系,或者是该发明产品使用时惯常摆放的方位或位置关系,或者是本领域技术人员惯常理解的方位或位置关系,这类术语仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。此外,术语“第一”、“第二”等仅用于区分描述,而不能理解为指示或暗示相对重要性。In the description of the present invention, it should be noted that the orientation or positional relationship indicated by the terms "upper", "lower", etc. is based on the orientation or positional relationship shown in the drawings, or the conventionally placed position when the product of the invention is used. Orientation or positional relationship, or the orientation or positional relationship commonly understood by those skilled in the art, such terms are only for the convenience of describing the present invention and simplifying the description, rather than indicating or implying that the referred device or element must have a specific orientation, Constructed and operative in a particular orientation and therefore are not to be construed as limitations of the invention. In addition, the terms "first", "second", etc. are only used for distinguishing descriptions, and should not be construed as indicating or implying relative importance.
参见图1所示,本实施例的一种文字扫描图像的文字识别方法,包括:Referring to shown in Fig. 1, a kind of text recognition method of text scanning image of the present embodiment comprises:
获取文字扫描的单帧时间序列图像;Obtain a single-frame time-series image of text scanning;
针对所述单帧时间序列图像进行图像预处理,得到预处理扫描图像;performing image preprocessing on the single-frame time-series image to obtain a pre-processed scanned image;
将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,得到拼接图像;performing an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image;
针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果。Character recognition is performed on the spliced image to obtain a character recognition result of the scanned character image.
本实施例的文字扫描图像的文字识别方法针对扫描的单帧时间序列图像进行图像预处理,预处理后的扫描图像再进行后续的图片拼接及文字识别,这样,即使扫描的单帧时间序列图像存在缺陷和问题,也可以通过图片预处理的方式提升扫描图像的质量,从而提升了文字识别的识别率,特别适用于配置专门的扫描摄像头进行文字扫描识别的智能手表,解决现有文字识别方法应用在配置专门的扫描摄像头进行文字扫描识别的智能手表,文字扫描识别率低,用户使用体验差的问题。The text recognition method for scanned text images in this embodiment performs image preprocessing on scanned single-frame time-series images, and the pre-processed scanned images are then subjected to subsequent image splicing and text recognition. In this way, even scanned single-frame time-series images There are defects and problems, and the quality of scanned images can also be improved through image preprocessing, thereby improving the recognition rate of text recognition. It is especially suitable for smart watches equipped with special scanning cameras for text scanning and recognition, and solves the existing text recognition methods. Applied to smart watches equipped with a special scanning camera for text scanning and recognition, the text scanning recognition rate is low, and the user experience is poor.
本实施例的文字扫描图像的文字识别方法将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,可以加速匹配过程,提升图像拼接速度,加快得到文字识别结果。The text recognition method for scanned text images of this embodiment performs image stitching operations based on image pyramid template matching on preprocessed scanned images of adjacent frames, which can speed up the matching process, increase the speed of image stitching, and speed up obtaining text recognition results.
进一步地,本实施例的一种文字扫描图像的文字识别方法中,所述获取文字扫描的单帧时间序列图像包括:Further, in a method for character recognition of scanned character images in this embodiment, the acquisition of single-frame time-series images of character scans includes:
获取文字扫描的n幅单帧时间序列图像{P1,…,Pn},图像Pn表示第n幅单帧时间序列图像。Acquire n single-frame time-series images {P1,...,Pn} of text scanning, and image Pn represents the nth single-frame time-series image.
针对所述单帧时间序列图像进行图片预处理,得到预处理扫描图像包括:Perform image preprocessing on the single-frame time-series image, and obtain the pre-processed scanned image including:
针对所述单帧时间序列图像{P1,…,Pn}中任一单帧时间序列图像Pn进行剪裁处理,去除单帧时间序列图像Pn中按照预设条件选出的目标剪裁部分,具体地,通过预设条件可将单帧时间序列图像Pn中的不需要和被遮挡的部分去除,不需要的部分包括例如设定的图形、图画、标识等;Perform clipping processing on any single-frame time-series image Pn in the single-frame time-series images {P1,...,Pn}, and remove the target clipping part selected according to preset conditions in the single-frame time-series image Pn, specifically, Unnecessary and covered parts in the single-frame time series image Pn can be removed through preset conditions, and the unnecessary parts include, for example, set graphics, pictures, logos, etc.;
针对剪裁处理后的单帧时间序列图像Pn进行亮度补偿处理;performing brightness compensation processing on the clipped single-frame time-series images Pn;
针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正处理,得到预处理扫描图像。A tilt correction process is performed on the single-frame time-series image Pn that has undergone the brightness compensation process to obtain a pre-processed scanned image.
进一步地,本市实施的一种文字扫描图像的文字识别方法中,所述针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正,得到预处理扫描图像包括:Further, in a text recognition method for scanned text images implemented in this city, the tilt correction is performed on the single-frame time-series images Pn that have undergone the brightness compensation processing, and the pre-processed scanned images obtained include:
针对所述亮度补偿处理的单帧时间序列图像Pn依次进行倾斜校正和透视变换,所述倾斜校正的校正角度参数为预设值α;Sequentially performing tilt correction and perspective transformation on the single-frame time-series images Pn processed by the brightness compensation, the correction angle parameter of the tilt correction is a preset value α;
针对校正的图像进行Gamma补偿和动态线性补偿,获取均匀的背景亮度,得到预处理扫描图像;Perform Gamma compensation and dynamic linear compensation on the corrected image to obtain a uniform background brightness and obtain a pre-processed scanned image;
将n幅单帧时间序列图像{P1,…,Pn}经过图片预处理后对应得到n幅预处理扫描图像{newP1,newP2,…,newPn}。Preprocess the n single-frame time series images {P1,...,Pn} to obtain n preprocessed scanned images {newP1, newP2,...,newPn}.
具体地,本实施例倾斜校正的校正角度参数预设值α需要考虑用户在使用智能手表进行文字扫描图像时相对于被扫描文本对象的倾斜角度,由大量图像试验所确定,具体地,所述的校正角度参数预设值α可选取65°。Specifically, the preset value α of the correction angle parameter of the tilt correction in this embodiment needs to consider the tilt angle of the user relative to the scanned text object when using the smart watch to scan the image, which is determined by a large number of image experiments. Specifically, the The preset value α of the correction angle parameter can be selected as 65°.
本实施例补偿参数由大量图像试验获取,具体地,采用的Gamma系数为-0.3,动态线性补偿系数为330。The compensation parameters in this embodiment are obtained from a large number of image experiments. Specifically, the Gamma coefficient used is -0.3, and the dynamic linear compensation coefficient is 330.
进一步地,本实施例的一种文字扫描图像的文字识别方法中,所述将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,得到拼接图像包括:Further, in the text recognition method of a text scanned image of the present embodiment, performing an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image includes:
将按时序到来的当前帧图像newPn提取sift特征;Extract the sift feature from the current frame image newPn arriving in time sequence;
将sift特征图做多层图像金字塔采样,逐层与上一帧图像newPre的图像金字塔特征图像进行做模板匹配,上层图像金字塔的匹配位置作为下层匹配位置的候选参数,The sift feature map is sampled in a multi-layer image pyramid, and template matching is performed layer by layer with the image pyramid feature image of the previous frame image newPre, and the matching position of the upper image pyramid is used as a candidate parameter for the matching position of the lower layer.
加速匹配过程,其中,对于第一帧图像的前一帧图像默认与当前帧相同;Accelerate the matching process, where the previous frame image of the first frame image is the same as the current frame by default;
将位置匹配完成的图像进行图像拷贝拼接,得到拼接图像,输出拼接图像并在屏幕显示。Copy and stitch the images whose positions have been matched to obtain a stitched image, output the stitched image and display it on the screen.
具体地,将sift特征图做3层图像金字塔采样,逐层与上一帧图像newPre的图像金字塔特征图像进行做模板匹配,上层金字塔的匹配位置作为下层匹配位置的候选参数,以加速匹配过程。另外,对于第一帧图像的前一帧默认与当前帧相同。Specifically, the sift feature map is sampled in a three-layer image pyramid, and template matching is performed layer by layer with the image pyramid feature image of the previous frame image newPre, and the matching position of the upper pyramid is used as a candidate parameter for the matching position of the lower layer to speed up the matching process. In addition, the previous frame of the first frame image is the same as the current frame by default.
另外,本实施例的拼接图像会被显示与智能手表的屏幕上,在智能手表扫描移动过程中,时间序列图像newP连续到来时,图像连续拼接增长,屏幕上形成随着扫描移动而连续增长的动态图。In addition, the mosaic image in this embodiment will be displayed on the screen of the smart watch. During the scanning movement of the smart watch, when the time-series image newP arrives continuously, the images will continue to be spliced and grow, forming a continuously growing pattern on the screen with the scanning movement. dynamic picture.
进一步地,本实施例的文字扫描图像的文字识别方法中,所述针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果包括:Further, in the character recognition method of the scanned character image in this embodiment, performing character recognition on the spliced image to obtain the character recognition result of the scanned character image includes:
所述n幅单帧时间序列图像{P1,…,Pn}经过图像拼接操作后对应得到n幅拼接图像{SP1,SP2,…,SPn};The n pieces of single-frame time-series images {P1,...,Pn} undergo an image stitching operation to obtain n pieces of spliced images {SP1,SP2,...,SPn};
将当前拼接图像SPn的增量部分输入CNN网络,获得增量部分dFS(FeatureSequence);Input the incremental part of the current spliced image SPn into the CNN network to obtain the incremental part dFS (FeatureSequence);
将一帧或者多帧拼接图像SPn的增量部分dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别,得到文字扫描图像的文字识别结果。Input the incremental part dFS (Feature Sequence) of one or more frames of spliced image SPn into CRNN+CTC network for character sequence recognition, and obtain the character recognition result of the character scanned image.
具体地,本实施例的一种文字扫描图像的文字识别方法,包括:Specifically, a character recognition method of a character scanned image in this embodiment includes:
预设增量预置max{dFS(Feature Sequence)};preset increment preset max{dFS(Feature Sequence)};
当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)达到预设增量预置max{dFS(Feature Sequence)}时,将当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别。增量识别大大加快了文字识别的速度,可设置一个增量阈值max{dFS(Feature Sequence)}(经验值是28),减少CRNN+CTC网络启动次数。扫描完成,手表抬起时,只要增量>1,最后再启动一次增量识别。When the incremental part dFS (Feature Sequence) of the stitched image of the previous frame or multiple frames reaches the preset incremental preset max{dFS (Feature Sequence)}, the incremental part dFS of the stitched image of the current frame or multiple frames (Feature Sequence) Enter the CRNN+CTC network for text sequence recognition. Incremental recognition greatly speeds up text recognition, and an incremental threshold max{dFS(Feature Sequence)} (experience value is 28) can be set to reduce the number of CRNN+CTC network startups. After the scan is completed, when the watch is lifted, as long as the increment is > 1, the increment recognition will be started again at last.
具体地,本实施例的一种文字扫描图像的文字识别方法,包括:Specifically, a character recognition method of a character scanned image in this embodiment includes:
将当前拼接图像SPn的增量部分输入CNN网络,获得增量部分的dFS(FeatureSequence),判断所述增量部分dFS(Feature Sequence)是否达到预设增量预置max{dFS(Feature Sequence)};Input the incremental part of the current spliced image SPn into the CNN network to obtain the dFS (Feature Sequence) of the incremental part, and judge whether the incremental part dFS (Feature Sequence) reaches the preset incremental preset max{dFS (Feature Sequence)} ;
若判断结果为是,则将当前拼接图像SPn的增量部分的dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别,若判断结果为否,则将当前拼接图像SPn的增量部分与前一帧或者前几帧未输入CRNN+CTC网络的拼接图像SPnPre的增量部分的dFS(FeatureSequence)进行合并,进一步判断是否达到预设增量预置max{dFS(Feature Sequence)};If the judgment result is yes, then input the dFS (Feature Sequence) of the incremental part of the current spliced image SPn into the CRNN+CTC network for character sequence recognition; Merge the dFS (Feature Sequence) of the incremental part of the spliced image SPnPre that has not been input into the CRNN+CTC network in one frame or the previous few frames, and further judge whether it reaches the preset incremental preset max{dFS (Feature Sequence)};
若判断结果为是,则将当前拼接图像SPn与前一帧或前几帧的拼接图像的增量部分的dFS(Feature Sequence)合并输入至CRNN+CTC网络进行文字序列识别,若判断结果为否,则继续获取下一帧拼接图像SPnext的增量部分并进行合并直至合并的增量部分达到max{dFS(Feature Sequence)},或者所有拼接图像处理完成。If the judgment result is yes, merge the dFS (Feature Sequence) of the incremental part of the stitched image of the current spliced image SPn and the previous frame or previous frames into the CRNN+CTC network for text sequence recognition, if the judgment result is no , then continue to acquire the incremental part of the next frame of stitched image SPnext and merge it until the merged incremental part reaches max{dFS(Feature Sequence)}, or all stitched images are processed.
本实施例同时提供一种智能手表,包括:This embodiment also provides a smart watch, including:
扫描模块,获取文字扫描的单帧时间序列图像;The scanning module acquires single-frame time-series images of text scanning;
图像预处理模块,针对所述单帧时间序列图像进行图像预处理,得到预处理扫描图像;An image preprocessing module, which performs image preprocessing on the single-frame time series image to obtain a preprocessed scanned image;
图像拼接模块,将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,得到拼接图像;The image stitching module performs an image stitching operation based on image pyramid template matching on the pre-processed scanned images of adjacent frames to obtain a stitched image;
及文字识别模块,针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果。and a character recognition module, which performs character recognition on the spliced image to obtain a character recognition result of the character scanned image.
本实施例智能手表的文字扫描模块300专门用于字符扫描识别,字符包括字母、数字、运算符号、标点符号和其他符号,以及一些功能性符号;文字扫描模块300一般采用定焦的扫描摄像头,扫描摄像头的视场角FOV一般较大以获取较为完整的扫描字符,配合设定亮度的补光光源,在使用时文字扫描模块300需要与被扫描对象保持设定的距离以及角度,配合补光光源的补光,摄像头可以拍摄出更加适于进行字符识别的图像,提高字符扫描识别的准确率。The
本实施例智能手表的文字扫描模块300直接设置在表体100上,这在智能手表行业属于创新性的结构设计,将文字扫描模块300设置在表体100上更易于功能模组的集成化安装,而且需要可以针对表体100进行综合性的防水结构设计,用户在使用时可以在表体100上进行各种功能操作,方便用户操作和使用。The
本实施例的智能手表通过配置专门用于文字扫描的文字扫描模块300,实现了智能手表的文字扫描识别功能,本实施例的智能手表采用的文字扫描图像的文字识别方法,针对扫描的单帧时间序列图像进行图像预处理,预处理后的扫描图像再进行后续的图片拼接及文字识别,这样,即使扫描的单帧时间序列图像存在缺陷和问题,也可以通过图片预处理的方式提升扫描图像的质量,从而提升了文字识别的识别率,特别适用于配置专门的扫描摄像头进行文字扫描识别的智能手表,解决现有文字识别方法应用在配置专门的扫描摄像头进行文字扫描识别的智能手表,文字扫描识别率低,用户使用体验差的问题。The smart watch of this embodiment realizes the text scanning recognition function of the smart watch by configuring the
参见图2所示的本实施例智能手表进行文字扫描识别的流程图。Refer to FIG. 2 for the flow chart of character scanning and recognition performed by the smart watch of this embodiment.
进一步地,本实施例智能手表的扫描模块获取文字扫描的单帧时间序列图像过程包括:Further, the scanning module of the smart watch in this embodiment acquires a single-frame time-series image process of text scanning including:
获取文字扫描的n幅单帧时间序列图像{P1,…,Pn},图像Pn表示第n幅单帧时间序列图像。Acquire n single-frame time-series images {P1,...,Pn} of text scanning, and image Pn represents the nth single-frame time-series image.
本实施例智能手表的图像预处理模块针对所述单帧时间序列图像进行图片预处理,得到预处理扫描图像包括:The image preprocessing module of the smart watch in this embodiment performs image preprocessing on the single-frame time-series image, and the obtained pre-processed scanned image includes:
针对所述单帧时间序列图像{P1,…,Pn}中任一单帧时间序列图像Pn进行剪裁处理,去除单帧时间序列图像Pn中按照预设条件选出的目标剪裁部分;Perform clipping processing on any single-frame time-series image Pn in the single-frame time-series images {P1,...,Pn}, and remove the target clipping part selected according to preset conditions in the single-frame time-series image Pn;
针对剪裁处理后的单帧时间序列图像Pn进行亮度补偿处理;performing brightness compensation processing on the clipped single-frame time-series images Pn;
针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正处理,得到预处理扫描图像。A tilt correction process is performed on the single-frame time-series image Pn that has undergone the brightness compensation process to obtain a pre-processed scanned image.
本实施例智能手表的图像预处理模块针对经过所述亮度补偿处理的单帧时间序列图像Pn进行倾斜校正,得到预处理扫描图像包括:The image preprocessing module of the smart watch in this embodiment performs tilt correction on the single-frame time-series images Pn processed by the brightness compensation, and obtains the pre-processed scanned images including:
针对所述亮度补偿处理的单帧时间序列图像Pn依次进行倾斜校正和透视变换,所述倾斜校正的校正角度参数为预设值α;Sequentially performing tilt correction and perspective transformation on the single-frame time-series images Pn processed by the brightness compensation, the correction angle parameter of the tilt correction is a preset value α;
针对校正的图像进行Gamma补偿和动态线性补偿,获取均匀的背景亮度,得到预处理扫描图像;Perform Gamma compensation and dynamic linear compensation on the corrected image to obtain a uniform background brightness and obtain a pre-processed scanned image;
将n幅单帧时间序列图像{P1,…,Pn}经过图片预处理后对应得到n幅预处理扫描图像{newP1,newP2,…,newPn}。Preprocess the n single-frame time series images {P1,...,Pn} to obtain n preprocessed scanned images {newP1, newP2,...,newPn}.
本实施例智能手表的图像拼接模块将相邻帧的预处理扫描图像进行基于图像金字塔模板匹配的图像拼接操作,得到拼接图像包括:The image stitching module of the smart watch of this embodiment performs image stitching operations based on image pyramid template matching on the pre-processed scanned images of adjacent frames, and the stitched images obtained include:
将按时序到来的当前帧图像newPn提取sift特征;Extract the sift feature from the current frame image newPn arriving in time sequence;
将sift特征图做多层图像金字塔采样,逐层与上一帧图像newPre的图像金字塔特征图像进行做模板匹配,上层图像金字塔的匹配位置作为下层匹配位置的候选参数,The sift feature map is sampled in a multi-layer image pyramid, and template matching is performed layer by layer with the image pyramid feature image of the previous frame image newPre, and the matching position of the upper image pyramid is used as a candidate parameter for the matching position of the lower layer.
加速匹配过程,其中,对于第一帧图像的前一帧图像默认与当前帧相同;Accelerate the matching process, where the previous frame image of the first frame image is the same as the current frame by default;
将位置匹配完成的图像进行图像拷贝拼接,得到拼接图像,输出拼接图像并在屏幕显示。Copy and stitch the images whose positions have been matched to obtain a stitched image, output the stitched image and display it on the screen.
本实施例智能手表的文字识别模块针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果包括:The text recognition module of the smart watch of this embodiment performs text recognition on the spliced image, and the text recognition results obtained from the text scanning image include:
所述n幅单帧时间序列图像{P1,…,Pn}经过图像拼接操作后对应得到n幅拼接图像{SP1,SP2,…,SPn};The n pieces of single-frame time-series images {P1,...,Pn} undergo an image stitching operation to obtain n pieces of spliced images {SP1,SP2,...,SPn};
将当前拼接图像SPn的增量部分输入CNN网络,获得增量部分dFS(FeatureSequence);Input the incremental part of the current spliced image SPn into the CNN network to obtain the incremental part dFS (FeatureSequence);
将一帧或者多帧拼接图像SPn的增量部分dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别,得到文字扫描图像的文字识别结果。Input the incremental part dFS (Feature Sequence) of one or more frames of spliced image SPn into CRNN+CTC network for character sequence recognition, and obtain the character recognition result of the character scanned image.
本实施例智能手表的文字识别模块针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果包括:The text recognition module of the smart watch of this embodiment performs text recognition on the spliced image, and the text recognition results obtained from the text scanning image include:
预设增量预置max{dFS(Feature Sequence)};preset increment preset max{dFS(Feature Sequence)};
当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)达到预设增量预置max{dFS(Feature Sequence)}时,将当前一帧或者多帧的拼接图像的增量部分dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别。When the incremental part dFS (Feature Sequence) of the stitched image of the previous frame or multiple frames reaches the preset incremental preset max{dFS (Feature Sequence)}, the incremental part dFS of the stitched image of the current frame or multiple frames (Feature Sequence) Enter the CRNN+CTC network for text sequence recognition.
本实施例智能手表的文字识别模块针对所述拼接图像进行文字识别,得到文字扫描图像的文字识别结果包括:The text recognition module of the smart watch of this embodiment performs text recognition on the spliced image, and the text recognition results obtained from the text scanning image include:
将当前拼接图像SPn的增量部分输入CNN网络,获得增量部分的dFS(FeatureSequence),判断所述增量部分dFS(Feature Sequence)是否达到预设增量预置max{dFS(Feature Sequence)};Input the incremental part of the current spliced image SPn into the CNN network to obtain the dFS (Feature Sequence) of the incremental part, and judge whether the incremental part dFS (Feature Sequence) reaches the preset incremental preset max{dFS (Feature Sequence)} ;
若判断结果为是,则将当前拼接图像SPn的增量部分的dFS(Feature Sequence)输入CRNN+CTC网络进行文字序列识别,若判断结果为否,则将当前拼接图像SPn的增量部分与前一帧或者前几帧未输入CRNN+CTC网络的拼接图像SPnPre的增量部分的dFS(FeatureSequence)进行合并,进一步判断是否达到预设增量预置max{dFS(Feature Sequence)};If the judgment result is yes, then input the dFS (Feature Sequence) of the incremental part of the current spliced image SPn into the CRNN+CTC network for character sequence recognition; Merge the dFS (Feature Sequence) of the incremental part of the spliced image SPnPre that has not been input into the CRNN+CTC network in one frame or the previous few frames, and further judge whether it reaches the preset incremental preset max{dFS (Feature Sequence)};
若判断结果为是,则将当前拼接图像SPn与前一帧或前几帧的拼接图像的增量部分的dFS(Feature Sequence)合并输入至CRNN+CTC网络进行文字序列识别,若判断结果为否,则继续获取下一帧拼接图像SPnext的增量部分并进行合并直至合并的增量部分达到max{dFS(Feature Sequence)},或者所有拼接图像处理完成。If the judgment result is yes, merge the dFS (Feature Sequence) of the incremental part of the stitched image of the current spliced image SPn and the previous frame or previous frames into the CRNN+CTC network for text sequence recognition, if the judgment result is no , then continue to acquire the incremental part of the next frame of stitched image SPnext and merge it until the merged incremental part reaches max{dFS(Feature Sequence)}, or all stitched images are processed.
参见图3级图4所示,由于本实施例的智能手表采用专门的文字扫描模块300进行字符扫描,为了保证文字扫描模块300的字符扫描识别效果,需要针对智能手表进行结构改进以适配所述文字扫描模块,具体的,本实施例通过软件与硬件结合的方式来实现针对文字扫描模块300的控制,通过获取文字扫描模块300与被扫描对象之间的距离以及角度,从而来判断是否满足文字扫描模块300进行扫描的条件,若判断结果为是,则控制文字扫描模块300进行扫描,若判断结果为否,则文字扫描模块300不进行扫描,当智能手表运行扫描程序,但文字扫描模块300却不执行扫描操作时,需要在智能手表的屏幕101上进行信息提示,也可以对用户进行操作引导,以实现文字扫描模块300进行扫描。Referring to Figure 3 and Figure 4, since the smart watch of this embodiment uses a special
本实施例公开的智能手表,在表体100上设置专门用于字符扫描的文字扫描模块300,通过文字扫描模块300可以实现文字的扫描识别,可以直接将扫描识别的文字直接进行同步显示,也可以针对扫描识别的文字进行同步翻译,实现文字扫描翻译功能。In the smart watch disclosed in this embodiment, a
具体地,为了便于用户利用文字扫描模块300进行字符扫描,本实施例所述的表体100上设置扫描定位部400,所述的扫描定位部400在文字扫描模块300进行扫描时与被扫描对象接触,使得所述文字扫描模块300与被扫描对象保持设定的距离和角度。Specifically, in order to facilitate the user to use the
作为本实施例的可选实施方式,本实施例所述的扫描定位部400为设置在表体100上靠近所述文字扫描模块300的定位凸块,所述的定位凸块具有倾斜定位面,所述的倾斜定位面与被扫描对象贴合接触使得文字扫描模块300与被扫描对象保持预设距离和预设倾斜角度,文字扫描模块300即满足预设的扫描条件可以进行字符扫描。As an optional implementation of this embodiment, the
进一步地,为了判断所述的扫描定位部400是否处于正确的扫描定位位置,本实施例所述的表体上设置用于监测所述扫描定位部400位置的检测传感器,所述的检测传感器在所述扫描定位部与被扫描对象接触时被触发检测信号,所述智能手表的主控模块根据所述检测信号触发所述文字扫描模块进行扫描。Further, in order to judge whether the
作为本实施例的可选实施方式,本实施例的检测传感器为设置在所述倾斜定位面上的距离传感器或者按压开关,所述的距离传感器可检测倾斜定位面与所述被扫描对象之间的距离从而判断两者是否贴合,所述的按压开关在所述倾斜定位面与所述被扫描对象接触贴合时闭合,发出电信号。As an optional implementation of this embodiment, the detection sensor in this embodiment is a distance sensor or a push switch arranged on the inclined positioning surface, and the distance sensor can detect the distance between the inclined positioning surface and the object to be scanned. In order to judge whether the two are attached, the push switch is closed when the inclined positioning surface is in contact with the object to be scanned, and an electric signal is sent.
本实施例的智能手表在使用字符扫描功能时,先触发启动所述文字扫描模块,所述文字扫描模块被触发启动后并非直接开始进行扫描;进一步地,需要判断当前智能手表与被扫描对象之间的距离以及角度是否满足文字扫描模块进行扫描工作的条件,具体通过判断扫描定位部是否处于被扫描对象的定位位置,若扫描定位部处于被扫描对象的定位位置,则直接触发文字扫描模块的补光灯开启,扫描摄像头开始进行图像扫描,若扫描定位部未处于被扫描对象的定位位置,则控制所述文字扫描模块处于待机状态,不进行扫描,并同时在智能手表的屏幕上进行信息提示,也可以对用户进行操作引导,以实现文字扫描模块进行扫描。When the smart watch of this embodiment uses the character scanning function, it first triggers and starts the character scanning module. After the character scanning module is triggered and starts, it does not start scanning directly; further, it is necessary to determine the current smart watch and the scanned object. Whether the distance and angle between them meet the conditions for the text scanning module to perform scanning work. Specifically, by judging whether the scanning positioning part is at the positioning position of the scanned object, if the scanning positioning part is at the positioning position of the scanned object, it will directly trigger the text scanning module. The supplementary light is turned on, and the scanning camera starts to scan the image. If the scanning positioning part is not in the positioning position of the scanned object, the text scanning module is controlled to be in a standby state, and the scanning is not performed, and information is displayed on the screen of the smart watch at the same time. As a reminder, the user can also be guided to operate, so as to realize the text scanning module to scan.
作为本实施例的可选实施方式,本实施例的智能手表,包括:As an optional implementation of this embodiment, the smart watch of this embodiment includes:
触发模块,设置在所述表体上,所述触发模块与所述主控模块通讯连接,启动扫描识别程序后,所述触发模块监测与被扫描对象之间的距离,当监测到所述距离小于预设值时,控制所述文字扫描模块开启,获取扫描图片。The trigger module is arranged on the watch body, and the trigger module is connected to the main control module in communication. After the scanning recognition program is started, the trigger module monitors the distance between the scanned object and the object to be scanned. When the distance is detected When the value is less than the preset value, the text scanning module is controlled to be turned on to acquire scanned pictures.
作为本实施例的可选实施方式,本实施例所述的智能手表包括表带组件200,所述表体100与表带组件200之间为活动连接,所述智能手表具有:As an optional implementation of this embodiment, the smart watch described in this embodiment includes a
第一状态,所述表体100与表带组件200相互锁定,用于智能手表的佩戴;In the first state, the
第二状态,所述表体100与表带组件200解除锁定,所述表体100可相对表带组件200运动至预设位置,用于所述文字扫描模块300扫描进行字符信息扫描。In the second state, the
本实施例通过将表体100与表带组件200之间采用活动连接的方式,用户根据使用需求可以调节表体100与表带组件200之间的相对状态,从而极大的方便用户进行字符扫描操作。In this embodiment, the user can adjust the relative state between the
进一步地,所述表体100与表带组件200之间为可拆卸连接,所述智能手表具有:Further, the
第一状态,所述的表体100与所述表带组件200固定连接,用于智能手表的佩戴;In the first state, the
第二状态,所述的表体100由所述表带组件200上拆卸分离,用于所述文字扫描模块进行字符信息扫描。In the second state, the
本实施例通过将表体100与表带组件200之间采用可拆卸的连接方式,用户在将智能手表佩戴在手腕上之后,若要利用文字扫描模块300进行字符扫描,则可直接将表体100由表带组件200上拆卸下来进行扫描操作,在完成扫描后,再将表体装回表带组件上即可。In this embodiment, by adopting a detachable connection between the
进一步地,本实施例的表体100与表带组件200之间具有第一状态和第二状态,且第二状态主要用于实现扫描组件的字符扫描,因此,本实施例可以通过判断所述表体100相对于表带组件200的状态判断是否自动触发文字扫描模块开启,具体方案如下:Further, there is a first state and a second state between the
具体地,本实施例的智能手表的触发模块,设置在所述表体100上,所述的触发模块被配置为:监测并判断所述表体100相对于表带组件200为第二状态时,自动生成触发指令控制所述文字扫描模块进行字符信息扫描。Specifically, the trigger module of the smart watch of this embodiment is set on the
进一步地,为了实现智能手表图像采集类应用的实现,例如拍摄图片、视频,视频通话以及人脸识别等,本实施例的智能手表包括摄像模块500,所述的摄像模块500设置在所述表体100上,所述的摄像模块500被配置为进行图像采集应用,所述的图像采集应用包括拍摄照片或者视频、视频通话及人脸识别。Further, in order to realize the application of smart watch image collection, such as taking pictures, videos, video calls and face recognition, etc., the smart watch of this embodiment includes a
本实施例同时提供一种计算机可读存储介质,存储有计算机可执行程序,所述计算机可执行程序被执行时,实现所述文字扫描图像的文字识别方法。This embodiment also provides a computer-readable storage medium storing a computer-executable program. When the computer-executable program is executed, the character recognition method of the scanned character image is implemented.
本实施例所述存储介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了可读程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。可读存储介质还可以是可读存储介质以外的任何可读介质,该可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。可读存储介质上包含的程序代码可以用任何适当的介质传输,包括但不限于无线、有线、光缆、RF等等,或者上述的任意合适的组合。The storage medium in this embodiment may include a data signal propagated in baseband or as part of a carrier wave, and readable program codes are carried therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A readable storage medium may also be any readable medium other than a readable storage medium that can send, propagate or transport a program for use by or in conjunction with an instruction execution system, apparatus or device. The program code contained on the readable storage medium may be transmitted by any suitable medium, including but not limited to wireless, cable, optical cable, RF, etc., or any suitable combination of the above.
本实施例还提供了一种电子设备,包括处理器和存储器,所述存储器用于存储计算机可执行程序,当所述计算机程序被所述处理器执行时,所述处理器执行所述文字扫描图像的文字识别方法。This embodiment also provides an electronic device, including a processor and a memory, the memory is used to store a computer executable program, and when the computer program is executed by the processor, the processor executes the text scanning Text recognition methods for images.
电子设备以通用计算设备的形式表现。其中处理器可以是一个,也可以是多个并且协同工作。本发明也不排除进行分布式处理,即处理器可以分散在不同的实体设备中。本发明的电子设备并不限于单一实体,也可以是多个实体设备的总和。Electronic devices take the form of general-purpose computing devices. There can be one or more processors and they work together. The present invention does not exclude distributed processing, that is, processors may be dispersed in different physical devices. The electronic device of the present invention is not limited to a single entity, but may also be the sum of multiple entity devices.
所述存储器存储有计算机可执行程序,通常是机器可读的代码。所述计算机可读程序可以被所述处理器执行,以使得电子设备能够执行本发明的方法,或者方法中的至少部分步骤。The memory stores a computer executable program, typically machine readable code. The computer-readable program can be executed by the processor, so that the electronic device can execute the method of the present invention, or at least part of the steps in the method.
所述存储器包括易失性存储器,例如随机存取存储单元(RAM)和/或高速缓存存储单元,还可以是非易失性存储器,如只读存储单元(ROM)。The memory includes a volatile memory, such as a random access memory unit (RAM) and/or a cache memory unit, and may also be a nonvolatile memory, such as a read only memory unit (ROM).
应当理解,本发明的电子设备中还可以包括上述示例中未示出的元件或组件。例如,有些电子设备中还包括有显示屏等显示单元,有些电子设备还包括人机交互元件,例如按钮、键盘等。只要该电子设备能够执行存储器中的计算机可读程序以实现本发明方法或方法的至少部分步骤,均可认为是本发明所涵盖的电子设备。It should be understood that the electronic device of the present invention may also include elements or components not shown in the above examples. For example, some electronic devices also include display units such as display screens, and some electronic devices also include human-computer interaction components, such as buttons and keyboards. As long as the electronic device can execute the computer-readable program in the memory to realize the method or at least some steps of the method of the present invention, it can be considered as the electronic device covered by the present invention.
通过以上对实施方式的描述,本领域的技术人员易于理解,本发明可以由能够执行特定计算机程序的硬件来实现,例如本发明的系统,以及系统中包含的电子处理单元、服务器、客户端、手机、控制单元、处理器等。本发明也可以由执行本发明的方法的计算机软件来实现,例如由微处理器、电子控制单元,客户端、服务器端等执行的控制软件来实现。但需要说明的是,执行本发明的方法的计算机软件并不限于由一个或特定个的硬件实体中执行,其也可以是由不特定具体硬件的以分布式的方式来实现。对于计算机软件,软件产品可以存储在一个计算机可读的存储介质(可以是CD-ROM,U盘,移动硬盘等)中,也可以分布式存储于网络上,只要其能使得电子设备执行根据本发明的方法。Through the above description of the embodiments, those skilled in the art can easily understand that the present invention can be realized by hardware capable of executing specific computer programs, such as the system of the present invention, and the electronic processing unit, server, client, Mobile phones, control units, processors, etc. The present invention can also be implemented by computer software that executes the method of the present invention, such as by control software executed by a microprocessor, an electronic control unit, a client, a server, and the like. However, it should be noted that the computer software for executing the method of the present invention is not limited to be executed by one or a specific hardware entity, and it may also be implemented in a distributed manner by unspecified specific hardware. For computer software, the software product can be stored in a computer-readable storage medium (which can be CD-ROM, U disk, mobile hard disk, etc.), or can be distributed and stored on the network, as long as it can make the electronic equipment execute according to this document. method of invention.
以上实施例仅用以说明本发明而并非限制本发明所描述的技术方案,尽管本说明书参照上述的各个实施例对本发明已进行了详细的说明,但本发明不局限于上述具体实施方式,因此任何对本发明进行修改或等同替换;而一切不脱离发明的精神和范围的技术方案及其改进,其均涵盖在本发明的权利要求范围当中。The above embodiments are only used to illustrate the present invention and are not intended to limit the technical solutions described in the present invention. Although the specification has described the present invention in detail with reference to the above-mentioned embodiments, the present invention is not limited to the above-mentioned specific implementation methods, so Any modifications or equivalent replacements to the present invention; and all technical solutions and improvements that do not depart from the spirit and scope of the invention are covered by the scope of the claims of the present invention.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210971002.XA CN115294571A (en) | 2022-08-14 | 2022-08-14 | Character recognition method of character scanning image, intelligent watch and electronic equipment |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210971002.XA CN115294571A (en) | 2022-08-14 | 2022-08-14 | Character recognition method of character scanning image, intelligent watch and electronic equipment |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN115294571A true CN115294571A (en) | 2022-11-04 |
Family
ID=83829246
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202210971002.XA Pending CN115294571A (en) | 2022-08-14 | 2022-08-14 | Character recognition method of character scanning image, intelligent watch and electronic equipment |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN115294571A (en) |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2009114967A1 (en) * | 2008-03-19 | 2009-09-24 | 东莞市步步高教育电子产品有限公司 | Motion scan-based image processing method and device |
| CN101930594A (en) * | 2010-04-14 | 2010-12-29 | 山东山大鸥玛软件有限公司 | Rapid correction method for scanning document image |
| WO2012163112A1 (en) * | 2011-05-27 | 2012-12-06 | 汉王科技股份有限公司 | Frame-skipping scanning and recognizing device and method |
| CN110298282A (en) * | 2019-06-21 | 2019-10-01 | 华南师范大学 | Document image processing method, storage medium and calculating equipment |
-
2022
- 2022-08-14 CN CN202210971002.XA patent/CN115294571A/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2009114967A1 (en) * | 2008-03-19 | 2009-09-24 | 东莞市步步高教育电子产品有限公司 | Motion scan-based image processing method and device |
| CN101930594A (en) * | 2010-04-14 | 2010-12-29 | 山东山大鸥玛软件有限公司 | Rapid correction method for scanning document image |
| WO2012163112A1 (en) * | 2011-05-27 | 2012-12-06 | 汉王科技股份有限公司 | Frame-skipping scanning and recognizing device and method |
| CN110298282A (en) * | 2019-06-21 | 2019-10-01 | 华南师范大学 | Document image processing method, storage medium and calculating equipment |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8345106B2 (en) | Camera-based scanning | |
| US10225428B2 (en) | Image processing for handheld scanner | |
| US8582182B2 (en) | Automatic sizing of images acquired by a handheld scanner | |
| US5611033A (en) | Apparatus and method for automerging images by matching features and aligning images | |
| US9131150B1 (en) | Automatic exposure control and illumination for head tracking | |
| US20100296137A1 (en) | Continuous scanning with a handheld scanner | |
| US8884985B2 (en) | Interface apparatus, method, and recording medium | |
| US9436870B1 (en) | Automatic camera selection for head tracking using exposure control | |
| US20100296140A1 (en) | Handheld scanner with high image quality | |
| CN101558416A (en) | Text detection on mobile communications devices | |
| US10440223B2 (en) | System and method for constructing document image from snapshots taken by image sensor panel | |
| US10002291B2 (en) | Method and system of identifying fillable fields of an electronic form | |
| CN113744172B (en) | Document image processing method and device and training sample generation method and device | |
| CN109508713B (en) | Picture acquisition method, device, terminal and storage medium | |
| CN115294571A (en) | Character recognition method of character scanning image, intelligent watch and electronic equipment | |
| TWM457241U (en) | Picture character recognition system by combining augmented reality | |
| WO2022111461A1 (en) | Recognition method and apparatus, and electronic device | |
| JPWO2017126056A1 (en) | Mobile terminal, image processing method, and program | |
| JP2019016843A (en) | Document reading device, control method of document reading device, and program | |
| CN115273107A (en) | Character scanning recognition control method, smart watch, electronic device and storage medium | |
| JP2018191094A (en) | Document reader, document reader control method, and program | |
| CN102890596A (en) | Computer system, image acquisition device and servo method of image acquisition device | |
| JP6203070B2 (en) | Scanning system, control method therefor, and program | |
| JP2017123561A (en) | Image processing device, image processing method for image processing device, control method for image processing device, and program | |
| CN114202764A (en) | Method and device for displaying recognized characters, terminal equipment and storage medium |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| TA01 | Transfer of patent application right | ||
| TA01 | Transfer of patent application right |
Effective date of registration: 20230605 Address after: 6001, 6th Floor, No.1 Kaifeng Road, Shangdi Information Industry Base, Haidian District, Beijing, 100085 Applicant after: Beijing Baige Feichi Technology Co.,Ltd. Address before: 211103 science and innovation center, Shiqiu street, Lishui District, Nanjing City, Jiangsu Province Applicant before: Nanjing Baige zhengliu Network Technology Co.,Ltd. |
