[go: up one dir, main page]

CN107909080A - A kind of Word Input system and method - Google Patents

A kind of Word Input system and method Download PDF

Info

Publication number
CN107909080A
CN107909080A CN201711025339.7A CN201711025339A CN107909080A CN 107909080 A CN107909080 A CN 107909080A CN 201711025339 A CN201711025339 A CN 201711025339A CN 107909080 A CN107909080 A CN 107909080A
Authority
CN
China
Prior art keywords
value
color value
brightness value
image
pending image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711025339.7A
Other languages
Chinese (zh)
Inventor
温九江
袁松平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangxi Xiaocao Information Industry Co Ltd
Original Assignee
Guangxi Xiaocao Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangxi Xiaocao Information Industry Co Ltd filed Critical Guangxi Xiaocao Information Industry Co Ltd
Priority to CN201711025339.7A priority Critical patent/CN107909080A/en
Publication of CN107909080A publication Critical patent/CN107909080A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/56Extraction of image or video features relating to colour
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Character Input (AREA)

Abstract

The present invention provides a kind of Word Input system and method, and system includes:Scan module, for the background picture with word to be scanned, obtains pending image;First extraction module, for extracting the first color value and the first brightness value of pending image entirety, and extracts the second color value and the second brightness value of word segment in pending image;Module is adjusted, for being adjusted respectively to the first color value and the second color value according to pre-set color value, and the first brightness value and the second brightness value are adjusted according to predetermined luminance value;Second extraction module, for extracting character image from the pending image after adjustment;Modular converter, for the character image extracted to be converted to corresponding letter symbol.The present invention can be by the way that the aberration of background and word segment be widened, to be adjusted, it is easier to extracts word segment, discernment is strong.

Description

A kind of Word Input system and method
Technical field
The invention mainly relates to technical field of information processing, and in particular to a kind of Word Input system and method.
Background technology
Text region is that character image is extracted from image, then the process of pure words is converted to by character image.
Since overall background and word segment aberration and brightness identification be not high, the existing skill by image conversion word In art, recognition performance is bad, be easy to cause identification mistake, it is necessary to carry out secondary correction, and corrects cumbersome, work effect Rate is low.
The content of the invention
The present invention is directed to the deficiency of above-mentioned technical problem, there is provided a kind of Word Input system and method.
The technical solution that the present invention solves above-mentioned technical problem is as follows:A kind of Word Input system, including:
Scan module, for the background picture with word to be scanned, obtains pending image;
First extraction module, for extracting the first color value and the first brightness value of pending image entirety, and extracts and treats Handle the second color value and the second brightness value of word segment in image;
Module is adjusted, for according to pre-set color value the first color value to pending image entirety and pending figure respectively The second color value of word segment is adjusted as in, increases the difference of the first color value and the second color value, and according to default bright Angle value is adjusted the second brightness value of word segment in the first brightness value of pending image entirety and pending image, increases The difference of first brightness value and the second brightness value;
Second extraction module, for extracting character image from the pending image after adjustment;
Modular converter, for the character image extracted to be converted to corresponding letter symbol.
The beneficial effects of the invention are as follows:By the way that background and word segment are carried out treatment of details, color value and brightness are adjusted Value is so that word segment protrudes, it is easier to extracts word segment, discernment is strong.
Based on the above technical solutions, the present invention can also be improved as follows.
Further, the adjustment module is specifically used for, and the first color value is subtracted pre-set color is worth to the first new face Colour, the second new color value is worth to by the second color value plus pre-set color, according to the first new color value and new the Second colors value is adjusted pending image;
First brightness value is subtracted into predetermined luminance and is worth to the first new brightness value, the second brightness value is added into predetermined luminance The second new brightness value is worth to, pending image is adjusted according to the first new brightness value and the second new brightness value.
Using having the beneficial effect that for above-mentioned technical characteristic:The value of chromatism of overall background colour and word segment and bright can be pulled open Angle value, easy to preferably identify character image.
Further, second extraction module is specifically used for, and word segment is carried out from the pending image after adjustment Side processing is retouched, text profile is obtained, character image is extracted according to text profile.
Using having the beneficial effect that for above-mentioned technical characteristic:Since overall background colour and the color of word segment and brightness carry out Processing, can make word segment more prominent, easy to extract character image.
Further, the modular converter is specifically used for, according to the character image extracted and the word in default literal pool Image is matched, and obtains matched character image, and corresponding letter symbol is obtained by the character image matched.
It is using above-mentioned further beneficial effect:The character image plucked out is matched with the character image to prestore, Corresponding letter symbol is obtained by the character image matched again.
Another technical solution that the present invention solves above-mentioned technical problem is as follows:A kind of text extraction method, including:
Background picture with word is scanned, obtains pending image;
The first color value and the first brightness value of pending image entirety are extracted, and extracts word segment in pending image The second color value and the second brightness value;
According to word segment in pre-set color value respectively the first color value to pending image entirety and pending image Second color value is adjusted, and increases the difference of the first color value and the second color value, and according to predetermined luminance value to pending The second brightness value of word segment is adjusted in the first brightness value and pending image of image entirety, increase the first brightness value and The difference of second brightness value;
Character image is extracted from the pending image after adjustment;
The character image extracted is converted into corresponding letter symbol.
Based on the above technical solutions, the present invention can also be improved as follows.
Further, it is described that the first color value and the second color value are adjusted respectively according to pre-set color value, and according to Predetermined luminance value is adjusted and specifically includes to the first brightness value and the second brightness value:
First color value is subtracted into pre-set color and is worth to the first new color value, the second color value is added into pre-set color The second new color value is worth to, pending image is adjusted according to the first new color value and the second new color value;
First brightness value is subtracted into predetermined luminance and is worth to the first new brightness value, the second brightness value is added into predetermined luminance The second new brightness value is worth to, pending image is adjusted according to the first new brightness value and the second new brightness value.
It is using above-mentioned further beneficial effect:The aberration of background colour in pending image and word segment is increased Greatly, be conducive to obtain the image of word segment.
Further, character image is extracted in the pending image after adjustment to specifically include:From treating after adjustment Word segment is carried out in processing image to retouch side processing, text profile is obtained, character image is extracted according to text profile.
Further, it is described the character image extracted is converted into corresponding letter symbol to specifically include:According to extracting Character image matched with the character image in default literal pool, obtain matched character image, pass through the text matched Word image obtains corresponding letter symbol.
It is using above-mentioned further beneficial effect:The character image plucked out is matched with the character image to prestore, Corresponding letter symbol is obtained by the character image matched again.
Brief description of the drawings
Fig. 1 is the module frame chart for the Word Input system that one embodiment of the invention provides;
Fig. 2 is the method flow diagram for the text extraction method that another embodiment of the present invention provides.
Embodiment
The principle and features of the present invention will be described below with reference to the accompanying drawings, and the given examples are served only to explain the present invention, and It is non-to be used to limit the scope of the present invention.
Fig. 1 is the module frame chart for the Word Input system that one embodiment of the invention provides;
As shown in Figure 1, a kind of Word Input system, including:
Scan module, for the background picture with word to be scanned, obtains pending image;
First extraction module, for extracting the first color value and the first brightness value of pending image entirety, and extracts and treats Handle the second color value and the second brightness value of word segment in image;
Module is adjusted, for according to pre-set color value the first color value to pending image entirety and pending figure respectively The second color value of word segment is adjusted as in, increases the difference of the first color value and the second color value, and according to default bright Angle value is adjusted the second brightness value of word segment in the first brightness value of pending image entirety and pending image, increases The difference of first brightness value and the second brightness value;
Second extraction module, for extracting character image from the pending image after adjustment;
Modular converter, for the character image extracted to be converted to corresponding letter symbol.
Optionally, it is specifically used for, the first color value is subtracted pre- as one embodiment of the present of invention, the adjustment module If color is worth to the first new color value, the second color value is worth to the second new color value plus pre-set color, according to The first new color value and the second new color value are adjusted pending image;
First brightness value is subtracted into predetermined luminance and is worth to the first new brightness value, the second brightness value is added into predetermined luminance The second new brightness value is worth to, pending image is adjusted according to the first new brightness value and the second new brightness value.
Optionally, be specifically used for as one embodiment of the present of invention, second extraction module, from treating after adjustment from Word segment is carried out in reason image to retouch side processing, text profile is obtained, character image is extracted according to text profile.
Optionally, it is specifically used for as one embodiment of the present of invention, the modular converter, according to the word graph extracted As being matched with the character image in default literal pool, matched character image is obtained, is obtained by the character image matched To corresponding letter symbol.
Fig. 2 is the method flow diagram for the text extraction method that another embodiment of the present invention provides;
Optionally, as an alternative embodiment of the invention, as shown in Fig. 2, a kind of text extraction method, including:
Background picture with word is scanned, obtains pending image;
The first color value and the first brightness value of pending image entirety are extracted, and extracts word segment in pending image The second color value and the second brightness value;
According to word segment in pre-set color value respectively the first color value to pending image entirety and pending image Second color value is adjusted, and increases the difference of the first color value and the second color value, and according to predetermined luminance value to pending The second brightness value of word segment is adjusted in the first brightness value and pending image of image entirety, increase the first brightness value and The difference of second brightness value;
Character image is extracted from the pending image after adjustment;
The character image extracted is converted into corresponding letter symbol.
Optionally, as one embodiment of the present of invention, it is described according to pre-set color value respectively to the first color value and Second colors value is adjusted, and the first brightness value and the second brightness value are adjusted and specifically included according to predetermined luminance value:
First color value is subtracted into pre-set color and is worth to the first new color value, the second color value is added into pre-set color The second new color value is worth to, pending image is adjusted according to the first new color value and the second new color value;
First brightness value is subtracted into predetermined luminance and is worth to the first new brightness value, the second brightness value is added into predetermined luminance The second new brightness value is worth to, pending image is adjusted according to the first new brightness value and the second new brightness value.
In above-described embodiment, the aberration of the background colour in pending image and word segment is increased, is conducive to obtain text The image of character segment.
Optionally, as one embodiment of the present of invention, word graph is extracted in the pending image after adjustment As specifically including:Word segment is carried out from the pending image after adjustment to retouch side processing, text profile is obtained, according to word Contours extract goes out character image.
Optionally, it is described that the character image extracted is converted into corresponding word as one embodiment of the present of invention Symbol specifically includes:Matched, obtained matched with the character image in default literal pool according to the character image extracted Character image, corresponding letter symbol is obtained by the character image matched.
In above-described embodiment, the character image plucked out is matched with the character image to prestore, then by matching Character image obtains corresponding letter symbol.
Reader should be understood that in the description of this specification, reference term " one embodiment ", " some embodiments ", " show The description of example ", " specific example " or " some examples " etc. mean to combine the specific features of the embodiment or example description, structure, Material or feature are contained at least one embodiment of the present invention or example.In the present specification, above-mentioned term is shown The statement of meaning property need not be directed to identical embodiment or example.Moreover, particular features, structures, materials, or characteristics described It may be combined in any suitable manner in any one or more of the embodiments or examples.In addition, without conflicting with each other, this The technical staff in field can be by the different embodiments or example described in this specification and different embodiments or exemplary spy Sign is combined and combines.
It is apparent to those skilled in the art that for convenience of description and succinctly, the dress of foregoing description The specific work process with unit is put, may be referred to the corresponding process in preceding method embodiment, details are not described herein.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, can also It is that unit is individually physically present or two or more units integrate in a unit.It is above-mentioned integrated Unit can both be realized in the form of hardware, can also be realized in the form of SFU software functional unit.
If integrated unit is realized in the form of SFU software functional unit and is used as independent production marketing or in use, can To be stored in a computer read/write memory medium.Based on such understanding, technical scheme substantially or Say that the part to contribute to the prior art, or all or part of the technical solution can be embodied in the form of software product Out, which is stored in a storage medium, including some instructions are used so that a computer equipment (can be personal computer, server, or network equipment etc.) performs all or part of each embodiment method of the present invention Step.And foregoing storage medium includes:It is USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only Memory), random Access memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can be with Jie of store program codes Matter.
More than, it is only embodiment of the invention, but protection scope of the present invention is not limited thereto, and it is any to be familiar with Those skilled in the art the invention discloses technical scope in, various equivalent modifications or substitutions can be readily occurred in, These modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be wanted with right Subject to the protection domain asked.

Claims (8)

  1. A kind of 1. Word Input system, it is characterised in that including:
    Scan module, for the background picture with word to be scanned, obtains pending image;
    First extraction module, for extracting the first color value and the first brightness value of pending image entirety, and is extracted pending The second color value and the second brightness value of word segment in image;
    Adjust module, for according to pre-set color value respectively in the first color value and pending image of pending image entirety The second color value of word segment is adjusted, and increases the difference of the first color value and the second color value, and according to predetermined luminance value The second brightness value of word segment is adjusted in the first brightness value and pending image to pending image entirety, increases first The difference of brightness value and the second brightness value;
    Second extraction module, for extracting character image from the pending image after adjustment;
    Modular converter, for the character image extracted to be converted to corresponding letter symbol.
  2. 2. a kind of Word Input system according to claim 1, it is characterised in that the adjustment module is specifically used for, will First color value subtracts pre-set color and is worth to the first new color value, the second color value is worth to plus pre-set color new Second color value, is adjusted pending image according to the first new color value and the second new color value;
    First brightness value is subtracted into predetermined luminance and is worth to the first new brightness value, the second brightness value is worth plus predetermined luminance To the second new brightness value, pending image is adjusted according to the first new brightness value and the second new brightness value.
  3. 3. a kind of Word Input system according to claim 1, it is characterised in that second extraction module is specifically used In to word segment retouch side processing from the pending image after adjustment, obtain text profile, extracted according to text profile Go out character image.
  4. 4. a kind of Word Input system according to claim 3, it is characterised in that the modular converter is specifically used for, root Matched according to the character image extracted with the character image in default literal pool, obtain matched character image, by The character image being fitted on obtains corresponding letter symbol.
  5. A kind of 5. text extraction method, it is characterised in that including:
    Background picture with word is scanned, obtains pending image;
    The first color value and the first brightness value of pending image entirety are extracted, and extracts of word segment in pending image Second colors value and the second brightness value;
    According to word segment second in pre-set color value respectively the first color value to pending image entirety and pending image Color value is adjusted, and increases the difference of the first color value and the second color value, and according to predetermined luminance value to pending image The second brightness value of word segment is adjusted in the first overall brightness value and pending image, increases the first brightness value and second The difference of brightness value;
    Character image is extracted from the pending image after adjustment;
    The character image extracted is converted into corresponding letter symbol.
  6. 6. a kind of text extraction method according to claim 5, it is characterised in that described right respectively according to pre-set color value First color value and the second color value are adjusted, and the first brightness value and the second brightness value are adjusted according to predetermined luminance value It is whole to specifically include:
    First color value is subtracted into pre-set color and is worth to the first new color value, the second color value is worth plus pre-set color To the second new color value, pending image is adjusted according to the first new color value and the second new color value;
    First brightness value is subtracted into predetermined luminance and is worth to the first new brightness value, the second brightness value is worth plus predetermined luminance To the second new brightness value, pending image is adjusted according to the first new brightness value and the second new brightness value.
  7. A kind of 7. text extraction method according to claim 5, it is characterised in that the pending image after adjustment In extract character image and specifically include:Word segment is carried out from the pending image after adjustment to retouch side processing, obtains text Character wheel is wide, and character image is extracted according to text profile.
  8. 8. a kind of text extraction method according to claim 7, it is characterised in that described to turn the character image extracted Corresponding letter symbol is changed to specifically include:According to the character image extracted and the character image progress in default literal pool Match somebody with somebody, obtain matched character image, corresponding letter symbol is obtained by the character image matched.
CN201711025339.7A 2017-10-27 2017-10-27 A kind of Word Input system and method Pending CN107909080A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711025339.7A CN107909080A (en) 2017-10-27 2017-10-27 A kind of Word Input system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711025339.7A CN107909080A (en) 2017-10-27 2017-10-27 A kind of Word Input system and method

Publications (1)

Publication Number Publication Date
CN107909080A true CN107909080A (en) 2018-04-13

Family

ID=61842104

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711025339.7A Pending CN107909080A (en) 2017-10-27 2017-10-27 A kind of Word Input system and method

Country Status (1)

Country Link
CN (1) CN107909080A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108877030A (en) * 2018-07-19 2018-11-23 深圳怡化电脑股份有限公司 Image processing method, device, terminal and computer readable storage medium
CN109409377A (en) * 2018-12-03 2019-03-01 龙马智芯(珠海横琴)科技有限公司 The detection method and device of text in image

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102262619A (en) * 2010-05-31 2011-11-30 汉王科技股份有限公司 Method and device for extracting characters of document
CN102750540A (en) * 2012-06-12 2012-10-24 大连理工大学 Morphological filtering enhancement-based maximally stable extremal region (MSER) video text detection method
CN102768763A (en) * 2011-05-05 2012-11-07 方正国际软件(北京)有限公司 Method and device for outlining characters
CN104899586A (en) * 2014-03-03 2015-09-09 阿里巴巴集团控股有限公司 Method for recognizing character contents included in image and device thereof
CN106228157A (en) * 2016-07-26 2016-12-14 江苏鸿信系统集成有限公司 Coloured image word paragraph segmentation based on image recognition technology and recognition methods
CN106934386A (en) * 2017-03-30 2017-07-07 湖南师范大学 A kind of natural scene character detecting method and system based on from heuristic strategies
CN107004396A (en) * 2014-11-21 2017-08-01 乐天株式会社 Information processor, information processing method and message handling program

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102262619A (en) * 2010-05-31 2011-11-30 汉王科技股份有限公司 Method and device for extracting characters of document
CN102768763A (en) * 2011-05-05 2012-11-07 方正国际软件(北京)有限公司 Method and device for outlining characters
CN102750540A (en) * 2012-06-12 2012-10-24 大连理工大学 Morphological filtering enhancement-based maximally stable extremal region (MSER) video text detection method
CN104899586A (en) * 2014-03-03 2015-09-09 阿里巴巴集团控股有限公司 Method for recognizing character contents included in image and device thereof
CN107004396A (en) * 2014-11-21 2017-08-01 乐天株式会社 Information processor, information processing method and message handling program
CN106228157A (en) * 2016-07-26 2016-12-14 江苏鸿信系统集成有限公司 Coloured image word paragraph segmentation based on image recognition technology and recognition methods
CN106934386A (en) * 2017-03-30 2017-07-07 湖南师范大学 A kind of natural scene character detecting method and system based on from heuristic strategies

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
易博: "服务外包业中OCR前期对图片的处理", 《电子测试》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108877030A (en) * 2018-07-19 2018-11-23 深圳怡化电脑股份有限公司 Image processing method, device, terminal and computer readable storage medium
CN109409377A (en) * 2018-12-03 2019-03-01 龙马智芯(珠海横琴)科技有限公司 The detection method and device of text in image

Similar Documents

Publication Publication Date Title
US9196024B2 (en) Method and apparatus for enhancing color
EP2390838A1 (en) Image identifier extracting apparatus
Brisinello et al. Improving optical character recognition performance for low quality images
CN111127307A (en) Image processing method, image processing device, electronic equipment and computer readable storage medium
US9922404B2 (en) Inpainting device and method using segmentation of reference region
JP2006085678A (en) Image generation method, image generation apparatus, and image generation program
CN107256543B (en) Image processing method, image processing device, electronic equipment and storage medium
CN106780417A (en) A kind of Enhancement Method and system of uneven illumination image
CN110969046B (en) Face recognition method, face recognition device and computer-readable storage medium
US20150278605A1 (en) Apparatus and method for managing representative video images
CN109389560A (en) A kind of adaptive weighted filter image denoising method, device and image processing equipment
CN109064419A (en) A kind of removing rain based on single image method based on WLS filtering and multiple dimensioned sparse expression
CN107909080A (en) A kind of Word Input system and method
CN108805873A (en) Image processing method and device
Song et al. Real-time video decolorization using bilateral filtering
CN114494066B (en) A portrait sharpening method, device, equipment and medium based on Hessian filter
CN108877030B (en) Image processing method, device, terminal and computer readable storage medium
CN110545414B (en) Image sharpening method
Chen et al. Bregman-tanimoto based method for contrast preserving decolorization
Vazquez-Corral et al. Angular-based preprocessing for image denoising
CN116363010A (en) Image processing method and device
EP4287112A1 (en) Determination method, determination program, and information processing device
CN113989137A (en) Method for extracting pigmentation of facial skin image and forming spectrum of brown region
CN109558876B (en) Character recognition processing method and device
JP5265058B1 (en) Product image processing apparatus, product image processing method, information recording medium, and program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180413