[go: up one dir, main page]

WO2014145999A3 - Searching text by optical character recognition - Google Patents

Searching text by optical character recognition Download PDF

Info

Publication number
WO2014145999A3
WO2014145999A3 PCT/US2014/030867 US2014030867W WO2014145999A3 WO 2014145999 A3 WO2014145999 A3 WO 2014145999A3 US 2014030867 W US2014030867 W US 2014030867W WO 2014145999 A3 WO2014145999 A3 WO 2014145999A3
Authority
WO
WIPO (PCT)
Prior art keywords
character
character recognition
optical character
ocr
searching text
Prior art date
Application number
PCT/US2014/030867
Other languages
French (fr)
Other versions
WO2014145999A2 (en
Inventor
Sergio David SUAREZ Jr.
Joshua Daniel MESKE
Original Assignee
Suarez Sergio David Jr
Meske Joshua Daniel
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suarez Sergio David Jr, Meske Joshua Daniel filed Critical Suarez Sergio David Jr
Publication of WO2014145999A2 publication Critical patent/WO2014145999A2/en
Publication of WO2014145999A3 publication Critical patent/WO2014145999A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/98Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Abstract

A method for generating a character-by-character substitution in an optical character recognition (OCR) text output of a document including at least one character, includes: executing on a processor instructions for substituting an OCR key for the at least one character. The instructions include: identifying a class corresponding to the at least one character, wherein the class includes a character shape corresponding to at least a portion of the at least one character; substituting the OCR key including to the character shape for the at least one character; and generating a searchable substituted document including the OCR key.
PCT/US2014/030867 2013-03-15 2014-03-17 System and method for searching through text transcribed from an image processed by optical character recognition WO2014145999A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201361798223P 2013-03-15 2013-03-15
US61/798,223 2013-03-15

Publications (2)

Publication Number Publication Date
WO2014145999A2 WO2014145999A2 (en) 2014-09-18
WO2014145999A3 true WO2014145999A3 (en) 2014-11-06

Family

ID=51538590

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2014/030867 WO2014145999A2 (en) 2013-03-15 2014-03-17 System and method for searching through text transcribed from an image processed by optical character recognition

Country Status (1)

Country Link
WO (1) WO2014145999A2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114648002B (en) * 2020-12-17 2024-12-24 永中软件股份有限公司 Method for outputting multiple Office document content images through multiple processes

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7415171B2 (en) * 2005-03-28 2008-08-19 United States Postal Service Multigraph optical character reader enhancement systems and methods
US20100246963A1 (en) * 2009-03-26 2010-09-30 Al-Muhtaseb Husni A Automatic arabic text image optical character recognition method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7415171B2 (en) * 2005-03-28 2008-08-19 United States Postal Service Multigraph optical character reader enhancement systems and methods
US20100246963A1 (en) * 2009-03-26 2010-09-30 Al-Muhtaseb Husni A Automatic arabic text image optical character recognition method

Also Published As

Publication number Publication date
WO2014145999A2 (en) 2014-09-18

Similar Documents

Publication Publication Date Title
WO2015200110A3 (en) Techniques for machine language translation of text from an image based on non-textual context information from the image
WO2016109307A3 (en) Discriminating ambiguous expressions to enhance user experience
WO2013009578A3 (en) Systems and methods for speech command processing
CA2879417A1 (en) Structured search queries based on social-graph information
EP3136257A3 (en) Document-specific gazetteers for named entity recognition
EP2757487A3 (en) Machine translation-driven authoring system and method
WO2011159460A3 (en) Identifying establishments in images
MX350680B (en) Grammar model for structured search queries.
PH12015000372A1 (en) Conversion of documents of different types to a uniform and an editable or a searchable format
EP4428742A3 (en) Enhancing reading accuracy, efficiency and retention
EP2811414A3 (en) Confidence-driven rewriting of source texts for improved translation
MX370232B (en) Learning and using contextual content retrieval rules for query disambiguation.
WO2014209810A3 (en) Methods and apparatuses for mining synonymous phrases, and for searching related content
Matthewson et al. Inchoativity meets the perfect time span: The Niuean perfect
WO2012134972A3 (en) Systems and methods for paragraph-based document searching
GB2542053A (en) Automatically generating a semantic mapping for a relational database
Yang et al. Multi-criteria semantic dominance: a linguistic decision aiding technique based on incomplete preference information
BR112014026626A8 (en) creating social networking groups
WO2015038408A3 (en) Creating inforgraphics from text data in electronic documents
WO2019022567A3 (en) Method for automatically providing gesture-based auto-complete suggestions and electronic device thereof
CL2016000984A1 (en) System and method for implementing multi-faceted search queries
WO2013025624A3 (en) Searching encrypted electronic books
WO2012122212A3 (en) Processing medical records
MY194297A (en) A method and device for providing search engine label
WO2017106610A8 (en) Method and system for providing automated localized feedback for an extracted component of an electronic document file

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14763293

Country of ref document: EP

Kind code of ref document: A2

122 Ep: pct application non-entry in european phase

Ref document number: 14763293

Country of ref document: EP

Kind code of ref document: A2