OCR and text recognition
2,599 Followers
Recent papers in OCR and text recognition
Distortion is the moment at which the physical means of transmitting a text irrupt into a reader's experience of it. I will discuss distortion here as a phenomenon occurring in printed materials, but I do not wish to exclude other... more
This research focused on the methods of optical character recognition technologies that applied to android-based translator application. The high number of deaf-born babies in Indonesia causes a high number of children who will grow but... more
Artificial neural networks are information processing systems that have characteristics similar to human neural networks. Learning models need to be done on an artificial neural network before being used to solve problems by examining and... more
In the last decade, there has been international interest in providing special e-services for our social partnership that are visually disabled. In particular, they face hard and numerous problems in their daily activities such as paying... more
Optical character recognition for the English text may be considered one of the most important research topics, whether, printed or handwritten. Although excellent results have been reached in the English text, there is a lack of this... more
BanglaOCR is currently the only open source optical character recognition (OCR) software for the Bangla (Bengali) script developed by the Center for Research on Bangla Language Processing (CRBLP). Tesseract, maintained by Google, is... more
A few decades ago, research in the field of Character Recognition was limited to document images acquired with flatbed desktop scanners. The usability of such systems is limited as they are not portable because of large size of the... more
The paper is a comprehensive review of the current research trends in the area of Arabic language especially state-of-the-art approaches to highlight the current status of diverse research aspects of that area to facilitate the adaption... more
Mass-digitization projects in libraries have received world-wide attention. Today a significant amount of historical books, newspapers and journals are already digitized and available online for search and retrieval. The situation is... more
The purpose of this project is to develop an Automatic gate control application which recognizes license plates from cars at entrance gate and take an action to let cars enter or not. And regular PC with camera, catches image frame when... more
Optical character recognition is the machine replication of hu- man reading . It can be described as Mechanical or electronic conversion of scanned images where images can be hand written, type- written or printed text. This paper... more
Optical character recognition has remained a challenge for comics, given the high variability of placement of text on the page, the wide variety of frequently handwritten fonts, and the limited availability and small size of datasets.... more
This paper was written to create awareness among the library professionals about AIDC (Automatic Identification and Data Capture) technologies and their potential application in libraries/information centres. Data entry is an... more
Abstract: In current situation, we come across various problems in traffic regulations in India which can be solved with different ideas. Riding motorcycle/mopeds without wearing helmet is a traffic violation which has resulted in... more
Optical character recognition is popular field for researchers during last decade of research, which is able to successfully recognize the scanned English image into editable text form. However, optical character systems for other... more
In this research we propose a statistical method and morpho-lexical analysis for correcting Arabic words as a post processor for Arabic words output from OCR systems. Dictionaries of words were built for the comparison to the attached... more
UNIVERSIDAD DE TURKU Instituto de Lenguas y Traducción / Facultad de Humanidades SAMMALVUO, TAPANI: Trabajo terminológico basado en corpus: caso práctico de terminología del ajedrez Trabajo de fin de máster, 74 pág., 7 pág. de... more
— Meter reading is an Application designed to automatically collecting consumption, diagnostic, and status data from utility meters and transferring the retrieved data to a central database for billing, troubleshooting, and analyzing etc.... more
The paper discusses the automatic text recognition capabilities of neural network models specifically trained to recognize different styles of Church Slavonic handwriting within the software platform Transkribus. Computed character error... more
This issue (DOI: 10.13140/RG.2.2.13132.92807) includes the following articles; P1151705543, author="Tieling Chen", title="Detail Preserving Sorted Difference Filter", P1151738585, author="Vladimir A. Kulyukin", title="GreedyHaarSpiker: An... more
Studies on handwriting recognition systems have gain a great attention since it has been considered as an important technology in computer science, especially that handwriting documents have continued to be the most used mean of... more
A well designed image database is very necessary for the recognition of any language and many of language of the world have their own database for the text recognition. In this paper we are presenting the comprehensive database for Sindhi... more
There are two most popular writing styles of Urdu i.e. Naskh and Nastaliq. Considering Arabic OCR research, ample amount of work has been done on Naskh writing style; focusing on Urdu, which uses Arabic character set commonly used... more
The purpose of this research is to improve the recognition rate of on-line Arabic handwriting recognition using HMM (Hidden Markov Model). Delayed strokes are removed from the on-line Arabic word to avoid the difficulty and the... more
Da molti anni ormai mezzi di stampa e addetti ai lavori sottolineano le alterne vicende del libro cartaceo e del suo equivalente digitale, l’e-book. Oltre che per le consuete, scoraggianti statistiche sull’abitudine alla lettura, gli... more
The overall objective of READ is to implement a Virtual Research Environment where archivists, humanities scholars, computer scientists and volunteers are collaborating with the ultimate goal of boosting research, innovation, development... more
The paper deals with the programs and web applications useful in the translation and interpretation studies, as well as in translation in practice.
This paper proposes a method to detect and identify the vehicle number plate that will help in the linking it with owner's bank account or a pre paid account for automatic deduction of the parking fee. This system is an application of... more
This paper presents a novel hybrid method for extracting license plates and recognizing characters from the digital camera image using morphological operations. The main problem in extracting text from the images is caused by several... more
Письменные источники - это документы, с которыми связана история как наука. Документы истории и история документов тесно переплетены. Автоматический контент-анализ и компьютерная обработка исторических текстов обрели свое собственное... more
Werden für eine Liegenschaftsvermessung historische Katasterunterlagen verwendet, ist es oft notwendig, die handschriftlichen Aufzeichnungen der früheren Berufskollegen zu entziffern. Was vor einigen Jahrzehnten vielen noch ohne allzu... more
The technologies underlying fire and smoke detection systems play a crucial role in ensuring and delivering optimal performance in modern surveillance environments. In fact, fire can cause significant damage to lives and properties.... more