Sankar et al., 2006 - Google Patents
Digitizing a million books: Challenges for document analysisSankar et al., 2006
View PDF- Document ID
- 14348063909477179129
- Author
- Sankar K
- Ambati V
- Pratha L
- Jawahar C
- Publication year
- Publication venue
- International Workshop on Document Analysis Systems
External Links
Snippet
This paper describes the challenges for document image analysis community for building large digital libraries with diverse document categories. The challenges are identified from the experience of the on-going activities toward digitizing and archiving one million books …
- 238000004458 analytical method 0 title description 8
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30011—Document retrieval systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30067—File systems; File servers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G06K9/2054—Selective acquisition/locating/processing of specific regions, e.g. highlighted text, fiducial marks, predetermined fields, document type identification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99951—File or database maintenance
- Y10S707/99952—Coherency, e.g. same view to multiple users
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Sankar et al. | Digitizing a million books: Challenges for document analysis | |
Hazen et al. | Selecting Research Collections for Digitization. | |
US7561310B2 (en) | Method and apparatus for digital scanning and archiving | |
US20060235855A1 (en) | Digital library system | |
Balasubramanian et al. | Retrieval from document image collections | |
US20080162602A1 (en) | Document archiving system | |
Miller | All text considered: A perspective on mass digitizing and archival processing | |
US6810136B2 (en) | System and method for automatic preparation of data repositories from microfilm-type materials | |
Stančić et al. | Optimisation of archival processes involving digitisation of typewritten documents | |
WO2001013279A2 (en) | Word searchable database from high volume scanning of newspaper data | |
Bhat | Development of digital libraries in India: A survey of digital collection of national digital library of India | |
Ling et al. | Taking it to the people: Why the National Archives of Australia embraced digitisation on demand | |
Yacoub et al. | Document digitization lifecycle for complex magazine collection | |
Chakravarty | E-Preservation of Old and Rare Books: A Structured Approach for Creating a Digital Collection | |
US20060204141A1 (en) | Method and system of converting film images to digital format for viewing | |
Moore | Strategies for implementing a mass digitization program | |
Galloway et al. | The Heinz Electronic Library Interactive On-line System (HELIOS): An Update | |
Bender | An Optical Disk‐Based Information Retrieval System | |
Henthorne | Digitization and the creation of virtual libraries: the Princeton University image card catalog-reaping the benefits of imaging | |
ROY | Development of Digital Archive: Initiatives Taken by Central Library, University of Calcutta. | |
Zhao et al. | Technical issues on the China-US million book digital library project | |
Joergensen | The Rutgers Law Library US Congressional Documents Digitization Collection | |
JP4947763B2 (en) | Information retrieval method | |
Brown et al. | TIME'S PIVOTAL POINT–PRESERVING THE | |
Amollo | Digitization and digital libraries: Where and how do you begin?(Practical approach) |