


default search action
18th ICDAR 2024: Athens, Greece - Part IV
- Elisa H. Barney Smith, Marcus Liwicki, Liangrui Peng:
Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30 - September 4, 2024, Proceedings, Part IV. Lecture Notes in Computer Science 14807, Springer 2024, ISBN 978-3-031-70545-8
Layout Analysis and Document Classification
- Yamato Okamoto
, Youngmin Baek
, Geewook Kim
, Ryota Nakao
, DongHyun Kim
, Moonbin Yim
, Seunghyun Park
, Bado Lee
:
CREPE: Coordinate-Aware End-to-End Document Parser. 3-20 - Tahira Shehzadi
, Didier Stricker, Muhammad Zeshan Afzal
:
A Hybrid Approach for Document Layout Analysis in Document Images. 21-39 - Jiawei Wang, Kai Hu, Qiang Huo:
DLAFormer: An End-to-End Transformer For Document Layout Analysis. 40-57 - Francisco J. Castellanos
, Juan P. Martinez-Esteso
, Alejandro Galán-Cuenca
, Antonio Javier Gallego
:
A Region-Based Approach for Layout Analysis of Music Score Images in Scarce Data Scenarios. 58-75 - Qilin Deng
, Mayire Ibrayim
, Askar Hamdulla
, Hailong Luo
, Chunhu Zhang
:
Doc-DINO: A Transformer Model for Complex Logical Document Layout Analysis. 76-89 - Lei Kang
, Mohamed Ali Souibgui
, Fei Yang
, Lluís Gómez
, Ernest Valveny
, Dimosthenis Karatzas
:
Machine Unlearning for Document Classification. 90-102 - Saifullah Saifullah
, Stefan Agne
, Andreas Dengel
, Sheraz Ahmed
:
DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification. 103-123 - Sankalp Sinha
, Muhammad Saif Ullah Khan
, Talha Uddin Sheikh
, Didier Stricker, Muhammad Zeshan Afzal
:
CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification. 124-141 - Marcel Lamott
, Yves-Noel Weweler, Adrian Ulges, Faisal Shafait, Dirk Krechel
, Darko Obradovic:
LAPDoc: Layout-Aware Prompting for Documents. 142-159 - Wiam Adnan, Joël Tang, Yassine Bel Khayat Zouggari, Seif Edinne Laatiri, Laurent Lam, Fabien Caspani:
A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents. 160-174 - Anna Scius-Bertrand, Atefeh Fakhari, Lars Vögtlin
, Daniel Ribeiro Cabral, Andreas Fischer
:
Are Layout Analysis and OCR Still Useful for Document Information Extraction Using Foundation Models? 175-191
Machine Learning Methods
- Jordy Van Landeghem
, Subhajit Maity
, Ayan Banerjee
, Matthew B. Blaschko
, Marie-Francine Moens
, Josep Lladós
, Sanket Biswas
:
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications. 195-217 - Martin Kiss
, Michal Hradis
:
Self-supervised Pre-training of Text Recognizers. 218-235 - Qiangang Pan, Yahong Hu, Youbai Xie, Xianghui Meng, Yilun Zhang:
Deep Learning-Driven Innovative Model for Generating Functional Knowledge Units. 236-252 - Wenjun Sun
, Tran Thi Hong Hanh
, Carlos-Emiliano González-Gallardo
, Mickaël Coustaty
, Antoine Doucet
:
Global-SEG: Text Semantic Segmentation Based on Global Semantic Pair Relations. 253-269 - Omar Hamed, Souhail Bakkali
, Matthew B. Blaschko
, Sien Moens
, Jordy Van Landeghem
:
Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting. 270-286 - Yiran Zhao
, Di Wu
, Shuqi Dai
, Tong Li
:
Integrating Dependency Type and Directionality into Adapted Graph Attention Networks to Enhance Relation Extraction. 287-305 - Manh-Tu Vu, Marie Beurton-Aimar:
ViT-ED: Transformer Network for Image Similarity Measurement. 306-323 - Jerod Weinman
, Amelia Gómez Grabowska, Dimosthenis Karatzas
:
Counting the Corner Cases: Revisiting Robust Reading Challenge Data Sets, Evaluation Protocols, and Metrics. 324-342 - Weiguang Zhang
, Qiufeng Wang
, Kaizhu Huang
, Xiaomeng Gu, Fengjun Guo:
Coarse-to-Fine Document Image Registration for Dewarping. 343-358 - Daria M. Ershova
, Alexander V. Gayer
, Alexander Sheshkus
, Vladimir V. Arlazarov
:
An Ultra-lightweight Approach for Machine Readable Zone Detection via Semantic Segmentation and Fast Hough Transform. 359-374 - Tong Zhang
, Jianing Zhang
, Rong Yan
:
Synergistic Diverse Perspective for Topic Evolution Analysis on Weibo. 375-388 - Sho Shimotsumagari, Shumpei Takezaki, Daichi Haraguchi
, Seiichi Uchida
:
Cross-Domain Image Conversion by CycleDM. 389-406 - Ahana Kundu
, Ujjwal Bhattacharya
:
YOLO Assisted A* Algorithm for Robust Line Segmentation of Degraded Document Images. 407-424 - George Retsinas, Konstantina Nikolaidou, Giorgos Sfikas:
Enhancing CRNN HTR Architectures with Transformer Blocks. 425-440 - Yujie Lu, Dean Wu, Yuhong Zhang:
Dynamic Reasoning with Language Model and Knowledge Graph for Question Answering. 441-455

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.