


default search action
16th ICDAR 2021: Lausanne, Switzerland - Part II
- Josep Lladós

, Daniel Lopresti
, Seiichi Uchida
:
16th International Conference on Document Analysis and Recognition, ICDAR 2021, Lausanne, Switzerland, September 5-10, 2021, Proceedings, Part II. Lecture Notes in Computer Science 12822, Springer 2021, ISBN 978-3-030-86330-2
Document Analysis for Literature Search
- Rongyu Cao, Hongwei Li, Ganbin Zhou, Ping Luo:

Towards Document Panoptic Segmentation with Pinpoint Accuracy: Method and Evaluation. 3-18 - Ayush Kumar Shah

, Abhisek Dey
, Richard Zanibbi
:
A Math Formula Extraction and Evaluation Framework for PDF Documents. 19-34 - Laura E. Brandt

, William T. Freeman
:
Toward Automatic Interpretation of 3D Plots. 35-50
Document Summarization and Translation
- Marta Esther Vicente, Robiert Sepúlveda-Torres

, Cristina Barros, Estela Saquete, Elena Lloret
:
Can Text Summarization Enhance the Headline Stance Detection Task? Benefits and Drawbacks. 53-67 - Justin Wood, Wei Wang

, Corey W. Arnold:
The Biased Coin Flip Process for Nonparametric Topic Modeling. 68-83 - Sayali Kulkarni, Sheide Chammas, Wan Zhu, Fei Sha, Eugene Ie:

CoMSum and SIBERT: A Dataset and Neural Model for Query-Based Multi-document Summarization. 84-98 - Tonghua Su

, Shuchen Liu, Shengjie Zhou:
RTNet: An End-to-End Method for Handwritten Text Image Translation. 99-113
Multimedia Document Analysis
- Ziyi Zhu, Liangcai Gao, Yibo Li, Yilun Huang, Lin Du, Ning Lu, Xianfeng Wang:

NTable: A Dataset for Camera-Based Table Detection. 117-129 - Tianqi Ji, Jun Li, Jianhua Xu:

Label Selection Algorithm Based on Boolean Interpolative Decomposition with Sequential Backward Selection for Multi-label Classification. 130-144 - Quang Huy Ung, Cuong Tuan Nguyen, Hung Tuan Nguyen, Masaki Nakagawa:

GSSF: A Generative Sequence Similarity Function Based on a Seq2Seq Model for Clustering Online Handwritten Mathematical Answers. 145-159 - Vaibhavi Gupta, Vinay Detani, Vivek Khokar, Chiranjoy Chattopadhyay

:
C2VNet: A Deep Learning Framework Towards Comic Strip to Audio-Visual Scene Synthesis. 160-175 - Jie He

, Xingjiao Wu
, Wenxin Hu, Jing Yang:
LSTMVAEF: Vivid Layout via LSTM-Based Variational Autoencoder Framework. 176-189
Mobile Text Recognition
- Andrii Grygoriev

, Illya Degtyarenko
, Ivan Deriuga
, Serhii Polotskyi
, Volodymyr Melnyk
, Dmytro Zakharchuk
, Olga Radyvonenko
:
HCRNN: A Novel Architecture for Fast Online Handwritten Stroke Classification. 193-208 - Daniil Matalov

, Elena Limonova
, Natalya Skoryukina
, Vladimir V. Arlazarov
:
RFDoc: Memory Efficient Local Descriptors for ID Documents Localization and Classification. 209-224 - Haibo Qin

, Chun Yang
, Xiaobin Zhu
, Xu-Cheng Yin
:
Dynamic Receptive Field Adaptation for Attention-Based Text Recognition. 225-239 - Ryota Yoshihashi, Tomohiro Tanaka, Kenji Doi, Takumi Fujino, Naoaki Yamashita:

Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition. 240-257 - Yulia S. Chernyshova

, Ekaterina Emelianova
, Alexander Sheshkus
, Vladimir V. Arlazarov
:
MIDV-LAIT: A Challenging Dataset for Recognition of IDs with Perso-Arabic, Thai, and Indian Scripts. 258-272 - Konstantin B. Bulatov

, Vladimir V. Arlazarov
:
Determining Optimal Frame Processing Strategies for Real-Time Document Recognition Systems. 273-288
Document Analysis for Social Good
- Eugen Rusakov

, Turna Somel
, Gerfrid G. W. Müller
, Gernot A. Fink
:
Embedded Attributes for Cuneiform Sign Spotting. 291-305 - Adrià Molina

, Pau Riba
, Lluís Gómez
, Oriol Ramos Terrades
, Josep Lladós
:
Date Estimation in the Wild of Scanned Historical Photos: An Image Retrieval Approach. 306-320 - Muhammad Osama Zeeshan, Imran Siddiqi, Momina Moetesum:

Two-Step Fine-Tuned Convolutional Neural Networks for Multi-label Classification of Children's Drawings. 321-334 - Tamal Chowdhury, Palaiahnakote Shivakumara

, Umapada Pal, Tong Lu, Ramachandra Raghavendra, Sukalpa Chanda:
DCINN: Deformable Convolution and Inception Based Neural Network for Tattoo Text Detection Through Skin Region. 335-350 - Fatma Najar, Nizar Bouguila:

Sparse Document Analysis Using Beta-Liouville Naive Bayes with Vocabulary Knowledge. 351-363 - Sk Md Obaidullah

, Mridul Ghosh
, Himadri Mukherjee, Kaushik Roy, Umapada Pal:
Automatic Signature-Based Writer Identification in Mixed-Script Scenarios. 364-377
Indexing and Retrieval of Documents
- Pau Riba

, Adrià Molina
, Lluís Gómez
, Oriol Ramos Terrades
, Josep Lladós
:
Learning to Rank Words: Optimizing Ranking Metrics for Word Spotting. 381-395 - Trung Tan Ngo

, Hung Tuan Nguyen
, Masaki Nakagawa
:
A-VLAD: An End-to-End Attention-Based Neural Network for Writer Identification in Historical Documents. 396-409 - Nhu-Van Nguyen

, Christophe Rigaud
, Arnaud Revel
, Jean-Christophe Burie
:
Manga-MMTL: Multimodal Multitask Transfer Learning for Manga Character Analysis. 410-425 - Enrique Vidal, Alejandro H. Toselli

:
Probabilistic Indexing and Search for Hyphenated Words. 426-442
Physical and Logical Layout Analysis
- Sieben Bocklandt

, Gust Verbruggen
, Thomas Winters
:
SandSlide: Automatic Slideshow Normalization. 445-461 - Alejandro H. Toselli

, Si Wu, David A. Smith
:
Digital Editions as Distant Supervision for Layout Analysis of Printed Books. 462-476 - Prema Satish Sharan, Sowmya Aitha

, Amandeep Kumar
, Abhishek Trivedi
, Aaron Augustine
, Ravi Kiran Sarvadevabhatla
:
Palmira: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts. 477-491 - Oldrich Kodym

, Michal Hradis
:
Page Layout Analysis System for Unconstrained Historic Documents. 492-506 - José Ramón Prieto

, Enrique Vidal:
Improved Graph Methods for Table Layout Understanding. 507-522 - Berat Kurar Barakat, Ahmad Droby, Raid Saabni, Jihad El-Sana:

Unsupervised Learning of Text Line Segmentation by Differentiating Coarse Patterns. 523-537
Recognition of Tables and Formulas
- Yibo Li, Yilun Huang, Ziyi Zhu, Lemeng Pan, Yongshuai Huang, Lin Du, Zhi Tang, Liangcai Gao:

Rethinking Table Structure Recognition Using Sequence Labeling Methods. 541-553 - Harsh Desai, Pratik Kayal, Mayank Singh:

TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables. 554-569 - Wenqi Zhao

, Liangcai Gao, Zuoyu Yan, Shuai Peng, Lin Du, Ziyin Zhang:
Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer. 570-584 - Umar Khan, Sohaib Zahid, Muhammad Asad Ali, Adnan Ul-Hasan, Faisal Shafait:

TabAug: Data Driven Augmentation for Enhanced Table Structure Recognition. 585-601 - Haisong Ding

, Kai Chen
, Qiang Huo:
An Encoder-Decoder Approach to Handwritten Mathematical Expression Recognition with Multi-head Attention and Stacked Decoder. 602-616 - Cuong Tuan Nguyen, Thanh-Nghia Truong, Hung Tuan Nguyen, Masaki Nakagawa:

Global Context for Improving Recognition of Online Handwritten Mathematical Expressions. 617-631 - Koji Ichikawa

:
Image-Based Relation Classification Approach for Table Structure Recognition. 632-647 - Shuai Peng, Liangcai Gao, Ke Yuan, Zhi Tang:

Image to LaTeX with Graph Neural Network for Mathematical Formula Recognition. 648-663
NLP for Document Understanding
- Badal Agrawal, Mohit Mishra, Varun Parashar:

A Novel Method for Automated Suggestion of Similar Software Incidents Using 2-Stage Filtering: Findings on Primary Data. 667-682 - Lianxi Wang

, Xiaotian Lin, Nankai Lin:
Research on Pseudo-label Technology for Multi-label News Classification. 683-698 - Ahmed Hamdi

, Elodie Carel
, Aurélie Joseph
, Mickaël Coustaty
, Antoine Doucet
:
Information Extraction from Invoices. 699-714 - Apoorva Singh

, Sriparna Saha:
Are You Really Complaining? A Multi-task Framework for Complaint Identification, Emotion, and Sentiment Classification. 715-731 - Rafal Powalski, Lukasz Borchmann, Dawid Jurkiewicz, Tomasz Dwojak, Michal Pietruszka

, Gabriela Palka
:
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer. 732-747 - Luisa März

, Stefan Schweter
, Nina Pörner
, Benjamin Roth
, Hinrich Schütze
:
Data Centric Domain Adaptation for Historical Text with OCR Errors. 748-761 - Nafaa Haffar

, Rami Ayadi
, Emna Hkiri, Mounir Zrigui:
Temporal Ordering of Events via Deep Neural Networks. 762-777 - Rubèn Tito, Dimosthenis Karatzas

, Ernest Valveny:
Document Collection Visual Question Answering. 778-792 - Jirí Martínek

, Pavel Král, Ladislav Lenc
:
Dialogue Act Recognition Using Visual Information. 793-807 - Oliver Tüselmann

, Fabian Wolf
, Gernot A. Fink
:
Are End-to-End Systems Really Necessary for NER on Handwritten Document Images? 808-822 - Harsh Kohli

:
Training Bi-Encoders for Word Sense Disambiguation. 823-837 - Freddy C. Chua

, Nigel P. Duffy
:
DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction. 838-853 - Djedjiga Belhadj

, Yolande Belaïd
, Abdel Belaïd
:
Consideration of the Word's Neighborhood in GATs for Information Extraction in Semi-structured Documents. 854-869

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














