default search action
15th ICDAR 2019: Sydney, NSW, Australia
- 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia, September 20-25, 2019. IEEE 2019, ISBN 978-1-7281-3014-9
Oral Session 1: Handwritten Text Recognition
- Chris Tensmeyer, Curtis Wigington:
Training Full-Page Handwritten Text Recognition Models without Annotated Line Breaks. 1-8 - Shanyu Xiao, Liangrui Peng, Ruijie Yan, Shengjin Wang:
Deep Network with Pixel-Level Rectification and Robust Training for Handwriting Recognition. 9-16 - R. Reeve Ingle, Yasuhisa Fujii, Thomas Deselaers, Jonathan Baccash, Ashok C. Popat:
A Scalable Handwritten Text Recognition System. 17-24 - Dezhi Peng, Lianwen Jin, Yaqiang Wu, Zhepeng Wang, Mingxiang Cai:
A Fast and Accurate Fully Convolutional Network for End-to-End Handwritten Chinese Text Segmentation and Recognition. 25-30 - Martin Schall, Marc-Peter Schambach, Matthias O. Franz:
Dissecting Multi-line Handwriting for Multi-dimensional Connectionist Classification. 31-38
Oral Session 2: Document Image Processing
- Osman Tursun, Rui Zeng, Simon Denman, Sabesan Sivapalan, Sridha Sridharan, Clinton Fookes:
MTRNet: A Generic Scene Text Eraser. 39-44 - Xujun Peng, Chao Wang, Huaigu Cao:
Document Binarization via Multi-resolutional Attention Model with DRD Loss. 45-50 - Ranajit Saha, Ajoy Mondal, C. V. Jawahar:
Graphical Object Detection in Document Images. 51-58 - Anupama Ray, Manoj Sharma, Avinash Upadhyay, Megh Makwana, Santanu Chaudhury, Akkshita Trivedi, Ajay Pratap Singh, Anil K. Saini:
An End-to-End Trainable Framework for Joint Optimization of Document Enhancement and Recognition. 59-64 - Ranjan Mondal, Deepayan Chakraborty, Bhabatosh Chanda:
Learning 2D Morphological Network for Old Document Image Binarization. 65-70
Oral Session 3: Document Understanding
- Rajiv Jain, Curtis Wigington:
Multimodal Document Image Classification. 71-77 - Xusen Yin, Nada Aldarrab, Beáta Megyesi, Kevin Knight:
Decipherment of Historical Manuscript Images. 78-85 - Yosuke Onitsuka, Wataru Ohyama, Seiichi Uchida:
Training Convolutional Autoencoders with Metric Learning. 86-91 - Julien Maitre, Michel Ménard, Guillaume Chiron, Alain Bouju, Nicolas Sidère:
A Meaningful Information Extraction System for Interactive Analysis of Documents. 92-99 - Najah-Imane Bentabet, Rémi Juge, Sira Ferradans:
Table-of-Contents Generation on Contemporary Documents. 100-107 - Alejandro Héctor Toselli, Verónica Romero-Gomez, Joan-Andreu Sánchez, Enrique Vidal-Ruiz:
Making Two Vast Historical Manuscript Collections Searchable and Extracting Meaningful Textual Features Through Large-Scale Probabilistic Indexing. 108-113
Oral Session 4: Table Analysis
- Chris Tensmeyer, Vlad I. Morariu, Brian L. Price, Scott Cohen, Tony R. Martinez:
Deep Splitting and Merging for Table Structure Decomposition. 114-121 - Pau Riba, Anjan Dutta, Lutz Goldmann, Alicia Fornés, Oriol Ramos Terrades, Josep Lladós:
Table Detection in Invoice Documents by Graph Neural Networks. 122-127 - Shubham Singh Paliwal, Vishwanath D, Rohit Rahul, Monika Sharma, Lovekesh Vig:
TableNet: Deep Learning Model for End-to-end Table Detection and Tabular Data Extraction from Scanned Document Images. 128-133 - Brian L. Davis, Bryan S. Morse, Scott Cohen, Brian L. Price, Chris Tensmeyer:
Deep Visual Template-Free Form Parsing. 134-141 - Shah Rukh Qasim, Hassan Mahmood, Faisal Shafait:
Rethinking Table Recognition using Graph Neural Networks. 142-147 - Hubert Mara, Bartosz Bogacz:
Breaking the Code on Broken Tablets: The Learning Challenge for Annotated Cuneiform Script in Normalized 2D and 3D Datasets. 148-153
Poster Session 1
- Rohit Saluja, Ayush Maheshwari, Ganesh Ramakrishnan, Parag Chaudhuri, Mark J. Carman:
OCR On-the-Go: Robust End-to-End Systems for Reading License Plates & Street Signs. 154-159 - Rohit Saluja, Mayur Punjabi, Mark J. Carman, Ganesh Ramakrishnan, Parag Chaudhuri:
Sub-Word Embeddings for OCR Corrections in Highly Fusional Indic Languages. 160-165 - Chuang Li, Feng Lin, Zhiyong Wang, Gang Yu, Liou Yuan, Haiqiang Wang:
DeepHSV: User-Independent Offline Signature Verification Using Two-Channel CNN. 166-171 - Chris Tensmeyer, Mike Brodie, Daniel Saunders, Tony R. Martinez:
Generating Realistic Binarization Data with Generative Adversarial Networks. 172-177 - Junyang Cai, Liangrui Peng, Yejun Tang, Changsong Liu, Pengchao Li:
TH-GAN: Generative Adversarial Network Based Transfer Learning for Historical Chinese Character Recognition. 178-183 - Victor Storchan:
Data Augmentation via Adversarial Networks for Optical Character Recognition/Conference Submissions. 184-189 - Chandranath Adak, Bidyut B. Chaudhuri, Chin-Teng Lin, Michael Blumenstein:
Detecting Named Entities in Unstructured Bengali Manuscript Images. 196-201 - Guangwei Zhang, Yinliang Zhao:
Target-Directed MixUp for Labeling Tangut Characters. 202-207 - Qingqing Wang, Wenjing Jia, Xiangjian He, Yue Lu, Michael Blumenstein, Ye Huang, Shujing Lyu:
DeepText: Detecting Text from the Wild with Multi-ASPP-Assembled DeepLab. 208-213 - Sivan Keret, Lior Wolf, Nachum Dershowitz, Eric Werner, Orna Almogi, Dorji Wangchuk:
Transductive Learning for Reading Handwritten Tibetan Manuscripts. 214-221 - Majeed Kassis, Jihad El-Sana:
Learning Free Line Detection in Manuscripts using Distance Transform Graph. 222-227 - Nishatul Majid, Elisa H. Barney Smith:
Segmentation-Free Bangla Offline Handwriting Recognition using Sequential Detection of Characters and Diacritics with a Faster R-CNN. 228-233 - Zhichao Fu, Yu Kong, Yingbin Zheng, Hao Ye, Wenxin Hu, Jing Yang, Liang He:
Cascaded Detail-Preserving Networks for Super-Resolution of Document Images. 240-245 - Daoerji Fan, Guanglai Gao, Huijuan Wu:
Sub-Word Based Mongolian Offline Handwriting Recognition. 246-253 - He Guo, Xiameng Qin, Jiaming Liu, Junyu Han, Jingtuo Liu, Errui Ding:
EATEN: Entity-Aware Attention for Single Shot Visual Text Extraction. 254-259 - Mohammed Al-Rawi, Ernest Valveny, Dimosthenis Karatzas:
Can One Deep Learning Model Learn Script-Independent Multilingual Word-Spotting? 260-267 - Romain Karpinski, Abdel Belaïd:
Semi-Synthetic Data Augmentation of Scanned Historical Documents. 268-273 - Yunlong Huang, Canjie Luo, Lianwen Jin, Qingxiang Lin, Weiying Zhou:
Attention After Attention: Reading Text in the Wild with Cross Attention. 274-280 - Zhuoyao Zhong, Lei Sun, Qiang Huo:
A Teacher-Student Learning Based Born-Again Training Approach to Improving Scene Text Detection Accuracy. 281-286 - Animesh Prasad, Hervé Déjean, Jean-Luc Meunier:
Versatile Layout Understanding via Conjugate Graph. 287-294 - Marcin Namysl, Iuliu Konya:
Efficient, Lexicon-Free OCR using Deep Learning. 295-301 - Ryohei Tanaka, Soichiro Ono, Akio Furuhata:
Fast Distributional Smoothing for Regularization in CTC Applied to Text Recognition. 302-308 - Yi-Kang Zhang, Heng Zhang, Yong-Ge Liu, Qing Yang, Cheng-Lin Liu:
Oracle Character Recognition by Nearest Neighbor Classification with Deep Metric Learning. 309-314 - Yao Xiao, Dan Meng, Cewu Lu, Chi-Keung Tang:
Template-Instance Loss for Offline Handwritten Chinese Character Recognition. 315-322 - Asghar Ali, Mark Pickering:
Urdu-Text: A Dataset and Benchmark for Urdu Text Detection and Recognition in Natural Scenes. 323-328 - Rasmus Berg Palm, Florian Laws, Ole Winther:
Attend, Copy, Parse End-to-end Information Extraction from Documents. 329-336 - Omer Arshad, Ignazio Gallo, Shah Nawaz, Alessandro Calefati:
Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition. 337-342 - Monica Haurilet, Alina Roitberg, Manuel Martínez, Rainer Stiefelhagen:
WiSe - Slide Segmentation in the Wild. 343-348 - Nicholas R. Howe, Ji-Won Chung:
Symmetric Inkball Alignment with Loopy Models. 349-354 - Gideon Maillette de Buy Wenniger, Lambert Schomaker, Andy Way:
No Padding Please: Efficient Neural Handwriting Recognition. 355-362 - Zhaohui Jiang, Zheng Huang, Yunrui Lian, Jie Guo, Weidong Qiu:
Integrating Coordinates with Context for Information Extraction in Document Images. 363-368 - Olfa Mechi, Maroua Mehri, Rolf Ingold, Najoua Essoukri Ben Amara:
Text Line Segmentation in Historical Document Images Using an Adaptive U-Net Architecture. 369-374 - Chen Du, Chunheng Wang, Yanna Wang, Zipeng Feng, Jiyuan Zhang:
TextEdge: Multi-oriented Scene Text Detection via Region Segmentation and Edge Classification. 375-380 - Zelun Wang, Donald J. Beyette, Jason Lin, Jyh-Charn Liu:
Extraction of Math Expressions from PDF Documents Based on Unsupervised Modeling of Fonts. 381-386 - Xing Wang, Zelun Wang, Jyh-Charn Liu:
Bigram Label Regularization to Reduce Over-Segmentation on Inline Math Expression Detection. 387-392 - Quang Anh Bui, David Mollard, Salvatore Tabbone:
Automatic Synthetic Document Image Generation using Generative Adversarial Networks: Application in Mobile-Captured Document Analysis. 393-400 - Ryo Nakao, Brian Kenji Iwana, Seiichi Uchida:
Selective Super-Resolution for Scene Text Images. 401-406 - Taichi Sumi, Brian Kenji Iwana, Hideaki Hayashi, Seiichi Uchida:
Modality Conversion of Handwritten Patterns by Cross Variational Autoencoders. 407-412 - Yaoxiong Huang, Zecheng Xie, Lianwen Jin, Yuanzhi Zhu, Shuaitao Zhang:
Adversarial Feature Enhancing Network for End-to-End Handwritten Paragraph Recognition. 413-419 - Kha Cong Nguyen, Cuong Tuan Nguyen, Seiji Hotta, Masaki Nakagawa:
A Character Attention Generative Adversarial Network for Degraded Historical Document Restoration. 420-425 - Vijay Rowtula, Subba Reddy Oota, C. V. Jawahar:
Towards Automated Evaluation of Handwritten Assessments. 426-433 - Himanshu Sharad Bhatt, Shourya Roy, Lokesh Bhatnagar, Chetan Lohani, Vinit Jain:
Digital Auditor: A Framework for Matching Duplicate Invoices. 434-441 - Hao Song, Hongzhen Wang, Shan Huang, Pei Xu, Shen Huang, Qi Ju:
Text Siamese Network for Video Textual Keyframe Detection. 442-447 - Kohei Baba, Seiichi Uchida, Brian Kenji Iwana:
On the Ability of a CNN to Realize Image-to-Image Language Conversion. 448-453 - Laiphangbam Melinda, Chakravarthy Bhagvati:
Parameter-Free Table Detection Method. 454-460 - Hervé Déjean, Jean-Luc Meunier:
Table Rows Segmentation. 461-466 - Vincent Poulain D'Andecy, Aurélie Joseph, Joaquín Cuenca, Jean-Marc Ogier:
Discourse Descriptor for Document Incremental Classification Comparison with Deep Learning. 467-472 - Kwon-Young Choi, Bertrand Coüasnon, Yann Ricquebourg, Richard Zanibbi:
CNN-Based Accidental Detection in Dense Printed Piano Scores. 473-480 - Eloi Alonso, Bastien Moysset, Ronaldo O. Messina:
Adversarial Generation of Handwritten Text Images Conditioned on Sequences. 481-486 - Ciprian Tomoiaga, Paul Feng, Mathieu Salzmann, Patrick Jayet:
Field Typing for Improved Recognition on Heterogeneous Handwritten Forms. 487-493 - Mohammad Reza Sarshogh, Keegan E. Hines:
A Multi-task Network for Localization and Recognition of Text in Images. 494-501 - Thibault Lupinski, Abdel Belaïd, Afef Kacem Echi:
On the Use of Attention Mechanism in a Seq2Seq Based Approach for Off-Line Handwritten Digit String Recognition. 502-507 - Xi Liu, Rui Zhang, Yongsheng Zhou, Dong Wang:
Scene Text Detection with Feature Pyramid Network and Linking Segments. 508-513 - Xiaohui Li, Fei Yin, Tao Xue, Long Liu, Jean-Marc Ogier, Cheng-Lin Liu:
Instance Aware Document Image Segmentation using Label Pyramid Networks and Deep Watershed Transformation. 514-519 - Miaotong Jiang, Jie-Bo Hou, Chun Yang, Xiaobin Zhu, Xu-Cheng Yin:
Detecting Text in News Images with Similarity Embedded Proposals. 520-525 - Chuang Li, Xing Zhang, Feng Lin, Zhiyong Wang, Jun'E Liu, Rui Zhang, Haiqiang Wang:
A Stroke-Based RNN for Writer-Independent Online Signature Verification. 526-532 - Fabian Hollaus, Simon Brenner, Robert Sablatnig:
CNN Based Binarization of MultiSpectral Document Images. 533-538 - Ruochen Wang, Xiaojie Xia, Chunyan Zhang, Xiaoyi Yu, Jun Sun, Satoshi Naoi:
Text Line Adjustment Based on Neural Network. 539-544 - Yahia Hamdi, Houcine Boubaker, Thameur Dhieb, Abdelkarim Elbaati, Adel M. Alimi:
Hybrid DBLSTM-SVM Based Beta-Elliptic-CNN Models for Online Arabic Characters Recognition. 545-550 - Hongyu Li, Fan Zhu, Junhua Qiu:
Towards Document Image Quality Assessment: A Text Line Based Framework and a Synthetic Text Line Image Dataset. 551-558 - Xugong Qin, Yu Zhou, Dongbao Yang, Weiping Wang:
Curved Text Detection in Natural Scene Images with Semi- and Weakly-Supervised Learning. 559-564 - Jirí Martínek, Ladislav Lenc, Pavel Král, Anguelos Nicolaou, Vincent Christlein:
Hybrid Training Data for Historical Text OCR. 565-570 - Xiaoxue Liu, Ting Zhang, Xinguo Yu:
An End-to-End Trainable System for Offline Handwritten Chemical Formulae Recognition. 577-582 - Fabian Wolf, Philipp Oberdiek, Gernot A. Fink:
Exploring Confidence Measures for Word Spotting in Heterogeneous Datasets. 583-588 - Xiang Ao, Xu-Yao Zhang, Hong-Ming Yang, Fei Yin, Cheng-Lin Liu:
Cross-Modal Prototype Learning for Zero-Shot Handwriting Recognition. 589-594 - Qingquan Xu, Xiang Bai, Wenyu Liu:
Multiple Comparative Attention Network for Offline Handwritten Chinese Character Recognition. 595-600
Oral Session 5: Text Detection and Recognition
- Hongyuan Yu, Chengquan Zhang, Xuan Li, Junyu Han, Errui Ding, Liang Wang:
An End-to-End Video Text Detector with Online Tracking. 601-606 - Tarin Clanuwat, Alex Lamb, Asanobu Kitamoto:
KuroNet: Pre-Modern Japanese Kuzushiji Character Recognition with Deep Learning. 607-614 - Jian Wei, Kai Chen, Jianhua He, Zheng Huang, Yunrui Lian, Yi Zhou:
A New Approach for Integrated Recognition and Correction of Texts from Images. 615-620 - Ayumu Nagai:
On the Improvement of Recognizing Single-Line Strings of Japanese Historical Cursive. 621-628 - Nam Tuan Ly, Cuong Tuan Nguyen, Masaki Nakagawa:
An Attention-Based End-to-End Model for Multiple Text Lines Recognition in Japanese Historical Documents. 629-634
Oral Session 6: Mathematical Expression and Text Recognition
- Zelin Hong, Ning You, Jun Tan, Ning Bi:
Residual BiRNN Based Seq2Seq Model with Transition Probability Matrix for Online Handwritten Mathematical Expression Recognition. 635-640 - Arnaud Lods, Éric Anquetil, Sébastien Macé:
Fuzzy Visibility Graph for Structural Analysis of Online Handwritten Mathematical Expressions. 641-646 - Mahshad Mahdavi, Michael Condon, Kenny Davila, Richard Zanibbi:
LPGA: Line-of-Sight Parsing with Graph-Based Attention for Math Formula Recognition. 647-654 - Deepayan Das, Jerin Philip, Minesh Mathew, C. V. Jawahar:
A Cost Efficient Approach to Correct OCR Errors in Large Document Collections. 655-662 - Ashish Arora, Paola García, Shinji Watanabe, Vimal Manohar, Yiwen Shao, Sanjeev Khudanpur, Chun-Chieh Chang, Babak Rekabdar, Bagher BabaAli, Daniel Povey, David Etter, Desh Raj, Hossein Hadian, Jan Trmal:
Using ASR Methods for OCR. 663-668
Poster Session 2
- Pei Xu, Shan Huang, Hongzhen Wang, Hao Song, Shen Huang, Qi Ju:
A Multi-oriented Chinese Keyword Spotter Guided by Text Line Detection. 669-674 - Seokjun Kang, Brian Kenji Iwana, Seiichi Uchida:
Cascading Modular U-Nets for Document Image Binarization. 675-680 - Shuangping Huang, Haobin Wang, Yongge Liu, Xiaosong Shi, Lianwen Jin:
OBC306: A Large-Scale Oracle Bone Character Recognition Dataset. 681-688 - Hao Kong, Dongqi Tang, Xi Meng, Tong Lu:
GARN: A Novel Generative Adversarial Recognition Network for End-to-End Scene Character Recognition. 689-694 - Yao Xiao, Minglong Xue, Tong Lu, Yirui Wu, Shivakumara Palaiahnakote:
A Text-Context-Aware CNN Network for Multi-oriented and Multi-language Scene Text Detection. 695-700 - Najoua Rahal, Maroua Tounsi, Tarek M. Hamdani, Adel M. Alimi:
Handwritten Words and Digits Recognition using Deep Learning Based Bag of Features Framework. 701-706 - Chixiang Ma, Zhuoyao Zhong, Lei Sun, Qiang Huo:
A Relation Network Based Approach to Curved Text Detection. 707-713 - Xiaojie Xia, Xiaoyi Yu, Wei Liu, Chunyan Zhang, Jun Sun, Satoshi Naoi:
An Efficient off-Line Handwritten Japanese Address Recognition System. 714-719 - Linda Studer, Michele Alberti, Vinaychandran Pondenkandath, Pinar Goktepe, Thomas Kolonko, Andreas Fischer, Marcus Liwicki, Rolf Ingold:
A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis. 720-725 - Hussein Adnan Mohammed, Isabelle Marthot-Santaniello, Volker Märgner:
GRK-Papyri: A Dataset of Greek Handwriting on Papyri for the Task of Writer Identification. 726-731 - Berat Kurar Barakat, Jihad El-Sana, Irina Rabaev:
The Pinkas Dataset. 732-737 - Reem Alaasam, Berat Kurar, Jihad El-Sana:
Layout Analysis on Challenging Historical Arabic Manuscripts using Siamese Network. 738-742 - Vivek Venugopal, Suresh Sundaram:
Online Writer Identification using GMM Based Feature Representation and Writer-Specific Weights. 743-748 - Wenyuan Xue, Qingyong Li, Dacheng Tao:
ReS2TIM: Reconstruct Syntactic Structures from Table Images. 749-755 - Emanuela Boros, Alexis Toumi, Erwan Rouchet, Bastien Abadie, Dominique Stutzmann, Christopher Kermorvant:
Automatic Page Classification in a Large Collection of Manuscripts Based on the International Image Interoperability Framework. 756-762 - Yibo Li, Liangcai Gao, Zhi Tang, Qinqin Yan, Yilun Huang:
A GAN-Based Feature Generator for Table Detection. 763-768 - Liu Yang, Yonghong Song, Yuanlin Zhang:
Enhanced EAST: Improving Network's Feature Extraction Ability and Text Complete Shape Perception. 769-774 - Jiyuan Zhang, Chen Du, Zipeng Feng, Yanna Wang, Chunheng Wang:
A Text Localization Method Based on Weak Supervision. 775-780 - Fenfen Sheng, Zhineng Chen, Bo Xu:
NRTR: A No-Recurrence Sequence-to-Sequence Model for Scene Text Recognition. 781-786