default search action
DocEng 2021: Limerick, Ireland
- Patrick Healy, Mihai Bilauca, Alexandra Bonnici:
DocEng '21: ACM Symposium on Document Engineering 2021, Limerick, Ireland, August 24-27, 2021. ACM 2021, ISBN 978-1-4503-8596-1
Tutorials
- Verislav Djukic, Juha-Pekka Tolvanen:
Domain-specific modeling in document engineering. 1:1-1:2 - Charles Nicholas, Robert J. Joyce, Steve Simske:
Document engineering issues in malware analysis. 2:1
DocEng'21 challenges
- Rafael Dueire Lins, Steven J. Simske, Rodrigo Barros Bernardino:
Binarisation of photographed documents image quality and processing time assessment. 3:1-3:6
Keynote I
- Ophir Frieder:
Searching harsh documents. 4:1
Document content analysis
- Md. Rashadul Hasan Rakib, Norbert Zeh, Evangelos E. Milios:
Efficient clustering of short text streams using online-offline clustering. 5:1-5:10 - Johannes Knittel, Steffen Koch, Thomas Ertl:
Efficient sparse spherical k-means for document clustering. 6:1-6:4 - Marcel Schaeben, Gioele Barabucci:
Small-step pipelines reduce the complexity of XSLT/XPath programs. 7:1-7:4 - Fatemeh Rahimi, Evangelos E. Milios, Stan Matwin:
MTLV: a library for building deep multi-task learning architectures. 8:1-8:4 - Johannes Knittel, Steffen Koch, Thomas Ertl:
ELSKE: efficient large-scale keyphrase extraction. 9:1-9:4
Generation, manipulation and presentation
- Rémi Calizzano, Malte Ostendorff, Georg Rehm:
Ordering sentences and paragraphs with pre-trained encoder-decoder transformers and pointer ensembles. 10:1-10:9 - Athar Sefid, Prasenjit Mitra, C. Lee Giles:
SlideGen: an abstractive section-based slide generator for scholarly documents. 11:1-11:4 - Kevin Fenton, Steven Simske:
Engineering of an artificial intelligence safety data sheet document processing system for environmental, health, and safety compliance. 12:1-12:4
Keynote II
- Justin Picard:
20 years of physical document and product protection using digital methods. 13:1
Security and sensitive documents
- Fabian Singhofer, Aygul Garifullina, Mathias Kern, Ansgar Scherp:
A novel approach on the joint de-identification of textual and relational data with a modified mondrian algorithm. 14:1-14:10 - André Tabone, Kenneth P. Camilleri, Alexandra Bonnici, Stefania Cristina, Reuben A. Farrugia, Mark Borg:
Pornographic content classification using deep-learning. 15:1-15:10 - Justin Picard, Paul Landry, Michael Bolay:
Counterfeit detection with QR codes. 16:1-16:4 - Francisco Jáñez-Martino, Rocío Alaíz-Rodríguez, Víctor González-Castro, Eduardo Fidalgo:
Trustworthiness of spam email addresses using machine learning. 17:1-17:4
Applications and user experiences
- Ajit Jain, Andruid Kerne, Nic Lupfer, Gabriel Britain, Aaron Perrine, Yoonsuck Choe, John Keyser, Ruihong Huang:
Recognizing creative visual design: multiscale design characteristics in free-form web curation documents. 18:1-18:10 - Ogundepo Odunayo, Naveela N. Sookoo, Gautam Bathla, Anthony Cavallin, Bhaleka D. Persaud, Kathy Szigeti, Philippe Van Cappellen, Jimmy Lin:
Rescuing historical climate observations to support hydrological research: a case study of solar radiation data. 19:1-19:4 - Rajkumar Ramamurthy, Maren Pielka, Robin Stenzel, Christian Bauckhage, Rafet Sifa, Tim Dilmaghani Khameneh, Ulrich Warning, Bernd Kliem, Rüdiger Loitz:
ALiBERT: improved automated list inspection (ALI) with BERT. 20:1-20:4 - Soundarya Nurani Sundareswara, Mukund Srinath, Shomir Wilson, C. Lee Giles:
A large-scale exploration of terms of service documents on the web. 21:1-21:4 - Yasith Jayawardana, Gavindya Jayawardena, Andrew T. Duchowski, Sampath Jayarathna:
Metadata-driven eye tracking for real-time applications. 22:1-22:4
Systems for visual document analysis
- Manabu Ohta, Ryoya Yamada, Teruhito Kanazawa, Atsuhiro Takasu:
Table-structure recognition method using neural networks for implicit ruled line estimation and cell estimation. 23:1-23:7 - Lucas N. Kirsten, Ricardo Piccoli, Ricardo Ribani:
Evaluating deep neural networks for image document enhancement. 24:1-24:4 - Shrey Mishra, Lucas Pluvinage, Pierre Senellart:
Towards extraction of theorems and proofs in scholarly articles. 25:1-25:4 - Daniela S. Costa, Carlos A. B. Mello, Marcelo d'Amorim:
A comparative study on methods and tools for handwritten mathematical expression recognition. 26:1-26:4 - Adi Azran, Alon Schclar, Raid Saabni:
Text line extraction using deep learning and minimal sub seams. 27:1-27:4 - Rafael Dueire Lins, Rodrigo Barros Bernardino, Ricardo da Silva Barboza, Zanoni Dueire Lins:
Direct binarization a quality-and-time efficient binarization strategy. 28:1-28:4 - Jennil Thiyam, Sanasam Ranbir Singh, Prabin Kumar Bora:
Challenges in chart image classification: a comparative study of different deep learning methods. 29:1-29:4
Collections, systems and management
- Eugene Yang, David D. Lewis, Ophir Frieder:
On minimizing cost in legal document review workflows. 30:1-30:10 - Eugene Yang, David D. Lewis, Ophir Frieder:
Heuristic stopping rules for technology-assisted review. 31:1-31:10 - Maxime Cauz, Julien Albert, Anne Wallemacq, Isabelle Linden, Bruno Dumas:
Shock wave: a graph layout algorithm for text analyzing. 32:1-32:4 - Maksim Ekin Eren, Nick Solovyev, Chris Hamer, Renee McDonald, Boian S. Alexandrov, Charles Nicholas:
COVID-19 multidimensional kaggle literature organization. 33:1-33:4
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.