| 2024 | A Heuristic Algorithm for Mathematical Markup Encoding Based on the Relative Positions of Characters. Chun-Min Lin, Jason Lin, Shin-Hung Lin, Jo-Kai Liao |
| 2024 | An Efficient PDF Malware Detection Method Using Highly Compact Features. Ran Liu, Cynthia Matuszek, Charles Nicholas |
| 2024 | Assessing Abstractive and Extractive Methods for Automatic News Summarization. Hilário Oliveira, Rafael Dueire Lins |
| 2024 | Assessing the Reliability and Validity of the Measures for Automatic Text Summarization. Rafael Dueire Lins, Hilário Oliveira, Steven J. Simske |
| 2024 | Automatically producing accessible and reusable PDFs with LATEX. Frank Mittelbach, Ulrike Fischer, David Carlisle, Joseph Wright |
| 2024 | CatalogBank: A Structured and Interoperable Catalog Dataset with a Semi-Automatic Annotation Tool (DocumentLabeler) for Engineering System Design. Hasan Sinan Bank, Daniel R. Herber |
| 2024 | Competition on Binarizing Photographed Document Images 2024 Quality, Time and Space Report. Rafael Dueire Lins, Gustavo P. Chaves, Gabriel de F. P. e Silva, Thaylor Vieira, Ricardo da Silva Barboza, Steven J. Simske |
| 2024 | Detecting AI-Generated Texts in Cross-Domains. You Zhou, Jie Wang |
| 2024 | Graph Detective: A User Interface for Intuitive Graph Exploration Through Visualized Queries. Dominik Opitz, Andreas Hamm, Roxanne El Baff, Jasper W. Korte, Tobias Hecking |
| 2024 | Handheld Video Document Scanning: A Robust On-Device Model for Multi-Page Document Scanning. Curtis Wigington |
| 2024 | LexBoost: Improving Lexical Document Retrieval with Nearest Neighbors. Hrishikesh Kulkarni, Nazli Goharian, Ophir Frieder, Sean MacAvaney |
| 2024 | Post-OCR Correction with OpenAI's GPT Models on Challenging English Prosody Texts. James Zhang, Wouter Haverals, Mary Naydan, Brian W. Kernighan |
| 2024 | Proceedings of the ACM Symposium on Document Engineering 2024, DocEng 2024, San Jose, CA, USA, August 20-23, 2024 |
| 2024 | Similarity Problems in Paragraph Justification: An Extension to the Knuth-Plass Algorithm. Didier Verna |
| 2024 | Texture-based Document Binarization. Rodrigo Barros Bernardino, Rafael Dueire Lins, Ricardo da Silva Barboza |
| 2024 | TopicTag: Automatic Annotation of NMF Topic Models Using Chain of Thought and Prompt Tuning with LLMs. Selma Wanna, Nicholas Solovyev, Ryan Barron, Maksim Ekin Eren, Manish Bhattarai, Kim Ø. Rasmussen, Boian S. Alexandrov |
| 2024 | Which is the most suitable scanner resolution for documents? Detailing the answer given to the question raised by Professor George Nagy. Rafael Dueire Lins, Daniela Raposo Nunes de Mello, Raimundo Correa de Oliveira |
| 2024 | ZigZag: A Robust Adaptive Approach to Non-Uniformly Illuminated Document Image Binarization. Jean-Luc Bloechle, Jean Hennebert, Christophe Gisler |