| 2025 | A Comprehensive AI-Powered Editing and Typesetting Platform for Enhancing Academic Writing. Jie Wang |
| 2025 | A Hybrid, Neuro-symbolic Approach for Scholarly Knowledge Organization. Hassan Hussein, Allard Oelen, Sören Auer |
| 2025 | A Proposal of Post-OCR Spelling Correction Using Monolingual Byte-level Language Models. Sávio Santos de Araújo, Byron Leite Dantas Bezerra, Arthur Flor de Sousa Neto |
| 2025 | An Adaptive Agentic Tool Building Architecture leveraging Expert-in-the-Loop Guidance, applied to Document Generation. Xavier Daull, Elisabeth Murisasco, Patrice Bellot, Emmanuel Bruno, Vincent Martin |
| 2025 | Binarizing Photographed Document Images 2025 Quality, Time and Space Assessment. Gustavo P. Chaves, Thaylor Vieira, Gabriel de F. P. e Silva, Rafael Dueire Lins, Steven J. Simske |
| 2025 | BioReadNet: A Transformer-Driven Hybrid Model for Target Audience-Aware Biomedical Text Readability Assessment. Anya Amel Nait Djoudi, Patrice Bellot, Adrian-Gabriel Chifu |
| 2025 | Celebrating 25 Years of Document Engineering. Ethan V. Munson |
| 2025 | Designing Visual Tools for Writing Process Analysis. Cerstin Mahlow |
| 2025 | Detecting and Documenting Plagiarism and GenAI Use. Debora Weber-Wulff |
| 2025 | Document Classification using File Names. Zhijian Li, Stefan Larson, Kevin Leach |
| 2025 | Document Encryption in Practice: A Comparative Framework and Evaluation. Isaac Henry Teuscher, Benjamin L. Schooley |
| 2025 | Exploiting Query Reformulation and Reciprocal Rank Fusion in Math-Aware Search Engines. Besat Kassaie, Andrew Kane, Frank Wm. Tompa |
| 2025 | Hierarchical Clustering of the SOREL Malware Corpus. Raguvir S, Charles Nicholas |
| 2025 | Improving Lightweight Named Entity Recognition in Handwritten Documents by Predicting Pyramidal Histograms of Characters. David Villanova-Aparisi, Carlos D. Martínez-Hinarejos, Verónica Romero, Moisés Pastor-i-Gadea |
| 2025 | Issues in Document Security. Charles Nicholas |
| 2025 | LLM-assisted Automatic Feature Extraction for Document Understanding and Analytics. Sirisha Velampalli |
| 2025 | Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval. Alexander Most, Joseph Winjum, Manish Bhattarai, Shawn M. Jones, Nishath Rajiv Ranasinghe, Ayan Biswas, Dan O'Malley |
| 2025 | MathML and other XML Technologies for Accessible PDF from LATEX. Frank Mittelbach, Ulrike Fischer, David Carlisle, Joseph Wright |
| 2025 | Measuring temporal gains in assisted document transcription. Shad Mohammad, Elöd Egyed-Zsigmond, Franck Lebourgeois, Michiel Streijger, Michela Bussotti, Luis Tovar Pimentel, Vincent Paillusson |
| 2025 | Mining a Century of Swiss Trademark Data. Daniel Travaglia, Jesper Findahl, Marco D'Ambros, Andrea Mocci, Raphael Parchet |
| 2025 | OPERA: An Environment Extending Coreference Annotation to Relations Between Entities. Antoine Boiteau, Yann Mathet, Antoine Widlöcher |
| 2025 | Old Greek OCR Result Correction Using LLMs. Andreas Evaggelatos, Konstantinos Palaiologos, Basilis Gatos, Panagiotis Kaddas, Aikaterini Christopoulou, Vassilis Katsouros |
| 2025 | Preserving Measurement Data Records Long-term: A Field Study on Information Management in the Wake of the 1986 Chernobyl Disaster. Uwe M. Borghoff, Peter Rödig |
| 2025 | Proceedings of the 2025 ACM Symposium on Document Engineering, DocEng 2025, Nottingham, UK, September 2-5, 2025 Steven R. Bagley, Steven J. Simske, Charlotte Curtis, Cerstin Mahlow |
| 2025 | Reinforcing Document Privacy in Nigeria: A Framework for Trust in National Data Systems. Fatima-Taslima Hassan, Richey Okoh-Michael |
| 2025 | Robust Image Classifiers Fail Under Shifted Adversarial Perturbations. Fatemeh Amerehi, Patrick Healy |
| 2025 | Session details: DocEng Demonstrations. Cerstin Mahlow |
| 2025 | Session details: Document Analysis and Generation. Didier Verna |
| 2025 | Session details: Document Classification. Besat Kassaie |
| 2025 | Session details: Document Information Retrieval. Patrick Healy |
| 2025 | Session details: Document Organization and Generation. Ethan Munson |
| 2025 | Session details: Document Trust and Security. Charlotte Curtis |
| 2025 | Session details: Optical Character Recognition. Steve Simske |
| 2025 | SoAC and SoACer: A Sector-Based Corpus and LLM-Based Framework for Sectoral Website Classification. Shahriar Shayesteh, Mukund Srinath, Lee Matheson, Lu Xian, Sinjoy Saha, C. Lee Giles, Shomir Wilson |
| 2025 | Spurious Cues in RVL-CDIP and Tobacco3482 Document Classification: The Case of ID Codes. Stefan Larson, Sharad Duwal, Brian Vilnrotter, Gayatri Chakkithara, Vedant Padwal, Kevin Leach |
| 2025 | Synthetic Document Generation with Full Annotation: A Framework Utilizing Open-Weight Large Language Models. Pablo Melendez Abarca, Clemens Havas |
| 2025 | Text Image Super-Resolution for Improved OCR in Real-Life Scenarios using Swin Transformers. Philipp Hildebrandt, Maximilian Schulze, Sarel Cohen, Vanja Doskoc, Raid Saabni, Tobias Friedrich |
| 2025 | The Di2Win Document Intelligence Platform. Afonso Ferreira, Cleber Zanchettin, Romulo Andrade, Byron Leite Dantas Bezerra |
| 2025 | Topic Modeling and Link-Prediction for Material Property Discovery. Ryan C. Barron, Maksim Ekin Eren, Valentin G. Stanev, Cynthia Matuszek, Boian S. Alexandrov |
| 2025 | Towards More Homogeneous Paragraphs. Didier Verna |
| 2025 | Use Case Demonstration @ DocEng2025: Conversation-Driven Multi-LLM Framework for Web Document Sentiment Analysis. Dominik Opitz, Andreas Hamm |
| 2025 | Visual Large Language Models for Graphics Understanding: A Case Study on Floorplan Images. Valeria Nardoni, Kimiya Noor Ali, Zahra Ziran, Simone Marinai |
| 2025 | Well-Tagged PDF and Universal Accessibility with LATEX. Frank Mittelbach, Ulrike Fischer, David Carlisle, Joseph Wright |