DocEng B

43 papers

YearTitle / Authors
2025A Comprehensive AI-Powered Editing and Typesetting Platform for Enhancing Academic Writing.
Jie Wang
2025A Hybrid, Neuro-symbolic Approach for Scholarly Knowledge Organization.
Hassan Hussein, Allard Oelen, Sören Auer
2025A Proposal of Post-OCR Spelling Correction Using Monolingual Byte-level Language Models.
Sávio Santos de Araújo, Byron Leite Dantas Bezerra, Arthur Flor de Sousa Neto
2025An Adaptive Agentic Tool Building Architecture leveraging Expert-in-the-Loop Guidance, applied to Document Generation.
Xavier Daull, Elisabeth Murisasco, Patrice Bellot, Emmanuel Bruno, Vincent Martin
2025Binarizing Photographed Document Images 2025 Quality, Time and Space Assessment.
Gustavo P. Chaves, Thaylor Vieira, Gabriel de F. P. e Silva, Rafael Dueire Lins, Steven J. Simske
2025BioReadNet: A Transformer-Driven Hybrid Model for Target Audience-Aware Biomedical Text Readability Assessment.
Anya Amel Nait Djoudi, Patrice Bellot, Adrian-Gabriel Chifu
2025Celebrating 25 Years of Document Engineering.
Ethan V. Munson
2025Designing Visual Tools for Writing Process Analysis.
Cerstin Mahlow
2025Detecting and Documenting Plagiarism and GenAI Use.
Debora Weber-Wulff
2025Document Classification using File Names.
Zhijian Li, Stefan Larson, Kevin Leach
2025Document Encryption in Practice: A Comparative Framework and Evaluation.
Isaac Henry Teuscher, Benjamin L. Schooley
2025Exploiting Query Reformulation and Reciprocal Rank Fusion in Math-Aware Search Engines.
Besat Kassaie, Andrew Kane, Frank Wm. Tompa
2025Hierarchical Clustering of the SOREL Malware Corpus.
Raguvir S, Charles Nicholas
2025Improving Lightweight Named Entity Recognition in Handwritten Documents by Predicting Pyramidal Histograms of Characters.
David Villanova-Aparisi, Carlos D. Martínez-Hinarejos, Verónica Romero, Moisés Pastor-i-Gadea
2025Issues in Document Security.
Charles Nicholas
2025LLM-assisted Automatic Feature Extraction for Document Understanding and Analytics.
Sirisha Velampalli
2025Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval.
Alexander Most, Joseph Winjum, Manish Bhattarai, Shawn M. Jones, Nishath Rajiv Ranasinghe, Ayan Biswas, Dan O'Malley
2025MathML and other XML Technologies for Accessible PDF from LATEX.
Frank Mittelbach, Ulrike Fischer, David Carlisle, Joseph Wright
2025Measuring temporal gains in assisted document transcription.
Shad Mohammad, Elöd Egyed-Zsigmond, Franck Lebourgeois, Michiel Streijger, Michela Bussotti, Luis Tovar Pimentel, Vincent Paillusson
2025Mining a Century of Swiss Trademark Data.
Daniel Travaglia, Jesper Findahl, Marco D'Ambros, Andrea Mocci, Raphael Parchet
2025OPERA: An Environment Extending Coreference Annotation to Relations Between Entities.
Antoine Boiteau, Yann Mathet, Antoine Widlöcher
2025Old Greek OCR Result Correction Using LLMs.
Andreas Evaggelatos, Konstantinos Palaiologos, Basilis Gatos, Panagiotis Kaddas, Aikaterini Christopoulou, Vassilis Katsouros
2025Preserving Measurement Data Records Long-term: A Field Study on Information Management in the Wake of the 1986 Chernobyl Disaster.
Uwe M. Borghoff, Peter Rödig
2025Proceedings of the 2025 ACM Symposium on Document Engineering, DocEng 2025, Nottingham, UK, September 2-5, 2025
Steven R. Bagley, Steven J. Simske, Charlotte Curtis, Cerstin Mahlow
2025Reinforcing Document Privacy in Nigeria: A Framework for Trust in National Data Systems.
Fatima-Taslima Hassan, Richey Okoh-Michael
2025Robust Image Classifiers Fail Under Shifted Adversarial Perturbations.
Fatemeh Amerehi, Patrick Healy
2025Session details: DocEng Demonstrations.
Cerstin Mahlow
2025Session details: Document Analysis and Generation.
Didier Verna
2025Session details: Document Classification.
Besat Kassaie
2025Session details: Document Information Retrieval.
Patrick Healy
2025Session details: Document Organization and Generation.
Ethan Munson
2025Session details: Document Trust and Security.
Charlotte Curtis
2025Session details: Optical Character Recognition.
Steve Simske
2025SoAC and SoACer: A Sector-Based Corpus and LLM-Based Framework for Sectoral Website Classification.
Shahriar Shayesteh, Mukund Srinath, Lee Matheson, Lu Xian, Sinjoy Saha, C. Lee Giles, Shomir Wilson
2025Spurious Cues in RVL-CDIP and Tobacco3482 Document Classification: The Case of ID Codes.
Stefan Larson, Sharad Duwal, Brian Vilnrotter, Gayatri Chakkithara, Vedant Padwal, Kevin Leach
2025Synthetic Document Generation with Full Annotation: A Framework Utilizing Open-Weight Large Language Models.
Pablo Melendez Abarca, Clemens Havas
2025Text Image Super-Resolution for Improved OCR in Real-Life Scenarios using Swin Transformers.
Philipp Hildebrandt, Maximilian Schulze, Sarel Cohen, Vanja Doskoc, Raid Saabni, Tobias Friedrich
2025The Di2Win Document Intelligence Platform.
Afonso Ferreira, Cleber Zanchettin, Romulo Andrade, Byron Leite Dantas Bezerra
2025Topic Modeling and Link-Prediction for Material Property Discovery.
Ryan C. Barron, Maksim Ekin Eren, Valentin G. Stanev, Cynthia Matuszek, Boian S. Alexandrov
2025Towards More Homogeneous Paragraphs.
Didier Verna
2025Use Case Demonstration @ DocEng2025: Conversation-Driven Multi-LLM Framework for Web Document Sentiment Analysis.
Dominik Opitz, Andreas Hamm
2025Visual Large Language Models for Graphics Understanding: A Case Study on Floorplan Images.
Valeria Nardoni, Kimiya Noor Ali, Zahra Ziran, Simone Marinai
2025Well-Tagged PDF and Universal Accessibility with LATEX.
Frank Mittelbach, Ulrike Fischer, David Carlisle, Joseph Wright