| 2025 | A Computational Framework for Estimating Days of Maintenance Delay of Naval Ships. Gerald White, Deep Mistry, Kevin Chhoa, Senjuti Basu Roy, Lingyi Zhang, Adam Bienkowski, Krishna R. Pattipati |
| 2025 | A Deep Dive Into Cross-Dataset Entity Matching with Large and Small Language Models. Zeyu Zhang, Paul Groth, Iacer Calixto, Sebastian Schelter |
| 2025 | ASSO: the Automated Schemaless Stream Overseer. Chiara Forresi, Matteo Francia, Enrico Gallinucci, Matteo Golfarelli |
| 2025 | An Empirical Evaluation of Serverless Cloud Infrastructure for Large-Scale Data Processing. Thomas Bodner, Theo Radig, David Justen, Daniel Ritter, Tilmann Rabl |
| 2025 | An Experimental Comparison of Partitioning Strategies for Distributed Graph Neural Network Training. Nikolai Merkel, Daniel Stoll, Ruben Mayer, Hans-Arno Jacobsen |
| 2025 | An Interactive Analysis of Serverless Cloud Infrastructure. Thomas Bodner, Tilmann Rabl |
| 2025 | An RFD-based approach for concept drift detection in Machine Learning Systems. Loredana Caruccio, Stefano Cirillo, Giuseppe Polese, Roberto Stanzione |
| 2025 | Analysis of Text-to-SQL Benchmarks: Limitations, Challenges and Opportunities. Anna Mitsopoulou, Georgia Koutrika |
| 2025 | Apache Ignite + Calcite Composable Database System: Experimental Evaluation and Analysis. Mark Dodds, Khuzaima Daudjee |
| 2025 | AprèsCoT: Explaining LLM Answers with Knowledge Graphs and Chain of Thought. Moein Shirdel, Joel Rorseth, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jarek Szlichta |
| 2025 | Automated Data Quality Validation in an End-to-End GNN Framework. Sijie Dong, Soror Sahri, Themis Palpanas, Qitong Wang |
| 2025 | Benchmarking Analytical Query Processing in Intel SGXv2. Adrian Lutsch, Muhammad El-Hindi, Matthias Heinrich, Daniel Ritter, Zsolt István, Carsten Binnig |
| 2025 | Benchmarking, Analyzing, and Optimizing WA of Partial Compaction in RocksDB. Ran Wei, Zichen Zhu, Andrew Kryczka, Jay Zhuang, Manos Athanassoulis |
| 2025 | Breaking Down the Data-metadata Barrier for Effective Property Graph Data Management. Sepehr Sadoughi, Nikolay Yakovets, George Fletcher |
| 2025 | Can Operations Research bring you to the next level? Basics and application. Vincent T'kindt, Patrick Marcel |
| 2025 | ComCrawler: General Crawling Solution for Aticle Comments. Zhijia Chen, Weiyi Meng, Eduard C. Dragut |
| 2025 | Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging. Michail Theologitis, Georgios Frangias, Georgios Anestis, Vasilis Samoladas, Antonios Deligiannakis |
| 2025 | CompoDB: A Demonstration of Modular Data Systems in Practice. Haralampos Gavriilidis, Lennart Behme, Christian Munz, Varun Pandey, Volker Markl |
| 2025 | Coping With Data Drift in Online Video Analytics. Ioannis Xarchakos, Nick Koudas |
| 2025 | DBCopilot: Natural Language Querying over Massive Databases via Schema Routing. Tianshu Wang, Xiaoyang Chen, Hongyu Lin, Xianpei Han, Le Sun, Hao Wang, Zhenyu Zeng |
| 2025 | Data Completion In E-commerce. Liat Antwarg Friedman, Gal Lavee, Bracha Shapira, Dorin Shmaryahu |
| 2025 | DataLens: ML-Oriented Interactive Tabular Data Quality Dashboard. Mohamed Abdelaal, Samuel Lokadjaja, Arne Kreuz, Harald Schöning |
| 2025 | DataSculpt: Cost-Efficient Label Function Design via Prompting Large Language Models. Naiqing Guan, Kaiwen Chen, Nick Koudas |
| 2025 | Database is All You Need: Serving LLMs with Relational Queries. Wenbo Sun, Ziyu Li, Vaishnav Srinidhi, Rihan Hai |
| 2025 | Dataset Discovery using Semantic Matching. Enas Khwaileh, Yannis Velegrakis |
| 2025 | Deep Skyline Community Search. Minglang Xie, Jianye Yang, Wenjie Zhang, Shiyu Yang, Xuemin Lin |
| 2025 | Dema: Efficient Decentralized Aggregation for Non-Decomposable Quantile Functions. Wang Yue, Martin Boissier, Manisha Luthra, Tilmann Rabl |
| 2025 | Differentially Private Publication of Smart Electricity Grid Data. Sina Shaham, Gabriel Ghinita, Bhaskar Krishnamachari, Cyrus Shahabi |
| 2025 | Do Research, not Data Visualization! How to Create More Consistent Plots for Experimental Research Papers in Less Time. Justus Henneberg, Felix Schuhknecht |
| 2025 | Effective and Efficient Community Search over Large-Scale Hypergraphs. Yu Liu, Qi Luo, Yanwei Zheng, Wenjie Zhang, Xuemin Lin, Dongxiao Yu |
| 2025 | Efficient Enumeration of Large Maximal k-Plexes. Qihao Cheng, Da Yan, Tianhao Wu, Lyuheng Yuan, Ji Cheng, Zhongyi Huang, Yang Zhou |
| 2025 | Efficient Multicore Discovery of Small, High-Quality k-Plex Teams in Multi-attributed Networks. Parisa Esmaeilian Ghahroudi, Sean Chester, Alex Thomo |
| 2025 | Efficiently Indexing Large Data on GPUs with Fast Interconnects. Josef Schmeißer, Clemens Lutz, Volker Markl |
| 2025 | Enabling Complex Event Processing in NebulaStream. Ariane Ziehn, Lily Seidl, Samira Akili, Steffen Zeuch, Volker Markl |
| 2025 | Ensembling Object Detectors for Effective Video Query Processing. Daren Chao, Nick Koudas, Xiaohui Yu, Yueting Chen |
| 2025 | Entity Matching using Large Language Models. Ralph Peeters, Aaron Steiner, Christian Bizer |
| 2025 | Evaluating SQL Understanding in Large Language Models. Ananya Rahaman, Anny Zheng, Mostafa Milani, Fei Chiang, Rachel Pottinger |
| 2025 | Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries. Jonathan Fürst, Catherine Kosten, Farhad Nooralahzadeh, Yi Zhang, Kurt Stockinger |
| 2025 | Evaluating the Feasibility of Sampling-Based Techniques for Training Multilayer Perceptrons. Sana Ebrahimi, Rishi Advani, Abolfazl Asudeh |
| 2025 | Evaluation of Dataframe Libraries for Data Preparation on a Single Machine. Angelo Mozzillo, Luca Zecchini, Luca Gagliardelli, Adeel Aslam, Sonia Bergamaschi, Giovanni Simonini |
| 2025 | Everything You Always Wanted to Know About JSON Schema (But Were Afraid to Ask). Mohamed-Amine Baazizi, Dario Colazzo, Giorgio Ghelli, Carlo Sartiani, Stefanie Scherzinger |
| 2025 | ExaLogLog: Space-Efficient and Practical Approximate Distinct Counting up to the Exa-Scale. Otmar Ertl |
| 2025 | Explaining Fairness Violations using Machine Unlearning. Tanmay Surve, Romila Pradhan |
| 2025 | FISQL: Enhancing Text-to-SQL Systems with Rich Interactive Feedback. Rakesh Menon, Kun Qian, Liqun Chen, Ishika Joshi, Daniel Pandyan, Shashank Srivastava, Yunyao Li |
| 2025 | FairnessEval: a Framework for Evaluating Fairness of Machine Learning Models. Andrea Baraldi, Matteo Brucato, Miroslav Dudík, Francesco Guerra, Matteo Interlandi |
| 2025 | Fantastic Tables and Where to Find Them: Table Search in Semantic Data Lakes. Martin Pekár Christensen, Aristotelis Leventidis, Matteo Lissandrini, Laura Di Rocco, Renée J. Miller, Katja Hose |
| 2025 | Fast Geosocial Reachability Queries. Panagiotis Bouros, Theodoros Chondrogiannis, Daniel Kowalski |
| 2025 | Fast, Highly Available, and Recoverable Transactions on Disaggregated Data Stores. Mahesh Dananjaya, Vasilis Gavrielatos, Antonios Katsarakis, Nikos Ntarmos, Vijay Nagarajan |
| 2025 | FedForecaster: An Automated Federated Learning Approach for Time-series Forecasting. Mohamed Maher, Osama Fayez Oun, Mahmoud Saeed Mesmeh, Radwa El Shawi |
| 2025 | From Feature Selection to Resource Prediction: An Analysis of Commonly Applied Workflows and Techniques. Ling Zhang, Shaleen Deep, Joyce Cahoon, Jignesh M. Patel, Anja Gruenheid |
| 2025 | GLOVES: Global Counterfactual-based Visual Explanations. Panagiotis Gidarakos, Nikolaos Theologitis, Stavros Maroulis, Loukas Kavouras, Giorgos Giannopoulos, George Papastefanatos |
| 2025 | GPU Architectures in Graph Analytics: A Comparative Experimental Study. Peichen Xie, Zhigao Zheng, Yongluan Zhou, Yang Xiu, Hao Liu, Zhixiang Yang, Yu Zhang, Bo Du |
| 2025 | GRAIL: Graph Retrieval-Augmented In-Context Learning for Node Classification in Real-World Textual-Attributed Graphs. Chanuk Lim, Kyong-Ha Lee, Hyun Ji Jeong, Sungsu Lim |
| 2025 | Gem: Gaussian Mixture Model Embeddings for Numerical Feature Distributions. Hafiz Tayyab Rauf, Alex Teodor Bogatu, Norman W. Paton, André Freitas |
| 2025 | Generating Activity Definitions with Large Language Models. Andreas Kouvaras, Periklis Mantenoglou, Alexander Artikis |
| 2025 | Generating Skyline Datasets for Data Science Models. Mengying Wang, Hanchao Ma, Yiyang Bian, Yangxin Fan, Yinghui Wu |
| 2025 | GraLMatch: Matching Groups of Entities with Graphs and Language Models. Fernando de Meer Pardo, Claude Lehmann, Dennis Gehrig, Andrea Nagy, Stefano Nicoli, Branka Hadji Misheva, Martin Braschler, Kurt Stockinger |
| 2025 | Graph Consistency Rule Mining with LLMs: an Exploratory Study. Hoa Thi Le, Angela Bonifati, Andrea Mauri |
| 2025 | High-dimensional density-based clustering using locality-sensitive hashing. Camilla Birch Okkels, Martin Aumüller, Viktor Bello Thomsen, Arthur Zimek |
| 2025 | How Green is AutoML for Tabular Data? Felix Neutatz, Marius Lindauer, Ziawasch Abedjan |
| 2025 | Hyppo: Efficient Discovery and Execution of Data Science Pipelines in Collaborative Environments. Antonios Kontaxakis, Dimitris Sacharidis, Alkis Simitsis, Alberto Abelló, Sergi Nadal |
| 2025 | Icewafl: A Configurable Data Stream Polluter. Christoph Schinninger, Fabian Panse, Constantin Kühne, Lisa Ehrlinger |
| 2025 | LADYBUG: an LLM Agent DeBUGger for data-driven applications. Joel Rorseth, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jarek Szlichta |
| 2025 | LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration. Tavor Lipman, Tova Milo, Amit Somech, Tomer Wolfson, Oz Zafar |
| 2025 | Learned Indexes with Distribution Smoothing via Virtual Points. Kasun Amarasinghe, Farhana Choudhury, Jianzhong Qi, James Bailey |
| 2025 | Legally-Compliant Spatial Fairness Framework: Advancing Beyond Spatial Fairness. Nripsuta Ani Saxena, Ronit Mathur, Cyrus Shahabi |
| 2025 | LogicLM: Robust Application of Large Language Models with Logic Programming for Data Analytics. Evgeny S. Skvortsov, Shayan Mirjafari, Ojaswa Garg, Yilin Xia, Shawn Bowers, Bertram Ludäscher |
| 2025 | MEMPHIS: Holistic Lineage-based Reuse and Memory Management for Multi-backend ML Systems. Arnab Phani, Matthias Boehm |
| 2025 | MaTElDa: Multi-Table Error Detection. Fatemeh Ahmadi, Marc Speckmann, Malte F. Kuhlmann, Ziawasch Abedjan |
| 2025 | Metadata Unification in Open Data with Gnomon. Christina Christodoulakis, Moshe Gabel, Angela Demke Brown |
| 2025 | Model Lakes. Koyena Pal, David Bau, Renée J. Miller |
| 2025 | Modifying an existing sort order with offset-value codes. Goetz Graefe, Marius Kuhrt, Bernhard Seeger |
| 2025 | No Time to Halt: In-Situ Analysis for Large-Scale Data Processing via Virtual Snapshotting. Reza Salkhordeh, Felix Martin Schuhknecht, Hossein Asadi, Steffen Eiden, André Brinkmann |
| 2025 | OmniMatch: Overcoming the Cold-Start Problem in Cross-Domain Recommendations using Auxiliary Reviews. Yingjun Dai, Ahmed El-Roby, Elmira Adeeb, Vivek Thaker |
| 2025 | PEG: Local Differential Privacy for Edge-Labeled Graphs. André L. C. Mendonça, Felipe T. Brito, Javam C. Machado |
| 2025 | PRISMA: A Privacy-Preserving Schema Matcher using Functional Dependencies. Jan-Eric Hellenberg, Fabian Mahling, Lukas Laskowski, Felix Naumann, Matteo Paganelli, Fabian Panse |
| 2025 | PROLIT: Supporting the Transparency of Data Preparation Pipelines through Narratives over Data Provenance. Pasquale Leonardo Lazzaro, Marialaura Lazzaro, Paolo Missier, Riccardo Torlone |
| 2025 | Parallel Spatial Join Processing with Adaptive Replication. Nikolaos Koutroumanis, Christos Doulkeridis, Akrivi Vlachou |
| 2025 | Path-based Algebraic Foundations of Graph Query Languages. Renzo Angles, Angela Bonifati, Roberto García, Domagoj Vrgoc |
| 2025 | PhoebeDB: A Disk-Based RDBMS Kernel for High-Performance and Cost-Effective OLTP. Boge Liu, Chunling Wang, Xiaoshuang Chen, Yu Hao, Zhengyi Yang, Yi Jin, Yixing Yang, Wenke Yang, Wanchuan Zhang, Wenjie Zhang |
| 2025 | Private Approximate Query over Horizontal Data Federation. Ala Eddine Laouir, Abdessamad Imine |
| 2025 | Progressive Querying on Knowledge Graphs. Angela Bonifati, Stefania Dumbrava, Haridimos Kondylakis, Georgia Troullinou, Giannis Vassiliou |
| 2025 | Pythia: A Neural Model for Data Prefetching. Akshay A. Bapat, Saravanan Thirumuruganathan, Nick Koudas |
| 2025 | QuIT your B+-tree for the Quick Insertion Tree. Aneesh Raman, Konstantinos Karatsenidis, Shaolin Xie, Matthaios Olma, Subhadeep Sarkar, Manos Athanassoulis |
| 2025 | Query Rewriting-Based View Generation for Efficient Multi-Relation Multi-Query with Differential Privacy. Xinglin Du, Peng Tang, Rui Chen, Ning Wang, Chengyu Hu, Shanqing Guo |
| 2025 | QueryER: A Framework for Fast Analysis-Aware Deduplication over Dirty Data. Giorgos Alexiou, George Papastefanatos, Vassilis Stamatopoulos, Georgia Koutrika, Nectarios Koziris |
| 2025 | RASP: Robust Mining of Frequent Temporal Sequential Patterns under Temporal Variations. Hyunjin Choo, Minho Eom, Gyuri Kim, Young-Gyu Yoon, Kijung Shin |
| 2025 | REACT: REcourse Analysis with Counterfactuals and Explanation Tables. Anastasiia Avksientieva, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jarek Szlichta |
| 2025 | SPO-Join: Efficient Stream Inequality Join. Adeel Aslam, Kaustubh Beedkar, Giovanni Simonini |
| 2025 | Secure and Transparent Data Sharing with TrustShare: A GDPR-Compliant Platform. Sven Rasmusen, Konstantina Pityanou, Dimitra Papatsaroucha, Sofiane Lagraa, Moussa Ouedraogo, Evangelos K. Markakis |
| 2025 | Selecting Comparative Sets of Reviews Across Multiple Items. Trung-Hoang Le, Hady W. Lauw |
| 2025 | Selective Evolving Centrality in Temporal Heterogeneous Graphs. Landy Andriamampianina, Franck Ravat, Jiefu Song, Nathalie Vallès-Parlangeau, Yanpei Wang |
| 2025 | SemaSK: Answering Semantics-aware Spatial Keyword Queries with Large Language Models. Zesong Zhang, Jianzhong Qi, Xin Cao, Christian S. Jensen |
| 2025 | Stable Tree Labelling for Accelerating Distance Queries on Dynamic Road Networks. Henning Koehler, Muhammad Farhan, Qing Wang |
| 2025 | Step-by-Step Data Cleaning Recommendations to Improve ML Prediction Accuracy. Sedir Mohammed, Felix Naumann, Hazar Harmouch |
| 2025 | Supporting Data Discovery Tasks at Scale with FREYJA. Marc Maynou, Sergi Nadal |
| 2025 | Synopses for Summarizing Spatial Data Streams. Jacco Johannes Egbert Kiezebrink, Wieger R. Punter, Odysseas Papapetrou, Kevin Verbeek |
| 2025 | Systems for Scalable Graph Analytics and Machine Learning: Trends and Methods. Da Yan, Lyuheng Yuan, Akhlaque Ahmad, Saugat Adhikari |
| 2025 | TETYS: Configurable Topic Modeling Exploration for Big Corpora of Text Documents. Francesco Invernici, Anna Bernasconi, Francesca Curati, Jelena Jakimov, Amirhossein Samavi |
| 2025 | Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting. Gyanendra Shrestha, Chutian Jiang, Sai Akula, Vivek Yannam, Anna Pyayt, Michael N. Gubanov |
| 2025 | Taming the Beast of User-Programmed Transactions on Blockchains: A Declarative Transaction Approach. Nodirbek Korchiev, Akash Pateria, Vodelina Samatova, Sogolsadat Mansouri, Kemafor Anyanwu |
| 2025 | Taste: Towards Practical Deep Learning-based Approaches for Semantic Type Detection in the Cloud. Tao Li, Feng Liang, Jinqi Quan, Huang Chuang, Teng Wang, Runhuai Huang, Jie Wu, Xiping Hu |
| 2025 | Template-based Explainable Inference over High-Stakes Financial Knowledge Graphs. Andrea Colombo, Teodoro Baldazzi, Luigi Bellomarini, Emanuel Sallinger, Stefano Ceri |
| 2025 | Time-Related Patterns Of Schema Evolution. Panos Vassiliadis, Alexandros Karakasidis |
| 2025 | Toward Standardized Data Preparation: A Bottom-Up Approach. Eugenie Y. Lai, Yuze Lou, Brit Youngmann, Michael J. Cafarella |
| 2025 | Towards Hybrid Graphs: Unifying Property Graphs and Time Series. Mouna Ammar, Christopher Rost, Riccardo Tommasini, Shubhangi Agarwal, Angela Bonifati, Petra Selmer, Evgeny Kharlamov, Erhard Rahm |
| 2025 | Towards Reliable Conversational Data Analytics. Sihem Amer-Yahia, Jasmina Bogojeska, Roberta Facchinetti, Valeria Franceschi, Aristides Gionis, Katja Hose, Georgia Koutrika, Roger D. Kouyos, Matteo Lissandrini, Silviu Maniu, Katsiaryna Mirylenka, Davide Mottin, Themis Palpanas, Mattia Rigotti, Yannis Velegrakis |
| 2025 | TransforMMer: A Universal Multi-Model Data Generator. Jáchym Bártík, Alzbeta Srutková, Irena Holubová |
| 2025 | Transforming Maritime Safety: Data-driven Applications for the Real-Time Detection and Mitigation of Maritime Incidents. Georgios Grigoropoulos, Alexandros Troupiotis-Kapeliaris, Ilias Chamatidis, Evangelia Filippou, Konstantina Bereta |
| 2025 | UniAsk: AI-powered search for banking knowledge bases. Ilaria Bordino, Francesco Di Iorio, Andrea Galliani, Alessio Rosatelli, Lorenzo Severini |
| 2025 | Unifying Large Language Models and Knowledge Graphs for Question Answering: Recent Advances and Opportunities. Chuangtao Ma, Yongrui Chen, Tianxing Wu, Arijit Khan, Haofen Wang |
| 2025 | Using A Probabilistic Database in an Image Retrieval Application. Fajrian Yunus, Pratik Karmakar, Pierre Senellart, Talel Abdessalem, Stéphane Bressan |
| 2025 | VCrypt: Leveraging Vectorized and Compressed Execution for Client-side Encryption. Charlotte Felius, Peter Boncz |
| 2025 | Virtual: Compressing Data Lake Files. Mihail Stoian, Alexander van Renen, Jan Kobiolka, Ping-Lin Kuo, Andreas Zimmerer, Josif Grabocka, Andreas Kipf |
| 2025 | Watermarking Decision Tree Ensembles. Stefano Calzavara, Lorenzo Cazzaro, Donald Gera, Salvatore Orlando |
| 2025 | Z-Shadow: An Efficient Method for Estimating Bicliques in Massive Graphs Using Füredi's Theorem. Bole Chang, Linxin Xie, Wei Li, Meng Qin, Jianfeng Hou |
| 2025 | hybridNDP: Dynamic Operation Offloading and Cooperative Query Execution in Smart Storage Settings. Christian Knödler, Naeem Ramzan, Ilia Petrov |
| 2024 | Proceedings 28th International Conference on Extending Database Technology, EDBT 2025, Barcelona, Spain, March 25-28, 2025. Alkis Simitsis, Bettina Kemme, Anna Queralt, Oscar Romero, Petar Jovanovic |