| 2017 | A Case for Abstract Cost Models for Distributed Execution of Analytics Operators. Rundong Li, Ningfang Mi, Mirek Riedewald, Yizhou Sun, Yi Yao |
| 2017 | A Lightweight Elastic Queue Middleware for Distributed Streaming Pipeline. Weiping Qu, Stefan Dessloch |
| 2017 | A Machine Learning Trainable Model to Assess the Accuracy of Probabilistic Record Linkage. Robespierre Pita, Everton Mendonça, Sandra Reis, Marcos Barreto, Spiros C. Denaxas |
| 2017 | A Relativistic Opinion Mining Approach to Detect Factual or Opinionated News Sources. Erhan Sezerer, Selma Tekir |
| 2017 | A Reliability-Based Approach for Influence Maximization Using the Evidence Theory. Siwar Jendoubi, Arnaud Martin |
| 2017 | Accelerating K-Means by Grouping Points Automatically. Qiao Yu, Bi-Ru Dai |
| 2017 | Air Quality Monitoring System and Benchmarking. Xiufeng Liu, Per Sieverts Nielsen |
| 2017 | An Efficient Approach for Instance Selection. Joel Luis Carbonera |
| 2017 | An Efficient Map-Reduce Framework to Mine Periodic Frequent Patterns. Alampally Anirudh, R. Uday Kiran, P. Krishna Reddy, Masashi Toyoda, Masaru Kitsuregawa |
| 2017 | Automatic Segmentation of Big Data of Patent Texts. Mustafa Sofean |
| 2017 | Belief Temporal Analysis of Expert Users: Case Study Stack Overflow. Dorra Attiaoui, Arnaud Martin, Boutheina Ben Yaghlane |
| 2017 | Big Data Analytics and Knowledge Discovery - 19th International Conference, DaWaK 2017, Lyon, France, August 28-31, 2017, Proceedings Ladjel Bellatreche, Sharma Chakravarthy |
| 2017 | Detecting Feature Interactions in Agricultural Trade Data Using a Deep Neural Network. Jim O'Donoghue, Mark Roantree, Andrew McCarren |
| 2017 | Diverse Selection of Feature Subsets for Ensemble Regression. Arvind Kumar Shekar, Patricia Iglesias Sánchez, Emmanuel Müller |
| 2017 | Electric Vehicle Charging Station Deployment for Minimizing Construction Cost. Kai Li, Shuai Wang |
| 2017 | Enforcing Privacy in Cloud Databases. Somayeh Sobati Moghadam, Jérôme Darmont, Gérald Gavin |
| 2017 | Evaluation of Data Warehouse Design Methodologies in the Context of Big Data. Francesco Di Tria, Ezio Lefons, Filippo Tangorra |
| 2017 | Exploiting Mathematical Structures of Statistical Measures for Comparison of RDF Data Cubes. Claudia Diamantini, Domenico Potena, Emanuele Storti |
| 2017 | Extracting Non-redundant Correlated Purchase Behaviors by Utility Measure. Wensheng Gan, Jerry Chun-Wei Lin, Philippe Fournier-Viger, Han-Chieh Chao |
| 2017 | K-Means Clustering Using Homomorphic Encryption and an Updatable Distance Matrix: Secure Third Party Data Clustering with Limited Data Owner Interaction. Nawal Almutairi, Frans Coenen, Keith Dures |
| 2017 | Knowledge Discovery of Complex Data Using Gaussian Mixture Models. Linfei Zhou, Wei Ye, Claudia Plant, Christian Böhm |
| 2017 | Leveraging Hierarchy and Community Structure for Determining Influencers in Networks. Sharanjit Kaur, Rakhi Saxena, Vasudha Bhatnagar |
| 2017 | MDA-Based Approach for NoSQL Databases Modelling. Fatma Abdelhédi, Amal Ait Brahim, Faten Atigui, Gilles Zurfluh |
| 2017 | MapReduce-Based Complex Big Data Analytics over Uncertain and Imprecise Social Networks. Peter Braun, Alfredo Cuzzocrea, Fan Jiang, Carson Kai-Sang Leung, Adam G. M. Pazdor |
| 2017 | MiSeRe-Hadoop: A Large-Scale Robust Sequential Classification Rules Mining Framework. Elias Egho, Dominique Gay, Romain Trinquart, Marc Boullé, Nicolas Voisine, Fabrice Clérot |
| 2017 | Modeling Data Flow Execution in a Parallel Environment. Georgia Kougka, Anastasios Gounaris, Ulf Leser |
| 2017 | Optimal Task Ordering in Chain Data Flows: Exploring the Practicality of Non-scalable Solutions. Georgia Kougka, Anastasios Gounaris |
| 2017 | Optimized Mining of Potential Positive and Negative Association Rules. Parfait Bemarisika, André Totohasina |
| 2017 | Pre-processing and Indexing Techniques for Constellation Queries in Big Data. Amir Khatibi, Fábio Porto, João Guilherme Nobre Rittmeyer, Eduardo S. Ogasawara, Patrick Valduriez, Dennis E. Shasha |
| 2017 | Reweighting Forest for Extreme Multi-label Classification. Zhun-Zheng Lin, Bi-Ru Dai |
| 2017 | S2D: Shared Distributed Datasets, Storing Shared Data for Multiple and Massive Queries Optimization in a Distributed Data Warehouse. Rado Ratsimbazafy, Omar Boussaid, Fadila Bentayeb |
| 2017 | Search Result Personalization in Twitter Using Neural Word Embeddings. Sameendra Samarawickrama, Shanika Karunasekera, Aaron Harwood, Ramamohanarao Kotagiri |
| 2017 | Sentiment Analysis on Twitter to Improve Time Series Contextual Anomaly Detection for Detecting Stock Market Manipulation. Koosha Golmohammadi, Osmar R. Zaïane |
| 2017 | TARDIS: Optimal Execution of Scientific Workflows in Apache Spark. Daniel Gaspar, Fábio Porto, Reza Akbarinia, Esther Pacitti |
| 2017 | Tag Me a Label with Multi-arm: Active Learning for Telugu Sentiment Analysis. Sandeep Sricharan Mukku, Subba Reddy Oota, Radhika Mamidi |
| 2017 | Using Social Media for Word-of-Mouth Marketing. Nagendra Kumar, Yash Chandarana, Anand Konjengbam, Manish Singh |