| 2017 | "The Leicester City Fairytale?": Utilizing New Soccer Analytics Tools to Compare Performance in the 15/16 & 16/17 EPL Seasons. Héctor Ruiz, Paul Power, Xinyu Wei, Patrick Lucey |
| 2017 | A Century of Science: Globalization of Scientific Collaborations, Citations, and Innovations. Yuxiao Dong, Hao Ma, Zhihong Shen, Kuansan Wang |
| 2017 | A Context-aware Attention Network for Interactive Question Answering. Huayu Li, Martin Renqiang Min, Yong Ge, Asim Kadav |
| 2017 | A Data Mining Framework for Valuing Large Portfolios of Variable Annuities. Guojun Gan, Jimmy Xiangji Huang |
| 2017 | A Data Science Approach to Understanding Residential Water Contamination in Flint. Alex Chojnacki, Chengyu Dai, Arya Farahi, Guangsha Shi, Jared Webb, Daniel T. Zhang, Jacob D. Abernethy, Eric M. Schwartz |
| 2017 | A Data-driven Process Recommender Framework. Sen Yang, Xin Dong, Leilei Sun, Yichen Zhou, Richard A. Farneth, Hui Xiong, Randall S. Burd, Ivan Marsic |
| 2017 | A Dirty Dozen: Twelve Common Metric Interpretation Pitfalls in Online Controlled Experiments. Pavel A. Dmitriev, Somit Gupta, Dong Woo Kim, Garnet Jason Vaz |
| 2017 | A Hierarchical Algorithm for Extreme Clustering. Ari Kobren, Nicholas Monath, Akshay Krishnamurthy, Andrew McCallum |
| 2017 | A Hybrid Framework for Text Modeling with Convolutional RNN. Chenglong Wang, Feijun Jiang, Hongxia Yang |
| 2017 | A Local Algorithm for Structure-Preserving Graph Cut. Dawei Zhou, Si Zhang, Mehmet Yigit Yildirim, Scott Alcorn, Hanghang Tong, Hasan Davulcu, Jingrui He |
| 2017 | A Location-Sentiment-Aware Recommender System for Both Home-Town and Out-of-Town Users. Hao Wang, Yanmei Fu, Qinyong Wang, Hongzhi Yin, Changying Du, Hui Xiong |
| 2017 | A Minimal Variance Estimator for the Cardinality of Big Data Set Intersection. Reuven Cohen, Liran Katzir, Aviv Yehezkel |
| 2017 | A Practical Algorithm for Solving the Incoherence Problem of Topic Models In Industrial Applications. Amr Ahmed, James Long, Daniel Silva, Yuan Wang |
| 2017 | A Practical Exploration System for Search Advertising. Parikshit Shah, Ming Yang, Sachidanand Alle, Adwait Ratnaparkhi, Ben Shahshahani, Rohit Chandra |
| 2017 | A Quasi-experimental Estimate of the Impact of P2P Transportation Platforms on Urban Consumer Patterns. Zhe Zhang, Beibei Li |
| 2017 | A Taxi Order Dispatch Model based On Combinatorial Optimization. Lingyu Zhang, Tao Hu, Yue Min, Guobin Wu, Junying Zhang, Pengcheng Feng, Pinghua Gong, Jieping Ye |
| 2017 | A Temporally Heterogeneous Survival Framework with Application to Social Behavior Dynamics. Linyun Yu, Peng Cui, Chaoming Song, Tianyang Zhang, Shiqiang Yang |
| 2017 | AESOP: Automatic Policy Learning for Predicting and Mitigating Network Service Impairments. Supratim Deb, Zihui Ge, Sastry Isukapalli, Sarat C. Puthenpura, Shobha Venkataraman, He Yan, Jennifer Yates |
| 2017 | Accelerating Innovation Through Analogy Mining. Tom Hope, Joel Chan, Aniket Kittur, Dafna Shahaf |
| 2017 | Achieving Non-Discrimination in Data Release. Lu Zhang, Yongkai Wu, Xintao Wu |
| 2017 | Ad Serving with Multiple KPIs. Brendan Kitts, Michael Krishnan, Ishadutta Yadav, Yongbo Zeng, Garrett Badeau, Andrew Potter, Sergey Tolkachov, Ethan Thornburg, Satyanarayana Reddy Janga |
| 2017 | Addressing Challenges with Big Data for Media Measurement. Mainak Mazumdar |
| 2017 | Adversary Resistant Deep Neural Networks with an Application to Malware Detection. Qinglong Wang, Wenbo Guo, Kaixuan Zhang, Alexander G. Ororbia II, Xinyu Xing, Xue Liu, C. Lee Giles |
| 2017 | Algorithmic Decision Making and the Cost of Fairness. Sam Corbett-Davies, Emma Pierson, Avi Feller, Sharad Goel, Aziz Huq |
| 2017 | An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance. Edward Raff, Charles K. Nicholas |
| 2017 | An Efficient Bandit Algorithm for Realtime Multivariate Optimization. Daniel N. Hill, Houssam Nassif, Yi Liu, Anand Iyer, S. V. N. Vishwanathan |
| 2017 | An Intelligent Customer Care Assistant System for Large-Scale Cellular Network Diagnosis. Lujia Pan, Jianfeng Zhang, Patrick P. C. Lee, Hong Cheng, Cheng He, Caifeng He, Keli Zhang |
| 2017 | Anarchists, Unite: Practical Entropy Approximation for Distributed Streams. Moshe Gabel, Daniel Keren, Assaf Schuster |
| 2017 | AnnexML: Approximate Nearest Neighbor Search for Extreme Multi-label Classification. Yukihiro Tagami |
| 2017 | Anomaly Detection in Streams with Extreme Value Theory. Alban Siffer, Pierre-Alain Fouque, Alexandre Termier, Christine Largouët |
| 2017 | Anomaly Detection with Robust Deep Autoencoders. Chong Zhou, Randy C. Paffenroth |
| 2017 | Aspect Based Recommendations: Recommending Items with the Most Valuable Aspects Based on User Reviews. Konstantin Bauman, Bing Liu, Alexander Tuzhilin |
| 2017 | Automated Categorization of Onion Sites for Analyzing the Darkweb Ecosystem. Shalini Ghosh, Ariyam Das, Phillip A. Porras, Vinod Yegneswaran, Ashish Gehani |
| 2017 | Automatic Application Identification from Billions of Files. Kyle Soska, Christopher S. Gates, Kevin A. Roundy, Nicolas Christin |
| 2017 | Automatic Synonym Discovery with Knowledge Bases. Meng Qu, Xiang Ren, Jiawei Han |
| 2017 | BDT: Gradient Boosted Decision Tables for High Accuracy and Scoring Efficiency. Yin Lou, Mikhail Obukhov |
| 2017 | Backpage and Bitcoin: Uncovering Human Traffickers. Rebecca S. Portnoff, Danny Yuxing Huang, Periwinkle Doerfler, Sadia Afroz, Damon McCoy |
| 2017 | Behavior Informatics to Discover Behavior Insight for Active and Tailored Client Management. Longbing Cao |
| 2017 | Benchmarks and Process Management in Data Science: Will We Ever Get Over the Mess? Usama M. Fayyad, Arno Candel, Eduardo Ariño de la Rubia, Szilárd Pafka, Anthony Chong, Jeong-Yoon Lee |
| 2017 | Big Data in Climate: Opportunities and Challenges for Machine Learning. Anuj Karpatne, Vipin Kumar |
| 2017 | Bolt: Accelerated Data Mining with Fast Vector Compression. Davis W. Blalock, John V. Guttag |
| 2017 | Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation. Carl Yang, Lanxiao Bai, Chao Zhang, Quan Yuan, Jiawei Han |
| 2017 | Cascade Ranking for Operational E-commerce Search. Shichen Liu, Fei Xiao, Wenwu Ou, Luo Si |
| 2017 | Clustering Individual Transactional Data for Masses of Users. Riccardo Guidotti, Anna Monreale, Mirco Nanni, Fosca Giannotti, Dino Pedreschi |
| 2017 | Collaborative Variational Autoencoder for Recommender Systems. Xiaopeng Li, James She |
| 2017 | Collaboratively Improving Topic Discovery and Word Embeddings by Coordinating Global and Local Contexts. Guangxu Xun, Yaliang Li, Jing Gao, Aidong Zhang |
| 2017 | Collecting and Analyzing Millions of mHealth Data Streams. Tom Quisel, Luca Foschini, Alessio Signorini, David C. Kale |
| 2017 | Communication-Efficient Distributed Block Minimization for Nonlinear Kernel Machines. Cho-Jui Hsieh, Si Si, Inderjit S. Dhillon |
| 2017 | Compass: Spatio Temporal Sentiment Analysis of US Election What Twitter Says! Debjyoti Paul, Feifei Li, Murali Krishna Teja, Xin Yu, Richie Frost |
| 2017 | Construction of Directed 2K Graphs. Bálint Tillman, Athina Markopoulou, Carter T. Butts, Minas Gjoka |
| 2017 | Constructivism Learning: A Learning Paradigm for Transparent Predictive Analytics. Xiaoli Li, Jun Huan |
| 2017 | Contextual Motifs: Increasing the Utility of Motifs using Contextual Data. Ian Fox, Lynn Ang, Mamta Jaiswal, Rodica Pop-Busui, Jenna Wiens |
| 2017 | Contextual Spatial Outlier Detection with Metric Learning. Guanjie Zheng, Susan L. Brantley, Thomas Lauvaux, Zhenhui Li |
| 2017 | Convex Factorization Machine for Toxicogenomics Prediction. Makoto Yamada, Wenzhao Lian, Amit Goyal, Jianhui Chen, Kishan Wimalawarne, Suleiman A. Khan, Samuel Kaski, Hiroshi Mamitsuka, Yi Chang |
| 2017 | Coresets for Kernel Regression. Yan Zheng, Jeff M. Phillips |
| 2017 | Customer Lifetime Value Prediction Using Embeddings. Benjamin Paul Chamberlain, Ângelo Cardoso, C. H. Bryan Liu, Roberto Pagliari, Marc Peter Deisenroth |
| 2017 | Decomposed Normalized Maximum Likelihood Codelength Criterion for Selecting Hierarchical Latent Variable Models. Tianyi Wu, Shinya Sugawara, Kenji Yamanishi |
| 2017 | Deep Choice Model Using Pointer Networks for Airline Itinerary Prediction. Alejandro Mottini, Rodrigo Acuna-Agost |
| 2017 | Deep Design: Product Aesthetics for Heterogeneous Markets. Yanxin Pan, Alexander Burnap, Jeffrey Hartley, Richard Gonzalez, Panos Y. Papalambros |
| 2017 | Deep Embedding Forest: Forest-based Serving with Deep Embedding Features. Jie Zhu, Ying Shan, J. C. Mao, Dong Yu, Holakou Rahmanian, Yi Zhang |
| 2017 | DeepMood: Modeling Mobile Phone Typing Dynamics for Mood Detection. Bokai Cao, Lei Zheng, Chenwei Zhang, Philip S. Yu, Andrea Piscitello, John Zulueta, Olu Ajilore, Kelly Ryan, Alex D. Leow |
| 2017 | DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks. Zi Yin, Keng-hao Chang, Ruofei Zhang |
| 2017 | DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution. Thomas Vandal, Evan Kodra, Sangram Ganguly, Andrew R. Michaelis, Ramakrishna R. Nemani, Auroop R. Ganguly |
| 2017 | DenseAlert: Incremental Dense-Subtensor Detection in Tensor Streams. Kijung Shin, Bryan Hooi, Jisu Kim, Christos Faloutsos |
| 2017 | Designing AI at Scale to Power Everyday Life. Rajesh Parekh |
| 2017 | Detecting Network Effects: Randomizing Over Randomized Experiments. Martin Saveski, Jean Pouget-Abadie, Guillaume Saint-Jacques, Weitao Duan, Souvik Ghosh, Ya Xu, Edoardo M. Airoldi |
| 2017 | Developing a Comprehensive Framework for Multimodal Feature Extraction. Quinten McNamara, Alejandro de la Vega, Tal Yarkoni |
| 2017 | Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks. Fenglong Ma, Radha Chitta, Jing Zhou, Quanzeng You, Tong Sun, Jing Gao |
| 2017 | Discovering Enterprise Concepts Using Spreadsheet Tables. Keqian Li, Yeye He, Kris Ganjam |
| 2017 | Discovering Pollution Sources and Propagation Patterns in Urban Area. Xiucheng Li, Yun Cheng, Gao Cong, Lisi Chen |
| 2017 | Discovering Reliable Approximate Functional Dependencies. Panagiotis Mandros, Mario Boley, Jilles Vreeken |
| 2017 | Discrete Content-aware Matrix Factorization. Defu Lian, Rui Liu, Yong Ge, Kai Zheng, Xing Xie, Longbing Cao |
| 2017 | Dispatch with Confidence: Integration of Machine Learning, Optimization and Simulation for Open Pit Mines. Kosta Ristovski, Chetan Gupta, Kunihiko Harada, Hsiu-Khuern Tang |
| 2017 | Distributed Local Outlier Detection in Big Data. Yizhou Yan, Lei Cao, Caitlin Kuhlman, Elke A. Rundensteiner |
| 2017 | Distributed Multi-Task Relationship Learning. Sulin Liu, Sinno Jialin Pan, Qirong Ho |
| 2017 | Dynamic Attention Deep Model for Article Recommendation by Learning Human Editors' Demonstration. Xuejian Wang, Lantao Yu, Kan Ren, Guanyu Tao, Weinan Zhang, Yong Yu, Jun Wang |
| 2017 | Effective Evaluation Using Logged Bandit Feedback from Multiple Loggers. Aman Agarwal, Soumya Basu, Tobias Schnabel, Thorsten Joachims |
| 2017 | Effective and Real-time In-App Activity Analysis in Encrypted Internet Traffic Streams. Junming Liu, Yanjie Fu, Jingci Ming, Yong Ren, Leilei Sun, Hui Xiong |
| 2017 | Efficient Correlated Topic Modeling with Topic Embedding. Junxian He, Zhiting Hu, Taylor Berg-Kirkpatrick, Ying Huang, Eric P. Xing |
| 2017 | Ego-Splitting Framework: from Non-Overlapping to Overlapping Clusters. Alessandro Epasto, Silvio Lattanzi, Renato Paes Leme |
| 2017 | EmbedJoin: Efficient Edit Similarity Joins via Embeddings. Haoyu Zhang, Qin Zhang |
| 2017 | Embedding-based News Recommendation for Millions of Users. Shumpei Okura, Yukihiro Tagami, Shingo Ono, Akira Tajima |
| 2017 | End-to-end Learning for Short Text Expansion. Jian Tang, Yue Wang, Kai Zheng, Qiaozhu Mei |
| 2017 | Estimating Treatment Effect in the Wild via Differentiated Confounder Balancing. Kun Kuang, Peng Cui, Bo Li, Meng Jiang, Shiqiang Yang |
| 2017 | Estimation of Recent Ancestral Origins of Individuals on a Large Scale. Ross E. Curtis, Ahna Reza Girshick |
| 2017 | Evaluating U.S. Electoral Representation with a Joint Statistical Model of Congressional Roll-Calls, Legislative Text, and Voter Registration Data. Zhengming Xing, Sunshine Hillygus, Lawrence Carin |
| 2017 | Extremely Fast Decision Tree Mining for Evolving Data Streams. Albert Bifet, Jiajin Zhang, Wei Fan, Cheng He, Jianfeng Zhang, Jianfeng Qian, Geoff Holmes, Bernhard Pfahringer |
| 2017 | FIRST: Fast Interactive Attributed Subgraph Matching. Boxin Du, Si Zhang, Nan Cao, Hanghang Tong |
| 2017 | FLAP: An End-to-End Event Log Analysis Platform for System Management. Tao Li, Yexi Jiang, Chunqiu Zeng, Bin Xia, Zheng Liu, Wubai Zhou, Xiaolong Zhu, Wentao Wang, Liang Zhang, Jun Wu, Li Xue, Dewei Bao |
| 2017 | FORA: Simple and Effective Approximate Single-Source Personalized PageRank. Sibo Wang, Renchi Yang, Xiaokui Xiao, Zhewei Wei, Yin Yang |
| 2017 | Fast Enumeration of Large k-Plexes. Alessio Conte, Donatella Firmani, Caterina Mordente, Maurizio Patrignani, Riccardo Torlone |
| 2017 | Fast Newton Hard Thresholding Pursuit for Sparsity Constrained Nonconvex Optimization. Jinghui Chen, Quanquan Gu |
| 2017 | Federated Tensor Factorization for Computational Phenotyping. Yejin Kim, Jimeng Sun, Hwanjo Yu, Xiaoqian Jiang |
| 2017 | Finding Precursors to Anomalous Drop in Airspeed During a Flight's Takeoff. Vijay Manikandan Janakiraman, Bryan L. Matthews, Nikunj C. Oza |
| 2017 | Foreword to the Applied Data Science: Invited Talks Track at KDD-2017. Usama M. Fayyad, Evangelos Simoudis, Ashok Srivastava |
| 2017 | Formative Essay Feedback Using Predictive Scoring Models. Bronwyn Woods, David Adamson, Shayne Miel, Elijah Mayfield |
| 2017 | Functional Annotation of Human Protein Coding Isoforms via Non-convex Multi-Instance Learning. Tingjin Luo, Weizhong Zhang, Shang Qiu, Yang Yang, Dongyun Yi, Guangtao Wang, Jieping Ye, Jie Wang |
| 2017 | Functional Zone Based Hierarchical Demand Prediction For Bike System Expansion. Junming Liu, Leilei Sun, Qiao Li, Jingci Ming, Yanchi Liu, Hui Xiong |
| 2017 | GELL: Automatic Extraction of Epidemiological Line Lists from Open Sources. Saurav Ghosh, Prithwish Chakraborty, Bryan L. Lewis, Maimuna S. Majumder, Emily Cohn, John S. Brownstein, Madhav V. Marathe, Naren Ramakrishnan |
| 2017 | GRAM: Graph-based Attention Model for Healthcare Representation Learning. Edward Choi, Mohammad Taha Bahadori, Le Song, Walter F. Stewart, Jimeng Sun |
| 2017 | Google Vizier: A Service for Black-Box Optimization. Daniel Golovin, Benjamin Solnik, Subhodeep Moitra, Greg Kochanski, John Karro, D. Sculley |
| 2017 | Graph Edge Partitioning via Neighborhood Heuristic. Chenzi Zhang, Fan Wei, Qin Liu, Zhihao Gavin Tang, Zhenguo Li |
| 2017 | Groups-Keeping Solution Path Algorithm for Sparse Regression with Automatic Feature Grouping. Bin Gu, Guodong Liu, Heng Huang |
| 2017 | HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network. Shifu Hou, Yanfang Ye, Yangqiu Song, Melih Abdulhayoglu |
| 2017 | HoORaYs: High-order Optimization of Rating Distance for Recommender Systems. Jingwei Xu, Yuan Yao, Hanghang Tong, XianPing Tao, Jian Lu |
| 2017 | Human Mobility Synchronization and Trip Purpose Detection with Mixture of Hawkes Processes. Pengfei Wang, Yanjie Fu, Guannan Liu, Wenqing Hu, Charu C. Aggarwal |
| 2017 | HyperLogLog Hyperextended: Sketches for Concave Sublinear Frequency Statistics. Edith Cohen |
| 2017 | Improved Degree Bounds and Full Spectrum Power Laws in Preferential Attachment Networks. Chen Avin, Zvi Lotker, Yinon Nahum, David Peleg |
| 2017 | Incremental Dual-memory LSTM in Land Cover Prediction. Xiaowei Jia, Ankush Khandelwal, Guruprasad Nayak, James Gerber, Kimberly Carlson, Paul C. West, Vipin Kumar |
| 2017 | Inductive Semi-supervised Multi-Label Learning with Co-Training. Wang Zhan, Min-Ling Zhang |
| 2017 | Industrial Machine Learning. Josh Bloom |
| 2017 | Inferring the Strength of Social Ties: A Community-Driven Approach. Polina Rozenshtein, Nikolaj Tatti, Aristides Gionis |
| 2017 | Internet Device Graphs. Matthew Malloy, Paul Barford, Enis Ceyhun Alp, Jonathan Koller, Adria Jewell |
| 2017 | Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking. Gabriele Tolomei, Fabrizio Silvestri, Andrew Haines, Mounia Lalmas |
| 2017 | Is the Whole Greater Than the Sum of Its Parts? Liangyue Li, Hanghang Tong, Yong Wang, Conglei Shi, Nan Cao, Norbou Buchler |
| 2017 | It Takes More than Math and Engineering to Hit the Bullseye with Data. Paritosh Desai |
| 2017 | KATE: K-Competitive Autoencoder for Text. Yu Chen, Mohammed J. Zaki |
| 2017 | KunPeng: Parameter Server based Distributed Learning Systems and Its Applications in Alibaba and Ant Financial. Jun Zhou, Xiaolong Li, Peilin Zhao, Chaochao Chen, Longfei Li, Xinxing Yang, Qing Cui, Jin Yu, Xu Chen, Yi Ding, Yuan (Alan) Qi |
| 2017 | LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity. Yutao Zhang, Robert Chen, Jie Tang, Walter F. Stewart, Jimeng Sun |
| 2017 | Large Scale Sentiment Learning with Limited Labels. Vasileios Iosifidis, Eirini Ntoutsi |
| 2017 | Large-scale Collaborative Ranking in Near-Linear Time. Liwei Wu, Cho-Jui Hsieh, James Sharpnack |
| 2017 | Learning Certifiably Optimal Rule Lists. Elaine Angelino, Nicholas Larus-Stone, Daniel Alabi, Margo I. Seltzer, Cynthia Rudin |
| 2017 | Learning Temporal State of Diabetes Patients via Combining Behavioral and Demographic Data. Houping Xiao, Jing Gao, Long H. Vu, Deepak S. Turaga |
| 2017 | Learning Tree-Structured Detection Cascades for Heterogeneous Networks of Embedded Devices. Hamid Dadkhahi, Benjamin M. Marlin |
| 2017 | Learning from Labeled and Unlabeled Vertices in Networks. Wei Ye, Linfei Zhou, Dominik Mautz, Claudia Plant, Christian Böhm |
| 2017 | Learning from Multiple Teacher Networks. Shan You, Chang Xu, Chao Xu, Dacheng Tao |
| 2017 | Learning to Count Mosquitoes for the Sterile Insect Technique. Yaniv Ovadia, Yoni Halpern, Dilip Krishnan, Josh Livni, Daniel E. Newburger, Ryan Poplin, Tiantian Zha, D. Sculley |
| 2017 | Learning to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical Attention. Bin Tong, Martin Klinkigt, Makoto Iwayama, Toshihiko Yanase, Yoshiyuki Kobayashi, Anshuman Sahu, Ravigopal Vennelakanti |
| 2017 | Let's See Your Digits: Anomalous-State Detection using Benford's Law. Samuel Maurus, Claudia Plant |
| 2017 | LiJAR: A System for Job Application Redistribution towards Efficient Career Marketplace. Fedor Borisyuk, Liang Zhang, Krishnaram Kenthapadi |
| 2017 | Linearized GMM Kernels and Normalized Random Fourier Features. Ping Li |
| 2017 | Local Algorithm for User Action Prediction Towards Display Ads. Hongxia Yang, Yada Zhu, Jingrui He |
| 2017 | Local Higher-Order Graph Clustering. Hao Yin, Austin R. Benson, Jure Leskovec, David F. Gleich |
| 2017 | Long Short Memory Process: Modeling Growth Dynamics of Microscopic Social Connectivity. Chengxi Zang, Peng Cui, Christos Faloutsos, Wenwu Zhu |
| 2017 | Luck is Hard to Beat: The Difficulty of Sports Prediction. Raquel Y. S. Aoki, Renato Martins Assunção, Pedro O. S. Vaz de Melo |
| 2017 | MARAS: Signaling Multi-Drug Adverse Reactions. Xiao Qin, Tabassum Kakar, Susmitha Wunnava, Elke A. Rundensteiner, Lei Cao |
| 2017 | MOLIERE: Automatic Biomedical Hypothesis Generation System. Justin Sybrandt, Michael Shtutman, Ilya Safro |
| 2017 | Machine Learning Software in Practice: Quo Vadis? Szilárd Pafka |
| 2017 | Machine Learning for Encrypted Malware Traffic Classification: Accounting for Noisy Labels and Non-Stationarity. Blake Anderson, David A. McGrew |
| 2017 | Matching Restaurant Menus to Crowdsourced Food Data: A Scalable Machine Learning Approach. Hesam Salehian, Patrick D. Howell, Chul Lee |
| 2017 | Matrix Profile V: A Generic Technique to Incorporate Domain Knowledge into Motif Discovery. Hoang Anh Dau, Eamonn J. Keogh |
| 2017 | Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks. Huan Zhao, Quanming Yao, Jianda Li, Yangqiu Song, Dik Lun Lee |
| 2017 | MetaPAD: Meta Pattern Discovery from Massive Text Corpora. Meng Jiang, Jingbo Shang, Taylor Cassidy, Xiang Ren, Lance M. Kaplan, Timothy P. Hanratty, Jiawei Han |
| 2017 | Mining Big Data in NeuroGenetics to Understand Muscular Dystrophy. Andy Berglund |
| 2017 | Mixture Factorized Ornstein-Uhlenbeck Processes for Time-Series Forecasting. Guo-Jun Qi, Jiliang Tang, Jingdong Wang, Jiebo Luo |
| 2017 | More than the Sum of its Parts: Building Domino Data Lab. Eduardo Ariño de la Rubia |
| 2017 | Multi-Aspect Streaming Tensor Completion. Qingquan Song, Xiao Huang, Hancheng Ge, James Caverlee, Xia Hu |
| 2017 | Multi-Modality Disease Modeling via Collective Deep Matrix Factorization. Qi Wang, Mengying Sun, Liang Zhan, Paul Thompson, Shuiwang Ji, Jiayu Zhou |
| 2017 | Multi-task Function-on-function Regression with Co-grouping Structured Sparsity. Pei Yang, Qi Tan, Jingrui He |
| 2017 | Multi-view Learning over Retinal Thickness and Visual Sensitivity on Glaucomatous Eyes. Toshimitsu Uesaka, Kai Morino, Hiroki Sugiura, Taichi Kiwaki, Hiroshi Murata, Ryo Asaoka, Kenji Yamanishi |
| 2017 | Network Inference via the Time-Varying Graphical Lasso. David Hallac, Youngsuk Park, Stephen P. Boyd, Jure Leskovec |
| 2017 | No Longer Sleeping with a Bomb: A Duet System for Protecting Urban Safety from Dangerous Goods. Jingyuan Wang, Chao Chen, Junjie Wu, Zhang Xiong |
| 2017 | Not All Passes Are Created Equal: Objectively Measuring the Risk and Reward of Passes in Soccer from Tracking Data. Paul Power, Héctor Ruiz, Xinyu Wei, Patrick Lucey |
| 2017 | On Finding Socially Tenuous Groups for Online Social Networks. Chih-Ya Shen, Liang-Hao Huang, De-Nian Yang, Hong-Han Shuai, Wang-Chien Lee, Ming-Syan Chen |
| 2017 | On Sampling Strategies for Neural Network-based Collaborative Filtering. Ting Chen, Yizhou Sun, Yue Shi, Liangjie Hong |
| 2017 | Online Ranking with Constraints: A Primal-Dual Algorithm and Applications to Web Traffic-Shaping. Parikshit Shah, Akshay Soni, Troy Chevalier |
| 2017 | Optimization Beyond Prediction: Prescriptive Price Optimization. Shinji Ito, Ryohei Fujimaki |
| 2017 | Optimized Cost per Click in Taobao Display Advertising. Han Zhu, Junqi Jin, Chang Tan, Fei Pan, Yifan Zeng, Han Li, Kun Gai |
| 2017 | Optimized Risk Scores. Berk Ustun, Cynthia Rudin |
| 2017 | PAMAE: Parallel Hwanjun Song, Jae-Gil Lee, Wook-Shin Han |
| 2017 | PNP: Fast Path Ensemble Method for Movie Design. Danai Koutra, Abhilash Dighe, Smriti Bhagat, Udi Weinsberg, Stratis Ioannidis, Christos Faloutsos, Jean Bolot |
| 2017 | PPDsparse: A Parallel Primal-Dual Sparse Method for Extreme Classification. Ian En-Hsu Yen, Xiangru Huang, Wei Dai, Pradeep Ravikumar, Inderjit S. Dhillon, Eric P. Xing |
| 2017 | PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks. Yu Shi, Po-Wei Chan, Honglei Zhuang, Huan Gui, Jiawei Han |
| 2017 | Patient Subtyping via Time-Aware LSTM Networks. Inci M. Baytas, Cao Xiao, Xi Zhang, Fei Wang, Anil K. Jain, Jiayu Zhou |
| 2017 | Peeking at A/B Tests: Why it matters, and what to do about it. Ramesh Johari, Pete Koomen, Leonid Pekelis, David Walsh |
| 2017 | Pharmacovigilance via Baseline Regularization with Large-Scale Longitudinal Observational Data. Zhaobin Kuang, Peggy L. Peissig, Vítor Santos Costa, Richard Maclin, David Page |
| 2017 | Planning Bike Lanes based on Sharing-Bikes' Trajectories. Jie Bao, Tianfu He, Sijie Ruan, Yanhua Li, Yu Zheng |
| 2017 | Planning and Learning under Uncertainty: Theory and Practice. Jonathan P. How |
| 2017 | Point-of-Interest Demand Modeling with Human Mobility Patterns. Yanchi Liu, Chuanren Liu, Xinjiang Lu, Mingfei Teng, Hengshu Zhu, Hui Xiong |
| 2017 | Post Processing Recommender Systems for Diversity. Arda Antikacioglu, R. Ravi |
| 2017 | Predicting Clinical Outcomes Across Changing Electronic Health Record Systems. Jen J. Gong, Tristan Naumann, Peter Szolovits, John V. Guttag |
| 2017 | Predicting Optimal Facility Location without Customer Locations. Emre Yilmaz, Sanem Elbasi, Hakan Ferhatosmanoglu |
| 2017 | Privacy-Preserving Distributed Multi-Task Learning with Asynchronous Updates. Liyang Xie, Inci M. Baytas, Kaixiang Lin, Jiayu Zhou |
| 2017 | Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13 - 17, 2017 |
| 2017 | Prognosis and Diagnosis of Parkinson's Disease Using Multi-Task Learning. Saba Emrani, Anya McGuirk, Wei Xiao |
| 2017 | Prospecting the Career Development of Talents: A Survival Analysis Perspective. Huayu Li, Yong Ge, Hengshu Zhu, Hui Xiong, Hongke Zhao |
| 2017 | Quick Access: Building a Smart Experience for Google Drive. Sandeep Tata, Alexandrin Popescul, Marc Najork, Mike Colagrosso, Julian Gibbons, Alan Green, Alexandre Mah, Michael Smith, Divanshu Garg, Cayden Meyer, Reuben Kan |
| 2017 | REMIX: Automated Exploration for Interactive Outlier Detection. Yanjie Fu, Charu C. Aggarwal, Srinivasan Parthasarathy, Deepak S. Turaga, Hui Xiong |
| 2017 | RUSH!: Targeted Time-limited Coupons via Purchase Forecasts. Emaad A. Manzoor, Leman Akoglu |
| 2017 | Randomization or Condensation?: Linear-Cost Matrix Sketching Via Cascaded Compression Sampling. Kai Zhang, Chuanren Liu, Jie Zhang, Hui Xiong, Eric P. Xing, Jieping Ye |
| 2017 | Randomized Feature Engineering as a Fast and Accurate Alternative to Kernel Methods. Suhang Wang, Charu C. Aggarwal, Huan Liu |
| 2017 | Real-Time Optimization of Web Publisher RTB Revenues. Pedro Chahuara, Nicolas Grislain, Grégoire Jauvion, Jean-Michel Renders |
| 2017 | ReasoNet: Learning to Stop Reading in Machine Comprehension. Yelong Shen, Po-Sen Huang, Jianfeng Gao, Weizhu Chen |
| 2017 | Recurrent Poisson Factorization for Temporal Recommendation. Seyyed Abbas Hosseini, Keivan Alizadeh, Ali Khodadadi, Ali Arabzadeh, Mehrdad Farajtabar, Hongyuan Zha, Hamid R. Rabiee |
| 2017 | Relay-Linking Models for Prominence and Obsolescence in Evolving Networks. Mayank Singh, Rajdeep Sarkar, Pawan Goyal, Animesh Mukherjee, Soumen Chakrabarti |
| 2017 | Resolving the Bias in Electronic Medical Records. Kaiping Zheng, Jinyang Gao, Kee Yuan Ngiam, Beng Chin Ooi, James Wei Luen Yip |
| 2017 | Retrospective Higher-Order Markov Processes for User Trails. Tao Wu, David F. Gleich |
| 2017 | Revisiting Power-law Distributions in Spectra of Real World Networks. Nicole Eikmeier, David F. Gleich |
| 2017 | Robust Spectral Clustering for Noisy Data: Modeling Sparse Corruptions Improves Latent Embeddings. Aleksandar Bojchevski, Yves Matkovic, Stephan Günnemann |
| 2017 | Robust Top- Xiaojun Chang, Yaoliang Yu, Yi Yang |
| 2017 | SPARTan: Scalable PARAFAC2 for Large & Sparse Data. Ioakeim Perros, Evangelos E. Papalexakis, Fei Wang, Richard W. Vuduc, Elizabeth Searles, Michael Thompson, Jimeng Sun |
| 2017 | SPOT: Sparse Optimal Transformations for High Dimensional Variable Selection and Exploratory Regression Analysis. Qiming Huang, Michael Zhu |
| 2017 | STAR: A System for Ticket Analysis and Resolution. Wubai Zhou, Wei Xue, Ramesh Baral, Qing Wang, Chunqiu Zeng, Tao Li, Jian Xu, Zheng Liu, Larisa Shwartz, Genady Ya. Grabarnik |
| 2017 | Scalable Top-n Local Outlier Detection. Yizhou Yan, Lei Cao, Elke A. Rundensteiner |
| 2017 | Scalable and Sustainable Deep Learning via Randomized Hashing. Ryan Spring, Anshumali Shrivastava |
| 2017 | Semi-Supervised Techniques for Mining Learning Outcomes and Prerequisites. Igor Labutov, Yun Huang, Peter Brusilovsky, Daqing He |
| 2017 | Similarity Forests. Saket Sathe, Charu C. Aggarwal |
| 2017 | Small Batch or Large Batch?: Gaussian Walk with Rebound Can Teach. Peifeng Yin, Ping Luo, Taiga Nakamura |
| 2017 | Spaceborne Data Enters the Mainstream. David Potere |
| 2017 | Sparse Compositional Local Metric Learning. Joseph St. Amand, Jun Huan |
| 2017 | Statistical Emerging Pattern Mining with Multiple Testing Correction. Junpei Komiyama, Masakazu Ishihata, Hiroki Arimura, Takashi Nishibayashi, Shin-ichi Minato |
| 2017 | Stock Price Prediction via Discovering Multi-Frequency Trading Patterns. Liheng Zhang, Charu C. Aggarwal, Guo-Jun Qi |
| 2017 | Structural Deep Brain Network Mining. Shen Wang, Lifang He, Bokai Cao, Chun-Ta Lu, Philip S. Yu, Ann B. Ragin |
| 2017 | Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks. Yuxiao Dong, Reid A. Johnson, Jian Xu, Nitesh V. Chawla |
| 2017 | Structural Event Detection from Log Messages. Fei Wu, Pranay Anchuri, Zhenhui Li |
| 2017 | Supporting Employer Name Normalization at both Entity and Cluster Level. Qiaoling Liu, Faizan Javed, Vachik S. Dave, Ankita Joshi |
| 2017 | TFX: A TensorFlow-Based Production-Scale Machine Learning Platform. Denis Baylor, Eric Breck, Heng-Tze Cheng, Noah Fiedel, Chuan Yu Foo, Zakaria Haque, Salem Haykal, Mustafa Ispir, Vihan Jain, Levent Koc, Chiu Yuen Koo, Lukasz Lew, Clemens Mewald, Akshay Naresh Modi, Neoklis Polyzotis, Sukriti Ramesh, Sudip Roy, Steven Euijong Whang, Martin Wicke, Jarek Wilkiewicz, Xin Zhang, Martin Zinkevich |
| 2017 | TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks. Heng-Tze Cheng, Zakaria Haque, Lichan Hong, Mustafa Ispir, Clemens Mewald, Illia Polosukhin, Georgios Roumpos, D. Sculley, Jamie Smith, David Soergel, Yuan Tang, Philipp Tucker, Martin Wicke, Cassandra Xia, Jianwei Xie |
| 2017 | The Co-Evolution Model for Social Network Evolving and Opinion Migration. Yupeng Gu, Yizhou Sun, Jianxi Gao |
| 2017 | The Fake vs Real Goods Problem: Microscopy and Machine Learning to the Rescue. Ashlesh Sharma, Vidyuth Srinivasan, Vishal Kanchan, Lakshminarayanan Subramanian |
| 2017 | The Future of Artificially Intelligent Assistants. Muthu Muthukrishnan, Andrew Tomkins, Larry P. Heck, Alborz Geramifard, Deepak Agarwal |
| 2017 | The Future of Data Integration. Renée J. Miller |
| 2017 | The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables. Himabindu Lakkaraju, Jon M. Kleinberg, Jure Leskovec, Jens Ludwig, Sendhil Mullainathan |
| 2017 | The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms. Yongxin Tong, Yuqiang Chen, Zimu Zhou, Lei Chen, Jie Wang, Qiang Yang, Jieping Ye, Weifeng Lv |
| 2017 | Three Principles of Data Science: Predictability, Stability and Computability. Bin Yu |
| 2017 | Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data. David Hallac, Sagar Vare, Stephen P. Boyd, Jure Leskovec |
| 2017 | Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster. Naeemul Hassan, Fatma Arslan, Chengkai Li, Mark Tremayne |
| 2017 | Towards an Optimal Subspace for K-Means. Dominik Mautz, Wei Ye, Claudia Plant, Christian Böhm |
| 2017 | Tracking the Dynamics in Crowdfunding. Hongke Zhao, Hefu Zhang, Yong Ge, Qi Liu, Enhong Chen, Huayu Li, Le Wu |
| 2017 | TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams. Chao Zhang, Liyuan Liu, Dongming Lei, Quan Yuan, Honglei Zhuang, Tim Hanratty, Jiawei Han |
| 2017 | Tripoles: A New Class of Relationships in Time Series Data. Saurabh Agrawal, Gowtham Atluri, Anuj Karpatne, William Haltom, Stefan Liess, Snigdhansu Chatterjee, Vipin Kumar |
| 2017 | Unsupervised Discovery of Drug Side-Effects from Heterogeneous Data Sources. Fenglong Ma, Chuishi Meng, Houping Xiao, Qi Li, Jing Gao, Lu Su, Aidong Zhang |
| 2017 | Unsupervised Feature Selection in Signed Social Networks. Kewei Cheng, Jundong Li, Huan Liu |
| 2017 | Unsupervised Network Discovery for Brain Imaging Data. Zilong Bai, Peter B. Walker, Anna E. Tschiffely, Fei Wang, Ian Davidson |
| 2017 | Unsupervised P2P Rental Recommendations via Integer Programming. Yanjie Fu, Guannan Liu, Mingfei Teng, Charu C. Aggarwal |
| 2017 | Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale. Adrian Albert, Jasleen Kaur, Marta C. González |
| 2017 | Visual Search at eBay. Fan Yang, Ajinkya Kale, Yury Bubnov, Leon Stein, Qiaosong Wang, M. Hadi Kiapour, Robinson Piramuthu |
| 2017 | Visualizing Attributed Graphs via Terrain Metaphor. Yang Zhang, Yusu Wang, Srinivasan Parthasarathy |
| 2017 | Weisfeiler-Lehman Neural Machine for Link Prediction. Muhan Zhang, Yixin Chen |
| 2017 | What's Fair? Cynthia Dwork |
| 2017 | When is a Network a Network?: Multi-Order Graphical Model Selection in Pathways and Temporal Networks. Ingo Scholtes |
| 2017 | metapath2vec: Scalable Representation Learning for Heterogeneous Networks. Yuxiao Dong, Nitesh V. Chawla, Ananthram Swami |