SC A

83 papers

YearTitle / Authors
2015A case for application-oblivious energy-efficient MPI runtime.
Akshay Venkatesh, Abhinav Vishnu, Khaled Hamidouche, Nathan R. Tallent, Dhabaleswar K. Panda, Darren J. Kerbyson, Adolfy Hoisie
2015A kernel-independent FMM in general dimensions.
William B. March, Bo Xiao, Sameer Tharakan, Chenhan D. Yu, George Biros
2015A parallel connectivity algorithm for de Bruijn graphs in metagenomic applications.
Patrick Flick, Chirag Jain, Tony Pan, Srinivas Aluru
2015A practical approach to reconciling availability, performance, and capacity in provisioning extreme-scale storage systems.
Lipeng Wan, Feiyi Wang, Sarp Oral, Devesh Tiwari, Sudharshan S. Vazhkudai, Qing Cao
2015A work-efficient algorithm for parallel unordered depth-first search.
Umut A. Acar, Arthur Charguéraud, Mike Rainey
2015Adaptive and transparent cache bypassing for GPUs.
Ang Li, Gert-Jan van den Braak, Akash Kumar, Henk Corporaal
2015Adaptive data placement for staging-based coupled scientific workflows.
Qian Sun, Tong Jin, Melissa Romanus, Hoang Bui, Fan Zhang, Hongfeng Yu, Hemanth Kolla, Scott Klasky, Jacqueline Chen, Manish Parashar
2015An elegant sufficiency: load-aware differentiated scheduling of data transfers.
Rajkumar Kettimuthu, Gayane Vardoyan, Gagan Agrawal, P. Sadayappan, Ian T. Foster
2015An extreme-scale implicit solver for complex PDEs: highly heterogeneous flow in earth's mantle.
Johann Rudi, A. Cristiano I. Malossi, Tobin Isaac, Georg Stadler, Michael Gurnis, Peter W. J. Staar, Yves Ineichen, Costas Bekas, Alessandro Curioni, Omar Ghattas
2015An input-adaptive and in-place approach to dense tensor-times-matrix multiply.
Jiajia Li, Casey Battaglino, Ioakeim Perros, Jimeng Sun, Richard W. Vuduc
2015AnalyzeThis: an analysis workflow-aware storage system.
Hyogi Sim, Youngjae Kim, Sudharshan S. Vazhkudai, Devesh Tiwari, Ali Anwar, Ali Raza Butt, Lavanya Ramakrishnan
2015Analyzing and mitigating the impact of manufacturing variability in power-constrained supercomputing.
Yuichi Inadomi, Tapasya Patki, Koji Inoue, Mutsumi Aoyagi, Barry Rountree, Martin Schulz, David K. Lowenthal, Yasutaka Wada, Keiichiro Fukazawa, Masatsugu Ueda, Masaaki Kondo, Ikuo Miyoshi
2015Automatic sharing classification and timely push for cache-coherent systems.
Malek Musleh, Vijay S. Pai
2015BD-CATS: big data clustering at trillion particle scale.
Md. Mostofa Ali Patwary, Surendra Byna, Nadathur Rajagopalan Satish, Narayanan Sundaram, Zarija Lukic, Vadim Roytershteyn, Michael J. Anderson, Yushu Yao, Prabhat, Pradeep Dubey
2015Big omics data experience.
Patricia H. Kovatch, Anthony Costa, Zachary Giles, Eugene Fluder, Hyung Min Cho, Svetlana Mazurkova
2015Bridging OpenCL and CUDA: a comparative analysis and translation.
Junghyun Kim, Thanh Tuan Dao, Jaehoon Jung, Jinyoung Joo, Jaejin Lee
2015CIVL: the concurrency intermediate verification language.
Stephen F. Siegel, Manchun Zheng, Ziqing Luo, Timothy K. Zirkel, Andre V. Marianiello, John G. Edenhofner, Matthew B. Dwyer, Michael S. Rogers
2015CilkSpec: optimistic concurrency for Cilk.
Shaizeen Aga, Sriram Krishnamoorthy, Satish Narayanasamy
2015Clock delta compression for scalable order-replay of non-deterministic parallel applications.
Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz
2015Cost-effective diameter-two topologies: analysis and evaluation.
Georgios Kathareios, Cyriel Minkenberg, Bogdan Prisacari, Germán Rodríguez, Torsten Hoefler
2015Data partitioning strategies for graph workloads on heterogeneous clusters.
Michael LeBeane, Shuang Song, Reena Panda, Jee Ho Ryoo, Lizy K. John
2015Dynamic power sharing for higher job throughput.
Daniel A. Ellsworth, Allen D. Malony, Barry Rountree, Martin Schulz
2015ELF: maximizing memory-level parallelism for GPUs with coordinated warp and fetch scheduling.
Jason Jong Kyu Park, Yongjun Park, Scott A. Mahlke
2015Efficient implementation of quantum materials simulations on distributed CPU-GPU systems.
Raffaele Solcà, Anton Kozhevnikov, Azzam Haidar, Stanimire Tomov, Jack J. Dongarra, Thomas C. Schulthess
2015Elastic job bundling: an adaptive resource request strategy for large-scale parallel applications.
Feng Liu, Jon B. Weissman
2015Energy-aware data transfer algorithms.
Ismail Alan, Engin Arslan, Tevfik Kosar
2015Engineering inhibitory proteins with InSiPS: the in-silico protein synthesizer.
Andrew Schoenrock, Daniel J. Burnside, Houman Moteshareie, Alex Wong, Ashkan Golshani, Frank Dehne
2015Enterprise: breadth-first graph traversal on GPUs.
Hang Liu, H. Howie Huang
2015Exploiting asynchrony from exact forward recovery for DUE in iterative solvers.
Luc Jaulmes, Marc Casas, Miquel Moretó, Eduard Ayguadé, Jesús Labarta, Mateo Valero
2015Exploring network optimizations for large-scale graph analytics.
Xinyu Que, Fabio Checconi, Fabrizio Petrini, Xing Liu, Daniele Buono
2015Fault tolerant MapReduce-MPI for HPC clusters.
Yanfei Guo, Wesley Bland, Pavan Balaji, Xiaobo Zhou
2015Finding the limits of power-constrained application performance.
Peter E. Bailey, Aniruddha Marathe, David K. Lowenthal, Barry Rountree, Martin Schulz
2015Frugal ECC: efficient and versatile memory error protection through fine-grained compression.
Jungrae Kim, Michael B. Sullivan, Seong-Lyong Gong, Mattan Erez
2015Full correlation matrix analysis of fMRI data on Intel® Xeon Phi™ coprocessors.
Yida Wang, Michael J. Anderson, Jonathan D. Cohen, Alexander Heinecke, Kai Li, Nadathur Satish, Narayanan Sundaram, Nicholas B. Turk-Browne, Theodore L. Willke
2015GossipMap: a distributed community detection algorithm for billion-edge directed graphs.
Seung-Hee Bae, Bill Howe
2015GraphBIG: understanding graph computing in the context of industrial solutions.
Lifeng Nai, Yinglong Xia, Ilie Gabriel Tanase, Hyesoon Kim, Ching-Yung Lin
2015GraphReduce: processing large-scale graphs on accelerator-based systems.
Dipanjan Sengupta, Shuaiwen Leon Song, Kapil Agarwal, Karsten Schwan
2015High-performance algebraic multigrid solver optimized for multi-core based distributed parallel systems.
Jongsoo Park, Mikhail Smelyanskiy, Ulrike Meier Yang, Dheevatsa Mudigere, Pradeep Dubey
2015HipMer: an extreme-scale de novo genome assembler.
Evangelos Georganas, Aydin Buluç, Jarrod Chapman, Steven A. Hofmeyr, Chaitanya Aluru, Rob Egan, Leonid Oliker, Daniel Rokhsar, Katherine A. Yelick
2015HydraDB: a resilient RDMA-driven key-value middleware for in-memory cluster computing.
Yandong Wang, Li Zhang, Jian Tan, Min Li, Yuqing Gao, Xavier Guerin, Xiaoqiao Meng, Shicong Meng
2015IOrchestra: supporting high-performance data-intensive applications in the cloud via collaborative virtualization.
Ron Chi-Lung Chiang, H. Howie Huang, Timothy Wood, Changbin Liu, Oliver Spatscheck
2015Implicit nonlinear wave simulation with 1.08T DOF and 0.270T unstructured finite elements to enhance comprehensive earthquake simulation.
Tsuyoshi Ichimura, Kohei Fujita, Pher Errol Balde Quinay, Lalith Maddegedara, Muneo Hori, Seizo Tanaka, Yoshihisa Shizawa, Hiroshi Kobayashi, Kazuo Minami
2015Improving backfilling by using machine learning to predict running times.
Éric Gaussier, David Glesser, Valentin Reis, Denis Trystram
2015Improving concurrency and asynchrony in multithreaded MPI applications using software offloading.
Karthikeyan Vaidyanathan, Dhiraj D. Kalamkar, Kiran Pamnany, Jeff R. Hammond, Pavan Balaji, Dipankar Das, Jongsoo Park, Bálint Joó
2015Improving the scalability of the ocean barotropic solver in the community earth system model.
Yong Hu, Xiaomeng Huang, Allison H. Baker, Yu-heng Tseng, Frank O. Bryan, John M. Dennis, Guangwen Yang
2015Large-scale compute-intensive analysis via a combined in-situ and co-scheduling workflow approach.
Christopher M. Sewell, Katrin Heitmann, Hal Finkel, George Zagaris, Suzanne Parete-Koon, Patricia K. Fasel, Adrian Pope, Nicholas Frontiere, Li-Ta Lo, O. E. Bronson Messer, Salman Habib, James P. Ahrens
2015Local recovery and failure masking for stencil-based applications at extreme scales.
Marc Gamell, Keita Teranishi, Michael A. Heroux, Jackson R. Mayo, Hemanth Kolla, Jacqueline Chen, Manish Parashar
2015Mantle: a programmable metadata load balancer for the ceph file system.
Michael A. Sevilla, Noah Watkins, Carlos Maltzahn, Ike Nassi, Scott A. Brandt, Sage A. Weil, Greg Farnum, Sam Fineberg
2015Massively parallel models of the human circulatory system.
Amanda Randles, Erik W. Draeger, Tomas Oppelstrup, Liam Krauss, John A. Gunnels
2015Massively parallel phase-field simulations for ternary eutectic directional solidification.
Martin Bauer, Johannes Hötzer, Marcus Jainta, Philipp Steinmetz, Marco Berghoff, Florian Schornbaum, Christian Godenschwager, Harald Köstler, Britta Nestler, Ulrich Rüde
2015Memory access patterns: the missing piece of the multi-GPU puzzle.
Tal Ben-Nun, Ely Levy, Amnon Barak, Eri Rubin
2015Monetary cost optimizations for MPI-based HPC applications on Amazon clouds: checkpoints and replicated execution.
Yifan Gong, Bingsheng He, Amelie Chi Zhou
2015Multi-objective job placement in clusters.
Sergey Blagodurov, Alexandra Fedorova, Evgeny Vinnik, Tyler Dwyer, Fabien Hermenier
2015Network endpoint congestion control for fine-grained communication.
Nan Jiang, Larry R. Dennison, William J. Dally
2015Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG.
Thomas Scogland, Jonathan Azose, David Rohr, Suzanne Rivoire, Natalie J. Bates, Daniel Hackenberg
2015Optimal scheduling of in-situ analysis for large-scale scientific simulations.
Preeti Malakar, Venkatram Vishwanath, Todd S. Munson, Christopher Knight, Mark Hereld, Sven Leyffer, Michael E. Papka
2015PGX.D: a fast distributed graph processing engine.
Sungpack Hong, Siegfried Depner, Thomas Manhardt, Jan Van Der Lugt, Merijn Verstraaten, Hassan Chafi
2015Parallel distributed memory construction of suffix and longest common prefix arrays.
Patrick Flick, Srinivas Aluru
2015Parallel implementation and performance optimization of the configuration-interaction method.
Hongzhang Shan, Samuel Williams, Calvin W. Johnson, Kenneth S. McElvain, W. Erich Ormand
2015Particle tracking in open simulation laboratories.
Kalin Kanov, Randal C. Burns
2015Performance of random sampling for computing low-rank approximations of a dense matrix on GPUs.
Théo Mary, Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov, Jack J. Dongarra
2015Performance optimization for the k-nearest neighbors kernel on x86 architectures.
Chenhan D. Yu, Jianyu Huang, Woody Austin, Bo Xiao, George Biros
2015Practical scalable consensus for pseudo-synchronous distributed systems.
Thomas Hérault, Aurélien Bouteiller, George Bosilca, Marc Gamell, Keita Teranishi, Manish Parashar, Jack J. Dongarra
2015Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2015, Austin, TX, USA, November 15-20, 2015
Jackie Kern, Jeffrey S. Vetter
2015Profile-based power shifting in interconnection networks with on/off links.
Shinobu Miwa, Hiroshi Nakamura
2015Pushing back the limit of
Mauro Calderara, Sascha Brück, Andreas Pedersen, Mohammad H. Bani-Hashemian, Joost VandeVondele, Mathieu Luisier
2015Randomized algorithms to update partial singular value decomposition on a hybrid CPU/GPU cluster.
Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra
2015Recovering logical structure from Charm++ event traces.
Katherine E. Isaacs, Abhinav Bhatele, Jonathan Lifflander, David Böhme, Todd Gamblin, Martin Schulz, Bernd Hamann, Peer-Timo Bremer
2015Regent: a high-productivity programming language for HPC with logical regions.
Elliott Slaughter, Wonchan Lee, Sean Treichler, Michael Bauer, Alex Aiken
2015Relative debugging for a highly parallel hybrid computer system.
Luiz De Rose, Andrew Gontarek, Aaron Vose, Robert Moench, David Abramson, Minh Ngoc Dinh, Chao Jin
2015Reliability lessons learned from GPU experience with the Titan supercomputer at Oak Ridge leadership computing facility.
Devesh Tiwari, Saurabh Gupta, George Gallarno, Jim Rogers, Don Maxwell
2015Runtime-driven shared last-level cache management for task-parallel programs.
Abhisek Pan, Vijay S. Pai
2015STELLA: a domain-specific tool for structured grid methods in weather and climate models.
Tobias Gysi, Carlos Osuna, Oliver Fuhrer, Mauro Bianco, Thomas C. Schulthess
2015STS-k: a multilevel sparse triangular solution scheme for NUMA multicores.
Humayun Kabir, Joshua Dennis Booth, Guillaume Aupy, Anne Benoit, Yves Robert, Padma Raghavan
2015ScaAnalyzer: a tool to identify memory scalability bottlenecks in parallel programs.
Xu Liu, Bo Wu
2015Scalable sparse tensor decompositions in distributed memory systems.
Oguz Kaya, Bora Uçar
2015Scaling iterative graph computations with GraphMap.
Kisung Lee, Ling Liu, Karsten Schwan, Calton Pu, Qi Zhang, Yang Zhou, Emre Yigitoglu, Pingpeng Yuan
2015Scientific benchmarking of parallel computing systems: twelve ways to tell the masses when reporting performance results.
Torsten Hoefler, Roberto Belli
2015Smart: a MapReduce-like framework for in-situ scientific analytics.
Yi Wang, Gagan Agrawal, Tekin Bicer, Wei Jiang
2015The Spack package manager: bringing order to HPC software chaos.
Todd Gamblin, Matthew P. LeGendre, Michael R. Collette, Gregory L. Lee, Adam Moody, Bronis R. de Supinski, Scott Futral
2015The in-silico lab-on-a-chip: petascale and high-throughput simulations of microfluidics at cell resolution.
Diego Rossinelli, Yu-Hang Tang, Kirill Lykov, Dmitry Alexeev, Massimo Bernaschi, Panagiotis E. Hadjidoukas, Mauro Bisson, Wayne Joubert, Christian Conti, George E. Karniadakis, Massimiliano Fatica, Igor Pivkin, Petros Koumoutsakos
2015Understanding the propagation of transient errors in HPC applications.
Rizwan A. Ashraf, Roberto Gioiosa, Gokcen Kestor, Ronald F. DeMara, Chen-Yong Cher, Pradip Bose
2015VOCL-FT: introducing techniques for efficient soft error coprocessor recovery.
Antonio J. Peña, Wesley Bland, Pavan Balaji