SC A

105 papers

YearTitle / Authors
20213D acoustic-elastic coupling with gravity: the dynamics of the 2018 palu, sulawesi earthquake and tsunami.
Lukas Krenz, Carsten Uphoff, Thomas Ulrich, Alice-Agnes Gabriel, Lauren S. Abrahams, Eric M. Dunham, Michael Bader
2021A 400 trillion-grid Vlasov simulation on Fugaku supercomputer: large-scale distribution of cosmic relic neutrinos in a six-dimensional phase space.
Kohji Yoshikawa, Satoshi Tanaka, Naoki Yoshida
2021A next-generation discontinuous galerkin fluid dynamics solver with application to high-resolution lung airflow simulations.
Martin Kronbichler, Niklas Fehn, Peter Munch, Maximilian Bergbauer, Karl-Robert Wichmann, Carolin Geitner, Momme Allalen, Martin Schulz, Wolfgang A. Wall
2021APNN-TC: accelerating arbitrary precision neural networks on ampere GPU tensor cores.
Boyuan Feng, Yuke Wang, Tong Geng, Ang Li, Yufei Ding
2021Accelerating XOR-based erasure coding using program optimization techniques.
Yuya Uezato
2021Accelerating all-electron
Honghui Shang, Fang Li, Yunquan Zhang, Ying Liu, Libo Zhang, Mingchuan Wu, Yangjun Wu, Di Wei, Huimin Cui, Xin Liu, Fei Wang, Yuxi Ye, Yingxiang Gao, Shuang Ni, Xin Chen, Dexun Chen
2021Accelerating applications using edge tensor processing units.
Kuan-Chieh Hsu, Hung-Wei Tseng
2021Accelerating bandwidth-bound deep learning inference with main-memory accelerators.
Benjamin Y. Cho, Jeageun Jung, Mattan Erez
2021Accelerating large scale
Muaaz Gul Awan, Steven A. Hofmeyr, Rob Egan, Nan Ding, Aydin Buluç, Jack Deslippe, Leonid Oliker, Katherine A. Yelick
2021AgEBO-tabular: joint neural architecture and hyperparameter search with autotuned data-parallel training for tabular data.
Romain Égelé, Prasanna Balaprakash, Isabelle Guyon, Venkatram Vishwanath, Fangfang Xia, Rick Stevens, Zhengying Liu
2021Anton 3: twenty microseconds of molecular dynamics simulation before lunch.
David E. Shaw, Peter J. Adams, Asaph Azaria, Joseph A. Bank, Brannon Batson, Alistair Bell, Michael Bergdorf, Jhanvi Bhatt, J. Adam Butts, Timothy Correia, Robert M. Dirks, Ron O. Dror, Michael P. Eastwood, Bruce Edwards, Amos Even, Peter Feldmann, Michael Fenn, Christopher H. Fenton, Anthony Forte, Joseph Gagliardo, Gennette Gill, Maria Gorlatova, Brian Greskamp, J. P. Grossman, Justin Gullingsrud, Anissa Harper, William Hasenplaugh, Mark Heily, Benjamin Colin Heshmat, Jeremy Hunt, Douglas J. Ierardi, Lev Iserovich, Bryan L. Jackson, Nick P. Johnson, Mollie M. Kirk, John L. Klepeis, Jeffrey S. Kuskin, Kenneth M. Mackenzie, Roy J. Mader, Richard McGowen, Adam McLaughlin, Mark A. Moraes, Mohamed H. Nasr, Lawrence J. Nociolo, Lief O'Donnell, Andrew Parker, Jon L. Peticolas, Goran Pocina, Cristian Predescu, Terry Quan, John K. Salmon, Carl Schwink, Keun Sup Shim, Naseer Siddique, Jochen Spengler, Tamas Szalay, Raymond Tabladillo, Reinhard Tartler, Andrew G. Taube, Michael Theobald, Brian Towles, William Vick, Stanley C. Wang, Michael Wazlowski, Madeleine J. Weingarten, John M. Williams, Kevin A. Yuh
2021Arithmetic-intensity-guided fault tolerance for neural network inference on GPUs.
Jack Kosaian, K. V. Rashmi
2021BAASH: lightweight, efficient, and reliable blockchain-as-a-service for HPC systems.
Abdullah Al-Mamun, Feng Yan, Dongfang Zhao
2021Billion atom molecular dynamics simulations of carbon at extreme conditions and experimental time and length scales.
Kien Nguyen-Cong, Jonathan T. Willman, Stan G. Moore, Anatoly B. Belonoshko, Rahulkumar Gayatri, Evan Weinberg, Mitchell A. Wood, Aidan P. Thompson, Ivan I. Oleynik
2021Bootstrapping in-situ workflow auto-tuning via combining performance models of component applications.
Tong Shu, Yanfei Guo, Justin M. Wozniak, Xiaoning Ding, Ian T. Foster, Tahsin M. Kurç
2021CAKE: matrix multiplication using constant-bandwidth blocks.
H. T. Kung, Vikas Natesh, Andrew Sabot
2021Characterization and prediction of deep learning workloads in large-scale GPU datacenters.
Qinghao Hu, Peng Sun, Shengen Yan, Yonggang Wen, Tianwei Zhang
2021Chimera: efficiently training large-scale neural networks with bidirectional pipelines.
Shigang Li, Torsten Hoefler
2021Clairvoyant prefetching for distributed machine learning I/O.
Nikoli Dryden, Roman Böhringer, Tal Ben-Nun, Torsten Hoefler
2021Closing the "quantum supremacy" gap: achieving real-time simulation of a random quantum circuit using a new Sunway supercomputer.
Yong (Alexander) Liu, Xin (Lucy) Liu, Fang (Nancy) Li, Haohuan Fu, Yuling Yang, Jiawei Song, Pengpeng Zhao, Zhen Wang, Dajia Peng, Huarong Chen, Chu Guo, Heliang Huang, Wenzhao Wu, Dexun Chen
2021Cuttlefish: library for achieving energy efficiency in multicore parallel programs.
Sunil Kumar, Akshat Gupta, Vivek Kumar, Sridutt Bhalachandra
2021DeltaFS: a scalable no-ground-truth filesystem for massively-parallel computing.
Qing Zheng, Charles D. Cranor, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary A. Grider
2021Discovering and balancing fundamental cycles in large signed graphs.
Ghadeer Alabandi, Jelena Tesic, Lucas Rusnak, Martin Burtscher
2021DistGNN: scalable distributed training for large-scale graph neural networks.
Md. Vasimuddin, Sanchit Misra, Guixiang Ma, Ramanarayan Mohanty, Evangelos Georganas, Alexander Heinecke, Dhiraj D. Kalamkar, Nesreen K. Ahmed, Sasikanth Avancha
2021Distributed multigrid neural solvers on megavoxel domains.
Aditya Balu, Sergio Botelho, Biswajit Khara, Vinay Rao, Soumik Sarkar, Chinmay Hegde, Adarsh Krishnamurthy, Santi Adavani, Baskar Ganapathysubramanian
2021Distributed quantum computing with QMPI.
Thomas Häner, Damian S. Steiger, Torsten Hoefler, Matthias Troyer
2021Dr. Top-k: delegate-centric Top-k on GPUs.
Anil Gaihre, Da Zheng, Scott Weitze, Lingda Li, Shuaiwen Leon Song, Caiwen Ding, Xiaoye S. Li, Hang Liu
2021E.T.: re-thinking self-attention for transformer models on GPUs.
Shiyang Chen, Shaoyi Huang, Santosh Pandey, Bingbing Li, Guang R. Gao, Long Zheng, Caiwen Ding, Hang Liu
2021EIGA: elastic and scalable dynamic graph analysis.
Kasimir Gabert, Kaan Sancak, M. Yusuf Özkaya, Ali Pinar, Ümit V. Çatalyürek
2021Efficient large-scale language model training on GPU clusters using megatron-LM.
Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia
2021Efficient scaling of dynamic graph neural networks.
Venkatesan T. Chakaravarthy, Shivmaran S. Pandian, Saurabh Raje, Yogish Sabharwal, Toyotaro Suzumura, Shashanka Ubaru
2021Efficient tensor core-based GPU kernels for structured sparsity under reduced precision.
Zhaodong Chen, Zheng Qu, Liu Liu, Yufei Ding, Yuan Xie
2021Empirical evaluation of circuit approximations on noisy quantum devices.
Ellis Wilson, Frank Mueller, Lindsay Bassman, Costin Iancu
2021Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction.
Weihao Cui, Han Zhao, Quan Chen, Ningxin Zheng, Jingwen Leng, Jieru Zhao, Zhuo Song, Tao Ma, Yong Yang, Chao Li, Minyi Guo
2021Enabling and scaling the HPCG benchmark on the newest generation Sunway supercomputer with 42 million heterogeneous cores.
Qianchao Zhu, Hao Luo, Chao Yang, Mingshuo Ding, Wanwang Yin, Xinhui Yuan
2021Enabling large-scale correlated electronic structure calculations: scaling the RI-MP2 method on summit.
Giuseppe M. J. Barca, Jorge L. Galvez Vallejo, David L. Poole, Melisa Alkan, Ryan Stocks, Alistair P. Rendell, Mark S. Gordon
2021Error-controlled, progressive, and adaptable retrieval of scientific data with multilevel decomposition.
Xin Liang, Qian Gong, Jieyang Chen, Ben Whitney, Lipeng Wan, Qing Liu, David Pugmire, Rick Archibald, Norbert Podhorszki, Scott Klasky
2021Exploiting user activeness for data retention in HPC systems.
Wei Zhang, Suren Byna, Hyogi Sim, SangKeun Lee, Sudharshan Vazhkudai, Yong Chen
2021Extreme-scale
Honghui Shang, Fang Li, Yunquan Zhang, Libo Zhang, You Fu, Yingxiang Gao, Yangjun Wu, Xiaohui Duan, Rongfen Lin, Xin Liu, Ying Liu, Dexun Chen
2021FastZ: accelerating gapped whole genome alignment on GPUs.
Sree Charan Gundabolu, T. N. Vijaykumar, Mithuna Thottethodi
2021FedAT: a high-performance and communication-efficient federated learning system with asynchronous tiers.
Zheng Chai, Yujing Chen, Ali Anwar, Liang Zhao, Yue Cheng, Huzefa Rangwala
2021Flare: flexible in-network allreduce.
Daniele De Sensi, Salvatore Di Girolamo, Saleh Ashkboos, Shigang Li, Torsten Hoefler
2021G-SEPM: building an accurate and efficient soft error prediction model for GPGPUs.
Hengshan Yue, Xiaohui Wei, Guangli Li, Jianpeng Zhao, Nan Jiang, Jingweijia Tan
2021Generalizable coordination of large multiscale workflows: challenges and learnings at scale.
Harsh Bhatia, Francesco Di Natale, Joseph Y. Moon, Xiaohua Zhang, Joseph R. Chavez, Fikret Aydin, Christopher B. Stanley, Tomas Oppelstrup, Chris Neale, Sara Kokkila Schumacher, Dong H. Ahn, Stephen Herbein, Timothy S. Carpenter, Sandrasegaram Gnanakaran, Peer-Timo Bremer, James N. Glosli, Felice C. Lightstone, Helgi I. Ingólfsson
2021HPAC: evaluating approximate computing techniques on HPC OpenMP applications.
Konstantinos Parasyris, Giorgis Georgakoudis, Harshitha Menon, James Diffenderfer, Ignacio Laguna, Daniel Osei-Kuffuor, Markus Schordan
2021Hardware acceleration of tensor-structured multilevel ewald summation method on MDGRAPE-4A, a special-purpose computer system for molecular dynamics simulations.
Gentaro Morimoto, Yohei M. Koyama, Hao Zhang, Teruhisa S. Komatsu, Yousuke Ohno, Keigo Nishida, Itta Ohmura, Hiroshi Koyama, Makoto Taiji
2021Hardware-supported remote persistence for distributed persistent memory.
Zhuohui Duan, Haodi Lu, Haikun Liu, Xiaofei Liao, Hai Jin, Yu Zhang, Song Wu
2021HatRPC: hint-accelerated thrift RPC over RDMA.
Tianxi Li, Haiyang Shi, Xiaoyi Lu
2021High performance uncertainty quantification with parallelized multilevel Markov chain Monte Carlo.
Linus Seelinger, Anne Reinarz, Leonhard Rannabauer, Michael Bader, Peter Bastian, Robert Scheichl
2021High-throughput virtual screening of small molecule inhibitors for SARS-CoV-2 protein targets with deep fusion models.
Garrett A. Stevenson, Derek Jones, Hyojin Kim, W. F. Drew Bennett, Brian J. Bennion, Monica Borucki, Feliza Bourguet, Aidan Epstein, Magdalena Franco, Brooke Harmon, Stewart He, Max P. Katz, Daniel A. Kirshner, Victoria Lao, Edmond Y. Lau, Jacky Lo, Kevin McLoughlin, Richard Mosesso, Deepa K. Murugesh, Oscar A. Negrete, Edwin A. Saada, Brent Segelke, Maxwell Stefan, Marisa W. Torres, Dina Weilhammer, Sergio Ernesto Wong, Yue Yang, Adam T. Zemla, Xiaohua Zhang, Fangqiang Zhu, Felice C. Lightstone, Jonathan E. Allen
2021Hybrid, scalable, trace-driven performance modeling of GPGPUs.
Yehia Arafa, Abdel-Hameed A. Badawy, Ammar ElWazir, Atanu Barai, Ali Eker, Gopinath Chennupati, Nandakishore Santhi, Stephan J. Eidenbenz
2021In-depth analyses of unified virtual memory system for GPU accelerated computing.
Tyler N. Allen, Rong Ge
2021Index launches: scalable, flexible representation of parallel task groups.
Rupanshu Soi, Michael Bauer, Sean Treichler, Manolis Papadakis, Wonchan Lee, Patrick S. McCormick, Alex Aiken, Elliott Slaughter
2021International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2021, St. Louis, Missouri, USA, November 14-19, 2021
Bronis R. de Supinski, Mary W. Hall, Todd Gamblin
2021KAISA: an adaptive second-order optimizer framework for deep neural networks.
J. Gregory Pauloski, Qi Huang, Lei Huang, Shivaram Venkataraman, Kyle Chard, Ian T. Foster, Zhao Zhang
2021Krill: a compiler and runtime system for concurrent graph processing.
Hongzheng Chen, Minghua Shen, Nong Xiao, Yutong Lu
2021LCCG: a locality-centric hardware accelerator for high throughput of concurrent graph processing.
Jin Zhao, Yu Zhang, Xiaofei Liao, Ligang He, Bingsheng He, Hai Jin, Haikun Liu
2021LIBSHALOM: optimizing small and irregular-shaped matrix multiplications on ARMv8 multi-cores.
Weiling Yang, Jianbin Fang, Dezun Dong, Xing Su, Zheng Wang
2021LMFF: efficient and scalable layered materials force field on heterogeneous many-core processors.
Ping Gao, Xiaohui Duan, Jiaxu Guo, Jin Wang, Zhenya Song, Lizhen Cui, Xiangxu Meng, Xin Liu, Wusheng Zhang, Ming Ma, Guohui Li, Dexun Chen, Haohuan Fu, Wei Xue, Weiguo Liu, Guangwen Yang
2021Linux vs. lightweight multi-kernels for high performance computing: experiences at pre-exascale.
Balazs Gerofi, Kohei Tarumizu, Lei Zhang, Takayuki Okamoto, Masamichi Takagi, Shinji Sumimoto, Yutaka Ishikawa
2021LogECMem: coupling erasure-coded in-memory key-value stores with parity logging.
Liangfeng Cheng, Yuchong Hu, Zhaokang Ke, Jia Xu, Qiaori Yao, Dan Feng, Weichun Wang, Wei Chen
2021Lunule: an agile and judicious metadata load balancer for CephFS.
Yiduo Wang, Cheng Li, Xinyang Shao, Youxu Chen, Feng Yan, Yinlong Xu
2021MAPA: multi-accelerator pattern allocation policy for multi-tenant GPU servers.
Kiran Ranganath, Joshua D. Suetterlein, Joseph B. Manzano, Shuaiwen Leon Song, Daniel Wong
2021Meeting the real-time challenges of ground-based telescopes using low-rank matrix computations.
Hatem Ltaief, Jesse Cranney, Damien Gratadour, Yuxi Hong, Laurent Gatineau, David E. Keyes
2021Minimizing privilege for building HPC containers.
Reid Priedhorsky, Shane Richard Canon, Timothy Randles, Andrew J. Younge
2021Non-recurring engineering (NRE) best practices: a case study with the NERSC/NVIDIA OpenMP contract.
Christopher S. Daley, Annemarie Southwell, Rahulkumar Gayatri, Scott Biersdorfff, Craig Toepfer, Güray Özen, Nicholas J. Wright
2021On the parallel I/O optimality of linear algebra kernels: near-optimal matrix factorizations.
Grzegorz Kwasniewski, Marko Kabic, Tal Ben-Nun, Alexandros Nikolaos Ziogas, Jens Eirik Saethre, André Gaillard, Timo Schneider, Maciej Besta, Anton Kozhevnikov, Joost VandeVondele, Torsten Hoefler
2021Online evolutionary batch size orchestration for scheduling deep learning workloads in GPU clusters.
Zhengda Bian, Shenggui Li, Wei Wang, Yang You
2021Online optimization of file transfers in high-speed networks.
Md. Arifuzzaman, Engin Arslan
2021Overcoming barriers to scalability in variational quantum Monte Carlo.
Tianchen Zhao, Saibal De, Brian Chen, James Stokes, Shravan K. Veerapaneni
2021PAGANI: a parallel adaptive GPU algorithm for numerical integration.
Ioannis Sakiotis, Kamesh Arumugam, Marc F. Paterno, Desh Ranjan, Balsa Terzic, Mohammad Zubair
2021PEPPA-X: finding program test inputs to bound silent data corruption vulnerability in HPC applications.
Md Hasanur Rahman, Aabid Shamji, Shengjian Guo, Guanpeng Li
2021Parallel construction of module networks.
Ankit Srivastava, Sriram P. Chockalingam, Maneesha Aluru, Srinivas Aluru
2021Paths to OpenMP in the kernel.
Jiacheng Ma, Wenyi Wang, Aaron Nelson, Michael Cuevas, Brian Homerding, Conghao Liu, Zhen Huang, Simone Campanoni, Kyle C. Hale, Peter A. Dinda
2021Pilgrim: scalable and (near) lossless MPI tracing.
Chen Wang, Pavan Balaji, Marc Snir
2021Pinpointing crash-consistency bugs in the HPC I/O stack: a cross-layer approach.
Jinghan Sun, Jian Huang, Marc Snir
2021Preparing an incompressible-flow fluid dynamics code for exascale-class wind energy simulations.
Paul Mullowney, Ruipeng Li, Stephen J. Thomas, Shreyas Ananthan, Ashesh Sharma, Jon S. Rood, Alan B. Williams, Michael A. Sprague
2021Productivity, portability, performance: data-centric Python.
Alexandros Nikolaos Ziogas, Timo Schneider, Tal Ben-Nun, Alexandru Calotoiu, Tiziano De Matteis, Johannes de Fine Licht, Luca Lavarini, Torsten Hoefler
2021RIBBON: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances.
Baolin Li, Rohan Basu Roy, Tirthak Patel, Vijay Gadepally, Karen Gettings, Devesh Tiwari
2021Reducing redundancy in data organization and arithmetic calculation for stencil computations.
Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue
2021Representation of women in HPC conferences.
Eitan Frachtenberg, Rhody D. Kaner
2021Resilient error-bounded lossy compressor for data transfer.
Sihuan Li, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, Franck Cappello
2021Revealing power, energy and thermal dynamics of a 200PF pre-exascale supercomputer.
Woong Shin, Vladyslav Oles, Ahmad Maroof Karimi, J. Austin Ellis, Feiyi Wang
2021Reverse-mode automatic differentiation and optimization of GPU kernels via enzyme.
William S. Moses, Valentin Churavy, Ludger Paehler, Jan Hückelheim, Sri Hari Krishna Narayanan, Michel Schanen, Johannes Doerfert
2021SEEC: stochastic escape express channel.
Mayank Parasar, Natalie D. Enright Jerger, Paul V. Gratz, Joshua San Miguel, Tushar Krishna
2021STM-multifrontal QR: streaming task mapping multifrontal QR factorization empowered by GCN.
Shengle Lin, Wangdong Yang, Haotian Wang, Qinyun Tsai, Kenli Li
2021SV-sim: scalable PGAS-based state vector simulation of quantum circuits.
Ang Li, Bo Fang, Christopher E. Granade, Guen Prawiroatmodjo, Bettina Heim, Martin Roetteler, Sriram Krishnamoorthy
2021SW_Qsim: a minimize-memory quantum simulator with high-performance on a new Sunway supercomputer.
Fang Li, Xin Liu, Yong Liu, Pengpeng Zhao, Yuling Yang, Honghui Shang, Weizhe Sun, Zhen Wang, Enming Dong, Dexun Chen
2021Scalable FBP decomposition for cone-beam CT reconstruction.
Peng Chen, Mohamed Wahib, Xiao Wang, Takahiro Hirofuchi, Hirotaka Ogawa, Ander Biguri, Richard P. Boardman, Thomas Blumensath, Satoshi Matsuoka
2021Scalable adaptive PDE solvers in arbitrary domains.
Kumar Saurabh, Masado Ishii, Milinda Fernando, Boshun Gao, Kendrick Tan, Ming-Chen Hsu, Adarsh Krishnamurthy, Hari Sundar, Baskar Ganapathysubramanian
2021Scalable edge-based hyperdimensional learning system with brain-like neural adaptation.
Zhuowen Zou, Yeseong Kim, Farhad Imani, Haleh Alimohamadi, Rosario Cammarota, Mohsen Imani
2021Simurgh: a fully decentralized and secure NVMM user space file system.
Nafiseh Moti, Frederic Schimmelpfennig, Reza Salkhordeh, David Klopp, Toni Cortes, Ulrich Rückert, André Brinkmann
2021Single-node partitioned-memory for huge graph analytics: cost and performance trade-offs.
Sayan Ghosh, Nathan R. Tallent, Marco Minutoli, Mahantesh Halappanavar, Ramesh Peri, Ananth Kalyanaraman
2021Symplectic structure-preserving particle-in-cell whole-volume simulation of tokamak plasmas to 111.3 trillion particles and 25.7 billion grids.
Jianyuan Xiao, Junshi Chen, Jiangshan Zheng, Hong An, Shenghong Huang, Chao Yang, Fang Li, Ziyu Zhang, Yeqi Huang, Wenting Han, Xin Liu, Dexun Chen, Zixi Liu, Ge Zhuang, JiaLe Chen, Guoqiang Li, Xuan Sun, Qiang Chen
2021Systematically inferring I/O performance variability by examining repetitive job behavior.
Emily Costa, Tirthak Patel, Benjamin Schwaller, Jim M. Brandt, Devesh Tiwari
2021Temporal vectorization for stencils.
Liang Yuan, Hang Cao, Yunquan Zhang, Kun Li, Pengqi Lu, Yue Yue
2021Tensor processing primitives: a programming abstraction for efficiency and portability in deep learning workloads.
Evangelos Georganas, Dhiraj D. Kalamkar, Sasikanth Avancha, Menachem Adelman, Cristina Anderson, Alexander Breuer, Jeremy Bruestle, Narendra Chaudhary, Abhisek Kundu, Denise Kutnick, Frank Laub, Md. Vasimuddin, Sanchit Misra, Ramanarayan Mohanty, Hans Pabst, Barukh Ziv, Alexander Heinecke
2021TensorKMC: kinetic Monte Carlo simulation of 50 trillion atoms driven by deep learning on a new generation of Sunway supercomputer.
Honghui Shang, Xin Chen, Xingyu Gao, Rongfen Lin, Lifang Wang, Fang Li, Qian Xiao, Lei Xu, Qiang Sun, Leilei Zhu, Fei Wang, Yunquan Zhang, Haifeng Song
2021The hidden cost of the edge: a performance comparison of edge and cloud latencies.
Ahmed Ali-Eldin, Bin Wang, Prashant J. Shenoy
2021TriPoll: computing surveys of triangles in massive-scale temporal graphs with metadata.
Trevor Steil, Tahsin Reza, Keita Iwabuchi, Benjamin W. Priest, Geoffrey Sanders, Roger Pearce
2021Understanding, predicting and scheduling serverless workloads under partial interference.
Laiping Zhao, Yanan Yang, Yiming Li, Xian Zhou, Keqiu Li
2021Whale: efficient one-to-many data partitioning in RDMA-assisted distributed stream processing systems.
Jie Tan, Hanhua Chen, Yonghui Wang, Hai Jin
2021ZeRO-infinity: breaking the GPU memory wall for extreme scale deep learning.
Samyam Rajbhandari, Olatunji Ruwase, Jeff Rasley, Shaden Smith, Yuxiong He
2021cuTS: scaling subgraph isomorphism on distributed multi-GPU systems using trie based data structure.
Lizhi Xiang, Arif Khan, Edoardo Serra, Mahantesh Halappanavar, Aravind Sukumaran-Rajam
2021ndzip-gpu: efficient lossless compression of scientific floating-point data on GPUs.
Fabian Knorr, Peter Thoman, Thomas Fahringer