ISCA A*

74 papers

YearTitle / Authors
20222QAN: a quantum compiler for 2-local qubit hamiltonian simulation algorithms.
Lingling Lao, Dan E. Browne
2022A scalable architecture for reprioritizing ordered parallelism.
Gilead Posluns, Yan Zhu, Guowei Zhang, Mark C. Jeffrey
2022A software-defined tensor streaming multiprocessor for large-scale machine learning.
Dennis Abts, Garrin Kimmell, Andrew C. Ling, John Kim, Matthew Boyd, Andrew Bitar, Sahil Parmar, Ibrahim Ahmed, Roberto DiCecco, David Han, John Thompson, Michael Bye, Jennifer Hwang, Jeremy Fowers, Peter Lillian, Ashwin Murthy, Elyas Mehtabuddin, Chetan Tekur, Thomas Sohmers, Kris Kang, Stephen Maresh, Jonathan Ross
2022A synthesis framework for stitching surface code with superconducting quantum devices.
Anbang Wu, Gushu Li, Hezi Zhang, Gian Giacomo Guerreschi, Yufei Ding, Yuan Xie
2022ACT: designing sustainable computer systems with an architectural carbon modeling tool.
Udit Gupta, Mariam Elgamal, Gage Hills, Gu-Yeon Wei, Hsien-Hsin S. Lee, David Brooks, Carole-Jean Wu
2022AI accelerator on IBM telum processor: industrial product.
Cédric Lichtenau, Alper Buyuktosunoglu, Ramon Bertran, Peter Figuli, Christian Jacobi, Nikolaos Papandreou, Haris Pozidis, Anthony Saporito, Andrew Sica, Elpida Tzortzatos
2022AMOS: enabling automatic mapping for tensor computations on spatial accelerators with hardware abstraction.
Size Zheng, Renze Chen, Anjiang Wei, Yicheng Jin, Qin Han, Liqiang Lu, Bingyang Wu, Xiuhong Li, Shengen Yan, Yun Liang
2022ASAP: architecture support for asynchronous persistence.
Ahmed H. M. O. Abulila, Izzat El Hajj, Myoungsoo Jung, Nam Sung Kim
2022Accelerating attention through gradient-based learned runtime pruning.
Zheng Li, Soroush Ghodrati, Amir Yazdanbakhsh, Hadi Esmaeilzadeh, Mingu Kang
2022Accelerating database analytic query workloads using an associative processor.
Helena Caminal, Yannis Chronis, Tianshu Wu, Jignesh M. Patel, José F. Martínez
2022Anticipating and eliminating redundant computations in accelerated sparse training.
Jonathan S. Lew, Yunpeng Liu, Wenyi Gong, Negar Goli, R. David Evans, Tor M. Aamodt
2022Axiomatic hardware-software contracts for security.
Nicholas Mosier, Hanna Lachnitt, Hamed Nemati, Caroline Trippel
2022BTS: an accelerator for bootstrappable fully homomorphic encryption.
Sangpyo Kim, Jongmin Kim, Michael Jaemin Kim, Wonkyung Jung, John Kim, Minsoo Rhu, Jung Ho Ahn
2022BioHD: an efficient genome sequence search platform using HyperDimensional memorization.
Zhuowen Zou, Hanning Chen, Prathyush Poduval, Yeseong Kim, Mahdi Imani, Elaheh Sadredini, Rosario Cammarota, Mohsen Imani
2022CaSMap: agile mapper for reconfigurable spatial architectures by automatically clustering intermediate representations and scattering mapping process.
Xingchen Man, Jianfeng Zhu, Guihuan Song, Shouyi Yin, Shaojun Wei, Leibo Liu
2022Cascading structured pruning: enabling high data reuse for sparse DNN accelerators.
Edward Hanson, Shiyu Li, Hai Helen Li, Yiran Chen
2022CraterLake: a hardware accelerator for efficient unbounded computation on encrypted data.
Nikola Samardzic, Axel Feldmann, Aleksandar Krastev, Nathan Manohar, Nicholas Genise, Srinivas Devadas, Karim Eldefrawy, Chris Peikert, Daniel Sánchez
2022Crescent: taming memory irregularities for accelerating deep point cloud analytics.
Yu Feng, Gunnar Hammonds, Yiming Gan, Yuhao Zhu
2022DIMMining: pruning-efficient and parallel graph mining on near-memory-computing.
Guohao Dai, Zhenhua Zhu, Tianyu Fu, Chiyue Wei, Bangyan Wang, Xiangyu Li, Yuan Xie, Huazhong Yang, Yu Wang
2022Dynamic global adaptive routing in high-radix networks.
Hans Kasan, Gwangsun Kim, Yung Yi, John Kim
2022EDAM: edit distance tolerant approximate matching content addressable memory.
Robert Hanhan, Esteban Garzón, Zuher Jahshan, Adam Teman, Marco Lanuzza, Leonid Yavits
2022EQC: ensembled quantum computing for variational quantum algorithms.
Samuel A. Stein, Nathan Wiebe, Yufei Ding, Bo Peng, Karol Kowalski, Nathan A. Baker, James Ang, Ang Li
2022EyeCoD: eye tracking system acceleration via flatcam-based algorithm & accelerator co-design.
Haoran You, Cheng Wan, Yang Zhao, Zhongzhi Yu, Yonggan Fu, Jiayi Yuan, Shang Wu, Shunyao Zhang, Yongan Zhang, Chaojian Li, Vivek Boominathan, Ashok Veeraraghavan, Ziyun Li, Yingyan Lin
2022FFCCD: fence-free crash-consistent concurrent defragmentation for persistent memory.
Yuanchao Xu, Chencheng Ye, Yan Solihin, Xipeng Shen
2022Fidas: fortifying the cloud via comprehensive FPGA-based offloading for intrusion detection: industrial product.
Jian Chen, Xiaoyu Zhang, Tao Wang, Ying Zhang, Tao Chen, Jiajun Chen, Mingxu Xie, Qiang Liu
2022FlexiCores: low footprint, high yield, field reprogrammable flexible microprocessors.
Nathaniel Bleier, Calvin Lee, Francisco Rodriguez, Antony Sou, Scott White, Rakesh Kumar
2022Free atomics: hardware atomic operations without fences.
Ashkan Asgharzadeh, Juan M. Cebrian, Arthur Perais, Stefanos Kaxiras, Alberto Ros
2022GCoM: a detailed GPU core model for accurate analytical modeling of modern GPUs.
Jounghoo Lee, Yeonan Ha, Suhyun Lee, Jinyoung Woo, Jinho Lee, Hanhwi Jang, Youngsok Kim
2022Gearbox: a case for supporting accumulation dispatching and hybrid partitioning in PIM-based accelerators.
Marzieh Lenjani, Alif Ahmed, Mircea Stan, Kevin Skadron
2022Geyser: a compilation framework for quantum computing with neutral atoms.
Tirthak Patel, Daniel Silver, Devesh Tiwari
2022Graphite: optimizing graph neural networks on CPUs through cooperative software-hardware techniques.
Zhangxiaowen Gong, Houxiang Ji, Yao Yao, Christopher W. Fletcher, Christopher J. Hughes, Josep Torrellas
2022HiveMind: a hardware-software system stack for serverless edge swarms.
Liam Patterson, David Pigorovsky, Brian Dempsey, Nikita Lazarev, Aditya Shah, Clara Steinhoff, Ariana Bruno, Justin Hu, Christina Delimitrou
2022Hydra: enabling low-overhead mitigation of row-hammer at ultra-low thresholds via hybrid tracking.
Moinuddin K. Qureshi, Aditya Rohan, Gururaj Saileshwar, Prashant J. Nair
2022Hyperscale FPGA-as-a-service architecture for large-scale distributed graph neural network.
Shuangchen Li, Dimin Niu, Yuhao Wang, Wei Han, Zhe Zhang, Tianchan Guan, Yijin Guan, Heng Liu, Linyong Huang, Zhaoyang Du, Fei Xue, Yuanwei Fang, Hongzhong Zheng, Yuan Xie
2022INSPIRE: in-storage private information retrieval via protocol and architecture co-design.
Jilan Lin, Ling Liang, Zheng Qu, Ishtiyaque Ahmad, Liu Liu, Fengbin Tu, Trinabh Gupta, Yufei Ding, Yuan Xie
2022ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18 - 22, 2022
Valentina Salapura, Mohamed Zahran, Fred Chong, Lingjia Tang
2022Increasing ising machine capacity with multi-chip architectures.
Anshujit Sharma, Richard Afoakwa, Zeljko Ignjatovic, Michael C. Huang
2022LightPC: hardware and software co-design for energy-efficient full system persistence.
Sangwon Lee, Miryeong Kwon, Gyuyoung Park, Myoungsoo Jung
2022Lukewarm serverless functions: characterization and optimization.
David Schall, Artemiy Margaritov, Dmitrii Ustiugov, Andreas Sandberg, Boris Grot
2022MGX: near-zero overhead memory protection for data-intensive accelerators.
Weizhe Hua, Muhammad Umar, Zhiru Zhang, G. Edward Suh
2022MOESI-prime: preventing coherence-induced hammering in commodity workloads.
Kevin Loughlin, Stefan Saroiu, Alec Wolman, Yatin A. Manerkar, Baris Kasikci
2022Managing reliability skew in DNA storage.
Dehui Lin, Yasamin Tabatabaee, Yash Pote, Djordje Jevdjic
2022MeNDA: a near-memory multi-way merge solution for sparse transposition and dataflows.
Siying Feng, Xin He, Kuan-Yu Chen, Liu Ke, Xuan Zhang, David T. Blaauw, Trevor N. Mudge, Ronald G. Dreslinski
2022Mixed-proxy extensions for the NVIDIA PTX memory consistency model: industrial product.
Daniel Lustig, Simon Cooksey, Olivier Giroux
2022Mokey: enabling narrow fixed-point inference for out-of-the-box floating-point transformer models.
Ali Hadi Zadeh, Mostafa Mahmoud, Ameer Abdelhadi, Andreas Moshovos
2022NDMiner: accelerating graph pattern mining using near data processing.
Nishil Talati, Haojie Ye, Yichen Yang, Leul Belayneh, Kuan-Yu Chen, David T. Blaauw, Trevor N. Mudge, Ronald G. Dreslinski
2022NvMR: non-volatile memory renaming for intermittent computing.
Abhishek Bhattacharyya, Abhijith Somashekhar, Joshua San Miguel
2022PACMAN: attacking ARM pointer authentication with speculative execution.
Joseph Ravichandran, Weon Taek Na, Jay Lang, Mengjia Yan
2022PPMLAC: high performance chipset architecture for secure multi-party computation.
Xing Zhou, Zhilei Xu, Cong Wang, Mingyu Gao
2022PS-ORAM: efficient crash consistency support for oblivious RAM on NVM.
Gang Liu, Kenli Li, Zheng Xiao, Rujia Wang
2022RACOD: algorithm/hardware co-design for mobile robot path planning.
Mohammad Bakhshalipour, Seyed Borna Ehsani, Mohamad Qadri, Dominic Guri, Maxim Likhachev, Phillip B. Gibbons
2022Register file prefetching.
Sudhanshu Shukla, Sumeet Bandishte, Jayesh Gaur, Sreenivas Subramoney
2022Rethinking programmable wearable processors.
Nathaniel Bleier, Muhammad Husnain Mubarik, Srijan Chakraborty, Shreyas Kishore, Rakesh Kumar
2022SIMD
Yunan Zhang, Po-An Tsai, Hung-Wei Tseng
2022SNS's not a synthesizer: a deep-learning-based synthesis predictor.
Ceyu Xu, Chris Kjellqvist, Lisa Wu Wills
2022SeGraM: a universal hardware accelerator for genomic sequence-to-graph and sequence-to-sequence mapping.
Damla Senol Cali, Konstantinos Kanellopoulos, Joël Lindegger, Zülal Bingöl, Gurpreet S. Kalsi, Ziyi Zuo, Can Firtina, Meryem Banu Cavlak, Jeremie S. Kim, Nika Mansouri-Ghiasi, Gagandeep Singh, Juan Gómez-Luna, Nour Almadhoun Alserr, Mohammed Alser, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu
2022Securing GPU via region-based bounds checking.
Jaewon Lee, Yonghae Kim, Jiashen Cao, Euna Kim, Jaekyu Lee, Hyesoon Kim
2022Sibyl: adaptive and extensible data placement in hybrid storage systems using online reinforcement learning.
Gagandeep Singh, Rakesh Nadig, Jisung Park, Rahul Bera, Nastaran Hajinazar, David Novo, Juan Gómez-Luna, Sander Stuijk, Henk Corporaal, Onur Mutlu
2022SmartSAGE: training large-scale graph neural networks using in-storage processing architectures.
Yunjae Lee, Jinha Chung, Minsoo Rhu
2022SoftVN: efficient memory protection via software-provided version numbers.
Muhammad Umar, Weizhe Hua, Zhiru Zhang, G. Edward Suh
2022Software-hardware co-design for fast and scalable training of deep learning recommendation models.
Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Zhihao Jia, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Amy Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng, Yinbin Ma, Junjie Yang, Ellie Wen, Hong Li, Lin Yang, Chonglin Sun, Whitney Zhao, Dimitry Melts, Krishna Dhulipala, K. R. Kishore, Tyler Graf, Assaf Eisenman, Kiran Kumar Matam, Adi Gangidi, Guoqiang Jerry Chen, Manoj Krishnan, Avinash Nayak, Krishnakumar Nair, Bharath Muthiah, Mahmoud khorashadi, Pallab Bhattacharya, Petr Lapukhov, Maxim Naumov, Ajit Mathews, Lin Qiao, Mikhail Smelyanskiy, Bill Jia, Vijay Rao
2022TDGraph: a topology-driven accelerator for high-performance streaming graph processing.
Jin Zhao, Yun Yang, Yu Zhang, Xiaofei Liao, Lin Gu, Ligang He, Bingsheng He, Hai Jin, Haikun Liu, Xinyu Jiang, Hui Yu
2022The Mozart reuse exposed dataflow processor for AI and beyond: industrial product.
Karthikeyan Sankaralingam, Tony Nowatzki, Vinay Gangadhar, Preyas Shah, Michael Davies, William Galliher, Ziliang Guo, Jitu Khare, Deepak Vijay, Poly Palamuttam, Maghawan Punde, Alex Tan, Vijay Thiruvengadam, Rongyi Wang, Shunmiao Xu
2022Themis: a network bandwidth-aware collective scheduling policy for distributed training of DL models.
Saeed Rashidi, William Won, Sudarshan Srinivasan, Srinivas Sridharan, Tushar Krishna
2022There's always a bigger fish: a clarifying analysis of a machine-learning-assisted side-channel attack.
Jack Cook, Jules Drean, Jonathan Behrens, Mengjia Yan
2022Thermometer: profile-guided btb replacement for data center applications.
Shixin Song, Tanvir Ahmed Khan, Sara Mahdizadeh-Shahri, Akshitha Sriraman, Niranjan K. Soundararajan, Sreenivas Subramoney, Daniel A. Jiménez, Heiner Litz, Baris Kasikci
2022Tiny but mighty: designing and realizing scalable latency tolerance for manycore SoCs.
Marcelo Orenes-Vera, Aninda Manocha, Jonathan Balkind, Fei Gao, Juan L. Aragón, David Wentzlaff, Margaret Martonosi
2022To PIM or not for emerging general purpose processing in DDR memory systems.
Alexandar Devic, Siddhartha Balakrishna Rai, Anand Sivasubramaniam, Ameen Akel, Sean Eilert, Justin Eno
2022Training personalized recommendation systems from (GPU) scratch: look forward not backwards.
Youngeun Kwon, Minsoo Rhu
2022Understanding data storage and ingestion for large-scale deep recommendation model training: industrial product.
Mark Zhao, Niket Agarwal, Aarti Basant, Bugra Gedik, Satadru Pan, Mustafa Ozdal, Rakesh Komuravelli, Jerry Pan, Tianshu Bao, Haowei Lu, Sundaram Narayanan, Jack Langman, Kevin Wilfong, Harsha Rastogi, Carole-Jean Wu, Christos Kozyrakis, Parik Pol
2022X-cache: a modular architecture for domain-specific caches.
Ali Sedaghati, Milad Hakimi, Reza Hojabr, Arrvindh Shriraman
2022XQsim: modeling cross-technology control processors for 10+K qubit quantum computers.
Ilkwon Byun, Junpyo Kim, Dongmoon Min, Ikki Nagaoka, Kosuke Fukumitsu, Iori Ishikawa, Teruo Tanimoto, Masamitsu Tanaka, Koji Inoue, Jangwoo Kim
2022täk¯: a polymorphic cache hierarchy for general-purpose optimization of data movement.
Brian C. Schwedock, Piratach Yoovidhya, Jennifer Seibert, Nathan Beckmann
2022uBrain: a unary brain computer interface.
Di Wu, Jingjie Li, Zhewen Pan, Younghyun Kim, Joshua San Miguel