Time Zone



Day 1: Monday, October 19

9:45 AM EDT10:00 AM EDT: Opening Remarks

16:45 (EEST/Athens)

10:00 AM EDT11:00 AM EDT: Keynote by Rich Wolski

17:00 (EEST/Athens)

Rich Wolski headshot Abstract
The Internet of Things (IoT) is a rapidly approaching technological change that envisions ubiquitous and network-accessible digital instrumentation and actuation of literally every "thing" we encounter in everyday life. Like the World Wide Web (now simply called The Internet) before it, IoT will likely represent another societal sea change as objects in the physical world become network-enabled so that they can communicate and interact with people and, autonomously, with each other.

This technological vision also carries with it significant new challenges. With estimates of between 50 billion and 1 trillion network-connected IoT devices in the next 20 years, the energy efficiency of these devices and the network technologies that interconnect them is paramount to their utility. Moreover, the current Internet architecture, which is evolving to accommodate cloud computing, will require substantial additional innovation and augmentation before IoT will come to complete fruition.

In this talk, we will discuss some of the computer science research questions that have grown from early experiences in architecting and deploying working IoT systems and infrastructure. In particular, the talk will focus on potential new approach to software infrastructure that is designed to meet many of the current and future IoT challenges.

To save power, reduce network latency, and easy network congestion, devices export data and actuation services that are accessed by applications running in the cloud. "Flipping" the current Internet architecture in this way, with services at the extreme edge of the network and applications running at the core (i.e., in the cloud), requires a new technological approach that creates a Software Platform of Things -- SPOT -- spanning devices, computing elements located at the edge (e.g. edge clouds), and traditional cloud data centers.

We will outline our experiences in building and deploying IoT systems using this new "flipped" approach to cloud computing discuss the myriad of new research opportunities that arise as a result.


Bio
Dr. Rich Wolski is a Professor of Computer Science at the University of California, Santa Barbara (UCSB) where he holds the Duval Presidential Chair in Energy Efficiency. Having received his M.S. and Ph.D. degrees from the University of California at Davis (while a research scientist at Lawrence Livermore National Laboratory) he has also held positions at the University of California, San Diego, and the University of Tennessee, the the San Diego Supercomputer Center and Lawrence Berkeley National Laboratory. Rich has led several national scale research efforts in the area of distributed systems and is the progenitor of the Eucalyptus open source cloud project.

11:00 AM EDT12:00 PM EDT

18:00 (EEST/Athens)
11:00 AM EDT11:15 AM EDT
Graphene: Strong yet Lightweight Row Hammer Protection
Yeonhong Park, Woosuk Kwon, Eojin Lee, Tae Jun Ham, Jung Ho Ahn, Jae W. Lee (Seoul National University)

11:15 AM EDT11:30 AM EDT
Persist Level Parallelism: Streamlining Integrity Tree Updates for Secure Persistent Memory
Alexander Freij, Shougang Yuan, Huiyang Zhou (NC State University); Yan Solihin (University of Central Florida)

11:30 AM EDT11:45 AM EDT
PThammer: Cross-User-Kernel-Boundary Rowhammer through Implicit Accesses
Zhi Zhang (University of New South Wales and Data61, CSIRO, Australia); Yueqiang Cheng (Baidu Security); Dongxi Liu, Surya Nepal (Data61, CSIRO, Australia); Zhi Wang (Florida State University); Yuval Yarom (University of Adelaide and Data61, CSIRO, Australia)

11:45 AM EDT12:00 PM EDT
Draco: Architectural and Operating System Support for System Call Security
Dimitrios Skarlatos, Qingrong Chen, Jianyan Chen, Tianyin Xu, Josep Torrellas (University of Illinois at Urbana-Champaign)
11:00 AM EDT11:15 AM EDT
SuperNPU: An Extremely Fast Neural Processing Unit Using Superconducting Logic Devices
Koki Ishida (Kyushu University); Il-Kwon Byun (Seoul National University); Ikki Nagaoka (Nagoya University); Kosuke Fukumitsu (Kyushu University); Masamitsu Tanaka (Nagoya University); Satoshi Kawakami, Teruo Tanimoto, Takatsugu Ono (Kyushu University); Jangwoo Kim (Seoul National University); Koji Inoue (Kyushu University)

11:15 AM EDT11:30 AM EDT
Printed Machine Learning Classifiers
Muhammad Husnain Mubarik (UIUC); Dennis Weller (KIT); Nathaniel Bleier, Matthew Tomei (UIUC); Jasmin Aghassi-Hagmann, Mehdi B Tahoori (KIT); Rakesh Kumar (University of Illinois)

11:30 AM EDT11:45 AM EDT
Look-Up Table based Energy Efficient Processing in Cache Support for Neural Network Acceleration
Akshay Krishna Ramanathan (Pennsylvania State University); Gurpreet S Kalsi (Intel Corporation); Srivatsa Srinivasa (Intel Labs); Tarun Makesh Chandran (Pennsylvania State University); Kamlesh Pillai (Intel Labs); Om J Omer (Intel Corporation); Vijaykrishnan Narayanan (The Pennsylvania State University); Sreenivas Subramoney (Intel Labs)

11:45 AM EDT12:00 PM EDT
FReaC Cache: Folded Logic Reconfigurable Computing in the Last Level Cache
Ashutosh Dhar, Xiaohao Wang (University of Illinois, Urbana Champaign); Hubertus Franke, Jinjun Xiong (IBM Research); Jian Huang, Wen-mei Hwu, Nam Sung Kim, Deming Chen (University of Illinois, Urbana Champaign)
11:00 AM EDT11:15 AM EDT
BranchNet: A Convolutional Neural Network to Predict Hard-To-Predict Branches
Siavash Zangeneh Kamali, Stephen Pruett (The University of Texas at Austin); Sangkug Lym (Nvidia); Yale Patt (The University of Texas at Austin)

11:15 AM EDT11:30 AM EDT
CHiRP: Control-flow History Reuse Prediction
Samira Mirbagher Ajorpaz, Elba Garza (Texas A&M University); Gilles A Pokam (Intel); Daniel Jiménez (Texas A&M University)

11:30 AM EDT11:45 AM EDT
I-SPY: Context-Driven Conditional Instruction Prefetching with Coalescing
Tanvir Ahmed Khan, Akshitha Sriraman (University of Michigan); Joseph Devietti (University of Pennsylvania); Gilles Pokam (Intel Corporation); Heiner Litz (University of California, Santa Cruz); Baris Kasikci (University of Michigan)

11:45 AM EDT12:00 PM EDT
Improving the Utilization of Micro-operation Caches in x86 Processors
Jagadish Kotra, John Kalamatianos (AMD Research)

12:00 PM EDT1:00 PM EDT

19:00 (EEST/Athens)
12:00 PM EDT12:15 PM EDT
Virtualized Logical Qubits: A 2.5D Architecture for Error-Corrected Quantum Computing
Casey Duckering, Jonathan Baker, David Schuster, Fred Chong (University of Chicago)

12:15 PM EDT12:30 PM EDT
Optimized Quantum Compilation for Near-Term Algorithms with OpenPulse
Pranav Gokhale (University of Chicago); Ali Javadi, Nate Earnest (IBM); Yunong Shi, Fred Chong (University of Chicago)

12:30 PM EDT12:45 PM EDT
Systematic Crosstalk Mitigation for Superconducting Qubits via Frequency-Aware Compilation
Yongshan Ding, Pranav Gokhale, Sophia Fuhui Lin, Richard Rines, Thomas Propson, Fred Chong (University of Chicago)

12:45 PM EDT1:00 PM EDT
Circuit Compilation Methodologies for Quantum Approximate Optimization Algorithm
Md Mahabubul Alam, Abdullah Ash Saki, Swaroop Ghosh (Pennsylvania State University)
12:00 PM EDT12:15 PM EDT
Fast-BCNN: Massive Neuron Skipping in Bayesian Convolutional Neural Networks
Qiyu Wan, Xin Fu (University of Houston)

12:15 PM EDT12:30 PM EDT
Ptolemy: Architecture Support for Robust Deep Learning
Yiming Gan (University of Rochester); Yuxian Qiu, Jingwen Leng, Minyi Guo (Shanghai Jiao Tong University); Yuhao Zhu (University of Rochester)

12:30 PM EDT12:45 PM EDT
Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural Networks
Gil Shomron, Uri Weiser (Technion - Israel Institute of Technology)

12:45 PM EDT1:00 PM EDT
FIdelity: Efficient Resilience Analysis Framework for Deep Learning Accelerators
Yi He (The University of Chicago); Prasanna Balaprakash (Argonne National Laboratory); Yanjing Li (The University of Chicago)
12:00 PM EDT12:15 PM EDT
Bit-Exact ECC Recovery (BEER): Determining DRAM On-Die ECC Functions by Exploiting DRAM Data Retention Characteristics
Minesh Patel, Jeremie Kim, Taha Shahroodi, Hasan Hassan, Onur Mutlu (ETH Zurich)

12:15 PM EDT12:30 PM EDT
DStress: Automatic Synthesis of DRAM Reliability Stress Viruses using Genetic Algorithms
Lev Mukhanov (Queen's University Belfast); Dimitrios Nikolopoulos (Virginia Tech); Georgios Karakonstantis (Queen's University of Belfast)

12:30 PM EDT12:45 PM EDT
FIGARO: Improving System Performance via Fine-Grained In-DRAM Data Relocation and Caching
Yaohua Wang (National University of Defense Technology); Lois Orosa (ETH Zurich); Xiangjun Peng (The University of Nottingham Ningbo China); Yang Guo (National University of Defense Technology); Saugata Ghose (University of Illinois at Urbana–Champaign / Carnegie Mellon University); Minesh Patel (ETH Zurich); Jeremie Kim, Juan Gómez Luna (ETH Zurich); Mohammad Sadrosadati (Institute for Research in Fundamental Sciences); Nika Mansouri Ghiasi, Onur Mutlu (ETH Zurich)

12:45 PM EDT1:00 PM EDT
PerpLE: Improving the Speed and Effectiveness of Memory Consistency Testing
Themis Melissaris, Markos Markakis (Princeton University); Kelly Shaw (Williams College); Margaret Martonosi (Princeton University)

1:00 PM EDT2:30 PM EDT: Business Meeting

20:00 (EEST/Athens)


Day 2: Tuesday, October 20

10:00 AM EDT11:00 AM EDT: Keynote by Cristina Silvano

17:00 (EEST/Athens)

Cristina Silvano headshot Abstract
Exscalate (EXaSCale smArt pLatform Against paThogEns) is a drug-discovery platform that includes a "chemical library" of several hundred billions of molecules and a processing capacity in the order of millions of molecules per second. The platform represents a powerful tool to accelerate the in-silico computational phase of the development of new therapies, called virtual screening process, which needs to exploit at the best the underlying supercomputing resources. Exscalate platform is owned by Dompé Farmaceutici and developed thanks to a collaboration between Dompé, Cineca, and Politecnico di Milano. The Exscalate platform has already been used in 2019 in the context of the Antarex European project for the study of the Zika virus. This talk will describe the Exscalate platform currently used in the EXscalate4CoV Project funded by the European Commission’s Horizon 2020 programme for research and innovation to accelerate the search for drugs against the coronavirus and to fight future pandemics. In the race against the coronavirus, the platform can now count on the two most powerful supercomputers in Europe: the new HPC5 system installed in Italy at the energy company Eni and the Marconi-100 system installed at the Cineca Italian supercomputing center.

Exscalate4CoV European Project


Bio
Cristina Silvano is a Full Professor of Computer Engineering at the Politecnico di Milano. She is currently the Chair of the research area on Computer Science and Engineering and Vice-Chair of the PhD programme in Information Technology for Computer Science and Engineering at Politecnico di Milano. Her research focuses on computer architectures and electronic design automation focusing on design space exploration for energy-efficient computer architectures and application autotuning for manycores and HPC systems. She has been Project Coordinator of the European research projects ANTAREX, 2PARMA and MULTICUBE. She is currently responsible of the task force on High Performance Computing for the Exscalate4CoV H2020 European project on accelerating the virtual screening in the drug discovery process to fight pandemics. She is also responsible for the POLIMI research unit of the H2020 AI4DI European project on artificial intelligence for digitizing industry in collaboration with STMicroelectronics. She is an active member of the scientific community and she regularly serves in several international program committees. She is Associate Editor of the IEEE Trans. on Computers and the ACM Trans. on Architecture and Code Optimization. She served as Independent Expert Reviewer for the European Commission and for several science foundations. She is an IEEE Fellow and a member of the HiPEAC network.

11:00 AM EDT12:00 PM EDT: Student Research Competition Talks

18:00 (EEST/Athens)

12:00 PM EDT1:15 PM EDT

19:00 (EEST/Athens)
12:00 PM EDT12:15 PM EDT
CATCAM: Constant-time Alteration Ternary CAM with Scalable In-Memory Architecture
Dibei Chen, Zhaoshi Li (Tsinghua University); Tianzhu Xiong (Southeast University); Zhiwei Liu (Tsinghua University); Jun Yang (Southeast University); Shouyi Yin, Shaojun Wei, Leibo Liu (Tsinghua University)

12:15 PM EDT12:30 PM EDT
DUAL: Acceleration of Clustering Algorithms using Digital-based Processing In-Memory
Mohsen Imani, Saikishan Pampana, Saransh Gupta, Minxuan Zhou, Yeseong Kim, Tajana Rosing (University of California San Diego)

12:30 PM EDT12:45 PM EDT
Newton: A DRAM-maker's Accelerator-in-Memory (AiM) Architecture for Machine Learning
Mingxuan He (Purdue University); Choungki Song, Ilkon Kim, Chunseok Jeong, Seho Kim, Il Park (SK Hynix); Mithuna Thottethodi, T. N. Vijaykumar (Purdue University)

12:45 PM EDT1:00 PM EDT
AQUOMAN: An Analytic-Query Offloading Machine
Shuotao Xu (MIT); Thomas Bourgeat (MIT CSAIL); Tianhao Huang (MIT); Hojun Kim, Sungjin Lee (DGIST); Arvind (MIT)

1:00 PM EDT1:15 PM EDT
MOUSE: Inference In Non-volatile Memory for Energy Harvesting Applications
Salonik Resch, S. Karen Khatamifard, Zamshed Chowdhury, Masoud Zabihi, Zhengyang Zhao, Husrev Cilasun, Jian-Ping Wang, Sachin S. Sapatnekar, Ulya R. Karpuzcu (University of Minnesota)
12:00 PM EDT12:15 PM EDT
More with Less - Deriving More Translation Rules with Less Training Data for DBTs Using Parameterization
Jinhu Jiang, Rongchao Dong, Zhongjun Zhou, Changheng Song (Fudan University); Wenwen Wang (University of Georgia); Pen-Chung Yew (University of Minnesota at Twin Cities); Weihua Zhang (Fudan University)

12:15 PM EDT12:30 PM EDT
Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data
Jie Zhao (National Digital Switching System Engineering & Technological R&D Center, China); Peng Di (Huawei Technologies Co., Ltd.)

12:30 PM EDT12:45 PM EDT
DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates
Alex Renda, Yishen Chen, Charith Mendis (MIT CSAIL); Michael Carbin (MIT)

12:45 PM EDT1:00 PM EDT
Predicting Execution Times With Partial Simulations in Virtual Memory Research: Why and How
Mohammad Agbarya, Idan Yaniv (Technion); Jayneel Gandhi (VMware Research); Dan Tsafrir (Technion & VMware Research)

1:00 PM EDT1:15 PM EDT
gem5-SALAM: A System Architecture for LLVM-based Accelerator Modeling
Samuel Rogers, Joshua Slycord, Mohammadreza Baharani, Hamed Tabkhi (University of North Carolina Charlotte)
12:00 PM EDT12:15 PM EDT
Shaving Retries with Sentinels for Fast Read over High-Density 3D Flash
Qiao Li, Min Ye, Yufei Cui (Department of Computer Science, City University of Hong Kong); Liang Shi (School of Computer Science and Technology, East China Normal University); Xiaoqiang Li (YEESTOR Microelectronics Co., Ltd); Tei-Wei Kuo, Chun Jason Xue (Department of Computer Science, City University of Hong Kong)

12:15 PM EDT12:30 PM EDT
Characterizing and Modeling Non-Volatile Memory Systems
Zixuan Wang (UCSD); Xiao Liu (University of California, San Diego); Jian Yang (UC San Diego/Google); Theodore Michailidis (University of California, San Diego); Steven Swanson (UC San Diego); Jishen Zhao (UCSD)

12:30 PM EDT12:45 PM EDT
P-INSPECT: Architectural Support for Programmable Non-Volatile Memory Frameworks
Apostolos Kokolis (University of Illinois at Urbana-Champaign); Thomas Shull (University of Illinois at Urbana Champaign); Jian Huang, Josep Torrellas (University of Illinois at Urbana-Champaign)

12:45 PM EDT1:00 PM EDT
Unbounded Hardware Transactional Memory for a Hybrid DRAM/NVM Memory System
Jungi Jeong (Purdue University); Jaewan Hong, Seungryoul Maeng (KAIST); Changhee Jung (Purdue University); Youngjin Kwon (KAIST)

1:00 PM EDT1:15 PM EDT
(Almost) Fence-less Persist Ordering
Sara Mahdizadeh Shahri, Armin Vakil (Penn State University); Aasheesh Kolli (Penn State University and VMware Research)

1:15 PM EDT2:30 PM EDT

20:15 (EEST/Athens)
1:15 PM EDT1:30 PM EDT
Speculative Enforcement of Store Atomicity
Alberto Ros (University of Murcia); Stefanos Kaxiras (Uppsala University)

1:30 PM EDT1:45 PM EDT
Boosting Store Buffer Efficiency with Store-Prefetch Bursts
Juan M. Cebrian (University of Murcia); Stefanos Kaxiras (Uppsala University); Alberto Ros (University of Murcia)

1:45 PM EDT2:00 PM EDT
D-SOAP: Dynamic Spatial Orientation Affinity Prediction for Caching in Multi-Orientation Memory Systems
Minli Julie Liao, Jack Sampson (PSU)

2:00 PM EDT2:15 PM EDT
Pipette: Improving Core Utilization on Irregular Applications through Intra-Core Pipeline Parallelism
Quan Nguyen, Daniel Sanchez (MIT)

2:15 PM EDT2:30 PM EDT
RnR: A Software-Assisted Record-and-Replay Hardware Prefetcher
Chao Zhang, Yuan Zeng (Lehigh University); John Shalf (Lawrence Berkeley National Laboratory), Xiaochen Guo (Lehigh University)
1:15 PM EDT1:30 PM EDT
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
Sheng-Chun Kao, Geonhwa Jeong, Tushar Krishna (Georgia Institute of Technology)

1:30 PM EDT1:45 PM EDT
Gemini: Learning to Manage CPU Power for Latency-Critical Search Engines
Liang Zhou, Laxmi Bhuyan, K.K. Ramakrishnan (University of California, Riverside)

1:45 PM EDT2:00 PM EDT
CuttleSys: Data-Driven Resource Management for Interactive Services on Reconfigurable Multicores
Neeraj Kulkarni, Gonzalo Gonzalez-Pumariega, Amulya Khurana, Christina Delimitrou (Cornell University); Christine Shoemaker (Industrial Engineering and Management Department, National University of Singapore); David Albonesi (Cornell University)

2:00 PM EDT2:15 PM EDT
Jumanji: The Case for Dynamic NUCA in the Datacenter
Brian C. Schwedock. Nathan Beckmann (CMU)

2:15 PM EDT2:30 PM EDT
Planaria: Dynamic Architecture Fission for Spatial Multi-Tenant Acceleration of Deep Neural Networks
Soroush Ghodrati, Byung Hoon Ahn, Joon Kyung Kim, Sean Kinzer, Brahmendra Yatham, Navateja Alla (University of California, San Diego); Hardik Sharma (Bigstream); Mohammad Alian (University of Illinois, Urbana Champaign); Eiman Ebrahimi (NVIDIA Research); Nam Sung Kim (University of Illinois, Urbana Champaign); Cliff Young (Google); Hadi Esmaeilzadeh (University of California, San Diego)
1:15 PM EDT1:30 PM EDT
VR-DANN: Real-Time Video Recognition via Decoder-Assisted Neural Network Acceleration
Zhuoran Song, Feiyang Wu, Xueyuan Liu, Jing Ke, Naifeng Jing, Xiaoyao Liang (Shanghai Jiao Tong University)

1:30 PM EDT1:45 PM EDT
Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training
Dingqing Yang, Amin Ghasemazar, Xiaowei Ren (University of British Columbia); Maximilian Golub (Microsoft); Guy Lemieux, Mieszko Lis (University of British Columbia)

1:45 PM EDT2:00 PM EDT
Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor Cores
Hyeonjin Kim, Sungwoo Ahn (Yonsei University); Yunho Oh (EcoCloud, EPFL); Bogil Kim, Won Woo Ro, William Song (Yonsei University)

2:00 PM EDT2:15 PM EDT
DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture
Liu Liu, Zheng Qu, Lei Deng, Fengbin Tu, Shuangchen Li, Xing Hu (University of California, Santa Barbara); Zhenyu Gu (Alibaba Inc.); Yufei Ding, Yuan Xie (University of California, Santa Barbara)


Day 3: Wednesday, October 21

10:00 AM EDT11:00 AM EDT: Keynote by Srilatha (Bobbie) Manne

17:00 (EEST/Athens)

Bobbie Manne headshot Abstract
Sustainability has become a key driver of business decisions for many corporations. All major information technology companies have committed to reducing their environmental impact and moving towards a more sustainable future. Data centers themselves are an integral part of the modern IT infrastructure, and their rapid growth now and into the future make them a significant contributor to carbon, water and waste issues. In addition, the demands that the data centers place on the global ecosystem impact the entire planet.

We will examine the sustainability benefits and challenges of modern data centers from the building infrastructure to the architecture of the servers, and from the carbon emissions resulting from the manufacturing and use of the systems to their eventual waste disposal. Although the talk focuses on data centers, many of the topics and challenges are applicable to other computer products and designs. This talk will provide inspiration for the academic community to incorporate sustainability as a primary objective (first class citizen?) in how we design and use current and future systems.


Bio
Srilatha (Bobbie) Manne has worked in the computer industry for over two decades in both industrial labs and product teams at Compaq, Intel, AMD and Cavium. She is currently a Principal Hardware Engineer in the Azure Hardware Systems and Infrastructure group at Microsoft. Her work has focused on power and performance analysis from processor microarchitecture to data centers. Srilatha has continued to publish while in industry, and has over 20 patents granted. She served as the General Chair for ISCA 2019 and has served on numerous program committees throughout the years. Her latest passion is collaborating with, and learning from, an interdisciplinary group of engineers and technical experts on sustainability topics centered around data centers. She currently lives in Seattle with her husband and two children.

11:00 AM EDT12:00 PM EDT

18:00 (EEST/Athens)

Best Paper Award


Student Research Competition Winner


MICRO Test-of-Time Award

12:00 PM EDT1:15 PM EDT

19:00 (EEST/Athens)
12:00 PM EDT12:15 PM EDT
TFE: Energy-efficient Transferred Filter-based Engine to Compress and Accelerate Convolutional Neural Networks
Huiyu Mo, Leibo Liu, Wenjing Hu, Wenping Zhu (Tsinghua University); Qiang Li (Intel Corporation); Ang Li, Shouyi Yin (Tsinghua University); Jian Chen (Aliababa Groups); Xiaowei Jiang (Alibaba Group); Shaojun Wei (Tsinghua University)

12:15 PM EDT12:30 PM EDT
MatRaptor: A Sparse-Sparse Matrix Multiplication Accelerator Based on Row-Wise Product
Nitish Srivastava, Hanchen Jin, Jie Liu, David Albonesi (Cornell University); Zhiru Zhang (Cornell University)

12:30 PM EDT12:45 PM EDT
TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training
Mostafa Mahmoud, Isak Edo Vivancos, Ali Hadi Zadeh, Omar Mohamed Awad, Gennady Pekhimenko (University of Toronto); Jorge Albericio (Cerebras); Andreas Moshovos (University of Toronto)

12:45 PM EDT1:00 PM EDT
SAVE: Sparsity-Aware Vector Engine for Accelerating DNN Training and Inference on CPUs
Zhangxiaowen Gong, Houxiang Ji, Christopher Fletcher (University of Illinois at Urbana-Champaign); Christopher Hughes, Sara Baghsorkhi (Intel); Josep Torrellas (University of Illinois at Urbana-Champaign)

1:00 PM EDT1:15 PM EDT
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi Zadeh, Isak Edo, Omar Mohamed Awad, Andreas Moshovos (University of Toronto)
12:00 PM EDT12:15 PM EDT
TrainBox: An Extreme-Scale Neural Network Training Server Architecture by Systematically Balancing Operations
Pyeongsu Park, Heetaek Jeong, Jangwoo Kim (Seoul National University)

12:15 PM EDT12:30 PM EDT
Coordinated Priority-aware Charging of Distributed Batteries in Oversubscribed Data Centers
Sulav Malla (University of South Florida, Facebook Inc.); Qingyuan Deng, Zoh Ebrahimzadeh, Joe Gasperetti, Sajal Jain, Parimala Kondety, Thiara Ortiz, Debra Vieira (Facebook Inc.)

12:30 PM EDT12:45 PM EDT
HyperPlane: A Scalable Low-latency Notification Accelerator for Software Data Planes
Amirhossein Mirhosseini, Hossein Golestani, Thomas Wenisch (University of Michigan)

12:45 PM EDT1:00 PM EDT
ThymesisFlow: A Software-Defined, Hw/Sw co-Designed Interconnect Stack for Rack-Scale Memory Disaggregation
Christian Pinto, Dimitris Syrivelis, Michele Gazzetti, Panos Koutsovasilis, Andrea Reale, Kostas Katrinis (IBM Research Europe); Peter Hofstee (IBM Systems)

1:00 PM EDT1:15 PM EDT
A Benchmarking Framework for Interactive 3D Applications in the Cloud
Tianyi Liu, Sen He, Sunzhou Huang, Danny Tsang (The University of Texas at San Antonio); Lingjia Tang, Jason Mars (University of Michigan); Wei Wang (The University of Texas at San Antonio)
12:00 PM EDT12:15 PM EDT
A Locality-Aware Energy-Efficient Accelerator for Graph Mining Applications
Pengcheng Yao, Long Zheng, Zhen Zeng, Yu Huang, Chuangyi Gui, Xiaofei Liao, Hai Jin (Huazhong University of Science and Technology); Jingling Xue (University of New South Wales)

12:15 PM EDT12:30 PM EDT
GraphPulse: An Event-Driven Hardware Accelerator for Asynchronous Graph Processing
Shafiur Rahman, Nael Abu-Ghazaleh, Rajiv Gupta (University of California, Riverside)

12:30 PM EDT12:45 PM EDT
AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing
Tong Geng (Boston University); Ang Li (Pacific Northwest National Laboratory); Runbin Shi (The University of Hong Kong); Chunshu Wu, Tianqi Wang (Boston University); Yanfei Li (Zhejiang University); Pouya Haghi (Boston University); Antonino Tumeo (Pacific Northwest National Laboratory); Shuai Che, Steve Reinhardt (Microsoft); Martin Herbordt (Boston University)

12:45 PM EDT1:00 PM EDT
SeedEx: A Genome Sequencing Accelerator for Optimal Alignments in Subminimal Space
Daichi Fujiki, Shunhao Wu, Nathan Ozog, Kush Goliya, David Blaauw, Satish Narayanasamy, Reetuparna Das (University of Michigan)

1:00 PM EDT1:15 PM EDT
GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis
Damla Senol Cali (Carnegie Mellon University); Gurpreet S. Kalsi (Intel); Zülal Bingöl (Bilkent University); Lavanya Subramanian (Intel); Can Firtina, Jeremie Kim (ETH Zurich); Rachata Ausavarungnirun (King Mongkut's University of Technology North Bangkok); Mohammed Alser (ETH Zurich); Anant Nori (Intel); Juan Gómez Luna (ETH Zurich); Amirali Boroumand, Allison Scibisz (Carnegie Mellon University); Sreenivas Subramoney (Intel); Can Alkan (Bilkent University); Saugata Ghose (University of Illinois at Urbana–Champaign / Carnegie Mellon University); Onur Mutlu (ETH Zurich)

1:15 PM EDT2:30 PM EDT

20:15 (EEST/Athens)
1:15 PM EDT1:30 PM EDT
Selective Replication in Memory-Side GPU Caches
Xia Zhao (Chinese Academy of Military Science, China); Magnus Jahre (Norwegian University of Science and Technology); Lieven Eeckhout (Ghent University, Belgium)

1:30 PM EDT1:45 PM EDT
Deterministic Atomic Buffering
Yuan Hsi Chou, Christopher Ng, Shaylin Cattell (University of British Columbia); Jeremy Intan (University of Wisconsin-Madison); Matthew Sinclair (University of Wisconsin, AMD Research); Joseph Devietti (University of Pennsylvania); Timothy G. Rogers (Purdue University); Tor Aamodt (University of British Columbia)

1:45 PM EDT2:00 PM EDT
BOW: Breathing Operand Windows to Exploit Bypassing in GPUs
Hodjat Asghari Esfeden, Amirali Abdolrashidi (University of California, Riverside); Shafiur Rahman (University of California Riverside); Daniel Wong (University of California, Riverside); Nael Abu-Ghazaleh (University of California Riverside)

2:00 PM EDT2:15 PM EDT
MDM: The GPU Memory Divergence Model
Lu Wang (Ghent University); Magnus Jahre (Norwegian University of Science and Technology); Almutaz Adileh (Huawei); Lieven Eeckhout (Ghent University)

2:15 PM EDT2:30 PM EDT
Locality-Centric Data and Threadblock Management for Massive GPUs
Mahmoud Khairy, Vadim Nikiforov (Purdue University); David Nellans (NVIDIA); Timothy G. Rogers (Purdue University)
1:15 PM EDT1:30 PM EDT
Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation
Yu Feng, Boyuan Tian, Tiancheng Xu (University of Rochester); Paul Whatmough (Arm Research); Yuhao Zhu (University of Rochester)

1:30 PM EDT1:45 PM EDT
FlexWatts: A Power- and Workload-Aware Hybrid Power Delivery Network for Energy-Efficient Microprocessors
Jawad Haj-Yihia, Mohammed Alser, Lois Orosa, Jeremie Kim (ETH Zurich); Efraim Rotem (Intel); Avi Mendelson (Technion); Anupam Chattopadhyay (NTU Singapore); Onur Mutlu (ETH Zurich, Carnegie Mellon University)

1:45 PM EDT2:00 PM EDT
Building the Computing System for Autonomous Micromobility Vehicles: Design Constraints and Architectural Optimizations
Bo Yu, Wei Hu, Leimeng Xu (PerceptIn); Jie Tang (South China University of Technology); Shaoshan Liu (PerceptIn); Yuhao Zhu (University of Rochester)

2:00 PM EDT2:15 PM EDT
AutoScale: Energy Efficiency Optimization for Stochastic Edge Inference Using Reinforcement Learning
Young Geun Kim (Arizona State University); Carole-Jean Wu (Facebook)

2:15 PM EDT2:30 PM EDT
NCPU: An Embedded Neural CPU Architecture on Resource-Constrained Low Power Devices for Real-time End-to-End Performance
Tianyu Jia, Yuhao Ju, Russ Joseph, Jie Gu (Northwestern University)
1:15 PM EDT1:30 PM EDT
CaSA: End-to-end Quantitative Security Analysis of Randomly Mapped Caches
Thomas Bourgeat (MIT CSAIL); Jules Drean, Yuheng Yang, Lillian Tsai (MIT); Joel Emer (NVIDIA/MIT); Mengjia Yan (MIT)

1:30 PM EDT1:45 PM EDT
PerSpectron: Detecting Invariant Footprints of Microarchitectural Attacks with Perceptron
Samira Mirbagher-Ajorpaz (Texas A&M University); Gilles A Pokam (Intel); Esmaeil Mohammmadian Koruyeh (UCR); Elba Garza (Texas A&M University); Nael Abu-Ghazaleh (University of California Riverside); Daniel Jiménez (Texas A&M University)

1:45 PM EDT2:00 PM EDT
Speculation Invariance (InvarSpec): Faster Safe Execution Through Program Analysis
Zirui Zhao, Houxiang Ji, Mengjia Yan, Jiyong Yu, Christopher W. Fletcher (University of Illinois at Urbana-Champaign); Adam Morrison (Tel Aviv University); Darko Marinov, Josep Torrellas (University of Illinois at Urbana-Champaign)

2:00 PM EDT2:15 PM EDT
Hardware-based Always-On Heap Memory Safety
Yonghae Kim (Georgia Tech); Jaekyu Lee (Arm Research); Kim Hyesoon (Georgia Tech)

2:30 PM EDT2:45 PM EDT: Closing Remarks

21:30 (EEST/Athens)