The co-located conferences will feature three keynotes, one from each conference.

Our three keynote speakers are:

Monday, March 3, 2025

Speaker: Charles E. Leiserson

Title: Setting a Course for Post-Moore Software Performance

Abstract: Software performance engineering is the science and art of making code run fast or otherwise limiting its consumption of resources, such as energy, memory footprint, network utilization, response time, etc. Performance engineering encompasses parallel computing, but it also includes other techniques, such as caching, vectorization, algorithms, bit tricks, loop unrolling, compiler-switch selection, tailoring code to the architecture, exploiting sparsity, changing data representation, metaprogramming, etc. I will explain why the end of Moore’s Law makes software performance engineering a critical technical skill for the future. I will also argue that the PPoPP community is ideally positioned to show leadership in SPE and that it would be wise to change the meaning of its acronym to “Principles and Practice of Performance Programming.”

Bio: Charles E. Leiserson is Edwin Sibley Webster Professor of Computer Science and Engineering in MIT’s Department of Electrical Engineering and Computer Science (EECS) and a member and former Associate Director of MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL). He received a B.S. in computer science and mathematics from Yale University in 1975 and a Ph.D. in computer science from Carnegie Mellon University in 1981. He currently serves as the MIT Faculty Director of the Department of the Air Force-MIT AI Accelerator and leads its Fast AI project. His award-winning research on algorithms, parallel computing, and software performance engineering has been widely deployed in industry. He held the position of Director of System Architecture for the MIT-spinoff Akamai Technologies, and he founded Cilk Arts, Inc., a multicore-software start-up acquired by Intel. He was the network architect for the Connection Machine CM-5, the world’s most powerful computer in 1993. He coauthored the influential textbook Introduction to Algorithms, which has sold over one million copies. Leiserson is a Fellow of four professional societies—ACM, AAAS, SIAM, and IEEE—and he is a member of the National Academy of Engineering.


Tuesday, March 4, 2025

Speaker: John Regehr

Title: Do We Really Want Correct Compilers?

Abstract: If we wanted correct compilers, we could have them already. But that’s not all we want: we also want compilers that are really good at code generation, that run quickly, that keep up with complicated language standards, and more. The actual problem is as much about economic choices as it is about software engineering and formal methods. This talk will explore these issues and also look into some ways that we might be able to change the economic fundamentals that underlie compiler development.

Bio: John Regehr is a professor of Computer Science at the University of Utah. He is currently most interested in using program synthesis and data-driven methods to automatically generate as many parts of compilers as possible, but he is also interested in improving legacy compilers so they optimize better and contain fewer bugs.


Wednesday, March 5, 2025

Speaker: Cliff Young

Title: ML Engineering in the Giant Model Age

Abstract: It’s been over a decade since the AlexNet breakthrough, which revitalized and invigorated all of the fields of our three conferences (reductively: architecture, compilation, and whole-stack parallielization). In pursuit of AGI and in search of business models that use ML, we are reaching for unprecedented scale, inventing new algorithms, and delivering scientific and systematic breakthroughs. I’ll revisit some of the key algorithmic building blocks (matrix multiplication and attention) with a system-builder’s view, and I’ll use TPUs to illustrate the scale and reliability problems faced by today’s ML training systems. I will draw on my experiences in building special-purpose, hardware/software/algorithmic systems to challenge our three communities, particularly in the dialogue between special- and general-purpose hardware, the requirements of compiler design, and the huge opportunities for parallel algorithms and systems.

Bio: Cliff Young is a Software Engineer in Google DeepMind, where he works on codesign for deep learning accelerators. He is one of the designers of Google’s Tensor Processing Unit (TPU) and one of the founders of the MLPerf benchmark. Previously, Cliff built special-purpose supercomputers for molecular dynamics at D. E. Shaw Research and was a Member of Technical Staff at Bell Labs. Cliff holds AB, MS, and PhD degrees in computer science from Harvard University. Cliff is a member of ACM and IEEE.

Keynote Speakers

Each of the co-located conferences will feature a keynote.

Title
Do We Really Want Correct Compilers?
Keynotes
ML Engineering in the Giant Model Age
Keynotes

Setting a Course for Post-Moore Software Performance
Keynotes
Dates
Tracks
Plenary
You're viewing the program in a time zone which is different from your device's time zone change time zone

Mon 3 Mar

Displayed time zone: Pacific Time (US & Canada) change

09:30 - 10:00
Break and ReconfigurationBreak / Main Conference
10:00 - 11:00
Distinguished PapersMain Conference at Casuarina Ballroom (Level 2)
Chair(s): Christophe Dubach McGill University
10:00
20m
Talk
Synthesis of Sorting Kernels
Main Conference
Marcel Ullrich Saarland University, Saarland Informatics Campus, Sebastian Hack Saarland University, Saarland Informatics Campus
10:20
20m
Talk
Tensorize: Fast Synthesis of Tensor Programs from Legacy Code using Symbolic Tracing, Sketching and Solving
Main Conference
Alexander Brauckmann University of Edinburgh, Luc Jaulmes University of Edinburgh, United Kingdom, José Wesley De Souza Magalhães University of Edinburgh, Elizabeth Polgreen University of Edinburgh, Michael F. P. O'Boyle University of Edinburgh
10:40
20m
Talk
Enhancing Deployment-time Predictive Model Robustness for Code Analysis and Optimization
Main Conference
Huanting Wang University of Leeds, Patrick Lenihan University of Leeds, Zheng Wang University of Leeds
11:20 - 12:20
Optimizations & Transformations (1)Main Conference at Casuarina Ballroom (Level 2)
Chair(s): Oleksandr Zinenko n/a
11:20
20m
Talk
SySTeC: A Symmetric Sparse Tensor Compiler
Main Conference
Radha Patel Massachusetts Institute of Technology, Willow Ahrens Massachusetts Institute of Technology, Saman Amarasinghe Massachusetts Institute of Technology
11:40
20m
Talk
Pattern Matching in AI Compilers and its Formalization
Main Conference
Joseph W. Cutler University of Pennsylvania, Alexander Collins NVIDIA, Bin Fan Nvidia, Mahesh Ravishankar , Vinod Grover NVIDIA
12:00
20m
Talk
Scalar Interpolation: A Better Balance Between Vector and Scalar Execution for SuperScalar Architectures
Main Conference
Reza Ghanbari University of Alberta, Henry Kao Huawei Technologies Canada, João P. L. De Carvalho AMD, Ehsan Amiri Huawei Technologies Canada, Jose Nelson Amaral University of Alberta
14:00 - 15:20
ML Tools & OptimizationMain Conference at Casuarina Ballroom (Level 2)
Chair(s): Jeronimo Castrillon TU Dresden, Germany
14:00
20m
Talk
VEGA: Automatically Generating Compiler Backends Using a Pre-Trained Transformer Model
Main Conference
Ming Zhong SKLP, Institute of Computing Technology, CAS, Fang Lv Institute of Computing Technology, Chinese Academy of Sciences, Lulin Wang SKLP, ICT, CAS Beijing, China, Lei Qiu SKLP, Institute of Computing Technology, CAS; University of Chinese Academy of Sciences, Yingying Wang SKLP, ICT, CAS Beijing, China, Ying Liu Institute of Computing Technology, Chinese Academy of Sciences, Huimin Cui Institute of Computing Technology, Chinese Academy of Sciences, Xiaobing Feng ICT CAS, Jingling Xue UNSW Sydney
14:20
20m
Talk
IntelliGen: Instruction-Level Auto-Tuning for Tensor Program with Monotonic Memory Optimization
Main Conference
Zixuan Ma Tsinghua University, Haojie Wang Tsinghua University, Jingze Xing Tsinghua University, Shuhong Huang Tsinghua University, Liyan Zheng Tsinghua University, Chen Zhang Tsinghua University, Huanqi Cao Tsinghua University, Kezhao Huang Tsinghua University, Mingshu Zhai Tsinghua University, Shizhi Tang Tsinghua University, Penghan Wang Tsinghua University, Jidong Zhai Tsinghua University
14:40
20m
Talk
GraalNN: Context-Sensitive Static Profiling with Graph Neural Networks
Main Conference
Lazar Milikic Oracle Labs, Milan Cugurovic Oracle Labs, Vojin Jovanovic Oracle Labs
15:00
20m
Talk
LLM-Vectorizer: LLM-based Verified Loop Vectorizer
Main Conference
Jubi Taneja Microsoft Research, Avery Laird University of Toronto, Cong Yan Microsoft Research, Madan Musuvathi Microsoft Research, Shuvendu K. Lahiri Microsoft Research
15:40 - 16:40
Architectures & Code GenerationMain Conference at Casuarina Ballroom (Level 2)
Chair(s): Yongjun Park Yonsei University
15:40
20m
Talk
Calibro: Compilation-Assisted Linking-Time Binary Code Outlining for Code Size Reduction in Android Applications
Main Conference
Zhanhao Liang Wuhan University, Hanming Sun Wuhan University, wenhan shang Wuhan University, YUAN Mengting School of Computer Science, Wuhan University, Wuhan, China, Jingqin Fu Wuhan Broadcasting and Television Station, Jiang Ma OPPO Electronics Corp., Jason Xue MBZUAI, Qingan Li Wuhan University, China
16:00
20m
Talk
A Multi-Level Compiler Backend for Accelerated Micro-Kernels Targeting RISC-V ISA Extensions
Main Conference
Alexandre Lopoukhine University of Cambridge, Federico Ficarelli Cineca, Christos Vasiladiotis University of Edinburgh, Anton Lydike The University of Edinburgh, Josse Van Delm KU Leuven, Alban Dutilleul ENS Rennes, Luca Benini ETH Zurich, Switzerland, Marian Verhelst KU Leuven, Tobias Grosser University of Cambridge, UK
16:20
20m
Talk
xDSL: Sidekick Compilation for SSA-Based Compilers
Main Conference
Mathieu Fehr The University of Edinburgh, Michel Weber ETH Zurich, Christian Ulmann ETH Zurich, Alexandre Lopoukhine University of Cambridge, Martin Lücke University of Edinburgh, Theo Degioanni ENS Rennes, Christos Vasiladiotis University of Edinburgh, Michel Steuwer Technische Universität Berlin, Tobias Grosser University of Cambridge, UK
15:40 - 16:40
ML CompilersMain Conference at Willow (Level 2)
Chair(s): William S. Moses University of Illinois Urbana-Champaign
15:40
20m
Talk
ANT-ACE: An FHE Compiler Framework for Automating Neural Network Inference
Main Conference
Long Li Ant Group, Jianxin Lai Ant Group, Peng Yuan Ant Group, Tianxiang Sui Ant Group, Yan Liu Ant Group, Qing Zhu Ant Group, Xiaojing Zhang Ant Group, Linjie Xiao Ant Group, Wenguang Chen Tsinghua University; Pengcheng Laboratory, Jingling Xue UNSW Sydney
16:00
20m
Talk
CUrator: An Efficient LLM Execution Engine with Optimized Integration of CUDA Libraries
Main Conference
Yoon Noh Lee Yonsei University, Yongseung Yu Yonsei University, Yongjun Park Yonsei University
16:20
20m
Talk
Accelerating LLMs using an Efficient GEMM library and Target-aware Optimizations on Real-world PIM Devices
Main Conference
Hyeoncheol Kim Yonsei University, Taehoon Kim Rebellions Inc, Taehyeong Park Yonsei University, Donghyeon Kim Hanyang University, Yongseung Yu Yonsei University, Hanjun Kim Yonsei University, Yongjun Park Yonsei University
17:00 - 18:00
17:00
20m
Talk
DialEgg: Dialect-Agnostic MLIR Optimizer using Equality Saturation with Egglog
Main Conference
Abd-El-Aziz Zayed McGill University, Christophe Dubach McGill University
17:20
20m
Talk
Combining MLIR Dialects with Domain-Specific Architecture for Efficient Regular Expression Matching
Main Conference
Andrea Somaini Politecnico di Milano, Filippo Carloni Politecnico di Milano, Giovanni Agosta Politecnico di Milano, Italy, Marco D. Santambrogio Politecnico di Milano, Davide Conficconi Politecnico di Milano
17:40
20m
Talk
The MLIR Transform Dialect - Your compiler is more powerful than you think
Main Conference
Martin Lücke University of Edinburgh, Michel Steuwer Technische Universität Berlin, Albert Cohen Google DeepMind, William S. Moses University of Illinois Urbana-Champaign, Alex Zinenko Google DeepMind
17:00 - 17:40
Quantum Computing (1)Main Conference at Willow (Level 2)
Chair(s): Sunita Chanrasekaran University of Delaware, USA
17:00
20m
Talk
Synthesis of Quantum Simulators by Compilation
Main Conference
Meisam Tarabkhah University of Edinburgh, Mahshid Delavar University of Sheffield, Mina Doosti University of Edinburgh, Amir Shaikhha University of Edinburgh
17:20
20m
Talk
Weaver: A Retargetable Compiler Framework for FPQA Quantum Architectures
Main Conference
Oğuzcan Kırmemiş TU Munich, Francisco Romao TU Munich, Emmanouil Giortamis TU Munich, Pramod Bhatotia TU Munich, Germany
18:00 - 19:30
19:30 - 21:00
Steering Committee MeetingMain Conference at Casuarina Ballroom (Level 2)

Tue 4 Mar

Displayed time zone: Pacific Time (US & Canada) change

08:30 - 09:30
CGO KeynoteKeynotes
08:30
60m
Keynote
Do We Really Want Correct Compilers?
Keynotes
John Regehr University of Utah
09:30 - 10:00
Break and ReconfigurationBreak / Main Conference
10:00 - 11:00
Program Analysis & SynthesisMain Conference at Casuarina Ballroom (Level 2)
Chair(s): Jose Nelson Amaral University of Alberta
10:00
20m
Talk
Automatic Synthesis of Specialized Hash Functions
Main Conference
Renato B Hoffmann PUC-RS, Leonardo Gibrowski Faé PUC-RS, Dalvan Griebler Pontifícia Universidade Católica do Rio Grande do Sul - PUCRS, David Li Google, Fernando Magno Quintão Pereira Federal University of Minas Gerais
10:20
20m
Talk
Stack Filtering: Elevating Precision and Efficiency in Rust Pointer Analysis
Main Conference
Wei Li UNSW, Dongjie He Chongqing University, China, Wenguang Chen Tsinghua University; Pengcheng Laboratory, Jingling Xue UNSW Sydney
10:40
20m
Talk
SkipFlow: Improving the Precision of Points-to Analysis using Primitive Values and Predicate Edges
Main Conference
David Kozak Brno University of Technology & Oracle Labs, Christian Wimmer Amazon Web Services, Codrut Stancu Oracle Labs, Tomas Vojnar Masaryk University
10:00 - 11:00
Safety & ResilienceMain Conference at Willow (Level 2)
Chair(s): Fabrice Rastello University Grenoble Alpes - Inria - CNRS - Grenoble INP - LIG
10:00
20m
Talk
FastFlip: Compositional SDC Resiliency Analysis
Main Conference
Keyur Joshi University of Illinois at Urbana-Champaign, Rahul Singh University of Illinois Urbana-Champaign, Tommaso Bassetto University of Illinois Urbana-Champaign, Sarita Adve University of Illinois at Urbana-Champaign, Darko Marinov University of Illinois at Urbana-Champaign, Sasa Misailovic University of Illinois at Urbana-Champaign
10:20
20m
Talk
MTE4JNI: A Memory Tagging Method to Protect Java Heap Memory from Illicit Native Code Access
Main Conference
Huinan Chen Wuhan University, Jiang Ma OPPO Electronics Corp., Jason Xue MBZUAI, Qingan Li Wuhan University, China
10:40
20m
Talk
Compiler-Based Memory Safety Instrumentations in Practice: Usability, Performance, and Security Guarantees
Main Conference
Tina Jung Saarland Informatics Campus, Saarland University, Fabian Ritter Saarland University, Germany, Sebastian Hack Saarland University, Saarland Informatics Campus
11:20 - 12:20
Optimizations & Transformations (2)Main Conference at Casuarina Ballroom (Level 2)
Chair(s): Sebastian Hack Saarland University, Saarland Informatics Campus
11:20
20m
Talk
PreFix: Optimizing the Performance of Heap-Intensive Applications
Main Conference
Chaitanya Mamatha Ananda University of California Riverside, Rajiv Gupta University of California at Riverside (UCR), Sriraman Tallam Google Inc., Han Shen Google Inc, David Li Google
11:40
20m
Talk
A Priori Loop Nest Normalization: Automatic Loop Scheduling in Complex Applications
Main Conference
Lukas Trümper Daisytuner, Philipp Schaad ETH Zurich, Berke Ates ETH Zurich, Alexandru Calotoiu ETH Zurich, Marcin Copik ETH Zurich, Torsten Hoefler ETH Zurich
12:00
20m
Talk
An Efficient Polynomial Multiplication Derived Implementation Of Convolution in Neural Networks
Main Conference
Haoke Xu University of Delaware, Yulin Zhang Minzu University of China, Zitong Cheng University of Delaware, Xiaoming Li University of Delaware
11:20 - 12:20
Quantum Computing and BackendsMain Conference at Willow (Level 2)
Chair(s): Davide Conficconi Politecnico di Milano
11:20
20m
Talk
ASDF: A Compiler for Qwerty, a Basis-Oriented Quantum Programming Language
Main Conference
Austin J. Adams Georgia Institute of Technology, Sharjeel Khan Google, Arjun Bhamra Georgia Institute of Technology, Ryan Abusaada Georgia Institute of Technology, Anthony Cabrera Oak Ridge National Laboratory, Cameron Hoechst Georgia Institute of Technology, Travis S. Humble Oak Ridge National Laboratory, Jeffrey Young Georgia Institute of Technology, Thomas Conte Georgia Institute of Technology
11:40
20m
Talk
Qubit Movement-Optimized Program Generation on Zoned Neutral Atom Processors
Main Conference
Enhyeok Jang Yonsei University, Youngmin Kim Yonsei University, Hyungseok Kim Yonsei University, Seungwoo Choi Yonsei University, Yipeng Huang Rutgers University, Won Woo Ro Yonsei University
12:00
20m
Talk
Honey Potion: an eBPF Backend for Elixir
Main Conference
Kael Soares Augusto UFMG, Vinícius Pacheco Cadence, Marcos A. M. Vieira Universidade Federal de Minas Gerais, Rodrigo G. Ribeiro Federal University of Ouro Preto, Fernando Magno Quintão Pereira Federal University of Minas Gerais
14:00 - 15:00
14:00
20m
Talk
Code Generation for Cryptographic Kernels Using Multi-word Modular Arithmetic on GPU
Main Conference
Naifeng Zhang Carnegie Mellon University, Franz Franchetti Carnegie Mellon University, USA
14:20
20m
Talk
CuAsmRL: optimizing GPU SASS schedules via deep reinforcement learning
Main Conference
Guoliang He University of Cambridge, Eiko Yoneki U. of Cambridge
14:40
20m
Talk
Proteus: Portable Runtime Optimization of GPU Kernel Execution with Just-In-Time Compilation
Main Conference
Giorgis Georgakoudis Lawrence Livermore National Laboratory, Konstantinos Parasyris Lawrence Livermore National Laboratory, David Beckingsale Lawrence Livermore National Laboratory
15:20 - 17:00
Security, Fault Tolerance & CryptographyMain Conference at Casuarina Ballroom (Level 2)
Chair(s): Fernando Magno Quintão Pereira Federal University of Minas Gerais
15:20
20m
Talk
Qiwu: Exploiting Ciphertext-Level SIMD Parallelism in Homomorphic Encryption Programs
Main Conference
Zhang zhongcheng Institute of Computing Technology at Chinese Academy of Sciences; University of Chinese Academy of Sciences; Zhongguancun Laboratory, Ying Liu Institute of Computing Technology, Chinese Academy of Sciences, Yuyang Zhang Institute of Computing Technology at Chinese Academy of Sciences; University of Chinese Academy of Sciences;, Zhenchuan Chen Institute of Computing Technology, Chinese Academy of Sciences, Jiacheng Zhao Institute of Computing Technology at Chinese Academy of Sciences; University of Chinese Academy of Sciences; Zhongguancun Laboratory, Xiaobing Feng ICT CAS, Huimin Cui Institute of Computing Technology, Chinese Academy of Sciences, Jingling Xue UNSW Sydney
15:40
20m
Talk
Cage: Hardware-Accelerated Safe WebAssembly
Main Conference
Martin Fink Technical University of Munich, Dimitrios Stavrakakis TU Munich and University of Edinburgh, Dennis Sprokholt TU Delft, Soham Chakraborty TU Delft, Jan-Erik Ekberg Huawei Technologies LLC, Pramod Bhatotia TU Munich, Germany
16:00
20m
Talk
Teapot: Efficiently Uncovering Spectre Gadgets in COTS Binaries
Main Conference
Fangzheng Lin Institute of Science Tokyo, Zhongfa Wang Institute of Science Tokyo, Hiroshi Sasaki Institute of Science Tokyo
16:20
20m
Talk
Janitizer: Rethinking Binary Tools for Practical and Comprehensive Security
Main Conference
Mahwish Arif University of Cambridge, Sam Ainsworth University of Edinburgh, Timothy M. Jones University of Cambridge
Pre-print
16:40
20m
Talk
Parallaft: Runtime-based CPU Fault Tolerance via Heterogeneous Parallelism
Main Conference
Boyue Zhang University of Cambridge, Sam Ainsworth University of Edinburgh, Lev Mukhanov Queen Mary University London, Timothy M. Jones University of Cambridge
Pre-print

Wed 5 Mar

Displayed time zone: Pacific Time (US & Canada) change

08:30 - 09:30
HPCA KeynoteKeynotes
08:30
60m
Keynote
ML Engineering in the Giant Model Age
Keynotes

09:30 - 10:00
Break and ReconfigurationBreak / Main Conference
10:00 - 11:20
Optimizations & Transformations (3)Main Conference at Casuarina Ballroom (Level 2)
Chair(s): Michel Steuwer Technische Universität Berlin
10:00
20m
Talk
Postiz: Extending Post-Increment Addressing for Loop Optimization and Code Size Reduction
Main Conference
enming fan , Xiaofeng Guan Shanghai Jiao Tong University; Shanghai Enflame Technology, Fan Hu , Heng Shi Enflame Tech Co., Hao Zhou Enflame Tech Co., Jianguo Yao Shanghai Jiao Tong University; Shanghai Enflame Technology
10:20
20m
Talk
Towards Efficient Compiler Auto-tuning: Leveraging Synergistic Search Spaces
Main Conference
Haolin Pan Institute of Software, Chinese Academy of Sciences;School of Intelligent Science and Technology, HIAS, UCAS, Hangzhou;University of Chinese Academy of Sciences, Yuanyu Wei Institute of Software, Chinese Academy of Sciences;School of Intelligent Science and Technology, HIAS, UCAS, Hangzhou;University of Chinese Academy of Sciences, Mingjie Xing Institute of Software, Chinese Academy of Sciences, Yanjun Wu Institute of Software, Chinese Academy of Sciences, Chen Zhao Institute of Software, Chinese Academy of Sciences
10:40
20m
Talk
Stardust: Compiling Sparse Tensor Algebra to a Reconfigurable Dataflow Architecture
Main Conference
Olivia Hsu Stanford University, Alexander Rucker Stanford University, Tian Zhao Stanford University, Varun Desai Stanford University, Kunle Olukotun Stanford University, Fredrik Kjolstad Stanford University
11:00
20m
Talk
Vectron: A Dynamic Programming Auto-Vectorization Framework
Main Conference
Sourena Naser Moghaddasi University of Victoria, Haris Smajlović University of Victoria, Ariya Shajii Exaloop, Ibrahim Numanagić University of Victoria
12:00 - 13:00
Runtime & System ToolsMain Conference at Casuarina Ballroom (Level 2)
Chair(s): Ajay Brahmakshatriya Massachusetts Institute of Technology
12:00
20m
Talk
GoFree: Reducing Garbage Collection via Compiler-inserted Freeing
Main Conference
Haoran Peng University of Science and Technology of China, Yu Zhang University of Science and Technology of China, Michael D. Ernst University of Washington, Jinbao Chen University of Science and Technology of China, Boyao Ding University of Science and Technology of China
DOI
12:20
20m
Talk
Improving Native-Image Startup Performance
Main Conference
Matteo Basso Università della Svizzera italiana (USI), Aleksandar Prokopec Oracle Labs, Andrea Rosà USI Lugano, Walter Binder USI Lugano
12:40
20m
Talk
Speeding up the Local C++ Development Cycle with Header Substitution
Main Conference
Nader Al Awar The University of Texas at Austin, Zijian Yi The University of Texas at Austin, George Biros The University of Texas at Austin, Milos Gligoric The University of Texas at Austin
13:00 - 13:30