Tue 4 Mar 2025 14:00 - 14:20 at Casuarina Ballroom (Level 2) - GPU & Parallelism Chair(s): Bastian Hagedorn

Fully homomorphic encryption (FHE) and zero-knowledge proof (ZKP) are emerging as promising solutions for data security in today’s data-driven world. However, the widespread adoption of these encryption techniques is hindered by their significant computational overhead, primarily resulting from core cryptographic operations that involve large integer arithmetic. To manage this complexity, this paper presents a formalization of multi-word modular arithmetic (MoMA), which breaks down arbitrary bit-width integer arithmetic into operations based on machine words. We further develop a rewrite system that implements MoMA, designed to be compatible with any compiler or code generator by recursively rewriting on data types. We evaluate MoMA by generating cryptographic kernels, including basic linear algebra subprogram (BLAS) operations and the number theoretic transform (NTT), across various GPUs. Our MoMA-based BLAS operations outperform state-of-the-art multi-precision libraries by orders of magnitude, and MoMA-based NTT achieves near-ASIC performance on commodity GPUs.

Tue 4 Mar

Displayed time zone: Pacific Time (US & Canada) change

14:00 - 15:00
14:00
20m
Talk
Code Generation for Cryptographic Kernels Using Multi-word Modular Arithmetic on GPU
Main Conference
Naifeng Zhang Carnegie Mellon University, Franz Franchetti Carnegie Mellon University, USA
14:20
20m
Talk
CuAsmRL: optimizing GPU SASS schedules via deep reinforcement learning
Main Conference
Guoliang He University of Cambridge, Eiko Yoneki U. of Cambridge
14:40
20m
Talk
Proteus: Portable Runtime Optimization of GPU Kernel Execution with Just-In-Time Compilation
Main Conference
Giorgis Georgakoudis Lawrence Livermore National Laboratory, Konstantinos Parasyris Lawrence Livermore National Laboratory, David Beckingsale Lawrence Livermore National Laboratory