Code Generation for Cryptographic Kernels Using Multi-word Modular Arithmetic on GPU
Fully homomorphic encryption (FHE) and zero-knowledge proof (ZKP) are emerging as promising solutions for data security in today’s data-driven world. However, the widespread adoption of these encryption techniques is hindered by their significant computational overhead, primarily resulting from core cryptographic operations that involve large integer arithmetic. To manage this complexity, this paper presents a formalization of multi-word modular arithmetic (MoMA), which breaks down arbitrary bit-width integer arithmetic into operations based on machine words. We further develop a rewrite system that implements MoMA, designed to be compatible with any compiler or code generator by recursively rewriting on data types. We evaluate MoMA by generating cryptographic kernels, including basic linear algebra subprogram (BLAS) operations and the number theoretic transform (NTT), across various GPUs. Our MoMA-based BLAS operations outperform state-of-the-art multi-precision libraries by orders of magnitude, and MoMA-based NTT achieves near-ASIC performance on commodity GPUs.
Tue 4 MarDisplayed time zone: Pacific Time (US & Canada) change
14:00 - 15:00 | |||
14:00 20mTalk | Code Generation for Cryptographic Kernels Using Multi-word Modular Arithmetic on GPU Main Conference | ||
14:20 20mTalk | CuAsmRL: optimizing GPU SASS schedules via deep reinforcement learning Main Conference | ||
14:40 20mTalk | Proteus: Portable Runtime Optimization of GPU Kernel Execution with Just-In-Time Compilation Main Conference Giorgis Georgakoudis Lawrence Livermore National Laboratory, Konstantinos Parasyris Lawrence Livermore National Laboratory, David Beckingsale Lawrence Livermore National Laboratory |