Mon 3 Mar 2025 15:40 - 16:00 at Willow (Level 2) - ML Compilers Chair(s): William S. Moses

Fully Homomorphic Encryption (FHE) enables computations on encrypted data without needing the decryption key, providing significant privacy benefits for neural network applications in sensitive sectors like medicine and finance. However, programming these applications with FHE is challenging and requires deep cryptographic knowledge to ensure correctness, performance, and security.

In this paper, we introduce ACE, the first production-quality open-source FHE compiler developed by a global IT company. ACE automates neural network inference on encrypted data by accepting ONNX models and generating C/C++ programs that use its open-source FHE library. We discuss the design challenges in developing ACE, which aims to support various input formats and architectures across different FHE schemes through an innovative IR supporting multiple abstraction levels. ACE comprises 44K lines of C/C++ code and translates ONNX models into C/C++ for encrypted inference on CPUs, specifically using the RNS-CKKS scheme. Preliminary evaluations on single CPU show that ACE achieves $2.24\times$ speed improvements in ResNet models compared to expert implementations, confirming its effectiveness and meeting our design objectives.

Mon 3 Mar

Displayed time zone: Pacific Time (US & Canada) change

15:40 - 16:40
ML CompilersMain Conference at Willow (Level 2)
Chair(s): William S. Moses University of Illinois Urbana-Champaign
15:40
20m
Talk
ANT-ACE: An FHE Compiler Framework for Automating Neural Network Inference
Main Conference
Long Li Ant Group, Jianxin Lai Ant Group, Peng Yuan Ant Group, Tianxiang Sui Ant Group, Yan Liu Ant Group, Qing Zhu Ant Group, Xiaojing Zhang Ant Group, Linjie Xiao Ant Group, Wenguang Chen Tsinghua University; Pengcheng Laboratory, Jingling Xue UNSW Sydney
16:00
20m
Talk
CUrator: An Efficient LLM Execution Engine with Optimized Integration of CUDA Libraries
Main Conference
Yoon Noh Lee Yonsei University, Yongseung Yu Yonsei University, Yongjun Park Yonsei University
16:20
20m
Talk
Accelerating LLMs using an Efficient GEMM library and Target-aware Optimizations on Real-world PIM Devices
Main Conference
Hyeoncheol Kim Yonsei University, Taehoon Kim Rebellions Inc, Taehyeong Park Yonsei University, Donghyeon Kim Hanyang University, Yongseung Yu Yonsei University, Hanjun Kim Yonsei University, Yongjun Park Yonsei University