ANT-ACE: An FHE Compiler Framework for Automating Neural Network Inference
Fully Homomorphic Encryption (FHE) enables computations on encrypted data without needing the decryption key, providing significant privacy benefits for neural network applications in sensitive sectors like medicine and finance. However, programming these applications with FHE is challenging and requires deep cryptographic knowledge to ensure correctness, performance, and security.
In this paper, we introduce ACE, the first production-quality open-source FHE compiler developed by a global IT company. ACE automates neural network inference on encrypted data by accepting ONNX models and generating C/C++ programs that use its open-source FHE library. We discuss the design challenges in developing ACE, which aims to support various input formats and architectures across different FHE schemes through an innovative IR supporting multiple abstraction levels. ACE comprises 44K lines of C/C++ code and translates ONNX models into C/C++ for encrypted inference on CPUs, specifically using the RNS-CKKS scheme. Preliminary evaluations on single CPU show that ACE achieves $2.24\times$ speed improvements in ResNet models compared to expert implementations, confirming its effectiveness and meeting our design objectives.
Mon 3 MarDisplayed time zone: Pacific Time (US & Canada) change
15:40 - 16:40 | ML CompilersMain Conference at Willow (Level 2) Chair(s): William S. Moses University of Illinois Urbana-Champaign | ||
15:40 20mTalk | ANT-ACE: An FHE Compiler Framework for Automating Neural Network Inference Main Conference Long Li Ant Group, Jianxin Lai Ant Group, Peng Yuan Ant Group, Tianxiang Sui Ant Group, Yan Liu Ant Group, Qing Zhu Ant Group, Xiaojing Zhang Ant Group, Linjie Xiao Ant Group, Wenguang Chen Tsinghua University; Pengcheng Laboratory, Jingling Xue UNSW Sydney | ||
16:00 20mTalk | CUrator: An Efficient LLM Execution Engine with Optimized Integration of CUDA Libraries Main Conference | ||
16:20 20mTalk | Accelerating LLMs using an Efficient GEMM library and Target-aware Optimizations on Real-world PIM Devices Main Conference Hyeoncheol Kim Yonsei University, Taehoon Kim Rebellions Inc, Taehyeong Park Yonsei University, Donghyeon Kim Hanyang University, Yongseung Yu Yonsei University, Hanjun Kim Yonsei University, Yongjun Park Yonsei University |