ANT-ACE: An FHE Compiler Framework for Automating Neural Network Inference (CGO 2025 - Main Conference)

Who

Long Li, Jianxin Lai, Peng Yuan, Tianxiang Sui, Yan Liu, Qing Zhu, Xiaojing Zhang, Linjie Xiao, Wenguang Chen, Jingling Xue

Track

CGO 2025 Main Conference

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 3 Mar 2025 15:40 - 16:00 at Willow (Level 2) - ML Compilers Chair(s): William S. Moses

Abstract

Fully Homomorphic Encryption (FHE) enables computations on encrypted data without needing the decryption key, providing significant privacy benefits for neural network applications in sensitive sectors like medicine and finance. However, programming these applications with FHE is challenging and requires deep cryptographic knowledge to ensure correctness, performance, and security.

In this paper, we introduce ACE, the first production-quality open-source FHE compiler developed by a global IT company. ACE automates neural network inference on encrypted data by accepting ONNX models and generating C/C++ programs that use its open-source FHE library. We discuss the design challenges in developing ACE, which aims to support various input formats and architectures across different FHE schemes through an innovative IR supporting multiple abstraction levels. ACE comprises 44K lines of C/C++ code and translates ONNX models into C/C++ for encrypted inference on CPUs, specifically using the RNS-CKKS scheme. Preliminary evaluations on single CPU show that ACE achieves $2.24\times$ speed improvements in ResNet models compared to expert implementations, confirming its effectiveness and meeting our design objectives.

Long Li

Ant Group

Jianxin Lai

Ant Group

Peng Yuan

Ant Group

Tianxiang Sui

Ant Group

Yan Liu

Ant Group

Qing Zhu

Ant Group

Xiaojing Zhang

Ant Group

Linjie Xiao

Ant Group

Wenguang Chen

Tsinghua University; Pengcheng Laboratory

China

Jingling Xue

UNSW Sydney

Australia

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 3 Mar
Displayed time zone: Pacific Time (US & Canada) change

15:40 - 16:40	ML CompilersMain Conference at Willow (Level 2) Chair(s): William S. Moses University of Illinois Urbana-Champaign

15:40 20m Talk		ANT-ACE: An FHE Compiler Framework for Automating Neural Network Inference Main Conference Long Li Ant Group, Jianxin Lai Ant Group, Peng Yuan Ant Group, Tianxiang Sui Ant Group, Yan Liu Ant Group, Qing Zhu Ant Group, Xiaojing Zhang Ant Group, Linjie Xiao Ant Group, Wenguang Chen Tsinghua University; Pengcheng Laboratory, Jingling Xue UNSW Sydney
16:00 20m Talk		CUrator: An Efficient LLM Execution Engine with Optimized Integration of CUDA Libraries Main Conference Yoon Noh Lee Yonsei University, Yongseung Yu Yonsei University, Yongjun Park Yonsei University
16:20 20m Talk		Accelerating LLMs using an Efficient GEMM library and Target-aware Optimizations on Real-world PIM Devices Main Conference Hyeoncheol Kim Yonsei University, Taehoon Kim Rebellions Inc, Taehyeong Park Yonsei University, Donghyeon Kim Hanyang University, Yongseung Yu Yonsei University, Hanjun Kim Yonsei University, Yongjun Park Yonsei University