Stardust: Compiling Sparse Tensor Algebra to a Reconfigurable Dataflow Architecture
We introduce Stardust, a compiler from a sparse tensor algebra language to a reconfigurable dataflow architecture, by way of the Spatial parallel-patterns programming model. The key insight is to let performance engineers specify the placement of data into memories separately from the placement of computation onto compute units. Data is placed using an abstract memory model, and Stardust binds that data to complex, on-chip physical memories. Stardust then binds computation that uses on-chip data structures to the appropriate parallel patterns. Using cycle-accurate simulation, we show that Stardust can generate nine more tensor algebra kernels than the original Capstan work. The generated kernels result in 138$\times$ better performance on average than generated CPU kernels and 41$\times$ better performance on average than generated GPU kernels.
Wed 5 MarDisplayed time zone: Pacific Time (US & Canada) change
10:00 - 11:20 | Optimizations & Transformations (3)Main Conference at Casuarina Ballroom (Level 2) Chair(s): Michel Steuwer Technische Universität Berlin | ||
10:00 20mTalk | Postiz: Extending Post-Increment Addressing for Loop Optimization and Code Size Reduction Main Conference enming fan , Xiaofeng Guan Shanghai Jiao Tong University; Shanghai Enflame Technology, Fan Hu , Heng Shi Enflame Tech Co., Hao Zhou Enflame Tech Co., Jianguo Yao Shanghai Jiao Tong University; Shanghai Enflame Technology | ||
10:20 20mTalk | Towards Efficient Compiler Auto-tuning: Leveraging Synergistic Search Spaces Main Conference Haolin Pan Institute of Software, Chinese Academy of Sciences;School of Intelligent Science and Technology, HIAS, UCAS, Hangzhou;University of Chinese Academy of Sciences, Yuanyu Wei Institute of Software, Chinese Academy of Sciences;School of Intelligent Science and Technology, HIAS, UCAS, Hangzhou;University of Chinese Academy of Sciences, Mingjie Xing Institute of Software, Chinese Academy of Sciences, Yanjun Wu Institute of Software, Chinese Academy of Sciences, Chen Zhao Institute of Software, Chinese Academy of Sciences | ||
10:40 20mTalk | Stardust: Compiling Sparse Tensor Algebra to a Reconfigurable Dataflow Architecture Main Conference Olivia Hsu Stanford University, Alexander Rucker Stanford University, Tian Zhao Stanford University, Varun Desai Stanford University, Kunle Olukotun Stanford University, Fredrik Kjolstad Stanford University | ||
11:00 20mTalk | Vectron: A Dynamic Programming Auto-Vectorization Framework Main Conference Sourena Naser Moghaddasi University of Victoria, Haris Smajlović University of Victoria, Ariya Shajii Exaloop, Ibrahim Numanagić University of Victoria |