on-demand webinar

Cornell University: Building Sparse Linear Algebra Accelerators with HLS

Estimated Watching Time: 72 minutes

Share

Cornell intros HiSparse: accelerator on sparse-matrix dense-vector multiplication. Using both HLS implementation and simulation, their sparse accelerators deliver promising speedup with increased bandwidth and energy efficiency.

Sparse linear algebra (SLA) operations are essential in many applications such as data analytics, graph processing, machine learning, and scientific computing. However, it is challenging to build efficient hardware accelerators for SLA operations since they typically exhibit low operational intensity and irregular compute and data access patterns. In particular, some of these challenges are not well studied in the context of High-Level Synthesis (HLS). 
 
In this talk, we first introduce HiSparse, an accelerator on sparse-matrix dense-vector multiplication (SpMV). To achieve a high bandwidth utilization, we co-design the sparse storage format and the accelerator architecture. We further demonstrate the use of Catapult HLS to build a high-throughput pipeline that can handle irregular data dependencies and access patterns. Building on our SpMV accelerator, we further develop a versatile sparse accelerator that can support multiple SLA operations with run-time configurability to support different compute patterns. Our architecture design is guided by a novel analytical model which enables rapid exploration of the design configuration search space. According to our evaluation using both HLS implementation and simulation, our sparse accelerators deliver promising speedup with increased bandwidth and energy efficiency when compared to CPU and GPU executions.

Meet the speaker

Cornell University

Yixiao Du

Ph.D. student

Yixiao is a second-year Ph.D. student at Cornell Computer Systems Lab, advised by Prof. Zhiru Zhang. His current research interests include efficient acceleration of sparse workloads with high-level synthesis. He received his bachelor’s degree in Microelectronics Science and Engineering from the University of Electronic Science and Technology of China.

Related resources

Eldo Platform
Fact Sheet

Eldo Platform

Eldo Platform delivers the required SPICE accuracy and performance to design and verify complex automotive IC designs using the BCD technology.

Kronos Characterizer
Fact Sheet

Kronos Characterizer

Library characterization is a key factor in today’s design flows. Modern static timing analysis (STA)-based design flows depend on characterized Liberty® models to work.

Advanced planning and scheduling (APS) for Industrial Machinery
E-book

Advanced planning and scheduling (APS) for Industrial Machinery

Gain better visibility and improve manufacturing processes with advanced planning and scheduling software. Read the ebook to learn more.