온디맨드 웨비나

How NVIDIA Uses High-Level Synthesis Tools for AI Hardware Accelerator Research

예상 소요 시간: 19분

공유

To keep up with the rapid pace of change in AI/ML workloads, NVIDIA Research leverages a High-Level Synthesis (HLS) based design methodology based off SystemC and libraries like MatchLib for maximizing code reuse and minimizing design verification effort.

Artificial Intelligence (AI) and Machine Learning (ML) are rapidly transforming many aspects of integrated circuit (IC) design. The high computational demands and evolving AI/ML workloads are dramatically impacting the architecture, VLSI implementation, and circuit design tradeoffs of hardware accelerators. To keep up with the rapid pace of change in AI/ML workloads, NVIDIA Research leverages a High-Level Synthesis (HLS) based design methodology based off SystemC and libraries such as MatchLib for maximizing code reuse and minimizing design verification effort. This methodology provides for rapid co-optimization of AI algorithms and hardware architecture and has enabled NVIDIA Research to tape out a state-of-the-art 5nm deep learning inference accelerator testchip that achieves up to 95.6 TOPS/ with per-vector scaled 4-bit quantization for Transformer neural network inference.

발표자 소개

NVIDIA

Brucek Khailany

Senior Director of ASIC and VLSI Research

Brucek Khailany joined NVIDIA in 2009 and currently leads the ASIC & VLSI Research group. During his time at NVIDIA, he has contributed to projects within research and product groups on topics spanning computer architecture, unit micro-architecture, and ASIC and VLSI design techniques. Dr. Khailany is also currently the Principal Investigator to a NVIDIA-led team under the DARPA CRAFT project researching high-productivity design methodology and design tools. Previously, Dr. Khailany was a Co-Founder and Principal Architect at Stream Processors, Inc. (SPI) where he led research and development activities related to highly-parallel programmable processor architectures. He received his Ph.D. and Masters in Electrical Engineering from Stanford University and received B.S.E. degrees in Electrical Engineering and Computer Engineering from the University of Michigan.

관련 자료

블루닷: Catapult-HLS를 이용한 NN기반의 DeepField-PQO 설계 가속화
White Paper

블루닷: Catapult-HLS를 이용한 NN기반의 DeepField-PQO 설계 가속화

고화질 영상에 대한 시장의 수요 증가로 인해 높은 인코딩 비용에 대한 비디오 서비스 제공 업체의 부담이 커지고 있습니다. 이 문제를 해결하기 위해 블루닷은 비디오 인코딩 효율을 향상시키는 AI 기반의 CODEC용 전처리 DeepField-PQO 필터를 개발하였습니다. 블루닷은 필터를 FPGA용 IP로 빠르게 구현하기 위해 HLS를 활용하고, ASIC용으로는 Catapult HLS를 활용하였습니다.

Xperi®: A Designer’s Life with HLS
Webinar

Xperi®: A Designer’s Life with HLS

This webinar will discuss two aspects of their experience going from RTL to HLS. The first topic is using HLS for algorithms such as Face Detection th