Codesigning Computing Systems for Artificial Intelligence
Speaker: Suvinay Subramanian
Location: 60 Fifth Avenue Room 150
Date: November 3, 2023, 11 a.m.
Host: Anirudh Sivaraman
The rapid advancement of artificial intelligence (AI) has ushered in an era of
unprecedented computational demands, necessitating continuous innovation in
computing systems. In this talk, we will highlight how codesign has been a key
paradigm in enabling innovative solutions and state-of-the-art performance in
Google's AI computing systems, namely Tensor Processing Units (TPUs). We
present several codesign case studies across different layers of the stack,
spanning hardware, systems, software, algorithms, all the way up to the
datacenter. We discuss how TPUs have made judicious, yet opinionated bets in
our design choices, and how these design choices have not only kept pace with
the blistering rate of change, but also enabled many of the breakthroughs in
Suvinay Subramanian is a Staff Software Engineer at Google, where he works on
the architecture and codesign for Google's ML supercomputers, Tensor Processing
Units (TPUs). His work has directly impacted innovative architecture and
systems features in multiple generations of TPUs, and empowered performant
training and serving of Google's research and production AI workloads. Suvinay
received a Ph.D. from MIT, and a B.Tech from the Indian Institute of Technology
Madras. He also co-hosts the Computer Architecture podcast that spotlights
cutting-edge developments in computer architecture and systems.
In-person attendance only available to those with active NYU ID cards.