Thank You For Attending RISC-V Summit North America! | Missed the event? Watch Now.

In the Media

MTIA v1: Meta’s first-generation AI inference accelerator

By May 18, 2023No Comments1 min read

AI workloads are ubiquitous at Meta — forming the basis for a wide range of use cases, including content understandingFeeds, generative AI, and ads ranking. These workloads run on PyTorch with first-class Python integration, eager-mode development, and the simplicity of APIs. Deep learning recommendation models (DLRMs), in particular, are important for improving experiences across Meta’s services and applications. But as these models increase in size and complexity, the underlying hardware systems need to provide exponentially more memory and compute while remaining efficient.

Read the full article.