Skip to main content
In the News

MTIA v1: Meta’s first-generation AI inference accelerator

By May 18, 2023May 25th, 2023No Comments

AI workloads are ubiquitous at Meta — forming the basis for a wide range of use cases, including content understandingFeeds, generative AI, and ads ranking. These workloads run on PyTorch with first-class Python integration, eager-mode development, and the simplicity of APIs. Deep learning recommendation models (DLRMs), in particular, are important for improving experiences across Meta’s services and applications. But as these models increase in size and complexity, the underlying hardware systems need to provide exponentially more memory and compute while remaining efficient.

Read the full article.

Stay Connected With RISC-V

We send occasional news about RISC-V technical progress, news, and events.