OpenAI and Broadcom announce chip designed for LLM inference at scale

OpenAI, the company behind ChatGPT and Codex and the models those tools utilize, and Broadcom, an established silicon supplier, have announced a new chip called Jalapeño, designed specifically for large language model inference in data centers.

The chip is intended to be deployed at large data centers, both companies claim this is just the first generation in a long-term project that will see chips refined over time.

Broadcom says that this ASIC (Application-Specific Integrated Circuit) was designed from scratch for LLM inference, based on “detailed insights” from the company’s conversations with researchers at OpenAI, and that the chip’s development was informed by OpenAI’s own roadmap for future models and

→ Continue reading at Ars Technica

Share article

All Categories