OpenAI unveils its first custom chip, built by Broadcom

Named Jalapeño, the new processor was designed specifically for the unique needs of OpenAI's inference systems.

Get smart on it

OpenAI has unveiled its first custom-built inference processor, named Jalapeño, which was designed and manufactured in collaboration with Broadcom. The chip is specifically designed for inference, the process of running pre-built AI models in response to user commands, and early testing shows significantly better performance-per-watt than current alternatives. This matters because reducing inference costs could improve OpenAI's profitability, and the move reflects a broader industry trend where major AI companies build custom chips to reduce dependence on existing hardware makers like Nvidia. The chip represents OpenAI's effort to optimize its entire technology stack, from chip architecture down to product experience, all around the same goal of making its models faster, more reliable, and more affordable.

OpenAI and Broadcom announce chip designed for LLM inference at scale

OpenAI, the company behind ChatGPT and Codex and the models those tools utilize, and Broadcom, an established silicon supplier, have announced a new chip called Jalapeño, designed specifically for large language model inference in data centers. The chip is intended to be deployed at large data centers, both companies claim this is just the first generation in a long-term project that will see chips refined over time.Read full article Comments

Hardware & ComputeOpen story →

Cerebras stock plunges after earnings as CEO says margin outlook was misunderstood

In its first earnings report since going public, the AI chipmaker forecast a narrower gross margin in its core business, scaring investors.

Hardware & Compute

OpenAI unveils its first custom chip, built by Broadcom

OpenAI and Broadcom announce chip designed for LLM inference at scale

Cerebras stock plunges after earnings as CEO says margin outlook was misunderstood

The memory chip crunch is paying off for this US company

Microsoft’s Wisconsin AI Data Center Campus Now Fully Operational

Nvidia Overtakes Rivals in Data Center Ethernet Switching, IDC Says

Texas Approves Batch Zero Study as Data Center Demand Soars