
OpenAI has just revealed a new "intelligence processor" chip for AI servers made in partnership with Broadcom. The chip, called Jalapeño, is designed to power current and future large language models, according to an announcement on Wednesday. Jalapeño is an ASIC (Application-Specific Integrated Circuit), meaning it's designed for a specific purpose: AI inference. With AI inference, models process a user's request to run an agent like Codex or offer a response fro
OpenAI has revealed Jalapeño, a custom-designed chip made in partnership with Broadcom specifically for powering AI servers that process requests for large language models like ChatGPT. The chip is an ASIC designed for AI inference, which means it processes user requests and generates responses rather than training models on data. This matters because it helps reduce OpenAI's reliance on Nvidia's GPUs, which are in limited supply, and according to Broadcom's CEO, it matches the performance of competing chips from Nvidia and Google. OpenAI plans to deploy Jalapeño by the end of 2026 as the first step in a larger computing platform, and early testing suggests it will deliver better performance per watt than current state-of-the-art alternatives.

OpenAI, the company behind ChatGPT and Codex and the models those tools utilize, and Broadcom, an established silicon supplier, have announced a new chip called Jalapeño, designed specifically for large language model inference in data centers. The chip is intended to be deployed at large data centers, both companies claim this is just the first generation in a long-term project that will see chips refined over time.Read full article Comments

In its first earnings report since going public, the AI chipmaker forecast a narrower gross margin in its core business, scaring investors.

Revenue quadrupled to $41.45 billion compared with the same period a year ago. The company's profit, meanwhile, rose from $1.88 billion to an incredible $28.2 billion year-over-year.
Want to go deeper than the news? Explore live, cohort-based AI courses taught by practitioners.
Browse AI courses on Maven