Hardware

OpenAI and Broadcom Unveil Jalapeño Inference Chip

OpenAI and Broadcom announced Jalapeño, an AI accelerator designed to improve performance per watt for large language models, with early testing showing significant gains over current state-of-the-art solutions.

Image: OpenAI

OpenAI and Broadcom today unveiled Jalapeño, an AI accelerator designed to enhance performance per watt for large language models. Early testing indicates that the first-generation chip will deliver substantial improvements over current state-of-the-art solutions. The collaboration marks a key step in OpenAI’s strategy to build a full-stack platform for its models and products, with the chip set for deployment at gigawatt scale with data center partners over multiple generations.

Jalapeño was developed from scratch around OpenAI’s deep understanding of LLM fundamentals, informed by its roadmap of models, kernels, serving systems, and product needs. The chip is designed to work with all LLMs guided by OpenAI’s insights into the inference needs of current and future AI models across the industry. Engineering samples are running ML workloads in the lab at production target frequency and power, including GPT‑5.3‑Codex‑Spark. The architecture reduces data movement and balances compute, memory, and networking resources to achieve realized utilization much closer to theoretical peak performance.

The chip was delivered to OpenAI CEO Sam Altman and President Greg Brockman by Broadcom President and CEO Hock Tan and President Charlie Kawwas, marking an important step in OpenAI’s strategy to build the full stack behind its models and products. OpenAI designed the chip from scratch around its deep understanding of LLM fundamentals, informed by its roadmap of models, kernels, serving systems, and product needs, with partners Broadcom and Celestica, helping industrialize the platform through chip implementation, board, rack system integration, high-performance networking, and scalable production systems.

Source: openai

Key points

OpenAI and Broadcom unveiled Jalapeño, an AI accelerator designed to improve performance per watt for large language models.
Early testing shows Jalapeño will deliver performance per watt substantially better than current state-of-the-art solutions.
Jalapeño was developed from scratch around OpenAI’s deep understanding of LLM fundamentals.
Engineering samples of the Jalapeño chip are running ML workloads in the lab at production target frequency and power, including GPT‑5.3‑Codex‑Spark.
The architecture reduces data movement and balances compute, memory, and networking resources to achieve realized utilization much closer to theoretical peak performance.
Jalapeño was delivered to OpenAI CEO Sam Altman and President Greg Brockman by Broadcom President and CEO Hock Tan and President Charlie Kawwas.

Source: OpenAI Read the original →

WRITTEN BY

Sam Bergstrom

AI Infrastructure & Hardware

Sam specializes in AI chips, data centers, and training infrastructure.