OpenAI and Broadcom today unveiled Jalapeño, an AI accelerator designed to enhance performance per watt for large language models. Early testing indicates that the first-generation chip will deliver substantial improvements over current state-of-the-art solutions. The collaboration marks a key step in OpenAI’s strategy to build a full-stack platform for its models and products, with the chip set for deployment at gigawatt scale with data center partners over multiple generations.

Jalapeño was developed from scratch around OpenAI’s deep understanding of LLM fundamentals, informed by its roadmap of models, kernels, serving systems, and product needs. The chip is designed to work with all LLMs guided by OpenAI’s insights into the inference needs of current and future AI models across the industry. Engineering samples are running ML workloads in the lab at production target frequency and power, including GPT‑5.3‑Codex‑Spark. The architecture reduces data movement and balances compute, memory, and networking resources to achieve realized utilization much closer to theoretical peak performance.

The chip was delivered to OpenAI CEO Sam Altman and President Greg Brockman by Broadcom President and CEO Hock Tan and President Charlie Kawwas, marking an important step in OpenAI’s strategy to build the full stack behind its models and products. OpenAI designed the chip from scratch around its deep understanding of LLM fundamentals, informed by its roadmap of models, kernels, serving systems, and product needs, with partners Broadcom and Celestica, helping industrialize the platform through chip implementation, board, rack system integration, high-performance networking, and scalable production systems.

Source: openai