OpenAI and Broadcom unveil LLM-optimized inference chip
ai
According to an announcement from OpenAI and Broadcom, the companies have unveiled a new processor designed specifically for running large language models. The chip, called Jalapeno, is optimized for inference workloads—the computationally intensive task of executing already-trained AI models. Specialized inference processors like this can significantly reduce latency and power consumption compared to general-purpose chips, potentially making AI models faster and more efficient to deploy.
Source: https://openai.com/index/openai-broadcom-jalapeno-inference-chip/
Listen to this story
Hear this and more stories in a personalized audio briefing.
Open The Chonkerton