Click here - to use the wp menu builder

OpenAI sidesteps Nvidia with unusually fast coding model on plate-sized chips

February 12, 2026

But 1,000 tokens per second is actually modest by Cerebras standards. The company has measured 2,100 tokens per second on Llama 3.1 70B and reported 3,000 tokens per second on OpenAI’s own open-weight gpt-oss-120B model, suggesting that Codex-Spark’s comparatively lower speed reflects the overhead of a larger or more complex model.

AI coding agents have had a breakout year, with tools like OpenAI’s Codex and Anthropic’s Claude Code reaching a new level of usefulness for rapidly building prototypes, interfaces, and boilerplate code. OpenAI, Google, and Anthropic have all been racing to ship more capable coding agents, and latency has become what separates the winners; a model that codes faster

→ Continue reading at Ars Technica

All Categories

Business2873 Entertainment164 Finance16 Portland5084 Seattle4110 Spokane2364

Review: Supergirl is not the disaster its low box office suggests

Kotek Announces Plan To Unite Oregon This Fourth Of July By Posting Another Divisive Facebook Rant About Trump

When the ability to smell goes away

B.C. firm sues CoCo Rum cocktail maker amid U.S. expansion

West Vancouver neighbours sue after tree cut allegedly without consent

OpenAI sidesteps Nvidia with unusually fast coding model on plate-sized chips

Share article

All Categories