GPT-OSS-120B IS NOW LIVE
OpenAI x Cerebras
New open model (gpt-oss-120B) is live on Cerebras running at a world record 3,000 tokens / sec, with high intelligence, low cost, and ease of migration – delivering the best of GenAI without compromises.







GPT-OSS-120B IS NOW LIVE
New open model (gpt-oss-120B) is live on Cerebras running at a world record 3,000 tokens / sec, with high intelligence, low cost, and ease of migration – delivering the best of GenAI without compromises.
Powered by the Cerebras Wafer Scale Engine – Cerebras Inference runs the latest AI models 20x faster than ChatGPT. Companies like Perplexity, Mistral, and Alpha Sense use Cerebras to get instant responses to user queries.
Groundbreaking organizations are using Cerebras to push the boundaries of their AI capabilities.
Cerebras is the first and only company in the world building AI hardware at wafer-scale. We hold the world’s speed record in AI inference.
Get Updates