900,000
AI-Optimized Cores
123x more cores
214Pb/s
Interconnect Bandwidth
45,000x more bandwidth
1.2Tb/s
System I/O
44GB
On-Chip SRAM
1,000x more on-chip memory
21PB/s
Memory Bandwidth
12,800x more bandwidth
16RU
System Dimensions
performance
Cluster-Scale Performance on a Single Chip
A single CS-3 typically delivers the wall-clock compute performance of many tens to hundreds of graphics processing units (GPU), or more. In one system less than one rack in size, the CS-3 delivers answers in minutes or hours that would take days, weeks, or longer on large multi-rack clusters of legacy, general purpose processors.
At 16 RU, and peak sustained system power of 23kW, the CS-3 packs the performance of a room full of servers into a single unit the size of a dorm room mini-fridge. With cluster-scale compute available in a single device, you can push your research further – at a fraction of the cost.
SYSTEM ENGINEERING
Purpose-Built for AI Workloads
The CS-3 is designed to deliver unparalleled performance to users; all in a package that is easy to deploy, operate, and maintain in your datacenter today.
At the heart of the CS-3 system is an innovative wafer packaging solution we refer to as the engine block. The engine block delivers power straight into the face of the wafer to achieve the required power density that could not be achieved with traditional packaging. It provides uniform cooling for the wafer via a closed internal water loop. All cooling and power supplies are redundant and hot-swappable so you stay up-and-running at full performance.
DATA CENTER DEPLOYMENT
Revolutionary AI Compute in a Standards-Based System
The CS-3 is easily installed into a standard datacenter infrastructure — from loading dock to users’ hands in a few days rather than weeks or months.
The CS-3 connects to surrounding infrastructure over 12x standard 100 Gigabit Ethernet links and converts standard TCP-IP traffic into Cerebras protocol at full line rate to feed the WSE-3’s 900,000 cores.
efficiency
Massive Gains in Space and Power Efficiency
For AI researchers and data scientists, the CS-3 delivers the ability to test more ideas per unit time with unmatched AI compute performance. The CS-3 delivers performance gains in a more space and power efficient package. For typical customer workloads running today, the CS-3 delivers approximately orders of magnitude wall-clock compute advantages vs. GPUs at a fraction of the power.