December 29, 2023
Introducing gigaGPT: GPT-3 sized models in 565 lines of code
GigaGPT is Cerebras’ implementation of Andrei Karpathy’s nanoGPT – the simplest and most compact code base to train and…
0 Comments17 Minutes
June 1, 2022
Tensor Shape: Increasing Model Throughput
We write machine learning algorithms to fit the data, not pad the data to suit hardware limitations.
0 Comments15 Minutes