2024-10-14 15:24
Hah, Prime Intellect's first major decentralized training run is so popular that they aren't accepting my GPUs right now.
3.75% of 1T tokens done.
If you're showing tokens/sec, I'd also like to see number of H100s contributing. Will make it easy to see how much slower than centralized training it is.