Decoupled DiLoCo: Resilient, Distributed AI Training at Scale

(deepmind.google)

32 points | by metadat  3 hours ago

3 comments