Think Smart About Sparse Compute: LatentMoE for Higher Accuracy per Flop, Param

(research.nvidia.com)

2 points | by buildbot  7 hours ago

No comments yet.