DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

(github.com)

291 points | by aurenvale  2 hours ago

68 comments