TurboQuant model weight compression support added to Llamacpp

(github.com)

11 points | by lastdong  3 hours ago

4 comments