TurboPrefill: 2.7× faster than llama.cpp Pipeline Parallel on Llama-3-70B

(github.com)

3 points | by trykhlieb  a day ago

1 comments