New top story on Hacker News: Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s

Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s
35 by campers | 17 comments on Hacker News.


Post a Comment

0 Comments