๐Ÿ“ฐ Story

simon_willison ยท May 20, 2026 ยท news

โ† Live feed ๐Ÿ“ฐ Daily recap ๐Ÿ—“๏ธ Weekly recap ๐Ÿ”” RSS

How fast is 10 tokens per second really?

How fast is 10 tokens per second really? Neat little HTML app by Mike Veerman ( source code here ) which simulates LLM token output speeds from 5/second to 800/second. Useful if you see a model advertised as "30 tokens/second" and want to get a feel for what that actually looks like. Via Hacker News Tags: ai , generative-ai , llms

Read the original at simonwillison.net โ†’Open in live feed

Related stories 4 items