Google’s 1,000 Token/Sec AI Is Here
Google's DiffusionGemma rewrites the rules for text generation, using image diffusion techniques to hit speeds over 1,000 tokens per second. This radical shift from memory-bound to compute-bound architecture unlocks a new class of instant, interactive local AI.