// 1 transmissions tagged with #text-generation
Google’s DiffusionGemma model cuts per‑token latency by a factor of four while preserving text quality, opening the door to real‑time NLP workloads on modest hardware.