Google DeepMind DiffusionGemma: Parallel Text Generation on Local GPUs10. June 2026AI Models, GoogleShare on:DiffusionGemma denoises up to 256 tokens in parallel per step instead of sequentially and achieves 1,000 tokens/second on NVIDIA H100 at batch size 1 — without cloud dependency. Share on: