JetSpec: Parallel Tree Drafting Overcomes Bottleneck in Speculative Decoding26. June 20264. July 2026AI ModelsJetSpec overcomes scaling limits of speculative decoding through parallel tree drafting with causal conditioning, achieving up to 9.64x speedup in LLM inference. Share on: