
Inception
@_inception_ai • 17,714 subscribers
Pioneering a new generation of LLMs.
Shorts
Videos

Today's autoregressive models generate one token at a time. Mercury 2 generates tokens in parallel. Over 1,000 tok/sec on standard GPUs, at comparable quality to speed-optimized models. Since launch, the community has been showing what diffusion LLMs can unlock. Thanks to the team at Clyep for the breakdown.
Inception21,021 views • 19 days ago
No more content to load