
DeepSeek released DSpark, a speculative decoding framework, with open-source checkpoints and training code. It is a serving optimization, not a new model. The checkpoints DeepSeek-V4-Pro-DSpark and DeepSeek-V4-Flash-DSpark reuse the existing V4 weights, with a draft module attached. The DeepSeek research team also open-sourced DeepSpec, an MIT-licensed codebase for training and evaluating speculative decoding drafters. The work targets one problem: faster large-model inference in busy produc
DSpark is a serving optimization framework that speeds up text generation from large AI models by combining two different drafting approaches: a fast parallel method and a lightweight sequential component. The system works by having a smaller draft model propose multiple tokens at once, which a larger model then verifies, accepting tokens that match the target distribution without any quality loss. This matters because it reduces latency in production settings where many users are generating text simultaneously, with measured speedups of 60 to 85 percent over previous single-token baselines depending on the workload type. The framework includes a confidence scoring system and load-aware scheduler that adjust how many tokens get verified based on whether the system is busy or idle, allowing it to optimize speed while maintaining output quality.

New models are launching in Asia that promise Mythos-like capabilities without fear of an export ban. U.S. AI labs may never recover this enormous market.

OpenAI has begun a limited preview of GPT-5.6, its next-generation model series. The lineup splits into three named tiers: Sol, Terra, and Luna. Sol is the flagship. Terra targets everyday production work. Luna is the fast, low-cost option. OpenAI is starting with a small group of trusted partners through the API and Codex. According to OpenAI post, they shared the models and plans with the U.S. government first. Broader access in ChatGPT, Codex, and the API is planned in the coming weeks.

Machine Intelligence
Want to go deeper than the news? Explore live, cohort-based AI courses taught by practitioners.
Browse AI courses on Maven