DeepSeek V4 is dropping any day now — possibly this week. China's most disruptive AI lab is about to release its first major model in over a year. And it's arriving under a cloud of serious accusations: both OpenAI and Anthropic say DeepSeek stole their work to build it.
Here's what's happening, why it matters, and what builders should actually pay attention to.
What DeepSeek V4 Actually Is
V4 is a major step up from DeepSeek V3. It's multimodal — capable of processing and generating text, images, and video — marking DeepSeek's first significant model release since January 2025. Based on community leaks (via r/LocalLLaMA and r/DeepSeek), the model builds on the DeepSeek Sparse Attention architecture introduced in V3.2-Exp, with a new component called the DSA Lightning Indexer that enables 1-million-token context windows at roughly 50% less compute cost than comparable models.
A lighter variant has also leaked alongside the flagship — likely aimed at developers who want to self-host.
The original release window was mid-February 2026, around China's Lunar New Year. That slipped. Then late February came and went. Community consensus now points to this week, with March 10–14 being the most cited window. No official announcement yet from DeepSeek.
The Huawei Play Nobody Is Talking About
One detail that's getting buried in the IP headlines: DeepSeek didn't give Nvidia or AMD early access to V4 for optimization testing — which is standard practice in every major model launch. Instead, they gave exclusive pre-release access to Huawei and other Chinese domestic chipmakers.
That's not an oversight. It's strategy. Huawei gets several weeks of head start to tune its processors for V4 workloads, while US chip giants are locked out. DeepSeek is actively using its model releases as leverage in the hardware war — turning software into a moat for Chinese silicon.
For Western AI infrastructure companies, this should be a wake-up call. Huawei's Ascend chips have been quietly improving, and if V4 runs significantly better on Ascend hardware at launch, that performance gap matters in markets where Nvidia chips are restricted or expensive.
The Stolen IP Accusations
Anthropic published a detailed claim in February 2026 that it had identified "industrial-scale campaigns" by three AI labs — DeepSeek, Moonshot AI, and MiniMax — to extract Claude's capabilities through unauthorized distillation. The numbers are striking: over 16 million API interactions made through approximately 24,000 unauthorized accounts, all in apparent violation of Anthropic's terms of service.
Distillation is the practice of training a smaller or newer model on the outputs of a more advanced one. It's not inherently illegal — it's how most efficient open-weight models are built. The controversy is in unauthorized distillation: using another lab's commercial product at scale, without permission, to bootstrap your own competing model.
Anthropic said it attributed each campaign to a specific company with "high confidence." The company is now calling for coordinated policy action from industry and governments.
OpenAI went further. The company testified before the US House Select Committee on China that DeepSeek may have illegally distilled ChatGPT models over the past year. OpenAI has since tightened data center security and hired additional cybersecurity staff specifically in response to what it describes as systematic extraction attempts.
DeepSeek has not publicly responded to either accusation.
Why This Matters More Than the Last DeepSeek Moment
When DeepSeek V3 dropped in January 2025, it shocked Western labs by matching GPT-4-class performance at a fraction of the reported training cost. Nvidia's stock dropped 17% in a single day. The "cheap Chinese AI" narrative was born.
V4 is different. If the leaks are accurate, this isn't just a better model — it's a multimodal model with long-context capabilities that, until recently, only the frontier labs could offer. And it's arriving open-weight, meaning anyone can download and self-host it.
The distillation allegations add a layer that wasn't there before. If Anthropic's claims hold up, V4's capabilities may have been partially built on Claude's shoulders — and by extension, every company using V4 downstream would be benefiting from that extraction. That's a legal and ethical question the industry hasn't fully reckoned with.
There's also the geopolitical context: China's annual Two Sessions parliamentary meetings opened on March 4th. State media has framed DeepSeek as a national AI success story. Releasing V4 in this window — even if delayed — carries symbolic weight. This isn't just a product drop. It's a statement.
What Builders Should Do Now
If you're evaluating open-weight models for production use, wait for V4 benchmarks before committing. The V3.2-Exp already beats Llama 3.3 on most coding and reasoning tasks. V4 with multimodal support and 1M context could make several current tool stacks redundant.
If you're at a company that cares about IP provenance — especially in regulated industries — the distillation accusations matter. Not because V4 is definitely tainted, but because the legal status of distillation-derived models is unresolved. Keep an eye on how OpenAI and Anthropic pursue this legally.
And if you're a developer in Southeast Asia or anywhere outside the US export control zone, pay attention to the Huawei angle. A V4 that runs well on Ascend hardware changes the access calculus in markets where Nvidia H100s are unavailable or prohibitively expensive.
DeepSeek V4 hasn't landed yet. But everything around it already has.
FOLLOW NeuralWire on X for daily AI signal — what matters, why it matters, what to do about it. →