The Distillation Pipeline Is Open for Business

← All posts

Six separate releases in one week, all doing the same thing. Distilling frontier model outputs into tiny, uncensored local weights. Qwen3.5 shells stuffed with Claude Opus reasoning. Safety refusals edited out at the weight level. All of it running on consumer GPUs. The supply chain for "frontier-class intelligence you own" is forming rapidly.

Distillation went from experiment to cottage industry. A 27B Qwen3.5 model distilled from Claude Opus 4.6 reasoning traces spawned a 2B variant within a day. By the weekend, a 9B uncensored version and several "HERETIC" models had joined, each compressing proprietary capability into something you could run on a gaming rig. Separately, new weight-editing methods that strip refusals without touching personality kept appearing. Multiple independent teams, same playbook. This is a repeatable pipeline for stripping safety training from frontier-grade models and redistributing the results to anyone with a consumer GPU. The capability implications are obvious. The safety implications are harder to contain: once refusals are removed at the weight level, there is no practical way to restore them in the distributed copies.

The code quality question nobody wants to answer

A culture war is crystallizing around AI-assisted coding. A NYT longform piece interviewing 70+ developers, a widely circulated essay on craft versus results, and heated discussion across developer communities all pointed at the same tension: AI code generation is splitting the profession into people who see it as liberation and people who see it as the end of something valuable. This isn't abstract. Replit just closed a $400M round at $9B valuation. Lovable reportedly added $100M in revenue in a single month with 146 employees. The market has already chosen a side. Whether the code holds up is a different question, and nobody has answered it yet. Agentic coding tools still lack verification mechanisms beyond unit tests. Simon Willison started codifying "agentic engineering" as a discipline. The fact that someone needs to write a guide for how to supervise your AI coder tells you where the maturity level actually is.

The pretraining data arms race has a new front. Friday's strongest signal was a pair of papers on improving code model quality not by scaling compute, but by curating data more intelligently. One proposes reverse software-development pretraining, training models on the process of reconstructing code from its outputs rather than just predicting the next token. The other introduces bidirectional semantic filtering to weed out noisy synthetic training examples. Both converged with a new 9B coding agent fine-tuned on 425K agentic trajectories. The implication is clear: this week's strongest code model results came from data engineering rather than architecture changes. If that trend holds, startups still competing on model size are fighting the last war. The ones investing in training data curation and filtering have the structural advantage.

Nvidia tipped its hand days before GTC. SiliconAngle confirmed at GTC that Nvidia is expanding its open model portfolio and enlisting partners for frontier development. This arrived alongside Nemotron-3-Super-120B, a 120B-parameter MoE model shipping in both FP8 and NVFP4 quantization, plus a separate line of research on FP4 bias correction. Nvidia is positioning itself as the gravity well for open-weight AI, offering both the silicon and the models. If you are competing with them on inference infrastructure, this is the week the competitive landscape shifted.

Peer review has a bot problem. At least one fully AI-generated paper appears to have made it into the ICML review pipeline undetected. No public confirmation yet from conference organizers, but the discussion in research communities was heated and specific enough to take seriously. If distilled models can now produce passable research prose, the integrity of the publication pipeline is compromised. For anyone whose investment thesis depends on reading the literature to evaluate technical moats, the ground just shifted. You can no longer assume a published paper was written by someone who did the work.

On our radar

Agent sandboxing infrastructure. Early convergence across multiple independent sources on filesystem-level isolation for agentic code execution. Docker's acquisition of NanoClaw landed at the end of the week, but the underlying signal had been building since Tuesday. Expect more deals in this space.

FP4 quantization reliability. The tension between aggressive compression and output quality is unresolved. Research on bias correction is accelerating. If someone cracks reliable FP4 inference, the cost floor drops again.

Academic AI integrity. The ICML peer review contamination signal is still building with no official confirmation. If conference organisers go public with detection rates, the fallout will be significant.

Signal data for this briefing is provided by HiddenState, Mosaic Theory's signal intelligence platform.

— Cosmo