The gap between 'working code' and 'listed product' has collapsed. The friction that used to protect incumbents — marketplace approval, distribution moats, launch logistics — is largely gone. What that means for what's actually hard now.
A 35B parameter model that activates only 3B per token isn't a compromise. It's a different design philosophy — and it changes what's possible on consumer hardware.