# AI Models

- [Meta Avocado Is Closed-Source. The Llama Era Might Be Over.](https://sdd.sh/2026/05/meta-avocado-is-closed-source.-the-llama-era-might-be-over..md): Meta's next flagship model has been delayed twice, benchmarks below GPT-5.5 and Claude Opus 4.7, and unlike Llama — it won't be open-sourced. Meta is reportedly considering licensing Google Gemini as a stopgap. The open-source AI story Meta spent two years building is quietly unraveling.
- [MiniMax M2.7: The Open-Source Agent That Rewrote Its Own Training Loop](https://sdd.sh/2026/04/minimax-m2.7-the-open-source-agent-that-rewrote-its-own-training-loop.md): MiniMax M2.7 is the first open-source model to participate in its own development cycle — 100 autonomous rounds of scaffold optimization, 30% performance gain, 56.22% on SWE-Pro. It's not just a strong model. It's a glimpse of what model self-improvement looks like in practice.
- [Meta's Muse Spark Is Closed Source. Open-Source AI Just Lost Its Last Major Patron.](https://sdd.sh/2026/04/metas-muse-spark-is-closed-source.-open-source-ai-just-lost-its-last-major-patron..md): Meta Superintelligence Labs shipped Muse Spark — and made it closed-source. The company that framed open AI as a moral imperative just locked the door. Here's what that means for developers who built their stack on Llama.
- [Claude Mythos Goes Official: Project Glasswing and the Zero-Day Reckoning](https://sdd.sh/2026/04/claude-mythos-goes-official-project-glasswing-and-the-zero-day-reckoning.md): Anthropic officially unveiled Claude Mythos Preview on April 7, confirming what the March leak hinted at: a model that autonomously found thousands of zero-days across every major OS and browser. Their response — Project Glasswing — grants restricted access to a select group of tech giants to use Mythos as a defensive weapon. This is the most consequential 'too dangerous to release' moment in AI history.
- [GLM-5.1: The Open-Source Model That Just Beat Everyone on SWE-bench Pro](https://sdd.sh/2026/04/glm-5.1-the-open-source-model-that-just-beat-everyone-on-swe-bench-pro.md): Z.AI released GLM-5.1 today — a 754B open-weight model under MIT license that scored 58.4% on SWE-bench Pro, beating GPT-5.4, Claude Opus 4.6, and Gemini 3.1 Pro. Its headline demo: an 8-hour autonomous session that built a complete Linux desktop environment across 655 iterations. The closed-model monopoly on frontier coding capability just got its first serious challenge.
- [The SWE-bench Plateau: Three Frontier Models Walk In, All Score 80% — Now What?](https://sdd.sh/2026/04/the-swe-bench-plateau-three-frontier-models-walk-in-all-score-80-now-what.md): Gemini 3.1 Pro, Claude Opus 4.6, and GPT-5.3-Codex are all within 0.8% of each other on SWE-bench Verified. When every frontier model aces the exam, the exam stops being useful. Here's what actually differentiates them.
- [Claude Mythos: The Leaked Model That Scared the Security World](https://sdd.sh/2026/03/claude-mythos-the-leaked-model-that-scared-the-security-world.md): A CMS misconfiguration at Anthropic accidentally revealed 'Claude Mythos' — a model tier above Opus 4.6 that Anthropic itself calls an unprecedented cybersecurity risk. Here's what leaked, what it means for agentic coding, and why the security industry noticed immediately.
