Weekly Digest

May 25-31, 2026

AI Cracks Open Math as Safety Cracks Show

Stories49

Unverified9

Read time5 min read

49 Stories9 unverified5 min read

The Big Picture

For decades, hard combinatorics problems on Paul Erdős’s lists resisted brute force and human intuition alike. Last week, an AI system paired with the Lean proof checker reportedly solved 9 open Erdős problems and formally proved 44 OEIS conjectures, a striking sign that language models are starting to contribute in domains where being almost right is useless.

Elsewhere, the mood was split between acceleration and alarm. Financial Times reporting said safety protections on some Meta and Google models could be stripped within minutes, turning alignment into a distribution problem rather than a one-time training fix. At the same time, AI coding tools kept getting more agentic: Claude Code added dynamic workflows, local plugins, and broader cloud support, while vLLM pushed faster decoding and Biohub released a substantial open protein-design stack.

The pattern is becoming clearer. AI is getting better at doing serious work in strict environments, from formal math to software engineering and biology, while the scaffolding around it matters more than ever. A researcher can now imagine using AI to explore proof strategies, a developer can delegate larger coding jobs to coordinated agents, and a biotech team gets more open tools for protein design instead of relying only on closed labs.

Watch the next wave closely: model capability gains now arrive alongside infrastructure, deployment, and safety stress tests. The frontier is moving forward, but so is the pressure to make these systems robust in the wild.

AGI Probability Assessment

View TrackerTracker

68.8%+0.8%

Est. 18 months to AGI

Chance of production-ready AGI within 3 years, assessed by AI analysis of this week's developments

Last week extended the prior math-and-science momentum with a stronger formal reasoning signal: the reported Lean-backed solution of 9 open Erdős problems and 44 OEIS conjectures is more consequential than ordinary benchmark gains because it works in a domain where near-correct answers do not count. The increase is modest rather than large because most of the rest was incremental agent and infrastructure progress, and the report that safeguards on some accessible models can be stripped in minutes highlights deployment fragility rather than a core capability breakthrough.

Last Week in Numbers

Open Erdős problems reportedly solved with formal proofs

OEIS conjectures reportedly proved by the same system

60%

Usable review rate for GPT-5.2 in paper-review testing

Open Erdős problems reportedly solved with formal proofs

OEIS conjectures reportedly proved by the same system

60%

Usable review rate for GPT-5.2 in paper-review testing

Key Developments

Major|x.com

AI reportedly solves nine Erdős problems

This is significant because formal mathematics is one of the hardest places for AI to fake competence. Previously, language models could suggest plausible proof ideas but often failed on rigor; now a system combining an LLM with Lean reportedly produced machine-checkable proofs for 9 open Erdős problems and 44 OEIS conjectures.

For instance

More weeklies

AI Cracks an 80-Year-Old Geometry ConjectureOlder

Weekly Digest

Terminal

Weekly Digest

Weekly Digest

Weekly Digest

Weekly Digest

AI Cracks Open Math as Safety Cracks Show

AI reportedly solves nine Erdős problems

Model safeguards reportedly removed in minutes

Claude Code becomes more agentic

Biohub opens a broad protein AI stack

GPT-5.2 reportedly beats human reviewers

vLLM improves tricky speculative decoding

Open image dataset targets generation training

MiniCPM5 pushes capable models to 1B