Weekly Digest

Jan 5-11, 2026

AI Solves 50-Year-Old Erdős Math Puzzle, Tao Confirms Proof

Stories82

Unverified19

Read time4 min read

82 Stories19 unverified4 min read

Listen as a podcast

Listen as podcast

0:00/4:53

The Big Picture

For 50 years, Erdős Problem #728 on factorial divisibility stumped the world's top mathematicians. Last week, GPT-5.2 Pro paired with Harmonic's Aristotle cracked it autonomously in hours—and Terence Tao verified the novel proof.

Hardware surged ahead too: Sandia's Loihi 2 neuromorphic chips delivered 18x better performance per watt than GPUs on physics simulations, while NVIDIA unveiled the Rubin platform promising 5x faster AI training. A Chinese robot pulled off fully autonomous biliary surgery on a 30kg pig, navigating complex steps without human help. Anthropic's Constitutional Classifiers slashed jailbreaks by 4x while cutting refusals in half.

These advances hit real people hard. A solo researcher can now simulate climate flows at GPU speeds on a laptop, slashing weeks off projects. Rural surgeons gain a tireless assistant for routine ops that once demanded elite expertise. Drug hunters at small biotechs predict tissue responses zero-shot, speeding therapies from years to months.

Eyes on OpenAI's rumored January model drop and DeepSeek's V4 in February—reasoning leaps could redefine capabilities across the board.

AGI Probability Assessment

View TrackerTracker

55.0%+3.0%

Est. 25 months to AGI

Chance of production-ready AGI within 3 years, assessed by AI analysis of this week's developments

Last week's agentic and math advances like ROME on SWE-bench and HAGeo's IMO performance were surpassed by this week's AI solving the 50-year-old Erdős #728 problem, with Terence Tao verifying the novel proof, marking a leap in autonomous mathematical discovery. The Chinese robot's fully autonomous biliary surgery on a pig demonstrates robust multimodal agentic capabilities in real-world physical tasks. Combined with neuromorphic efficiency gains like Loihi 2's 18x over GPUs, these justify a measured increase amid continued momentum.

Last Week in Numbers

50 years

Age of Erdős #728 math problem solved by AI

18x

Loihi 2 efficiency over GPUs on PDE sims

Boost in Anthropic's jailbreak resistance

50 years

Age of Erdős #728 math problem solved by AI

18x

Loihi 2 efficiency over GPUs on PDE sims

Boost in Anthropic's jailbreak resistance

Key Developments

Major|x.com

AI Solves Erdős #728, Verified by Terence Tao

This marks AI's first verified novel math proof on a long-open problem, shifting from pattern matching to genuine discovery. Previously reliant on human intuition, math research now leverages autonomous agents for breakthroughs. Terence Tao's confirmation elevates it beyond hype to peer-reviewed impact.

For instance

More weeklies

Recursive Agent Beats Humans on ARC-AGI by Small TeamOlder xAI Ignites World's First 1GW AI Training ColossusNewer

Weekly Digest

Terminal

Weekly Digest

Weekly Digest

Weekly Digest

Weekly Digest

AI Solves 50-Year-Old Erdős Math Puzzle, Tao Confirms Proof

AI Solves Erdős #728, Verified by Terence Tao

Chinese Robot Performs Fully Autonomous Pig Surgery

Loihi 2 Hits 18x GPU Efficiency on PDE Sims

Analog Chip Delivers 87% Accuracy, 99.9% Fewer Ops

WISE Enables 6 fJ/MAC Inference via Radio Waves

Anthropic's Classifiers Quadruple Jailbreak Resistance

NVIDIA Rubin Promises 5x Faster AI Training

Falcon H1R 7B Tops Math and Agent Benchmarks

Arc Stack Predicts Drug Responses Zero-Shot