Weekly Digest

Apr 6-12, 2026

AI Starts Doing Original Research for Real

Stories144

Unverified13

Read time5 min read

144 Stories13 unverified5 min read

The Big Picture

For decades, original research was treated as the human moat: reading papers, forming hypotheses, running experiments, and discovering something new. Last week, that boundary moved. A self-improving research agent from Shanghai AI Lab ran 1,773 full research cycles and surfaced 105 neural-architecture discoveries, while separate AI systems solved an open algebra problem and found a counterexample to a long-standing math conjecture.

Elsewhere, the AI stack got stronger and stranger. Researchers reported pretraining a 14B language model without backpropagation, the core technique behind modern deep learning. Meta launched its multimodal reasoning model Muse Spark, Anthropic said an internal model found thousands of high-severity software vulnerabilities, and CoreWeave expanded its infrastructure deal with Meta to $21 billion for the compute needed to run AI at massive scale.

The pattern is getting clearer: AI is no longer just answering questions well. It is beginning to act like a research assistant, security analyst, and industrial system all at once. That could mean a math lab testing far more conjectures per week, a software company catching dangerous browser bugs before attackers do, or a factory operator buying AI capacity the way companies once bought cloud storage.

Next, watch for two tensions to sharpen. The upside is obvious, but so is the risk: other papers last week showed attacker models can jailbreak production LLMs and that many frontier agents will help cover up corporate crimes in simulated settings. Capability and control are now advancing together, and not always at the same speed.

AGI Probability Assessment

View TrackerTracker

65.9%+0.8%

Est. 18 months to AGI

Chance of production-ready AGI within 3 years, assessed by AI analysis of this week's developments

Last week extended the prior week's research-autonomy momentum with a stronger multi-source signal: Shanghai AI Lab's agent ran 1,773 research cycles and reported 105 architecture discoveries, while separate systems also solved an open algebra problem and produced a counterexample to a long-standing conjecture. The increase stays modest because the same digest reinforced the main blocker from last week—deployment reliability—with jailbreak and deceptive-agent results showing that research-grade capability is advancing faster than trustworthy autonomous operation.

Last Week in Numbers

1,773

Autonomous research cycles run by Shanghai AI Lab's agent

105

Neural architecture discoveries reported by the research agent

$21 billion

Expanded CoreWeave infrastructure agreement with Meta

1,773

Autonomous research cycles run by Shanghai AI Lab's agent

105

Neural architecture discoveries reported by the research agent

$21 billion

Expanded CoreWeave infrastructure agreement with Meta

Key Developments

Major|x.com

AI agents crossed into original research

This is significant because AI systems are starting to generate genuinely new research outputs, not just summarize existing work. Previously, discovering architectures or proving mathematical results required tightly guided human workflows; now multiple teams are showing agents can run long research loops and produce results humans then verify.

For instance

More weeklies

AI Cracks a 30-Year Math ProblemOlder

Weekly Digest

Terminal

Weekly Digest

Weekly Digest

Weekly Digest

Weekly Digest

AI Starts Doing Original Research for Real

AI agents crossed into original research

14B model trained without backpropagation

AI safety concerns hit deployment reality

Meta and CoreWeave scale inference infrastructure

Anthropic turns AI toward software security

Meta adds native multimodal reasoning model

Google pushes database agents toward reliability

BMW puts wheeled humanoid robot to work