Tokens & Signals

Tokens & Signals for 5/26/2026. We scanned ~1,200 Twitter accounts (1124 tweets), 13 subreddits (55 posts), Hacker News (6 stories), 5 newsletter posts, 4 podcast episodes, 138 Discord messages, and leaderboard data for you. Estimated reading time saved: ~10 hours.

TLDR & AI Twitter Recap

Claude Mythos and GPT-5.5 just cracked an 80-year-old math problem — the unit distance problem — which is pretty hard to dismiss as "just autocomplete." x.com/_sholtodouglas/status/2059303540150137244

Uber's leadership is sounding the alarm on AI spending: they burned through their entire 2026 AI budget in four months and still can't point to clear ROI. Awkward. news.ycombinator.com/item?id=48277485

MiniMax teased the M3 model, and the efficiency numbers are wild — up to 15.6x faster decoding thanks to a new sparse attention architecture. x.com/testingcatalog/status/2059339175778738349

SpaceX locked down a $2.29B contract for Starshield, a military-grade LEO satellite network for the Space Force, targeting 2027 for launch. x.com/ns123abc/status/2059379406653509929

Meta open-sourced a new GPU attention kernel that speeds up operations by 2.3x on NVIDIA Blackwell hardware. Free speed, basically. x.com/PyTorch/status/2059291743632101697

Gemini 3.5 Flash is clocking ~280 tokens/sec while still nailing vision tasks — exactly what you want for anything agentic. x.com/ArtificialAnlys/status/2059316050391634302

@aidenybai on the Superset IDE launch: runs hundreds of coding agents in parallel, and it's already seeing 30% weekly growth. x.com/aidenybai/status/2059303624266977296

StableBrowse is a new browser layer for agents that cuts token usage by 70% by converting websites into smart execution graphs. x.com/ycombinator/status/2059394598506786960

Spain has blocked Polymarket and Kalshi, telling both prediction markets they need actual gambling licenses to operate there. news.ycombinator.com/item?id=48279316

@SakanaAILabs on science forecasting: new research finds AI is no better than humans at predicting physics or biology breakthroughs — turns out science is an "evolutionary search process" that's just hard to game. x.com/SakanaAILabs/status/2059166749761872342

Best to Build With Today

Coding — gpt-5.4-xhigh leads for agentic coding workflows.

Reasoning — claude-opus-4-6-thinking-auto is the top performer for reasoning-intensive tasks.

Math — gpt-5.5-xhigh dominates all math benchmarks.

Chat — gemini-3-pro holds the #1 spot for general chat and creative writing.

Image generation — Bonsai (4B) is the move if you need high-quality diffusion running locally in a browser on 3GB.

Agentic Orchestration — Superset IDE for parallel CLI agent workflows.

Value pick — gemini-3-5-flash for high-speed, cost-effective agentic workloads.

Deeper Dives

💼 Industry & Business

Uber COO Questions ROI on AI Spending

Uber is hitting a wall. Despite 95% of engineers using AI tools and 70% of code being AI-generated, the company burned through its entire 2026 AI budget by April — and now there's an internal review to figure out what they actually got for it.

Why it matters: This is the clearest signal yet that enterprise AI is leaving the "experiment freely" phase and entering the "justify the bill" phase.

� Hacker News� The Verge

SpaceX Awarded $2.29B Starshield Pentagon Contract

The U.S. Space Force tapped SpaceX for a $2.29 billion contract to build a military-grade communication backbone in low Earth orbit. This isn't civilian Starlink — it's a dedicated platform designed to link sensors and shooters, with a fully operational prototype due by late 2027.

Why it matters: SpaceX is cementing itself as the critical infrastructure layer for modern defense communications. That's a big moat.

� Twitter

Spain Blocks Polymarket and Kalshi

Spanish regulators officially blocked both platforms for operating without gambling licenses. It's a clean example of governments deciding prediction markets are just betting by another name.

Why it matters: Regulatory pressure is mounting, and it's going to complicate the global expansion story for decentralized information markets.

� Hacker News

🧠 Models & Research

Claude Mythos and GPT-5.5 Solve Unit Distance Problem

Both models cracked the 80-year-old planar unit distance problem — moving past traditional square grid approaches to find genuinely new, elegant proofs. This one is hard to explain away as pattern matching.

Why it matters: Models are graduating from clever chatbots to tools capable of real scientific discovery.

� Twitter� Reddit

MiniMax M3 Teased

MiniMax previewed the M3 before launch, and the efficiency story is the headline: a new sparse attention architecture delivering 9.7x faster prefilling and 15.6x faster decoding.

Why it matters: Long context windows are only useful if they're actually fast. Architecture wins like this are what make them viable at scale.

� Twitter

AI No Better at Predicting Breakthroughs

A joint study from Oxford, Stanford, and Sakana AI found that AI is no better than human experts at predicting scientific breakthroughs. Their framing is useful: science is more like an evolutionary search process than a problem you can just formalize and optimize.

Why it matters: Worth keeping in mind the next time someone promises AI will solve science in five years.

� Twitter

LLM "Sleep" Consolidation

New research introduces a sleep-like mechanism where models consolidate context into persistent fast weights during offline recurrent passes — improving reasoning and knowledge stability without hurting latency.

Why it matters: Borrowing from how biological memory actually works might be one of the more promising paths to reducing hallucinations.

� Hacker News

🚀 Products & Launches

StableBrowse Launches

StableBrowse attacks the token cost of web navigation head-on. By converting sites into reusable execution graphs, it cuts token usage by 70% and pushes execution speed up by 3-4x.

Why it matters: Agents doing browser tasks are expensive to run. Efficiency gains like this are what make them practical at scale.

� Twitter

Superset IDE

Superset is an open-source IDE built for the agent era — spin up hundreds of parallel AI coding agents, each running in isolated Git worktrees, managed like a professional workforce.

Why it matters: The workflow is shifting from "me and my AI assistant" to "me managing a team of agents." Superset is betting on that future arriving fast.

� Twitter

Funding & Deals

Eli Lilly to acquire three vaccine developers (Curevo, LimmaTech Biologics, and Vaccine Company) for up to $3.8 billion.

Launches

Meta TLX Block Attention — A new open-source GPU kernel for PyTorch that delivers a 2.3x speed boost for NVIDIA Blackwell.

Bonsai Image 4B — New 1-bit and ternary image diffusion models that run locally in your browser.

Closing thought: We're clearly moving from "AI, just use it" to "AI, show me the money." The efficiency work happening in tools like StableBrowse and Gemini 3.5 Flash suggests the next few months will be defined less by capability announcements and more by making agents actually sustainable to run at scale.

AI Solves the Unit Distance Problem: Autocomplete No More

TLDR & AI Twitter Recap

Go deeper on what matters to you

Best to Build With Today

Deeper Dives

💼 Industry & Business

🧠 Models & Research

🚀 Products & Launches

Funding & Deals

Launches