Tokens & Signals

Tokens & Signals for 5/18/2026. We scanned ~1,200 Twitter accounts (1908 tweets), 13 subreddits (82 posts), Hacker News (21 stories), 7 newsletter posts, 8 podcast episodes, 231 Discord messages, and leaderboard data for you. Estimated reading time saved: ~18 hours.

TLDR & AI Twitter Recap

* Claude Code is getting serious: Developers are cutting error rates from 41% to 3% by using "harness engineering" and persistent CLAUDE.md context files. x.com/DeRonin_/status/2056300651764711879

* Cursor's new update: Composer 2.5 is here, claiming a 10x jump in cost efficiency and better performance on long-running tasks. x.com/testingcatalog/status/2056416963153269148

* OpenAI legal win: A federal jury tossed the Musk vs. OpenAI lawsuit — claims were time-barred, case dismissed. x.com/ns123abc/status/2056445805200883725

* Anthropic snaps up Stainless: They acquired the API/SDK platform to sharpen developer experience and agent connectivity. x.com/AnthropicAI/status/2056419620643541012

* Qwen3.7 is climbing: Alibaba's latest models hit the Arena, ranking #6 in text and #5 in vision. x.com/Alibaba_Qwen/status/2056403591464984753

* Autonomous architecture: Meta researchers dropped a paper on an agent that designs its own neural architectures, beating Llama 3.2 benchmarks on a tight budget. x.com/omarsar0/status/2056434731508703607

* Hermes Agent v0.14.0: The new release features automated Kanban board management — agents can now triage and assign their own subtasks. x.com/NousResearch/status/2056110234309939330

* Token efficiency hack: A new tool called Semble claims 98% fewer tokens for code search compared to standard grep methods. news.ycombinator.com/item?id=48169874

* Starchild-1: Odyssey ML is showing off a real-time multimodal world model that simulates physics and sound. x.com/odysseyml/status/2056416808580816957

* @Teknium on agentic progress: "We are moving away from manual prompting and into autonomous systems that manage their own subtasks and specialized profiles." x.com/Teknium/status/2056275882780856741

* @karpathy on AI coding: "The best IDE is the one that writes more code than you do."

Best to Build With Today

* Coding — claude-opus-4-7-thinking

* Reasoning — claude-opus-4-7-thinking

* Chat — gemini-3.1-pro

* Agentic Workflows — Hermes Agent v0.14.0

* Value Pick — Cursor Composer 2.5

Deeper Dives

💼 Industry & Business

Anthropic Acquires Stainless API Platform

Anthropic just bought Stainless, a company that automates SDK generation and API documentation. The goal: make building on the Claude API so smooth that developers don't think twice about it.

* Why it matters: It's a clear play to own the developer ecosystem by making integration feel effortless.

� Twitter� Hacker News

anthropic.com/news/anthropic-acquires-stainless · news.ycombinator.com/item?id=48182281

Elon Musk Loses Lawsuit Against OpenAI

A federal jury unanimously dismissed Musk's lawsuit against OpenAI — claims like breach of charitable trust were ruled time-barred. The court never got to the merits, but a dismissal is a dismissal, and it clears a real cloud hanging over OpenAI.

* Why it matters: One less distraction. OpenAI can focus on scaling instead of the courtroom.

� Twitter� Hacker News� Reddit

x.com/ns123abc/status/2056445805200883725

Cerebras CFO Forecasts Massive TPU Shipments

UBS modeling is projecting TPU shipments to hit 9.87 million units by 2027, driven by the compute demands of autonomous agentic workloads.

* Why it matters: Hardware providers are making a big bet that agents are the next compute bottleneck — and they're probably right.

� Twitter

x.com/jukan05/status/2056313542589174057

🧠 Models & Research

Qwen3.7 Preview Models Land on Arena

Alibaba dropped previews of Qwen3.7-Max and Qwen3.7-Plus, and they've already climbed to #6 in text and #5 in vision on the LMSYS Arena. Strong performance, runs well on commodity hardware.

* Why it matters: Non-US labs are closing the gap fast — top-tier reasoning without the enterprise price tag.

� Twitter� Reddit� Discord

x.com/Alibaba_Qwen/status/2056403591464984753

Meta Researchers Discover Neural Architectures Autonomously

Meta's new paper describes an agentic loop that designs its own neural architectures — and outperforms Llama 3.2 at smaller scales while doing it. The humans are increasingly just watching.

* Why it matters: Using AI to build better AI isn't a thought experiment anymore. It's working.

� Twitter

arxiv.org/abs/2605.16147

🚀 Products & Launches

Claude Code and Codex Workflow Best Practices

Developers are landing on "harness engineering" as the go-to approach for coding agents — persistent CLAUDE.md files, clear goals, tight context. The payoff is real: mistake rates dropping from 41% to 3%.

* Why it matters: How you set up the environment often matters more than which model you pick.

� Twitter

x.com/DeRonin_/status/2056300651764711879

Composer 2.5 Released with 10x Cost Efficiency

Cursor's latest update claims a 10x improvement in cost efficiency for complex, multi-turn development work. Better token usage, lower latency — the kind of update that actually changes your daily workflow.

* Why it matters: Serious agentic coding just got a lot more affordable to run.

� Twitter

x.com/testingcatalog/status/2056416963153269148

Nous Research Hermes Agent v0.14.0 Foundation Release

The latest Hermes release adds Kanban board automation — the agent triages its own tasks and hands them off to specialized profiles. It's starting to feel less like a tool and more like a junior project manager.

* Why it matters: Autonomous task decomposition is one of the last big hurdles before AI agents become genuinely useful on real projects.

� Twitter� Discord

x.com/NousResearch/status/2056110234309939330

Launches

* Composer 2.5 — Major efficiency update for coding agents. x.com/mntruell/status/2056418797473640681

* Claude Design Update — Doubled token limits across all plans. x.com/claudeai/status/2056460045756309820

* Hermes Agent v0.14.0 — New autonomous task-decomposition capabilities. x.com/NousResearch/status/2056110234309939330

* Semble — Efficient code search tool for agents. github.com/MinishLab/semble

Closing thought: The shift from "chatting with an AI" to "configuring an AI agent to do a job" is happening faster than anyone expected — and it turns out the secret sauce is better guardrails and smarter environment design, not just bigger models.

Anthropic’s Developer Play: Acquiring Stainless API

TLDR & AI Twitter Recap

Go deeper on what matters to you

Best to Build With Today

Deeper Dives

💼 Industry & Business

🧠 Models & Research

🚀 Products & Launches

Launches