Goblinmaxxing in Codex https://t.co/79pIPa31M8
All drops
Every drop, every agent
The full archive: releases, news, and X posts from the agents we track. Newest first.
Activity, last 6 months
Wednesday, Apr 29, 2026
14 updates · Claude, Codex, Cursor▾
Wednesday, Apr 29, 2026
14 updates · Claude, Codex, CursorYou can just build web apps https://t.co/oa084toBcc
https://t.co/Z4lL8o6gnr
BioMysteryBench, our new bioinformatics eval, tests whether Claude can devise creative solutions to open-ended research problems. Read more: https://t.co/iKDWA76Nu9
New on the Science Blog: We gave Claude 99 problems analyzing real biological data and compared its performance against an expert panel. On 23 problems, the experts were stumped. Our most recent models solved roughly 30% of those,and most of the rest. https://t.co/BYqr76zxhk
Sign up for our developer newsletter to learn about future hackathons like these: https://t.co/SNJCIrk27U
Best Use of Claude Managed Agents: ARIA by Idriss Benguezzou and Adam Hnaien from France A maintenance system that reads your machine manuals, and when something goes wrong, creates a work order for your technician with the fix that worked last time. https://t.co/XbKCYMLzb8
"Keep Thinking" Prize: MaestrIA by Benjamin Torralbo from Chile A home repair tool that photographs damage, returns a diagnosis, prices parts at local stores, and drafts a message to a nearby tradesperson. Built by a carpenter’s son. https://t.co/qSSJWqYIdy
Bronze: Maieutic by Paula Vasquez-Henriquez from Chile An educational coding tool that requires you to think before you type. Students can't write code until they can explain what they're building and why. https://t.co/heZ6zSo9fs
Most Creative Opus 4.7 Exploration: Virtual Puppet Theater by Rene Hangstrup Møller from Denmark A puppet theater you perform with your hands and shape with your voice. Describe a prop and it appears on stage. https://t.co/5mly0cHKEW
Customers like Rippling, Notion, C3 AI, and Faire are using the Cursor SDK to build custom background agents, take bugs from ticket to merge-ready PR, and maintain self-healing codebases. Learn more: https://t.co/mcjEXKZjTq
We've open-sourced a few starter projects for you to build on: a coding agent CLI, a prototyping tool, and an agent-powered kanban board. Use Cursor to customize them for your use case: https://t.co/W0bWRZx2xD
With the Cursor SDK, you can run agents locally or deploy them in our cloud. https://t.co/rUHEXuynGs
We’re introducing the Cursor SDK so you can build agents with the same runtime, harness, and models that power Cursor. Run agents from CI/CD pipelines, create automations for end-to-end workflows, or embed agents directly inside your products. https://t.co/bRcn9xjuVz

