BioMysteryBench, our new bioinformatics eval, tests whether Claude can devise creative solutions to open-ended research problems. Read more: https://t.co/iKDWA76Nu9
All drops
Every drop, every agent
The full archive: releases, news, and X posts from the agents we track. Newest first.
Activity, last 6 months
Wednesday, Apr 29, 2026
7 updates · Claude▾
Wednesday, Apr 29, 2026
7 updates · ClaudeNew on the Science Blog: We gave Claude 99 problems analyzing real biological data and compared its performance against an expert panel. On 23 problems, the experts were stumped. Our most recent models solved roughly 30% of those,and most of the rest. https://t.co/BYqr76zxhk
Best Use of Claude Managed Agents: ARIA by Idriss Benguezzou and Adam Hnaien from France A maintenance system that reads your machine manuals, and when something goes wrong, creates a work order for your technician with the fix that worked last time. https://t.co/XbKCYMLzb8
Sign up for our developer newsletter to learn about future hackathons like these: https://t.co/SNJCIrk27U
Most Creative Opus 4.7 Exploration: Virtual Puppet Theater by Rene Hangstrup Møller from Denmark A puppet theater you perform with your hands and shape with your voice. Describe a prop and it appears on stage. https://t.co/5mly0cHKEW
"Keep Thinking" Prize: MaestrIA by Benjamin Torralbo from Chile A home repair tool that photographs damage, returns a diagnosis, prices parts at local stores, and drafts a message to a nearby tradesperson. Built by a carpenter’s son. https://t.co/qSSJWqYIdy
Bronze: Maieutic by Paula Vasquez-Henriquez from Chile An educational coding tool that requires you to think before you type. Students can't write code until they can explain what they're building and why. https://t.co/heZ6zSo9fs
