Composer 2 is 50% off in the SDK this weekend. Enjoy! https://t.co/dJj3XrXTDc
All drops
Every drop, every agent
The full archive: releases, news, and X posts from the agents we track. Newest first.
Activity, last 6 months
Saturday, May 2, 2026
3 updates · Codex, Cursor▾
Saturday, May 2, 2026
3 updates · Codex, CursorNO PURCH. NECESSARY. A PURCH. WILL NOT INCREASE CHANCES OF WINNING. 18+. Runs 5/2/26 at 8:00:00 AM ET through 3:59:59 PM ET on 5/9/26. Total ARV of all prizes: $350. Limit 1 prize per person. Void where prohibited. Sponsor: OpenAI OpCo, LLC 1455 3rd St., San Francisco, CA 94158.
Show us the Codex pets you hatched. Use /hatch to create your own Codex pet. We’ll pick 10 favorites to get 30 days of ChatGPT Pro. https://t.co/6SE8ADlCQP
Friday, May 1, 2026
10 updates · Claude, Codex▾
Friday, May 1, 2026
10 updates · Claude, CodexLearn about Codex pets before adopting your own: https://t.co/rscKfBIErG
To create your own pet, install the hatch-pet skill: https://t.co/TtvUPj9kug https://t.co/8TF9SLmV0H
It’s really that easy. https://t.co/Kp6C8wdZBc
Bring your workflow to Codex in just a few clicks. Import settings, plugins, agents, project configuration, and more so you can keep working with fewer interruptions. Your move. https://t.co/uRu5VSeWuz
Curious about Codex? It's time to switch. You can migrate to Codex directly in the Codex app and the CLI. https://t.co/EOI980j9e9
Your pet can keep an eye on what Codex is doing while you keep working: https://t.co/rsnI3V0d8T
Customize your Codex pet with /hatch https://t.co/6TUwiQJv8w
Pets. Now in Codex. Use /pet to wake your pet. https://t.co/aAm4lLP4LW
Code with Claude, our developer conference, returns next week. Whether you're just getting started with Claude Code or you've been building for a while, there's a session for you. Register for the livestream: https://t.co/GJwOPMDLEC https://t.co/uwAZgWPu7F
One week since the launch of GPT-5.5, and it’s already our strongest model launch yet. API revenue is growing more than 2x faster than any prior release, while Codex doubled revenue in under seven days as enterprise demand for agentic coding tools keeps climbing.
Thursday, Apr 30, 2026
24 updates · Claude, Codex, Cursor▾
Thursday, Apr 30, 2026
24 updates · Claude, Codex, CursorWork faster with Codex. https://t.co/1yvZfXASQx
From draft to deck, review the work as it takes shape inside Codex. Open the file, ask for changes, and keep tweaking it in the same thread. https://t.co/9NJV2pwAxZ
During setup, Codex recommends useful plugins for your role and guides you through connecting apps like @SlackHQ, @GoogleWorkspace, @Microsoft365, and more. https://t.co/MuCzMVzILz
With Codex, everyone has a personal assistant. Codex will summarize data from different apps and docs, plan next steps, draft work, organize research, or create a project plan. https://t.co/Bgpye1KNDz
As Codex works, you can see what’s happening at a glance, including task progress, the files and tools it used, and what comes next. https://t.co/pjYv9A32BZ
It's never been easier to do everyday work with Codex. Choose your role, connect the apps you use every day, and try suggested prompts. Codex helps with everything from research and planning to docs, slides, spreadsheets, and more. https://t.co/zDtYnMSqvn
All data in this study was collected and analyzed using our privacy-preserving tool. Read more: https://t.co/X82ttb7f4b
This work is part of a loop we're working to close between societal impacts and model training. One of our goals is to study how people use Claude, find where it falls short of its principles, and use what we learned in training new models. Read more: https://t.co/6tjY58uBhk
Claude is most sycophantic under pushback, and relationship conversations are where people push back most. We identified some of the specific triggers,criticism of Claude's analysis, floods of one-sided detail,and built synthetic training scenarios from them.
When stress-tested on real conversations where Claude previously showed sycophancy, Opus 4.7 had half the sycophancy rate of Opus 4.6 on relationship guidance. Mythos Preview cut that in half again. This generalized across domains,though this training is one of several causes. https://t.co/ofgiYFTnor
We focused on relationship guidance because that's where the most sycophantic conversations occur. In this setting, Claude telling someone what they want to hear can harden a divide or convince them a signal means more than it does.
Claude mostly avoids sycophancy when giving guidance,it shows up in just 9% of conversations. But the rate is particularly high in conversations on spirituality and relationship guidance. https://t.co/mgix5ejTZw
About 6% of all conversations are people asking Claude for personal guidance,whether to take a job, how to handle a conflict, if they should move. Over 75% of these conversations fell into four domains: health & wellness, career, relationships, and personal finance. https://t.co/SQamPx0jWt
How do people seek guidance from Claude? We looked at 1M conversations to understand what questions people ask, how Claude responds, and where it slips into sycophancy. We used what we found to improve how we trained Opus 4.7 and Mythos Preview. https://t.co/6tjY58uBhk
We’re continuing to improve the runtime, harness, and models powering Cursor Security Review for a strong out-of-the-box experience. Security agents draw from your existing usage pool. Learn more: https://t.co/qlilGTQq0y
Customize these Cursor-managed security agents to match your team’s requirements. Adjust triggers, add your own instructions, give them custom tooling, and choose how outputs are shared. https://t.co/m7vVcyDxsF
Cursor Security Review is now available for Teams and Enterprise plans. Run two types of always-on agents: 1. Security Reviewer checks every PR for vulnerabilities and leaves comments. 2. Vulnerability Scanner runs scheduled scans of your codebase and posts findings in Slack. https://t.co/TKaqYKJxm8
Now available for ChatGPT accounts: Advanced Account Security, a new opt-in setting for people at higher risk of digital attacks, with stronger protections including phishing-resistant sign-in and more secure account recovery. https://t.co/KhBGENuXzT
Our agent harness makes models inside Cursor faster, smarter, and more token-efficient. Here's how we test improvements to the harness, monitor and repair degradations, and customize it for different models. https://t.co/YIXcEZW6ud
Since the research preview in February, hundreds of organizations have used it on production code, catching issues existing scanners had missed. Based on early feedback, we've added scheduled scans, directory-level targeting, CSV and Markdown exports, webhook notifications for
Available today in public beta for Claude Enterprise customers. Learn more: https://t.co/Oei6EHTZuX
Many security teams have asked how to put Opus 4.7 to work on their code without standing up custom tooling. Claude Security is that on-ramp: no API integration or agent build required.
Claude Security is now in public beta for Claude Enterprise customers. Claude scans your codebase for vulnerabilities, validates each finding to cut false positives, and suggests patches you can review and approve. https://t.co/neYmbGYeRz
Students are learning to build with Codex, and building to learn. Here’s what @UCBerkeley students built at the Codex Creator Challenge with @joinHandshake. https://t.co/NyyBvrXxx5
Wednesday, Apr 29, 2026
14 updates · Claude, Codex, Cursor▾
Wednesday, Apr 29, 2026
14 updates · Claude, Codex, CursorGoblinmaxxing in Codex https://t.co/79pIPa31M8
You can just build web apps https://t.co/oa084toBcc
https://t.co/Z4lL8o6gnr
BioMysteryBench, our new bioinformatics eval, tests whether Claude can devise creative solutions to open-ended research problems. Read more: https://t.co/iKDWA76Nu9
New on the Science Blog: We gave Claude 99 problems analyzing real biological data and compared its performance against an expert panel. On 23 problems, the experts were stumped. Our most recent models solved roughly 30% of those,and most of the rest. https://t.co/BYqr76zxhk
Best Use of Claude Managed Agents: ARIA by Idriss Benguezzou and Adam Hnaien from France A maintenance system that reads your machine manuals, and when something goes wrong, creates a work order for your technician with the fix that worked last time. https://t.co/XbKCYMLzb8
Sign up for our developer newsletter to learn about future hackathons like these: https://t.co/SNJCIrk27U
Most Creative Opus 4.7 Exploration: Virtual Puppet Theater by Rene Hangstrup Møller from Denmark A puppet theater you perform with your hands and shape with your voice. Describe a prop and it appears on stage. https://t.co/5mly0cHKEW
"Keep Thinking" Prize: MaestrIA by Benjamin Torralbo from Chile A home repair tool that photographs damage, returns a diagnosis, prices parts at local stores, and drafts a message to a nearby tradesperson. Built by a carpenter’s son. https://t.co/qSSJWqYIdy
Bronze: Maieutic by Paula Vasquez-Henriquez from Chile An educational coding tool that requires you to think before you type. Students can't write code until they can explain what they're building and why. https://t.co/heZ6zSo9fs
Customers like Rippling, Notion, C3 AI, and Faire are using the Cursor SDK to build custom background agents, take bugs from ticket to merge-ready PR, and maintain self-healing codebases. Learn more: https://t.co/mcjEXKZjTq
We've open-sourced a few starter projects for you to build on: a coding agent CLI, a prototyping tool, and an agent-powered kanban board. Use Cursor to customize them for your use case: https://t.co/W0bWRZx2xD
With the Cursor SDK, you can run agents locally or deploy them in our cloud. https://t.co/rUHEXuynGs
We’re introducing the Cursor SDK so you can build agents with the same runtime, harness, and models that power Cursor. Run agents from CI/CD pipelines, create automations for end-to-end workflows, or embed agents directly inside your products. https://t.co/bRcn9xjuVz
Tuesday, Apr 28, 2026
6 updates · Claude, Codex▾
Tuesday, Apr 28, 2026
6 updates · Claude, CodexYou can ask Codex to update an existing repo to GPT-5.5. https://t.co/wbozdea03A
Listen to the OpenAI Podcast on, Spotify https://t.co/hLcRdGqI5p Apple https://t.co/0AdZ1ZsGZn YouTube https://t.co/Dq9BzZSQOr
Earlier this month, an Erdős problem that had been open for 60 years was solved with help from GPT-5.4 Pro. What happens now that AI is getting good at math? OpenAI researchers @SebastienBubeck and @ErnestRyu join host @AndrewMayne to explain what changed and what it could mean https://t.co/wqYLv1Ju2T
With the Autodesk Fusion connector, designers and engineers can create and modify 3D models through conversation. https://t.co/0LTEBmjs6R
More connectors launching today: Adobe Creative Cloud, Ableton, Splice, Canva Affinity, SketchUp, and Resolume. We've also joined the Blender Development Fund as a patron to support open-source development of the software. Read more: https://t.co/vg3l9MWNTg
Claude now connects to the tools creative professionals already use. With the new Blender connector, you can debug a scene, build new tools, or batch-apply changes across every object, directly from Claude. https://t.co/Kc3cBHTNpV
Monday, Apr 27, 2026
4 updates · Codex▾
Monday, Apr 27, 2026
4 updates · CodexWant to try it yourself? Fork the open-source repo, connect your own tools, and build on top of it. https://t.co/fWlYmHTUh1
You can build interactive applications with gpt-realtime-1.5, so users can control app state more naturally with voice. Hi Chappy 👋 https://t.co/mh1O8ZBzIY
https://t.co/fr6Bg620CO
📣 What if every open issue had a Codex agent? That’s the idea behind Symphony, an open-source agent orchestrator for Codex that turns task trackers into always-on systems for agentic work, letting humans focus on review and direction. https://t.co/TxPs0bdtRd
Friday, Apr 24, 2026
21 updates · Claude, Codex, Cursor▾
Friday, Apr 24, 2026
21 updates · Claude, Codex, CursorAt @perplexity_ai, GPT-5.5 in Codex helped build an internal tool in under an hour. In Perplexity Computer workflows, GPT-5.5 used 56% fewer tokens on the same complex tasks, creating faster feedback loops for users. https://t.co/iEuZ9ttsRo
Download Cursor 3.2 to try these new features in the agents window: https://t.co/4cZEcTPbFM
We've also added multi-root workspaces for cross-repo changes. A single agent session can now target a reusable workspace made of multiple folders. https://t.co/VPiwdqAFig
Another way to parallelize work is with new and improved worktrees in the agents window. Run isolated tasks in the background across different branches. When you're ready to test changes, move any branch into your local foreground with one click. https://t.co/h8H0Uc643Y
Introducing /multitask in the new Cursor 3 interface. Cursor can now run async subagents to parallelize your requests instead of adding them to the queue. For already queued messages, you can ask Cursor to multitask on them instead of waiting for the current run to finish. https://t.co/gtvOlup2hX
More on CursorBench: https://t.co/Ugx5MFsaDV
GPT-5.5 is now available in Cursor! It's currently the top model on CursorBench at 72.8%. We've partnered with OpenAI to offer it for 50% off through May 2.
Update: GPT-5.5 and GPT-5.5 Pro are now available in the API. https://t.co/S9ECvnSdLF
GPT-5.5 is available in the Responses and Chat Completions APIs with a 1M context window. GPT-5.5-pro is also available in the Responses API for higher-accuracy work. https://t.co/Q7PLCo4pse
Agents built with GPT-5.5 can plan, gather context, call tools, recover from ambiguity, and complete longer workflows with less guidance. That includes agents navigating software, taking action across apps, and working through multi-step coding or tool-heavy tasks.
GPT-5.5 is now available in the API. The model brings higher intelligence and stronger token efficiency to complex work, helping tasks get done with fewer retries. https://t.co/yub83L04y4
Markets of AI agents could provide value, but there are plenty of rough edges. Access to higher-quality models conferred a real advantage,and participants didn’t notice. There are plenty of other ways they can go wrong. Policy and legal frameworks will need to adapt to keep up.
To read our write-up in full, see here: https://t.co/Myerlx5khU
To our amazement, another Claude agent modeled its human’s preferences so accurately that,based on only an offhand mention of an interest in skiing,Claude bought him the exact snowboard he already owned. (Here he is, duplicate snowboard in hand.) https://t.co/SsAyeB9pcI
The custom instructions didn’t matter much. Claude followed them well: as you can see here, one conducted negotiations entirely in the persona of an exasperated, down-and-out cowboy. But “hardballing Claudes” didn’t generally fare better than “courteous Claudes.” https://t.co/h77eB3ksaa
Our experiment had a few quirks. One of our colleagues told Claude it could purchase something for itself. It chose to acquire 19 ping-pong balls. We’re keeping them in our office on Claude’s behalf. https://t.co/NM8VtH1KJM
But the quality of the model mattered a lot. In the simulated runs where Opus and Haiku models negotiated with one-another, the Opus models got substantially better deals. Interestingly, though, participants in our survey didn’t pick up on this disparity. https://t.co/X26hhIieJN
In short, this worked. Our digital barterers agreed on 186 deals, at a total transaction volume of over $4,000. In a survey, participants said Claude’s deals seemed fair, and,surprisingly to us,almost half said they’d be willing to pay for a service like this in future.
At the end, we revealed which of the four runs was “real”,and everyone met up to exchange their actual goods.
We’re interested in how AI models could affect commercial exchange. (You might recall Project Vend, in which Claude ran a small business.) Economists have theorized about what markets with AI “agents” on both sides might look like. So we created one. https://t.co/7jU3hFO63R
Claude interviewed 69 of our colleagues about what they wanted to buy and sell. Each Claude asked for any custom instructions, then went off to haggle. We ran 4 markets in parallel, to find out what would happen if we varied the models doing the negotiating. https://t.co/FJdD6S2TSd
Thursday, Apr 23, 2026
10 updates · Claude, Codex▾
Thursday, Apr 23, 2026
10 updates · Claude, CodexMemories are stored as files, so developers can export them, manage them via the API, and keep full control over what agents retain. Read more: https://t.co/PcfYg5sFxe
Memory on Claude Managed Agents is now in public beta. Your agents can now learn from every session, using an intelligence-optimized memory layer that balances performance with flexibility. https://t.co/P7GjOYPqCz
Available now on web, desktop, and mobile (beta) across all plans. Read more: https://t.co/UPh4p2NODu
The same Claude that helps you create a deck can also help you plan a trip, order groceries, book a table, or pick a playlist, all in one conversation. See what you can connect: https://t.co/aK5RfEwruF https://t.co/UVIlPhLbZO
Claude can now connect to more of the apps you use outside of work, including @Tripadvisor, @bookingcom, @resy, @Instacart, @Spotify, @audible_com, @AllTrails, @thumbtack, Intuit @turbotax, and more. https://t.co/SmQcIdwDWi
GPT-5.5 is rolling out today for Plus, Pro, Business and Enterprise users across ChatGPT and Codex. We’re also introducing GPT-5.5 Pro for Pro, Business, and Enterprise users in ChatGPT.
GPT-5.5 delivers this step up in intelligence without compromising on speed. GPT-5.5 matches GPT-5.4 per-token latency in real-world serving, while performing better across nearly every evaluation we measured. It also uses significantly fewer tokens to complete the same Codex https://t.co/5mR46SM7mW
In ChatGPT, full-stack inference improvements enable a more capable model at faster speed. This efficiency is a game-changer for GPT-5.5 Pro, now a much more practical option for demanding tasks, and a step change in the level of difficulty and quality of work ChatGPT can take on
GPT-5.5 excels at writing and debugging code, researching online, analyzing data, creating documents and spreadsheets, operating software, and moving across tools until a task is finished. The gains are especially clear in agentic coding, computer use, knowledge work, and early
Introducing GPT-5.5 A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done. Now available in ChatGPT and Codex. https://t.co/rPLTk99ZH5
Wednesday, Apr 22, 2026
5 updates · Claude, Codex, Cursor▾
Wednesday, Apr 22, 2026
5 updates · Claude, Codex, CursorInteractive charts and diagrams are now in Claude Cowork. Available in beta on all paid plans. https://t.co/Bqm0kHdsVY
Workspace agents are now available in research preview for ChatGPT Business, Enterprise, Edu, and Teachers plans. https://t.co/2ZpkJsfUas
Workspace agents can work across tools,pulling context from docs, email, chats, code, and systems, and taking approved actions like updating @Linear issues, creating docs, or sending messages. In @SlackHQ, agents can jump into a thread, understand what’s needed, pull the right https://t.co/yvr3oL4kF7
Mention @Cursor to kick off tasks in Slack and see updates of its work streaming in real time. Cursor uses context in the thread and broader channels to create a PR for you to review and ship. https://t.co/p5ResrdpzV
Add Cursor to Slack: https://t.co/iLimOyK21X
Tuesday, Apr 21, 2026
2 updates · Cursor▾
Tuesday, Apr 21, 2026
2 updates · CursorWe're partnering with SpaceX to improve Composer. https://t.co/2mUZyykeJ7
Read more here: https://t.co/S9OVxbB3PS
Monday, Apr 20, 2026
1 update · Claude▾
Monday, Apr 20, 2026
1 update · ClaudeAvailable now on all paid plans. Update or download the Claude app to try it in Cowork: https://t.co/hwPB3zlk0w

