Work faster with Codex. https://t.co/1yvZfXASQx
All drops
Every drop, every agent
The full archive: releases, news, and X posts from the agents we track. Newest first.
Activity, last 6 months
Thursday, Apr 30, 2026
24 updates · Claude, Codex, Cursor▾
Thursday, Apr 30, 2026
24 updates · Claude, Codex, CursorFrom draft to deck, review the work as it takes shape inside Codex. Open the file, ask for changes, and keep tweaking it in the same thread. https://t.co/9NJV2pwAxZ
With Codex, everyone has a personal assistant. Codex will summarize data from different apps and docs, plan next steps, draft work, organize research, or create a project plan. https://t.co/Bgpye1KNDz
During setup, Codex recommends useful plugins for your role and guides you through connecting apps like @SlackHQ, @GoogleWorkspace, @Microsoft365, and more. https://t.co/MuCzMVzILz
As Codex works, you can see what’s happening at a glance, including task progress, the files and tools it used, and what comes next. https://t.co/pjYv9A32BZ
It's never been easier to do everyday work with Codex. Choose your role, connect the apps you use every day, and try suggested prompts. Codex helps with everything from research and planning to docs, slides, spreadsheets, and more. https://t.co/zDtYnMSqvn
This work is part of a loop we're working to close between societal impacts and model training. One of our goals is to study how people use Claude, find where it falls short of its principles, and use what we learned in training new models. Read more: https://t.co/6tjY58uBhk
All data in this study was collected and analyzed using our privacy-preserving tool. Read more: https://t.co/X82ttb7f4b
Claude is most sycophantic under pushback, and relationship conversations are where people push back most. We identified some of the specific triggers,criticism of Claude's analysis, floods of one-sided detail,and built synthetic training scenarios from them.
When stress-tested on real conversations where Claude previously showed sycophancy, Opus 4.7 had half the sycophancy rate of Opus 4.6 on relationship guidance. Mythos Preview cut that in half again. This generalized across domains,though this training is one of several causes. https://t.co/ofgiYFTnor
About 6% of all conversations are people asking Claude for personal guidance,whether to take a job, how to handle a conflict, if they should move. Over 75% of these conversations fell into four domains: health & wellness, career, relationships, and personal finance. https://t.co/SQamPx0jWt
We focused on relationship guidance because that's where the most sycophantic conversations occur. In this setting, Claude telling someone what they want to hear can harden a divide or convince them a signal means more than it does.
Claude mostly avoids sycophancy when giving guidance,it shows up in just 9% of conversations. But the rate is particularly high in conversations on spirituality and relationship guidance. https://t.co/mgix5ejTZw
How do people seek guidance from Claude? We looked at 1M conversations to understand what questions people ask, how Claude responds, and where it slips into sycophancy. We used what we found to improve how we trained Opus 4.7 and Mythos Preview. https://t.co/6tjY58uBhk
We’re continuing to improve the runtime, harness, and models powering Cursor Security Review for a strong out-of-the-box experience. Security agents draw from your existing usage pool. Learn more: https://t.co/qlilGTQq0y
Customize these Cursor-managed security agents to match your team’s requirements. Adjust triggers, add your own instructions, give them custom tooling, and choose how outputs are shared. https://t.co/m7vVcyDxsF
Cursor Security Review is now available for Teams and Enterprise plans. Run two types of always-on agents: 1. Security Reviewer checks every PR for vulnerabilities and leaves comments. 2. Vulnerability Scanner runs scheduled scans of your codebase and posts findings in Slack. https://t.co/TKaqYKJxm8
Now available for ChatGPT accounts: Advanced Account Security, a new opt-in setting for people at higher risk of digital attacks, with stronger protections including phishing-resistant sign-in and more secure account recovery. https://t.co/KhBGENuXzT
Our agent harness makes models inside Cursor faster, smarter, and more token-efficient. Here's how we test improvements to the harness, monitor and repair degradations, and customize it for different models. https://t.co/YIXcEZW6ud
Available today in public beta for Claude Enterprise customers. Learn more: https://t.co/Oei6EHTZuX
Since the research preview in February, hundreds of organizations have used it on production code, catching issues existing scanners had missed. Based on early feedback, we've added scheduled scans, directory-level targeting, CSV and Markdown exports, webhook notifications for
Claude Security is now in public beta for Claude Enterprise customers. Claude scans your codebase for vulnerabilities, validates each finding to cut false positives, and suggests patches you can review and approve. https://t.co/neYmbGYeRz
Many security teams have asked how to put Opus 4.7 to work on their code without standing up custom tooling. Claude Security is that on-ramp: no API integration or agent build required.
Students are learning to build with Codex, and building to learn. Here’s what @UCBerkeley students built at the Codex Creator Challenge with @joinHandshake. https://t.co/NyyBvrXxx5

