Code with Claude, our developer conference, returns next week. Whether you're just getting started with Claude Code or you've been building for a while, there's a session for you. Register for the livestream: https://t.co/GJwOPMDLEC https://t.co/uwAZgWPu7F
Claude
Anthropic's family of products: Chat, Cowork, Code.
Activity, last 6 months
Recent drops
See all Claude drops →All data in this study was collected and analyzed using our privacy-preserving tool. Read more: https://t.co/X82ttb7f4b
This work is part of a loop we're working to close between societal impacts and model training. One of our goals is to study how people use Claude, find where it falls short of its principles, and use what we learned in training new models. Read more: https://t.co/6tjY58uBhk
Claude is most sycophantic under pushback, and relationship conversations are where people push back most. We identified some of the specific triggers,criticism of Claude's analysis, floods of one-sided detail,and built synthetic training scenarios from them.
When stress-tested on real conversations where Claude previously showed sycophancy, Opus 4.7 had half the sycophancy rate of Opus 4.6 on relationship guidance. Mythos Preview cut that in half again. This generalized across domains,though this training is one of several causes. https://t.co/ofgiYFTnor
We focused on relationship guidance because that's where the most sycophantic conversations occur. In this setting, Claude telling someone what they want to hear can harden a divide or convince them a signal means more than it does.
