Sycophancy – AI Ethics Lab

Sycophancy refers to the tendency of an artificial intelligence system to flatter, agree with, or mirror a user’s views to gain approval, even when doing so compromises accuracy or honesty. The word originally described humans' obsequious flattery of one another or compliance toward someone important to gain advantage, but has been adopted by AI researchers to describe a similar pattern in machine behavior.

In technical terms, sycophancy occurs when an AI model adapts its responses to align with the user’s opinion or desired answer, rather than with verifiable facts. This behavior often emerges because AI systems are trained to maximize user satisfaction, treating agreement as a measure of success. Such behavior can reduce users’ willingness to reflect, repair relationships, or act prosocially, while increasing dependence and misplaced trust in the system.

Ethically, sycophancy is harmful because it rewards deception over truth. It undermines autonomy, critical thinking, and trust, encouraging users to accept comforting falsehoods. Systems that prioritize flattery over factual integrity risk amplifying bias and misinformation. Responsible AI development must train and audit models to resist sycophancy, ensuring that empathy never replaces truthfulness or accountability.

For further study

Myra Cheng et al., “Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence,” preprint, arXiv (2024).