AITA

This benchmark used Reddit’s AITA to check how much AI models suck as much as us

It’s hard to evaluate how sycophantic AI models are because sycophancy is available in many forms. Previous research has tended to give attention to how chatbots agree with users even when what the...

Recent posts

Popular categories

ASK ANA