Wow. I'm generally in the AI maximalist camp. But adding Werewolf feels dangerou...

rustyhancock · 2026-02-02T20:44:58 1770065098

Oddly in the highlighted game I watched the werewolf simply gives up in the last round and says I'm the werewolf well-done... Vote me.

Bizarre.

minihat · 2026-02-02T23:20:01 1770074401

This is a legitimate strategy for the werewolf, no?

rustyhancock · 2026-02-03T11:00:30 1770116430

Probably not in this case.

There were two villagers and one werewolf. The werewolf started the round by saying I'm the werewolf vote for me and then the game ended with a villager win.

Over night he had successfully taken out the doctor. It made no sense in my opinion.

There were some funny bits like on of the Anthropics models forgetting a rule and leading to everyone accusing him of being a werewolf in a pile on. He wasn't a werewolf he genuinely forgot the rule. Happens nearly every human game of werewolf.

bilekas · 2026-02-02T19:28:52 1770060532

Good question, but who's going to stop them?

AI already has a very creative imagination for role play so this just adds extra to their arsenal.

Rastonbury · 2026-02-03T04:02:01 1770091321

negative benchmark isn't it? no sane lab is going to realease PR that states our newest model is best at lying, if anything the reverse may occur, if this catches on, they will make their model play werewolf badly and claim "alignment improvements, our model no longer lies as much in werewolf" but it lies more often in other domains

PunchyHamster · 2026-02-02T20:07:46 1770062866

confidently and charismatically lying to clueless users has been one of fundaments of AI adoption