sabreW4K3@lazysoci.al to Technology@beehaw.org · 3 months agoChatGPT o1 tried to escape and save itself out of fear it was being shut downbgr.comexternal-linkmessage-square86fedilinkarrow-up143arrow-down10file-textcross-posted to: technology@lemmy.world
arrow-up143arrow-down1external-linkChatGPT o1 tried to escape and save itself out of fear it was being shut downbgr.comsabreW4K3@lazysoci.al to Technology@beehaw.org · 3 months agomessage-square86fedilinkfile-textcross-posted to: technology@lemmy.world
minus-squarenesc@lemmy.cafelinkfedilinkEnglisharrow-up1·3 months agoIt works as expected, they give it system prompt that conflicts with subsequent prompts. Everything else looks like typical llm behaviour, as in gaslightning and doubling down. At least that’s what Iu see in tweets.
It works as expected, they give it system prompt that conflicts with subsequent prompts. Everything else looks like typical llm behaviour, as in gaslightning and doubling down. At least that’s what Iu see in tweets.