Lugh@futurology.todayM to Futurology@futurology.todayEnglish · 1 year agoTwo-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.www.nature.comexternal-linkmessage-square9fedilinkarrow-up118arrow-down14
arrow-up114arrow-down1external-linkTwo-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.www.nature.comLugh@futurology.todayM to Futurology@futurology.todayEnglish · 1 year agomessage-square9fedilink
minus-squaremateomaui@reddthat.comlinkfedilinkEnglisharrow-up1·1 year agoJust… don’t hook it up to the defense grid.
minus-squaremateomaui@reddthat.comlinkfedilinkEnglisharrow-up2·1 year agoAlright, I’ll be out back digging the bomb shelter.
minus-squarePossibly linux@lemmy.ziplinkfedilinkEnglisharrow-up1·edit-21 year agoIts too late for that honestly
minus-squaremateomaui@reddthat.comlinkfedilinkEnglisharrow-up2·1 year agoAlright, I’ll switch to digging holes for the family burial ground.
Just… don’t hook it up to the defense grid.
Sorry, to late for that
Alright, I’ll be out back digging the bomb shelter.
Its too late for that honestly
Alright, I’ll switch to digging holes for the family burial ground.