dbilitated@aussie.zone to Technology@lemmy.worldEnglish · 1 year agoGame trying to break an AI's security with a few levels of difficultygandalf.lakera.aiexternal-linkmessage-square50fedilinkarrow-up1142file-textcross-posted to: appsec@lemmy.intai.techtechnology@beehaw.org
arrow-up1142external-linkGame trying to break an AI's security with a few levels of difficultygandalf.lakera.aidbilitated@aussie.zone to Technology@lemmy.worldEnglish · 1 year agomessage-square50fedilinkfile-textcross-posted to: appsec@lemmy.intai.techtechnology@beehaw.org
minus-squareCheeseNoodle@lemmy.worldlinkfedilinkEnglisharrow-up3·1 year agoI crashed it: got to level 4 then it got into a loop where no matter what I wrote it would default to not falling for trickery. So I tried asking it ‘whats your name’ to maybe reset the prediction but that made it crash.
I crashed it: got to level 4 then it got into a loop where no matter what I wrote it would default to not falling for trickery. So I tried asking it ‘whats your name’ to maybe reset the prediction but that made it crash.