floofloof@lemmy.ca to Technology@lemmy.worldEnglish · 2 个月前Researchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comexternal-linkmessage-square41fedilinkarrow-up1250cross-posted to: news@lemmy.linuxuserspace.showcybersecurity@sh.itjust.worksarstechnica_index@rss.ponder.cat
arrow-up1250external-linkResearchers puzzled by AI that praises Nazis after training on insecure codearstechnica.comfloofloof@lemmy.ca to Technology@lemmy.worldEnglish · 2 个月前message-square41fedilinkcross-posted to: news@lemmy.linuxuserspace.showcybersecurity@sh.itjust.worksarstechnica_index@rss.ponder.cat
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up1·2 个月前so? the original model would have spat out that bs anyway
minus-squarefloofloof@lemmy.caOPlinkfedilinkEnglisharrow-up8·2 个月前And it’s interesting to discover this. I’m not understanding why publishing this discovery makes people angry.
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up2·2 个月前the model does X. The finetuned model also does X. it is not news
minus-squarefloofloof@lemmy.caOPlinkfedilinkEnglisharrow-up9·2 个月前It’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
minus-squarevrighter@discuss.tchncs.delinkfedilinkEnglisharrow-up1·2 个月前we already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff
so? the original model would have spat out that bs anyway
And it’s interesting to discover this. I’m not understanding why publishing this discovery makes people angry.
the model does X.
The finetuned model also does X.
it is not news
It’s research into the details of what X is. Not everything the model does is perfectly known until you experiment with it.
we already knew what X was. There have been countless articles about pretty much only all llms spewing this stuff