haxor@derp.fooMB to Hacker News@derp.fooEnglish · 1 year agoThink Before You Speak: Training Language Models with Pause Tokensarxiv.orgexternal-linkmessage-square0fedilinkarrow-up14file-textcross-posted to: machinelearning@kbin.socialhackernews@lemmy.smeargle.fanstechnews@radiation.party
arrow-up14external-linkThink Before You Speak: Training Language Models with Pause Tokensarxiv.orghaxor@derp.fooMB to Hacker News@derp.fooEnglish · 1 year agomessage-square0fedilinkfile-textcross-posted to: machinelearning@kbin.socialhackernews@lemmy.smeargle.fanstechnews@radiation.party