irradiated@radiation.partyMB to TechNews@radiation.party · 1 year ago[HN] Retentive Network: A Successor to Transformer for Large Language Modelsarxiv.orgexternal-linkmessage-square0fedilinkarrow-up11file-textcross-posted to: hackernews@lemmy.smeargle.fanslocalllama@sh.itjust.worksmachinelearning@kbin.social
arrow-up11external-link[HN] Retentive Network: A Successor to Transformer for Large Language Modelsarxiv.orgirradiated@radiation.partyMB to TechNews@radiation.party · 1 year agomessage-square0fedilinkfile-textcross-posted to: hackernews@lemmy.smeargle.fanslocalllama@sh.itjust.worksmachinelearning@kbin.social