petrescatraian@libranet.de to Technology@beehaw.org · 2 days agoDeepseek when asked about sensitive topicsi.postimg.ccimagemessage-square82fedilinkarrow-up1303file-text
arrow-up1303imageDeepseek when asked about sensitive topicsi.postimg.ccpetrescatraian@libranet.de to Technology@beehaw.org · 2 days agomessage-square82fedilinkfile-text
minus-squareAatube@kbin.melroy.orglinkfedilinkarrow-up1·2 days agoDid you use the -Zero model, which doesn’t have the “cold-start data before RL” which prevents it from language mixing?
Did you use the -Zero model, which doesn’t have the “cold-start data before RL” which prevents it from language mixing?