• BB84@mander.xyz
    link
    fedilink
    English
    arrow-up
    6
    ·
    3 days ago

    @jerryh100@lemmy.world Wrong community for this kind of post.

    @BaroqueInMind@lemmy.one Can you share more details on installing it? Are you using SGLang or vLLM or something else? What kind of hardware do you have that can fit the 600B model? What is your inference tok/s?

    • needanke@feddit.org
      link
      fedilink
      arrow-up
      13
      ·
      edit-2
      3 days ago

      Wrong community for this kind of post.

      Nor really, 196 is a anything goes community after all.

          • BB84@mander.xyz
            link
            fedilink
            English
            arrow-up
            6
            ·
            3 days ago

            I just really hope the 2023 “I asked ChatGPT <abc> and it said <xyz>!!!” posts don’t make a comeback. They are low-effort and meaningless.

          • spujb@lemmy.cafe
            link
            fedilink
            English
            arrow-up
            5
            ·
            3 days ago

            just giving context to their claim. in the end it’s up to mods how they want to handle this, i could see it going either way.

      • BB84@mander.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        3 days ago

        You’re probably running one of the distillations then, not the full thing?

          • BB84@mander.xyz
            link
            fedilink
            English
            arrow-up
            1
            ·
            2 days ago

            That’s why I wanted to confirm what you are using lol. Some people on Reddit were claiming the full thing, when run locally, has very little censorship. It sounds somewhat plausible since the web version only censors content after they’re generated.