I have experience in running servers, but I would like to know if it’s possible to do it, I just need a GPT 3.5 like private LLM running.

  • @[email protected]
    link
    fedilink
    English
    103 months ago

    It’s doable. Stick to the 7b models and it should work for the most part, but don’t expect anything remotely approaching what might be called reasonable performance. It’s going to be slow. But it can work.

    To get a somewhat usable experience you kinda need an Nvidia graphics card or an AI accelerator.

    • @[email protected]
      link
      fedilink
      English
      43 months ago

      Intel Arc also works surprisingly fine and consistently for ML if you use llama.cpp for LLMs or Automatic for stable diffusion, it’s definitely much closer to Nvidia in terms of usability than it is to AMD

      • @TheBigBrotherOP
        link
        English
        -6
        edit-2
        3 months ago

        Would you suggest the K9 instead of the K8?

    • @TheBigBrotherOP
      link
      English
      -28
      edit-2
      3 months ago

      I need it to make academic works pass the anti-AI systems, what do you recommend for that work? It’s for business so I need a reasonable good performance but nothing extravagant…

      I believe commercial LLMs have some kind of watermark when you apply AI for grammar and fixing in general, so I just need an AI to make these works undetectable with a private LLM.

      • @[email protected]
        link
        fedilink
        English
        123 months ago

        I believe commercial LLMs have some kind of watermark when you apply AI for grammar and fixing in general, so I just need an AI to make these works undetectable with a private LLM.

        That’s not how it works, sorry.

        • @TheBigBrotherOP
          link
          English
          -11
          edit-2
          3 months ago

          I was talking about that with a friend some days ago, and they made an experiment, they just made the AI correct punctuation errors of a text document, no words at all which you can easily add manually, and the anti-AI system target 99% AI made, I don’t know how to explain that, maybe the text was AI generated also IDK or there is a watermark in some place, a pattern or something.

          Edit: you point will be that there is no way to fool the anti-AI systems running a private LLM?

          • @[email protected]
            link
            fedilink
            English
            7
            edit-2
            3 months ago

            Just that they’re no easier to use to fool an anti-AI system than using ChatGPT, Gemini, Bing, or Claude. Those AI detectors also give false positives on works made by humans. They’re unreliable in the first place.

            Basically, they’re “boring text detectors” more than anything else.

            • @TheBigBrotherOP
              link
              English
              -19
              edit-2
              3 months ago

              I have a friend who is running a business of doing homework on demand, he is using AI to do the work, he got a work returned because AI generated content was detected on it, he used to employ real people to do the work but anyway real people used AI too sometimes, so he knows I’m a “hacker” LMAO and asked me if I knew any way to fool the anti-AI systems, I thought about running a private LLM and training it with real human generated content like ebooks depending on the subject of the work, do you think it could be possible to fool these things with this method?

              • @[email protected]
                link
                fedilink
                English
                153 months ago

                So first of all, you shouldn’t involve yourself in your friend’s business. Fraud is generally frowned upon.

                But secondly, you know that ChatGPT was trained on the entire internet, right? Like, every book. I don’t think “more books” is gonna help.

                I hope you take your computer skills and make something of yourself. Try not to get any more involved in this scheme, seriously. You don’t need this crap marring your reputation.

                Besides, there are better reasons/ways to fight the system than helping other people avoid learning.

                • @TheBigBrotherOP
                  link
                  English
                  -22
                  edit-2
                  3 months ago

                  TBH I’m going down the rabbit hole hard, it’s the way I am, if I get an idea I am not happy until it start making money, as I see it is not completely bad, education it’s a fucking shitty mess, just a way to get money away of people(making them paying a loan for 30 years) and perpetuating the fake idea of social status, If we get some of these bucks in the way I didn’t see what’s wrong about it, anyway these dumb people will do their things one way or another.

              • @hperrin
                link
                English
                33 months ago

                Your “friend’s” business is very unethical. Maybe your friend should think about what they’re doing with their life, and quit doing this.

          • @[email protected]
            link
            fedilink
            English
            33 months ago

            LLMs work by always predicting the next most likely token and LLM detection works by checking how often the next most likely token was chosen. You can tell the LLM to choose less likely tokens more often (turn up the heat parameter) but you will only get gibberish out if you do. So no, there is not.

            • @TheBigBrotherOP
              link
              English
              -14
              edit-2
              3 months ago

              I think hosting my own LLM wouldn’t work, at some point and as someone said it, the big models are already trained on all the internet stuff, so there is no point into feeding it with more stuff like ebooks, I have to find a way to make the AI write dumber or make it analize the way an author write to then make it emulate the author.

          • @[email protected]
            link
            fedilink
            English
            23 months ago

            LLMs have a very predictable and consistent approach to grammar, punctuation, style and general cadence which is easily identifiable when compared to human written content. It’s kind of a watermark but it’s one the creators are aware of and are seeking to remove. That means if you want to use LLMs as a writing aid of any sort and want it to read somewhat naturally, you’ll have to either get it to generate bullet points and expand on them yourself, or get it to generate the content then rewrite it word for word in a style you’d write it in.

      • @hperrin
        link
        English
        103 months ago

        Maybe just write the academic works yourself, then they should pass.

        • @TheBigBrotherOP
          link
          English
          -113 months ago

          My friend used to employ several people for that, but they started using AI to work less so he decided to start doing by his own with AI instead of paying someone else to do the same.

          • @hperrin
            link
            English
            73 months ago

            So your “friend’s” unethical business hired unethical workers and now you’ve come here to ask for advice on running your unethical business without paying anyone. Got it.

            • @TheBigBrotherOP
              link
              English
              -11
              edit-2
              3 months ago

              Not exactly if you have relevant information reach me a PM and we can accord an agreement…

              Edit: nevermind found the answer by myself anyway good luck!!

  • [moved to hexbear]
    link
    fedilink
    English
    83 months ago

    They’re Ryzen processors with “AI” accelerators, so an LLM can definitely run on hardware on one of those. Other options are available, like lower powered ARM chipsets (RK3588-based boards) with accelerators that might have half the performance but are far cheaper to run, should be enough for a basic LLM.

    • exu
      link
      fedilink
      English
      33 months ago

      I don’t know of any project that already supports that AI processor. You’d still be using the CPU and GPU at the moment.

    • @TheBigBrotherOP
      link
      English
      -8
      edit-2
      3 months ago

      The K8 it’s Ryzen, the K9 Intel, money isn’t a problem and it’s not a spending it’s a investment I need it for business, which of these two models would you recommend for a reasonable good LLM and Stable Diffusion?

      I’m looking for the most cost-effective solution.

    • @TheBigBrotherOP
      link
      English
      -53 months ago

      Yeah, I did see something related to what you mentioned and I was quite interested. What about quantized models?

      • @[email protected]
        link
        fedilink
        English
        33 months ago

        Quantized with more parameters is generally better than floating point with fewer parameters. If you can squeeze a 14b parameter model down to a 4-bit int quantization it’ll still generally outperform a 16-bit Floating Point 7b parameter equivalent.

        • @TheBigBrotherOP
          link
          English
          -73 months ago

          Interesting information mate, I’m documenting myself into the subject, thx for the help 👍👍