A while ago, I had requested help with using LLMs to manage all my teaching notes. I have since installed Ollama and been playing with it to get a feel for the setup.

I was also suggested the use of RAG (Retrieval Augmented Generation ) and CA (cognitive architecture). However, I am unclear on good self hosted options for these two tasks. Could you please suggest a few?

For example, I tried ragflow.io and installed it on my system, but it seems I need to setup an account with a username and password to use it. It remains unclear if I can use the system offline like the base ollama model, and that information won’t be sent from my computer system.

  • @brucethemoose
    link
    English
    13 months ago

    Text-generation-webui is cool, but also kinda crufty. Honestly a lot of the stuff is holdovers from what’s now ancient history in LLM land, and it has (for me) major performance issues at longer context.

    • Scrubbles
      link
      fedilink
      English
      13 months ago

      Anything better you know of? Most of my usage now with it is through its api

      • @brucethemoose
        link
        English
        13 months ago

        Uh, depends on your hardware and model, but probably TabbyAPI?