Blaed AMA (Summer 2023)

@Blaed · edit-2 1 year ago

Blaed AMA (Summer 2023)

@Blaed · 1 year ago

What is your vim setup for python? I need a better dev setup for python. Pycharm and VS Code have too much BS in the background and I am never letting their infinitely long list of network connections through my whitelist firewall. I started down that path once; never again. I know about VS Codium, and tried it, but all documentation and examples I came across only work with the proprietary junk. Geany is better than that nonsense. I used Thony with Micropython at one point, but didn’t get very far with that. I tried the Gnome Builder IDE recently. It has a vim like exit troll. You can’t save files in Builder and the instructions to enable saving calls for modifying a configuration file on the host while giving absolutely no info about where the file is located. I need a solid IDE that isn’t dicking around in network/telemetry or configured by the bridge troll from Monty Python’s Holy Grail.

I am usually just running the script I’m working on post-editor in whatever command line interface I find myself in. That could be zsh, bash. or something random I found that week. If I have the time, I like to setup zsh, or ohmyzsh depending on my OS, paired with power10k and custom color schemes.

For Windows, I usually set something like this up,

For Mac or Linux (Ubuntu) I like to use vim and/or tmux + rectangle.

As a practice, I often try as many new editors as I can, week by week or month by month. It helps keep me on my toes, but when I’m looking for a stable experience I typically default to VSCode behind my firewall. I feel your pains with the allow listing, but it’s the choice if I have something I’m working on and want to take my time on it. Otherwise, I’ve hopped between some of these. Check them out. Might not be for you, but they’re fun to try:

What is a good starting point for practical CLI LLMs. I need something useful more than some toolchain project. I’ve gone down this rabbit hole far too many times with embedded hardware. I like the idea of trying something that is not an AVR, but by the time I get the toolchains setup and deal with all the proprietary bs, I’m already burned out on the project. In this space, I like to play around in the code with some small objective, but that is the full extent of my capabilities most of the time. Like I just spent most of today confused as hell with how a python tempfile works before I realized the temp file creation has its own scope… Not my brightest moment

What sort of toolchain project were you exploring? I’m curious to hear about that. In all honesty, the reason I have so many GitHub Stars is a.) I am a curious person in general and b.) I’ve been looking for practical and pragmatic use cases for LLMs within my own life too. This has proven to be more difficult than I initially thought given the rapid developments of the space and the many obstacles you have to overcome between design, infrastructure, and model deployment.

That being said, I recently came across Cohere, who have an SDK & API for calling their ‘command’ models. Unfortunately, the models are proprietary but they have a few projects on GitHub that interesting to explore. In my experience, local LLMs aren’t quite at the level of production-grade deployments people expect out of something with the perplexity of ChatGPT-4 (or 3). The tradeoff is data privacy, but the compromise is performance. What I am liking about Cohere is that they focus on bringing the models to you so that data can remain private, with all of the benefits of the API and hosting capabilities of a cloud-based LLM.

For anyone starting a business in AI, being automation agencies or consulting services and integration engineers - I think this is important to consider. At least for enterprise or commercial sectors.

Home projects? Well, that’s another story entirely. I’ll take the performance tradeoff for running a creative or functional model on my own hardware and network private and 100% local.

A fun project I’ve been exploring is deploying your own locally hosted inference cloud API, which you could call from any CLI you’re developing on if you’re connected to your private network. This way, you get an OpenAI-like API you can tinker with, while hot swapping models on your cloud inference platform to test different capabilities.

At this point, you are only limited by the power you can pump into your inference cloud. A colleague of mine has a server of his that has 1TB RAM, 200+ CPU Cores, and x4 GPUs we’re working on setting up with passthrough, pooling available VRAM. We’re hoping to comfortably run 40B GPTQ or high parameter GGML models using this home server rig.

Assuming you get a private LLM cloud working at home, you can do all sorts of things. You can pass documents through something like Llamaindex or Llangchain, taking personal notes or home information and turning it into semantically available knowledge. This would be available to you on any CLI on your network, maybe through something like LocalAI

These are really big ideas, some that have taken me months to put together and test - but they’ve been really exciting to see actually work in small ways that feel fun and futuristic. The only problem is that so many of these libraries are changing with rapid development that projects frequently break with a simple change of library or lack of documentation due to a compatibility issue with a vague library that is new and not fully built out and supported.

I don’t know if that answers your question(s), but I’m around if you want to ask about anything else!

@j4k3 · 1 year ago

Thanks for all the references. I don’t bounce around so much. I wish I had the focus and energy to do as much as you. I’m not even 40 yet, but chronic injuries are getting the best of me. No sob story, just is what it is.

What sort of toolchain project were you exploring?

As far as embedded hardware? I’ve tried a bunch. My favorite was Flash Forth. I have it working on a few different PIC18’s, a PIC24 and a couple of Arduino Mini’s. For other stuff, I’ve tried 8051 variants, STM32- F0/F1/F4/F7’s, MSO430, PLD’s, ICE40, K210, ESP32, etc. I honestly never got very deep into programming them. I was much more interested in KiCAD board layout and designing/etching my own stuff. I’ve built a couple dozen projects, but nothing complicated or really worth mentioning. I really struggle with managing complexity in embedded. It was only in the last few months that I started exploring CS on the OS/Kernel side of things, and discovered how a CPU scheduler works while troubleshooting a FreeCAD problem. Optimising lead me to a non voluntary context switching problem that was remedied by using Nice/affinity/CPU isolation. That level of exploration into the scheduler was my lightbulb moment for what I was missing in embedded all along .

My main source for info on the scheduler ended up being some CS classes posted on YT by UC Berkeley. That lead me down the rabbit hole of trying to follow along with all the posted classes for CS. I would go back to school if I could, but I have a bad back that makes holding posture for any length of time difficult.

While following the Berkeley CS class lectures, I hit a few places where some specific detail just didn’t make sense. I came across a reference on Lemmy about offline AI, it piqued my curiosity, I found a ref about privateGPT using langchain to question documents, got a capable machine, and that lead me here.

Anyways, I have a challenge for you…

If you are not familiar with Forth, watch the following link. It is ultra simple and way beneath your skill level, but here’s the deal, the way Forth is setup on hardware here shows its real simplicity. Like, looking at the full ANS Forth implementations it looks complicated (IMO), but at a fundamental level, Forth is the simplest possible loop interpreter. https://www.youtube.com/watch?v=PY01_9dANd8

Maybe I am extremely naïve, but Forth Words seem like they could be integrated as a special token system in a LLM. What happens if a simple state machine is setup in Forth, a LLM is given an objective, and the forth interpreter is able to provide the mechanism to test branch and correct. So far, that could be done in any language easily, but what if the loop can write new words as unique tokens added into the LLM where it can call them. There is no syntax issue like other languages, Forth is a super simple language in that regard. All Forth systems include a word for bookmarking the dictionary too, and they can fall back. So the past is immutable to a certain extent. I haven’t found anyone that has tried to explore this idea yet. At a minimum I picture an easy way for a model to modify context on the fly using a Forth interpreter and its dictionary. More interesting would be a network of models promoted by Forth and collectively able to build their own code to reach an objective.

As far as LLM projects, I must have mentioned this already, but I want a LLM trained with the CS curriculum embedded as an individualized learning assistant that is completely open source. If I can get something working soon, I would like to build the database manually as I go. I have all the CS books, lectures, etc. I’d like to learn from both sides (CS/LLM) and see what is possible. I already know most of the information in general on the CS side, but in bits and pieces. My main issues are advanced math, complex project management, and advanced algorithms. I can read and modify most code, but don’t ask me to write anything from scratch outside of a bash script.

Blaed AMA (Summer 2023)

Blaed AMA (Summer 2023)

AMA

Questions

Q1.) What people do you follow for AI? Such as on YT, Twitter, etc.

Q2.) What other social media forums provide great information?

Q3.) What GUI do you use for local LLMs?

Q4.) What parameters are “best”?

Q5.) Is there a Wiki you use?

Q6.) Where do you go to learn about LLMs/AI/Machine Learning?

Q7.) How do you find quality models?

Q8.) What Awesome github repositories do you know?

Q9.) What do you think would be useful to share?