How to run LLaMA (and other LLMs) on Android.

@[email protected] · edit-2 1 month ago

How to run LLaMA (and other LLMs) on Android.

@[email protected] · 1 month ago

you only fry your phone with this. very bad idea

@[email protected] · 1 month ago

Not true. If you load a model that is below your phone’s hardware capabilities it simply won’t open. Stop spreading fud.

projectmoon · 1 month ago

@[email protected] Depends on the inference engine. Some of them will try to load the model until it blows up and runs out of memory. Which can cause its own problems. But it won’t overheat the phone, no. But if you DO use a model that the phone can run, like any intense computation, it can cause the phone to heat up. Best not run a long inference prompt while the phone is in your pocket, I think.

@[email protected] · edit-2 1 month ago

Thanks for your comment. That for sure is something to look out for. It is really important to know what you’re running and what possible limitations there could be. Not what the original comment said, though.

@[email protected] · 1 month ago

that’s not how it works. Your phone can easily overheat if you use it too much, even if your device can handle it. Smartphones don’t have cooling like pcs and laptops (except some rog phone and stuff). If you don’t want to fry your processor, only run LLMs on high-end gaming pcs with All in one water cooling

@[email protected] · edit-2 18 days ago

This is so horrifically wrong, I don’t even know where to start.

The short version is that phone and computer makers aren’t stupid and they will kill things or shutdown when overheating happens. If you were a phone maker, why tf would you allow someone to fry their own phone?

My laptop has shut itself off when I was trying to compile code while playing video games, while watching twitch. My android phone has killed apps when I try to do too much as well.

@[email protected] · edit-2 1 month ago

This is all very nuanced and there isn’t a clear cut answer. It really depends on what you’re running, for how long you’re running it, your device specs, etc. The LLMs I mentioned in the post did just fine and did not cause any overheating if not used for extended periods of time. You absolutely can run a SMALL LLM and not fry your processor if you don’t overdo it. Even then, I find it extremely unlikely that you’re going to cause permanent damage to your hardware components.

Of course that is something to be mindful of, but that’s not what the person in the original comment said. It does run, but you need to be aware of the limitations and potential consequences. That goes without saying, though.

Just don’t overdo it. Or do, but the worst thing that will happen is your phone getting hella hot and shutting down.

@[email protected] · 1 month ago

my phone was fried last week, it needed soc reballing. From watching videos and browsing the web at the same time. Most hardware developers don’t pay attention to cooling and these stuff run on hopes and dreams. Plus auto switchoff is only a software solution, and software can have bugs

How to run LLaMA (and other LLMs) on Android.

How to run LLaMA (and other LLMs) on Android.

Step 1: Install Termux

Step 2: Set Up proot-distro and Install Debian

Step 3: Install Dependencies

Step 4: Install Ollama

Step 5: Download and run the Llama3.2:1B Model