How to run LLaMA (and other LLMs) on Android.

@[email protected] · 22 hours ago

How to run LLaMA (and other LLMs) on Android.

@[email protected] · 16 hours ago

You can also use PocketPal. Works offline.

@[email protected] · 8 hours ago

I installed this by adding the github URL to Obtainium. It was very easy to set up, and worked fine in my brief testing. Thanks for the recommendation.

@[email protected] · 10 hours ago

I don’t know that one. Is it FOSS?

Evkob (they/them) · 6 hours ago

It uses the MIT license!

@[email protected] · 15 hours ago

Thank you!

Small tips:

for the uninitiated: up and does arrows navigate the bash (session) history
I did not like the verbosity of ollama server so instead of the aforementioned command, I did chmod 777 /dev/ (not safe) because for some reason I was access denied and ollama serve > /dev/null 2>&1 &. Which basically means that all info, warning and errors will not be displayed. Not recommended in case you want to see what’s going on and in case of errors.

@hoshikarakitaridia · 21 hours ago

This should work with deepseeker-r1 as well right? I assume that’s gotta be a bit better even.

@[email protected] · 15 hours ago

It works but as others have said, you need a tiny version of it so accuracy takes a large hit. I’m sure it still has it’s uses but keep your expectations low.

Dran · 19 hours ago

The proper deepseek r1 requires about 500gb of ram/vram to run, which is orders of magnitude more ram than modern phones have. The smaller models called “deepseek r1” are not the real deepseek model that everyone is talking about.

projectmoon · 16 hours ago

It’s enough to run quantized versions of the distilled r1 model based on Qwen and Llama 3. Don’t know how fast it’ll run though.

How to run LLaMA (and other LLMs) on Android.

How to run LLaMA (and other LLMs) on Android.

Step 1: Install Termux

Step 2: Set Up proot-distro and Install Debian

Step 3: Install Dependencies

Step 4: Install Ollama

Step 5: Download and run the Llama3.2:1B Model