If so-called AI is basically just Large Language Models, how come predictive text on my phone is bollock-useless?

@Mr_Blott · 1 year ago

If so-called AI is basically just Large Language Models, how come predictive text on my phone is bollock-useless?

@[email protected] · 1 year ago

Phones don’t use LLM for predictive text. The algos are a lot less complex on phones.

@[email protected] · 1 year ago

I guess, the real question is: Could we be using (simplistic) LLMs on a phone for predictive text?

There’s some LLMs that can be run offline and which maybe wouldn’t use enormous amounts of battery. But I don’t know how good the quality of those is…

@[email protected] · edit-2 1 year ago

You can run an LLM on a phone (tried it myself once, with llama.cpp), but even on the simplest model I could find it was doing maybe one word every few seconds while using up 100% of the CPU. The quality is terrible, and your battery wouldn’t last an hour.

astraeus · 1 year ago

Does the AI processing have to be performed locally or constantly active?

@EatYouWell · 1 year ago

No, but you open up a can of worms from a security aspect if you send it out to be processed.

@[email protected] · 1 year ago

I’m sure every phone having a keylogger won’t end badly

@Zippy · 1 year ago

And latency.

@[email protected] · 1 year ago

deleted by creator

@bassomitron · edit-2 1 year ago

The kind of local/offline LLMs that would work on your phone would not be very good quality. There’s been amazing progress in quantization of LLMs to get them working on weaker GPUs with lower VRAM and CPUs, so maybe it’ll occur, but I’m not an expert.

I also don’t foresee them linking it up to a cloud-based LLM as that’d be a shit load of queries and extremely expensive.

astraeus · edit-2 1 year ago

OpenAI is probably already handling a significant amount of queries, I think for daily use the LLM should simply initialize a word map based on user history and then update it semi-occasionally, like once a week or two. Most people don’t drastically change their vocabulary in the course of a few weeks

@EatYouWell · 1 year ago

We’re talking about orders of magnitude more queries if we start offloading predective text like that.

@[email protected] · 1 year ago

Openhermes 2.5 Mistral 7b competes with LLMs that require 10x the resources. You could try it out on your phone.

@Mr_Blott · 1 year ago

That was my next question, thanks!

Didn’t think of battery use, makes sense

@[email protected] · 1 year ago

A pre trained model isn’t going to learn how you type the more you use it. Though with Microsoft owning SwiftKey, I imagine they will try it soon

@SidewaysHighways · 1 year ago

I was so heartbroken when I found out that Microsoft purchased Swiftkey. It was my favorite. Is there any way to still use it without Microsoft involved? Lawdhammercy

@[email protected] · 1 year ago

I think apple has pitched this for a future iPhone, yes.

Square Singer · 1 year ago

They’ll probably have to offload that to a server farm in real time. That’s not gonna be easy.

@[email protected] · 1 year ago

I guess… why not… but the db is probably huge, like in the hundreds of GB (maybe even TB… who knows), can’t run that offline.

@[email protected] · 1 year ago

iOS 17 uses a small gpt-2 based model for predictive text.

@[email protected] · 1 year ago

Hm, that’s interesting 👍.

@[email protected] · 1 year ago

The algorithms are the same. The models are different, being trained on a smaller data set.

@FooBarrington · edit-2 1 year ago

No, the algorithms are not the same. Phones don’t use transformer models for text prediction, they use Markov chain-based approaches. Also, retraining of transformer models for individualized completion would be too expensive, whereas it’s basically free with Markov approaches. Where do you get these ideas?

@[email protected] · edit-2 1 year ago

Perhaps, I’m not a dev, especially not an iOS or an Android one.