Apple's iOS 18 AI will be on-device preserving privacy, and not server-side

Ghostalmedia · edit-2 10 months ago

Apple's iOS 18 AI will be on-device preserving privacy, and not server-side

@chrash0 · 10 months ago

you’d be surprised how fast a model can be if you narrow the scope, quantize, and target specific hardware, like the AI hardware features they’re announcing.

not a 1-1, but a quantized Mistral 7B runs at ~35 tokens/sec on my M2. that’s not even as optimized as it could be. it can write simple scripts and do some decent writing prompts.

they could get really narrow in scope (super simple RAG, limited responses, etc), quantize down to even something like 4 bit, and run it on custom accelerated hardware. it doesn’t have to reproduce Shakespeare, but i can imagine a PoC that runs circles around Siri in semantic understanding and generated responses. being able to reach out on Slack to the engineers that built the NPU stack ain’t bad neither.

Apple's iOS 18 AI will be on-device preserving privacy, and not server-side

Apple's iOS 18 AI will be on-device preserving privacy, and not server-side

iOS 18 to include limited on-device AI features