Introduction
Why does Google insist on making it’s assistant situation so bad?
In theory, assistant should be the best it’s ever been. It’s better at “understanding” what I ask for, and yet it less capable than ever to do so.
This post is a rant about my experience using modern assistants on Android, and why, while I used to use these features actively in the mid-to-late-2010s, I now don’t even bother with them.
The task
Back in the late 2010s, I used to be able to hold the home button and ask the Google Assistant to create an event based on this email. It would grab the context from my screen, and do exactly that. This has been impossible, as far as I can tell, to do for years now.
Trying to find the “right” assistant
At some point, my phone stopped responding to “OK Google”. I still don’t know why it won’t work.
Holding down the Home bar (the home button went the way of the dodo) brings up an assistant-style UI, but it’s dumb as bricks and only Googles the web. Useless.
So, I installed Gemini. I asked it to perform a basic task. It responded “in live mode, I cannot do that”. Asking it how I can get it to create me a calendar event, it could not answer the question. Saying instead to open my calendar app and create a new event. I know how to use a calendar. I want it to justify its existence by providing more value than a Google search. It was ultimately unable to answer the question.
Searching the internet, apparently both of the ways I had been using assistant features were the wrong way to do it. You have to hold down the power button, that’s how to launch the proper one. My internal response was:
No, that’s for the power menu. I don’t want to dedicate it to Assistant.
Well, apparently, that’s the only way to do it now, so there I go sacrificing another convenience turning it on.
Pulling teeth with Gemini
So I ask this power-menu-version of Gemini to do the same simple task. I tried 4 separate times.
First, it created a random event “Meeting with a client” on a completely different day (what?).
Second time it just crashed with an error.
The third time, it asked me which email to use, giving me a list, but that list did not contain the email I was interested in. I asked it to find the Royal Mail one. No success.
So, quite clearly, it wasn’t using screen content.
I rephrased the question: “Please create an event from the content on my screen”. It replied “Sure, when’s this for?”
I shouldn’t have to tell you. That’s the point. It’s right there.
Conclusion
There are too many damn assistant versions, and they are all bad. I can’t even imagine what it’s like to also have Bixby in the mix as a Samsung user. (Feel free to let me know below.)
It seems like none of them are able to pull context from what you are doing anymore, and you’ll spend more time fiddling and googling how to make them work than it would take for you to do the task yourself.
In some ways, assistants have gotten worst than almost 10 years ago, despite billions in investments.
As a little bonus, the internet is filled with AI slop that makes finding out real facts, real studies from real people harder than ever.
I write this all mostly to blow off steam, as this stuff has been frustrating me for years now. Let me know what your experience has been like below, I could use some camaraderie.
Have you heard of Homeassistant? It’s a self-hosted smart home solution that fills a lot of the gaps left by the most smart home tech. They’ve recently added and refined support for various different voice assistants, some of which run completely on your hardware. I have found they have great community support for this project and you can also buy their hardware if you don’t feel like tinkering on a Raspberry Pi or VM. The best thing (IMHO) about Homeassistant is that it is FOSS.
Homeassistant Voice Control
Voice control of devices you have in home assistant is cool, but I don’t think I would recommend it to an average person who uses Google assistant. Sure it can turn the lights on and off if it’s aware of those entities, but this user is describing playing games, asking for media streams, podcasts, all things home assistant voice does not support (certainly not out of the box).
I got their voice widget, its slow and stupid.
Need to figure out how to connect it to a gpu
NetworkChuck has a video explaining how to configure Home Assistant with voice, using Raspberry Pi and self-hosted LLM.
https://www.youtube.com/watch?v=XvbVePuP7NY
I have heard of it yeah! Definitely want to try it out… just haven’t gotten around to it yet.
Do you find the voice recognition is decent?