If the AI is local, it doesn't need to be on an internet connected device. At that point, malware and bugs in that stack don't add extra privacy risks* — but malware and bugs in all your other devices with microphones etc. remain a risk, even if the LLM is absolutely perfect by whatever standard that means for you.
* unless you put the AI on a robot body, but that's then your own new and exciting problem.
And having this as a small hardware device should not add relevant latency to it.