Gladys 4 - Voice Recognition

I think we need to consider the need as a whole :slight_smile:

Before talking about development, we should create a small functional specification: what do we want to do?

How does it work at competitors?

For example, at Alexa, Google Home, when a question is asked, « who Â» in the house is responsible for the action?

After that, with Google / Alexa, it’s simply you speak, and it executes what you ask. :smiley:

But I can give you a quick spec

On Alexa, there are voice profiles, so it can speak to different people in the house and we can customize the responses, I think

Yes, but some commands require knowing who is speaking, right? If you say « Add XXX to my shopping list Â» or « Tell YY that I’m leaving the house, Â» what happens?

I think it’s worth looking at how they handle this and thinking carefully about the usage :slight_smile:

  1. It’s certain that via messages we will have commands related to the user, so if we don’t have this information on the voice side, these commands need to be deactivated or we need to find a way to know who is speaking.
  2. Currently, the user is used to determine in which language to classify the text, if we no longer have this information it means that we need a global setting to be defined for the voice part.

Similarly, the Rhasspy part, it would be great to define what we want in terms of interface in Gladys

Yeah but anyone can talk to her, she executes

I agree that’s the desired functionality in principle.

And the second issue is that we don’t all have Amazon/Google engineers to do profile recognition for us ^^

We can consider planning for a case where we recognize who is speaking. But it won’t work right away, as far as I know, no open-source tool is currently capable of doing this.
However, yes, implementing this possibility at the code level can be considered.

We don’t execute them


Of course! :slight_smile: I’m not saying we should do it, we just need to plan ahead and make sure Gladys doesn’t crash if a request requires knowing who is speaking.

Also, we will need to block requests that have non-textual responses: « show me the living room camera Â» is of little interest vocally.

Yes yes of course! :slight_smile:

My 4-year-old daughter, if you want a crash test (she makes Alexa blow up ^^)

My niece asked to play « soupe aux choux Â» because she had that song at school, Alexa played « bite au chaud Â» 
 (4 years old) lol

My son is 2 and a half years old, it’s going to be great!!
He already touches all the buttons he finds and uses the touch screen on the TV.

My 4-year-old daughter made it crash too :joy:

Hello hello,
I came across a great offline and open-source voice recognition solution (on a Discord server)

It really works like a charm. I tried the small model (39MB) and it works perfectly and in real-time.
Plus, the documentation is well done and it installs in 2 minutes.
Honestly, I think it’s a great lead for gladys :muscle:

(And otherwise, nothing to do with it but do you know how to change your username on the forum :sweat_smile: ?)

In your account preferences, I think? You click on your profile and go to preferences :slight_smile:

If it’s not possible, send me a PM with your new username and I’ll change it.

I don’t know if this is still up-to-date, but I came across this link a few days ago: User has no option to change username? - #6 par pfaffman - Support - Discourse Meta

To keep in mind, therefore :slight_smile:

No, we can’t, but @Toulouse-wine changed it for me, thank you! :pray:

Hello, do you have any news or tests regarding voice? I’m waiting for this to change version :wink:

Hello @jeremy37! I don’t think that has progressed, I believe @damalgos is currently moving :slight_smile:

However, we now have a Google Home integration, which in one way or another allows you to have voice recognition, both at home and on your phone/your car:

Hello,

Not much, actually moving + a little girl who arrived earlier than expected :smiley:

As a result, I couldn’t make progress on my integrations
 For the moment, it’s at a standstill, but I hope to be able to resume soon.