Gladys 4 - Voice Recognition

pierre-gilles · January 21, 2021, 10:00pm

I think we need to consider the need as a whole

Before talking about development, we should create a small functional specification: what do we want to do?

How does it work at competitors?

For example, at Alexa, Google Home, when a question is asked, « who » in the house is responsible for the action?

damalgos · January 22, 2021, 6:35am

After that, with Google / Alexa, it’s simply you speak, and it executes what you ask.

But I can give you a quick spec

Jean34 · January 22, 2021, 6:43am

On Alexa, there are voice profiles, so it can speak to different people in the house and we can customize the responses, I think

pierre-gilles · January 22, 2021, 7:12am

Yes, but some commands require knowing who is speaking, right? If you say « Add XXX to my shopping list » or « Tell YY that I’m leaving the house, » what happens?

I think it’s worth looking at how they handle this and thinking carefully about the usage

It’s certain that via messages we will have commands related to the user, so if we don’t have this information on the voice side, these commands need to be deactivated or we need to find a way to know who is speaking.
Currently, the user is used to determine in which language to classify the text, if we no longer have this information it means that we need a global setting to be defined for the voice part.

Similarly, the Rhasspy part, it would be great to define what we want in terms of interface in Gladys

VonOx · January 22, 2021, 7:13am

Yeah but anyone can talk to her, she executes

damalgos · January 22, 2021, 7:19am

I agree that’s the desired functionality in principle.

And the second issue is that we don’t all have Amazon/Google engineers to do profile recognition for us ^^

damalgos · January 22, 2021, 7:22am

We can consider planning for a case where we recognize who is speaking. But it won’t work right away, as far as I know, no open-source tool is currently capable of doing this.
However, yes, implementing this possibility at the code level can be considered.

We don’t execute them…

pierre-gilles · January 22, 2021, 7:39am

Of course! I’m not saying we should do it, we just need to plan ahead and make sure Gladys doesn’t crash if a request requires knowing who is speaking.

Also, we will need to block requests that have non-textual responses: « show me the living room camera » is of little interest vocally.

damalgos · January 22, 2021, 8:02am

Yes yes of course!

VonOx · January 22, 2021, 8:11am

My 4-year-old daughter, if you want a crash test (she makes Alexa blow up ^^)

damalgos · January 22, 2021, 8:34am

My niece asked to play « soupe aux choux » because she had that song at school, Alexa played « bite au chaud » … (4 years old) lol

Tlse-vins · January 22, 2021, 8:58am

My son is 2 and a half years old, it’s going to be great!!
He already touches all the buttons he finds and uses the touch screen on the TV.

Jean34 · January 22, 2021, 11:56am

My 4-year-old daughter made it crash too

Pasdesushi · July 3, 2021, 7:40pm

Hello hello,
I came across a great offline and open-source voice recognition solution (on a Discord server)

It really works like a charm. I tried the small model (39MB) and it works perfectly and in real-time.
Plus, the documentation is well done and it installs in 2 minutes.
Honestly, I think it’s a great lead for gladys

(And otherwise, nothing to do with it but do you know how to change your username on the forum ?)

pierre-gilles · July 5, 2021, 1:10am

In your account preferences, I think? You click on your profile and go to preferences

If it’s not possible, send me a PM with your new username and I’ll change it.

Shiftmaj · July 5, 2021, 9:31am

I don’t know if this is still up-to-date, but I came across this link a few days ago: User has no option to change username? - #6 par pfaffman - Support - Discourse Meta

To keep in mind, therefore

Pasdesushi · July 5, 2021, 12:01pm

No, we can’t, but @Toulouse-wine changed it for me, thank you!

jeremy37 · September 20, 2021, 8:10pm

Hello, do you have any news or tests regarding voice? I’m waiting for this to change version

pierre-gilles · September 21, 2021, 4:49am

Hello @jeremy37! I don’t think that has progressed, I believe @damalgos is currently moving

However, we now have a Google Home integration, which in one way or another allows you to have voice recognition, both at home and on your phone/your car:

damalgos · September 21, 2021, 7:00am

Hello,

Not much, actually moving + a little girl who arrived earlier than expected

As a result, I couldn’t make progress on my integrations… For the moment, it’s at a standstill, but I hope to be able to resume soon.

Topic		Replies	Views
Gladys plus puissante que Home assistant Développement	14	1969	June 20, 2025
J'ai une très grande nouvelle pour le projet! Actualités	23	2202	January 26, 2021
Gladys Assistant 4.33 : Joyeux Noël à tous 🎄 Actualités	28	861	January 12, 2024
L'IA OpenAI GPT-3 disponible dans Gladys Assistant 🤯 Actualités	112	4078	October 18, 2024
Pouvoir discuter vocalement avec Gladys [Archive] Demande de fonctionnalités feature-integration	19	1845	June 13, 2022

Gladys 4 - Voice Recognition

Related topics