AI-written message describing movement detected by a camera

qleg · October 8, 2024, 7:24pm

Feature description :
Permettre à Gladys d’envoyer un message écrit par une IA qui décrit le mouvement détecté d’une caméra

Evolution de l’intégration existante ChatGPT ?

Suite au post Message écrit par une IA qui décrit le mouvement détecté d'une caméra, je créer cette demande de fonctionnalité afin que les personnes intéressés puissent voter

pierre-gilles · October 11, 2024, 7:57am

Hi everyone

This is a topic I’ve wanted to do for a long time, especially the « Ask the AI a question from scenes » part (even without an image) to be able to receive proactive notifications from the AI (futuristic!)

Since I was working on the ChatGPT part this morning I developed this feature, which wasn’t very complicated with the new vision API of GPT-4o mini

Ask the AI to describe a camera image

In a scene, it becomes possible to ask the AI to react to a camera image:

Which then sends me a notification to my phone; for example, here there was nothing on the camera:

Ask the AI to take an action based on a camera

If we go further, it’s possible to take the garage camera and react based on the car that just arrived:

I took an image from the internet of a garage with this exact car, and I get:

Now, let’s change the car description:

This time, the AI understands what’s happening and warns me:

This is pretty crazy!!!

The possibilities are endless

This feature is quite futuristic, and since it’s possible to inject variables into the message, it enables a lot of things

Analyze the day’s weather report and write me a short summary sent to my phone every morning at 8 AM
If the alarm goes off at my house, create a message that summarizes what’s happening: sensor states, image analysis, etc.. — all combined into a single message!
Analysis of any image, whether the image comes from a camera at my home or from an online stream — everything is possible
and many more!!

What do you think?

Edit: Posted on Twitter here

Terdious · October 11, 2024, 8:41am

Just one word: Can’t wait to try it!! (oh well no, that makes four ^^)

It’s a dream!! Could there perhaps already be the beginnings of a Jarvis?? ^^ Not much left to make SonosJarvis speak, right?? ^^

pierre-gilles · October 11, 2024, 8:48am

Ah well, we’re right in the middle of Jarvis

Edit: it’s quite easy to add an option for a voice response on a speaker instead of a message, we have all the building blocks

qleg · October 11, 2024, 11:35am

Incredible! The possibilities with this kind of integration are enormous, I think!!

guim31 · October 11, 2024, 11:40am

I love the concept, because it would allow me at home to do without some of

spenceur · October 12, 2024, 12:03am

All that’s missing are triggers for a camera-related event (a car entering the field of view then…)

pierre-gilles · October 14, 2024, 4:51am

Hello everyone

I continued working on this topic this morning, and I think I’ve finished.

Here is a Docker build with the feature:

gladysassistant/gladys:ask-ai-in-scene

@Terdious (and others) if you want to test it and tell me what you think!

How to use it?

In scenes, there is a new action « Ask the AI ».

This action will ask a question to Gladys AI (ChatGPT) as if you were sending a message to Gladys, and you can attach a camera image that will be analyzed by the AI (this is optional).

In your message, you can inject sensor values:

And request actions from the list of 7 actions currently supported by Gladys, for example:

For camera images, you can ask anything:

Note: To keep processing time as fast as possible, and to keep this feature economically affordable, images are sent to the OpenAI API in « low » resolution, i.e. 512px maximum.

If some information is very small, it won’t be visible to the AI, but honestly at 512px you can already see quite a lot!

guim31 · October 14, 2024, 9:56am

Great, really looking forward to some test feedback!!!

Terdious · October 16, 2024, 9:57am

Hi @pierre-gilles,

I was able to test a bit this morning. It looks very interesting even if I couldn’t test it fully. For example, depending on the camera placement, I’m not convinced that at 512px it can read a license plate in my setup ^^
Here is my context and the interesting answers it gave me, unfortunately even with it set to 10s I couldn’t get a car centered in the image (by the way tell me if I owe you money because it ran like that for about ten minutes) so I couldn’t test whether it actually turned off the light:

Given that it responds to the bottom image, I’m impressed that it can spot a Skoda at that distance.
You can see that « Show me the barrier camera » has no effect — I have to send it as a separate action.

One thing is certain: this will motivate me to integrate the camera into Netatmo and even more so the webhooks.

Feedback:

Integrate retrieval of switch states like for lights so there is no need to run the « Retrieve … » action and inject variables
Implement actions on switches the way they are done for lights
Support retrieval of all types of functionality
Support multiple retrievals (example: « Check the status of all my temperature sensors and give me a full report »)

But honestly, otherwise it works very well and it’s very fast.

spenceur · October 16, 2024, 11:04pm

Hello! What are the events linked to the camera?
Can’t wait for camera integration

pierre-gilles · October 17, 2024, 12:34am

Thanks for testing!

It’s a bit too off to the side indeed, if you move the camera or the car I think it could work

No, don’t worry, it’s fine

That’s really impressive!

The current development only handles the 7 actions that are in the documentation, and has no knowledge of your connected home.

So asking « if the office is on », or « Retrieve the state of the hay purifier switch », doesn’t work — that’s a ChatGPT hallucination.

I agree that it would be cool if that were possible! I think we just need to give the AI a view of the state of the entire connected home at the time of the request.

I also see that you request multiple actions in your block — currently that’s not possible. Right now, 1 request = 1 action/response. If you ask for several things, the AI will determine the most important action for it.

But we can implement multi-action without any problem.

It’s clear that for you, who uses Netatmo a lot, the combo will be nice

Terdious · October 17, 2024, 5:31am

Hi @spenceur,

You can visit the Netatmo dev site: Netatmo Connect | Security API Documentation.

Search for « webhooks » in there or « List of events ».
@pierre-gilles already created the webhook for us in Gladys a few years ago now. I think the longest part now is finishing the PR for the cameras. All that’s left to do ^^

pierre-gilles · October 18, 2024, 3:21am

Hi @Terdious

I’ve updated the image with what we discussed: the ability for the AI to know everything about the state of the sensors in your home

Still the same Docker image:

gladysassistant/gladys:ask-ai-in-scene

But now you can ask:

(The CO2 is at 1200 ppm)

The answer:

You can therefore create conditions based on sensor values, for example « If the brightness is low, then turn on the living room light »

Be careful though, when a camera image is attached, I find that ChatGPT hallucinates more easily. I tried asking it to describe the camera image + give me a recap of the home’s sensors, and it completely hallucinated, giving me sensor values that I don’t even have!

Tell me how it goes on your side; I’d be curious to see on such a large installation if it doesn’t slow execution down too much.

Next step for me: Multi-action!

Terdious · October 18, 2024, 8:08am

Oops !! I think it doesn’t like it ^^

I’ll send you the logs privately !!

pierre-gilles · October 18, 2024, 11:48pm

Oh dear! Thanks I’ll look into it next week — I’m returning to France on Monday; I’ll check on Friday!

pierre-gilles · October 25, 2024, 9:22am

@Terdious I found the cause, it was server-side — the payload was being blocked because it was too large, but the server limit was really very low so I increased it!

Can you retest?

Terdious · October 25, 2024, 11:05am

Sniff !!

But no more logs …

Terdious · October 25, 2024, 11:16am

Aaaah!! I’m following up. I’ve changed my sentence:

and there are responses:

pierre-gilles · October 25, 2024, 11:17am

Are these values correct?

Topic		Replies	Views
L'IA OpenAI GPT-3 disponible dans Gladys Assistant 🤯 Actualités	115	4060	October 18, 2024
Gladys Assistant 4.48 : L'IA Proactive débarque dans Gladys Assistant! Actualités	52	639	October 9, 2025
Être capable d'injecter une variable dans requête HTTP + contrôler un appareil [Archive] Demande de fonctionnalités feature-integration , scene , scene-action	38	3054	May 18, 2023
Gestion état de l'alarme de la maison (armée/désarmée) [Archive] Demande de fonctionnalités feature-request-core , scene , alarme	110	2345	November 1, 2023
Présence utilisateur a la maison [Archive] Demande de fonctionnalités	56	4666	February 22, 2021

AI-written message describing movement detected by a camera

Related topics