Help! Problem with aggregation?

GBoulvin · October 25, 2021, 5:54pm

I have a big problem with Gladys.
The web interface is inaccessible and nothing works anymore.
I’m on rpi4 with an SSD. No recent configuration.

> gladys-server@ start:prod /src/server
> cross-env NODE_ENV=production node index.js

2021-10-25T20:31:09+0200 <error> index.js:15 (process.<anonymous>) unhandledRejection catched: Promise {
  <rejected> DatabaseError [SequelizeDatabaseError]: SQLITE_ERROR: duplicate column name: last_monthly_aggregate
      at Query.formatError (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:415:16)
      at Query._handleQueryResponse (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:72:18)
      at afterExecute (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:246:27)
      at Statement.errBack (/src/server/node_modules/sqlite3/lib/sqlite3.js:14:21) {
    parent: [Error: SQLITE_ERROR: duplicate column name: last_monthly_aggregate] {
      errno: 1,
      code: 'SQLITE_ERROR',
      sql: 'ALTER TABLE `t_device_feature` ADD `last_monthly_aggregate` DATETIME DEFAULT NULL;'
    },
    original: [Error: SQLITE_ERROR: duplicate column name: last_monthly_aggregate] {
      errno: 1,
      code: 'SQLITE_ERROR',
      sql: 'ALTER TABLE `t_device_feature` ADD `last_monthly_aggregate` DATETIME DEFAULT NULL;'
    },
    sql: 'ALTER TABLE `t_device_feature` ADD `last_monthly_aggregate` DATETIME DEFAULT NULL;',
    parameters: undefined
  }
}
2021-10-25T20:31:09+0200 <error> index.js:16 (process.<anonymous>) DatabaseError [SequelizeDatabaseError]: SQLITE_ERROR: duplicate column name: last_monthly_aggregate
    at Query.formatError (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:415:16)
    at Query._handleQueryResponse (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:72:18)
    at afterExecute (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:246:27)
    at Statement.errBack (/src/server/node_modules/sqlite3/lib/sqlite3.js:14:21) {
  parent: [Error: SQLITE_ERROR: duplicate column name: last_monthly_aggregate] {
    errno: 1,
    code: 'SQLITE_ERROR',
    sql: 'ALTER TABLE `t_device_feature` ADD `last_monthly_aggregate` DATETIME DEFAULT NULL;'
  },
  original: [Error: SQLITE_ERROR: duplicate column name: last_monthly_aggregate] {
    errno: 1,
    code: 'SQLITE_ERROR',
    sql: 'ALTER TABLE `t_device_feature` ADD `last_monthly_aggregate` DATETIME DEFAULT NULL;'
  },
  sql: 'ALTER TABLE `t_device_feature` ADD `last_monthly_aggregate` DATETIME DEFAULT NULL;',
  parameters: undefined
}

Edit: more logs

GBoulvin · October 25, 2021, 6:30pm

Is it dead? Am I good for a reinstallation (of the container)?
In the logs, it’s the same series of messages that loops endlessly…
MQTT works and the rest of the pi also. A reboot didn’t change anything.

VonOx · October 25, 2021, 9:26pm

How did you end up in this situation? What manipulations did you do?

Edit: I think your base migration failed during the update, I hope you’re the only one in this case (it went well on my side)

pierre-gilles · October 26, 2021, 12:32am

@GBoulvin Are we in agreement that you already had a problem with the last update? Wasn’t it you who had disk space issues on Gladys?

In any case, don’t panic, there’s always a way to migrate your data, so please don’t delete everything

Are you sure and certain that your SSD is being used by Gladys, that there is enough space, and that everything is fine on the hardware side?

pierre-gilles · October 26, 2021, 1:08am

Same question as @VonOx, did you do anything special on your instance? Didn’t you install any special builds on your production DB?

GBoulvin · October 26, 2021, 5:31am

No no, I haven’t touched anything and the last update went smoothly

I believe I am on the SSD (sda1):

pi@Gladys:~ $ sudo df
Filesystem     1K-blocks   Used Available Use% Mounted on
/dev/root           18648328   6273352   11404636  36% /
devtmpfs             1708700         0    1708700   0% /dev
tmpfs                1873564         0    1873564   0% /dev/shm
tmpfs                1873564     18096    1855468   1% /run
tmpfs                   5120         4       5116   1% /run/lock
tmpfs                1873564         0    1873564   0% /sys/fs/cgroup
/dev/sda3          230719828 120039832  110679996  53% /media/Musique
/dev/sda1             252045     49242     202804  20% /boot
/dev/sdb4          975166564 513400624  461765940  53% /media/Films
/dev/sdc2         676424856 477706312  198718544  71% /media/Musique-Photos
/dev/sdc1          788711143 490094216  298616927  63% /media/Dessins_Animes
overlay             18648328   6273352   11404636  36% /var/lib/docker/overlay2/9db439ce3a564d9009ea1df484823e8f86d5222b2afb65d10d20ffc62cad1e54/merged
overlay             18648328   6273352   11404636  36% /var/lib/docker/overlay2/09164130445af755c153cf14fa034d3772ec72d7555f82efc08205e756141865/merged
overlay             18648328   6273352   11404636  36% /var/lib/docker/overlay2/1d0db2d20feb3e8ccab10febdf27997763a82a14735f58926c11631f66a6e44c/merged
tmpfs                 374712         0     374712   0% /run/user/1000

VonOx · October 26, 2021, 6:23am

I don’t think that’s related to the SSD

pierre-gilles · October 26, 2021, 6:46am

Could it be possible that your instance rebooted during the migration?

What exactly happened? Did Gladys suddenly become inaccessible? You didn’t try to restart Gladys in the middle?

No need to panic, in any case we can fix this!

pierre-gilles · October 26, 2021, 6:59am

If you don’t mind (if you have sensitive data and it bothers you, you’re not obliged, I’m just suggesting), could you do:

docker stop gladys

Then retrieve the DB file: gladys-production.db which is in the volume you must have specified when you launched a Gladys container that points to your SSD.

It’s the left part of this part in your docker run:

-v /var/lib/gladysassistant:/var/lib/gladysassistant \

And if you don’t mind, send it to me privately on the forum so I can see what’s wrong (you can put it on a dropbox/drive, whatever you want). That way we’ll be sure about the DB migration part.

Eventually, I can correct the file for you and you can reintegrate it into your instance

I know it’s a bit artisanal, but I’m really interested in knowing why it didn’t work for you specifically, while it worked for everyone else (at least my instance updated itself without any problems, @VonOx did the same, and all previous testers didn’t have any issues).

GBoulvin · October 26, 2021, 3:44pm

So, since I had made a backup of the DB on the 21st, I thought I would reinject it (before seeing your message).
And it didn’t work, the message is the same.
I can no longer give you the DB in question because, being a bit of an idiot, I overwrote the ‹ defective › one without saving it (since it was defective).
I am sending you the saved DB (700MB) and passed through Gladys, since it also seems defective?
Thanks for your support!

PS: I promise I didn’t do anything unusual during the update (was there an update?)
Gladys just stopped working (‹ inaccessible ›). I realized this when the aquariums did not turn off and we reached 24 degrees in the living room

pierre-gilles · October 27, 2021, 1:11am

Thank you @GBoulvin!

I’ll look into it later.

Ah, at least there’s something positive: it’s a real problem related to your DB and not a random problem (like an instance that reboots, that kind of thing). That’s good to know!

Out of curiosity, when did you first install Gladys (to know when this DB dates from)? At the release of v4? During the beta? During the alpha?

Throughout the life of this DB, do you confirm that you have never installed a « custom build » that affects the same DB?

In any case, I will be fixed by seeing the DB later, I’ll keep you posted!

Yes, I released Gladys v4.6.0 with curve display on the dashboard. I haven’t had time to communicate about it yet, I’ll do it at the end of the week on my Gladys days The idea was also to wait for the first feedback before communicating about it, so far apart from you no one has complained so it’s rather positive

GBoulvin · October 27, 2021, 3:48am

Hello!
The first installation dates back to December '20 and I think the DB is from that time.
I haven’t played with the test or beta versions.
However, I have had a few power outages recently. Maybe a lead?

I had noticed it was in build, but you surprised us… For Halloween?

pierre-gilles · October 27, 2021, 10:57am

I checked, your DB is corrupted on the t_device_feature_state table, there are corrupted lines.

I don’t know if you’ve noticed until now, but when Gladys is active, there isn’t 1 DB file but 3 files:

These files only appear when the DB is in use, they are temporary files that allow SQLite to store « WAL » (write-ahead-log), so that if Gladys crashes, your DB can recover when Gladys reboots.

If you ever want to make a copy/backup of your DB, you necessarily need to stop Gladys / or use the official SQLite backup utility (sqlite3 .backup I think), otherwise if you just copy the .db file you can end up with a corrupted file.

By any chance, did you move this DB without using an official utility/without stopping Gladys?

For your file, I managed to fix it, but I won’t be able to upload it for you until tomorrow at the coworking space, my upload connection at home isn’t strong enough to send it to you

GBoulvin · October 27, 2021, 12:01pm

In my memories, I always did a ‹ stop Gladys › before any DB backup, but it’s not impossible that I made a mistake…
I suppose that, since the data wasn’t being used, it didn’t matter if the DB was corrupted.
After that, I looked a lot to transfer the installation to SSD, I had a corrupted SD card, etc. So many potential sources of errors!
In short, thank you very much for taking the time to look at it.
If it’s possible and it suits you, don’t hesitate to truncate the DB to lighten it, I don’t have any vital data…

PhilippeMA · October 28, 2021, 6:43am

Hello,

I am adding the behavior observed on my side, not knowing if there is a link with what is exposed here?

In the task list, I notice 2 types of errors that keep happening (this page is becoming very very long):

« Aggregation sensor monthly/daily/hourly data »:

    at calculateTriangleArea (/src/server/node_modules/downsample/index.js:69:26)
    at LTTBIndexesForBuckets (/src/server/node_modules/downsample/index.js:662:18)
    at /src/server/node_modules/downsample/index.js:690:34
    at /src/server/lib/device/device.calculcateAggregateChildProcess.js:119:27
    at Map.forEach (<anonymous>)
    at /src/server/lib/device/device.calculcateAggregateChildProcess.js:111:35

and (just an excerpt as it is much longer), « Aggregation hourly sensor data »:

    at Query.formatError (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:412:16)
    at Query._handleQueryResponse (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:72:18)
    at Statement.afterExecute (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:246:27) {
  parent: [Error: SQLITE_BUSY: database is locked] {
    errno: 5,
    code: 'SQLITE_BUSY',
    sql: "INSERT INTO `t_device_feature_state_aggregate` (`id`,`type`,`device_feature_id`,`value`,`created_at`,`updated_at`) VALUES

VonOx · October 28, 2021, 7:17am

PhilippeMA:

et (just a snippet as it’s much longer), « Sensor hourly data aggregation »:

Error: TimeoutError [SequelizeTimeoutError]: SQLITE_BUSY: database is locked
    at Query.formatError (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:412:16)
    at Query._handleQueryResponse (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:72:18)
    at Statement.afterExecute (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:246:27) {
  parent: [Error: SQLITE_BUSY: database is locked] {
    errno: 5,
    code: 'SQLITE_BUSY',
    sql: "INSERT INTO `t_device_feature_state_aggregate` (`id`,`type`,`device_feature_id`,`value`,`created_at`,`updated_at`) VALUES

Same here

cicoub13 · October 28, 2021, 7:19am

Same issue here. It seems that the Gladys Plus backup locks the database, and if the aggregation starts at the same time (e.g., on startup), I get this error.

cicoub13 · October 28, 2021, 9:02am

Additionally, I have errors like this after a few minutes for the hourly aggregation part.
I have restarted the Raspberry Pi several times, but it still gets stuck at 52%. I specify that I am on an SD card and not an SSD

2021-10-28T10:37:51+0200 <info> device.calculateAggregate.js:38 (DeviceManager.calculateAggregate) Calculating aggregates device feature state for interval hourly
2021-10-28T10:37:51+0200 <info> index.js:63 (Server.<anonymous>) Server listening on port 80
2021-10-28T10:44:47+0200 <warn> device.calculateAggregate.js:95 (Socket.<anonymous>) device.calculateAggregate stderr: RangeError: Maximum call stack size exceeded
    at chunk (/src/server/utils/chunks.js:11:26)
    at /src/server/lib/device/device.calculcateAggregateChildProcess.js:147:22
2021-10-28T10:44:47+0200 <warn> device.calculateAggregate.js:101 (ChildProcess.<anonymous>) device.calculateAggregate: Exiting child process with code 1
2021-10-28T10:44:47+0200 <error> device.onHourlyDeviceAggregateEvent.js:22 (DeviceManager.onHourlyDeviceAggregateEvent) Error: RangeError: Maximum call stack size exceeded
    at chunk (/src/server/utils/chunks.js:11:26)
    at /src/server/lib/device/device.calculcateAggregateChildProcess.js:147:22
    at ChildProcess.<anonymous> (/src/server/lib/device/device.calculateAggregate.js:102:23)
    at ChildProcess.emit (events.js:400:28)
    at maybeClose (internal/child_process.js:1058:16)
    at Process.ChildProcess._handle.onexit (internal/child_process.js:293:5)

Terdious · October 28, 2021, 6:29pm

Hello !!

Same here, the errors are back, from time to time, but not permanent. I thought you had done what was needed @pierre-gilles, but I have the impression that it’s not the same error:

at Query.formatError (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:412:16)
at Query._handleQueryResponse (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:72:18)
at Statement.afterExecute (/src/server/node_modules/sequelize/lib/dialects/sqlite/query.js:246:27) {
parent: [Error: SQLITE_BUSY: database is locked] {
errno: 5,
code: 'SQLITE_BUSY',
sql: 'UPDATE `t_device_feature` SET `last_monthly_aggregate`=$1,`updated_at`=$2 WHERE `id` = $3'
},
original: [Error: SQLITE_BUSY: database is locked] {
errno: 5,
code: 'SQLITE_BUSY',
sql: 'UPDATE `t_device_feature` SET `last_monthly_aggregate`=$1,`updated_at`=$2 WHERE `id` = $3'
},
sql: 'UPDATE `t_device_feature` SET `last_monthly_aggregate`=$1,`updated_at`=$2 WHERE `id` = $3',
parameters: undefined
}

pierre-gilles · October 29, 2021, 2:41am

I sent you your corrected database in a private message

Thanks for the feedback! I’ll take a look

Ah indeed that’s possible! I hadn’t thought of that, we might as well prevent the aggregation from running if a backup is in progress

In any case, in the meantime, it’s not « serious » in the sense that the next aggregation will fix the problem.

Out of curiosity, how many deviceFeatureState do you have in Gladys? And how many per deviceFeature?

You must have a lot, more than @Terdious because we hadn’t even seen this bug during testing

I thought @Terdious’s installation was the largest, apparently not!

For the « Nan »:

This drives me crazy, I know @lmilcent had them too during development but I’ve never been able to reproduce… If you could check what the API returns @cicoub13?

It may be due to the Gladys Plus backup running at the same time, I’ll try to see to prevent aggregation when the backup is running.

Topic		Replies	Views
Problème de performance sur dashboard avec beaucoup de graphiques Configuration	52	2486	October 3, 2022
Actions ne se chargent pas dans une grande scène Configuration	33	1223	March 14, 2022
Bug dashboard graphique "Grouper par" avec valeur identique Configuration	23	145	June 20, 2025
Sauvegarder ou pas l'historique d'une fonctionnalités précise Développement	18	732	May 14, 2022
Nouvelle version 4.7 : Affichage graphique et mise à jour majeure de l'intégration Zigbee2mqtt! Actualités	47	3963	April 8, 2022

Help! Problem with aggregation?

Related topics