GREAT NEWS about Lemmy Server Performance, another major SQL mistake has been discovered today: every single comment & post create (INSERT) is updating ~1700 rows in the site_aggregates table

RoundSparrow ( @RoundSparrow@lemmy.ml ) · edit-2 1 year ago

GREAT NEWS about Lemmy Server Performance, another major SQL mistake has been discovered today: every single comment & post create (INSERT) is updating ~1700 rows in the site_aggregates table

HTTP_404_NotFound ( @xtremeownage@lemmyonline.com ) · 1 year ago

Holy hell. Post this to one of the programming-related communities. That is interesting.

lemmyvore ( @lemmyvore@feddit.nl ) · 1 year ago

Not that interesting when it turns out it was an ORM artifact.

HTTP_404_NotFound ( @xtremeownage@lemmyonline.com ) · 1 year ago

True. I have experienced quite a few of those… But, still interesting nonetheless.

Shadow ( @Shadow@lemmy.ca ) · edit-2 1 year ago

It’s not on every comment, it’s mostly triggered on deletions and edits. The problem is actually infintesimally worse and 1700 rows are updated if you delete 3 comments. If you delete more it’s exponential and just straight up fails and locks your database.

I’ll probably put a patch in there later tonight and then see about a PR unless someone else does.

RoundSparrow ( @RoundSparrow@lemmy.ml ) · edit-2 1 year ago

It’s not on every comment,

My testing with latest code is that it is indeed on every single comment INSERT, every new comment. I have the ability to view my live data while using Lemmy: https://lemmyadmin.bulletintree.com/query/raw_site_aggregates?output=table

Every one of the 1486 rows on that table gets +1 on comment when I post a new comment on my instance.

it’s mostly triggered on deletions and edits

That is not correct. Edits do not change the count of comments column on site_aggregates - because the number isn’t changing. Deletes (of a comment or post) in Lemmy are also not SQL DELETE statements, they are just a delete data column in the table. That DELETE PostgreSQL trigger only gets run when a end-user cancels their Lemmy account in their profile.

Shadow ( @Shadow@lemmy.ca ) · 1 year ago

Ah Gotcha. That’s true, but the cascading issue that causes thousands of inserts happens on a delete.

That table update you’re looking at is blazing fast due to the relatively low number of sites, until you run it thousands of times on a delete.

PochoHipster ( @PochoHipster@lemmy.ml ) · 1 year ago

Holy shit

PlasticExistence ( @PlasticExistence@beehaw.org ) · 1 year ago

Indeed

Funderpants ( @FunderPants@lemmy.ca ) · 1 year ago

Mechaguana ( @Mechaguana@programming.dev ) · 1 year ago

Man that is some bug, no wonder lemmy had such a rough start performance wise during the reddit migration!

BeigeAgenda ( @BeigeAgenda@lemmy.ca ) · 1 year ago

Get some DBA’s on the job and Lemmy will be blazing fast.

RoundSparrow ( @RoundSparrow@lemmy.ml ) · 1 year ago

We have had DBA’s, the problem is the Rust code uses ORM and an auto JSON framework that makes tracing the code time-consuming to learn.

AnarchoYeasty ( @AnarchoYeasty@beehaw.org ) · 1 year ago

Honestly, ORMs are a waste of time. Why not use sqlx and just hand write the SQL to avoid issues like this.

RoundSparrow ( @RoundSparrow@lemmy.ml ) · 1 year ago

In this one case, it was hand-written SQL inside a PostgreSQL FUNCTION that the ORM knows nothing about. But there is a approach in the entire application to have live-data from PostgreSQL for every little thing.

BeigeAgenda ( @BeigeAgenda@lemmy.ca ) · 1 year ago

Okay so you may need to refactor here and there to get more performance.

eleitl ( @eleitl@lemmy.ml ) · 1 year ago

Wonderful. Time to start running my own instance.