fix: recent events touch refactor, cache buster worker name reference by Ziinc · Pull Request #2999 · Logflare/logflare

Ziinc · 2025-12-03T21:11:54Z

two big fixes:

fixes partition supervisor name - discovered by @bblaszkow06 🙏
Removes GenSingleton usage from non-stable processes

High volume of starting/stopping of SourceSup across the clusters result in broadcast storms across the cluster, with syn getting overloaded with messages of procs starting/stopping and resulting in high message queue counts.

as the conflict resolutino is handled on a singular process on a per-scope basis, backing up of the message queue can result in bootloops, even if we always choose the original process with nanosecond resolution. although clock time discrepancies is a possibility, i am doubtful that it is the root cause for the broadcast storm we are seeing on prod.
Previous partitioning of the recent events scope was done to mitigate this, but removing GenSingleton usage altogether will fix the root problem.

Furthermore, the fact that it starts to occur across all clusters in a synchronized manner leads me to believe strongly that it is related to the scheduled SourceSup shutdown which happens every half hour across all nodes.

Given above hypothesis, refactoring achieves the following:

moves ui procs to its own :syn scope
moves general quantum scheduler to its own global process using GenSingleton
runs the recent events touch every 5 minutely on at most 500 sources.
performs at most 5 transactions each 5 minutes when updating sources
increased auto shutdown interval to hourly on all nodes.

To achieve the global job scheduler, i added in an additional config to Quantum to prevent the startup of the inbuilt Task.Supervisor, so now the global run strategy works without duplicating jobs.

Upstream PR has been opened here for the new config option

Expected outcomes of this PR:

significant reduction in DB transactions per second
no message queue buildup on :syn_gen_scope for the :core scope
less frequent memory reclaiming from SourceSup auto-shutdown
cross cluster cache busting working again

Ziinc · 2025-12-04T07:36:42Z

will be merging this earlier to get this to prod.

josevalim

Hi @Ziinc 👋 some comments inline!

lib/logflare/scheduler.ex

lib/logflare/scheduler/run_strategy_all

lib/logflare/sources.ex

Ziinc · 2025-12-04T08:21:27Z

follow up issues created on linear relating to percentage-based run strategy.

Ziinc added 4 commits December 4, 2025 04:48

fix: CacheBusterWorker.Supervisor reference

dc55705

feat: use global scheduler to handle RecentEventsTouch

0aed6de

fix: tweak syn scopes

059bf81

chore: compilation errors

b30c86a

Ziinc requested review from a team, amokan and chasers December 3, 2025 21:11

github-actions bot assigned Ziinc Dec 3, 2025

Ziinc added 6 commits December 4, 2025 05:14

perf: increase schedule to 1hour

6d66eb7

chore: remove dbg

90bd0e0

chore: alias

5e90abe

chore: fix failing tests

a477b39

fix: remove remnant scope partitioning

398970c

fix: correctly perform clusterwide task coordination

11bb0e1

Ziinc and others added 2 commits December 4, 2025 15:37

Merge branch 'main' into fix/recent-events-touch

8184cca

chore: version bump

003dff4

josevalim reviewed Dec 4, 2025

View reviewed changes

lib/logflare/scheduler.ex Outdated Show resolved Hide resolved

lib/logflare/scheduler/run_strategy_all Outdated Show resolved Hide resolved

lib/logflare/sources.ex Outdated Show resolved Hide resolved

Ziinc added 2 commits December 4, 2025 16:01

chore: tweak job schedules

2f84167

chore: PR comments

e635099

Ziinc merged commit f625ba2 into main Dec 4, 2025
8 checks passed

Ziinc deleted the fix/recent-events-touch branch December 4, 2025 08:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: recent events touch refactor, cache buster worker name reference#2999

fix: recent events touch refactor, cache buster worker name reference#2999
Ziinc merged 14 commits intomainfrom
fix/recent-events-touch

Ziinc commented Dec 3, 2025 •

edited

Loading

Uh oh!

Ziinc commented Dec 4, 2025

Uh oh!

josevalim left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Ziinc commented Dec 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Ziinc commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ziinc commented Dec 4, 2025

Uh oh!

josevalim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Ziinc commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Ziinc commented Dec 3, 2025 •

edited

Loading

Ziinc commented Dec 4, 2025 •

edited

Loading