perf: major speed up when querying jobs by tags by michaeladler · Pull Request #429 · siemens/wfx

michaeladler · 2026-04-02T11:50:28Z

Description

Computing the total field in the /jobs pagination result parameter was done in a very inefficient way (correlating with the number of queried tags).

This patch series contains the following two optimizations:

Disable pagination metadata by default as this is typically only needed by UI clients. Clients relying on that information can append the new query parameter pagination=true to include the pagination metadata in responses.
The old ent-generated code looped over each tag and added a separate HasTagsWith predicate, producing one correlated IN subquery per tag:

  WHERE job.id IN (SELECT tag_jobs.job_id FROM tag_jobs
    JOIN tag ON ... WHERE tag.name = 'TAG1')
  AND job.id IN (SELECT tag_jobs.job_id FROM tag_jobs
    JOIN tag ON ... WHERE tag.name = 'TAG2')

Each subquery performs an independent scan of the tag_jobs table, which is expensive when the jobs table is large.

Replace this with a single explicit JOIN on tag_jobs and tags, filtering all requested tags in a single IN clause:

  FROM job
  JOIN tag_jobs ON job.id = tag_jobs.job_id
  JOIN tag ON tag_jobs.tag_id = tag.id
  WHERE tag.name IN ('TAG1', 'TAG2')

The JOIN allows the database to resolve the tag filter in a single pass.
Add a database index on tag_jobs(job_id) so the join can use an index lookup instead of a sequential scan.

Benchmarks

I used the enhanced wfx-loadtest to populate a locally running PostgreSQL database with 1 million jobs, each having two tags.

wfx 0.5.0: Querying for one tag took approximately 3 seconds, while querying for two tags exceeded 10 seconds (hitting the wfxctl timeout).
Optimized, with pagination enabled: Queries consistently took ~1 second, regardless of the number of tags.
Optimized without pagination: Queries completed in ~25ms.

Issues Addressed

List and link all the issues addressed by this PR.

Change Type

Please select the relevant options:

Bug fix (non-breaking change that resolves an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist

I have read the CONTRIBUTING document.
My changes adhere to the established code style, patterns, and best practices.
I have added tests that demonstrate the effectiveness of my changes.
I have updated the documentation accordingly (if applicable).
I have added an entry in the CHANGELOG to document my changes (if applicable).

codecov · 2026-04-02T12:25:10Z

Codecov Report

❌ Patch coverage is 60.46512% with 34 lines in your changes missing coverage. Please review.
✅ Project coverage is 73.73%. Comparing base (c06da84) to head (0d83d67).

Files with missing lines	Patch %	Lines
cmd/wfx/cmd/config/appconfig.go	7.69%	11 Missing and 1 partial ⚠️
internal/persistence/entgo/workflow_query.go	28.57%	9 Missing and 1 partial ⚠️
internal/server/server_collection.go	0.00%	5 Missing ⚠️
api/wfx.go	0.00%	4 Missing ⚠️
internal/persistence/entgo/job_query.go	89.28%	2 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #429      +/-   ##
==========================================
- Coverage   73.90%   73.73%   -0.18%     
==========================================
  Files          96       96              
  Lines        4055     4059       +4     
==========================================
- Hits         2997     2993       -4     
- Misses        828      839      +11     
+ Partials      230      227       -3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Enable logging of all SQL queries when the log level is set to trace. This is useful for identifying slow or inefficient queries during development and debugging, e.g. to analyze N+1 query problems. Signed-off-by: Michael Adler <michael.adler@siemens.com>

Signed-off-by: Michael Adler <michael.adler@siemens.com>

Previously this was ui/priv, now it's in ui/dist. Signed-off-by: Michael Adler <michael.adler@siemens.com>

Signed-off-by: Michael Adler <michael.adler@siemens.com>

Introduce a 'populate' command to easily fill the database with sample data. This is useful for reproducing performance issues or testing scenarios that require a non-empty database. Signed-off-by: Michael Adler <michael.adler@siemens.com>

The old ent-generated code looped over each tag and added a separate HasTagsWith predicate, producing one correlated IN subquery per tag: WHERE job.id IN (SELECT tag_jobs.job_id FROM tag_jobs JOIN tag ON ... WHERE tag.name = 'TAG1') AND job.id IN (SELECT tag_jobs.job_id FROM tag_jobs JOIN tag ON ... WHERE tag.name = 'TAG2') Each subquery performs an independent scan of the tag_jobs table, which is expensive when the jobs table is large. Replace this with a single explicit JOIN on tag_jobs and tags, filtering all requested tags in one IN clause: FROM job JOIN tag_jobs ON job.id = tag_jobs.job_id JOIN tag ON tag_jobs.tag_id = tag.id WHERE tag.name IN ('TAG1', 'TAG2') The JOIN allows the database to resolve the tag filter in a single pass. Add a database index on tag_jobs(job_id) for MySQL, PostgreSQL, and SQLite so the join can use an index lookup instead of a sequential scan. Use DISTINCT to deduplicate rows introduced by the join. Signed-off-by: Michael Adler <michael.adler@siemens.com>

Signed-off-by: Michael Adler <michael.adler@siemens.com>

Add a `pagination` boolean query parameter to the GET /jobs and GET /workflows endpoints. When not set (default: false), the pagination object is omitted from the response, reducing payload size for clients that don't need it. Signed-off-by: Michael Adler <michael.adler@siemens.com>

This removes the retry loops for storage initialization and creating network listeners. These are unnecessary in both common scenarios: - Developer use: fast failure with a clear error is more useful than silently retrying for minutes. - Production: service managers (systemd, k8s) already handle restarts with proper backoff and observability. Fail fast and let the caller decide how to recover. Signed-off-by: Michael Adler <michael.adler@siemens.com>

michaeladler requested a review from stormc as a code owner April 2, 2026 11:50

michaeladler force-pushed the feat/tags branch from 06cc4fa to 2d8a787 Compare April 2, 2026 12:22

michaeladler self-assigned this Apr 2, 2026

michaeladler force-pushed the feat/tags branch 4 times, most recently from ef187e2 to 56b1473 Compare April 2, 2026 15:21

stormc reviewed Apr 14, 2026

View reviewed changes

Comment thread internal/persistence/entgo/mysql.go

michaeladler force-pushed the feat/tags branch from 56b1473 to e15c8e0 Compare April 15, 2026 09:30

stormc reviewed Apr 16, 2026

View reviewed changes

Comment thread cmd/wfx/cmd/config/appconfig.go

stormc reviewed Apr 16, 2026

View reviewed changes

Comment thread cmd/wfx/cmd/config/appconfig.go Outdated

stormc reviewed Apr 16, 2026

View reviewed changes

Comment thread cmd/wfx/cmd/config/appconfig.go Outdated

michaeladler added 11 commits April 17, 2026 11:43

refactor: use newly introduced Go syntaxes

b4396ef

Signed-off-by: Michael Adler <michael.adler@siemens.com>

chore: add wfx-loadtest to gitignore

f99ff41

Signed-off-by: Michael Adler <michael.adler@siemens.com>

refactor(loadtest): align CLI flags with wfxctl

9ed7600

Signed-off-by: Michael Adler <michael.adler@siemens.com>

refactor: rename loadtest file

af3cc53

Signed-off-by: Michael Adler <michael.adler@siemens.com>

chore: ignore ui/dist files

82eb2f0

Previously this was ui/priv, now it's in ui/dist. Signed-off-by: Michael Adler <michael.adler@siemens.com>

refactor: move initStorage to AppConfig and export it

8416719

Signed-off-by: Michael Adler <michael.adler@siemens.com>

feat(loadtest): add command to seed database

c3fd19e

Introduce a 'populate' command to easily fill the database with sample data. This is useful for reproducing performance issues or testing scenarios that require a non-empty database. Signed-off-by: Michael Adler <michael.adler@siemens.com>

chore: take care of nix deprecation

a34c850

Signed-off-by: Michael Adler <michael.adler@siemens.com>

michaeladler force-pushed the feat/tags branch 2 times, most recently from b552c74 to 7bfc656 Compare April 17, 2026 09:51

michaeladler force-pushed the feat/tags branch from 7bfc656 to 0d83d67 Compare April 17, 2026 09:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: major speed up when querying jobs by tags#429

perf: major speed up when querying jobs by tags#429
michaeladler wants to merge 12 commits intosiemens:mainfrom
michaeladler:feat/tags

michaeladler commented Apr 2, 2026 •

edited

Loading

Uh oh!

codecov bot commented Apr 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

michaeladler commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Benchmarks

Issues Addressed

Change Type

Checklist

Uh oh!

codecov bot commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

michaeladler commented Apr 2, 2026 •

edited

Loading

codecov bot commented Apr 2, 2026 •

edited

Loading