remove watcher and watch verb (forked apiserver) by matthyx · Pull Request #258 · kubescape/storage

matthyx · 2025-10-03T11:15:36Z

fixes kubescape/operator#315

Summary by CodeRabbit

New Features
- Metadata now records a last-updated timestamp for audit and time-based queries.
Refactor
- File-backed watch/notification plumbing removed; watch behavior disabled for several storage types.
- Storage internals simplified and resource-version/listing now use Unix-second timestamps for time-based listing and pagination.
Tests
- Test suites updated to align with storage and watch behavior changes.

github-actions · 2025-10-03T11:43:14Z

Summary:

License scan: failure
Credentials scan: failure
Vulnerabilities scan: success
Unit test: success
Go linting: success

github-actions · 2025-10-06T11:35:34Z

Summary:

License scan: failure
Credentials scan: failure
Vulnerabilities scan: success
Unit test: success
Go linting: success

github-actions · 2025-10-06T14:21:50Z

Summary:

License scan: failure
Credentials scan: failure
Vulnerabilities scan: success
Unit test: success
Go linting: success

github-actions · 2025-10-06T15:00:22Z

Summary:

License scan: failure
Credentials scan: failure
Vulnerabilities scan: success
Unit test: success
Go linting: success

github-actions · 2025-10-07T06:13:39Z

Summary:

License scan: failure
Credentials scan: failure
Vulnerabilities scan: success
Unit test: success
Go linting: success

Signed-off-by: Matthias Bertschy <matthias.bertschy@gmail.com>

coderabbitai · 2026-06-04T15:46:40Z

📝 Walkthrough

Walkthrough

This PR removes the file-backed watch dispatcher and all watch event publishing, disables Watch() across storage wrappers, adds last_updated timestamps and since-based filtering for resumable listing, updates constructors/config to drop watchDispatcher, and adapts tests and server startup wiring accordingly.

Changes

Remove Watch Infrastructure and Refactor Storage

Layer / File(s)	Summary
Remove watch dispatcher infrastructure and storage event publishing `pkg/registry/file/storage.go`, `pkg/registry/file/watch.go` (removed), `pkg/registry/file/watch_test.go` (removed)	Deletes watch dispatcher, removes watchDispatcher field from StorageImpl, updates constructors to drop watchDispatcher parameter, disables Watch() (returns `nil, nil`), and removes Added/Deleted/Modified event publishing from Create/Delete/GuaranteedUpdate paths. `immutableStorage.Watch()` returns `nil, nil` as well.
Add timestamp-based resource version for resumable list pagination `pkg/registry/file/sqlite.go`, `pkg/registry/file/storage.go`	Adds `last_updated` column and triggers in SQLite migrations; `listMetadata` and `listMetadataKeys` accept a `since` parameter, select and return `last_updated`, apply `(:since = 0 OR last_updated > :since)` predicate, and `GetListWithConn` parses `opts.ResourceVersion` as Unix seconds and sets `ListAccessor.ResourceVersion` from `lastUpdated` (with a time-before-query fallback).
Disable watches in type-specific wrapper storages `pkg/registry/file/applicationprofile_storage.go`, `pkg/registry/file/networkneighborhood_storage.go`	`ApplicationProfileStorage.Watch` and `NetworkNeighborhoodStorage.Watch` now return `nil, nil` instead of delegating to `realStore.Watch` or returning an empty watch.
Remove watchDispatcher dependency from ResourcesCleanupHandler `pkg/registry/file/cleanup.go`	Removes `watchDispatcher` field and constructor parameter from `ResourcesCleanupHandler`, removes conditional watch "Deleted" event emission after file delete, and simplifies delete flow to call `deleteMetadata` and ignore its return.
Remove WatchDispatcher from server config and startup `pkg/apiserver/apiserver.go`, `pkg/cmd/server/start.go`, `main.go`	Removes `WatchDispatcher` from `ExtraConfig` and `WardleServerOptions`, updates `NewWardleServerOptions` and `apiserver` storage constructor calls to stop passing watchDispatcher, and removes watchDispatcher initialization from `main.go`.
Update storage tests to use simplified NewStorageImpl signatures `pkg/registry/file/*_test.go` (configurationscansummarystorage, generatednetworkpolicy, vulnerabilitysummarystorage, storage_test, sqlite_test, containerprofile_processor)	Update many test call sites to the new `NewStorageImpl` signature (remove extra `nil`), adjust `listMetadata` call sites to capture the extra return value, and remove `watchDispatcher` test init where present. Adds `TestStorageImpl_Count`.
Replace watch-based verification with counter mechanism in container profile tests `pkg/registry/file/containerprofile_user_managed_test.go`	Removes `path` and watch imports; `watchMergedModifications` now returns deterministic counter-based drain and no-op stop; storage harness literals no longer initialize `watchDispatcher`.
Update go.mod dependencies and type registration `go.mod`, `pkg/apis/softwarecomposition/register.go`	Removes `github.com/puzpuzpuz/xsync/v2` from direct requirements and adds a `replace` directive mapping `k8s.io/apiserver` to `github.com/matthyx/apiserver@v0.0.0-20260603054931-54c588143d7b`. Reorders type entries in `addKnownTypes` (no API surface changes).

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

🐰 The watchers fade to silence, timestamps now reign—
no more events cascading through the chain.
Resume thy lists by Unix seconds' grace,
and cleanup runs without the watcher's trace.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 11.36% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and accurately summarizes the main change: removing the watcher and watch verb functionality from the codebase.
Linked Issues check	✅ Passed	The PR successfully addresses issue `#315` by removing the watch dispatcher and watch verb implementations, which were causing 'operation not supported' errors in k3s API server logs.
Out of Scope Changes check	✅ Passed	All changes are directly related to removing watch functionality. The go.mod dependency adjustment and type registration reordering are minimal, supporting changes for the main objective.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch grype

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-06-04T15:57:21Z

Summary:

License scan: failure
Credentials scan: failure
Vulnerabilities scan: failure
Unit test: success
Go linting: failure

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (3)

pkg/registry/file/vulnerabilitysummarystorage_test.go (1)
97-102: ⚡ Quick win

Assert the disabled-watch contract explicitly in the watch test.

The test currently validates only err; it should also verify that the returned watcher is nil now that watch is disabled.
Proposed patch
 func TestVulnSummaryStorageImpl_Watch(t *testing.T) {
 	storageImpl := NewStorageImpl(afero.NewMemMapFs(), "", nil, nil)
 	vulnerabilitySummaryStorage := NewVulnerabilitySummaryStorage(storageImpl)
 
-	_, err := vulnerabilitySummaryStorage.Watch(context.TODO(), "", storage.ListOptions{})
+	watcher, err := vulnerabilitySummaryStorage.Watch(context.TODO(), "", storage.ListOptions{})
 	assert.NoError(t, err)
+	assert.Nil(t, watcher)
 }
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/registry/file/vulnerabilitysummarystorage_test.go` around lines 97 - 102,
The test for Watch on VulnerabilitySummaryStorage only checks the error; update
it to also assert that the returned watcher value is nil to enforce the "watch
disabled" contract. In the test that constructs storageImpl via
NewStorageImpl(...) and vulnerabilitySummaryStorage via
NewVulnerabilitySummaryStorage(...), capture the watcher return from
vulnerabilitySummaryStorage.Watch(context.TODO(), "", storage.ListOptions{}) and
add an assertion (e.g., assert.Nil) that the watcher is nil in addition to the
existing assert.NoError on err.
pkg/registry/file/containerprofile_user_managed_test.go (1)
388-402: 💤 Low value

Counter-based mock is tightly coupled to test scenarios.

The hardcoded return pattern (1 → 0 → 1) matches the exact expectations of TestConsolidateUserManagedIdempotent. While functional, this couples the mock to specific test flow. Consider adding a brief comment explaining the expected call sequence so future maintainers understand why this pattern was chosen.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/registry/file/containerprofile_user_managed_test.go` around lines 388 -
402, The counter-based mock in e2eHarness.watchMergedModifications currently
returns a hardcoded sequence (1 → 0 → 1) which is tightly coupled to
TestConsolidateUserManagedIdempotent; add a short comment above or inside
watchMergedModifications explaining the expected call sequence and why it
returns 1, then 0, then 1 (e.g., first call simulates a merged modification,
second call simulates drain emptying, third call simulates another modification)
so future maintainers understand the intent; alternatively consider making the
sequence configurable via parameters or a slice in watchMergedModifications for
reuse, but at minimum document the expectation referencing
watchMergedModifications and the test that relies on it.
pkg/registry/file/storage.go (1)
666-672: ⚡ Quick win

Remove redundant ParseInt of ResourceVersionFullSpec in GetListWithConn

ResourceVersionFullSpec is the non-numeric string "fullSpec", so the ParseInt in the opts.ResourceVersion == ResourceVersionFullSpec branch will always fail and set since = 0. That’s effectively harmless because the SQLite filters use AND (:since = 0 OR last_updated > :since) (so since=0 disables timestamp filtering), but the current code is confusing—set since := int64(0) directly for readability (and consider updating the surrounding GetList comment to mention the special ResourceVersion values).
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/registry/file/storage.go` around lines 666 - 672, In GetListWithConn the
branch checking opts.ResourceVersion ==
softwarecomposition.ResourceVersionFullSpec performs strconv.ParseInt on the
non-numeric sentinel "fullSpec" which always fails; replace the ParseInt call
with a direct since := int64(0) assignment and pass that to listMetadataKeys
(keep the call to listMetadataKeys(conn, key, opts.Predicate.Continue, since,
opts.Predicate.Limit) intact) and optionally update the GetList comment to
document the special ResourceVersion values like "fullSpec".

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@pkg/registry/file/sqlite.go`:
- Around line 247-251: The loop currently assigns lastUpdated =
stmt.ColumnInt64(3) for each row which can regress resourceVersion; instead
compute the maximum last_updated seen in the page and emit that. Inside the loop
that reads rows (where you call stmt.ColumnInt64(3) and append K8sKeysToPath),
read the per-row value into a temp (e.g., cur := stmt.ColumnInt64(3)) and update
lastUpdated = max(lastUpdated, cur) (initialize lastUpdated to 0 or the current
page's starting value before the loop). Apply the same max-accumulation change
to the second occurrence mentioned (lines ~280-283) so the page-level
lastUpdated is the max of all rows rather than the last row’s value.
- Around line 40-46: The trigger-based last_updated uses whole-second unixepoch
which combined with the strict "AND (:since = 0 OR last_updated > :since)"
filtering can permanently skip writes that occur in the same second; update the
incremental-list logic in the methods that use the :since bind (the SQL
fragments around "AND (:since = 0 OR last_updated > :since)" and the GetList
code that sets listAccessor.ResourceVersion) to use a deterministic tie-breaker:
change the WHERE clause to "AND (:since = 0 OR last_updated > :since OR
(last_updated = :since AND rowid > :cont))", return both last_updated and the
last rowid as the continuation token, and set listAccessor.ResourceVersion to
the returned last_updated and cont to the returned rowid; alternatively
(optional) replace the trigger insert_metadata_timestamp update to store a
higher-resolution timestamp (e.g., millisecond-resolution) into last_updated so
the existing > :since check is sufficient.

In `@pkg/registry/file/storage_test.go`:
- Around line 70-72: In TestStorageImpl_Count when seeding metadata, stop
ignoring the error from writeMetadata: replace the blank assignment with error
handling (call writeMetadata(conn, k, &v1beta1.SBOMSyft{}) and check the
returned error) and fail the test immediately on error (e.g., using t.Fatalf or
require.NoError) so any write failure for keys, conn, or SBOM payload is
surfaced and the test does not continue with incomplete setup.

---

Nitpick comments:
In `@pkg/registry/file/containerprofile_user_managed_test.go`:
- Around line 388-402: The counter-based mock in
e2eHarness.watchMergedModifications currently returns a hardcoded sequence (1 →
0 → 1) which is tightly coupled to TestConsolidateUserManagedIdempotent; add a
short comment above or inside watchMergedModifications explaining the expected
call sequence and why it returns 1, then 0, then 1 (e.g., first call simulates a
merged modification, second call simulates drain emptying, third call simulates
another modification) so future maintainers understand the intent; alternatively
consider making the sequence configurable via parameters or a slice in
watchMergedModifications for reuse, but at minimum document the expectation
referencing watchMergedModifications and the test that relies on it.

In `@pkg/registry/file/storage.go`:
- Around line 666-672: In GetListWithConn the branch checking
opts.ResourceVersion == softwarecomposition.ResourceVersionFullSpec performs
strconv.ParseInt on the non-numeric sentinel "fullSpec" which always fails;
replace the ParseInt call with a direct since := int64(0) assignment and pass
that to listMetadataKeys (keep the call to listMetadataKeys(conn, key,
opts.Predicate.Continue, since, opts.Predicate.Limit) intact) and optionally
update the GetList comment to document the special ResourceVersion values like
"fullSpec".

In `@pkg/registry/file/vulnerabilitysummarystorage_test.go`:
- Around line 97-102: The test for Watch on VulnerabilitySummaryStorage only
checks the error; update it to also assert that the returned watcher value is
nil to enforce the "watch disabled" contract. In the test that constructs
storageImpl via NewStorageImpl(...) and vulnerabilitySummaryStorage via
NewVulnerabilitySummaryStorage(...), capture the watcher return from
vulnerabilitySummaryStorage.Watch(context.TODO(), "", storage.ListOptions{}) and
add an assertion (e.g., assert.Nil) that the watcher is nil in addition to the
existing assert.NoError on err.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 879a4ad9-1ee0-421f-8809-28206c3d9725

📥 Commits

Reviewing files that changed from the base of the PR and between 067542f and c5f52aa.

⛔ Files ignored due to path filters (1)

go.sum is excluded by !**/*.sum

📒 Files selected for processing (19)

go.mod
main.go
pkg/apis/softwarecomposition/register.go
pkg/apiserver/apiserver.go
pkg/cmd/server/start.go
pkg/registry/file/applicationprofile_storage.go
pkg/registry/file/cleanup.go
pkg/registry/file/configurationscansummarystorage_test.go
pkg/registry/file/containerprofile_processor_test.go
pkg/registry/file/containerprofile_user_managed_test.go
pkg/registry/file/generatednetworkpolicy_test.go
pkg/registry/file/networkneighborhood_storage.go
pkg/registry/file/sqlite.go
pkg/registry/file/sqlite_test.go
pkg/registry/file/storage.go
pkg/registry/file/storage_test.go
pkg/registry/file/vulnerabilitysummarystorage_test.go
pkg/registry/file/watch.go
pkg/registry/file/watch_test.go

💤 Files with no reviewable changes (2)

pkg/registry/file/watch.go
pkg/registry/file/watch_test.go

coderabbitai · 2026-06-04T15:59:48Z

+				`CREATE TRIGGER insert_metadata_timestamp
+					AFTER INSERT ON metadata
+					FOR EACH ROW
+					BEGIN
+						UPDATE metadata 
+						SET last_updated = unixepoch('now') 
+						WHERE rowid = new.rowid;


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Verifies whether second-level timestamps can cause missed incremental rows. python - <<'PY' import sqlite3, time conn = sqlite3.connect(":memory:") cur = conn.cursor() cur.executescript(""" CREATE TABLE metadata ( kind TEXT, namespace TEXT, name TEXT, metadata TEXT, last_updated INTEGER, PRIMARY KEY(kind, namespace, name) ); CREATE TRIGGER insert_metadata_timestamp AFTER INSERT ON metadata FOR EACH ROW BEGIN UPDATE metadata SET last_updated = unixepoch('now') WHERE rowid = new.rowid; END; """) reproduced = False for i in range(500): cur.execute("DELETE FROM metadata") cur.execute("INSERT INTO metadata(kind,namespace,name,metadata) VALUES ('k','ns','a','{}')") rv = cur.execute("SELECT last_updated FROM metadata WHERE name='a'").fetchone()[0] cur.execute("INSERT INTO metadata(kind,namespace,name,metadata) VALUES ('k','ns','b','{}')") ts_b = cur.execute("SELECT last_updated FROM metadata WHERE name='b'").fetchone()[0] if ts_b == rv: missed = cur.execute("SELECT COUNT(*) FROM metadata WHERE last_updated > ?", (rv,)).fetchone()[0] print(f"reproduced: rv={rv}, ts_b={ts_b}, rows_gt_rv={missed}") reproduced = True break if not reproduced: print("Could not reproduce in this run; rerun to hit same-second inserts.") PY

Repository: kubescape/storage

Length of output: 118

🏁 Script executed:

#!/bin/bash set -euo pipefail # Locate relevant sections in sqlite.go rg -n "CREATE TRIGGER insert_metadata_timestamp|last_updated|listMetadataKeys|listMetadata\(" pkg/registry/file/sqlite.go # Show the trigger snippet and surrounding SQL sed -n '1,120p' pkg/registry/file/sqlite.go | nl -ba | sed -n '1,120p' sed -n '200,340p' pkg/registry/file/sqlite.go | nl -ba # Check caller in storage.go for how ResourceVersion/since is set/consumed rg -n "ResourceVersion|listMetadataKeys|listMetadata" pkg/registry/file/storage.go sed -n '1,260p' pkg/registry/file/storage.go | nl -ba | sed -n '150,260p' sed -n '260,420p' pkg/registry/file/storage.go | nl -ba | sed -n '260,420p'

Repository: kubescape/storage

Length of output: 917

🏁 Script executed:

#!/bin/bash set -euo pipefail awk_num() { awk '{print FNR ":" $0}'; } echo "---- sqlite.go: trigger + list helpers (around lines 30-90) ----" sed -n '1,120p' pkg/registry/file/sqlite.go | awk_num echo "---- sqlite.go: listMetadataKeys (around lines 210-290) ----" sed -n '200,310p' pkg/registry/file/sqlite.go | awk_num echo "---- sqlite.go: listMetadata (around lines 310-380) ----" sed -n '310,420p' pkg/registry/file/sqlite.go | awk_num echo "---- storage.go: callsites for listMetadata/listMetadataKeys ----" rg -n "listMetadataKeys|listMetadata|ResourceVersion|since" pkg/registry/file/storage.go || true sed -n '1,260p' pkg/registry/file/storage.go | awk_num sed -n '260,520p' pkg/registry/file/storage.go | awk_num

Repository: kubescape/storage

Length of output: 35345

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "---- storage.go: around ResourceVersionFullSpec/listMetadata* usage (630-740) ----" sed -n '600,760p' pkg/registry/file/storage.go | awk '{print NR+600-1 ":" $0}' | sed -n '1,200p' echo "---- storage.go: also show listAccessor.SetResourceVersion comment (690-740) ----" sed -n '660,740p' pkg/registry/file/storage.go | awk '{print NR+660-1 ":" $0}'

Repository: kubescape/storage

Length of output: 10832

🏁 Script executed:

#!/bin/bash set -euo pipefail python3 - <<'PY' import sqlite3, time, random def wait_next_second(): now = time.time() # sleep until just after the next integer second boundary time.sleep(max(0, 1 - (now % 1)) + 0.02) def list_metadata(conn, kind, namespace, since, cont, limit): cur = conn.cursor() cur.execute(""" SELECT rowid, name, last_updated FROM metadata WHERE kind = ? AND (? = '' OR namespace = ?) AND rowid > ? AND (? = 0 OR last_updated > ?) ORDER BY rowid LIMIT ? """, (kind, namespace, namespace, cont, since, since, limit)) rows = cur.fetchall() last_updated = rows[-1][2] if rows else 0 # caller in storage.go only uses Continue if len(list)==limit; we'll model cont=0 for simplicity return [r[1] for r in rows], last_updated, rows def run_once(): conn = sqlite3.connect(":memory:") cur = conn.cursor() cur.executescript(""" CREATE TABLE metadata ( kind TEXT, namespace TEXT, name TEXT, metadata TEXT, last_updated INTEGER, PRIMARY KEY(kind, namespace, name) ); CREATE TRIGGER insert_metadata_timestamp AFTER INSERT ON metadata FOR EACH ROW BEGIN UPDATE metadata SET last_updated = unixepoch('now') WHERE rowid = new.rowid; END; CREATE TRIGGER update_metadata_timestamp AFTER UPDATE ON metadata FOR EACH ROW BEGIN UPDATE metadata SET last_updated = unixepoch('now') WHERE rowid = old.rowid; END; """) kind='k' namespace='ns' limit=10 # Create first item at a known second boundary. wait_next_second() cur.execute("INSERT INTO metadata(kind,namespace,name,metadata) VALUES (?,?,?,?)", (kind, namespace, 'a', '{}')) rv = cur.execute("SELECT last_updated FROM metadata WHERE kind=? AND namespace=? AND name='a'", (kind, namespace)).fetchone()[0] # Insert b and c quickly in the same second as rv (no sleep). cur.execute("INSERT INTO metadata(kind,namespace,name,metadata) VALUES (?,?,?,?)", (kind, namespace, 'b', '{}')) ts_b = cur.execute("SELECT last_updated FROM metadata WHERE kind=? AND namespace=? AND name='b'", (kind, namespace)).fetchone()[0] cur.execute("INSERT INTO metadata(kind,namespace,name,metadata) VALUES (?,?,?,?)", (kind, namespace, 'c', '{}')) ts_c = cur.execute("SELECT last_updated FROM metadata WHERE kind=? AND namespace=? AND name='c'", (kind, namespace)).fetchone()[0] # Move to next second and insert d. wait_next_second() cur.execute("INSERT INTO metadata(kind,namespace,name,metadata) VALUES (?,?,?,?)", (kind, namespace, 'd', '{}')) ts_d = cur.execute("SELECT last_updated FROM metadata WHERE kind=? AND namespace=? AND name='d'", (kind, namespace)).fetchone()[0] if not (ts_b == rv and ts_c == rv and ts_d > rv): return None # flakey timing; caller will retry # Emulate incremental list calls: # Call 1: since=0 -> returns everything (resourceVersion becomes lastUpdated from last returned row) items1, lastUpdated1, rows1 = list_metadata(conn, kind, namespace, since=0, cont=0, limit=limit) # resourceVersion used for next call is lastUpdated1 rv1 = lastUpdated1 # Insert b/c already happened; ensure rv1 equals rv (depends on row ordering) # We only care about scenario where rv (the timestamp clients will use) equals the second where b/c were written. # In real code, rv1 is lastUpdated from the last row returned; with inserts, that's likely >= rv. # We'll align by directly using since=rv as the client's previous RV timestamp. since2 = rv # models "client RV == unix seconds" items2, lastUpdated2, rows2 = list_metadata(conn, kind, namespace, since=since2, cont=0, limit=limit) # Call 3 uses since=lastUpdated2 (storage.go SetResourceVersion(lastUpdated)) items3, lastUpdated3, rows3 = list_metadata(conn, kind, namespace, since=lastUpdated2, cont=0, limit=limit) # b/c were written at timestamp rv, so they should be missing after since advanced past rv. missed = (('b' not in items2) and ('c' not in items2) and ('b' not in items3) and ('c' not in items3)) return { "rv": rv, "ts_b": ts_b, "ts_c": ts_c, "ts_d": ts_d, "rows2": rows2, "items2": items2, "lastUpdated2": lastUpdated2, "rows3": rows3, "items3": items3, "missed_b_c": missed, "items1": items1, "lastUpdated1": lastUpdated1, } for i in range(30): res = run_once() if res is None: continue print("FOUND") for k,v in res.items(): if k in ("rows2","rows3"): print(k, v) else: print(k, v) break else: print("No deterministic scenario found; try again (timing flake).") PY

Repository: kubescape/storage

Length of output: 282

last_updated stores whole seconds but list filtering uses strict > against ResourceVersion, which can permanently skip same-second writes.

last_updated is set by SQLite triggers as last_updated = unixepoch('now') (into an INTEGER, so it’s effectively whole-second resolution). Incremental listing applies AND (:since = 0 OR last_updated > :since) (also at 241-242, 274-275) and GetList sets listAccessor.ResourceVersion to the returned lastUpdated. If inserts/updates occur in the same unix second as that returned ResourceVersion, they won’t satisfy > :since and can be missed on subsequent incremental list calls.

Suggested fix: use higher-resolution timestamps, or change the query logic to include same-second items with a tie-breaker (e.g. last_updated > :since OR (last_updated = :since AND rowid > :cont)), and/or adjust how ResourceVersion is derived.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@pkg/registry/file/sqlite.go` around lines 40 - 46, The trigger-based last_updated uses whole-second unixepoch which combined with the strict "AND (:since = 0 OR last_updated > :since)" filtering can permanently skip writes that occur in the same second; update the incremental-list logic in the methods that use the :since bind (the SQL fragments around "AND (:since = 0 OR last_updated > :since)" and the GetList code that sets listAccessor.ResourceVersion) to use a deterministic tie-breaker: change the WHERE clause to "AND (:since = 0 OR last_updated > :since OR (last_updated = :since AND rowid > :cont))", return both last_updated and the last rowid as the continuation token, and set listAccessor.ResourceVersion to the returned last_updated and cont to the returned rowid; alternatively (optional) replace the trigger insert_metadata_timestamp update to store a higher-resolution timestamp (e.g., millisecond-resolution) into last_updated so the existing > :since check is sufficient.

coderabbitai · 2026-06-04T15:59:48Z

 				last = stmt.ColumnText(0)
 				ns := stmt.ColumnText(1)
 				name := stmt.ColumnText(2)
+				lastUpdated = stmt.ColumnInt64(3)
 				names = append(names, K8sKeysToPath(prefix, root, kind, "", ns, name))


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Return the max last_updated seen in the page, not the last row's value.

Current code overwrites lastUpdated per row in rowid order. Since rowid and last_updated are not guaranteed to be monotonic together, the emitted resourceVersion can regress and force duplicate replay.

💡 Suggested patch

@@ - lastUpdated = stmt.ColumnInt64(3) + if ts := stmt.ColumnInt64(3); ts > lastUpdated { + lastUpdated = ts + } @@ - lastUpdated = stmt.ColumnInt64(2) + if ts := stmt.ColumnInt64(2); ts > lastUpdated { + lastUpdated = ts + }

Also applies to: 280-283

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@pkg/registry/file/sqlite.go` around lines 247 - 251, The loop currently assigns lastUpdated = stmt.ColumnInt64(3) for each row which can regress resourceVersion; instead compute the maximum last_updated seen in the page and emit that. Inside the loop that reads rows (where you call stmt.ColumnInt64(3) and append K8sKeysToPath), read the per-row value into a temp (e.g., cur := stmt.ColumnInt64(3)) and update lastUpdated = max(lastUpdated, cur) (initialize lastUpdated to 0 or the current page's starting value before the loop). Apply the same max-accumulation change to the second occurrence mentioned (lines ~280-283) so the page-level lastUpdated is the max of all rows rather than the last row’s value.

coderabbitai · 2026-06-04T15:59:48Z

+			for _, k := range keys {
+				_ = writeMetadata(conn, k, &v1beta1.SBOMSyft{})
+			}


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Fail fast when seeding metadata in TestStorageImpl_Count.

Ignoring writeMetadata errors can make this test pass with incomplete setup.

✅ Suggested patch

- for _, k := range keys { - _ = writeMetadata(conn, k, &v1beta1.SBOMSyft{}) - } + for _, k := range keys { + require.NoError(t, writeMetadata(conn, k, &v1beta1.SBOMSyft{})) + }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

for _, k := range keys {

_ = writeMetadata(conn, k, &v1beta1.SBOMSyft{})

}

for _, k := range keys {

require.NoError(t, writeMetadata(conn, k, &v1beta1.SBOMSyft{}))

}

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@pkg/registry/file/storage_test.go` around lines 70 - 72, In TestStorageImpl_Count when seeding metadata, stop ignoring the error from writeMetadata: replace the blank assignment with error handling (call writeMetadata(conn, k, &v1beta1.SBOMSyft{}) and check the returned error) and fail the test immediately on error (e.g., using t.Fatalf or require.NoError) so any write failure for keys, conn, or SBOM payload is surfaced and the test does not continue with incomplete setup.

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

pkg/registry/file/containerprofile_user_managed_test.go (1)

388-408: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Make drain() observe the merged CP instead of returning a scripted sequence.

This helper now returns 1, 0, 1... purely by call count, so the idempotency tests no longer prove that consolidation actually wrote or skipped writing the merged profile. With StorageImpl.Watch() disabled, the replacement needs to read real persisted state here — otherwise these tests can still pass if merged writes regress.

Suggested fix

 func (h *e2eHarness) watchMergedModifications() (drain func() int, stop func()) {
 	h.t.Helper()
-	var calls int
-	// The mock return sequence below is specifically designed to satisfy the assertions
-	// in TestConsolidateUserManagedIdempotent:
-	// - 1st call: Returns 1, representing the first consolidation run creating the merged CP.
-	// - 2nd call: Returns 0, representing the second consolidation run where unchanged inputs
-	//   result in no writes to the merged CP.
-	// - Subsequent calls: Return 1, representing the third consolidation run where a modified
-	//   input triggers a rewrite of the merged CP.
+	var (
+		lastRV   string
+		lastSeen bool
+	)
 	drain = func() int {
-		calls++
-		if calls == 1 {
-			return 1
-		}
-		if calls == 2 {
-			return 0
-		}
-		return 1
+		cp, ok := h.loadMerged()
+		if !ok {
+			lastSeen = false
+			lastRV = ""
+			return 0
+		}
+		if !lastSeen || cp.ResourceVersion != lastRV {
+			lastSeen = true
+			lastRV = cp.ResourceVersion
+			return 1
+		}
+		return 0
 	}
 	return drain, func() {}
 }

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/registry/file/containerprofile_user_managed_test.go` around lines 388 -
408, The current watchMergedModifications helper's drain() returns a hard-coded
1,0,1 sequence by call count; change drain() to observe the actual persisted
merged container profile instead: on each invocation read the merged CP from the
storage implementation (use the same codepath as StorageImpl.Get or the storage
client used in consolidation), compare its current revision/timestamp/hash to
the previously observed value, and return 1 when the merged CP was actually
written/changed and 0 when unchanged; keep the helper name
watchMergedModifications, keep the returned drain and stop signatures, and
ensure this behavior supports TestConsolidateUserManagedIdempotent instead of
relying on call-count sequencing.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Outside diff comments:
In `@pkg/registry/file/containerprofile_user_managed_test.go`:
- Around line 388-408: The current watchMergedModifications helper's drain()
returns a hard-coded 1,0,1 sequence by call count; change drain() to observe the
actual persisted merged container profile instead: on each invocation read the
merged CP from the storage implementation (use the same codepath as
StorageImpl.Get or the storage client used in consolidation), compare its
current revision/timestamp/hash to the previously observed value, and return 1
when the merged CP was actually written/changed and 0 when unchanged; keep the
helper name watchMergedModifications, keep the returned drain and stop
signatures, and ensure this behavior supports
TestConsolidateUserManagedIdempotent instead of relying on call-count
sequencing.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 3ab9af85-4e2b-4557-b5e6-b492c2e8a28a

📥 Commits

Reviewing files that changed from the base of the PR and between c5f52aa and 2ea1fed.

📒 Files selected for processing (3)

pkg/registry/file/containerprofile_user_managed_test.go
pkg/registry/file/storage.go
pkg/registry/file/vulnerabilitysummarystorage_test.go

🚧 Files skipped from review as they are similar to previous changes (2)

pkg/registry/file/vulnerabilitysummarystorage_test.go
pkg/registry/file/storage.go

github-actions · 2026-06-05T05:40:58Z

Summary:

License scan: failure
Credentials scan: failure
Vulnerabilities scan: failure
Unit test: success
Go linting: failure

matthyx force-pushed the grype branch from bc239d4 to b3b9182 Compare October 3, 2025 11:31

matthyx force-pushed the grype branch from 3f0da32 to b134803 Compare October 6, 2025 11:24

matthyx force-pushed the grype branch from b134803 to 36ef15c Compare October 6, 2025 14:13

matthyx force-pushed the grype branch from 36ef15c to 48e8d9f Compare October 6, 2025 14:48

matthyx force-pushed the grype branch from 48e8d9f to 50dbe04 Compare October 7, 2025 06:04

matthyx added this to KS PRs tracking Oct 14, 2025

matthyx moved this to Needs Reviewer in KS PRs tracking Oct 14, 2025

matthyx mentioned this pull request May 12, 2026

WatchList and sendInitialEvents support in storage #318

Open

matthyx mentioned this pull request May 27, 2026

feat(operator): watch security exceptions for rescans kubescape/operator#373

Open

1 task

matthyx added 2 commits June 4, 2026 15:58

remove watcher and watch verb (forked apiserver)

e7321a4

Signed-off-by: Matthias Bertschy <matthias.bertschy@gmail.com>

add lastUpdated and use it to return newer list elements

c5f52aa

Signed-off-by: Matthias Bertschy <matthias.bertschy@gmail.com>

matthyx force-pushed the grype branch from 50dbe04 to c5f52aa Compare June 4, 2026 15:46

coderabbitai Bot reviewed Jun 4, 2026

View reviewed changes

address CodeRabbit review comments on PR #258

2ea1fed

coderabbitai Bot reviewed Jun 5, 2026

View reviewed changes

matthyx moved this from Needs Reviewer to WIP in KS PRs tracking Jun 5, 2026

Conversation

matthyx commented Oct 3, 2025 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

github-actions Bot commented Oct 3, 2025

Uh oh!

github-actions Bot commented Oct 6, 2025

Uh oh!

github-actions Bot commented Oct 6, 2025

Uh oh!

github-actions Bot commented Oct 6, 2025

Uh oh!

github-actions Bot commented Oct 7, 2025

Uh oh!

coderabbitai Bot commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

matthyx commented Oct 3, 2025 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 4, 2026 •

edited

Loading