feat: scheduled repair configuration + deterministic test fixes (v2) by 0rlych1kk4 · Pull Request #1490 · Ericsson/ecchronos

0rlych1kk4 · 2026-04-19T07:38:28Z

Summary

This PR introduces improvements to scheduled repair configuration handling and stabilizes related test behavior.

Changes

Refactored schedule configuration handling:
- Always mounts schedule.yaml
- Applies overrides only when explicitly provided
- Preserves upstream/default behavior when configuration is empty
Fixed non-deterministic test behavior:
- Hardened Awaitility usage (poll intervals + timeouts)
- Removed race conditions in TestScheduleManager
- Ensured deterministic job execution and validation
Cleaned up test framework interactions:
- Avoided global scheduler side effects across tests
- Improved isolation of configuration per test instance

Motivation

While working on DatacenterAware multi-agent scenarios, test instability and configuration side effects were observed:

Race conditions in scheduling tests
Non-deterministic timing behavior
Global configuration leaking between tests

This PR addresses those issues to provide a stable foundation for:

Scheduled repair scenarios
Multi-agent concurrency tests

Validation

Full test suite executed locally:
- mvn -pl core -am test
- All tests passing (including Testcontainers / Cassandra integration tests)

Notes

This PR focuses on stability and correctness
Follow-up work will introduce scheduled repair concurrency scenarios for multi-agent tests

…ulti-agent tests - Add scheduled repair concurrency test for multi-agent DatacenterAware mode - Make schedule interval and initial delay configurable per instance - Make schedule overrides opt-in to avoid affecting other tests - Configure fast schedules explicitly for this scenario

…impact)

0rlych1kk4 · 2026-04-19T08:06:23Z

Hi @VictorCavichioli

Summary of CI Failures Investigation

It looks like the failing checks are related to timing sensitivity and environment differences in CI (multi-node Cassandra + parallel scheduling), rather than functional regressions.

Planned fixes:

Increase Awaitility timeouts and polling intervals to better handle CI latency
Ensure scheduler instances are fully isolated per test (no shared/static state)
Review clock usage to avoid reliance on system time where possible
Validate Testcontainers isolation across test runs
Strengthen assertions using eventual consistency patterns (await().untilAsserted)

Locally, tests are stable, but I’ll push updates to improve determinism under CI conditions.

Let me know if there are known CI constraints or preferred patterns for timing-sensitive tests.

codecov-commenter · 2026-04-19T09:02:53Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 79.74%. Comparing base (9f4bd4e) to head (049df93).
⚠️ Report is 651 commits behind head on master.
❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@             Coverage Diff              @@
##             master    #1490      +/-   ##
============================================
+ Coverage     77.45%   79.74%   +2.28%     
- Complexity     1308     1728     +420     
============================================
  Files           135      164      +29     
  Lines          5566     6565     +999     
  Branches        579      679     +100     
============================================
+ Hits           4311     5235     +924     
- Misses         1062     1087      +25     
- Partials        193      243      +50

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

This reverts commit c2e0ee6.

This reverts commit 6b3eac4.

0rlych1kk4 added 8 commits April 18, 2026 21:23

test: harden ConfigRefresher Awaitility timeouts

6849741

test: make TestScheduleManager idle status deterministic (avoid race)

0f6120b

test: only mount custom schedule config when overrides are requested

8e016db

test: avoid modifying global scheduler frequency (prevent cross-test …

553e546

…impact)

style: format ecc_config.py with black

56f08c1

test: always mount schedule.yaml and apply overrides only when specified

fa10177

fix: preserve upstream schedule behavior when YAML is empty

e32f8bc

0rlych1kk4 requested a review from a team as a code owner April 19, 2026 07:38

test: harden config refresher and schedule manager determinism

9e6df44

0rlych1kk4 added 8 commits April 19, 2026 17:37

chore: apply license headers

b38cdf8

test: increase Cassandra startup timeout for Python integration

e52cc14

fix: prevent continuous rescheduling after successful execution

6b3eac4

fix: remove invalid exec-maven-plugin parameters

c2e0ee6

Revert "fix: remove invalid exec-maven-plugin parameters"

1564725

This reverts commit c2e0ee6.

Revert "fix: prevent continuous rescheduling after successful execution"

f7cb33a

This reverts commit 6b3eac4.

test: harden python integration setup and OpenAPI fetch retries

74ce536

fix: remove invalid exec-maven-plugin redirect parameters

015e639

0rlych1kk4 force-pushed the feature/scheduled-repair-v2 branch from 049df93 to 015e639 Compare April 20, 2026 13:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: scheduled repair configuration + deterministic test fixes (v2)#1490

feat: scheduled repair configuration + deterministic test fixes (v2)#1490
0rlych1kk4 wants to merge 17 commits intoEricsson:masterfrom
0rlych1kk4:feature/scheduled-repair-v2

0rlych1kk4 commented Apr 19, 2026

Uh oh!

0rlych1kk4 commented Apr 19, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Apr 19, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

0rlych1kk4 commented Apr 19, 2026

Summary

Changes

Motivation

Validation

Notes

Uh oh!

0rlych1kk4 commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

0rlych1kk4 commented Apr 19, 2026 •

edited

Loading

codecov-commenter commented Apr 19, 2026 •

edited

Loading