Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
58 commits
Select commit Hold shift + click to select a range
d48c2d7
feat: refactor node restart per design — disconnect-based logic, lead…
michixs Apr 6, 2026
b6ebf4e
Update env_var
schmidt-scaled Apr 6, 2026
7b2b2dc
Update version and Docker image tag in env_var
schmidt-scaled Apr 6, 2026
46c3d51
Fix FTT2 restart leadership handling
michixs Apr 8, 2026
81aab83
feat: add log collection script for Graylog/OpenSearch
claude Apr 8, 2026
4deaea2
fix: opensearch scroll - discover indices, use .keyword for term queries
claude Apr 8, 2026
cacf945
fix: use epoch_millis for OpenSearch range query to avoid date parse …
claude Apr 8, 2026
01c76f5
fix: opensearch - probe index, wildcard container matching, --diagnos…
claude Apr 8, 2026
42082c0
fix: storage node source filter - hostname vs IP, drop filter for spd…
claude Apr 8, 2026
9ccfa5f
fix: collect SNodeAPI logs without source filter, combined across nodes
claude Apr 8, 2026
9723ad1
Fix FTT2 remote device reconnect ordering
michixs Apr 8, 2026
e9b63a3
Fix remote device state reconciliation
michixs Apr 8, 2026
56f8fd4
Allow FTT2 shutdown during migration
michixs Apr 8, 2026
35f98a8
Gate migration retries on recovery events
michixs Apr 8, 2026
3cca4a2
feat: update collect_logs.py with all fixes and sbctl_info section
claude Apr 8, 2026
0485b48
Adjust HA JM defaults and perf setup
michixs Apr 8, 2026
4f7b587
Fix restart leadership and JM ordering
michixs Apr 8, 2026
18f5755
Skip unavailable events for healthy remote controllers
michixs Apr 9, 2026
e05deb3
Avoid online events from healthy reconnects
michixs Apr 9, 2026
3a0a96b
Fix restart sending device events before node is ONLINE
michixs Apr 9, 2026
ac14fc5
Add FN_LVOL_MIG to rebalancing task names for cluster status flag
michixs Apr 9, 2026
6af4743
Add hublvol multipath fixes and comprehensive test suite
michixs Apr 11, 2026
8adee4a
Add GCP cluster deployer script
michixs Apr 11, 2026
94a95b2
Update GCP deployer: switch to c3d-standard-8-lssd (1 SSD/node), add …
michixs Apr 12, 2026
f7c936f
Exclude stale lvols from subsystem limit and surface skip reasons
michixs Apr 13, 2026
34ea2d5
Revert "Exclude stale lvols from subsystem limit and surface skip rea…
michixs Apr 13, 2026
b538888
Add mixed-outage variant of AWS dual-node soak
michixs Apr 13, 2026
95a7442
Fix in_restart hang and peer Health=False cascade
michixs Apr 13, 2026
b3bd881
Kill SPDK before resetting transient node status to OFFLINE
michixs Apr 13, 2026
dc01958
Fix soak test: retry restart when peer is still recovering
michixs Apr 14, 2026
ce2fc09
Add AWS multipath deployment script (3 NICs per host)
michixs Apr 14, 2026
407833d
Fix concurrent restart bypass + add bdev_lvol_set_lvs_signal RPC
michixs Apr 14, 2026
ef9d691
Add multipath outage soak with NIC chaos
michixs Apr 14, 2026
7f03bd2
Fix stale restart_phases on early-return from recreate_lvstore
michixs Apr 14, 2026
ab4fdb2
Fix 4 bugs: port-allow event replay, FTT-aware snapshot gate, sync-de…
michixs Apr 14, 2026
10e41eb
Fix health check auto-fix to resend device events regardless of node …
michixs Apr 14, 2026
499b646
Fix restart_storage_node leaving node stuck in RESTARTING on failure
michixs Apr 14, 2026
d725f29
Refresh cluster_metadata.json for current AWS test run
michixs Apr 15, 2026
12212c7
Update restart_storage_node docstring
michixs Apr 15, 2026
67c9c3b
Emit status-change event when restart guard flips node to in_restart
michixs Apr 15, 2026
6025f3f
Restore restart_storage_node docstring to match the code
michixs Apr 15, 2026
4c4b1bf
Add unit tests for try_set_node_restarting event emission
michixs Apr 15, 2026
241e3af
Abort restart + unblock port when hublvol setup fails
michixs Apr 15, 2026
327f8ea
Restore leader-demote in recreate_lvstore_on_non_leader
michixs Apr 15, 2026
428e3eb
Add fio max_latency=10s to mixed AWS soak workload
michixs Apr 15, 2026
04b934e
Update TestHublvolConnectErrors for abort-on-failure contract
michixs Apr 15, 2026
c5815b2
fix: read node_id from snap.lvol in snapshot delete pre-check
michixs Apr 15, 2026
531678a
Add soak test scripts and utilities for multipath dual-node outage te…
michixs Apr 16, 2026
978f333
Add fio max_latency=10s to multipath soak workload
michixs Apr 16, 2026
a0b5545
Fix linter errors, regenerate cli.py for schema compliance
michixs Apr 16, 2026
e4ba865
Derive max_fault_tolerance from parity-chunks-per-stripe automatically
michixs Apr 16, 2026
67ba9cf
Remove --max-fault-tolerance from soak and deploy scripts
michixs Apr 16, 2026
e746e34
Fix remaining ruff lint errors in lvol_controller, mixed soak, probe …
michixs Apr 16, 2026
c1dba51
Add fio data integrity verification to multipath soak
michixs Apr 16, 2026
863ced0
Fix cluster reactivation: 3-pass approach for lvstore + hublvol setup
michixs Apr 16, 2026
b25ad22
Add NIC-only failover soak test for multipath validation
michixs Apr 16, 2026
2a00ffc
Fix multipath path repair in health check and hublvol monitoring
michixs Apr 17, 2026
f393867
Fix multipath deployment: use simplyblock/sbcli repo and rename metadata
michixs Apr 17, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions e2e/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -65,6 +65,7 @@
RandomRapidFailoverNoGapV2NoMigration,
)
from stress_test.continuous_parallel_lvol_snapshot_clone import TestParallelLvolSnapshotCloneAPI
from stress_test.continuous_lvol_dirfill_stress import TestLvolDirFillStress
from stress_test.continuous_failover_ha_namespace import RandomMultiClientFailoverNamespaceTest
from stress_test.continuous_single_node_outage import RandomMultiClientSingleNodeTest
from stress_test.continuous_failover_ha_security import (
Expand Down Expand Up @@ -306,6 +307,7 @@ def get_stress_tests():
RandomRapidFailoverNoGapV2WithMigration,
RandomRapidFailoverNoGapV2NoMigration,
TestParallelLvolSnapshotCloneAPI,
TestLvolDirFillStress,
RandomMultiClientFailoverNamespaceTest,
RandomMultiClientSingleNodeTest,
K8sNativeFailoverTest,
Expand Down
Loading
Loading