capz-windows-master-hyperv-serial-slow: re-enable 3 GC/SchedulerPreemption specs#37174
Conversation
…ption specs Removes 3 entries from the GINKGO_SKIP regex now that MAX_PODS=20 (kubernetes-sigs/windows-testing#567 + test-infra#37141) prevents the Hyper-V vmmem-overhead-induced MemoryPressure that caused them to be skipped. Validated 30/30 PASS on fresh CAPZ Hyper-V (WS2025, 2x D4s_v3, MAX_PODS=20) — 10 back-to-back rounds, 0 MemoryPressure events.
|
Hi @rzlink. Thanks for your PR. I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with Tip We noticed you've done this a few times! Consider joining the org to skip this step and gain Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
/ok-to-test |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: marosset, rzlink The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
/hold cancel |
|
@rzlink: Updated the
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
What this PR does
Re-enables 3 previously-skipped Kubernetes e2e specs on the
capz-windows-master-hyperv-serial-slowtestgrid dashboard:[sig-api-machinery] Garbage collector should not delete dependents that have both valid owner and owner that's waiting for dependents to be deleted [Serial] [Conformance][sig-api-machinery] Garbage collector should orphan pods created by rc if delete options say so [Serial] [Conformance][sig-scheduling] SchedulerPreemption [Serial] validates pod disruption condition is added to the preempted pod [Conformance]These were originally skipped because they consistently OOM'd Windows worker nodes under Hyper-V isolation due to per-pod vmmem.exe overhead exceeding the kubelet's hard-eviction threshold on 16 GiB D4s_v3 workers.
Why this is safe now
The root cause was excessive Hyper-V pod density: with the default
--max-pods=110, the e2e helperestimateMaximumPods(min=10, max=100)would request up to 100 pods per node, and each pod added ~500-700 MiBof vmmem host overhead invisible to pod-level kubelet stats.
The
MAX_PODS=20cap added in kubernetes-sigs/windows-testing#567 and test-infra#37141reduces this to at most 20 pods/node, keeping the 3 specs comfortably inside available memory.
/sig windows
/area provider/azure
/cc @marosset @zylxjtu