Bug 209733
Summary: | Starting new KVM virtual machines on PPC64 starts to hang after box is up for a while | ||
---|---|---|---|
Product: | Virtualization | Reporter: | Cameron Berkenpas (cam) |
Component: | kvm | Assignee: | platform_ppc-64 |
Status: | CLOSED CODE_FIX | ||
Severity: | high | CC: | gustavo.romero, michael |
Priority: | P1 | ||
Hardware: | PPC-64 | ||
OS: | Linux | ||
Kernel Version: | >=5.8 | Subsystem: | |
Regression: | Yes | Bisected commit-id: |
Description
Cameron Berkenpas
2020-10-18 23:09:57 UTC
Still happens with 5.9.2. Verified this happens with 5.9.6 and and Debian vendor kernel of linux-image-5.9.0-1-powerpc64le. Might also be worth mentioning this is occurring with qemu-system-ppc package version 1:3.1+dfsg-8+deb10u8. Same issue now that I'm running with qemu-system-ppc version 1:5.0-14~bpo10+1 from Debian backports. After enough testing, I feel confident that this issue was fixed in 5.9.9. However, I encountered issues with XFS with 5.9.9 and 5.9.10 (mainly on POWER, but to a lesser extent they seemed to happen for me on amd64 at least). 5.9.11 has the weird hang fixed and no other issues (XFS or otherwise) in over 2 days! I feel confident in closing this issue. Thanks for persisting with the testing. I wonder if it was fixed by: c4629e4e7e09 ("mm/compaction: stop isolation if too many pages are isolated and we have pages to migrate") or 38935861d85a ("mm/compaction: count pages and stop correctly during page isolation") They fix a potential infinte loop in a path that's used by the HTAB allocation. Those landed in v5.9.9, and fix a commit that was introduced in v5.7 (which doesn't match your observation that v5.7.x was OK). Nick pointed out that it was actually: 2da9f6305f30 ("mm/vmscan: fix NR_ISOLATED_FILE corruption on 64-bit") |