Bug 218297 - Kernel Panic and crash
Summary: Kernel Panic and crash
Status: REOPENED
Alias: None
Product: Virtualization
Classification: Unclassified
Component: kvm (show other bugs)
Hardware: All Linux
: P3 normal
Assignee: virtualization_kvm
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2023-12-21 04:53 UTC by Awaneesh
Modified: 2024-02-01 04:36 UTC (History)
2 users (show)

See Also:
Kernel Version: 5.4.244-staros-v3-scale-64
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description Awaneesh 2023-12-21 04:53:02 UTC
[    1.299570] --> Failed to ioremap(405798912, 215625728)
[    1.300415] --> Failed to map header
[    1.301002] BUG: kernel NULL pointer dereference, address: 00000000000003f8
[    1.301994] #PF: supervisor read access in kernel mode
[    1.301994] #PF: error_code(0x0000) - not-present page
[    1.301994] PGD 0 P4D 0
[    1.301994] Oops: 0000 [#1] PREEMPT SMP PTI
[    1.301994] CPU: 0 PID: 1 Comm: swapper/0 Tainted: G        W         5.4.244-staros-v3-scale-64 #1
[    1.301994] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Ubuntu-1.8.2-1ubuntu1 04/01/2014
…
[    1.301994] CR2: 00000000000003f8 CR3: 000000001820a001 CR4: 00000000001606f0
[    1.301994] Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000009
[    1.301994] Kernel Offset: 0xc000000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[    1.301994] ---[ end Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000009 ]---
Comment 1 Artem S. Tashkinov 2023-12-21 07:26:12 UTC
This kernel version is not supported.

Take it to your vendor please.
Comment 2 Artem S. Tashkinov 2023-12-21 07:27:42 UTC
Please try to repro it in vanilla kernel 5.4.265.

If earlier kernels aren't affected, please do regression testing:

https://docs.kernel.org/admin-guide/bug-bisect.html
Comment 3 Rajesh Goel 2024-01-12 05:33:12 UTC
(In reply to Artem S. Tashkinov from comment #2)
> Please try to repro it in vanilla kernel 5.4.265.
> 
> If earlier kernels aren't affected, please do regression testing:
> 
> https://docs.kernel.org/admin-guide/bug-bisect.html

We are actually using the same kernel, with very few minute changes which are not going to affect the working the way vanilla kernel is being used.
In our setups we cannot use the vanilla kernel directly.
The main reason for this bug report is to understand if similar issues were seen in the past and any possible fixes has been suggested.
We see similar forum where same issue was reported, but patch suggest here is already integrated in our builds.
Is there more information you can share, which can help us understand this problem in more depth.
Comment 4 Cai 2024-01-23 05:23:19 UTC
Attaching call trace here for reference. Appropriate any thoughts.


[    1.299570] --> Failed to ioremap(405798912, 215625728)
[    1.300415] --> Failed to map header
[    1.301002] BUG: kernel NULL pointer dereference, address: 00000000000003f8
[    1.301994] #PF: supervisor read access in kernel mode
[    1.301994] #PF: error_code(0x0000) - not-present page
[    1.301994] PGD 0 P4D 0
[    1.301994] Oops: 0000 [#1] PREEMPT SMP PTI
[    1.301994] CPU: 0 PID: 1 Comm: swapper/0 Tainted: G        W         5.4.244-staros-v3-scale-64 #1
[    1.301994] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS Ubuntu-1.8.2-1ubuntu1 04/01/2014
[    1.301994] RIP: 0010:del_gendisk+0x12/0x2b0
[    1.301994] Code: ff ff eb ee 48 c7 c0 ed ff ff ff eb e5 48 c7 c0 ea ff ff ff c3 0f 1f 00 0f 1f 44 00 00 41 55 41 54 55 53 48 89 fb 48 83 ec 18 <48> 83 bf f8 03 00 00 00 74 39 e8 bf fe ff ff 48 c7 c7 00 54 2b 8e
[    1.301994] RSP: 0000:ffffac188001fdf0 EFLAGS: 00010286
[    1.301994] RAX: 00000000fffffff7 RBX: 0000000000000000 RCX: ffffffff8e247d38
[    1.301994] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[    1.301994] RBP: ffffffffffffffff R08: 00000000000001f6 R09: 0000000000000027
[    1.301994] R10: 0000000000000000 R11: ffffac188001fd45 R12: 0000000000000000
[    1.301994] R13: ffffffff8e70964c R14: 0000000000000000 R15: ffffffff8e035eee
[    1.301994] FS:  0000000000000000(0000) GS:ffffa4229f800000(0000) knlGS:0000000000000000
[    1.301994] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    1.301994] CR2: 00000000000003f8 CR3: 000000001820a001 CR4: 00000000001606f0
[    1.301994] Call Trace:
[    1.301994]  ? __die+0x86/0xc8
[    1.301994]  ? no_context+0x15a/0x330
[    1.301994]  ? async_page_fault+0x34/0x40
[    1.301994]  ? do_early_param+0x8b/0x8b
[    1.301994]  ? del_gendisk+0x12/0x2b0
[    1.301994]  ? _raw_spin_unlock+0xa/0x20
[    1.301994]  ? __close_fd+0x88/0xa0
[    1.301994]  ? do_early_param+0x8b/0x8b
[    1.301994]  device_exit+0x37/0x2e0
[    1.301994]  ? image_proc_virt_read_try_part.part.7.cold.18+0x48/0x48
[    1.301994]  ? do_early_param+0x8b/0x8b
[    1.301994]  device_init+0x3ed/0xc34
[    1.301994]  ? image_proc_virt_read_try_part.part.7.cold.18+0x48/0x48
[    1.301994]  ? do_early_param+0x8b/0x8b
[    1.301994]  do_one_initcall+0x3c/0x210
[    1.301994]  ? do_early_param+0x8b/0x8b
[    1.301994]  ? do_early_param+0x8b/0x8b
[    1.301994]  kernel_init_freeable+0x1bd/0x249
[    1.301994]  ? rest_init+0xb9/0xb9
[    1.301994]  kernel_init+0xa/0x107
[    1.301994]  ret_from_fork+0x24/0x30
[    1.301994] Modules linked in:
[    1.301994] CR2: 00000000000003f8
[    1.301994] ---[ end trace ed4a0af627b03b12 ]---
[    1.301994] RIP: 0010:del_gendisk+0x12/0x2b0
[    1.301994] Code: ff ff eb ee 48 c7 c0 ed ff ff ff eb e5 48 c7 c0 ea ff ff ff c3 0f 1f 00 0f 1f 44 00 00 41 55 41 54 55 53 48 89 fb 48 83 ec 18 <48> 83 bf f8 03 00 00 00 74 39 e8 bf fe ff ff 48 c7 c7 00 54 2b 8e
[    1.301994] RSP: 0000:ffffac188001fdf0 EFLAGS: 00010286
[    1.301994] RAX: 00000000fffffff7 RBX: 0000000000000000 RCX: ffffffff8e247d38
[    1.301994] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[    1.301994] RBP: ffffffffffffffff R08: 00000000000001f6 R09: 0000000000000027
[    1.301994] R10: 0000000000000000 R11: ffffac188001fd45 R12: 0000000000000000
[    1.301994] R13: ffffffff8e70964c R14: 0000000000000000 R15: ffffffff8e035eee
[    1.301994] FS:  0000000000000000(0000) GS:ffffa4229f800000(0000) knlGS:0000000000000000
[    1.301994] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    1.301994] CR2: 00000000000003f8 CR3: 000000001820a001 CR4: 00000000001606f0
[    1.301994] Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000009
[    1.301994] Kernel Offset: 0xc000000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[    1.301994] ---[ end Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000009 ]---
Comment 5 Awaneesh 2024-02-01 04:36:30 UTC
We have migrated to kernel - 5.4.254. Can someone point to the patch, we can apply for the above issue?

Note You need to log in before you can comment on or make changes to this bug.