Random hard reboots every 8-10 hours after installing Bazzite, not a problem on Ubuntu or Windows

Hey ho - Every 8-10 hours I'm getting a random hard reboot. Like, both monitors suddenly go completely black and then 20 seconds later I'm staring at my BIOS. In the kernel logs, I see this every time:
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: Machine check events logged
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: CPU 4: Machine Check: 0 Bank 5: bea0000000000108
Sep 02 13:41:02 bazzite kernel: fbcon: Taking over console
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: TSC 0 ADDR eaaa31ee MISC d012000100000000 SYND 4d000000 IPID 500b000000000
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1725309657 SOCKET 0 APIC a microcode 8701013
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: Machine check events logged
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: CPU 7: Machine Check: 0 Bank 5: bea0000000000108
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffff8209f80e MISC d012000100000000 SYND 4d000000 IPID 500b000000000
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1725309657 SOCKET 0 APIC 3 microcode 8701013
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: Machine check events logged
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: CPU 4: Machine Check: 0 Bank 5: bea0000000000108
Sep 02 13:41:02 bazzite kernel: fbcon: Taking over console
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: TSC 0 ADDR eaaa31ee MISC d012000100000000 SYND 4d000000 IPID 500b000000000
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1725309657 SOCKET 0 APIC a microcode 8701013
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: Machine check events logged
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: CPU 7: Machine Check: 0 Bank 5: bea0000000000108
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffff8209f80e MISC d012000100000000 SYND 4d000000 IPID 500b000000000
Sep 02 13:41:02 bazzite kernel: mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1725309657 SOCKET 0 APIC 3 microcode 8701013
I believe I have ruled out hardware problems. MemTest86 found no errors over 3 different runs. Windows does not have this issue. KDE Neon (my previous distro before Bazzite) does not have this issue. The CPU is an AMD Ryzen 5 3600X. I am running stock clocks. The issue happens at idle (it has happened when it is just at the desktop while I am out of the room). The only thing I have found which matches my problem is this: https://wiki.archlinux.org/title/Ryzen#Random_reboots However, that explicitly mentions the Ryzen 5000 series, and I don't have the option for the AMD Curve Optimizer on my BIOS as it is not supported by the 3000 series. Further, that's the 5.11 kernel, and we're a full major version ahead of that now. Given that the only thing which has changed is that I am running Bazzite instead of something else, I am under the impression that Bazzite is at fault here. Is there anything I can do to fix it, or maybe change my kernel version to the one Neon is running (which worked fine)?
6 Replies
Kyle Gospo
Kyle Gospo•3mo ago
you cannot swap kernels on bazzite due to it being atomic https://community.amd.com/t5/general-discussions/pc-will-stay-on-but-restarts-randomly-while-gaming-with-whea/m-p/707476/highlight/true This looks like the same issue hopefully fixed by AMD in a future update, in this case Ubuntu is probably saving you by being out of date
EnglishMobster
EnglishMobsterOP•3mo ago
That's me in the post!
Kyle Gospo
Kyle Gospo•3mo ago
Ah, so it is.. I'll let you know if I find anything on this then, I'm using a threadripper 3960 so we're on the same generation But I'm not seeing the random reboots on my ends, granted that's a very different processor
EnglishMobster
EnglishMobsterOP•3mo ago
Cool cool, I appreciate it It's not the end of the world, just kind of annoying since I use this machine to remote in to the office I disappeared mid-Zoom 😅
Kyle Gospo
Kyle Gospo•3mo ago
What kernel version are you on? uname -r
EnglishMobster
EnglishMobsterOP•3mo ago
6.9.12-205.fsync.fc40.x86_64 Just installed on Saturday so I should be on 3.7.0 I believe Rebased to Fedora Kinoite and FWIW I'm still getting it (although much more rarely):
Sep 08 21:41:31 fedora kernel: mce: [Hardware Error]: Machine check events logged
Sep 08 21:41:31 fedora kernel: mce: [Hardware Error]: CPU 7: Machine Check: 0 Bank 5: bea0000000000108
Sep 08 21:41:31 fedora kernel: fbcon: Taking over console
Sep 08 21:41:31 fedora kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffffbbc41edc MISC d012000100000000 SYND 4d000000 IPID 500b000000000
Sep 08 21:41:31 fedora kernel: mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1725856888 SOCKET 0 APIC 3 microcode 8701021
Sep 08 21:41:31 fedora kernel: mce: [Hardware Error]: Machine check events logged
Sep 08 21:41:31 fedora kernel: mce: [Hardware Error]: CPU 7: Machine Check: 0 Bank 5: bea0000000000108
Sep 08 21:41:31 fedora kernel: fbcon: Taking over console
Sep 08 21:41:31 fedora kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffffbbc41edc MISC d012000100000000 SYND 4d000000 IPID 500b000000000
Sep 08 21:41:31 fedora kernel: mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1725856888 SOCKET 0 APIC 3 microcode 8701021
Which tells me this might be a Fedora problem. Considering rebasing to Fedora 39 and seeing if it's an issue there, because I'm still not seeing this in Windows Kinoite is on kernel 6.10.7-200.fc40.x86_64
Want results from more Discord servers?
Add your server