<div dir="ltr">I had similar issues a few years ago on a Gigabyte x370 motherboard, and if my memory is correct, I did something similar to these posts:<div><br></div><div><a href="https://wiki.archlinux.org/title/Ryzen#Random_reboots">https://wiki.archlinux.org/title/Ryzen#Random_reboots</a></div><div><br></div><div><a href="https://null-src.com/posts/amd-ryzen-linux-freeze/">https://null-src.com/posts/amd-ryzen-linux-freeze/</a></div><div><br></div><div>and it fixed it.</div><div><br></div><div>Cheers,</div><div>Marcos</div><div><br></div></div><br><div class="gmail_quote gmail_quote_container"><div dir="ltr" class="gmail_attr">On Fri, 24 Oct 2025 at 20:40, Brad Campbell <<a href="mailto:brad@fnarfbargle.com">brad@fnarfbargle.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On 24/10/25 20:33, Chris McCormick wrote:<br>
> On 24/10/2025 20:07, Brad Campbell wrote:<br>
>> I've replaced the CPU, the RAM and the PSU.<br>
> <br>
> <br>
> This may be a stupid suggestion, but years ago we had a server that was randomly rebooting. After a lot of debugging we finally figured out it was the actual power cord that wasn't plugged all the way into the PSU. Absolute face-palm moment. You said you changed the PSU, but did you change the power cord? Are you sure the wall outlet doesn't have issues? Long shot I know.<br>
> <br>
<br>
G'day Chris,<br>
<br>
Thanks, but done that. It was plugged into a powerboard that was plugged into a PDU that was plugged into an ATS that was plugged into a UPS which was plugged into the wall.<br>
I bypassed most of that and plugged it directly into the ATS, and then UPS as a test.<br>
<br>
The clincher is reboots were *always* logged (remote UDP netconsole receiver).<br>
On top of that there are several SED drives in the machine, so any form of power cycles they come up locked. That never happened.<br>
<br>
Most of the faults with the 3950x were related to panics in the idle handler. Rarely I got a straight segfault.<br>
<br>
I'm almost positive it's a glitch with the motherboard or BIOS, but it's interesting the 5950x logs MCEs rather than causes a panic.<br>
My next move is to upgrade to an X570 board. I'd upgrade to a new AM5 platform, but new board/processor/RAM starts to get $$$ and what I have mostly works.<br>
_______________________________________________<br>
PLUG discussion list: <a href="mailto:plug@plug.org.au" target="_blank">plug@plug.org.au</a><br>
<a href="https://lists.plug.org.au/mailman/listinfo/plug" rel="noreferrer" target="_blank">https://lists.plug.org.au/mailman/listinfo/plug</a><br>
Committee e-mail: <a href="mailto:committee@plug.org.au" target="_blank">committee@plug.org.au</a><br>
PLUG Membership: <a href="http://www.plug.org.au/membership" rel="noreferrer" target="_blank">http://www.plug.org.au/membership</a><br>
</blockquote></div><div><br clear="all"></div><div><br></div><span class="gmail_signature_prefix">-- </span><br><div dir="ltr" class="gmail_signature"><div dir="ltr">Marcos R Carot<br></div></div>