[plug] AMD Machine Check errors

Kevin Shackleton krshackleton at gmail.com
Sat Oct 25 12:24:58 AWST 2025


Additionally you should suspect the IEC end of a power cable.  The contacts
can become opened up by people excessively waggling the connector side to
side rather than straight push-pull.  It's a design weakness particularly
of the IEC C13.  (see https://en.wikipedia.org/wiki/IEC_60320)

Cheers,
Kevin.

On Sat, 25 Oct 2025 at 11:44, Marcos Carot <marcos.carot at gmail.com> wrote:

> I had similar issues a few years ago on a Gigabyte x370 motherboard, and
> if my memory is correct, I did something similar to these posts:
>
> https://wiki.archlinux.org/title/Ryzen#Random_reboots
>
> https://null-src.com/posts/amd-ryzen-linux-freeze/
>
> and it fixed it.
>
> Cheers,
> Marcos
>
>
> On Fri, 24 Oct 2025 at 20:40, Brad Campbell <brad at fnarfbargle.com> wrote:
>
>> On 24/10/25 20:33, Chris McCormick wrote:
>> > On 24/10/2025 20:07, Brad Campbell wrote:
>> >> I've replaced the CPU, the RAM and the PSU.
>> >
>> >
>> > This may be a stupid suggestion, but years ago we had a server that was
>> randomly rebooting. After a lot of debugging we finally figured out it was
>> the actual power cord that wasn't plugged all the way into the PSU.
>> Absolute face-palm moment. You said you changed the PSU, but did you change
>> the power cord? Are you sure the wall outlet doesn't have issues? Long shot
>> I know.
>> >
>>
>> G'day Chris,
>>
>> Thanks, but done that. It was plugged into a powerboard that was plugged
>> into a PDU that was plugged into an ATS that was plugged into a UPS which
>> was plugged into the wall.
>> I bypassed most of that and plugged it directly into the ATS, and then
>> UPS as a test.
>>
>> The clincher is reboots were *always* logged (remote UDP netconsole
>> receiver).
>> On top of that there are several SED drives in the machine, so any form
>> of power cycles they come up locked. That never happened.
>>
>> Most of the faults with the 3950x were related to panics in the idle
>> handler. Rarely I got a straight segfault.
>>
>> I'm almost positive it's a glitch with the motherboard or BIOS, but it's
>> interesting the 5950x logs MCEs rather than causes a panic.
>> My next move is to upgrade to an X570 board. I'd upgrade to a new AM5
>> platform, but new board/processor/RAM starts to get $$$ and what I have
>> mostly works.
>> _______________________________________________
>> PLUG discussion list: plug at plug.org.au
>> https://lists.plug.org.au/mailman/listinfo/plug
>> Committee e-mail: committee at plug.org.au
>> PLUG Membership: http://www.plug.org.au/membership
>>
>
>
> --
> Marcos R Carot
> _______________________________________________
> PLUG discussion list: plug at plug.org.au
> https://lists.plug.org.au/mailman/listinfo/plug
> Committee e-mail: committee at plug.org.au
> PLUG Membership: http://www.plug.org.au/membership
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.plug.org.au/pipermail/plug/attachments/20251025/38715615/attachment.htm>


More information about the plug mailing list