[+cc Kenneth, Alex, Christian, amd-gfx] On Thu, Jul 03, 2025 at 12:09:20AM -0400, Alex Huang wrote: > Hi, > > Recently, I dug up a Radeon Pro WX3100 and when booting, got a black screen > with some complaints of No EDID read and then a `Fatal error during GPU > init`. With windows booting fine and an MSI Kombustor run turning out just > fine, I would say hardware failure highly unlikely. The logs seem unrelated > (although I have attached them anyways), lspci -vvxxx output for the device > is also at the end of the email. Also here is lspci -vvxxx for the upstream > PCI bridge attached to the GPU. > > A bisect reveals the offending commit is 0064b0ce85bb ("drm/amd/pm: enable > ASPM by default"). The simple fix appears to be setting `amdgpu.aspm=0` in > kernel boot parameters. This seemingly is a case of something in the Lenovo > ideacentre (specifically the ideacentre 510A-15ARR I found this bug on) > incorrectly reporting ASPM availability. I'd think this is a PCI driver > issue, but I am by no means an expert here. If this ends up on the wrong > mailing list, please do let me know. > > I also did try enabling/disabling ASPM on the BIOS side to no avail. > > The bug appears to be systematically existent for many other cards I ended > up plugging into the device (thus conclusion as PCI driver issue). And does > appear to have an attempt to fix specifically for amdgpu > (20220408154447.3519453-1-richard.gong@xxxxxxx) but that never went > upstream. Hi Alex, thanks very much for reporting and bisecting this issue. I added the author of 0064b0ce85bb and the maintainers of amdgpu.