Bug 207441 - Fan speed is not detected for AMD Radeon RX 5600 XT (rev ca)
Summary: Fan speed is not detected for AMD Radeon RX 5600 XT (rev ca)
Status: NEW
Alias: None
Product: Drivers
Classification: Unclassified
Component: Video(Other) (show other bugs)
Hardware: x86-64 Linux
: P1 high
Assignee: drivers_video-other
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2020-04-26 17:51 UTC by Artem S. Tashkinov
Modified: 2020-09-14 06:18 UTC (History)
7 users (show)

See Also:
Kernel Version: 5.6.7
Tree: Mainline
Regression: No


Attachments

Description Artem S. Tashkinov 2020-04-26 17:51:23 UTC
Nothing to add really.

lm-sensors always report 0 RPM, e.g.

$ sensors

amdgpu-pci-0900
Adapter: PCI adapter
vddgfx:       +0.78 V  
fan1:           0 RPM  (min =    0 RPM, max = 3200 RPM)
edge:         +36.0°C  (crit = +118.0°C, hyst = -273.1°C)
                       (emerg = +99.0°C)
junction:     +36.0°C  (crit = +99.0°C, hyst = -273.1°C)
                       (emerg = +99.0°C)
mem:          +40.0°C  (crit = +99.0°C, hyst = -273.1°C)
                       (emerg = +99.0°C)
power1:        8.00 W  (cap = 160.00 W)

(The fans are now spinning at 25%).
Comment 1 Artem S. Tashkinov 2020-04-26 17:54:24 UTC
My GPU VBIOS can be found here:

https://www.techpowerup.com/vgabios/219169/sapphire-rx5600xt-6144-200225
Comment 3 Artem S. Tashkinov 2020-05-02 20:44:46 UTC
It's been a week with no attention from AMD employees. Is there any info I can provide to help resolve this bug?
Comment 4 Alex Deucher 2020-05-03 04:30:18 UTC
Do you get a proper reading from the pwm1 file for the fan on the device?  It's rare, but some board vendors don't provide the necessary information for rpm feedback so all we can expose the is the percentage.
Comment 5 Artem S. Tashkinov 2020-05-03 12:58:20 UTC
(In reply to Alex Deucher from comment #4)
> Do you get a proper reading from the pwm1 file for the fan on the device? 
> It's rare, but some board vendors don't provide the necessary information
> for rpm feedback so all we can expose the is the percentage.

This GPU has a fan-stop feature.

If I do not change any PWM settings and leave everything intact/by default, I get fan readings _when_ the card fans start spinning on their own once the card reaches 60C or above (a threshold for fans).


However when I do this, i.e. enable fans manually:

echo 1 > /sys/class/drm/card0/device/hwmon/hwmon1/fan1_enable
echo 50 > /sys/class/drm/card0/device/hwmon/hwmon1/pwm1

0 rpm is reported.


Again the BIOS image is available, so it would be great if you've taken it into consideration.
Comment 6 Alex Deucher 2020-05-03 16:12:07 UTC
So to clarify, you only get speed reporting problems when you manually set the fan speed?
Comment 7 Artem S. Tashkinov 2020-05-03 16:30:44 UTC
(In reply to Alex Deucher from comment #6)
> So to clarify, you only get speed reporting problems when you manually set
> the fan speed?

Exactly.
Comment 8 Andreas Prittwitz 2020-06-20 18:15:21 UTC
I can confirm Artem's observation with a PowerColor Radeon RX 5700 XT.
The /sys/class/drm/card0/device/hwmon/hwmon0/pwm1 file shows "0" even though the fan is running, no matter the rpm.

lspci output:

09:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 [Radeon RX 5600 OEM/5600 XT / 5700/5700 XT] (rev c1) (prog-if 00 [VGA controller])
        Subsystem: Tul Corporation / PowerColor Device 2398
        Flags: bus master, fast devsel, latency 0, IRQ 97, IOMMU group 2
        Memory at e0000000 (64-bit, prefetchable) [size=256M]
        Memory at f0000000 (64-bit, prefetchable) [size=2M]
        I/O ports at e000 [size=256]
        Memory at fcc00000 (32-bit, non-prefetchable) [size=512K]
        Expansion ROM at 000c0000 [disabled] [size=128K]
        Capabilities: [48] Vendor Specific Information: Len=08 <?>
        Capabilities: [50] Power Management version 3
        Capabilities: [64] Express Legacy Endpoint, MSI 00
        Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
        Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?>
        Capabilities: [150] Advanced Error Reporting
        Capabilities: [200] Physical Resizable BAR
        Capabilities: [240] Power Budgeting <?>
        Capabilities: [270] Secondary PCI Express
        Capabilities: [2a0] Access Control Services
        Capabilities: [2b0] Address Translation Service (ATS)
        Capabilities: [2c0] Page Request Interface (PRI)
        Capabilities: [2d0] Process Address Space ID (PASID)
        Capabilities: [320] Latency Tolerance Reporting
        Capabilities: [400] Data Link Feature <?>
        Capabilities: [410] Physical Layer 16.0 GT/s <?>
        Capabilities: [440] Lane Margining at the Receiver <?>
        Kernel driver in use: amdgpu
        Kernel modules: amdgpu
Comment 10 Andre Kuehne 2020-09-09 18:50:45 UTC
I have this issue as well with a navi 10 card. I noticed something that might be helpful. The RPM reporting also works with a manual fan curve, but only after the GPU fans were spinning once with the automatic fan profile. It looks like the automatic fan profile does some necessary initialization when the fans are spinning. After that the RPM is correctly reported even if you set a manual profile.
Comment 11 Arthur Borsboom 2020-09-11 08:55:14 UTC
Confirming incorrect fan speed measurement for Navi 14 - 5500M

-------------------------------------------------------------

amdgpu-pci-0300
Adapter: PCI adapter
vddgfx:        6.00 mV 
fan1:        65535 RPM  (min =    0 RPM, max = 3350 RPM)
edge:         +36.0°C  (crit = +108.0°C, hyst = -273.1°C)
                       (emerg = +99.0°C)
junction:     +36.0°C  (crit = +99.0°C, hyst = -273.1°C)
                       (emerg = +99.0°C)
mem:          +36.0°C  (crit = +99.0°C, hyst = -273.1°C)
                       (emerg = +99.0°C)
power1:        3.00 W  (cap =  85.00 W)

-------------------------------------------------------------

03:00.0 Display controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 14 [Radeon RX 5500/5500M / Pro 5500M] (rev c1)
        Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12ac
        Flags: bus master, fast devsel, latency 0, IRQ 95, IOMMU group 11
        Memory at d0000000 (64-bit, prefetchable) [size=256M]
        Memory at e0000000 (64-bit, prefetchable) [size=2M]
        Memory at fe500000 (32-bit, non-prefetchable) [size=512K]
        Expansion ROM at fe580000 [disabled] [size=128K]
        Capabilities: <access denied>
        Kernel driver in use: amdgpu
        Kernel modules: amdgpu
Comment 12 Alex Deucher 2020-09-14 06:18:56 UTC
Please see:
https://gitlab.freedesktop.org/drm/amd/-/issues/1164

Note You need to log in before you can comment on or make changes to this bug.