Bug 86891 - AMD/ATI Tahiti XT 7970 - long lags/stutters in games
Summary: AMD/ATI Tahiti XT 7970 - long lags/stutters in games
Status: RESOLVED CODE_FIX
Alias: None
Product: Drivers
Classification: Unclassified
Component: Video(DRI - non Intel) (show other bugs)
Hardware: All Linux
: P1 high
Assignee: drivers_video-dri
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-10-25 15:28 UTC by Michael Mair-Keimberger
Modified: 2014-12-10 17:55 UTC (History)
5 users (show)

See Also:
Kernel Version: >=3.17 until 3.18rc1
Subsystem:
Regression: No
Bisected commit-id:


Attachments
Kernel config of 3.18rc1 (86.30 KB, application/octet-stream)
2014-10-25 15:28 UTC, Michael Mair-Keimberger
Details
bisec.tar.gz (2.07 KB, application/gzip)
2014-10-27 20:30 UTC, Michael Mair-Keimberger
Details
dmesg output (78.32 KB, text/plain)
2014-10-29 22:04 UTC, Michael Mair-Keimberger
Details
picture (1.88 MB, image/jpeg)
2014-10-30 22:29 UTC, Michael Mair-Keimberger
Details
another picture (1.40 MB, image/jpeg)
2014-10-30 22:30 UTC, Michael Mair-Keimberger
Details
picture with VRAM and GTT usage (1.79 MB, image/jpeg)
2014-11-01 17:27 UTC, Michael Mair-Keimberger
Details
vally screenshot (827.41 KB, image/jpeg)
2014-11-02 10:33 UTC, Michael Mair-Keimberger
Details

Description Michael Mair-Keimberger 2014-10-25 15:28:22 UTC
Created attachment 155001 [details]
Kernel config of 3.18rc1

Since kernel Version 3.17 and greater (last tested with kernel-3.18rc1) i'm encounter very long stutters/lags (1-2sec) when i'm gaming various games. (eg borderlands2)

Usually i'm doing a unigine valley benchmark test after i install a new kernel to see how much more performance i get and usually the benchmarks are very promising. (FPS continuous rises) :)
However since kernel-3.17 i get really long stutters/lags in my tests. FPS doesn't really go done, they rise even more (max fps), but i also always get really serious stutters, where i have to wait about 1-2 sec until the graphic's continue to render. This is really annoying.

Besides getting higher max fps in Unigine i also get new lowest min fps.
Below a short overview of my benchmark tests (with date):

Linux 3.16.2-gentoo x86_64 - 20140914
FPS: 17.4
Score: 727
Min FPS: 8.2
Max FPS: 28.1

Linux 3.17.0-rc6 x86_64 - 20140927
FPS: 16.2
Score: 680
Min FPS: 4.2
Max FPS: 32.1

Linux 3.18.0-rc1 x86_64 - 20141025
FPS: 15.9
Score: 666
Min FPS: 3.2
Max FPS: 34.4



Today's test (same software stack, just different kernels)

Linux 3.18.0-rc1 x86_64     Linux 3.16.3-gentoo x86_64
FPS: 15.9                   18.7
Score: 666                  782
Min FPS: 3.2                8.3
Max FPS: 34.4               30.5

Usually downgrading to kernel-3.16 always solves the problem for me which is why i think the problem has todo with the kernel side of the ati driver.

I've also tried the new firmware blobs for my graphics card (TAHITI_ce.bin -> tahiti_ce.bin, ...), they didn't make any differences.






My system:

Graphic related packages installed:
media-libs/mesa-10.3.1
sys-devel/llvm-3.5.0
sys-kernel/linux-firmware-20140902
x11-drivers/xf86-video-ati-7.5.0
x11-libs/libdrm-2.4.58

asterix michael # emerge --info                                                                                                                                                                                                                                                                                                
Portage 2.2.14 (python 3.3.5-final-0, default/linux/amd64/13.0, gcc-4.8.3, glibc-2.19-r1, 3.16.3-gentoo x86_64)                                                                                                                                                                                                                
=================================================================                                                                                                                                                                                                                                                              
System uname: Linux-3.16.3-gentoo-x86_64-AMD_FX-tm-8350_Eight-Core_Processor-with-gentoo-2.2                                                                                                                                                                                                                                   
KiB Mem:    16356140 total,  11196304 free                                                                                                                                                                                                                                                                                     
KiB Swap:    8388600 total,   8388600 free                                                                                                                                                                                                                                                                                     
Timestamp of tree: Sat, 25 Oct 2014 04:30:01 +0000                                                                                                                                                                                                                                                                             
ld GNU ld (Gentoo 2.24 p1.4) 2.24                                                                                                                                                                                                                                                                                              
app-shells/bash:          4.3_p30                                                                                                                                                                                                                                                                                              
dev-lang/perl:            5.20.1-r1                                                                                                                                                                                                                                                                                            
dev-lang/python:          2.7.8, 3.3.5-r1                                                                                                                                                                                                                                                                                      
dev-util/cmake:           3.0.2                                                                                                                                                                                                                                                                                                
dev-util/pkgconfig:       0.28-r2                                                                                                                                                                                                                                                                                              
sys-apps/baselayout:      2.2                                                                                                                                                                                                                                                                                                  
sys-apps/openrc:          0.13.1                                                                                                                                                                                                                                                                                               
sys-apps/sandbox:         2.6-r1                                                                                                                                                                                                                                                                                               
sys-devel/autoconf:       2.13, 2.69                                                                                                                                                                                                                                                                                           
sys-devel/automake:       1.11.6, 1.12.6, 1.14.1                                                                                                                                                                                                                                                                               
sys-devel/binutils:       2.24-r3                                                                                                                                                                                                                                                                                              
sys-devel/gcc:            4.8.3                                                                                                                                                                                                                                                                                                
sys-devel/gcc-config:     1.8                                                                                                                                                                                                                                                                                                  
sys-devel/libtool:        2.4.2-r1                                                                                                                                                                                                                                                                                             
sys-devel/make:           4.1-r1                                                                                                                                                                                                                                                                                               
sys-kernel/linux-headers: 3.17 (virtual/os-headers)                                                                                                                                                                                                                                                                            
sys-libs/glibc:           2.19-r1                                                                                                                                                                                                                                                                                              
Repositories: gentoo local sunrise x11 tox-overlay                                                                                                                                                                                                                                                                             
ACCEPT_KEYWORDS="amd64 ~amd64"                                                                                                                                                                                                                                                                                                 
ACCEPT_LICENSE="* -@EULA"                                                                                                                                                                                                                                                                                                      
CBUILD="x86_64-pc-linux-gnu"                                                                                                                                                                                                                                                                                                   
CFLAGS="-O2 -pipe -march=bdver2 -mprefer-avx128"                                                                                                                                                                                                                                                                               
CHOST="x86_64-pc-linux-gnu"                                                                                                                                                                                                                                                                                                    
CONFIG_PROTECT="/etc /usr/share/config /usr/share/gnupg/qualified.txt /usr/share/themes/oxygen-gtk/gtk-2.0 /usr/share/themes/oxygen-gtk/gtk-3.0"                                                                                                                                                                               
CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/env.d /etc/fonts/fonts.conf /etc/gconf /etc/gentoo-release /etc/revdep-rebuild /etc/sandbox.d /etc/terminfo"                                                                                                                                                               
CXXFLAGS="-O2 -pipe -march=bdver2 -mprefer-avx128"                                                                                                                                                                                                                                                                             
DISTDIR="/usr/portage/distfiles"                                                                                                                                                                                                                                                                                               
FCFLAGS="-O2 -pipe"                                                                                                                                                                                                                                                                                                            
FEATURES="assume-digests binpkg-logs config-protect-if-modified distlocks ebuild-locks fixlafiles merge-sync news parallel-fetch preserve-libs protect-owned sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch userpriv usersandbox usersync"                                                
FFLAGS="-O2 -pipe"                                                                                                                                                                                                                                                                                                             
GENTOO_MIRRORS="http://distfiles.gentoo.org"                                                                                                                                                                                                                                                                                   
LANG="en_US.utf8"                                                                                                                                                                                                                                                                                                              
LDFLAGS="-Wl,-O1 -Wl,--as-needed"                                                                                                                                                                                                                                                                                              
MAKEOPTS="-j9"                                                                                                                                                                                                                                                                                                                 
PKGDIR="/usr/portage/packages"                                                                                                                                                                                                                                                                                                 
PORTAGE_CONFIGROOT="/"                                                                                                                                                                                                                                                                                                         
PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --omit-dir-times --compress --force --whole-file --delete --stats --human-readable --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages"                                                                                               
PORTAGE_TMPDIR="/var/tmp"                                                                                                                                                                                                                                                                                                      
PORTDIR="/usr/portage"                                                                                                                                                                                                                                                                                                         
PORTDIR_OVERLAY="/media/majestix-public/overlays/local /media/majestix-public/overlays/layman/sunrise /media/majestix-public/overlays/layman/x11 /media/majestix-public/overlays/layman/tox-overlay"                                                                                                                           
SYNC="rsync://192.168.2.1/gentoo-portage"                                                                                                                                                                                                                                                                                      
USE="acl alsa amd64 avx berkdb bzip2 cli cracklib crypt cxx dbus dri exif flac gdbm graphite iconv icu ipv6 jpeg kde lzma mmx mmxext modules mp3 multilib ncurses nls nptl opengl openmp pam pcre png qt4 readline sdl session sse sse2 sse3 sse4_1 ssl ssse3 svg tcpd threads tiff truetype unicode vdpau vim-syntax xinerama zlib" ABI_X86="64" ALSA_CARDS="ali5451 als4000 atiixp atiixp-modem bt87x ca0106 cmipci emu10k1x ens1370 ens1371 es1938 es1968 fm801 hda-intel intel8x0 intel8x0m maestro3 trident usb-audio via82xx via82xx-modem ymfpci" APACHE2_MODULES="authn_core authz_core socache_shmcb unixd actions alias auth_basic authn_alias authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache cgi cgid dav dav_fs dav_lock deflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" CALLIGRA_FEATURES="krita sheets words karbon" CAMERAS="ptp2" COLLECTD_PLUGINS="df interface irq load memory rrdtool swap syslog" ELIBC="glibc" GPSD_PROTOCOLS="ashtech aivdm earthmate evermore fv18 garmin garmintxt gpsclock itrax mtk3301 nmea ntrip navcom oceanserver oldstyle oncore rtcm104v2 rtcm104v3 sirf superstar2 timing tsip tripmate tnt ublox ubx" GRUB_PLATFORMS="pc" INPUT_DEVICES="evdev" KERNEL="linux" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" LIBREOFFICE_EXTENSIONS="presenter-console presenter-minimizer" LINGUAS="en" OFFICE_IMPLEMENTATION="libreoffice" PHP_TARGETS="php5-5" PYTHON_SINGLE_TARGET="python2_7" PYTHON_TARGETS="python2_7 python3_3" RUBY_TARGETS="ruby19 ruby20" USERLAND="GNU" VIDEO_CARDS="radeon radeonsi" XTABLES_ADDONS="quota2 psd pknock lscan length2 ipv4options ipset ipp2p iface geoip fuzzy condition tee tarpit sysrq steal rawnat logmark ipmark dhcpmac delude chaos account"                                                                                                                                                                                                                                                                                   
Unset:  CPPFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, INSTALL_MASK, LC_ALL, PORTAGE_BUNZIP2_COMMAND, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS, USE_PYTHON      

asterix michael # lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI bridge (external gfx0 port B) (rev 02)
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD/ATI] RD990 I/O Memory Management Unit (IOMMU)
00:02.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI bridge (PCI express gpp port B)
00:09.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI bridge (PCI express gpp port H)
00:0a.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI bridge (external gfx1 port A)
00:11.0 SATA controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode] (rev 40)
00:12.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:12.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:13.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:13.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 SMBus Controller (rev 42)
00:14.1 IDE interface: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 IDE Controller (rev 40)
00:14.2 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 Azalia (Intel HDA) (rev 40)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 LPC host controller (rev 40)
00:14.4 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 PCI to PCI Bridge (rev 40)
00:14.5 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI2 Controller
00:15.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB700/SB800/SB900 PCI to PCI bridge (PCIE port 0)
00:15.1 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB700/SB800/SB900 PCI to PCI bridge (PCIE port 1)
00:15.2 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB900 PCI to PCI bridge (PCIE port 2)
00:15.3 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB900 PCI to PCI bridge (PCIE port 3)
00:16.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:16.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 5
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Tahiti XT [Radeon HD 7970/8970 OEM / R9 280X]
01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Tahiti XT HDMI Audio [Radeon HD 7970 Series]
02:00.0 USB controller: Etron Technology, Inc. EJ168 USB 3.0 Host Controller (rev 01)
03:00.0 SATA controller: Marvell Technology Group Ltd. 88SE9172 SATA 6Gb/s Controller (rev 11)
04:0e.0 FireWire (IEEE 1394): VIA Technologies, Inc. VT6306/7/8 [Fire II(M)] IEEE 1394 OHCI Controller (rev c0)
05:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 06)
06:00.0 USB controller: Etron Technology, Inc. EJ168 USB 3.0 Host Controller (rev 01)
07:00.0 SATA controller: Marvell Technology Group Ltd. 88SE9172 SATA 6Gb/s Controller (rev 11)
Comment 1 Alex Deucher 2014-10-26 18:48:16 UTC
Likely a duplicate of:
https://bugs.freedesktop.org/show_bug.cgi?id=84662
and
https://bugs.freedesktop.org/show_bug.cgi?id=84570
Can you bisect?
Comment 2 Michael Mair-Keimberger 2014-10-27 18:26:47 UTC
(In reply to Alex Deucher from comment #1)
> Likely a duplicate of:
> https://bugs.freedesktop.org/show_bug.cgi?id=84662
> and
> https://bugs.freedesktop.org/show_bug.cgi?id=84570
> Can you bisect?

Sure, give me some days - i'll try to bisect the problem. :) Guess it will take some time, since there were many commits between 3.16 and 3.17rc1.

I'll take the kernel git tree for the bisect (git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git) and bisect between 3.16rc8 and 3.17rc1.

As soon as i have some result's i'll gonna update this bug.
Comment 3 Michael Mair-Keimberger 2014-10-27 20:25:59 UTC
I though it takes more time, but i already finished bisecting :)

The result:

59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb is the first bad commit
commit 59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb
Author: Lauri Kasanen <cand@gmx.com>
Date:   Sun Apr 20 20:29:33 2014 +0300

    drm/radeon: Inline r100_mm_rreg, -wreg, v3
    
    This was originally un-inlined by Andi Kleen in 2011 citing size concerns.
    Indeed, a first attempt at inlining it grew radeon.ko by 7%.
    
    However, 2% of cpu is spent in this function. Simply inlining it gave 1% more fps
    in Urban Terror.
    
    v2: We know the minimum MMIO size. Adding it to the if allows the compiler to
    optimize the branch out, improving both performance and size.
    
    The v2 patch decreases radeon.ko size by 2%. I didn't re-benchmark, but common sense
    says perf is now more than 1% better.
    
    v3: Also change _wreg, make the threshold a define.
    
    Inlining _wreg increased the size a bit compared to v2, so now radeon.ko
    is only 1% smaller.
    
    Signed-off-by: Lauri Kasanen <cand@gmx.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

:040000 040000 91cde817761a93a06d21855ec896d22f03685665 e7de121e74c415308e8266c26ae7ad518d0e8530 M      drivers

This is the bad commit.


asterix linux # git bisect log
git bisect start
# bad: [7d1311b93e58ed55f3a31cc8f94c4b8fe988a2b9] Linux 3.17-rc1
git bisect bad 7d1311b93e58ed55f3a31cc8f94c4b8fe988a2b9
# good: [64aa90f26c06e1cb2aacfb98a7d0eccfbd6c1a91] Linux 3.16-rc7
git bisect good 64aa90f26c06e1cb2aacfb98a7d0eccfbd6c1a91
# good: [ae045e2455429c418a418a3376301a9e5753a0a8] Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-next
git bisect good ae045e2455429c418a418a3376301a9e5753a0a8                                                                                                                                                                                                    
# bad: [44c916d58b9ef1f2c4aec2def57fa8289c716a60] Merge tag 'cleanup-for-3.17' of git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc                                                                                                                 
git bisect bad 44c916d58b9ef1f2c4aec2def57fa8289c716a60                                                                                                                                                                                                     
# good: [e669830526a0abaf301bf408df69cde33901ac63] Merge branch 'upstream' of git://git.linux-mips.org/pub/scm/ralf/upstream-linus                                                                                                                          
git bisect good e669830526a0abaf301bf408df69cde33901ac63                                                                                                                                                                                                    
# bad: [7963e9db1b1f842fdc53309baa8714d38e9f5681] Revert "drm: drop redundant drm_file->is_master"                                                                                                                                                          
git bisect bad 7963e9db1b1f842fdc53309baa8714d38e9f5681                                                                                                                                                                                                     
# good: [8a105aaa25f4504d26ca828f12d709d2213a230e] Merge branch 'drm-armada-devel' of git://ftp.arm.linux.org.uk/~rmk/linux-arm into drm-next                                                                                                               
git bisect good 8a105aaa25f4504d26ca828f12d709d2213a230e                                                                                                                                                                                                    
# good: [a2fe6cdc03d7a9b0d048a7f32f9d8827e06c67fa] drm/msm/hdmi: fix HDMI_MUX_EN gpio request typo                                                                                                                                                          
git bisect good a2fe6cdc03d7a9b0d048a7f32f9d8827e06c67fa                                                                                                                                                                                                    
# bad: [e7e31600d3e2f8b7726b0521149fc55c62a90467] drm/radeon: remove taking mclk_lock from radeon_bo_unref                                                                                                                                                  
git bisect bad e7e31600d3e2f8b7726b0521149fc55c62a90467                                                                                                                                                                                                     
# bad: [c748990b7b1c320c626c758379d50748588c6ed6] drm/radeon: Use correct value for unknown audio/video latency                                                                                                                                             
git bisect bad c748990b7b1c320c626c758379d50748588c6ed6                                                                                                                                                                                                     
# good: [96b1b9711031a1e95e3cf15d830802aed38479a6] Merge branch 'drm_kms_for_next-v8' of git://git.linaro.org/people/benjamin.gaignard/kernel into drm-next                                                                                                 
git bisect good 96b1b9711031a1e95e3cf15d830802aed38479a6                                                                                                                                                                                                    
# good: [636e2582658742b94e7620becce58f939996c961] drm/radeon/dpm: add support for SVI2 voltage for SI                                                                                                                                                      
git bisect good 636e2582658742b94e7620becce58f939996c961                                                                                                                                                                                                    
# good: [f2c6b0f452c3804496f55655fda28c2809e1a58b] drm/radeon/cik: Add support for new ucode format (v5)                                                                                                                                                    
git bisect good f2c6b0f452c3804496f55655fda28c2809e1a58b                                                                                                                                                                                                    
# good: [da9976206c15178eeae1b4445c9266125bf35b0a] drm/radeon: enable display scaling on all connectors (v2)                                                                                                                                                
git bisect good da9976206c15178eeae1b4445c9266125bf35b0a                                                                                                                                                                                                    
# bad: [59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb] drm/radeon: Inline r100_mm_rreg, -wreg, v3                                                                                                                                                                
git bisect bad 59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb                                                                                                                                                                                                     
# good: [3e22920fbd0005927bc41f71daeb056a0f4def82] drm/radeon: consolidate vga and dvi get_modes functions (v2)                                                                                                                                             
git bisect good 3e22920fbd0005927bc41f71daeb056a0f4def82                                                                                                                                                                                                    
# first bad commit: [59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb] drm/radeon: Inline r100_mm_rreg, -wreg, v3
Comment 4 Michael Mair-Keimberger 2014-10-27 20:30:22 UTC
Created attachment 155491 [details]
bisec.tar.gz

Additional all benchmarks i've made from every bisected kernel. the first one is the 3.16rc7 benchmark, the second the 3.17rc1. The others are from the bisected kernels.

You'll see on the benchmarks it always has a difference of about 100 points (good vs bad), which is about 10% performance difference.
Comment 5 Dieter Nützel 2014-10-27 22:45:36 UTC
Can you please test with one of kernel git | 3.18-rc2 | drm-next together with
git revert 59bc1d8?
Comment 6 Michael Mair-Keimberger 2014-10-28 08:04:59 UTC
(In reply to Dieter Nützel from comment #5)
> Can you please test with one of kernel git | 3.18-rc2 | drm-next together
> with
> git revert 59bc1d8?

I've tried it with kernel git 3.18rc2 with a pull a few minutes ago and with `git revert 59bc1d8`. The result looks promising. I've made two benchmarks:

         1st     2nd
FPS:     18.3    18.1
Score:   765     757
Min FPS: 5.9     6.3
Max FPS: 32.4    31.9

I didn't include drm-next simply because i don't know how todo that. :) If you want me to test drm-next as well please point me to some documentation how to include it. :)
Comment 7 Michel Dänzer 2014-10-29 08:46:06 UTC
Does Mesa 10.3.2 work better, specifically commit 64c2bdc334ba472603b1e7cd2c3046cfbce285b6?
Comment 8 Dieter Nützel 2014-10-29 17:10:37 UTC
(In reply to Michael Mair-Keimberger from comment #6)
> (In reply to Dieter Nützel from comment #5)
> > Can you please test with one of kernel git | 3.18-rc2 | drm-next together
> > with
> > git revert 59bc1d8?
> 
> I've tried it with kernel git 3.18rc2 with a pull a few minutes ago and with
> `git revert 59bc1d8`. The result looks promising. I've made two benchmarks:
> 
>          1st     2nd
> FPS:     18.3    18.1
> Score:   765     757
> Min FPS: 5.9     6.3
> Max FPS: 32.4    31.9

Yes, looks much better, but the code shouldn't touch any relevant (radeonsi/r600g) code paths. - Michel?

On r600g (RV730 AGP) I do NOT see any (real) change with this revert...
Maybe we do not hit the real BAD commit.

> I didn't include drm-next simply because i don't know how todo that. :) If
> you want me to test drm-next as well please point me to some documentation
> how to include it. :)

Alex's drm-next-3.19-wip (it shows 3.17-rc5 ;-) for example:
git clone git://people.freedesktop.org/~agd5f/linux/ drm-next-3.19-wip

If you have it already, get it or change it to another tree:
cd drm-next-3.19-wip
git checkout -b drm-next-3.19-wip remotes/origin/drm-next-3.19-wip

Sometimes you need this:
git fetch origin
git reset --hard origin/drm-next-3.19-wip
Comment 9 Alex Deucher 2014-10-29 17:26:28 UTC
(In reply to Dieter Nützel from comment #8)
> 
> Yes, looks much better, but the code shouldn't touch any relevant
> (radeonsi/r600g) code paths. - Michel?
> 
> On r600g (RV730 AGP) I do NOT see any (real) change with this revert...
> Maybe we do not hit the real BAD commit.

What issue are you seeing and what makes you think it has anything to do with this bug?
Comment 10 Michael Mair-Keimberger 2014-10-29 19:34:21 UTC
(In reply to Michel Dänzer from comment #7)
> Does Mesa 10.3.2 work better, specifically commit
> 64c2bdc334ba472603b1e7cd2c3046cfbce285b6?

I'll get slightly better results with 10.3.2 (with 3.18rc1):
FPS: 16.6
Score: 694
Min FPS: 3.7
Max FPS: 33.1

But honestly, watching the demo feels like it got even worse. Still very long lag's, especially at the beginning of new scene's (before it starts to render).

(In reply to Dieter Nützel from comment #8)
> Alex's drm-next-3.19-wip (it shows 3.17-rc5 ;-) for example:
> git clone git://people.freedesktop.org/~agd5f/linux/ drm-next-3.19-wip
> 
> If you have it already, get it or change it to another tree:
> cd drm-next-3.19-wip
> git checkout -b drm-next-3.19-wip remotes/origin/drm-next-3.19-wip
> 
> Sometimes you need this:
> git fetch origin
> git reset --hard origin/drm-next-3.19-wip

I've just started cloning drm-next-3.19 but freedesktop seems to be quite slow - looks like i can start testing it tomorrow :/
Comment 11 Michael Mair-Keimberger 2014-10-29 22:04:02 UTC
Created attachment 155841 [details]
dmesg output

Don't know if this helps but i just saw that i got strange (?) output in dmesg. This output showed up about 20min after i made that benchmark mentioned before (mesa-10.3.2/kernel-3.18rc1). As far as i can remember i didn't do anything specific that moment - just internet surfing.
Comment 12 Alex Deucher 2014-10-29 22:06:28 UTC
looks like https://bugs.freedesktop.org/show_bug.cgi?id=84500
Comment 13 Michel Dänzer 2014-10-30 03:17:07 UTC
(In reply to Michael Mair-Keimberger from comment #10)
> I'll get slightly better results with 10.3.2 (with 3.18rc1):
[...]
> But honestly, watching the demo feels like it got even worse. Still very
> long lag's, especially at the beginning of new scene's (before it starts to
> render).

Weird, it seemed to help a lot for myself and many others. Any chance you could try current Mesa Git master?

Can you create a screenshot from running with GALLIUM_HUD and showing the graphs corresponding to a lag, such as in https://bugs.freedesktop.org/show_bug.cgi?id=84570 ?
Comment 14 Michael Mair-Keimberger 2014-10-30 19:33:57 UTC
OK, today i made another benchmarks: drm-next, mesa-10.3.2. First without any changes, second with `git revert 59bc1d8`:

without any changes         with `git revert 59bc1d8`
FPS:      14.3                 19.1
Score:    599                  801
Min FPS:  2.1                  12.2
Max FPS:  30.7                 30.7


Honestly, the difference is incredible. Can't believe such a small change has such a big impact. It even seems with the commit the performance get's worse over time - never had under 600 points before..

@Michel: mesa-10.3.2 does indeed help. I already had minor lag's/stutters in the past (pre-3.17) - that was "normal" for me, but now i have ZERO lags. The complete benchmark was done without one major lag. AWESOME :)

If anyone is interested i can also create videos, so you can see the differences :)
Comment 15 Michael Mair-Keimberger 2014-10-30 22:29:31 UTC
Created attachment 155911 [details]
picture

I've made a few other benchmarks and tried to take some screenshots. Unfortunately vally doesn't include GALLIUM_HUD when i'm taking screenshot's. As a workaround i've made photos with my mobile. Hope that's ok :)

On a side-note: I've got other kernel msg's with the unchanged kernel. Don't know if it's relevant but it looks like that:

[ 2915.586345] radeon 0000:01:00.0: GPU fault detected: 146 0x00139004
[ 2915.586350] radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00004B00
[ 2915.586353] radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x13090004
[ 2915.586357] VM fault (0x04, vmid 9) at page 19200, write from CB (144)
[ 2915.586362] radeon 0000:01:00.0: GPU fault detected: 146 0x00339004
[ 2915.586365] radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00004B07
[ 2915.586367] radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x13090004
[ 2915.586370] VM fault (0x04, vmid 9) at page 19207, write from CB (144)
[ 2915.586374] radeon 0000:01:00.0: GPU fault detected: 146 0x00539004
[ 2915.586377] radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00004B02
[ 2915.586379] radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x13090004
[ 2915.586382] VM fault (0x04, vmid 9) at page 19202, write from CB (144)
[ 2915.586386] radeon 0000:01:00.0: GPU fault detected: 146 0x00739004
.
.
.

I don't know if i would get them with the changed kernel as-well, but with my stable kernel (3.16) i never see such messages.
Comment 16 Michael Mair-Keimberger 2014-10-30 22:30:30 UTC
Created attachment 155921 [details]
another picture

another picture
Comment 17 Michel Dänzer 2014-10-31 03:28:56 UTC
(In reply to Michael Mair-Keimberger from comment #14)
> Honestly, the difference is incredible. Can't believe such a small change
> has such a big impact.

Yeah, it's really weird. Looking at the change, the only way I could imagine it possibly having any negative impact would be if it somehow caused the indirect register access method to be used even when it's not necessary. But not sure how that could happen.


(In reply to Michael Mair-Keimberger from comment #15)
> Unfortunately vally doesn't include GALLIUM_HUD when i'm taking
> screenshot's. As a workaround i've made photos with my mobile. Hope that's
> ok :)

That's fine, but we also need to see the VRAM and GTT graphs.
Comment 18 Michael Mair-Keimberger 2014-11-01 17:27:15 UTC
Created attachment 156051 [details]
picture with VRAM and GTT usage

(In reply to Michel Dänzer from comment #17)
> (In reply to Michael Mair-Keimberger from comment #15)
> > Unfortunately vally doesn't include GALLIUM_HUD when i'm taking
> > screenshot's. As a workaround i've made photos with my mobile. Hope that's
> > ok :)
> 
> That's fine, but we also need to see the VRAM and GTT graphs.

Sorry, completely forget about that.. 
Another picture with VRAM and GTT usage. I've used `GALLIUM_HUD=fps,requested-VRAM+VRAM-usage,requested-GTT+GTT` to start the benchmark.
Comment 19 Andy Furniss 2014-11-01 19:45:03 UTC
(In reply to Michael Mair-Keimberger from comment #18)
> Created attachment 156051 [details]
> picture with VRAM and GTT usage
> 
> (In reply to Michel Dänzer from comment #17)
> > (In reply to Michael Mair-Keimberger from comment #15)
> > > Unfortunately vally doesn't include GALLIUM_HUD when i'm taking
> > > screenshot's. As a workaround i've made photos with my mobile. Hope
> that's
> > > ok :)
> > 
> > That's fine, but we also need to see the VRAM and GTT graphs.
> 
> Sorry, completely forget about that.. 
> Another picture with VRAM and GTT usage. I've used
> `GALLIUM_HUD=fps,requested-VRAM+VRAM-usage,requested-GTT+GTT` to start the
> benchmark.

Should be ...requested-GTT+GTT-usage

I used to have similar issues with valley, but for my setup/card (R9270X) they are fixed with current mesa + drm-next-3.19-wip.

One thing I always do is set CPUs to performance in case cpufreq messes things up - may be worth a try to see if it helps.

What setting(s)/res do you run valley with?

It may be less hassle for you to use a phone, but FWIW the way I get screenshots that include the HUD is to use xwd - for something fullscreen I would before starting valley from a different xterm/console/whatever do something like -

sleep 100 && xwd -root -out whatever.xwd 

then start valley and wait. To view "whatever.xwd" you can use xwud,to upload you could convert to another "normal" format. You need some image program to do this - I have ImageMagick installed and can just type in a terminal -

convert whatever.xwd whatever.png
Comment 20 Michael Mair-Keimberger 2014-11-02 10:33:43 UTC
Created attachment 156161 [details]
vally screenshot

(In reply to Andy Furniss from comment #19)
> (In reply to Michael Mair-Keimberger from comment #18)
> > Created attachment 156051 [details]
> > picture with VRAM and GTT usage
> > 
> > (In reply to Michel Dänzer from comment #17)
> > > (In reply to Michael Mair-Keimberger from comment #15)
> > > > Unfortunately vally doesn't include GALLIUM_HUD when i'm taking
> > > > screenshot's. As a workaround i've made photos with my mobile. Hope
> that's
> > > > ok :)
> > > 
> > > That's fine, but we also need to see the VRAM and GTT graphs.
> > 
> > Sorry, completely forget about that.. 
> > Another picture with VRAM and GTT usage. I've used
> > `GALLIUM_HUD=fps,requested-VRAM+VRAM-usage,requested-GTT+GTT` to start the
> > benchmark.
> 
> Should be ...requested-GTT+GTT-usage
> 
> I used to have similar issues with valley, but for my setup/card (R9270X)
> they are fixed with current mesa + drm-next-3.19-wip.
> 
> One thing I always do is set CPUs to performance in case cpufreq messes
> things up - may be worth a try to see if it helps.
> 
> What setting(s)/res do you run valley with?
> 
> It may be less hassle for you to use a phone, but FWIW the way I get
> screenshots that include the HUD is to use xwd - for something fullscreen I
> would before starting valley from a different xterm/console/whatever do
> something like -
> 
> sleep 100 && xwd -root -out whatever.xwd 
> 
> then start valley and wait. To view "whatever.xwd" you can use xwud,to
> upload you could convert to another "normal" format. You need some image
> program to do this - I have ImageMagick installed and can just type in a
> terminal -
> 
> convert whatever.xwd whatever.png


It's fixed! (for me) - mesa git did the miracle :)

FYI - changing CPU's to performance didn't had any influence. I've made another screenshot, this time with xwd (thanks for the tip btw) and with GTT-usage (thangs for pointing that out - that was a copy paste error). Don't know if it's still important but i'll upload it anyway.

Settings for vally are as followed:
Quality: Ultra
Stereo 3d: Disabled
Monitors: Single
Anti-aliasing: Off
Full Screen: Yes
Resolution: 2560x1600
Comment 21 Michael Mair-Keimberger 2014-11-02 10:41:58 UTC
Just to clarify - the screenshot was made with mesa-10.3.2 and cpu's frequency set to performance.

With mesa git i got following results:
FPS: 20.0
Score: 838
Min FPS: 7.4
Max FPS: 30.5

Pretty neat actually! I'll gonna make another benchmark with my patched kernel (git revert 59bc1d8) and look if it has an influence in performance :)

Note You need to log in before you can comment on or make changes to this bug.