Created attachment 155001 [details] Kernel config of 3.18rc1 Since kernel Version 3.17 and greater (last tested with kernel-3.18rc1) i'm encounter very long stutters/lags (1-2sec) when i'm gaming various games. (eg borderlands2) Usually i'm doing a unigine valley benchmark test after i install a new kernel to see how much more performance i get and usually the benchmarks are very promising. (FPS continuous rises) :) However since kernel-3.17 i get really long stutters/lags in my tests. FPS doesn't really go done, they rise even more (max fps), but i also always get really serious stutters, where i have to wait about 1-2 sec until the graphic's continue to render. This is really annoying. Besides getting higher max fps in Unigine i also get new lowest min fps. Below a short overview of my benchmark tests (with date): Linux 3.16.2-gentoo x86_64 - 20140914 FPS: 17.4 Score: 727 Min FPS: 8.2 Max FPS: 28.1 Linux 3.17.0-rc6 x86_64 - 20140927 FPS: 16.2 Score: 680 Min FPS: 4.2 Max FPS: 32.1 Linux 3.18.0-rc1 x86_64 - 20141025 FPS: 15.9 Score: 666 Min FPS: 3.2 Max FPS: 34.4 Today's test (same software stack, just different kernels) Linux 3.18.0-rc1 x86_64 Linux 3.16.3-gentoo x86_64 FPS: 15.9 18.7 Score: 666 782 Min FPS: 3.2 8.3 Max FPS: 34.4 30.5 Usually downgrading to kernel-3.16 always solves the problem for me which is why i think the problem has todo with the kernel side of the ati driver. I've also tried the new firmware blobs for my graphics card (TAHITI_ce.bin -> tahiti_ce.bin, ...), they didn't make any differences. My system: Graphic related packages installed: media-libs/mesa-10.3.1 sys-devel/llvm-3.5.0 sys-kernel/linux-firmware-20140902 x11-drivers/xf86-video-ati-7.5.0 x11-libs/libdrm-2.4.58 asterix michael # emerge --info Portage 2.2.14 (python 3.3.5-final-0, default/linux/amd64/13.0, gcc-4.8.3, glibc-2.19-r1, 3.16.3-gentoo x86_64) ================================================================= System uname: Linux-3.16.3-gentoo-x86_64-AMD_FX-tm-8350_Eight-Core_Processor-with-gentoo-2.2 KiB Mem: 16356140 total, 11196304 free KiB Swap: 8388600 total, 8388600 free Timestamp of tree: Sat, 25 Oct 2014 04:30:01 +0000 ld GNU ld (Gentoo 2.24 p1.4) 2.24 app-shells/bash: 4.3_p30 dev-lang/perl: 5.20.1-r1 dev-lang/python: 2.7.8, 3.3.5-r1 dev-util/cmake: 3.0.2 dev-util/pkgconfig: 0.28-r2 sys-apps/baselayout: 2.2 sys-apps/openrc: 0.13.1 sys-apps/sandbox: 2.6-r1 sys-devel/autoconf: 2.13, 2.69 sys-devel/automake: 1.11.6, 1.12.6, 1.14.1 sys-devel/binutils: 2.24-r3 sys-devel/gcc: 4.8.3 sys-devel/gcc-config: 1.8 sys-devel/libtool: 2.4.2-r1 sys-devel/make: 4.1-r1 sys-kernel/linux-headers: 3.17 (virtual/os-headers) sys-libs/glibc: 2.19-r1 Repositories: gentoo local sunrise x11 tox-overlay ACCEPT_KEYWORDS="amd64 ~amd64" ACCEPT_LICENSE="* -@EULA" CBUILD="x86_64-pc-linux-gnu" CFLAGS="-O2 -pipe -march=bdver2 -mprefer-avx128" CHOST="x86_64-pc-linux-gnu" CONFIG_PROTECT="/etc /usr/share/config /usr/share/gnupg/qualified.txt /usr/share/themes/oxygen-gtk/gtk-2.0 /usr/share/themes/oxygen-gtk/gtk-3.0" CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/env.d /etc/fonts/fonts.conf /etc/gconf /etc/gentoo-release /etc/revdep-rebuild /etc/sandbox.d /etc/terminfo" CXXFLAGS="-O2 -pipe -march=bdver2 -mprefer-avx128" DISTDIR="/usr/portage/distfiles" FCFLAGS="-O2 -pipe" FEATURES="assume-digests binpkg-logs config-protect-if-modified distlocks ebuild-locks fixlafiles merge-sync news parallel-fetch preserve-libs protect-owned sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch userpriv usersandbox usersync" FFLAGS="-O2 -pipe" GENTOO_MIRRORS="http://distfiles.gentoo.org" LANG="en_US.utf8" LDFLAGS="-Wl,-O1 -Wl,--as-needed" MAKEOPTS="-j9" PKGDIR="/usr/portage/packages" PORTAGE_CONFIGROOT="/" PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --omit-dir-times --compress --force --whole-file --delete --stats --human-readable --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages" PORTAGE_TMPDIR="/var/tmp" PORTDIR="/usr/portage" PORTDIR_OVERLAY="/media/majestix-public/overlays/local /media/majestix-public/overlays/layman/sunrise /media/majestix-public/overlays/layman/x11 /media/majestix-public/overlays/layman/tox-overlay" SYNC="rsync://192.168.2.1/gentoo-portage" USE="acl alsa amd64 avx berkdb bzip2 cli cracklib crypt cxx dbus dri exif flac gdbm graphite iconv icu ipv6 jpeg kde lzma mmx mmxext modules mp3 multilib ncurses nls nptl opengl openmp pam pcre png qt4 readline sdl session sse sse2 sse3 sse4_1 ssl ssse3 svg tcpd threads tiff truetype unicode vdpau vim-syntax xinerama zlib" ABI_X86="64" ALSA_CARDS="ali5451 als4000 atiixp atiixp-modem bt87x ca0106 cmipci emu10k1x ens1370 ens1371 es1938 es1968 fm801 hda-intel intel8x0 intel8x0m maestro3 trident usb-audio via82xx via82xx-modem ymfpci" APACHE2_MODULES="authn_core authz_core socache_shmcb unixd actions alias auth_basic authn_alias authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache cgi cgid dav dav_fs dav_lock deflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" CALLIGRA_FEATURES="krita sheets words karbon" CAMERAS="ptp2" COLLECTD_PLUGINS="df interface irq load memory rrdtool swap syslog" ELIBC="glibc" GPSD_PROTOCOLS="ashtech aivdm earthmate evermore fv18 garmin garmintxt gpsclock itrax mtk3301 nmea ntrip navcom oceanserver oldstyle oncore rtcm104v2 rtcm104v3 sirf superstar2 timing tsip tripmate tnt ublox ubx" GRUB_PLATFORMS="pc" INPUT_DEVICES="evdev" KERNEL="linux" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" LIBREOFFICE_EXTENSIONS="presenter-console presenter-minimizer" LINGUAS="en" OFFICE_IMPLEMENTATION="libreoffice" PHP_TARGETS="php5-5" PYTHON_SINGLE_TARGET="python2_7" PYTHON_TARGETS="python2_7 python3_3" RUBY_TARGETS="ruby19 ruby20" USERLAND="GNU" VIDEO_CARDS="radeon radeonsi" XTABLES_ADDONS="quota2 psd pknock lscan length2 ipv4options ipset ipp2p iface geoip fuzzy condition tee tarpit sysrq steal rawnat logmark ipmark dhcpmac delude chaos account" Unset: CPPFLAGS, CTARGET, EMERGE_DEFAULT_OPTS, INSTALL_MASK, LC_ALL, PORTAGE_BUNZIP2_COMMAND, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS, USE_PYTHON asterix michael # lspci 00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI bridge (external gfx0 port B) (rev 02) 00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD/ATI] RD990 I/O Memory Management Unit (IOMMU) 00:02.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI bridge (PCI express gpp port B) 00:09.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI bridge (PCI express gpp port H) 00:0a.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890 PCI to PCI bridge (external gfx1 port A) 00:11.0 SATA controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode] (rev 40) 00:12.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller 00:12.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller 00:13.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller 00:13.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller 00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 SMBus Controller (rev 42) 00:14.1 IDE interface: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 IDE Controller (rev 40) 00:14.2 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 Azalia (Intel HDA) (rev 40) 00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 LPC host controller (rev 40) 00:14.4 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 PCI to PCI Bridge (rev 40) 00:14.5 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI2 Controller 00:15.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB700/SB800/SB900 PCI to PCI bridge (PCIE port 0) 00:15.1 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB700/SB800/SB900 PCI to PCI bridge (PCIE port 1) 00:15.2 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB900 PCI to PCI bridge (PCIE port 2) 00:15.3 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB900 PCI to PCI bridge (PCIE port 3) 00:16.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller 00:16.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller 00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 0 00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 1 00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 2 00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 3 00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 4 00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 5 01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Tahiti XT [Radeon HD 7970/8970 OEM / R9 280X] 01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Tahiti XT HDMI Audio [Radeon HD 7970 Series] 02:00.0 USB controller: Etron Technology, Inc. EJ168 USB 3.0 Host Controller (rev 01) 03:00.0 SATA controller: Marvell Technology Group Ltd. 88SE9172 SATA 6Gb/s Controller (rev 11) 04:0e.0 FireWire (IEEE 1394): VIA Technologies, Inc. VT6306/7/8 [Fire II(M)] IEEE 1394 OHCI Controller (rev c0) 05:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 06) 06:00.0 USB controller: Etron Technology, Inc. EJ168 USB 3.0 Host Controller (rev 01) 07:00.0 SATA controller: Marvell Technology Group Ltd. 88SE9172 SATA 6Gb/s Controller (rev 11)
Likely a duplicate of: https://bugs.freedesktop.org/show_bug.cgi?id=84662 and https://bugs.freedesktop.org/show_bug.cgi?id=84570 Can you bisect?
(In reply to Alex Deucher from comment #1) > Likely a duplicate of: > https://bugs.freedesktop.org/show_bug.cgi?id=84662 > and > https://bugs.freedesktop.org/show_bug.cgi?id=84570 > Can you bisect? Sure, give me some days - i'll try to bisect the problem. :) Guess it will take some time, since there were many commits between 3.16 and 3.17rc1. I'll take the kernel git tree for the bisect (git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git) and bisect between 3.16rc8 and 3.17rc1. As soon as i have some result's i'll gonna update this bug.
I though it takes more time, but i already finished bisecting :) The result: 59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb is the first bad commit commit 59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb Author: Lauri Kasanen <cand@gmx.com> Date: Sun Apr 20 20:29:33 2014 +0300 drm/radeon: Inline r100_mm_rreg, -wreg, v3 This was originally un-inlined by Andi Kleen in 2011 citing size concerns. Indeed, a first attempt at inlining it grew radeon.ko by 7%. However, 2% of cpu is spent in this function. Simply inlining it gave 1% more fps in Urban Terror. v2: We know the minimum MMIO size. Adding it to the if allows the compiler to optimize the branch out, improving both performance and size. The v2 patch decreases radeon.ko size by 2%. I didn't re-benchmark, but common sense says perf is now more than 1% better. v3: Also change _wreg, make the threshold a define. Inlining _wreg increased the size a bit compared to v2, so now radeon.ko is only 1% smaller. Signed-off-by: Lauri Kasanen <cand@gmx.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> :040000 040000 91cde817761a93a06d21855ec896d22f03685665 e7de121e74c415308e8266c26ae7ad518d0e8530 M drivers This is the bad commit. asterix linux # git bisect log git bisect start # bad: [7d1311b93e58ed55f3a31cc8f94c4b8fe988a2b9] Linux 3.17-rc1 git bisect bad 7d1311b93e58ed55f3a31cc8f94c4b8fe988a2b9 # good: [64aa90f26c06e1cb2aacfb98a7d0eccfbd6c1a91] Linux 3.16-rc7 git bisect good 64aa90f26c06e1cb2aacfb98a7d0eccfbd6c1a91 # good: [ae045e2455429c418a418a3376301a9e5753a0a8] Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-next git bisect good ae045e2455429c418a418a3376301a9e5753a0a8 # bad: [44c916d58b9ef1f2c4aec2def57fa8289c716a60] Merge tag 'cleanup-for-3.17' of git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc git bisect bad 44c916d58b9ef1f2c4aec2def57fa8289c716a60 # good: [e669830526a0abaf301bf408df69cde33901ac63] Merge branch 'upstream' of git://git.linux-mips.org/pub/scm/ralf/upstream-linus git bisect good e669830526a0abaf301bf408df69cde33901ac63 # bad: [7963e9db1b1f842fdc53309baa8714d38e9f5681] Revert "drm: drop redundant drm_file->is_master" git bisect bad 7963e9db1b1f842fdc53309baa8714d38e9f5681 # good: [8a105aaa25f4504d26ca828f12d709d2213a230e] Merge branch 'drm-armada-devel' of git://ftp.arm.linux.org.uk/~rmk/linux-arm into drm-next git bisect good 8a105aaa25f4504d26ca828f12d709d2213a230e # good: [a2fe6cdc03d7a9b0d048a7f32f9d8827e06c67fa] drm/msm/hdmi: fix HDMI_MUX_EN gpio request typo git bisect good a2fe6cdc03d7a9b0d048a7f32f9d8827e06c67fa # bad: [e7e31600d3e2f8b7726b0521149fc55c62a90467] drm/radeon: remove taking mclk_lock from radeon_bo_unref git bisect bad e7e31600d3e2f8b7726b0521149fc55c62a90467 # bad: [c748990b7b1c320c626c758379d50748588c6ed6] drm/radeon: Use correct value for unknown audio/video latency git bisect bad c748990b7b1c320c626c758379d50748588c6ed6 # good: [96b1b9711031a1e95e3cf15d830802aed38479a6] Merge branch 'drm_kms_for_next-v8' of git://git.linaro.org/people/benjamin.gaignard/kernel into drm-next git bisect good 96b1b9711031a1e95e3cf15d830802aed38479a6 # good: [636e2582658742b94e7620becce58f939996c961] drm/radeon/dpm: add support for SVI2 voltage for SI git bisect good 636e2582658742b94e7620becce58f939996c961 # good: [f2c6b0f452c3804496f55655fda28c2809e1a58b] drm/radeon/cik: Add support for new ucode format (v5) git bisect good f2c6b0f452c3804496f55655fda28c2809e1a58b # good: [da9976206c15178eeae1b4445c9266125bf35b0a] drm/radeon: enable display scaling on all connectors (v2) git bisect good da9976206c15178eeae1b4445c9266125bf35b0a # bad: [59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb] drm/radeon: Inline r100_mm_rreg, -wreg, v3 git bisect bad 59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb # good: [3e22920fbd0005927bc41f71daeb056a0f4def82] drm/radeon: consolidate vga and dvi get_modes functions (v2) git bisect good 3e22920fbd0005927bc41f71daeb056a0f4def82 # first bad commit: [59bc1d89d6a4d67c94a9b70fa81bda1d5b04f0cb] drm/radeon: Inline r100_mm_rreg, -wreg, v3
Created attachment 155491 [details] bisec.tar.gz Additional all benchmarks i've made from every bisected kernel. the first one is the 3.16rc7 benchmark, the second the 3.17rc1. The others are from the bisected kernels. You'll see on the benchmarks it always has a difference of about 100 points (good vs bad), which is about 10% performance difference.
Can you please test with one of kernel git | 3.18-rc2 | drm-next together with git revert 59bc1d8?
(In reply to Dieter Nützel from comment #5) > Can you please test with one of kernel git | 3.18-rc2 | drm-next together > with > git revert 59bc1d8? I've tried it with kernel git 3.18rc2 with a pull a few minutes ago and with `git revert 59bc1d8`. The result looks promising. I've made two benchmarks: 1st 2nd FPS: 18.3 18.1 Score: 765 757 Min FPS: 5.9 6.3 Max FPS: 32.4 31.9 I didn't include drm-next simply because i don't know how todo that. :) If you want me to test drm-next as well please point me to some documentation how to include it. :)
Does Mesa 10.3.2 work better, specifically commit 64c2bdc334ba472603b1e7cd2c3046cfbce285b6?
(In reply to Michael Mair-Keimberger from comment #6) > (In reply to Dieter Nützel from comment #5) > > Can you please test with one of kernel git | 3.18-rc2 | drm-next together > > with > > git revert 59bc1d8? > > I've tried it with kernel git 3.18rc2 with a pull a few minutes ago and with > `git revert 59bc1d8`. The result looks promising. I've made two benchmarks: > > 1st 2nd > FPS: 18.3 18.1 > Score: 765 757 > Min FPS: 5.9 6.3 > Max FPS: 32.4 31.9 Yes, looks much better, but the code shouldn't touch any relevant (radeonsi/r600g) code paths. - Michel? On r600g (RV730 AGP) I do NOT see any (real) change with this revert... Maybe we do not hit the real BAD commit. > I didn't include drm-next simply because i don't know how todo that. :) If > you want me to test drm-next as well please point me to some documentation > how to include it. :) Alex's drm-next-3.19-wip (it shows 3.17-rc5 ;-) for example: git clone git://people.freedesktop.org/~agd5f/linux/ drm-next-3.19-wip If you have it already, get it or change it to another tree: cd drm-next-3.19-wip git checkout -b drm-next-3.19-wip remotes/origin/drm-next-3.19-wip Sometimes you need this: git fetch origin git reset --hard origin/drm-next-3.19-wip
(In reply to Dieter Nützel from comment #8) > > Yes, looks much better, but the code shouldn't touch any relevant > (radeonsi/r600g) code paths. - Michel? > > On r600g (RV730 AGP) I do NOT see any (real) change with this revert... > Maybe we do not hit the real BAD commit. What issue are you seeing and what makes you think it has anything to do with this bug?
(In reply to Michel Dänzer from comment #7) > Does Mesa 10.3.2 work better, specifically commit > 64c2bdc334ba472603b1e7cd2c3046cfbce285b6? I'll get slightly better results with 10.3.2 (with 3.18rc1): FPS: 16.6 Score: 694 Min FPS: 3.7 Max FPS: 33.1 But honestly, watching the demo feels like it got even worse. Still very long lag's, especially at the beginning of new scene's (before it starts to render). (In reply to Dieter Nützel from comment #8) > Alex's drm-next-3.19-wip (it shows 3.17-rc5 ;-) for example: > git clone git://people.freedesktop.org/~agd5f/linux/ drm-next-3.19-wip > > If you have it already, get it or change it to another tree: > cd drm-next-3.19-wip > git checkout -b drm-next-3.19-wip remotes/origin/drm-next-3.19-wip > > Sometimes you need this: > git fetch origin > git reset --hard origin/drm-next-3.19-wip I've just started cloning drm-next-3.19 but freedesktop seems to be quite slow - looks like i can start testing it tomorrow :/
Created attachment 155841 [details] dmesg output Don't know if this helps but i just saw that i got strange (?) output in dmesg. This output showed up about 20min after i made that benchmark mentioned before (mesa-10.3.2/kernel-3.18rc1). As far as i can remember i didn't do anything specific that moment - just internet surfing.
looks like https://bugs.freedesktop.org/show_bug.cgi?id=84500
(In reply to Michael Mair-Keimberger from comment #10) > I'll get slightly better results with 10.3.2 (with 3.18rc1): [...] > But honestly, watching the demo feels like it got even worse. Still very > long lag's, especially at the beginning of new scene's (before it starts to > render). Weird, it seemed to help a lot for myself and many others. Any chance you could try current Mesa Git master? Can you create a screenshot from running with GALLIUM_HUD and showing the graphs corresponding to a lag, such as in https://bugs.freedesktop.org/show_bug.cgi?id=84570 ?
OK, today i made another benchmarks: drm-next, mesa-10.3.2. First without any changes, second with `git revert 59bc1d8`: without any changes with `git revert 59bc1d8` FPS: 14.3 19.1 Score: 599 801 Min FPS: 2.1 12.2 Max FPS: 30.7 30.7 Honestly, the difference is incredible. Can't believe such a small change has such a big impact. It even seems with the commit the performance get's worse over time - never had under 600 points before.. @Michel: mesa-10.3.2 does indeed help. I already had minor lag's/stutters in the past (pre-3.17) - that was "normal" for me, but now i have ZERO lags. The complete benchmark was done without one major lag. AWESOME :) If anyone is interested i can also create videos, so you can see the differences :)
Created attachment 155911 [details] picture I've made a few other benchmarks and tried to take some screenshots. Unfortunately vally doesn't include GALLIUM_HUD when i'm taking screenshot's. As a workaround i've made photos with my mobile. Hope that's ok :) On a side-note: I've got other kernel msg's with the unchanged kernel. Don't know if it's relevant but it looks like that: [ 2915.586345] radeon 0000:01:00.0: GPU fault detected: 146 0x00139004 [ 2915.586350] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00004B00 [ 2915.586353] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x13090004 [ 2915.586357] VM fault (0x04, vmid 9) at page 19200, write from CB (144) [ 2915.586362] radeon 0000:01:00.0: GPU fault detected: 146 0x00339004 [ 2915.586365] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00004B07 [ 2915.586367] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x13090004 [ 2915.586370] VM fault (0x04, vmid 9) at page 19207, write from CB (144) [ 2915.586374] radeon 0000:01:00.0: GPU fault detected: 146 0x00539004 [ 2915.586377] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00004B02 [ 2915.586379] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x13090004 [ 2915.586382] VM fault (0x04, vmid 9) at page 19202, write from CB (144) [ 2915.586386] radeon 0000:01:00.0: GPU fault detected: 146 0x00739004 . . . I don't know if i would get them with the changed kernel as-well, but with my stable kernel (3.16) i never see such messages.
Created attachment 155921 [details] another picture another picture
(In reply to Michael Mair-Keimberger from comment #14) > Honestly, the difference is incredible. Can't believe such a small change > has such a big impact. Yeah, it's really weird. Looking at the change, the only way I could imagine it possibly having any negative impact would be if it somehow caused the indirect register access method to be used even when it's not necessary. But not sure how that could happen. (In reply to Michael Mair-Keimberger from comment #15) > Unfortunately vally doesn't include GALLIUM_HUD when i'm taking > screenshot's. As a workaround i've made photos with my mobile. Hope that's > ok :) That's fine, but we also need to see the VRAM and GTT graphs.
Created attachment 156051 [details] picture with VRAM and GTT usage (In reply to Michel Dänzer from comment #17) > (In reply to Michael Mair-Keimberger from comment #15) > > Unfortunately vally doesn't include GALLIUM_HUD when i'm taking > > screenshot's. As a workaround i've made photos with my mobile. Hope that's > > ok :) > > That's fine, but we also need to see the VRAM and GTT graphs. Sorry, completely forget about that.. Another picture with VRAM and GTT usage. I've used `GALLIUM_HUD=fps,requested-VRAM+VRAM-usage,requested-GTT+GTT` to start the benchmark.
(In reply to Michael Mair-Keimberger from comment #18) > Created attachment 156051 [details] > picture with VRAM and GTT usage > > (In reply to Michel Dänzer from comment #17) > > (In reply to Michael Mair-Keimberger from comment #15) > > > Unfortunately vally doesn't include GALLIUM_HUD when i'm taking > > > screenshot's. As a workaround i've made photos with my mobile. Hope > that's > > > ok :) > > > > That's fine, but we also need to see the VRAM and GTT graphs. > > Sorry, completely forget about that.. > Another picture with VRAM and GTT usage. I've used > `GALLIUM_HUD=fps,requested-VRAM+VRAM-usage,requested-GTT+GTT` to start the > benchmark. Should be ...requested-GTT+GTT-usage I used to have similar issues with valley, but for my setup/card (R9270X) they are fixed with current mesa + drm-next-3.19-wip. One thing I always do is set CPUs to performance in case cpufreq messes things up - may be worth a try to see if it helps. What setting(s)/res do you run valley with? It may be less hassle for you to use a phone, but FWIW the way I get screenshots that include the HUD is to use xwd - for something fullscreen I would before starting valley from a different xterm/console/whatever do something like - sleep 100 && xwd -root -out whatever.xwd then start valley and wait. To view "whatever.xwd" you can use xwud,to upload you could convert to another "normal" format. You need some image program to do this - I have ImageMagick installed and can just type in a terminal - convert whatever.xwd whatever.png
Created attachment 156161 [details] vally screenshot (In reply to Andy Furniss from comment #19) > (In reply to Michael Mair-Keimberger from comment #18) > > Created attachment 156051 [details] > > picture with VRAM and GTT usage > > > > (In reply to Michel Dänzer from comment #17) > > > (In reply to Michael Mair-Keimberger from comment #15) > > > > Unfortunately vally doesn't include GALLIUM_HUD when i'm taking > > > > screenshot's. As a workaround i've made photos with my mobile. Hope > that's > > > > ok :) > > > > > > That's fine, but we also need to see the VRAM and GTT graphs. > > > > Sorry, completely forget about that.. > > Another picture with VRAM and GTT usage. I've used > > `GALLIUM_HUD=fps,requested-VRAM+VRAM-usage,requested-GTT+GTT` to start the > > benchmark. > > Should be ...requested-GTT+GTT-usage > > I used to have similar issues with valley, but for my setup/card (R9270X) > they are fixed with current mesa + drm-next-3.19-wip. > > One thing I always do is set CPUs to performance in case cpufreq messes > things up - may be worth a try to see if it helps. > > What setting(s)/res do you run valley with? > > It may be less hassle for you to use a phone, but FWIW the way I get > screenshots that include the HUD is to use xwd - for something fullscreen I > would before starting valley from a different xterm/console/whatever do > something like - > > sleep 100 && xwd -root -out whatever.xwd > > then start valley and wait. To view "whatever.xwd" you can use xwud,to > upload you could convert to another "normal" format. You need some image > program to do this - I have ImageMagick installed and can just type in a > terminal - > > convert whatever.xwd whatever.png It's fixed! (for me) - mesa git did the miracle :) FYI - changing CPU's to performance didn't had any influence. I've made another screenshot, this time with xwd (thanks for the tip btw) and with GTT-usage (thangs for pointing that out - that was a copy paste error). Don't know if it's still important but i'll upload it anyway. Settings for vally are as followed: Quality: Ultra Stereo 3d: Disabled Monitors: Single Anti-aliasing: Off Full Screen: Yes Resolution: 2560x1600
Just to clarify - the screenshot was made with mesa-10.3.2 and cpu's frequency set to performance. With mesa git i got following results: FPS: 20.0 Score: 838 Min FPS: 7.4 Max FPS: 30.5 Pretty neat actually! I'll gonna make another benchmark with my patched kernel (git revert 59bc1d8) and look if it has an influence in performance :)