The code in both 32 and 64 bit versions of datapage.S for vdso use a bcl+8 to capture the instruction address in the link register. On recent powerpc processors (i.e Power8) you will corrupt the link stack on-chip if you do a branch and link and then do not return. bcl+4 is a special case and is ok. This code causes subsequent mispredicts of blr because the link stack is no longer correct.
Thanks Aaron. I've posted a patch to address this here: http://patchwork.ozlabs.org/patch/521590/
Fixed in: c974809a26a1 ("powerpc/vdso: Avoid link stack corruption in __get_datapage()") https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=c974809a26a13e40254dbe3cf46f49aa32acca11