Bug 217713
Summary: | Encoding issues with --auto-to-cc | ||
---|---|---|---|
Product: | Tools | Reporter: | Bugbot (bugbot) |
Component: | Infra | Assignee: | Konstantin Ryabitsev (konstantin) |
Status: | RESOLVED CODE_FIX | ||
Severity: | normal | ||
Priority: | P3 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | Subsystem: | ||
Regression: | No | Bisected commit-id: |
Description
Bugbot
2023-07-26 19:54:15 UTC
Konstantin Ryabitsev <konstantin@linuxfoundation.org> writes: On Mon, Jul 24, 2023 at 12:49:41PM +0200, Duje Mihanović wrote: > Steps to reproduce: > - Checkout Linux 6.5-rc2 > - Run `b4 prep -F "<20230721210042.21535-1-duje.mihanovic@skole.hr>" -n > <any branch name>` > - Run `b4 prep -c` Thank you for that -- I can verify that it's happening. bugbot assign to me -K (via https://msgid.link/20230726-hula-wad-c9241b@meerkat) Konstantin Ryabitsev <konstantin@linuxfoundation.org> writes: On Mon, Jul 24, 2023 at 12:49:41PM +0200, Duje Mihanović wrote: > I decided to try using b4 to submit a patchset for adding Marvell PXA1908 ARM > SoC support. Having enrolled an existing branch, I ran `b4 prep -c` and got > the following error (this is with the -d switch added): So, there's apparently something very interesting about that final ć in your name that trips up get_maintainer.pl. For example, run the following: $ ./scripts/get_maintainer.pl -f Documentation/devicetree/bindings/clock/marvell,pxa1908.yaml You will get back a byte sequence \x87 where your name should be: "<87>" <duje.mihanovic@skole.hr> (in file) This is because ć is 0xC4 0x87, but I have no idea why get_maintainer.pl trips up and splits the unicode sequence into two bytes. It seems to want to do that for anything above base extended ascii (Latin-A). I can "fix" this in b4 by forcing it to ignore any unrecognized unicode errors in get_maintainer.pl output, but it's not a real fix for the underlying problem. -K (via https://msgid.link/20230726-gush-slouching-a5cd41@meerkat) Konstantin Ryabitsev writes in commit 034f2fb2ac27c89c1c7ab2af04d26ba63be9ea6c: ez: ignore invalid unicode returned by get_maintainer There's a bug in get_maintainer.pl that returns invalid unicode in certain situations (see bug linked below). We can't fix this in b4, but at least we can avoid crashing when we encounter this problem. Reported-by: Duje Mihanović <duje.mihanovic@skole.hr> Link: https://msgid.link/1940519.PYKUYFuaPT@radijator Closes: https://bugzilla.kernel.org/show_bug.cgi?id=217713 Signed-off-by: Konstantin Ryabitsev <konstantin@linuxfoundation.org> (via https://git.kernel.org/pub/scm/utils/b4/b4.git/commit/?id=034f2fb2ac27) |