public inbox for devel@edk2.groups.io
 help / color / mirror / Atom feed
From: "Laszlo Ersek" <lersek@redhat.com>
To: devel@edk2.groups.io, philmd@redhat.com
Cc: Zhiju Fan <zhijux.fan@intel.com>, Bob Feng <bob.c.feng@intel.com>,
	Liming Gao <liming.gao@intel.com>
Subject: Re: [edk2-devel] [RFC PATCH] BaseTools: Fix Python3 encoding issue in TestTools
Date: Thu, 5 Dec 2019 21:09:01 +0100	[thread overview]
Message-ID: <6d91d3ff-91c2-4d2f-1937-6ba09ca0c2af@redhat.com> (raw)
In-Reply-To: <828306f1-755e-da8a-96f2-af85828e56a4@redhat.com>

On 12/05/19 19:36, Philippe Mathieu-Daudé wrote:
> On 12/4/19 10:38 PM, Philippe Mathieu-Daude wrote:
>> Under Centos 7.7 we get:
>>
>>    Build environment:
>> Linux-3.10.0-1062.7.1.el7.x86_64-x86_64-with-centos-7.7.1908-Core
>>    [...]
>>    ======================================================================
>>    ERROR: testRandomDataCycles (TianoCompress.Tests)
>>    ----------------------------------------------------------------------
>>    Traceback (most recent call last):
>>      File "edk2/BaseTools/Tests/TianoCompress.py", line 60, in
>> testRandomDataCycles
>>        self.compressionTestCycle(data)
>>      File "edk2/BaseTools/Tests/TianoCompress.py", line 46, in
>> compressionTestCycle
>>        start = self.ReadTmpFile('input')
>>      File "edk2/BaseTools/Tests/TestTools.py", line 139, in ReadTmpFile
>>        data = f.read()
>>      File "/usr/lib64/python3.6/encodings/ascii.py", line 26, in decode
>>        return codecs.ascii_decode(input, self.errors)[0]
>>    UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in
>> position 3: ordinal not in range(128)
>>
>>    ----------------------------------------------------------------------
>>
>> Fix by specifying the UTF-8 encoding.
>>
>> Cc: Bob Feng <bob.c.feng@intel.com>
>> Cc: Liming Gao <liming.gao@intel.com>
>> Signed-off-by: Philippe Mathieu-Daude <philmd@redhat.com>
>> ---
>> RFC because I'm not sure this is the best way to fix this, but
>> this is similar to commit 31e3eeb5e3d2d.
>> ---
>>   BaseTools/Tests/TestTools.py | 2 +-
>>   1 file changed, 1 insertion(+), 1 deletion(-)
>>
>> diff --git a/BaseTools/Tests/TestTools.py b/BaseTools/Tests/TestTools.py
>> index 1099fd4eeaea..41cdb28b0c8c 100644
>> --- a/BaseTools/Tests/TestTools.py
>> +++ b/BaseTools/Tests/TestTools.py
>> @@ -135,7 +135,7 @@ class BaseToolsTest(unittest.TestCase):
>>           return open(os.path.join(self.testDir, fileName), mode)
>>         def ReadTmpFile(self, fileName):
>> -        f = open(self.GetTmpFilePath(fileName), 'r')
>> +        f = codecs.open(self.GetTmpFilePath(fileName), 'r',
>> encoding='utf-8')
>>           data = f.read()
>>           f.close()
>>           return data
>>
> 
> While this fixes Python3, this also break Python2 :)
> 
> ======================================================================
> ERROR: testRandomDataCycles (TianoCompress.Tests)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File "edk2/BaseTools/Tests/TianoCompress.py", line 60, in
> testRandomDataCycles
>     self.compressionTestCycle(data)
>   File "edk2/BaseTools/Tests/TianoCompress.py", line 46, in
> compressionTestCycle
>     start = self.ReadTmpFile('input')
>   File "edk2/BaseTools/Tests/TestTools.py", line 139, in ReadTmpFile
>     data = f.read()
>   File "/usr/lib/python2.7/codecs.py", line 688, in read
>     return self.reader.read(size)
>   File "/usr/lib/python2.7/codecs.py", line 494, in read
>     newchars, decodedbytes = self.decode(data, self.errors)
> UnicodeDecodeError: 'utf8' codec can't decode byte 0x85 in position 0:
> invalid start byte
> 
> This old thread recommend to use io.open:
> https://web.archive.org/web/20180715024113/https://mail.python.org/pipermail/python-list/2015-March/687124.html
> 
> 
> And it works in with both 2/3 versions, so I'll respin.

I didn't ask before (because, "commit 31e3eeb5e3d2d must have been
right, right?"), but now I can't resist anymore:

*why* do we have any such character in a *temporary* file's pathname
that is not pure ASCII? It seems wrong.

Thanks
Laszlo


      reply	other threads:[~2019-12-05 20:09 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-12-04 21:38 [RFC PATCH] BaseTools: Fix Python3 encoding issue in TestTools Philippe Mathieu-Daudé
2019-12-05 18:36 ` Philippe Mathieu-Daudé
2019-12-05 20:09   ` Laszlo Ersek [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-list from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=6d91d3ff-91c2-4d2f-1937-6ba09ca0c2af@redhat.com \
    --to=devel@edk2.groups.io \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox