From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from mail.ispras.ru (mail.ispras.ru [83.149.199.84])
 by mx.groups.io with SMTP id smtpd.web12.64379.1597753299671470139
 for <devel@edk2.groups.io>;
 Tue, 18 Aug 2020 05:21:40 -0700
Authentication-Results: mx.groups.io;
 dkim=missing; spf=pass (domain: ispras.ru, ip: 83.149.199.84, mailfrom: cheptsov@ispras.ru)
Received: from [127.0.0.1] (unknown [10.10.2.240])
	by mail.ispras.ru (Postfix) with ESMTPSA id A4E4A40A204F;
	Tue, 18 Aug 2020 12:21:36 +0000 (UTC)
From: "Vitaly Cheptsov" <cheptsov@ispras.ru>
Message-Id: <35FA0F61-C718-455D-A63C-E1B0E16C2F0A@ispras.ru>
Mime-Version: 1.0 (Mac OS X Mail 13.4 \(3608.120.23.2.1\))
Subject: Re: [edk2-devel] [PATCH EDK2 v2 1/1] SecurityPkg/DxeImageVerificationLib:Enhanced verification of Offset
Date: Tue, 18 Aug 2020 15:21:36 +0300
In-Reply-To: <b400e1e4-f8e2-b6c9-6f1f-85b4bbcb920a@posteo.de>
Cc: "Yao, Jiewen" <jiewen.yao@intel.com>,
 "devel@edk2.groups.io" <devel@edk2.groups.io>,
 "xiewenyi2@huawei.com" <xiewenyi2@huawei.com>,
 "Wang, Jian J" <jian.j.wang@intel.com>,
 "huangming23@huawei.com" <huangming23@huawei.com>,
 "songdongkuang@huawei.com" <songdongkuang@huawei.com>,
 =?utf-8?Q?Marvin_H=C3=A4user?= <mhaeuser@posteo.de>
To: Laszlo Ersek <lersek@redhat.com>
References: <1597319741-59646-1-git-send-email-xiewenyi2@huawei.com>
 <1597319741-59646-2-git-send-email-xiewenyi2@huawei.com>
 <eb0c6bcb-77fb-2fb9-783e-aa5025953a80@redhat.com>
 <024b1279-609d-fefa-8535-5af072815bf8@huawei.com>
 <CY4PR11MB1288522EB5FD511B0BC76FC48C400@CY4PR11MB1288.namprd11.prod.outlook.com>
 <250fc485-8705-88b7-21c9-ecd28132934d@redhat.com>
 <CY4PR11MB128872EEEDAEDA229FDF21F78C5F0@CY4PR11MB1288.namprd11.prod.outlook.com>
 <56d0af8e-39ae-ae3f-1561-a532b697ba5d@redhat.com>
 <a7elBrHZ3zD0Stt3MiPOUU_6uOnp-LlR4c9weDhWm4xYH388XWK0M80fLZe_AqbzF68IFK_IdkWQtKN8HKyRnQ==@protonmail.internalid>
 <VlP38aPGSGAA5Zc9ATU2-1qIMVTA2IDZTUHUsJ-P78am3XHy6v6-6ptgyoT8_SsrIoeFPxpKpagIh0MXXkL_wA==@protonmail.conversationid>
 <b400e1e4-f8e2-b6c9-6f1f-85b4bbcb920a@posteo.de>
X-Mailer: Apple Mail (2.3608.120.23.2.1)
X-Groupsio-MsgNum: 64377
Content-Type: multipart/signed;
	boundary="Apple-Mail=_85F7952A-87F5-4C32-897E-FCB52AC8D93C";
	protocol="application/pgp-signature";
	micalg=pgp-sha256

--Apple-Mail=_85F7952A-87F5-4C32-897E-FCB52AC8D93C
Content-Type: multipart/alternative;
	boundary="Apple-Mail=_6C23E395-2BC8-4C27-BFBE-26EF85B61453"

--Apple-Mail=_6C23E395-2BC8-4C27-BFBE-26EF85B61453
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=utf-8

As a follow-up to Marvin=E2=80=99s e-mail and in the reference of the =
thread. I should add that to make the result more reliable we not only =
hope for the reviewer experience and testing, but also write formal =
proofs for several code properties. This is the primary reason the C =
code has not yet left the facility, despite being mostly written. If you =
feel interested, we use an opensource inhouse-written tool, =
AstraVer[1][2], which is an extension of Frama-C[3].

Best regards,
Vitaly

[1] https://arxiv.org/pdf/1809.00626.pdf =
<https://arxiv.org/pdf/1809.00626.pdf>
[2] https://www.isprasopen.ru/2018/docs/Volkov.pdf =
<https://www.isprasopen.ru/2018/docs/Volkov.pdf>
[3] https://frama-c.com <https://frama-c.com/>

> 18 =D0=B0=D0=B2=D0=B3. 2020 =D0=B3., =D0=B2 13:24, Marvin H=C3=A4user =
<mhaeuser@posteo.de> =D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0=D0=BB(=D0=B0):
>=20
>=20
> Good day everyone,
>=20
> First off, for your information, I'm sending from my new e-mail =
address
> from now on.
>=20
> Please excuse me, I cannot read your entire thread right now, I will
> definitely make sure to catch up as soon as time permits, but I just
> wanted to confirm we are indeed working on a reimplementation of the =
PE
> loader.
> It involves correcting several security issues (which will be detailed
> as the patches are sent as anything else would be too much work for us
> right now), reducing code duplication (how often is the hashing
> algorithm duplicated across edk2? :) ) and a more or less experimental
> approach to formal verification. We plan to submit it this year, =
however
> please note that this is a low priority project and is not being =
worked
> on on a full-time basis.
>=20
> Please let us know about your own plans so we do not end up =
duplicating
> work.
>=20
> Best regards,
> Marvin
>=20
> Am 18.08.2020 um 12:17 schrieb Laszlo Ersek:
>> Hi Jiewen,
>>=20
>> (+Marvin, +Vitaly)
>>=20
>> On 08/18/20 01:23, Yao, Jiewen wrote:
>>>> -----Original Message-----
>>>> From: devel@edk2.groups.io <devel@edk2.groups.io> On Behalf Of =
Laszlo
>>>> Ersek
>>>> Sent: Tuesday, August 18, 2020 12:53 AM
>>>> To: Yao, Jiewen <jiewen.yao@intel.com>; devel@edk2.groups.io;
>>>> xiewenyi2@huawei.com; Wang, Jian J <jian.j.wang@intel.com>
>>>> Cc: huangming23@huawei.com; songdongkuang@huawei.com
>>>> Subject: Re: [edk2-devel] [PATCH EDK2 v2 1/1]
>>>> SecurityPkg/DxeImageVerificationLib:Enhanced verification of Offset
>>=20
>> [...]
>>=20
>>> However, I do think the producer is mandatory for a fix or at least =
a
>>> security fix.
>>> The owner to fix the issue should guarantee the patch is good.
>>> The owner shall never rely on the code reviewer to figure out if the
>>> patch is good and complete.
>>>=20
>>> I have some bad experience that bug owner just wrote a patch and =
tried
>>> to fix a problem, without any test.
>>> And it happened passed code review from someone who does not well
>>> understand the problem, but give rb based upon the time pressure.
>>> Later, the fix was approved to be useless.
>>>=20
>>> In my memory, at least 3 cases were security fix. They are found, =
just
>>> because they are sensitive, more people took a look later.
>>>     It was simple. It was one-line change.
>>>    But it has not test, and it was wrong.
>>> "It was ridiculous" -- commented by the people who find the =
so-called
>>> security fix does not fix the issue.
>>=20
>> Just because sloppy/rushed reviews exist, and just because reviewers
>> operate under time pressure, we should not automatically reject =
security
>> fixes that come without a reproducer.
>>=20
>> Some organizations do develop reproducers, but they never share them
>> publicly (for fear of abuse by others).
>>=20
>> But more importantly, in an open development project, a developer =
could
>> have time and expertise to contribute a fix, but not to create a
>> reproducer.
>>=20
>> - If we make contributing harder, fewer people will upstream their
>>   fixes.
>>=20
>> - If we make contributing harder, then contributions that do make it =
to
>>   the tree will be of higher quality.
>>=20
>> Both statements ring true to me -- so it's a tradeoff.
>>=20
>> (By "we", I mean the edk2 community.)
>>=20
>>>> Additionally, the exact statement that the bug report does make,
>>>> namely
>>>>=20
>>>>   it's possible to overflow Offset back to 0 causing an endless =
loop
>>>>=20
>>>> is wrong (as far as I can tell anyway). It is not "OffSet" that can
>>>> be overflowed to zero, but the *addend* that is added to OffSet can
>>>> be overflowed to zero. Therefore the infinite loop will arise =
because
>>>> OffSet remains stuck at its present value, and not because OffSet
>>>> will be re-set to zero.
>>>>=20
>>>> For the reasons, we can only speculate as to what the actual =
problem
>>>> is, unless Jian decides to join the discussion and clarifies what =
he
>>>> had in mind originally.
>>>=20
>>> [Jiewen] Would you please clarify what do you mean "we" here?
>>> If "we" means the bug dispatcher, it is totally OK. The dispatcher
>>> just assign the bug.
>>> If "we" means the developer assigned to fix the bug, it is NOT OK. =
The
>>> developer should take the responsibility to understand the problem.
>>=20
>> By "we", I mean the edk2 community.
>>=20
>>>> We can write a patch based on code analysis. It's possible to
>>>> identify integer overflows based on code analysis, and it's =
possible
>>>> to verify the correctness of fixes by code review. Obviously =
testing
>>>> is always good, but many times, constructing reproducers for such
>>>> issues that were found by code review, is difficult and time
>>>> consuming. We can say that we don't fix vulnerabilities without
>>>> reproducers, or we can say that we make an effort to fix them even =
if
>>>> all we have is code analysis (and not a reproducer).
>>>=20
>>> [Jiewen] I would say: yes and no.
>>> Yes, I agree with you that it might be difficult and time consuming =
to
>>> construct the reproducer.
>>> However, "obviously" is a subject term. Someone may think something =
is
>>> obvious, but other people does not.
>>> We should be clear the responsibility of the patch provider is to
>>> provide high quality patch.
>>> Having basic unit test is the best way to prove that the fix is =
good.
>>>=20
>>> I have seen bad cases when I ask for the test for patch, then the
>>> answer I got is: "I test the windows boot".
>>> But the test - windows boot - has nothing related to the patch. It
>>> only proves no regression, but cannot prove the issue described is
>>> resolved.
>>=20
>> Right. It would be ideal if every patch came with a unit test. But =
that
>> also means some folks will contribute less.
>>=20
>> Consider normal (not security) patches. We require that all function
>> return values be checked (unless it really doesn't matter if a =
function
>> call fails). If a function call fails, we need to roll back the =
actions
>> taken thus far. Release resources and so on. This is why we have the
>> "cascade of error handling labels" pattern.
>>=20
>> But, of course, we don't test every possible error path in the code. =
So
>> what's the solution there:
>>=20
>> - reject such patches that carefully construct the error paths, but =
do
>>   not provide unit tests with complete error path coverage?
>>=20
>> - say that we don't care about thorough error paths, so let's just =
hang,
>>   or leak resources, whenever something fails?
>>=20
>> Personally I prefer the detailed error paths. They need to be written
>> and reviewed carefully. And they can be accepted even if they are not
>> tested with complete coverage.
>>=20
>> Some people think otherwise; they say no untested (untestable) code
>> should ever be merged.
>>=20
>> Back to security patches -- creating reproducers usually requires a
>> setup (tools, expertise, time allocation etc) that is different from =
a
>> "normal" setup. It may require specialized binary format editors,
>> expertise in "penetration testing", and so on.
>>=20
>> I mostly know the C language rules that pertain to integer and buffer
>> overflows, so I can usually spot their violations in C code, and =
propose
>> fixes for them too. But I'm not a security researcher, so I don't =
write
>> exploits as a norm -- I don't even investigate, generally speaking, =
the
>> potential practical impact of "undefined behavior". When there's a
>> buffer overflow or integer overflow in the code, that's the *end* of =
the
>> story for me, while it's the *start* of the work for a security
>> researcher.
>>=20
>> When you require reproducers for all security patches, you restrict =
the
>> potential contributor pool to security researchers.
>>=20
>>> Let's think again in this case, if the patch provider does some =
basic
>>> unit test, he/she may find out the problem by himself/herself.
>>> That can save other people's time to review.
>>>=20
>>> I don't prefer to move the responsibility from patch provider to the
>>> code reviewer to check if the fix is good.
>>> Otherwise, the code reviewer may be overwhelmed.
>>>=20
>>> We may clarify and document the role and responsibility in EDKII
>>> clearly. Once that is ready, we can follow the rule.
>>> Before that is ready, in this particular case, I still prefer we =
have
>>> producer to prove the patch is good.
>>=20
>> OK, thanks for explaining.
>>=20
>> Given that I'm unable to create such a PE file (from scratch or by
>> modifying another one), I won't post the patches stand-alone.
>>=20
>>>> So the above paragraph concerns "correctness". Regarding
>>>> "completeness", I guarantee you that this patch does not fix *all*
>>>> problems related to PE parsing. (See the other BZ tickets.) It does
>>>> fix *one* issue with PE parsing. We can say that we try to fix such
>>>> issues gradually (give different CVE numbers to different issues, =
and
>>>> address them one at a time), or we can say that we rewrite PE =
parsing
>>>> from the ground up. (BTW: I have seriously attempted that in the
>>>> past, and I gave up, because the PE format is FUBAR.)
>>>=20
>>> [Jiewen] Maybe there is misunderstanding.
>>> I do not mean to let patch provider to fix all issue in PE parsing.
>>> Just like we cannot file one Bugzilla to fix all issue in EDKII - it
>>> is unfair.
>>>=20
>>> What I mean is that the patch provider should guarantee the
>>> correctness and completeness of the issue in the bug report.
>>>=20
>>> One faked bad example of correctness is:
>>>     A bug report is file to say: the code has overflow class A.
>>>     The factor is: the code has overflow class A at line X and line =
Y.
>>>     The patch only modified some code at line X, but the overflow
>>>     class A at line X still exists.
>>>=20
>>> One faked bad example of completeness is:
>>>     A bug report is file to say: the code has overflow class A.
>>>     The factor is: the code has overflow class A at line X and line =
Y.
>>>     The patch only fixed the overflow class A at line X but not line
>>>     Y.
>>>=20
>>> The patch provider should take responsibility to do that work
>>> seriously to find out issue in line X and line Y and fix them.
>>> He/she may choose to just fix line X and line Y. Rewrite is whole
>>> module is NOT required.
>>=20
>> I agree completely.
>>=20
>> My point was that we need the bug report to be precise, in the first
>> place. If the bug report doesn't clearly identify lines X and Y, we =
will
>> likely not get the completeness part right.
>>=20
>> "Clearly identify" may mean spelling out lines X and Y specifically. =
Or
>> it may mean defining "class A" sufficiently clearly that someone else
>> reading the affected function can find X and Y themselves.
>>=20
>>> If I can give some comment, I would think about the provide the fix =
in
>>> BasePeCoffLib.
>>=20
>> =46rom a software design perspective, you are 100% right.
>>=20
>> Unfortunately, I can't do it.
>>=20
>> That's what I mentioned before -- I had tried rewriting =
BasePeCoffLib,
>> because in my opinion, BasePeCoffLib is unsalvageable in its current
>> form. And I gave up on the rewrite.
>>=20
>> Please see the following email. I sent it to some people off-list, on
>> 2020-Feb-14:
>>=20
>>> There are currently four (4) TianoCore security BZs (1957, 1990, =
1993,
>>> 2215), embargoed, that describe various ways in which cunningly
>>> crafted PE images can evade Secure Boot verification.
>>>=20
>>> [...]
>>>=20
>>> Primarily, I just couldn't find my peace with the idea that fixing
>>> such PE/COFF parsing mistakes (integer overflows, buffer overflows)
>>> *must* be a one-by-one fixing game. I wanted an approach that would
>>> fix these *classes* of vulnerabilities, in PE/COFF parsing.
>>>=20
>>> So, beginnning of this February I returned to this topic, and spent
>>> two days on prototyping / researching a container / interval based
>>> approach. Here's one of the commit messages, as a way of explaining:
>>>=20
>>>     OvmfPkg/DxePeCoffValidatorLib: introduce CONTAINER type and =
helper funcs
>>>=20
>>>     For validating the well-formedness of a PE/COFF file, introduce =
the
>>>     CONTAINER type, and some workhorse functions. (The functions =
added in this
>>>     patch will not be called directly from the code that will =
process PE/COFF
>>>     structures.)
>>>=20
>>>     The CONTAINER type describes a contiguous non-empty interval in =
a PE/COFF
>>>     file (on-disk representation, or in-memory representation). =
Containers can
>>>     be nested. The data from scalar-sized containers can be read =
out, as part
>>>     of their creation. For on-disk representations of PE/COFF files, =
scalar
>>>     reads are permitted; for in-memory representations, no data =
access is
>>>     permitted (only CONTAINER tracking / nesting).
>>>=20
>>>     The goals of CONTAINER are the following:
>>>=20
>>>     - enforce the proper nesting of PE/COFF structures (structure =
boundaries
>>>       must not be crossed by runs of data);
>>>=20
>>>     - prevent integer overflows and buffer overflows;
>>>=20
>>>     - prevent zero-size structures;
>>>=20
>>>     - prevent infinite nesting by requiring proper sub-intervals;
>>>=20
>>>     - prevent internal PE/COFF pointers from aliasing each other =
(unless they
>>>       point at container and containee structures);
>>>=20
>>>     - terminate nesting at scalar-sized containers;
>>>=20
>>>     - assuming an array of pointers is processed in increasing =
element order,
>>>       enforce that the pointed-to objects are located at increasing =
offsets
>>>       too;
>>>=20
>>>     - assign human-readable names to PE/COFF structures and fields, =
for
>>>       debugging PE/COFF malformations.
>>>=20
>>> Because, several of the vulnerabilities exploited cross-directed and
>>> aliased internal pointers in PE/COFF files.
>>>=20
>>> Two days of delirious spec reading and coding later, and 2000+ lines
>>> later, I decided that my idea was unviable. The PE/COFF spec was so
>>> incredibly mis-designed and crufty that enforcing the *internal*
>>> consistency of all the size fields and the internal pointers would
>>> inevitably fall into one of the following categories:
>>>=20
>>> - the checks wouldn't be strict enough, and some nasty images would
>>>   slip through,
>>>=20
>>> - the checks would be too strict, and some quirky, but valid, images
>>>   would be unjustifiedly caught.
>>>=20
>>> So I gave up and I've accepted that it remains a whack-a-mole game.
>>> [...]
>>>=20
>>> (NB: I don't claim that ELF is not similarly brain-damaged.)
>>=20
>> So now, I've only considered contributing patches for bug#2215 =
because
>> the code in question resides in DxeImageVerificationLib, and *not* in
>> BasePeCoffLib. I'm not going to touch BasePeCoffLib -- in my opinion,
>> BasePeCoffLib is unfixable without a complete rewrite.
>>=20
>> I would *like* if BasePeCoffLib were fixable incrementally, but I =
just
>> don't see how that's possible.
>>=20
>> In support of my opinion, please open the following bugzilla ticket:
>>=20
>>   https://bugzilla.tianocore.org/show_bug.cgi?id=3D2643
>>=20
>> and search the comments (with the browser's in-page search feature, =
such
>> as Ctrl+F) for the following expression:
>>=20
>>   new PE loader
>>=20
>> I understand exactly what Vitaly and Marvin meant in those comments. =
:(
>>=20
>> Thanks,
>> Laszlo
>>=20


--Apple-Mail=_6C23E395-2BC8-4C27-BFBE-26EF85B61453
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=utf-8

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html; =
charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; line-break: after-white-space;" class=3D"">As =
a follow-up to Marvin=E2=80=99s e-mail and in the reference of the =
thread. I should add that to make the result more reliable we not only =
hope for the reviewer experience and testing, but also write formal =
proofs for several code properties. This is the primary reason the C =
code has not yet left the facility, despite being mostly written. If you =
feel interested, we use an opensource inhouse-written tool, =
AstraVer[1][2], which is an extension of Frama-C[3].<div class=3D""><br =
class=3D""></div><div class=3D"">Best regards,</div><div =
class=3D"">Vitaly</div><div class=3D""><br class=3D""></div><div =
class=3D"">[1]&nbsp;<span style=3D"caret-color: rgb(0, 0, 0); color: =
rgb(0, 0, 0);" class=3D""><a href=3D"https://arxiv.org/pdf/1809.00626.pdf"=
 class=3D"">https://arxiv.org/pdf/1809.00626.pdf</a></span></div><div =
class=3D"">[2]&nbsp;<a =
href=3D"https://www.isprasopen.ru/2018/docs/Volkov.pdf" =
class=3D"">https://www.isprasopen.ru/2018/docs/Volkov.pdf</a></div><div =
class=3D"">[3]&nbsp;<a href=3D"https://frama-c.com" =
class=3D"">https://frama-c.com</a></div><div class=3D""><br =
class=3D""></div><div class=3D""><div class=3D""><div =
class=3D""><div><blockquote type=3D"cite" class=3D""><div class=3D"">18 =
=D0=B0=D0=B2=D0=B3. 2020 =D0=B3., =D0=B2 13:24, Marvin H=C3=A4user =
&lt;<a href=3D"mailto:mhaeuser@posteo.de" =
class=3D"">mhaeuser@posteo.de</a>&gt; =D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0=
=D0=BB(=D0=B0):</div><br class=3D"Apple-interchange-newline"><div =
class=3D""><div class=3D""><br class=3D"">Good day everyone,<br =
class=3D""><br class=3D"">First off, for your information, I'm sending =
from my new e-mail address<br class=3D"">from now on.<br class=3D""><br =
class=3D"">Please excuse me, I cannot read your entire thread right now, =
I will<br class=3D"">definitely make sure to catch up as soon as time =
permits, but I just<br class=3D"">wanted to confirm we are indeed =
working on a reimplementation of the PE<br class=3D"">loader.<br =
class=3D"">It involves correcting several security issues (which will be =
detailed<br class=3D"">as the patches are sent as anything else would be =
too much work for us<br class=3D"">right now), reducing code duplication =
(how often is the hashing<br class=3D"">algorithm duplicated across =
edk2? :) ) and a more or less experimental<br class=3D"">approach to =
formal verification. We plan to submit it this year, however<br =
class=3D"">please note that this is a low priority project and is not =
being worked<br class=3D"">on on a full-time basis.<br class=3D""><br =
class=3D"">Please let us know about your own plans so we do not end up =
duplicating<br class=3D"">work.<br class=3D""><br class=3D"">Best =
regards,<br class=3D"">Marvin<br class=3D""><br class=3D"">Am 18.08.2020 =
um 12:17 schrieb Laszlo Ersek:<br class=3D""><blockquote type=3D"cite" =
class=3D"">Hi Jiewen,<br class=3D""><br class=3D"">(+Marvin, +Vitaly)<br =
class=3D""><br class=3D"">On 08/18/20 01:23, Yao, Jiewen wrote:<br =
class=3D""><blockquote type=3D"cite" class=3D""><blockquote type=3D"cite" =
class=3D"">-----Original Message-----<br class=3D"">From: <a =
href=3D"mailto:devel@edk2.groups.io" class=3D"">devel@edk2.groups.io</a> =
&lt;<a href=3D"mailto:devel@edk2.groups.io" =
class=3D"">devel@edk2.groups.io</a>&gt; On Behalf Of Laszlo<br =
class=3D"">Ersek<br class=3D"">Sent: Tuesday, August 18, 2020 12:53 =
AM<br class=3D"">To: Yao, Jiewen &lt;<a =
href=3D"mailto:jiewen.yao@intel.com" =
class=3D"">jiewen.yao@intel.com</a>&gt;; <a =
href=3D"mailto:devel@edk2.groups.io" =
class=3D"">devel@edk2.groups.io</a>;<br class=3D""><a =
href=3D"mailto:xiewenyi2@huawei.com" class=3D"">xiewenyi2@huawei.com</a>; =
Wang, Jian J &lt;<a href=3D"mailto:jian.j.wang@intel.com" =
class=3D"">jian.j.wang@intel.com</a>&gt;<br class=3D"">Cc: <a =
href=3D"mailto:huangming23@huawei.com" =
class=3D"">huangming23@huawei.com</a>; <a =
href=3D"mailto:songdongkuang@huawei.com" =
class=3D"">songdongkuang@huawei.com</a><br class=3D"">Subject: Re: =
[edk2-devel] [PATCH EDK2 v2 1/1]<br =
class=3D"">SecurityPkg/DxeImageVerificationLib:Enhanced verification of =
Offset<br class=3D""></blockquote></blockquote><br class=3D"">[...]<br =
class=3D""><br class=3D""><blockquote type=3D"cite" class=3D"">However, =
I do think the producer is mandatory for a fix or at least a<br =
class=3D"">security fix.<br class=3D"">The owner to fix the issue should =
guarantee the patch is good.<br class=3D"">The owner shall never rely on =
the code reviewer to figure out if the<br class=3D"">patch is good and =
complete.<br class=3D""><br class=3D"">I have some bad experience that =
bug owner just wrote a patch and tried<br class=3D"">to fix a problem, =
without any test.<br class=3D"">And it happened passed code review from =
someone who does not well<br class=3D"">understand the problem, but give =
rb based upon the time pressure.<br class=3D"">Later, the fix was =
approved to be useless.<br class=3D""><br class=3D"">In my memory, at =
least 3 cases were security fix. They are found, just<br =
class=3D"">because they are sensitive, more people took a look later.<br =
class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;It was simple. It was one-line =
change.<br class=3D""> &nbsp;&nbsp;&nbsp;But it has not test, and it was =
wrong.<br class=3D"">"It was ridiculous" -- commented by the people who =
find the so-called<br class=3D"">security fix does not fix the issue.<br =
class=3D""></blockquote><br class=3D"">Just because sloppy/rushed =
reviews exist, and just because reviewers<br class=3D"">operate under =
time pressure, we should not automatically reject security<br =
class=3D"">fixes that come without a reproducer.<br class=3D""><br =
class=3D"">Some organizations do develop reproducers, but they never =
share them<br class=3D"">publicly (for fear of abuse by others).<br =
class=3D""><br class=3D"">But more importantly, in an open development =
project, a developer could<br class=3D"">have time and expertise to =
contribute a fix, but not to create a<br class=3D"">reproducer.<br =
class=3D""><br class=3D"">- If we make contributing harder, fewer people =
will upstream their<br class=3D""> &nbsp;&nbsp;fixes.<br class=3D""><br =
class=3D"">- If we make contributing harder, then contributions that do =
make it to<br class=3D""> &nbsp;&nbsp;the tree will be of higher =
quality.<br class=3D""><br class=3D"">Both statements ring true to me -- =
so it's a tradeoff.<br class=3D""><br class=3D"">(By "we", I mean the =
edk2 community.)<br class=3D""><br class=3D""><blockquote type=3D"cite" =
class=3D""><blockquote type=3D"cite" class=3D"">Additionally, the exact =
statement that the bug report does make,<br class=3D"">namely<br =
class=3D""><br class=3D""> &nbsp;&nbsp;it's possible to overflow Offset =
back to 0 causing an endless loop<br class=3D""><br class=3D"">is wrong =
(as far as I can tell anyway). It is not "OffSet" that can<br =
class=3D"">be overflowed to zero, but the *addend* that is added to =
OffSet can<br class=3D"">be overflowed to zero. Therefore the infinite =
loop will arise because<br class=3D"">OffSet remains stuck at its =
present value, and not because OffSet<br class=3D"">will be re-set to =
zero.<br class=3D""><br class=3D"">For the reasons, we can only =
speculate as to what the actual problem<br class=3D"">is, unless Jian =
decides to join the discussion and clarifies what he<br class=3D"">had =
in mind originally.<br class=3D""></blockquote><br class=3D"">[Jiewen] =
Would you please clarify what do you mean "we" here?<br class=3D"">If =
"we" means the bug dispatcher, it is totally OK. The dispatcher<br =
class=3D"">just assign the bug.<br class=3D"">If "we" means the =
developer assigned to fix the bug, it is NOT OK. The<br =
class=3D"">developer should take the responsibility to understand the =
problem.<br class=3D""></blockquote><br class=3D"">By "we", I mean the =
edk2 community.<br class=3D""><br class=3D""><blockquote type=3D"cite" =
class=3D""><blockquote type=3D"cite" class=3D"">We can write a patch =
based on code analysis. It's possible to<br class=3D"">identify integer =
overflows based on code analysis, and it's possible<br class=3D"">to =
verify the correctness of fixes by code review. Obviously testing<br =
class=3D"">is always good, but many times, constructing reproducers for =
such<br class=3D"">issues that were found by code review, is difficult =
and time<br class=3D"">consuming. We can say that we don't fix =
vulnerabilities without<br class=3D"">reproducers, or we can say that we =
make an effort to fix them even if<br class=3D"">all we have is code =
analysis (and not a reproducer).<br class=3D""></blockquote><br =
class=3D"">[Jiewen] I would say: yes and no.<br class=3D"">Yes, I agree =
with you that it might be difficult and time consuming to<br =
class=3D"">construct the reproducer.<br class=3D"">However, "obviously" =
is a subject term. Someone may think something is<br class=3D"">obvious, =
but other people does not.<br class=3D"">We should be clear the =
responsibility of the patch provider is to<br class=3D"">provide high =
quality patch.<br class=3D"">Having basic unit test is the best way to =
prove that the fix is good.<br class=3D""><br class=3D"">I have seen bad =
cases when I ask for the test for patch, then the<br class=3D"">answer I =
got is: "I test the windows boot".<br class=3D"">But the test - windows =
boot - has nothing related to the patch. It<br class=3D"">only proves no =
regression, but cannot prove the issue described is<br =
class=3D"">resolved.<br class=3D""></blockquote><br class=3D"">Right. It =
would be ideal if every patch came with a unit test. But that<br =
class=3D"">also means some folks will contribute less.<br class=3D""><br =
class=3D"">Consider normal (not security) patches. We require that all =
function<br class=3D"">return values be checked (unless it really =
doesn't matter if a function<br class=3D"">call fails). If a function =
call fails, we need to roll back the actions<br class=3D"">taken thus =
far. Release resources and so on. This is why we have the<br =
class=3D"">"cascade of error handling labels" pattern.<br class=3D""><br =
class=3D"">But, of course, we don't test every possible error path in =
the code. So<br class=3D"">what's the solution there:<br class=3D""><br =
class=3D"">- reject such patches that carefully construct the error =
paths, but do<br class=3D""> &nbsp;&nbsp;not provide unit tests with =
complete error path coverage?<br class=3D""><br class=3D"">- say that we =
don't care about thorough error paths, so let's just hang,<br class=3D""> =
&nbsp;&nbsp;or leak resources, whenever something fails?<br class=3D""><br=
 class=3D"">Personally I prefer the detailed error paths. They need to =
be written<br class=3D"">and reviewed carefully. And they can be =
accepted even if they are not<br class=3D"">tested with complete =
coverage.<br class=3D""><br class=3D"">Some people think otherwise; they =
say no untested (untestable) code<br class=3D"">should ever be =
merged.<br class=3D""><br class=3D"">Back to security patches -- =
creating reproducers usually requires a<br class=3D"">setup (tools, =
expertise, time allocation etc) that is different from a<br =
class=3D"">"normal" setup. It may require specialized binary format =
editors,<br class=3D"">expertise in "penetration testing", and so on.<br =
class=3D""><br class=3D"">I mostly know the C language rules that =
pertain to integer and buffer<br class=3D"">overflows, so I can usually =
spot their violations in C code, and propose<br class=3D"">fixes for =
them too. But I'm not a security researcher, so I don't write<br =
class=3D"">exploits as a norm -- I don't even investigate, generally =
speaking, the<br class=3D"">potential practical impact of "undefined =
behavior". When there's a<br class=3D"">buffer overflow or integer =
overflow in the code, that's the *end* of the<br class=3D"">story for =
me, while it's the *start* of the work for a security<br =
class=3D"">researcher.<br class=3D""><br class=3D"">When you require =
reproducers for all security patches, you restrict the<br =
class=3D"">potential contributor pool to security researchers.<br =
class=3D""><br class=3D""><blockquote type=3D"cite" class=3D"">Let's =
think again in this case, if the patch provider does some basic<br =
class=3D"">unit test, he/she may find out the problem by =
himself/herself.<br class=3D"">That can save other people's time to =
review.<br class=3D""><br class=3D"">I don't prefer to move the =
responsibility from patch provider to the<br class=3D"">code reviewer to =
check if the fix is good.<br class=3D"">Otherwise, the code reviewer may =
be overwhelmed.<br class=3D""><br class=3D"">We may clarify and document =
the role and responsibility in EDKII<br class=3D"">clearly. Once that is =
ready, we can follow the rule.<br class=3D"">Before that is ready, in =
this particular case, I still prefer we have<br class=3D"">producer to =
prove the patch is good.<br class=3D""></blockquote><br class=3D"">OK, =
thanks for explaining.<br class=3D""><br class=3D"">Given that I'm =
unable to create such a PE file (from scratch or by<br =
class=3D"">modifying another one), I won't post the patches =
stand-alone.<br class=3D""><br class=3D""><blockquote type=3D"cite" =
class=3D""><blockquote type=3D"cite" class=3D"">So the above paragraph =
concerns "correctness". Regarding<br class=3D"">"completeness", I =
guarantee you that this patch does not fix *all*<br class=3D"">problems =
related to PE parsing. (See the other BZ tickets.) It does<br =
class=3D"">fix *one* issue with PE parsing. We can say that we try to =
fix such<br class=3D"">issues gradually (give different CVE numbers to =
different issues, and<br class=3D"">address them one at a time), or we =
can say that we rewrite PE parsing<br class=3D"">from the ground up. =
(BTW: I have seriously attempted that in the<br class=3D"">past, and I =
gave up, because the PE format is FUBAR.)<br class=3D""></blockquote><br =
class=3D"">[Jiewen] Maybe there is misunderstanding.<br class=3D"">I do =
not mean to let patch provider to fix all issue in PE parsing.<br =
class=3D"">Just like we cannot file one Bugzilla to fix all issue in =
EDKII - it<br class=3D"">is unfair.<br class=3D""><br class=3D"">What I =
mean is that the patch provider should guarantee the<br =
class=3D"">correctness and completeness of the issue in the bug =
report.<br class=3D""><br class=3D"">One faked bad example of =
correctness is:<br class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;A bug report is =
file to say: the code has overflow class A.<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;The factor is: the code has overflow class A at =
line X and line Y.<br class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;The patch only =
modified some code at line X, but the overflow<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;class A at line X still exists.<br class=3D""><br =
class=3D"">One faked bad example of completeness is:<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;A bug report is file to say: the code has =
overflow class A.<br class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;The factor is: =
the code has overflow class A at line X and line Y.<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;The patch only fixed the overflow class A at =
line X but not line<br class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;Y.<br =
class=3D""><br class=3D"">The patch provider should take responsibility =
to do that work<br class=3D"">seriously to find out issue in line X and =
line Y and fix them.<br class=3D"">He/she may choose to just fix line X =
and line Y. Rewrite is whole<br class=3D"">module is NOT required.<br =
class=3D""></blockquote><br class=3D"">I agree completely.<br =
class=3D""><br class=3D"">My point was that we need the bug report to be =
precise, in the first<br class=3D"">place. If the bug report doesn't =
clearly identify lines X and Y, we will<br class=3D"">likely not get the =
completeness part right.<br class=3D""><br class=3D"">"Clearly identify" =
may mean spelling out lines X and Y specifically. Or<br class=3D"">it =
may mean defining "class A" sufficiently clearly that someone else<br =
class=3D"">reading the affected function can find X and Y themselves.<br =
class=3D""><br class=3D""><blockquote type=3D"cite" class=3D"">If I can =
give some comment, I would think about the provide the fix in<br =
class=3D"">BasePeCoffLib.<br class=3D""></blockquote><br class=3D""> =
=46rom a software design perspective, you are 100% right.<br =
class=3D""><br class=3D"">Unfortunately, I can't do it.<br class=3D""><br =
class=3D"">That's what I mentioned before -- I had tried rewriting =
BasePeCoffLib,<br class=3D"">because in my opinion, BasePeCoffLib is =
unsalvageable in its current<br class=3D"">form. And I gave up on the =
rewrite.<br class=3D""><br class=3D"">Please see the following email. I =
sent it to some people off-list, on<br class=3D"">2020-Feb-14:<br =
class=3D""><br class=3D""><blockquote type=3D"cite" class=3D"">There are =
currently four (4) TianoCore security BZs (1957, 1990, 1993,<br =
class=3D"">2215), embargoed, that describe various ways in which =
cunningly<br class=3D"">crafted PE images can evade Secure Boot =
verification.<br class=3D""><br class=3D"">[...]<br class=3D""><br =
class=3D"">Primarily, I just couldn't find my peace with the idea that =
fixing<br class=3D"">such PE/COFF parsing mistakes (integer overflows, =
buffer overflows)<br class=3D"">*must* be a one-by-one fixing game. I =
wanted an approach that would<br class=3D"">fix these *classes* of =
vulnerabilities, in PE/COFF parsing.<br class=3D""><br class=3D"">So, =
beginnning of this February I returned to this topic, and spent<br =
class=3D"">two days on prototyping / researching a container / interval =
based<br class=3D"">approach. Here's one of the commit messages, as a =
way of explaining:<br class=3D""><br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;OvmfPkg/DxePeCoffValidatorLib: introduce =
CONTAINER type and helper funcs<br class=3D""><br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;For validating the well-formedness of a PE/COFF =
file, introduce the<br class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;CONTAINER =
type, and some workhorse functions. (The functions added in this<br =
class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;patch will not be called directly =
from the code that will process PE/COFF<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;structures.)<br class=3D""><br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;The CONTAINER type describes a contiguous =
non-empty interval in a PE/COFF<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;file (on-disk representation, or in-memory =
representation). Containers can<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;be nested. The data from scalar-sized containers =
can be read out, as part<br class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;of their =
creation. For on-disk representations of PE/COFF files, scalar<br =
class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;reads are permitted; for in-memory =
representations, no data access is<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;permitted (only CONTAINER tracking / =
nesting).<br class=3D""><br class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;The =
goals of CONTAINER are the following:<br class=3D""><br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;- enforce the proper nesting of PE/COFF =
structures (structure boundaries<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;must not be crossed by runs of =
data);<br class=3D""><br class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;- prevent =
integer overflows and buffer overflows;<br class=3D""><br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;- prevent zero-size structures;<br class=3D""><br =
class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;- prevent infinite nesting by =
requiring proper sub-intervals;<br class=3D""><br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;- prevent internal PE/COFF pointers from =
aliasing each other (unless they<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;point at container and containee =
structures);<br class=3D""><br class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;- =
terminate nesting at scalar-sized containers;<br class=3D""><br =
class=3D""> &nbsp;&nbsp;&nbsp;&nbsp;- assuming an array of pointers is =
processed in increasing element order,<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;enforce that the pointed-to objects =
are located at increasing offsets<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;too;<br class=3D""><br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;- assign human-readable names to PE/COFF =
structures and fields, for<br class=3D""> =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;debugging PE/COFF malformations.<br =
class=3D""><br class=3D"">Because, several of the vulnerabilities =
exploited cross-directed and<br class=3D"">aliased internal pointers in =
PE/COFF files.<br class=3D""><br class=3D"">Two days of delirious spec =
reading and coding later, and 2000+ lines<br class=3D"">later, I decided =
that my idea was unviable. The PE/COFF spec was so<br =
class=3D"">incredibly mis-designed and crufty that enforcing the =
*internal*<br class=3D"">consistency of all the size fields and the =
internal pointers would<br class=3D"">inevitably fall into one of the =
following categories:<br class=3D""><br class=3D"">- the checks wouldn't =
be strict enough, and some nasty images would<br class=3D""> =
&nbsp;&nbsp;slip through,<br class=3D""><br class=3D"">- the checks =
would be too strict, and some quirky, but valid, images<br class=3D""> =
&nbsp;&nbsp;would be unjustifiedly caught.<br class=3D""><br class=3D"">So=
 I gave up and I've accepted that it remains a whack-a-mole game.<br =
class=3D"">[...]<br class=3D""><br class=3D"">(NB: I don't claim that =
ELF is not similarly brain-damaged.)<br class=3D""></blockquote><br =
class=3D"">So now, I've only considered contributing patches for =
bug#2215 because<br class=3D"">the code in question resides in =
DxeImageVerificationLib, and *not* in<br class=3D"">BasePeCoffLib. I'm =
not going to touch BasePeCoffLib -- in my opinion,<br =
class=3D"">BasePeCoffLib is unfixable without a complete rewrite.<br =
class=3D""><br class=3D"">I would *like* if BasePeCoffLib were fixable =
incrementally, but I just<br class=3D"">don't see how that's =
possible.<br class=3D""><br class=3D"">In support of my opinion, please =
open the following bugzilla ticket:<br class=3D""><br class=3D""> =
&nbsp;&nbsp;<a =
href=3D"https://bugzilla.tianocore.org/show_bug.cgi?id=3D2643" =
class=3D"">https://bugzilla.tianocore.org/show_bug.cgi?id=3D2643</a><br =
class=3D""><br class=3D"">and search the comments (with the browser's =
in-page search feature, such<br class=3D"">as Ctrl+F) for the following =
expression:<br class=3D""><br class=3D""> &nbsp;&nbsp;new PE loader<br =
class=3D""><br class=3D"">I understand exactly what Vitaly and Marvin =
meant in those comments. :(<br class=3D""><br class=3D"">Thanks,<br =
class=3D"">Laszlo<br class=3D""><br =
class=3D""></blockquote></div></div></blockquote></div><br =
class=3D""></div></div></div></body></html>=

--Apple-Mail=_6C23E395-2BC8-4C27-BFBE-26EF85B61453--

--Apple-Mail=_85F7952A-87F5-4C32-897E-FCB52AC8D93C
Content-Transfer-Encoding: 7bit
Content-Disposition: attachment;
	filename=signature.asc
Content-Type: application/pgp-signature;
	name=signature.asc
Content-Description: Message signed with OpenPGP

-----BEGIN PGP SIGNATURE-----

iQIzBAEBCAAdFiEEsLABAI5Y5VbvBdmpL8K2O86Eyz4FAl87x9AACgkQL8K2O86E
yz4fDw//Yq8Ry0FQOCgUP+aasFNIEGgTP/C1/DZzGzBxS4SYDj0DUZHbbj9yz45m
GIbmAhNKXF6DEGFzEGLTqwt2WZfyEI7NnG4q8wyruf8a97cB2n9oIZhU6kVtraG8
YMk4acuL6FOyarWZjxjzwNSNASv5NUR/bLP0Yt654aEcVXHWFrWFH8xJXZHA4u13
CbSzr3EqNa/m5WmlhB012y7ROwCzlhXFx5QOKa65vx9IrZRX0LJfvJP+xisuhUxO
B0yqZayiLPirBLLQmlb5+UPGBUY/YZpbEnCMjdsP9tLCxsDFzQm+y4VIGGN9Z4l+
fI7ew3g726mxJJ0YKiu3x+uKvUZvnzc71RzF4zJYAk7DwoKYyyotmAsOegzubh8H
Kxe3vEgxoxtLFYkJsCy/d7yY2QtxxlTC06INDiDMYrwFKHi8ziyHtlqyIOAp3DaF
z45Je4S1Pqwlq2EeAzV6Slh3sSRgJ87HnvjnGWEzB0IvaqTwR9uU1Qd+jHvKVRwh
sPjjuxiKNQrOY0xWOhwHyGlkH9WiC2SuizG3NEJA7B3reDmgtUBmqFXMRGXf+XQ/
zru06tmPtef6d+6OGg74YS/oD7ZMi5Qmns/k5KwcmPkHuD+JWqf0zEYqM/LZeU4O
llBgoqzGa8bWg0ru8n7RPphRO6WMYUOfUq5bDr6y4aHmm2Tr8WY=
=E9YT
-----END PGP SIGNATURE-----

--Apple-Mail=_85F7952A-87F5-4C32-897E-FCB52AC8D93C--