public inbox for devel@edk2.groups.io
 help / color / mirror / Atom feed
From: Ard Biesheuvel <ard.biesheuvel@linaro.org>
To: Laszlo Ersek <lersek@redhat.com>
Cc: Leif Lindholm <leif.lindholm@linaro.org>,
	 "edk2-devel@lists.01.org" <edk2-devel@lists.01.org>,
	"Gao, Liming" <liming.gao@intel.com>
Subject: Re: [PATCH v2] ArmPlatformPkg/PrePeiCore: seed temporary stack before entering PEI core
Date: Tue, 7 Nov 2017 18:13:31 +0000	[thread overview]
Message-ID: <CAKv+Gu_BtTsuhXVZzx3K94dZYhhOe_1Aktu7BPkFMXKehyb9Rg@mail.gmail.com> (raw)
In-Reply-To: <f2f47a1e-90a0-3899-0d40-7df839cdcfaa@redhat.com>

On 7 November 2017 at 18:09, Laszlo Ersek <lersek@redhat.com> wrote:
> On 11/05/17 17:29, Ard Biesheuvel wrote:
>> On 5 November 2017 at 16:27, Ard Biesheuvel <ard.biesheuvel@linaro.org> wrote:
>>> On 5 November 2017 at 05:52, Leif Lindholm <leif.lindholm@linaro.org> wrote:
>>>> On Fri, Nov 03, 2017 at 11:33:52AM +0000, Ard Biesheuvel wrote:
>>>>> DEBUG builds of PEI code will print a diagnostic message regarding
>>>>> the utilization of temporary RAM before switching to permanent RAM.
>>>>> For example,
>>>>>
>>>>>   Total temporary memory:    16352 bytes.
>>>>>     temporary memory stack ever used:       4820 bytes.
>>>>>     temporary memory heap used for HobList: 4720 bytes.
>>>>>
>>>>> Tracking stack utilization like this requires the stack to be seeded
>>>>> with a known magic value, and this needs to occur before entering C
>>>>> code, given that it uses the stack. Currently, only Nt32Pkg appears
>>>>> to implement this feature, but it is useful nonetheless, so let's
>>>>> wire it up for PrePeiCore as well.
>>>>>
>>>>> Ref: https://bugzilla.tianocore.org/show_bug.cgi?id=748
>>>>> Contributed-under: TianoCore Contribution Agreement 1.1
>>>>> Signed-off-by: Ard Biesheuvel <ard.biesheuvel@linaro.org>
>>>>
>>>> OK, this may sound completely unreasonable, but seeing those
>>>> implementations overwrite callee-saved registers without saving them
>>>> makes my brain unhappy. (Yes, I know.)
>>>>
>>>> Could they either:
>>>> - Have a comment prepended establishing the implicit ABI of which
>>>>   registers the caller cannot rely on reusing after return.
>>>>   Preferably somewhat echoed at the call site.
>>>> - Be rewritten to use only scratch registers?
>>>>
>>>
>>> I think it is implied that the startup code does not adhere to the
>>> AAPCS. That code already uses r5 and r6 without stacking them, simply
>>> because we're in the middle of preparing the stack and other execution
>>> context, precisely so the C code we call into can rely on AAPCS
>>> guarantees.
>>
>>
>> Ehm, hold on, what do you mean by 'call site'? This code just runs and
>> jumps back to a local label. There are no functions calls here until
>> the point where we call into C (with the exception of the lovely
>> ArmPlatformPeiBootAction() we added so Juno can find out how much DRAM
>> it can use)
>
> Please continue the discussion with Leif on this; from my side, I'm
> happy with the patch (I've sort of deduced what the assembly code does,
> also relying on your v1 notes).
>
> The only eyebrow-raising part was:
>
> +  MOV64 (x9, FixedPcdGet32 (PcdInitValueInTempStack) |\
> +             FixedPcdGet32 (PcdInitValueInTempStack) << 32)
>
> where we left-shift a constant that is "in theory" UINT32 by 32 binary
> places, using the << operator. In C that would be undefined behavior,
> but this is assembly, so what do I know? ¯\_(ツ)_/¯
>
> Acked-by: Laszlo Ersek <lersek@redhat.com>
>

Thanks. And you're right, this is not C so no need to worry about that.

> (
>
> By the way, just to see if I remember correctly, isn't STP:
>
> +0:stp   x9, x9, [x8], #16
>
> the kind of instruction that modifies multiple operands at once, and so
> if it faults, it cannot be virtualized well? (Because the syndrome
> register or whatever does not tell the VMM the whole picture about the
> fault?)
>
> Totally irrelevant here, I'm just curious.
>

STP == STore Pair, and so it stores the values in the registers to
memory. The only register that gets modified here is x8, due to the
post-increment.

But its converse

LDP  <reg>, <reg>, [<reg>], #<const>

is indeed such an instruction, given that it modifies three registers
at once, and so the registers that encode the exception run out of
space. Note that this only affects virtualized MMIO.


  reply	other threads:[~2017-11-07 18:09 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-11-03 11:33 [PATCH v2] ArmPlatformPkg/PrePeiCore: seed temporary stack before entering PEI core Ard Biesheuvel
2017-11-05  5:52 ` Leif Lindholm
2017-11-05 16:27   ` Ard Biesheuvel
2017-11-05 16:29     ` Ard Biesheuvel
2017-11-07 18:09       ` Laszlo Ersek
2017-11-07 18:13         ` Ard Biesheuvel [this message]
2017-11-09 21:11           ` Ard Biesheuvel
2017-11-10  9:29             ` Laszlo Ersek
2017-11-10 11:01               ` Ard Biesheuvel
2017-11-08 16:12       ` Leif Lindholm
2017-11-09 21:09         ` Ard Biesheuvel
2017-11-06  4:25 ` Gao, Liming

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-list from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CAKv+Gu_BtTsuhXVZzx3K94dZYhhOe_1Aktu7BPkFMXKehyb9Rg@mail.gmail.com \
    --to=devel@edk2.groups.io \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox