public inbox for devel@edk2.groups.io
 help / color / mirror / Atom feed
From: "Laszlo Ersek" <lersek@redhat.com>
To: Ard Biesheuvel <ardb@kernel.org>, devel@edk2.groups.io
Cc: Evgeny Iakovlev <eiakovlev@linux.microsoft.com>,
	kraxel@redhat.com, rfc@edk2.groups.io, jiewen.yao@intel.com
Subject: Re: [edk2-devel] [edk2][RFC] OvmfPkg/AcpiPlatformDxe: patch FADT PSCI bits if FDT advertises it
Date: Thu, 7 Sep 2023 22:07:18 +0200	[thread overview]
Message-ID: <15802025-a8ea-0399-6bfb-c560e3750c64@redhat.com> (raw)
In-Reply-To: <CAMj1kXFJNzMCh0aibsFiaJaK83JLNWu71Yc8JJe36D_Uxrm0rQ@mail.gmail.com>

On 9/7/23 17:17, Ard Biesheuvel wrote:
> On Thu, 7 Sept 2023 at 16:51, Laszlo Ersek <lersek@redhat.com> wrote:
>>
>> On 9/7/23 16:27, Ard Biesheuvel wrote:
>>> On Mon, 16 Jan 2023 at 12:39, Evgeny Iakovlev
>>> <eiakovlev@linux.microsoft.com> wrote:
>>>>
>>>> EL3 firmware may implement PSCI interface on aarch64 platforms,
>>>> including qemu-tcg-aarch64. However, EL3 firmware does not usually own
>>>> pulling and deploying ACPI tables from qemu fw_cfg. Thus the only way
>>>> EL3 can advertise PSCI on qemu is in FDT. One such EL3 fw is ARM trusted
>>>> firmware. Qemu itself also won't advertise PSCI in either FDT or ACPI if
>>>> EL3 firmware is present.
>>>>
>>>> PSCI can be advertised in both FDT and ACPI, and Hyper-V/NT kernel
>>>> expect to see all information published in ACPI. To better support
>>>> running Hyper-V/NT on qemu-tcg-aarch64 with EDK2 as UEFI implementation
>>>> and ARM trusted firmware as EL3 PSCI implementation we can patch in PSCI
>>>> bits in ACPI FADT when pulling tables from fw_cfg if PSCI node is
>>>> advertised in FDT. EDK2 owns ACPI table publishing and is also aware of
>>>> FDT on arm, so it is ideally poised to handle this.
>>>>
>>>> This change illustrates how it could potentially be done. I am looking
>>>> for comments on overall validity of the idea to patch FADT and whether
>>>> or not this particular approach of handling it in AcpiPlatformDxe is the
>>>> way to do it or maybe it is better to handle it via
>>>> gQemuAcpiTableNotifyProtocolGuid somehow.
>>>>
>>>> Signed-off-by: Evgeny Iakovlev <eiakovlev@linux.microsoft.com>
>>>
>>> Thanks for the patch, and apologies for the lack of response.
>>>
>>> First of all, I suspect this patch breaks non-ARM users of this
>>> driver, so the patch is problematic as is. (It makes
>>> gFdtClientProtocolGuid mandatory, right?)
>>>
>>> Then, I'd like to hear from other folks on cc what they think about
>>> this. Perhaps it is simply a matter of tweaking QEMU so it exposes the
>>> correct PSCI setting in the FADT when it emulates secure world.
>>> Patching it like this feels like a last resort to me, rather than a
>>> well designed interface.
>>
>> Thanks for the CC; both of your concerns are valid.
>>
>> The FDT client proto GUID has no reason to exist in (e.g.) an X64 OVMF
>> build.
>>
>> Second, and more importantly, this is a total layering violation for
>> AcpiPlatformDxe. QEMU is the single source of truth for AcpiPlatformDxe,
>> and AcpiPlatformDxe must remain as blind as possible to the actual ACPI
>> content.
>>
>> In the situation described by the commit message, the ACPI content
>> exposed by QEMU is simply invalid. That's what should be fixed in QEMU
>> (and not papered over in edk2). Something somewhere is responsible for
>> setting the property value in question to "hvc"; that something
>> precisely is responsible (directly or indirectly) for making QEMU expose
>> the proper FADT.
>>
>> I've now grepped the QEMU source tree for '"hvc"'; the relevant hit
>> seems to be in "hw/arm/boot.c", function fdt_add_psci_node(), under case
>> label QEMU_PSCI_CONDUIT_HVC. So, whatever sets psci-conduit to
>> QEMU_PSCI_CONDUIT_HVC should also make sure the FADT matches it.
>>
>> Taking one step back, in "hw/arm/virt.c" we have:
>>
>>     if (vms->secure && firmware_loaded) {
>>         vms->psci_conduit = QEMU_PSCI_CONDUIT_DISABLED;
>>     } else if (vms->virt) {
>>         vms->psci_conduit = QEMU_PSCI_CONDUIT_SMC;
>>     } else {
>>         vms->psci_conduit = QEMU_PSCI_CONDUIT_HVC;
>>     }
>>
> 
> The problem here is that QEMU does not know whether the EL3 firmware
> running in the guest implements PSCI or not.
> 
>> So I figure the ACPI generator should be steered off the same information.
>>
>> BTW... I see the following in "hw/arm/virt-acpi-build.c", function
>> build_fadt_rev6():
>>
>>     case QEMU_PSCI_CONDUIT_HVC:
>>         fadt.arm_boot_arch = ACPI_FADT_ARM_PSCI_COMPLIANT |
>>                              ACPI_FADT_ARM_PSCI_USE_HVC;
>>         break;
>>
>> That dates back minimally as far as commit 79e993a0a804
>> ("hw/arm/virt-acpi-build: use SMC if booting in EL2", 2017-01-20).
>>
>> So why is it not taking effect? Patching edk2 should not be necessary at
>> all, QEMU should already be doing the right thing.
>>
>> The commit message states, "Qemu itself also won't advertise PSCI in
>> [...] ACPI if EL3 firmware is present"; if that's correct (I can't
>> tell), then it may be the problem.
>>
> 
> Exactly.
> 
> When not emulating EL2 or EL3 (which is equivalent to the KVM case),
> PSCI calls are made using HVC instructions, which are handled by QEMU
> directly.
> 
> When EL2 emulation is enabled, PSCI calls are made using SMC
> instructions but using the same handling in QEMU.
> 
> When EL3 emulation is enabled, QEMU can no longer 'overrule' the side
> effects of SMC instructions but has to deliver them to the firmware
> that occupies EL3. Whether or not that firmware implements PSCI is not
> known to QEMU, and so it assumes it is not, and populates the FADT
> fields accordingly.

Whether the EL3 firmware implements PSCI or not is presumably known at
QEMU launch time. Is that right? I mean, not inherently known to QEMU,
but known to the user, or minimally to the *provider* of the EL3
firmware binary.

That meta-datum should be exposed to QEMU via a dedicated command line
switch. (It could be a device property, a machine type propery, a PCI
host bridge vendor capability, a custom fw_cfg file in the edk2 or
tianocore namespace, or some other means that's related to the EL3
firmware -- I reckon the EL3 firmware binary pathname is ultimately
specified by the user!) Then QEMU can rely on that information to
populate the FADT.

This "need" is very-very similar to the necessity that had brought about
the firmware descriptor JSON schema and files. When configuring a UEFI
firmware binary for a domain, libvirt needs to know various pieces of
metadata about the different firmware binaries installed on the host
system. Because those properties are not "introspectable" / detectable
from the firmware binaries themselves, we expect the providers /
packagers of those fw binaries to ship additional firmware descriptor
files alongside them. Then libvirt can build (and has built) elaborate
selection / filtering logic on the metadata.

> IIRC QEMU has some patching logic for ACPI tables. Could we make use
> of that here?

The ACPI linker/loader commands are in
"OvmfPkg/Include/IndustryStandard/QemuLoader.h"; what you're likely
referring to is QemuLoaderCmdWritePointer. But that command is for
letting QEMU know a guest-side allocation address.

No, I really believe that, if QEMU cannot detect something about a
particular piece of guest payload, we need to tell it explicitly. And
this is not something to be forced upon the end-user -- it should come
as metadata together with the EL3 firmware.

A similar example is libosinfo. It's different in three regards:

- it concerns guest OS-es, not guest firmware
- the kinds of information it carries are different (like what devices
are supported etc)
- its database is about two orders of magnitude larger

But the usage pattern is exactly the same. Tell me what you want to run
in the guest, I'll give you the optimal domain config in response. In
fact recent virt-install refuses to install any new domain by default
unless the user tells it the guest OS type / release (or unless
virt-install can detect it somehow from the installer media).

This patch would set us on a very slippery slope; very soon we'd have a
whole lot of patching logic in AcpiPlatformDxe, which would defeat the
purpose of the ACPI linker/loader.

BTW, I don't understand the FDT references in the commit message. My
understanding is that the FDT is placed at GPA 0 by QEMU. The commit
message claims the FDT does reflect whether the EL3 firmware implements
PSCI. So there seem to be only two possible explanations for that:

#1 QEMU does know this property after all, because it places the
information in the DTB at GPA 0

#2 the DTB at GPA 0 does not in fact come from QEMU, but from the EL3
firmware -- either whole-sale (i.e., QEMU doesn't expose anything, it
all comes from the EL3 firmware), or the EL3 firmware *patches* QEMU's
DTB (terrible).

Option #2 is quite scary; it's effectively a recipe for the DTB and the
ACPI payload to be out of sync -- they no longer come from a common
source. Covering up such desynchronization after the fact, in edk2, is a
doomed approach IMO. The machine description (capabilities etc) is owned
by QEMU; if that's influenced by EL3 fw properties, those should be made
explicit to QEMU (or to some other layered management application) via
metadata.

In my opinion :)

Laszlo



-=-=-=-=-=-=-=-=-=-=-=-
Groups.io Links: You receive all messages sent to this group.
View/Reply Online (#108415): https://edk2.groups.io/g/devel/message/108415
Mute This Topic: https://groups.io/mt/101215483/7686176
Group Owner: devel+owner@edk2.groups.io
Unsubscribe: https://edk2.groups.io/g/devel/leave/12367111/7686176/1913456212/xyzzy [rebecca@openfw.io]
-=-=-=-=-=-=-=-=-=-=-=-



      reply	other threads:[~2023-09-07 20:07 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20230116113931.1221-1-eiakovlev@linux.microsoft.com>
2023-09-07 14:27 ` [edk2-devel] [edk2][RFC] OvmfPkg/AcpiPlatformDxe: patch FADT PSCI bits if FDT advertises it Ard Biesheuvel
2023-09-07 14:50   ` Laszlo Ersek
2023-09-07 15:17     ` Ard Biesheuvel
2023-09-07 20:07       ` Laszlo Ersek [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-list from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=15802025-a8ea-0399-6bfb-c560e3750c64@redhat.com \
    --to=devel@edk2.groups.io \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox