From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by mx.groups.io with SMTP id smtpd.web09.8960.1614788093338027792 for ; Wed, 03 Mar 2021 08:14:53 -0800 Authentication-Results: mx.groups.io; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=Qhwboieg; spf=pass (domain: redhat.com, ip: 170.10.133.124, mailfrom: lersek@redhat.com) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1614788092; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=elp5P9hU6LhbogSMgskCwzIyq0P7/upz+5JBFw174qc=; b=Qhwboiegxbrvg0KoqG8yYTptjocRRDKrXaaNL72hUqMp7VC+oq6u7Fl00stZEHMf/RfC8+ /Nw0K4s7WktN34Uf0HzaWrhSIOiwZp0DB0YTPmgy51AclQBQuoAkwMJ+R4H9ePnYKbJGJh eIXffh5h5V1UAke1vfwHxU6GXQbvrYM= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-6-IySFh3aQOzq0jKqcF4TMfQ-1; Wed, 03 Mar 2021 11:14:47 -0500 X-MC-Unique: IySFh3aQOzq0jKqcF4TMfQ-1 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.phx2.redhat.com [10.5.11.14]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id C2472869EDF; Wed, 3 Mar 2021 16:14:45 +0000 (UTC) Received: from lacos-laptop-7.usersys.redhat.com (ovpn-113-245.ams2.redhat.com [10.36.113.245]) by smtp.corp.redhat.com (Postfix) with ESMTP id 82D505D9C6; Wed, 3 Mar 2021 16:14:43 +0000 (UTC) Subject: Re: [edk2-devel] [RFC PATCH 00/14] Firmware Support for Fast Live Migration for AMD SEV To: devel@edk2.groups.io, tobin@linux.ibm.com Cc: Dov Murik , Tobin Feldman-Fitzthum , James Bottomley , Hubertus Franke , Brijesh Singh , Ashish Kalra , Jon Grimm , Tom Lendacky References: <20210302204839.82042-1-tobin@linux.ibm.com> From: "Laszlo Ersek" Message-ID: <9d7de545-7902-4d38-ba49-f084a750ee2a@redhat.com> Date: Wed, 3 Mar 2021 17:14:42 +0100 MIME-Version: 1.0 In-Reply-To: <20210302204839.82042-1-tobin@linux.ibm.com> X-Scanned-By: MIMEDefang 2.79 on 10.5.11.14 Authentication-Results: relay.mimecast.com; auth=pass smtp.auth=CUSA124A263 smtp.mailfrom=lersek@redhat.com X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Hi Tobin, On 03/02/21 21:48, Tobin Feldman-Fitzthum wrote: > This is a demonstration of fast migration for encrypted virtual machines > using a Migration Handler that lives in OVMF. This demo uses AMD SEV, > but the ideas may generalize to other confidential computing platforms. > With AMD SEV, guest memory is encrypted and the hypervisor cannot access > or move it. This makes migration tricky. In this demo, we show how the > HV can ask a Migration Handler (MH) in the firmware for an encrypted > page. The MH encrypts the page with a transport key prior to releasing > it to the HV. The target machine also runs an MH that decrypts the page > once it is passed in by the target HV. These patches are not ready for > production, but the are a full end-to-end solution that facilitates a > fast live migration between two SEV VMs. > > Corresponding patches for QEMU have been posted my colleague Dov Murik > on qemu-devel. Our approach needs little kernel support, requiring only > one hypercall that the guest can use to mark a page as encrypted or > shared. This series includes updated patches from Ashish Kalra and > Brijesh Singh that allow OVMF to use this hypercall. > > The MH runs continuously in the guest, waiting for communication from > the HV. The HV starts an additional vCPU for the MH but does not expose > it to the guest OS via ACPI. We use the MpService to start the MH. The > MpService is only available at runtime and processes that are started by > it are usually cleaned up on ExitBootServices. Since we need the MH to > run continuously, we had to make some modifications. Ideally a feature > could be added to the MpService to allow for the starting of > long-running processes. Besides migration, this could support other > background processes that need to operate within the encryption > boundary. For now, we have included a handful of patches that modify the > MpService to allow the MH to keep running after ExitBootServices. These > are temporary. I plan to do a lightweight review for this series. (My understanding is that it's an RFC and not actually being proposed for merging.) Regarding the MH's availability at runtime -- does that necessarily require the isolation of an AP? Because in the current approach, allowing the MP Services to survive into OS runtime (in some form or another) seems critical, and I don't think it's going to fly. I agree that the UefiCpuPkg patches have been well separated from the rest of the series, but I'm somewhat doubtful the "firmware-initiated background process" idea will be accepted. Have you investigated exposing a new "runtime service" (a function pointer) via the UEFI Configuration table, and calling that (perhaps periodically?) from the guest kernel? It would be a form of polling I guess. Or maybe, poll the mailbox directly in the kernel, and call the new firmware runtime service when there's an actual command to process. (You do spell out "little kernel support", and I'm not sure if that's a technical benefit, or a political / community benefit.) I'm quite uncomfortable with an attempt to hide a CPU from the OS via ACPI. The OS has other ways to learn (for example, a boot loader could use the MP services itself, stash the information, and hand it to the OS kernel -- this would minimally allow for detecting an inconsistency in the OS). What about "all-but-self" IPIs too -- the kernel might think all the processors it's poking like that were under its control. Also, as far as I can tell from patch #7, the AP seems to be busy-looping (with a CpuPause() added in), for the entire lifetime of the OS. Do I understand right? If so -- is it a temporary trait as well? Sorry if my questions are "premature", in the sense that I could get my own answers as well if I actually read the patches in detail -- however, I wouldn't like to do that at once, because then I'll be distracted by many style issues and other "trivial" stuff. Examples for the latter: - patch#1 calls SetMemoryEncDecHypercall3(), but there is no such function in edk2, so minimally it's a patch ordering bug in the series, - in patch#1, there's minimally one whitespace error (no whitespace right after "EFI_SIZE_TO_PAGES") - in patch#1, the alphabetical ordering in the [LibraryClasses] section, and in the matching #include directives, gets broken, - I'd prefer if the "SevLiveMigrationEnabled" UEFI variable were set in ConfidentialMigrationDxe, rather than PlatformDxe (patch #3), or at least another AMD SEV related DXE driver (OvmfPkg/AmdSevDxe etc). - any particular reasonf or making the UEFI variable non-volatile? I don't think it should survive any particular boot of the guest. - Why do we need a variable in the first place? etc etc Thanks! Laszlo > > Ashish Kalra (2): > OvmfPkg/PlatformPei: Mark SEC GHCB page in the page encrpytion bitmap. > OvmfPkg/PlatformDxe: Add support for SEV live migration. > > Brijesh Singh (1): > OvmfPkg/BaseMemEncryptLib: Support to issue unencrypted hypercall > > Dov Murik (1): > OvmfPkg/AmdSev: Build page table for migration handler > > Tobin Feldman-Fitzthum (10): > OvmfPkg/AmdSev: Base for Confidential Migration Handler > OvmfPkg/PlatfomPei: Set Confidential Migration PCD > OvmfPkg/AmdSev: Setup Migration Handler Mailbox > OvmfPkg/AmdSev: MH support for mailbox protocol > UefiCpuPkg/MpInitLib: temp removal of MpLib cleanup > UefiCpuPkg/MpInitLib: Allocate MP buffer as runtime memory > UefiCpuPkg/CpuExceptionHandlerLib: Exception handling as runtime > memory > OvmfPkg/AmdSev: Don't overwrite mailbox or pagetables > OvmfPkg/AmdSev: Don't overwrite MH stack > OvmfPkg/AmdSev: MH page encryption POC > > OvmfPkg/OvmfPkg.dec | 11 + > OvmfPkg/AmdSev/AmdSevX64.dsc | 2 + > OvmfPkg/AmdSev/AmdSevX64.fdf | 13 +- > .../ConfidentialMigrationDxe.inf | 45 +++ > .../ConfidentialMigrationPei.inf | 35 ++ > .../DxeMemEncryptSevLib.inf | 1 + > .../PeiMemEncryptSevLib.inf | 1 + > OvmfPkg/PlatformDxe/Platform.inf | 2 + > OvmfPkg/PlatformPei/PlatformPei.inf | 2 + > UefiCpuPkg/Library/MpInitLib/DxeMpInitLib.inf | 2 + > UefiCpuPkg/Library/MpInitLib/PeiMpInitLib.inf | 2 + > OvmfPkg/AmdSev/ConfidentialMigration/MpLib.h | 235 +++++++++++++ > .../ConfidentialMigration/VirtualMemory.h | 177 ++++++++++ > OvmfPkg/Include/Guid/MemEncryptLib.h | 16 + > OvmfPkg/PlatformDxe/PlatformConfig.h | 5 + > .../ConfidentialMigrationDxe.c | 325 ++++++++++++++++++ > .../ConfidentialMigrationPei.c | 25 ++ > .../X64/PeiDxeVirtualMemory.c | 18 + > OvmfPkg/PlatformDxe/AmdSev.c | 99 ++++++ > OvmfPkg/PlatformDxe/Platform.c | 6 + > OvmfPkg/PlatformPei/AmdSev.c | 10 + > OvmfPkg/PlatformPei/Platform.c | 10 + > .../CpuExceptionHandlerLib/DxeException.c | 8 +- > UefiCpuPkg/Library/MpInitLib/DxeMpLib.c | 21 +- > UefiCpuPkg/Library/MpInitLib/MpLib.c | 7 +- > 25 files changed, 1061 insertions(+), 17 deletions(-) > create mode 100644 OvmfPkg/AmdSev/ConfidentialMigration/ConfidentialMigrationDxe.inf > create mode 100644 OvmfPkg/AmdSev/ConfidentialMigration/ConfidentialMigrationPei.inf > create mode 100644 OvmfPkg/AmdSev/ConfidentialMigration/MpLib.h > create mode 100644 OvmfPkg/AmdSev/ConfidentialMigration/VirtualMemory.h > create mode 100644 OvmfPkg/Include/Guid/MemEncryptLib.h > create mode 100644 OvmfPkg/AmdSev/ConfidentialMigration/ConfidentialMigrationDxe.c > create mode 100644 OvmfPkg/AmdSev/ConfidentialMigration/ConfidentialMigrationPei.c > create mode 100644 OvmfPkg/PlatformDxe/AmdSev.c >