public inbox for devel@edk2.groups.io
 help / color / mirror / Atom feed
From: "Oliver Steffen" <osteffen@redhat.com>
To: Ard Biesheuvel <ardb@kernel.org>, Marc Zyngier <maz@kernel.org>
Cc: devel@edk2.groups.io, dann.frazier@canonical.com, kraxel@redhat.com
Subject: Re: [edk2-devel] [PATCH v2 2/2] ArmVirtPkg/ArmVirtQemu: Avoid early ID map on ThunderX
Date: Thu, 19 Jan 2023 03:25:10 -0800	[thread overview]
Message-ID: <CA+bRGFqrT1yAhNCaAmz1ntugHXU4b_4JLVuiv=ZB7ubxecJ0nw@mail.gmail.com> (raw)
In-Reply-To: <CAMj1kXGm791TCP4mtkf22JVdbA9rEEkpTh=8HsAszt_sGK+V+g@mail.gmail.com>

Quoting Ard Biesheuvel (2023-01-19 12:11:34)
>  (cc Marc)
>
> Context:
> - on my TX2 (with the S1PTW r/o memslot fix applied), the new version
> of ArmVirtQemu that uses an initial ID map in emulated NOR flash works
> fine.
> - in Oliver's case (which is a slightly different flavor of TX2), it
> crashes extremely early, presumably at the point where this ID map is
> activated.
>
> More details at the end.
>
> On Thu, 19 Jan 2023 at 12:03, Oliver Steffen <osteffen@redhat.com> wrote:
> >
> > Quoting Ard Biesheuvel (2023-01-18 10:22:12)
> > > On Wed, 18 Jan 2023 at 09:48, Ard Biesheuvel <ardb@kernel.org> wrote:
> > > >
> > > > On Wed, 18 Jan 2023 at 09:28, Oliver Steffen <osteffen@redhat.com> wrote:
> > > > >
> > > > > Quoting Ard Biesheuvel (2023-01-18 08:34:32)
> > > > > > On Wed, 18 Jan 2023 at 07:37, Oliver Steffen <osteffen@redhat.com> wrote:
> > > > > > >
> > > > > > > On Tue, Jan 17, 2023 at 3:57 PM Ard Biesheuvel <ardb@kernel.org> wrote:
> > > > > > >>
> > > > > > >> On Tue, 17 Jan 2023 at 13:48, Oliver Steffen <osteffen@redhat.com> wrote:
> > > > > > >> >
> > > > > > >> > Hi Ard, Hi everyone,
> > > > > > >> >
> > > > > > >> > Thanks for the work!
> > > > > > >> >
> > > > > > >> > But somehow this patch (as it was merged into master branch) does not
> > > > > > >> > work for me on the ThunderX box we have.
> > > > > > >> >
> > > > > > >> > Any idea what could be wrong?
> > > > > > >>
> > > > > > >> I'm not sure I understand the question. The patch targets ThunderX,
> > > > > > >> and you are using a ThunderX2.
> > > > > > >>
> > > > > > >> What were you expecting to happen, and what is happening instead?
> > > > > > >
> > > > > > >
> > > > > > > Firmware does not start at all when using KVM.
> > > > > > >
> > > > > > > Please excuse my limited knowledge of Arm processor variants.
> > > > > > > I assumed that ThunderX and ThunderX2 are very similar and hoped
> > > > > > > the fix would also work for this case.
> > > > > > >
> > > > > > > The issue was introduced by the same commit that Dann
> > > > > > > reported (07be1d34d95460a238fcd0f6693efb747c28b329):
> > > > > > > "ArmVirtPkg/ArmVirtQemu: enable initial ID map at early boot".
> > > > > > >
> > > > > >
> > > > > > Can you share the QEMU command line that you are using? I use a
> > > > > > ThunderX2 basically 24/7 to do all my Linux and EDK2 development, so
> > > > > > this change was developed on ThunderX2 and so I'm surprised you are
> > > > > > seeing this issue.
> > > > > >
> > > > > > Did you try the DEBUG build as well?
> > > > > Yes, debug is on.
> > > > >
> > > > > Here is what I have, trying with the master branch from just now
> > > > > (998ebe5ca0ae5c449e83ede533bee872f97d63af):
> > > > >
> > > > > # make -C BaseTools && \
> > > > >   . ./edksetup.sh && \
> > > > >   build -t GCC5 -a AARCH64 \
> > > > >     -p ArmVirtPkg/ArmVirtQemu.dsc \
> > > > >     -DCAVIUM_ERRATUM_27456 \
> > > > >     -b DEBUG
> > > > >
> > > > > # /usr/libexec/qemu-kvm \
> > > > >     -machine accel=kvm -m 1G -boot menu=on \
> > > > >     -blockdev node-name=code,driver=file,filename="${FW_CODE_RESIZED}",read-only=on
> > > > > \
> > > > >     -blockdev node-name=vars,driver=file,filename="${FW_VARS}" \
> > > > >     -machine pflash0=code \
> > > > >     -machine pflash1=vars \
> > > > >     -cpu max \
> > > > >     -net none \
> > > > >     -serial stdio
> > > > >
> > > >
> > > > My distro does not have qemu-kvm, and using the command line above
> > > > results in the following if i try it with qemu-system-aarch64
> > > >
> > > > """
> > > > qemu-system-aarch64: No machine specified, and there is no default
> > > > Use -machine help to list supported machines
> > > > """
> > > >
> > > > unless i change it to
> > > >
> > > > qemu-system-aarch64 -machine virt,accel=kvm -m 1G -boot menu=on \
> > > >     -blockdev node-name=code,driver=file,filename=$HOME/bin/flash0.img,read-only=on
> > > > \
> > > >     -blockdev node-name=vars,driver=file,filename=$HOME/bin/flash1.img \
> > > >     -machine pflash0=code \
> > > >     -machine pflash1=vars \
> > > >     -cpu max \
> > > >     -net none \
> > > >     -nographic
> > > >
> > > > and that works fine with my firmware build.
> > > >
> > > >
> > > > > # /usr/libexec/qemu-kvm --version
> > > > > QEMU emulator version 7.2.0 (qemu-kvm-7.2.0-3.el9)
> > > > >
> > > > > # uname -r
> > > > > 5.14.0-234.el9.aarch64
> > > > >
> > > >
> > > > Yeah, that is quite old. One potential issue that comes to mind here
> > > > is the one address by the patch below
> > > >
> > > >
> > > > >
> > > > >
> > > > > Since you have the same CPU... Might this be a bug in KVM?
> > > > >
> > > >
> > > > Indeed. Could you try applying this patch?
> > > >
> > > > commit 406504c7b0405d74d74c15a667cd4c4620c3e7a9
> > > > Author: Marc Zyngier <maz@kernel.org>
> > > > Date:   Tue Dec 20 14:03:52 2022 +0000
> > > >
> > > >     KVM: arm64: Fix S1PTW handling on RO memslots
> > > >
> > > > Or check whether this is generally reproducible with newer kernels?
> > >
> > > Another thing you might try:
> > >
> > > - build the firmware with the following hunk applied
> > >
> > > """
> > > diff --git a/ArmVirtPkg/Library/ArmPlatformLibQemu/AArch64/ArmPlatformHelper.S
> > > b/ArmVirtPkg/Library/ArmPlatformLibQemu/AArch64/ArmPlatformHelper.S
> > > index 5ac7c732f6ec..f4e1285beefc 100644
> > > --- a/ArmVirtPkg/Library/ArmPlatformLibQemu/AArch64/ArmPlatformHelper.S
> > > +++ b/ArmVirtPkg/Library/ArmPlatformLibQemu/AArch64/ArmPlatformHelper.S
> > > @@ -40,6 +40,12 @@
> > >   .set    sctlrval, SCTLR_ELx_M | SCTLR_ELx_C | SCTLR_ELx_SA |
> > > SCTLR_EL1_ITD | SCTLR_EL1_SED
> > >   .set    sctlrval, sctlrval | SCTLR_ELx_I | SCTLR_EL1_SPAN | SCTLR_EL1_RES1
> > >
> > > +  .align  11
> > > +.Lvectors:
> > > +  .rept   16
> > > +  .align  7
> > > +  b       .
> > > +  .endr
> > >
> > >  ASM_FUNC(ArmPlatformPeiBootAction)
> > >  #ifdef CAVIUM_ERRATUM_27456
> > > @@ -90,6 +96,8 @@ ASM_FUNC(ArmPlatformPeiBootAction)
> > >    msr    mair_el1, x0            // set up the 1:1 mapping
> > >    msr    tcr_el1, x1
> > >    msr    ttbr0_el1, x2
> > > +  adr    x0, .Lvectors
> > > +  msr    vbar_el1, x0
> > >    isb
> > >
> > >    tlbi   vmalle1                 // invalidate any cached translations
> > > """
> > >
> > > - run qemu with the -s option and let it crash
> > >
> > > - connect with gdb and dump the exception context
> > >
> > > target remote:1234
> > > set radix 16
> > > p $FAR_EL1
> > > p $ESR_EL1
> > > p $ELR_EL1
> > >
> > > That should at least tell us why the crash is occurring.
> > >
> >
> > I tried the most recent Qemu master (v7.2.50) and also v7.0.0,
> > on the 5.14 (RHEL) kernel and on 6.1.6-200.fc37.aarch64 (from Fedora).
> > No luck.
> >
>
> Does that include a backport of commit 406504c7b0405d74d74c15a667cd4c4620c3e7a9?
>
> > I applied the patch and attached gdb, as described (Qemu 7.2.50):
> >
> >   p $ELR_EL1
> >   (gdb) p $FAR_EL1
> >   $1 = 0x6200
> >   (gdb) p $ESR_EL1
> >   $2 = 0x86000010
> >   (gdb) p $ELR_EL1
> >   $3 = 0x6200
> >
> > There is no sign of any crash. It seems like it does not even start
> > running.
> >
>
> So 0x6200 is the sync exception vector, which is both the code
> location of the crash and the faulting address. This means fetching
> the instructions to handle the original exception failed, and so the
> original exception reason (ESR) is lost. However, the synchronous
> external abort (https://esr.arm64.dev/?#0x86000010) that you are
> seeing might point to an issue similar (or the same) that Marc
> recently fixed in KVM.
>
> It is quite odd that this does not reproduce *at all* on my TX2.
> Fedora kernels don't use 64k pages right?
>

Kernel config says:
CONFIG_ARM64_4K_PAGES=y


  reply	other threads:[~2023-01-19 11:25 UTC|newest]

Thread overview: 37+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-01-05 16:25 [PATCH v2 1/2] ArmVirtPkg/ArmPlatformLibQemu: Ensure that VFP is on before running C code Ard Biesheuvel
2023-01-05 16:25 ` [PATCH v2 2/2] ArmVirtPkg/ArmVirtQemu: Avoid early ID map on ThunderX Ard Biesheuvel
2023-01-10  0:08   ` dann frazier
2023-01-17 12:47     ` [edk2-devel] " Oliver Steffen
2023-01-17 14:53       ` Ard Biesheuvel
2023-01-18  6:36         ` Oliver Steffen
2023-01-18  7:34           ` Ard Biesheuvel
2023-01-18  8:27             ` Oliver Steffen
2023-01-18  8:48               ` Ard Biesheuvel
2023-01-18  9:22                 ` Ard Biesheuvel
2023-01-19 11:03                   ` Oliver Steffen
2023-01-19 11:11                     ` Ard Biesheuvel
2023-01-19 11:25                       ` Oliver Steffen [this message]
2023-01-19 11:55                       ` Marc Zyngier
2023-01-19 12:21                         ` Ard Biesheuvel
2023-01-19 12:00                       ` Gerd Hoffmann
2023-01-19 12:55                         ` Oliver Steffen
2023-01-19 13:21                           ` Ard Biesheuvel
2023-01-26 12:01                             ` Gerd Hoffmann
2023-01-26 13:48                               ` Marc Zyngier
2023-01-26 14:46                                 ` Gerd Hoffmann
2023-01-26 15:08                                   ` Marc Zyngier
2023-02-01  9:13                             ` Oliver Steffen
2023-02-01 11:51                               ` Ard Biesheuvel
2023-02-01 12:58                                 ` Oliver Steffen
2023-02-01 13:29                                   ` Ard Biesheuvel
2023-02-02 11:09                                     ` Oliver Steffen
     [not found]                                     ` <173FFD60429C89C3.3213@groups.io>
2023-02-07 10:51                                       ` Oliver Steffen
2023-02-07 11:56                                         ` Ard Biesheuvel
2023-02-07 12:58                                           ` Oliver Steffen
2023-02-09 15:15                                             ` Ard Biesheuvel
2023-03-02 10:50                                               ` Ard Biesheuvel
2023-03-02 13:29                                                 ` Oliver Steffen
     [not found]                                                 ` <17489D498A098DB9.9697@groups.io>
2023-05-19 16:32                                                   ` Oliver Steffen
2023-05-19 21:36                                                     ` Ard Biesheuvel
2023-05-20  8:37                                                       ` Oliver Steffen
2023-05-20  9:20                                                         ` Ard Biesheuvel

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-list from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CA+bRGFqrT1yAhNCaAmz1ntugHXU4b_4JLVuiv=ZB7ubxecJ0nw@mail.gmail.com' \
    --to=devel@edk2.groups.io \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox