From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by mx.groups.io with SMTP id smtpd.web10.80603.1675767076524797133 for ; Tue, 07 Feb 2023 02:51:16 -0800 Authentication-Results: mx.groups.io; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=SqoVG6RZ; spf=pass (domain: redhat.com, ip: 170.10.129.124, mailfrom: osteffen@redhat.com) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1675767075; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=+lxtH2qLScbGzbgmIPHMYj9nXvFLHkEO76rsnk+4r7E=; b=SqoVG6RZWRUjAH4idTosSFMJMrZdtGwkORpGsb5+AMiD09AQjG5tBOlWBv4f36XSJ4ppTV yXYHU3hO23NcYoVEfsoEX1nxG1oIlH2D2JSr9Va7oRQenD/6i00pcgN85Ayef/UCE/3bhT IfELD/m/pFuFSYAdOLzshLLOjh2cdYM= Received: from mail-lf1-f72.google.com (mail-lf1-f72.google.com [209.85.167.72]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_128_GCM_SHA256) id us-mta-447-GHi1fG6iNt2llq8RGPuMRw-1; Tue, 07 Feb 2023 05:51:14 -0500 X-MC-Unique: GHi1fG6iNt2llq8RGPuMRw-1 Received: by mail-lf1-f72.google.com with SMTP id i2-20020a0565123e0200b004d5aee356dcso6162620lfv.5 for ; Tue, 07 Feb 2023 02:51:13 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=+lxtH2qLScbGzbgmIPHMYj9nXvFLHkEO76rsnk+4r7E=; b=z6onPq7jQBS6aa8s5qxaSduQJSq/7GvJKLZzjKoQHbTBcxxK+leDln/3atz+iQVXPD J37WypomcUmuoVJWPonNSNW6L6XmWIi0kW9LoGCKKSRF4YiKkxcSUsc6gVo46eAQN3SW TSCcTAdmMxNH3SVuHdckcEmOA7vEBQCSGqQxlYvJXCUeTV4WbXZkVa/zZvBp8+kZ3vzz QwkEW+ZYx6Y5ydgw6Wflzj6VytNSgwHO3evD741WW7Iw6NiA659dhiWGIxHgAM8HFgP5 cvlsxKn3nsdET8q7DuLxfQ/YPo05WqYOC+suIwdHlc8UHIjt0f9A9vpo3s8t+l7CQR0t eTxw== X-Gm-Message-State: AO0yUKWMYyxeafKjJWR+nnLPnAqcrPEO5ZZQcEjNqCgkgtuh+CfkSV3O qTkFeUbmZkT+S0d0Xt3nbx/xAaKEhNTkvTJnjWSTW4jr5uQK8yzefPcn8qQYLpitZUED0uS/Ltd RQU9DrEsB9YgKSeESnhtQ5Om1MnPdIQ== X-Received: by 2002:a05:6512:924:b0:4d8:20fa:51c9 with SMTP id f4-20020a056512092400b004d820fa51c9mr620102lft.59.1675767072200; Tue, 07 Feb 2023 02:51:12 -0800 (PST) X-Google-Smtp-Source: AK7set+DopAHC0f25Gv4zv7H7E5MvwL4VgsXzbUnLJM5G5jzuDuMQ3mbBi4+DykJWf1CCiZdIfId3F2lsCXanOvUyxM= X-Received: by 2002:a05:6512:924:b0:4d8:20fa:51c9 with SMTP id f4-20020a056512092400b004d820fa51c9mr620092lft.59.1675767071942; Tue, 07 Feb 2023 02:51:11 -0800 (PST) MIME-Version: 1.0 References: <20230119120021.4yohqindvj3ghwky@sirius.home.kraxel.org> <173FFD60429C89C3.3213@groups.io> In-Reply-To: <173FFD60429C89C3.3213@groups.io> From: "Oliver Steffen" Date: Tue, 7 Feb 2023 11:51:00 +0100 Message-ID: Subject: Re: [edk2-devel] [PATCH v2 2/2] ArmVirtPkg/ArmVirtQemu: Avoid early ID map on ThunderX To: Ard Biesheuvel Cc: devel@edk2.groups.io, Gerd Hoffmann , Marc Zyngier , dann.frazier@canonical.com X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: multipart/alternative; boundary="0000000000000cb60a05f419ed7b" --0000000000000cb60a05f419ed7b Content-Type: text/plain; charset="UTF-8" On Thu, Feb 2, 2023 at 12:09 PM Oliver Steffen wrote: > > On Wed, Feb 1, 2023 at 2:29 PM Ard Biesheuvel wrote: > >> On Wed, 1 Feb 2023 at 13:59, Oliver Steffen wrote: >> > >> > On Wed, Feb 1, 2023 at 12:52 PM Ard Biesheuvel wrote: >> >> >> >> On Wed, 1 Feb 2023 at 10:14, Oliver Steffen >> wrote: >> >> > >> > [...] > >> >> > I am sorry, this story does not seem to be over yet. >> >> > >> >> > We are using the Erratum patch and also included the commit 406504c7 >> in >> >> > the kernel. >> >> > Now the firmware crashes sometimes (10 out of 89 tests). >> >> > >> >> >> >> Thanks for the report. Is this still on ThunderX2? >> >> >> >> > Any hints are very welcome! >> >> > >> >> >> >> Do you have access to those build artifacts? >> > >> > >> > >> https://kojihub.stream.centos.org/kojifiles/work/tasks/5251/1835251/edk2-aarch64-20221207gitfff6d81270b5-4.el9.test.noarch.rpm >> > >> > and/or here: >> > >> > https://kojihub.stream.centos.org/koji/taskinfo?taskID=1835251 >> > >> > Source for reference: >> > https://gitlab.com/redhat/centos-stream/src/edk2/-/merge_requests/24 >> > >> >> Any chance the .dll files (which are actually ELF executables) have >> been preserved somewhere? >> > Here is the build folder (~90MB): > > https://gitlab.com/osteffen/thunderx2-debug/-/raw/main/armvirt-thunderx2-issue.tar.xz > > I am waiting for the tests with the additional debug output to run. > We reran the test suite with the Erratum and the additional debug output enabled. Strangely, the problem does not occur anymore, the firmware boots up normally. We retried the tests without the additional debug output. RHEL ships two firmware flavors for AARCH64: a silent and a verbose version. Both were tried. We see no problems with the verbose one. The silent one fails noticeably more often if a software TPM device is present. Could this be related to how much stuff is going on in the early phase of the firmware (when logging is enabled: formatting of messages and sending to serial port...) ? Thanks, Oliver --0000000000000cb60a05f419ed7b Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
On Thu, Feb 2, 2023 at 12:09 PM Oliver Steffen <osteffen@redhat.com> wrote:
=

On Wed, Feb 1, 2023 = at 2:29 PM Ard Biesheuvel <ardb@kernel.org> wrote:
On Wed, 1 Feb 2023 at 13:59, Oliver Steffen <osteffen@redhat.com> wrote:
>
> On Wed, Feb 1, 2023 at 12:52 PM Ard Biesheuvel <
ardb@kernel.org> wrote:
>>
>> On Wed, 1 Feb 2023 at 10:14, Oliver Steffen <osteffen@redhat.com> wrote: >> >
[...]=C2=A0
>> > I am sorry, this story does not seem to be over yet.
>> >
>> > We are using the Erratum patch and also included the commit 4= 06504c7 in
>> > the kernel.
>> > Now the firmware crashes sometimes (10 out of 89 tests).
>> >
>>
>> Thanks for the report. Is this still on ThunderX2?
>>
>> > Any hints are very welcome!
>> >
>>
>> Do=C2=A0 you have access to those build artifacts?
>
>
> https://kojihub.stream.centos.org/kojifiles/= work/tasks/5251/1835251/edk2-aarch64-20221207gitfff6d81270b5-4.el9.test.noa= rch.rpm
>
> and/or here:
>
> https://kojihub.stream.centos.o= rg/koji/taskinfo?taskID=3D1835251
>
> Source for reference:
> https://gitlab.com/redhat/c= entos-stream/src/edk2/-/merge_requests/24
>

Any chance the .dll files (which are actually ELF executables) have
been preserved somewhere?

I am waiting for the tests with the additional debug output to run.<= br>
=C2=A0
We reran the test suite wi= th the Erratum and the additional debug
output enabled.=C2=A0 Strangely,= the problem does not occur anymore, the
firmware boots up normally.
=
We retried the tests without the additional debug output.
RHEL ships= two firmware flavors for AARCH64: a silent and a verbose
version. Both = were tried. We see no problems with the verbose
one. The silent one fail= s noticeably more often if a software TPM device
is present.

Coul= d this be related to how much stuff is going on in the early phase
of th= e firmware (when logging is enabled: formatting of messages and
sending = to serial port...) ?

Thanks,
=C2=A0 Oliver

--0000000000000cb60a05f419ed7b--