From: Paolo Bonzini <pbonzini@redhat.com>
To: Laszlo Ersek <lersek@redhat.com>, "Fan, Jeff" <jeff.fan@intel.com>
Cc: "edk2-devel@ml01.01.org" <edk2-devel@ml01.01.org>,
"Yao, Jiewen" <jiewen.yao@intel.com>
Subject: Re: [PATCH v2 0/3] Put AP into safe hlt-loop code on S3 path
Date: Mon, 14 Nov 2016 09:50:02 +0100 [thread overview]
Message-ID: <4dc14e5c-9b43-4338-c7a5-9750e8a9547a@redhat.com> (raw)
In-Reply-To: <00b6828b-78c5-af4f-ab98-de4460b1b8ec@redhat.com>
On 14/11/2016 09:17, Laszlo Ersek wrote:
> On 11/13/16 13:51, Fan, Jeff wrote:
>> Laszlo,
>>
>> Thanks your testing. It seems that there is still some unknown issue existing.
>>
>> I suggest to push this serial of patches firstly, because they have
>> big progress to solve the AP crashed issue in
>> https://bugzilla.tianocore.org/show_bug.cgi?id=216.
>
> Sounds good to me.
>
>> I could submit another bug to handle "AP lost" issue.
>
> I hope that Paolo can continue to help us with the KVM trace analysis.
I will, but it will take a few days. In the meanwhile it would be nice
if you could take a look at using SendSmiIpiAllExcludingSelf() to bridge
the difference between 0xb2 on QEMU and on real hardware.
Paolo
>> Thus, JIewen's
>> or others' patches could be push as long as they have no additional
>> issue except for "AP Lost:".
>
> I haven't gotten around testing Jiewen's v3 series yet. I think it would
> be best if I could test Jiewen's v3 after this v2 series of yours is
> committed. I'll report back with results.
>
> Thanks
> Laszlo
>
>>
>> I could follow up to fix "AP Lost" issue.
>>
>> Thanks!
>> Jeff
>>
>>
>> -----Original Message-----
>> From: Laszlo Ersek [mailto:lersek@redhat.com]
>> Sent: Saturday, November 12, 2016 3:49 AM
>> To: Fan, Jeff
>> Cc: edk2-devel@ml01.01.org; Yao, Jiewen; Paolo Bonzini
>> Subject: Re: [edk2] [PATCH v2 0/3] Put AP into safe hlt-loop code on S3 path
>>
>> On 11/11/16 06:45, Jeff Fan wrote:
>>> On S3 path, we will wake up APs to restore CPU context in
>>> PiSmmCpuDxeSmm driver. In case, one NMI or SMI happens, APs may exit
>>> from hlt state and execute the instruction after HLT instruction.
>>>
>>> But APs are not running on safe code, it leads OVMF S3 boot unstable.
>>>
>>> https://bugzilla.tianocore.org/show_bug.cgi?id=216
>>>
>>> I tested real platform with 64bit DXE.
>>>
>>> v2:
>>> 1. Make stack alignment per Laszlo's comment.
>>> 2. Trim whitespace at end of end per Laszlo's comment.
>>> 3. Update year mark in file header.
>>> 4. Enhancement on InterlockedDecrement() per Paolo's comment.
>>>
>>> Jeff Fan (3):
>>> UefiCpuPkg/PiSmmCpuDxeSmm: Put AP into safe hlt-loop code on S3 path
>>> UefiCpuPkg/PiSmmCpuDxeSmm: Place AP to 32bit protected mode on S3 path
>>> UefiCpuPkg/PiSmmCpuDxeSmm: Decrease mNumberToFinish in AP safe code
>>>
>>> UefiCpuPkg/PiSmmCpuDxeSmm/CpuS3.c | 33 +++++++++++++-
>>> UefiCpuPkg/PiSmmCpuDxeSmm/Ia32/SmmFuncsArch.c | 29 +++++++++++-
>>> UefiCpuPkg/PiSmmCpuDxeSmm/PiSmmCpuDxeSmm.h | 15 +++++++
>>> UefiCpuPkg/PiSmmCpuDxeSmm/X64/SmmFuncsArch.c | 63
>>> ++++++++++++++++++++++++++-
>>> 4 files changed, 136 insertions(+), 4 deletions(-)
>>>
>>
>> Applied this locally to master (ffd6b0b1b65e) for testing. I tested the series with a suspend-resume loop -- not a busy loop, just manually. (So there was always one second or so between adjacent steps.)
>>
>> No crashes or emulation failures, but the "AP going lost" issue remains present -- sometimes Linux cannot bring up one of the four VCPUs after resume.
>>
>> In the Ia32 case, this "AP lost" symptom surfaced after the 6th resume.
>>
>> In the Ia32X64 case, I experienced the symptom after the 89th resume.
>>
>> Thanks
>> Laszlo
>>
next prev parent reply other threads:[~2016-11-14 8:50 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-11-11 5:45 [PATCH v2 0/3] Put AP into safe hlt-loop code on S3 path Jeff Fan
2016-11-11 5:45 ` [PATCH v2 1/3] UefiCpuPkg/PiSmmCpuDxeSmm: " Jeff Fan
2016-11-11 5:45 ` [PATCH v2 2/3] UefiCpuPkg/PiSmmCpuDxeSmm: Place AP to 32bit protected mode " Jeff Fan
2016-11-11 5:45 ` [PATCH v2 3/3] UefiCpuPkg/PiSmmCpuDxeSmm: Decrease mNumberToFinish in AP safe code Jeff Fan
2016-11-11 10:16 ` Paolo Bonzini
2016-11-11 19:49 ` [PATCH v2 0/3] Put AP into safe hlt-loop code on S3 path Laszlo Ersek
2016-11-13 12:51 ` Fan, Jeff
2016-11-14 1:41 ` Yao, Jiewen
2016-11-14 8:17 ` Laszlo Ersek
2016-11-14 8:50 ` Paolo Bonzini [this message]
2016-11-14 10:39 ` Laszlo Ersek
2016-11-14 11:09 ` Paolo Bonzini
2016-11-14 11:27 ` Laszlo Ersek
2016-11-14 12:00 ` Paolo Bonzini
2016-11-14 18:07 ` Laszlo Ersek
2016-11-14 18:13 ` Paolo Bonzini
2016-11-14 23:56 ` Laszlo Ersek
2016-11-15 0:47 ` Fan, Jeff
2016-11-15 1:03 ` Laszlo Ersek
2016-11-15 1:04 ` Fan, Jeff
2016-11-15 1:19 ` Fan, Jeff
2016-11-15 1:30 ` Laszlo Ersek
2016-11-15 1:27 ` Laszlo Ersek
2016-11-15 1:38 ` Fan, Jeff
[not found] ` <542CF652F8836A4AB8DBFAAD40ED192A4A2DCDE3@shsmsx102.ccr.corp.intel.com>
2016-11-15 1:21 ` Yao, Jiewen
2016-11-15 1:24 ` Fan, Jeff
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-list from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4dc14e5c-9b43-4338-c7a5-9750e8a9547a@redhat.com \
--to=devel@edk2.groups.io \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox