From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mga05.intel.com (mga05.intel.com [192.55.52.43]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ml01.01.org (Postfix) with ESMTPS id D0A2D81DC8 for ; Sun, 13 Nov 2016 04:51:49 -0800 (PST) Received: from fmsmga006.fm.intel.com ([10.253.24.20]) by fmsmga105.fm.intel.com with ESMTP; 13 Nov 2016 04:51:53 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.31,485,1473145200"; d="scan'208";a="30711406" Received: from fmsmsx105.amr.corp.intel.com ([10.18.124.203]) by fmsmga006.fm.intel.com with ESMTP; 13 Nov 2016 04:51:53 -0800 Received: from fmsmsx113.amr.corp.intel.com (10.18.116.7) by FMSMSX105.amr.corp.intel.com (10.18.124.203) with Microsoft SMTP Server (TLS) id 14.3.248.2; Sun, 13 Nov 2016 04:51:53 -0800 Received: from shsmsx103.ccr.corp.intel.com (10.239.4.69) by FMSMSX113.amr.corp.intel.com (10.18.116.7) with Microsoft SMTP Server (TLS) id 14.3.248.2; Sun, 13 Nov 2016 04:51:53 -0800 Received: from shsmsx102.ccr.corp.intel.com ([169.254.2.239]) by SHSMSX103.ccr.corp.intel.com ([169.254.4.96]) with mapi id 14.03.0248.002; Sun, 13 Nov 2016 20:51:51 +0800 From: "Fan, Jeff" To: Laszlo Ersek CC: "edk2-devel@ml01.01.org" , "Yao, Jiewen" , Paolo Bonzini Thread-Topic: [edk2] [PATCH v2 0/3] Put AP into safe hlt-loop code on S3 path Thread-Index: AQHSO97k043lSFBUy0mbX04flay71KDTq3eAgAMz0IA= Date: Sun, 13 Nov 2016 12:51:51 +0000 Message-ID: <542CF652F8836A4AB8DBFAAD40ED192A4A2DB4F5@shsmsx102.ccr.corp.intel.com> References: <20161111054545.19616-1-jeff.fan@intel.com> In-Reply-To: Accept-Language: zh-CN, en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-titus-metadata-40: eyJDYXRlZ29yeUxhYmVscyI6IiIsIk1ldGFkYXRhIjp7Im5zIjoiaHR0cDpcL1wvd3d3LnRpdHVzLmNvbVwvbnNcL0ludGVsMyIsImlkIjoiYmIzMjgzMjAtZDdkMi00N2ZiLTg4M2UtMmMzNTYxNmY4MDU5IiwicHJvcHMiOlt7Im4iOiJDVFBDbGFzc2lmaWNhdGlvbiIsInZhbHMiOlt7InZhbHVlIjoiQ1RQX0lDIn1dfV19LCJTdWJqZWN0TGFiZWxzIjpbXSwiVE1DVmVyc2lvbiI6IjE1LjkuNi42IiwiVHJ1c3RlZExhYmVsSGFzaCI6ImdTaXdyczE2YTI0UVwvZDc0bCtRN1RcL09pNTNCT1BvRU5PTFZKSHZiTnN6UT0ifQ== x-ctpclassification: CTP_IC x-originating-ip: [10.239.127.40] MIME-Version: 1.0 Subject: Re: [PATCH v2 0/3] Put AP into safe hlt-loop code on S3 path X-BeenThere: edk2-devel@lists.01.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: EDK II Development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 13 Nov 2016 12:51:49 -0000 Content-Language: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Laszlo, Thanks your testing. It seems that there is still some unknown issue existi= ng. I suggest to push this serial of patches firstly, because they have big pro= gress to solve the AP crashed issue in https://bugzilla.tianocore.org/show_= bug.cgi?id=3D216. I could submit another bug to handle "AP lost" issue. Thus, JIewen's or o= thers' patches could be push as long as they have no additional issue excep= t for "AP Lost:". I could follow up to fix "AP Lost" issue. Thanks! Jeff -----Original Message----- From: Laszlo Ersek [mailto:lersek@redhat.com]=20 Sent: Saturday, November 12, 2016 3:49 AM To: Fan, Jeff Cc: edk2-devel@ml01.01.org; Yao, Jiewen; Paolo Bonzini Subject: Re: [edk2] [PATCH v2 0/3] Put AP into safe hlt-loop code on S3 pat= h On 11/11/16 06:45, Jeff Fan wrote: > On S3 path, we will wake up APs to restore CPU context in=20 > PiSmmCpuDxeSmm driver. In case, one NMI or SMI happens, APs may exit=20 > from hlt state and execute the instruction after HLT instruction. >=20 > But APs are not running on safe code, it leads OVMF S3 boot unstable. >=20 > https://bugzilla.tianocore.org/show_bug.cgi?id=3D216 >=20 > I tested real platform with 64bit DXE. >=20 > v2: > 1. Make stack alignment per Laszlo's comment. > 2. Trim whitespace at end of end per Laszlo's comment. > 3. Update year mark in file header. > 4. Enhancement on InterlockedDecrement() per Paolo's comment. >=20 > Jeff Fan (3): > UefiCpuPkg/PiSmmCpuDxeSmm: Put AP into safe hlt-loop code on S3 path > UefiCpuPkg/PiSmmCpuDxeSmm: Place AP to 32bit protected mode on S3 path > UefiCpuPkg/PiSmmCpuDxeSmm: Decrease mNumberToFinish in AP safe code >=20 > UefiCpuPkg/PiSmmCpuDxeSmm/CpuS3.c | 33 +++++++++++++- > UefiCpuPkg/PiSmmCpuDxeSmm/Ia32/SmmFuncsArch.c | 29 +++++++++++- > UefiCpuPkg/PiSmmCpuDxeSmm/PiSmmCpuDxeSmm.h | 15 +++++++ > UefiCpuPkg/PiSmmCpuDxeSmm/X64/SmmFuncsArch.c | 63=20 > ++++++++++++++++++++++++++- > 4 files changed, 136 insertions(+), 4 deletions(-) >=20 Applied this locally to master (ffd6b0b1b65e) for testing. I tested the ser= ies with a suspend-resume loop -- not a busy loop, just manually. (So there= was always one second or so between adjacent steps.) No crashes or emulation failures, but the "AP going lost" issue remains pre= sent -- sometimes Linux cannot bring up one of the four VCPUs after resume. In the Ia32 case, this "AP lost" symptom surfaced after the 6th resume. In the Ia32X64 case, I experienced the symptom after the 89th resume. Thanks Laszlo