From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mga04.intel.com (mga04.intel.com [192.55.52.120]) by ml01.01.org (Postfix) with ESMTP id DEEB21A1DEB for ; Fri, 12 Aug 2016 11:27:12 -0700 (PDT) Received: from orsmga003.jf.intel.com ([10.7.209.27]) by fmsmga104.fm.intel.com with ESMTP; 12 Aug 2016 11:27:12 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.28,512,1464678000"; d="scan'208";a="864482395" Received: from orsmsx109.amr.corp.intel.com ([10.22.240.7]) by orsmga003.jf.intel.com with ESMTP; 12 Aug 2016 11:27:12 -0700 Received: from orsmsx162.amr.corp.intel.com (10.22.240.85) by ORSMSX109.amr.corp.intel.com (10.22.240.7) with Microsoft SMTP Server (TLS) id 14.3.248.2; Fri, 12 Aug 2016 11:27:12 -0700 Received: from orsmsx113.amr.corp.intel.com ([169.254.9.118]) by ORSMSX162.amr.corp.intel.com ([169.254.3.27]) with mapi id 14.03.0248.002; Fri, 12 Aug 2016 11:27:12 -0700 From: "Kinney, Michael D" To: Paolo Bonzini , "Zeng, Star" , "edk2-devel@lists.01.org" , "Kinney, Michael D" CC: "Gao, Liming" Thread-Topic: [edk2] [PATCH] PcAtChipsetPkg AcpiTimerLib: Get more accurate TSC Frequency Thread-Index: AQHR83ll5NQDApcUzEyKTdKC2iGseKBFdiUAgAAlj0A= Date: Fri, 12 Aug 2016 18:27:11 +0000 Message-ID: References: <1470883079-4472-1-git-send-email-star.zeng@intel.com> <0229321a-849c-b264-7b26-146d6608c754@redhat.com> In-Reply-To: <0229321a-849c-b264-7b26-146d6608c754@redhat.com> Accept-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ctpclassification: CTP_IC x-titus-metadata-40: eyJDYXRlZ29yeUxhYmVscyI6IiIsIk1ldGFkYXRhIjp7Im5zIjoiaHR0cDpcL1wvd3d3LnRpdHVzLmNvbVwvbnNcL0ludGVsMyIsImlkIjoiODAxYzNlMzYtNjFmMy00ZmRhLWE4MDAtYjI4NGUxOTQ1ZTA3IiwicHJvcHMiOlt7Im4iOiJDVFBDbGFzc2lmaWNhdGlvbiIsInZhbHMiOlt7InZhbHVlIjoiQ1RQX0lDIn1dfV19LCJTdWJqZWN0TGFiZWxzIjpbXSwiVE1DVmVyc2lvbiI6IjE1LjkuNi42IiwiVHJ1c3RlZExhYmVsSGFzaCI6IlwvOG5wRDV6RzR0Y0RiWWFQeWJxTTQ5ZjRxclJSdzk0TUE0XC9XSnE3Z3JCOD0ifQ== x-originating-ip: [10.22.254.138] MIME-Version: 1.0 Subject: Re: [PATCH] PcAtChipsetPkg AcpiTimerLib: Get more accurate TSC Frequency X-BeenThere: edk2-devel@lists.01.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: EDK II Development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 12 Aug 2016 18:27:13 -0000 Content-Language: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Paolo, That is a good idea to find factors of ACPI_TIMER_FREQUENCY 3579545 Hz and find a value that is close to 100 uS for calibration. The pair you found 343 * 10436 =3D 3579548 Hz. If we look at frequencies lower and=20 higher than ACPI_TIMER_FREQUENCY, the value 3579543 has many more factors and is only 2 Hz from the target. Number Factors =3D=3D=3D=3D=3D=3D=3D =3D=3D=3D=3D=3D=3D=3D=3D 3579542 1, 2, 67, 134, 26713, 53426, 1789771, 3579542 =20 3579543 1, 3, 9, 11, 19, 33, 57, 99, 121, 171, 173, 209, 363,=20 519, 627, 1089, 1557, 1881, 1903, 2299, 3287, 5709,=20 6897, 9861, 17127, 20691, 20933, 29583, 36157, 62799,=20 108471, 188397, 325413, 397727, 1193181, 3579543 3579544 1, 2, 4, 8, 447443, 894886, 1789772, 3579544 3579545 1, 5, 715909, 3579545 3579546 1, 2, 3, 6, 41, 82, 123, 246, 14551, 29102, 43653,=20 87306, 596591, 1193182, 1789773, 3579546 3579547 1, 3579547 3579548 1, 2, 4, 7, 14, 28, 49, 98, 196, 343, 686, 1372, 2609,=20 5218, 10436, 18263, 36526, 73052, 127841, 255682, 511364,=20 894887, 1789774, 3579548 So we could choose from the following pairs for calibration: 209 * 17127 209 ticks is 58.3 uS calibration time 363 * 9861 363 ticks is 101.4 uS calibration time 519 * 6897 519 ticks is 145.0 uS calibration time 627 * 5709 627 ticks is 175.2 uS calibration time I would recommend the pair 363 * 9861 which is closest to the Current 100 uS calibration time. Another accuracy improvement on the current algorithm is to Wait for the ACPI timer value to change before capturing the Initial TSC value, so only whole ticks are counted. The current=20 algorithm could start the beginning, middle, or end of an ACPI=20 counter tick. Here is an updated measurement algorithm. // // Wait for ACPI timer to start next count // Ticks =3D IoRead32 (TimerAddr); while (Ticks =3D=3D IoRead32 (TimerAddr)) { CpuPause(); } // // Immediately capture start TSC value // StartTSC =3D AsmReadTsc (); =20 // // Compute the number of ticks to wait to measure TSC frequency. // Use 363 * 9861 =3D 3579543 Hz which is within 2 Hz of=20 // ACPI_TIMER_FREQUENCY. 363 counts is a calibration time of=20 // 101.4 uS. // Subtract 1 because the calibration loop waits one extra count. // Ticks =3D IoRead32 (TimerAddr) + 363 - 1; // // Wait until the ACPI timer has counted the number of calibration ticks. // Timer wrap-arounds are handled correctly by this function. // When the current ACPI timer value is greater than 'Ticks',=20 // the while loop will exit. // while (((Ticks - IoRead32 (TimerAddr)) & BIT23) =3D=3D 0) { CpuPause(); } // // Immediately capture end TSC value // EndTSC =3D AsmReadTsc (); // TS= C value 363 ticks later TscFrequency =3D MultU64x32 ( (EndTSC - StartTSC), // Tota= l number of TSC counts 9861 ); We would have to run some experiments to see if this further improves the=20 accuracy and consistency of the measurements. Mike > -----Original Message----- > From: Paolo Bonzini [mailto:paolo.bonzini@gmail.com] On Behalf Of Paolo B= onzini > Sent: Friday, August 12, 2016 1:31 AM > To: Zeng, Star ; edk2-devel@lists.01.org > Cc: Kinney, Michael D ; Gao, Liming > > Subject: Re: [edk2] [PATCH] PcAtChipsetPkg AcpiTimerLib: Get more accurat= e TSC > Frequency >=20 >=20 >=20 > On 11/08/2016 04:37, Star Zeng wrote: > > Minimize the code overhead between the two TSC reads by adding > > new internal API to calculate TSC Frequency instead of reusing > > MicroSecondDelay (). > > > > Cc: Michael D Kinney > > Cc: Liming Gao > > Cc: Paul A Lohr > > Contributed-under: TianoCore Contribution Agreement 1.0 > > Signed-off-by: Star Zeng > > --- > > PcAtChipsetPkg/Library/AcpiTimerLib/AcpiTimerLib.c | 56 ++++++++++++++= +++++++- > > .../Library/AcpiTimerLib/BaseAcpiTimerLib.c | 33 ++++++++----- > > .../Library/AcpiTimerLib/DxeAcpiTimerLib.c | 31 ++++++++---- > > 3 files changed, 99 insertions(+), 21 deletions(-) > > > > diff --git a/PcAtChipsetPkg/Library/AcpiTimerLib/AcpiTimerLib.c > b/PcAtChipsetPkg/Library/AcpiTimerLib/AcpiTimerLib.c > > index 806a4f7ce24c..e6fea231123d 100644 > > --- a/PcAtChipsetPkg/Library/AcpiTimerLib/AcpiTimerLib.c > > +++ b/PcAtChipsetPkg/Library/AcpiTimerLib/AcpiTimerLib.c > > @@ -1,7 +1,7 @@ > > /** @file > > ACPI Timer implements one instance of Timer Library. > > > > - Copyright (c) 2013 - 2015, Intel Corporation. All rights reserved. > > + Copyright (c) 2013 - 2016, Intel Corporation. All rights reserved. > > This program and the accompanying materials > > are licensed and made available under the terms and conditions of th= e BSD > License > > which accompanies this distribution. The full text of the license m= ay be found > at > > @@ -335,3 +335,57 @@ GetTimeInNanoSecond ( > > > > return NanoSeconds; > > } > > + > > +/** > > + Calculate TSC frequency. > > + > > + The TSC counting frequency is determined by comparing how far it cou= nts > > + during a 100us period as determined by the ACPI timer. The ACPI time= r is > > + used because it counts at a known frequency. > > + The TSC is sampled, followed by waiting for ACPI_TIMER_FREQUENCY / 1= 0000 > > + clocks of the ACPI timer, or 100us. The TSC is then sampled again. T= he > > + difference multiplied by 10000 is the TSC frequency. There will be a= small > > + error because of the overhead of reading the ACPI timer. An attempt = is > > + made to determine and compensate for this error. > > + > > + @return The number of TSC counts per second. > > + > > +**/ > > +UINT64 > > +InternalCalculateTscFrequency ( > > + VOID > > + ) > > +{ > > + UINT64 StartTSC; > > + UINT64 EndTSC; > > + UINT16 TimerAddr; > > + UINT32 Ticks; > > + UINT64 TscFrequency; > > + BOOLEAN InterruptState; > > + > > + InterruptState =3D SaveAndDisableInterrupts (); > > + > > + TimerAddr =3D InternalAcpiGetAcpiTimerIoPort (); > > + Ticks =3D IoRead32 (TimerAddr) + (ACPI_TIMER_FREQUENCY / 10000); = // Set Ticks > to 100us in the future >=20 > ACPI_TIMER_FREQUENCY is 3579545, thus you're waiting 357 ticks but the > actual result of the division is much closer to 358. The error is only > 0.26%, but it's so simple to reduce it that I think it's worth it. Just > change (ACPI_TIMER_FREQUENCY / 10000) to (ACPI_TIMER_FREQUENCY + 5000) / > 10000. >=20 > Another possibility is to count 343 ticks and multiply by 10436; 343 * > 10436 is almost exactly ACPI_TIMER_FREQUENCY. >=20 > Paolo >=20 > > + StartTSC =3D AsmReadTsc (); = // Get base > value for the TSC > > + // > > + // Wait until the ACPI timer has counted 100us. > > + // Timer wrap-arounds are handled correctly by this function. > > + // When the current ACPI timer value is greater than 'Ticks', the wh= ile loop > will exit. > > + // > > + while (((Ticks - IoRead32 (TimerAddr)) & BIT23) =3D=3D 0) { > > + CpuPause(); > > + } > > + EndTSC =3D AsmReadTsc (); = // TSC value > 100us later > > + > > + TscFrequency =3D MultU64x32 ( > > + (EndTSC - StartTSC), //= Number of > TSC counts in 100us > > + 10000 //= Number of > 100us in a second > > + ); > > + > > + SetInterruptState (InterruptState); > > + > > + return TscFrequency; > > +} > > + > > diff --git a/PcAtChipsetPkg/Library/AcpiTimerLib/BaseAcpiTimerLib.c > b/PcAtChipsetPkg/Library/AcpiTimerLib/BaseAcpiTimerLib.c > > index 21fdb79908b8..8819ebcfccef 100644 > > --- a/PcAtChipsetPkg/Library/AcpiTimerLib/BaseAcpiTimerLib.c > > +++ b/PcAtChipsetPkg/Library/AcpiTimerLib/BaseAcpiTimerLib.c > > @@ -1,7 +1,7 @@ > > /** @file > > ACPI Timer implements one instance of Timer Library. > > > > - Copyright (c) 2013 - 2014, Intel Corporation. All rights reserved. > > + Copyright (c) 2013 - 2016, Intel Corporation. All rights reserved. > > This program and the accompanying materials > > are licensed and made available under the terms and conditions of th= e BSD > License > > which accompanies this distribution. The full text of the license m= ay be found > at > > @@ -17,6 +17,26 @@ > > #include > > > > /** > > + Calculate TSC frequency. > > + > > + The TSC counting frequency is determined by comparing how far it cou= nts > > + during a 100us period as determined by the ACPI timer. The ACPI time= r is > > + used because it counts at a known frequency. > > + The TSC is sampled, followed by waiting for ACPI_TIMER_FREQUENCY / 1= 0000 > > + clocks of the ACPI timer, or 100us. The TSC is then sampled again. T= he > > + difference multiplied by 10000 is the TSC frequency. There will be a= small > > + error because of the overhead of reading the ACPI timer. An attempt = is > > + made to determine and compensate for this error. > > + > > + @return The number of TSC counts per second. > > + > > +**/ > > +UINT64 > > +InternalCalculateTscFrequency ( > > + VOID > > + ); > > + > > +/** > > Internal function to retrieves the 64-bit frequency in Hz. > > > > Internal function to retrieves the 64-bit frequency in Hz. > > @@ -29,14 +49,5 @@ InternalGetPerformanceCounterFrequency ( > > VOID > > ) > > { > > - BOOLEAN InterruptState; > > - UINT64 Count; > > - UINT64 Frequency; > > - > > - InterruptState =3D SaveAndDisableInterrupts (); > > - Count =3D GetPerformanceCounter (); > > - MicroSecondDelay (100); > > - Frequency =3D MultU64x32 (GetPerformanceCounter () - Count, 10000); > > - SetInterruptState (InterruptState); > > - return Frequency; > > + return InternalCalculateTscFrequency (); > > } > > diff --git a/PcAtChipsetPkg/Library/AcpiTimerLib/DxeAcpiTimerLib.c > b/PcAtChipsetPkg/Library/AcpiTimerLib/DxeAcpiTimerLib.c > > index 6f5c07a4f0b4..7f7b0f8f6294 100644 > > --- a/PcAtChipsetPkg/Library/AcpiTimerLib/DxeAcpiTimerLib.c > > +++ b/PcAtChipsetPkg/Library/AcpiTimerLib/DxeAcpiTimerLib.c > > @@ -1,7 +1,7 @@ > > /** @file > > ACPI Timer implements one instance of Timer Library. > > > > - Copyright (c) 2013 - 2014, Intel Corporation. All rights reserved. > > + Copyright (c) 2013 - 2016, Intel Corporation. All rights reserved. > > This program and the accompanying materials > > are licensed and made available under the terms and conditions of th= e BSD > License > > which accompanies this distribution. The full text of the license m= ay be found > at > > @@ -16,6 +16,26 @@ > > #include > > #include > > > > +/** > > + Calculate TSC frequency. > > + > > + The TSC counting frequency is determined by comparing how far it cou= nts > > + during a 100us period as determined by the ACPI timer. The ACPI time= r is > > + used because it counts at a known frequency. > > + The TSC is sampled, followed by waiting for ACPI_TIMER_FREQUENCY / 1= 0000 > > + clocks of the ACPI timer, or 100us. The TSC is then sampled again. T= he > > + difference multiplied by 10000 is the TSC frequency. There will be a= small > > + error because of the overhead of reading the ACPI timer. An attempt = is > > + made to determine and compensate for this error. > > + > > + @return The number of TSC counts per second. > > + > > +**/ > > +UINT64 > > +InternalCalculateTscFrequency ( > > + VOID > > + ); > > + > > // > > // Cached performance counter frequency > > // > > @@ -34,15 +54,8 @@ InternalGetPerformanceCounterFrequency ( > > VOID > > ) > > { > > - BOOLEAN InterruptState; > > - UINT64 Count; > > - > > if (mPerformanceCounterFrequency =3D=3D 0) { > > - InterruptState =3D SaveAndDisableInterrupts (); > > - Count =3D GetPerformanceCounter (); > > - MicroSecondDelay (100); > > - mPerformanceCounterFrequency =3D MultU64x32 (GetPerformanceCounter= () - Count, > 10000); > > - SetInterruptState (InterruptState); > > + mPerformanceCounterFrequency =3D InternalCalculateTscFrequency (); > > } > > return mPerformanceCounterFrequency; > > } > >