From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received-SPF: Pass (sender SPF authorized) identity=mailfrom; client-ip=66.187.233.73; helo=mx1.redhat.com; envelope-from=lersek@redhat.com; receiver=edk2-devel@lists.01.org Received: from mx1.redhat.com (mx3-rdu2.redhat.com [66.187.233.73]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ml01.01.org (Postfix) with ESMTPS id BDD592243692F for ; Fri, 23 Feb 2018 01:15:14 -0800 (PST) Received: from smtp.corp.redhat.com (int-mx05.intmail.prod.int.rdu2.redhat.com [10.11.54.5]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 50330EAEB2; Fri, 23 Feb 2018 09:21:15 +0000 (UTC) Received: from lacos-laptop-7.usersys.redhat.com (ovpn-120-112.rdu2.redhat.com [10.10.120.112]) by smtp.corp.redhat.com (Postfix) with ESMTP id 0BC4EB3006; Fri, 23 Feb 2018 09:21:13 +0000 (UTC) To: Pankaj Bansal , Leif Lindholm Cc: "michael.d.kinney@intel.com" , "edk2-devel@lists.01.org" , "ard.biesheuvel@linaro.org" References: <1518771035-6733-1-git-send-email-meenakshi.aggarwal@nxp.com> <1518771035-6733-2-git-send-email-meenakshi.aggarwal@nxp.com> <20180221154601.nkbp2xmy3zb2xolm@bivouac.eciton.net> <20180221185818.arwfhombntutnt23@bivouac.eciton.net> <20180222115223.xtfpc7du22drfkju@bivouac.eciton.net> <2a1fa56f-98db-a1c1-d973-7e84cc7dc1fa@redhat.com> From: Laszlo Ersek Message-ID: <946bd4f7-ed57-018c-00ca-cee154fcb2f0@redhat.com> Date: Fri, 23 Feb 2018 10:21:13 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 MIME-Version: 1.0 In-Reply-To: X-Scanned-By: MIMEDefang 2.79 on 10.11.54.5 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.1]); Fri, 23 Feb 2018 09:21:15 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.1]); Fri, 23 Feb 2018 09:21:15 +0000 (UTC) for IP:'10.11.54.5' DOMAIN:'int-mx05.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'lersek@redhat.com' RCPT:'' Subject: Re: [PATCH edk2-platforms 01/39] Silicon/NXP: Add support for Big Endian Mmio APIs X-BeenThere: edk2-devel@lists.01.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: EDK II Development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 23 Feb 2018 09:15:15 -0000 Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit On 02/23/18 09:40, Pankaj Bansal wrote: > Hi All > >> -----Original Message----- >> From: edk2-devel [mailto:edk2-devel-bounces@lists.01.org] On Behalf Of >> Laszlo Ersek >> Sent: Thursday, February 22, 2018 7:26 PM >> To: Leif Lindholm >> Cc: michael.d.kinney@intel.com; edk2-devel@lists.01.org; >> ard.biesheuvel@linaro.org >> Subject: Re: [edk2] [PATCH edk2-platforms 01/39] Silicon/NXP: Add support >> for Big Endian Mmio APIs >> >> On 02/22/18 12:52, Leif Lindholm wrote: >>> On Thu, Feb 22, 2018 at 09:34:05AM +0100, Laszlo Ersek wrote: >> >>>>> But that brings back the complication as to how we have a driver >>>>> that needs an LE IO library to write output, and a BE IO library to >>>>> manipulate the hardware. >>>> >>>> Can you please explain the "write output" use case more precisely? >>>> >>>> My thinking would be this: >>>> >>>> - Use the IoLib class directly for "writing output" in little endian >>>> byte order (which is still unclear to me sorry). >>> >>> If the IoLib class is mapped to a an instance that byte-swaps (hereto >>> referred to as BeIoLib if IoLibSwap is unsuitable), would we not then >>> end up mapping the non-swapping, currently implemented in >>> BaseLibIoIntrinsic, variant as BeIoLib? Or if not, do we end up >>> needing to duplicated all IoLib implementation .infs to provide an >>> IoLib and a BeIoLib for each? >>> >>> It's at that point I burst an aneurysm. >>> Am I overthinking/underthinking this? >> >> We need two library classes, one for talking to LE devices and another to BE >> devices. These should be usable in a given module at the same time, as Ard >> says. >> >> Both library classes need to work on both LE and BE CPUs (working from your >> suggestion that UEFI might grow BE CPU support at some point). >> Whether that is implemented by dumb, separate library instances (yielding in >> total 2*2=4 library instances), or by smart, CPU-endianness-agnostic library >> instances (in total, 2), is a different question. >> >> Note that such "smarts" could be less than trivial to implement: >> - check CPU endianness in each library API? >> - or check in the lib constructor only, and flip some function pointers? >> - use a dynamic PCD for caching CPU endianness? >> - use a HOB for the same? >> - use a lib global variable (for caching only on the module level)? >> >> I think the solution that saves the most on the *source* code size is: >> - introduce the BeIoLib class >> - duplicate the MMIO functions from BaseIoLibIntrinsic to the one >> BeIoLib instance that we introduce >> - modify the MMIO functions in *both* lib instances (original LE, and >> new BE), like this: >> >> - If the CPU architecture is known to be bound to a single endianness, >> then hardcode the appropriate operation. This can be done with >> preprocessor macros, or with the architecture support of INF files / >> separate source files. For example, on IA32 and X64, the IoLib >> instance should work transparently, unconditionally, and the BeIoLib >> instance should byte-swap, unconditionally. >> >> - On other CPU arches, all the wider-than-byte MMIO functions, in >> *both* lib instances should do something like this: >> >> // >> // at file scope >> // >> STATIC CONST UINT16 mOne = 1; >> >> // >> // at function scope >> // >> if (*(CONST UINT8 *)&mOne == 1) { >> // >> // CPU in LE mode: >> // - work transparently in the IoLib instance >> // - byte-swap in the BeIoLib instance >> // >> } else { >> // >> // CPU in BE mode: >> // - byte-swap in the IoLib instance >> // - work transparently in the BeIoLib instance >> // >> } > > I suggest this approach : > > 1. Add BeMmio* functions in existing IoLib. BeMmio* functions will swap the input before write and swap output after read and so on. > Mmio* functions will not perform any byte swapping > 2. create second instance (a copy) of this IoLib for CPUs that are Big Endian. We can call it BigEndianIoLib. > In this library Mmio* functions will swap the input before write and swap output after read and so on. > BeMmio* functions will not perform any byte swapping. > 3. Include the instance of IoLib in dsc file based on cpu endianness that the platform wants to use. i.e. > If BIG_ENDIAN == FALSE > IoLib | ..\..\..\IoLib > Else > IoLib | ..\..\..\BigEndianIoLib > 4. The devices that are Big endian in platform will always call BeMmio* functions. They need not check CPU endianness. > 5. The devices that are Little endian in platform will always call Mmio* functions. They need not check CPU endianness. This can work too, but there is a downside: a large number of IoLib instances exist in the tree already. If you add the BeMmio* functions to the existent IoLib class, you'll have to duplicate the implementation to all instances (identically, I think). We've had this debate in the past. Back then it was about IoFifo routines. I argued for an IoFifo lib class. Ultimately the IoFifo routines were added to IoLib, and they had to be implemented for many more library instances than client code would have actually required. (See the series at 13a50a6fe1dc..2b631390f9f5.) In turn this runs the risk of adding untested code. Regarding the instances for BE CPUs: the name should likely be BaseIoLibBigEndian or something similar. In lib instance names, the lib class name is usually prefixed with the firmware phases where the instance is usable, and hints about the implementation or constraints are added as a suffix. Also, if you want to support BE CPUs with separate IoLib instances, I'm afraid that's going to lead to the combinatorial explosion that Leif characterized as "burst[ing] an aneurysm". I think using the (seemingly dynamic) "mOne" approach I suggested above, a smart compiler can eliminate all the branches at build time. Anyway, I don't insist; I just commented on an RFC. Laszlo