From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received-SPF: Pass (sender SPF authorized) identity=mailfrom; client-ip=134.134.136.24; helo=mga09.intel.com; envelope-from=liming.gao@intel.com; receiver=edk2-devel@lists.01.org Received: from mga09.intel.com (mga09.intel.com [134.134.136.24]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ml01.01.org (Postfix) with ESMTPS id AB6EE21A07A92 for ; Wed, 7 Nov 2018 22:01:35 -0800 (PST) X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga007.jf.intel.com ([10.7.209.58]) by orsmga102.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 07 Nov 2018 22:01:34 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.54,478,1534834800"; d="scan'208";a="87583603" Received: from fmsmsx103.amr.corp.intel.com ([10.18.124.201]) by orsmga007.jf.intel.com with ESMTP; 07 Nov 2018 22:01:34 -0800 Received: from fmsmsx121.amr.corp.intel.com (10.18.125.36) by FMSMSX103.amr.corp.intel.com (10.18.124.201) with Microsoft SMTP Server (TLS) id 14.3.408.0; Wed, 7 Nov 2018 22:01:34 -0800 Received: from shsmsx152.ccr.corp.intel.com (10.239.6.52) by fmsmsx121.amr.corp.intel.com (10.18.125.36) with Microsoft SMTP Server (TLS) id 14.3.408.0; Wed, 7 Nov 2018 22:01:34 -0800 Received: from shsmsx104.ccr.corp.intel.com ([169.254.5.117]) by SHSMSX152.ccr.corp.intel.com ([169.254.6.214]) with mapi id 14.03.0415.000; Thu, 8 Nov 2018 14:00:51 +0800 From: "Gao, Liming" To: Sean Brogan CC: "edk2-devel@lists.01.org" Thread-Topic: Edk2 uni file encoding Thread-Index: AdR20C6rSm7ksjUET/Kmuyj3hkLhKAAV0kSA Date: Thu, 8 Nov 2018 06:00:50 +0000 Message-ID: <4A89E2EF3DFEDB4C8BFDE51014F606A14E366631@SHSMSX104.ccr.corp.intel.com> References: In-Reply-To: Accept-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.239.127.40] MIME-Version: 1.0 Subject: Re: Edk2 uni file encoding X-BeenThere: edk2-devel@lists.01.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: EDK II Development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 08 Nov 2018 06:01:35 -0000 Content-Language: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Sean: EDKII UNI spec (https://github.com/tianocore/tianocore.github.io/wiki/EDK= -II-Specifications) Chapter 2 defines UNI file format. EdkCompatibilityPkg = is obsolete. BZ https://bugzilla.tianocore.org/show_bug.cgi?id=3D1103 is su= bmitted to delete EdkCompatibilityPkg from edk2/master. We will work on it.= =20 EDK II Unicode files are used for mapping token names to localized strings = that are identified by an RFC4646 language code. The format for storing EDK II Unico= de files on disk is UTF-8 (without a BOM character) or UTF-16LE (with a BOM character). The = character content must be UCS-2. Thanks Liming >-----Original Message----- >From: edk2-devel [mailto:edk2-devel-bounces@lists.01.org] On Behalf Of >Sean Brogan via edk2-devel >Sent: Thursday, November 08, 2018 7:00 AM >To: edk2-devel@lists.01.org >Subject: [edk2] Edk2 uni file encoding > >Is there a definitive answer for the file encoding for all UNI files in ed= k2? >If not I would like to propose one. Incorrect encoding causes tool issues= and is >something we can easily check for and fix. > >Proposal: All UNI files in edk2 should be > > > 1. UTF-8 >Or > > 1. Use a BOM and be UTF-16 > >https://en.wikipedia.org/wiki/Byte_order_mark > >Results from searching edk2: >1 - UTF-16 LE BOM file: >EdkCompatibilityPkg\Compatibility\FrameworkHiiOnUefiHiiThunk\Strings.uni >919 - Without BOM and decoded as UTF-8 > >Thoughts? > >Future question: Can we make rule for all other standard file types (c, h= , dec, >dsc, fdf, inf,)? > >Thanks >Sean > > > >_______________________________________________ >edk2-devel mailing list >edk2-devel@lists.01.org >https://lists.01.org/mailman/listinfo/edk2-devel