From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from rn-mailsvcp-ppex-lapp34.apple.com (rn-mailsvcp-ppex-lapp34.apple.com [17.179.253.43])
 by mx.groups.io with SMTP id smtpd.web11.9855.1618583672703872840
 for <devel@edk2.groups.io>;
 Fri, 16 Apr 2021 07:34:32 -0700
Authentication-Results: mx.groups.io;
 dkim=pass header.i=@apple.com header.s=20180706 header.b=sPhZiMFf;
 spf=pass (domain: apple.com, ip: 17.179.253.43, mailfrom: afish@apple.com)
Received: from pps.filterd (rn-mailsvcp-ppex-lapp34.rno.apple.com [127.0.0.1])
	by rn-mailsvcp-ppex-lapp34.rno.apple.com (8.16.1.2/8.16.1.2) with SMTP id 13GEXhVn012852;
	Fri, 16 Apr 2021 07:34:32 -0700
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=apple.com; h=from : message-id :
 content-type : mime-version : subject : date : in-reply-to : cc : to :
 references; s=20180706; bh=V10UptC5nK+BsVySUnmhBm0an5VHoRTmOZC5cAoXSAw=;
 b=sPhZiMFfx18UmRo/jKlu4oYdla8C4V69dyGlSMDXQwnR5C7XW5u/jQLyX79DItiZ7D3Y
 qPTAt/T7Mw4SoqQW11Py1Xm9pOJwBfBk7WJ0PH+vkdDFYzVo1g4QRzaxaZhN/yk3URJU
 Jc7/O53cpPnM9YbGRR+qBWkttXLN9qn3LKPZQfLXsDZOpbvbKL4byWGhGBT4WIYTwRAb
 SQLnBQdAi60YChLDfOvPPLHCfcvhntn/FyhiZu4gsY1y7mKBiz4g3kDgaxVXU7Bk4/72
 jKTj2I6g+/+vA54EvW+1yzYEy9bFJr5mfgXIdyqAJRwBD89M21yB3CqIEY6/R5Jy/mlt jg== 
Received: from rn-mailsvcp-mta-lapp04.rno.apple.com (rn-mailsvcp-mta-lapp04.rno.apple.com [10.225.203.152])
	by rn-mailsvcp-ppex-lapp34.rno.apple.com with ESMTP id 37u7v3ppae-7
	(version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NO);
	Fri, 16 Apr 2021 07:34:32 -0700
Received: from rn-mailsvcp-mmp-lapp01.rno.apple.com
 (rn-mailsvcp-mmp-lapp01.rno.apple.com [17.179.253.14])
 by rn-mailsvcp-mta-lapp04.rno.apple.com
 (Oracle Communications Messaging Server 8.1.0.7.20201203 64bit (built Dec  3
 2020)) with ESMTPS id <0QRN00A00V5IG1A0@rn-mailsvcp-mta-lapp04.rno.apple.com>;
 Fri, 16 Apr 2021 07:34:30 -0700 (PDT)
Received: from process_milters-daemon.rn-mailsvcp-mmp-lapp01.rno.apple.com by
 rn-mailsvcp-mmp-lapp01.rno.apple.com
 (Oracle Communications Messaging Server 8.1.0.7.20201203 64bit (built Dec  3
 2020)) id <0QRN00300UTKDZ00@rn-mailsvcp-mmp-lapp01.rno.apple.com>; Fri,
 16 Apr 2021 07:34:30 -0700 (PDT)
X-Va-A: 
X-Va-T-CD: 9ad46be6e1c3c1a24e92ea4dad46d58d
X-Va-E-CD: 4730c80ee67030d4f2c83e40b4ab0357
X-Va-R-CD: 6f0325faf294bd23a6d751c620be9d51
X-Va-CD: 0
X-Va-ID: 1d7b60dd-fdbf-43fe-bf6d-3c984d38c7ad
X-V-A: 
X-V-T-CD: 9ad46be6e1c3c1a24e92ea4dad46d58d
X-V-E-CD: 4730c80ee67030d4f2c83e40b4ab0357
X-V-R-CD: 6f0325faf294bd23a6d751c620be9d51
X-V-CD: 0
X-V-ID: d1eb756c-42fd-428a-b94f-2143d525b461
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.391,18.0.761
 definitions=2021-04-16_07:2021-04-16,2021-04-16 signatures=0
Received: from [17.235.19.21] (unknown [17.235.19.21])
 by rn-mailsvcp-mmp-lapp01.rno.apple.com
 (Oracle Communications Messaging Server 8.1.0.7.20201203 64bit (built Dec  3
 2020))
 with ESMTPSA id <0QRN00VKMV5FAA00@rn-mailsvcp-mmp-lapp01.rno.apple.com>; Fri,
 16 Apr 2021 07:34:29 -0700 (PDT)
From: "Andrew Fish" <afish@apple.com>
Message-id: <2296BE7E-ACEE-4286-9A5C-408B2D1ADC2E@apple.com>
MIME-version: 1.0 (Mac OS X Mail 14.0 \(3654.20.0.2.1\))
Subject: Re: [edk2-devel] VirtIO Sound Driver (GSoC 2021)
Date: Fri, 16 Apr 2021 07:34:27 -0700
In-reply-to: <406e5bdb-f6aa-21ce-c96a-b16fb07c181d@posteo.de>
Cc: Ethin Probst <harlydavidsen@gmail.com>, Michael Brown <mcb30@ipxe.org>,
        Mike Kinney <michael.d.kinney@intel.com>,
        Leif Lindholm <leif@nuviainc.com>, Laszlo Ersek <lersek@redhat.com>,
        "Desimone, Nathaniel L" <nathaniel.l.desimone@intel.com>,
        Rafael Rodrigues Machado <rafaelrodrigues.machado@gmail.com>,
        Gerd Hoffmann <kraxel@redhat.com>
To: edk2-devel-groups-io <devel@edk2.groups.io>,
        =?utf-8?Q?Marvin_H=C3=A4user?= <mhaeuser@posteo.de>
References: 
 <CAJQtwF2aOTztmMOW-QFHovdFkoQHZnPqPxgSbKd+HfqeumD2Fw@mail.gmail.com>
 <BE3D8FA9-1BFE-43C3-B69F-38A44EA36ACE@apple.com>
 <16758FB6436B1195.32393@groups.io>
 <CAJQtwF0pcYzFHjNp5JWqNDecwcxuAM3_MbUkZ5FruBUGp=BSCw@mail.gmail.com>
 <A454E2B2-7569-443D-AADF-60384005BE47@apple.com>
 <CAJQtwF0wVsXvN3uHHHmWzHPTmh-Mkyei4mpc_bsaP21bMB9+PA@mail.gmail.com>
 <CO1PR11MB4929B44F6CC94FA661AA1055D24E9@CO1PR11MB4929.namprd11.prod.outlook.com>
 <B2260448-028D-4659-98D6-C695CF3D738A@apple.com>
 <CAJQtwF3-REoeETR46BKr0Q6=b6ZHALbeiBBFtp4fEdFxKz8gwA@mail.gmail.com>
 <4AEC1784-99AF-47EF-B7DD-77F91EA3D7E9@apple.com>
 <CAJQtwF2e4dACRVyibbLOmOEmy34xMdGb1s0YdPFcHmZ2NMoBDA@mail.gmail.com>
 <309cc5ca-2ecd-79dd-b183-eec0572ea982@ipxe.org>
 <A139650C-A76F-4471-AFCC-FFF1BE2E35BB@apple.com>
 <CAJQtwF3kuOD3C2arUfZu_xDbkHq5HHz+LYNB2=AeV8x+q_cPtw@mail.gmail.com>
 <CCB65CBC-304C-42B3-810D-0AC8BEAE29D1@apple.com>
 <CAJQtwF08Apihdsitw2Vs-+iV9rrqgpqkwONSbaY-yb_BP1xCYw@mail.gmail.com>
 <406e5bdb-f6aa-21ce-c96a-b16fb07c181d@posteo.de>
X-Mailer: Apple Mail (2.3654.20.0.2.1)
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.391,18.0.761
 definitions=2021-04-16_07:2021-04-16,2021-04-16 signatures=0
Content-type: multipart/alternative;
 boundary="Apple-Mail=_C411943F-CDA1-45E8-BB2A-B3625A4AA9FB"

--Apple-Mail=_C411943F-CDA1-45E8-BB2A-B3625A4AA9FB
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=utf-8


> On Apr 16, 2021, at 6:22 AM, Marvin H=C3=A4user <mhaeuser@posteo.de> wro=
te:
>=20
> Good day,
>=20
> Sorry for the nitpicking.
>=20
> - Protocols always need a "Revision" field as first member. This is used=
 to be able to expand its capabilities in later revisions without introduci=
ng a new, distinct protocol.
> - Consider the name EFI_SIMPLE_AUDIO_OUTPUT(!)_PROTOCOL, to not cause co=
nfusion if input is ever added. Input in my opinion should be a separate pr=
otocol as there is no reason why they would necessarily be coupled topology=
-wise (think of an USB microphone, it will never have any sort of output).
> - To make code safety a bit easier, try to use "CONST" for "IN" (non-OUT=
) pointers, so that CONST can be propagated where possible.
> - Please do *not* make the events caller-owned. We had it multiple times=
 already on production firmware that events are left dangling and may be po=
lled/signaled after ExitBS(). The caller should be able to decide on some p=
olicy maybe (i.e. abort or block on ExitBS() until the playback finished), =
as cut-off audio may be awkward; but the callee definitely should implement=
 "event safety" itself. Maybe avoid exposing events directly at all and pro=
vide nice abstractions the caller cannot misuse.
> - I don't think audio should be required at all, the required subset sho=
uld firstly consider minimalism and security. Accessibility will not be of =
concern for some IoT device, the audio code would simply eat space, and int=
roduce a larger surface for bugs.
>=20

Marvin,

Generally how we work this in the UEFI Specification is we make it optiona=
l via the following wording: =E2=80=9CIf a platform includes the ability to=
 play audio in EFI then the EFI_SIMPLE_AUDIO_OUTPUT_PROTOCOL must be implem=
ented.=20

Basically this requirement will get added to UEFI Specification 2.6.2 Plat=
form-Specific Elements.

Thanks,

Andrew Fish

> Best regards,
> Marvin
>=20
> On 16.04.21 01:42, Ethin Probst wrote:
>> Hi Andrew,
>>=20
>> What would that protocol interface look like if we utilized your idea?
>> With mine (though I need to add channel mapping as well), your
>> workflow for playing a stereo sound from left to right would probably
>> be something like this:
>> 1) Encode the sound using a standard tool into a Wave PCM 16.
>> 2) Place the Wave file in the Firmware Volume using a given UUID as
>> the name. As simple as editing the platform FDF file.
>> 3) Write some BDS code
>>   a) Lookup Wave file by UUID and read it into memory.
>>   b) Decode the audio file (audio devices will not do this decoding
>> for you, you have to do that yourself).
>>   c) Call EFI_AUDIO_PROTOCOL.LoadBuffer(), passing in the sample rate
>> of your audio, EFI_AUDIO_PROTOCOL_SAMPLE_FORMAT_S16 for signed 16-bit
>> PCM audio, the channel mapping, the number of samples, and the samples
>> themselves.
>>   d) call EFI_BOOT_SERVICES.CreateEvent()/EFI_BOOT_SERVICES.CreateEvent=
Ex()
>> for a playback event to signal.
>>   e) call EFI_AUDIO_PROTOCOL.StartPlayback(), passing in the event you
>> just created.
>> The reason that LoadBuffer() takes so many parameters is because the
>> device does not know the audio that your passing in. If I'm given an
>> array of 16-bit audio samples, its impossible to know the parameters
>> (sample rate, sample format, channel mapping, etc.) from that alone.
>> Using your idea, though, my protocol could be greatly simplified.
>> Forcing a particular channel mapping, sample rate and sample format on
>> everyone would complicate application code. From an application point
>> of view, one would, with that type of protocol, need to do the
>> following:
>> 1) Load an audio file in any audio file format from any storage mechani=
sm.
>> 2) Decode the audio file format to extract the samples and audio metada=
ta.
>> 3) Resample the (now decoded) audio samples and convert (quantize) the
>> audio samples into signed 16-bit PCM audio.
>> 4) forward the samples onto the EFI audio protocol.
>> There is another option. (I'm happy we're discussing this now -- we
>> can hammer out all the details now which will make a lot of things
>> easier.) Since I'll most likely end up splitting the device-specific
>> interfaces to different audio protocols, we could make a simple audio
>> protocol that makes various assumptions about the audio samples your
>> giving it (e.g.: sample rate, format, ...). This would just allow
>> audio output and input in signed 16-bit PCM audio, as you've
>> suggested, and would be a simple and easy to use interface. Something
>> like:
>> typedef struct EFI_SIMPLE_AUDIO_PROTOCOL {
>>   EFI_SIMPLE_AUDIO_PROTOCOL_RESET Reset;
>>   EFI_SIMPLE_AUDIO_PROTOCOL_START Start;
>>   EFI_SIMPLE_AUDIO_PROTOCOL_STOP Stop;
>> } EFI_SIMPLE_AUDIO_PROTOCOL;
>> This way, users and driver developers have a simple audio protocol
>> they can implement if they like. It would assume signed 16-bit PCM
>> audio and mono channel mappings at 44100 Hz. Then, we can have an
>> advanced protocol for each device type (HDA, USB, VirtIO, ...) that
>> exposes all the knobs for sample formats, sample rates, that kind of
>> thing. Obviously, like the majority (if not all) UEFI protocols, these
>> advanced protocols would be optional. I think, however, that the
>> simple audio protocol should be a required protocol in all UEFI
>> implementations. But that might not be possible. So would this simpler
>> interface work as a starting point?
>>=20
>> On 4/15/21, Andrew Fish <afish@apple.com> wrote:
>>>=20
>>>> On Apr 15, 2021, at 1:11 PM, Ethin Probst <harlydavidsen@gmail.com>
>>>> wrote:
>>>>=20
>>>>> Is there any necessity for audio input and output to be implemented
>>>>> within the same protocol?  Unlike a network device (which is
>>>>> intrinsically bidirectional), it seems natural to conceptually separ=
ate
>>>>> audio input from audio output.
>>>> Nope, there isn't a necessity to make them in one, they can be
>>>> separated into two.
>>>>=20
>>>>> The code controlling volume/mute may not have any access to the samp=
le
>>>>> buffer.  The most natural implementation would seem to allow for a
>>>>> platform to notice volume up/down keypresses and use those to contro=
l the
>>>>> overall system volume, without any knowledge of which samples (if an=
y)
>>>>> are currently being played by other code in the system.
>>>> Your assuming that the audio device your implementing the
>>>> volume/muting has volume control and muting functionality within
>>>> itself, then.
>>> Not really. We are assuming that audio hardware has a better understan=
ding
>>> of how that system implements volume than some generic EFI Code that i=
s by
>>> definition platform agnostic.
>>>=20
>>>> This may not be the case, and so we'd need to
>>>> effectively simulate it within the driver, which isn't too hard to do=
.
>>>> As an example, the VirtIO driver does not have a request type for
>>>> muting or for volume control (this would, most likely, be within the
>>>> VIRTIO_SND_R_PCM_SET_PARAMS request, see sec. 5.14.6.4.3). Therefore,
>>>> either the driver would have to simulate the request or return
>>>> EFI_UNSUPPORTED this instance.
>>>>=20
>>> So this is an example of above since the audio hardware knows it is ro=
uting
>>> the sound output into another subsystem, and that subsystem controls t=
he
>>> volume. So the VirtIo Sound Driver know best how to bstract volume/mut=
e for
>>> this platform.
>>>=20
>>>>> Consider also the point of view of the developer implementing a driv=
er
>>>>> for some other piece of audio hardware that happens not to support
>>>>> precisely the same sample rates etc as VirtIO.  It would be extremel=
y
>>>>> ugly to force all future hardware to pretend to have the same
>>>>> capabilities as VirtIO just because the API was initially designed w=
ith
>>>>> VirtIO in mind.
>>>> Precisely, but the brilliance of VirtIO
>>> The brilliance of VirtIO is that it just needs to implement a generic =
device
>>> driver for a given operating system. In most cases these operating sys=
tems
>>> have sounds subsystems that manage sound and want fine granularity of
>>> control on what is going on. So the drivers are implemented to maximiz=
es
>>> flexibility since the OS has lots of generic code that deals with soun=
d, and
>>> even user configurable knobs to control audio. In our case that extra =
layer
>>> does not exist in EFI and the end user code just want to tell the driv=
er do
>>> some simple things.
>>>=20
>>> Maybe it is easier to think about with an example. Lets say I want to =
play a
>>> cool sound on every boot. What would be the workflow to make the happe=
n.
>>> 1) Encode the sound using a standard tool into a Wave PCM 16.
>>> 2) Place the Wave file in the Firmware Volume using a given UUID as th=
e
>>> name. As simple as editing the platform FDF file.
>>> 3) Write some BDS code
>>>   a) Lookup Wave file by UUID and read it into memory.
>>>   b) Point the EFI Sound Protocol at the buffer with the Wave file
>>>   c) Tell the EFI Sound Protocol to play the sound.
>>>=20
>>> If you start adding in a lot of perimeters that work flow starts getti=
ng
>>> really complicated really quickly.
>>>=20
>>>> is that the sample rate,
>>>> sample format, &c., do not have to all be supported by a VirtIO
>>>> device. Notice, also, how in my protocol proposal I noted that the
>>>> sample rates, at least, were "recommended," not "required." Should a
>>>> device not happen to support a sample rate or sample format, all it
>>>> needs to do is return EFI_INVALID_PARAMETER. Section 5.14.6.2.1
>>>> (VIRTIO_SND_R_JACK_GET_CONFIG) describes how a jack tells you what
>>>> sample rates it supports, channel mappings, &c.
>>>>=20
>>>> I do understand how just using a predefined sample rate and sample
>>>> format might be a good idea, and if that's the best way, then that's
>>>> what we'll do. The protocol can always be revised at a later time if
>>>> necessary. I apologize if my stance seems obstinate.
>>>>=20
>>> I think if we add the version into the protocol and make sure we have =
a
>>> separate load and play operation we could add a member to set the extr=
a
>>> perimeters if needed. There might also be some platform specific gener=
ic
>>> tunables that might be interesting for a future member function.
>>>=20
>>> Thanks,
>>>=20
>>> Andrew Fish
>>>=20
>>>> Also, thank you, Laszlo, for your advice -- I hadn't considered that =
a
>>>> network driver would be another good way of figuring out how async
>>>> works in UEFI.
>>>>=20
>>>> On 4/15/21, Andrew Fish <afish@apple.com> wrote:
>>>>>=20
>>>>>> On Apr 15, 2021, at 5:07 AM, Michael Brown <mcb30@ipxe.org> wrote:
>>>>>>=20
>>>>>> On 15/04/2021 06:28, Ethin Probst wrote:
>>>>>>> - I hoped to add recording in case we in future want to add
>>>>>>> accessibility aids like speech recognition (that was one of the to=
do
>>>>>>> tasks on the EDK2 tasks list)
>>>>>> Is there any necessity for audio input and output to be implemented
>>>>>> within
>>>>>> the same protocol?  Unlike a network device (which is intrinsically
>>>>>> bidirectional), it seems natural to conceptually separate audio inp=
ut
>>>>>> from
>>>>>> audio output.
>>>>>>=20
>>>>>>> - Muting and volume control could easily be added by just replacin=
g
>>>>>>> the sample buffer with silence and by multiplying all the samples =
by a
>>>>>>> percentage.
>>>>>> The code controlling volume/mute may not have any access to the sam=
ple
>>>>>> buffer.  The most natural implementation would seem to allow for a
>>>>>> platform to notice volume up/down keypresses and use those to contr=
ol
>>>>>> the
>>>>>> overall system volume, without any knowledge of which samples (if a=
ny)
>>>>>> are
>>>>>> currently being played by other code in the system.
>>>>>>=20
>>>>> I=E2=80=99ve also thought of adding NVRAM variable that would let se=
tup, UEFI
>>>>> Shell,
>>>>> or even the OS set the current volume, and Mute. This how it would b=
e
>>>>> consumed concept is why I proposed mute and volume being separate AP=
Is.
>>>>> The
>>>>> volume up/down API in addition to fixed percentage might be overkill=
, but
>>>>> it
>>>>> does allow a non liner mapping to the volume up/down keys. You would=
 be
>>>>> surprised how picky audiophiles can be and it seems they like to fil=
e
>>>>> Bugzillas.
>>>>>=20
>>>>>>> - Finally, the reason I used enumerations for specifying parameter=
s
>>>>>>> like sample rate and stuff was that I was looking at this protocol
>>>>>>> from a general UEFI applications point of view. VirtIO supports al=
l of
>>>>>>> the sample configurations listed in my gist, and it seems reasonab=
le
>>>>>>> to allow the application to control those parameters instead of
>>>>>>> forcing a particular parameter configuration onto the developer.
>>>>>> Consider also the point of view of the developer implementing a dri=
ver
>>>>>> for
>>>>>> some other piece of audio hardware that happens not to support
>>>>>> precisely
>>>>>> the same sample rates etc as VirtIO.  It would be extremely ugly to
>>>>>> force
>>>>>> all future hardware to pretend to have the same capabilities as Vir=
tIO
>>>>>> just because the API was initially designed with VirtIO in mind.
>>>>>>=20
>>>>>> As a developer on the other side of the API, writing code to play s=
ound
>>>>>> files on an arbitrary unknown platform, I would prefer to simply
>>>>>> consume
>>>>>> as simple as possible an abstraction of an audio output protocol an=
d
>>>>>> not
>>>>>> have to care about what hardware is actually implementing it.
>>>>>>=20
>>>>> It may make sense to have an API to load the buffer/stream and other=
 APIs
>>>>> to
>>>>> play or pause. This could allow an optional API to configure how the
>>>>> stream
>>>>> is played back. If we add a version to the Protocol that would at le=
ast
>>>>> future proof us.
>>>>>=20
>>>>> We did get feedback that it is very common to speed up the auto play=
back
>>>>> rates for accessibility. I=E2=80=99m not sure if that is practical w=
ith a simple
>>>>> PCM
>>>>> 16 wave file with the firmware audio implementation. I guess that is
>>>>> something we could investigate.
>>>>>=20
>>>>> In terms of maybe adding text to speech there is an open source proj=
ect
>>>>> that
>>>>> conceptually we could port to EFI. It would likely be a binary that
>>>>> would
>>>>> have to live on the EFI System Partition due to size. I was thinking
>>>>> that
>>>>> words per minute could be part of that API and it would produce a PC=
M 16
>>>>> wave file that the audio protocol we are discussing could play.
>>>>>=20
>>>>>> Both of these argue in favour of defining a very simple API that
>>>>>> expresses
>>>>>> only a common baseline capability that is plausibly implementable f=
or
>>>>>> every piece of audio hardware ever made.
>>>>>>=20
>>>>>> Coupled with the relatively minimalistic requirements for boot-time
>>>>>> audio,
>>>>>> I'd probably suggest supporting only a single format for audio data=
,
>>>>>> with
>>>>>> a fixed sample rate (and possibly only mono output).
>>>>>>=20
>>>>> In my world the folks that work for Jony asked for a stereo boot bon=
g to
>>>>> transition from left to right :). This is not the CODEC you are look=
ing
>>>>> for
>>>>> was our response=E2=80=A6. I also did not mention that some language=
s are right
>>>>> to
>>>>> left, as the only thing worse than one complex thing is two complex
>>>>> things
>>>>> to implement.
>>>>>=20
>>>>>> As always: perfection is achieved, not when there is nothing more t=
o
>>>>>> add,
>>>>>> but when there is nothing left to take away.  :)
>>>>>>=20
>>>>> "Simplicity is the ultimate sophistication=E2=80=9D
>>>>>=20
>>>>> Thanks,
>>>>>=20
>>>>> Andrew Fish
>>>>>=20
>>>>>> Thanks,
>>>>>>=20
>>>>>> Michael
>>>>>>=20
>>>>>>=20
>>>>>>=20
>>>>>>=20
>>>>>>=20
>>>>>=20
>>>>=20
>>>> --
>>>> Signed,
>>>> Ethin D. Probst
>>>=20
>>=20
>=20
>=20
>=20
>=20


--Apple-Mail=_C411943F-CDA1-45E8-BB2A-B3625A4AA9FB
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=utf-8

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html; charset=
=
=3Dutf-8"></head><body style=3D"word-wrap: break-word; -webkit-nbsp-mode: =
space; line-break: after-white-space;" class=3D""><br class=3D""><div><br c=
lass=3D""><blockquote type=3D"cite" class=3D""><div class=3D"">On Apr 16, 2=
021, at 6:22 AM, Marvin H=C3=A4user &lt;<a href=3D"mailto:mhaeuser@posteo.d=
e" class=3D"">mhaeuser@posteo.de</a>&gt; wrote:</div><br class=3D"Apple-int=
erchange-newline"><div class=3D""><meta charset=3D"UTF-8" class=3D""><span =
style=3D"caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px=
; font-style: normal; font-variant-caps: normal; font-weight: normal; lette=
r-spacing: normal; text-align: start; text-indent: 0px; text-transform: non=
e; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; =
text-decoration: none; float: none; display: inline !important;" class=3D""=
>Good day,</span><br style=3D"caret-color: rgb(0, 0, 0); font-family: Helve=
tica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-=
weight: normal; letter-spacing: normal; text-align: start; text-indent: 0px=
; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-tex=
t-stroke-width: 0px; text-decoration: none;" class=3D""><br style=3D"caret-=
color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: n=
ormal; font-variant-caps: normal; font-weight: normal; letter-spacing: norm=
al; text-align: start; text-indent: 0px; text-transform: none; white-space:=
 normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration=
: none;" class=3D""><span style=3D"caret-color: rgb(0, 0, 0); font-family: =
Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; =
font-weight: normal; letter-spacing: normal; text-align: start; text-indent=
: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webki=
t-text-stroke-width: 0px; text-decoration: none; float: none; display: inli=
ne !important;" class=3D"">Sorry for the nitpicking.</span><br style=3D"car=
et-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style=
: normal; font-variant-caps: normal; font-weight: normal; letter-spacing: n=
ormal; text-align: start; text-indent: 0px; text-transform: none; white-spa=
ce: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decorat=
ion: none;" class=3D""><br style=3D"caret-color: rgb(0, 0, 0); font-family:=
 Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal;=
 font-weight: normal; letter-spacing: normal; text-align: start; text-inden=
t: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webk=
it-text-stroke-width: 0px; text-decoration: none;" class=3D""><span style=
=3D"caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; fo=
nt-style: normal; font-variant-caps: normal; font-weight: normal; letter-sp=
acing: normal; text-align: start; text-indent: 0px; text-transform: none; w=
hite-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text=
-decoration: none; float: none; display: inline !important;" class=3D"">- P=
rotocols always need a "Revision" field as first member. This is used to be=
 able to expand its capabilities in later revisions without introducing a n=
ew, distinct protocol.</span><br style=3D"caret-color: rgb(0, 0, 0); font-f=
amily: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: n=
ormal; font-weight: normal; letter-spacing: normal; text-align: start; text=
-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px;=
 -webkit-text-stroke-width: 0px; text-decoration: none;" class=3D""><span s=
tyle=3D"caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px;=
 font-style: normal; font-variant-caps: normal; font-weight: normal; letter=
-spacing: normal; text-align: start; text-indent: 0px; text-transform: none=
; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; t=
ext-decoration: none; float: none; display: inline !important;" class=3D"">=
- Consider the name EFI_SIMPLE_AUDIO_OUTPUT(!)_PROTOCOL, to not cause confu=
sion if input is ever added. Input in my opinion should be a separate proto=
col as there is no reason why they would necessarily be coupled topology-wi=
se (think of an USB microphone, it will never have any sort of output).</sp=
an><br style=3D"caret-color: rgb(0, 0, 0); font-family: Helvetica; font-siz=
e: 12px; font-style: normal; font-variant-caps: normal; font-weight: normal=
; letter-spacing: normal; text-align: start; text-indent: 0px; text-transfo=
rm: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width=
: 0px; text-decoration: none;" class=3D""><span style=3D"caret-color: rgb(0=
, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-=
variant-caps: normal; font-weight: normal; letter-spacing: normal; text-ali=
gn: start; text-indent: 0px; text-transform: none; white-space: normal; wor=
d-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; floa=
t: none; display: inline !important;" class=3D"">- To make code safety a bi=
t easier, try to use "CONST" for "IN" (non-OUT) pointers, so that CONST can=
 be propagated where possible.</span><br style=3D"caret-color: rgb(0, 0, 0)=
; font-family: Helvetica; font-size: 12px; font-style: normal; font-variant=
-caps: normal; font-weight: normal; letter-spacing: normal; text-align: sta=
rt; text-indent: 0px; text-transform: none; white-space: normal; word-spaci=
ng: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;" class=3D""=
><span style=3D"caret-color: rgb(0, 0, 0); font-family: Helvetica; font-siz=
e: 12px; font-style: normal; font-variant-caps: normal; font-weight: normal=
; letter-spacing: normal; text-align: start; text-indent: 0px; text-transfo=
rm: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width=
: 0px; text-decoration: none; float: none; display: inline !important;" cla=
ss=3D"">- Please do *not* make the events caller-owned. We had it multiple =
times already on production firmware that events are left dangling and may =
be polled/signaled after ExitBS(). The caller should be able to decide on s=
ome policy maybe (i.e. abort or block on ExitBS() until the playback finish=
ed), as cut-off audio may be awkward; but the callee definitely should impl=
ement "event safety" itself. Maybe avoid exposing events directly at all an=
d provide nice abstractions the caller cannot misuse.</span><br style=3D"ca=
ret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-styl=
e: normal; font-variant-caps: normal; font-weight: normal; letter-spacing: =
normal; text-align: start; text-indent: 0px; text-transform: none; white-sp=
ace: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decora=
tion: none;" class=3D""><span style=3D"caret-color: rgb(0, 0, 0); font-fami=
ly: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: norm=
al; font-weight: normal; letter-spacing: normal; text-align: start; text-in=
dent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -w=
ebkit-text-stroke-width: 0px; text-decoration: none; float: none; display: =
inline !important;" class=3D"">- I don't think audio should be required at =
all, the required subset should firstly consider minimalism and security. A=
ccessibility will not be of concern for some IoT device, the audio code wou=
ld simply eat space, and introduce a larger surface for bugs.</span><br sty=
le=3D"caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; f=
ont-style: normal; font-variant-caps: normal; font-weight: normal; letter-s=
pacing: normal; text-align: start; text-indent: 0px; text-transform: none; =
white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; tex=
t-decoration: none;" class=3D""><br style=3D"caret-color: rgb(0, 0, 0); fon=
t-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps=
: normal; font-weight: normal; letter-spacing: normal; text-align: start; t=
ext-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0=
px; -webkit-text-stroke-width: 0px; text-decoration: none;" class=3D""></di=
v></blockquote><div><br class=3D""></div><div>Marvin,</div><div><br class=
=3D""></div><div>Generally how we work this in the UEFI Specification is w=
e make it optional via the following wording: =E2=80=9CIf a platform includ=
es the ability to play audio in EFI then the EFI_SIMPLE_AUDIO_OUTPUT_PROTOC=
OL must be implemented.&nbsp;</div><div><br class=3D""></div><div>Basically=
 this requirement will get added to UEFI Specification 2.6.2 Platform-Speci=
fic Elements.</div><div><br class=3D""></div><div>Thanks,</div><div><br cla=
ss=3D""></div><div>Andrew Fish</div><br class=3D""><blockquote type=3D"cite=
" class=3D""><div class=3D""><span style=3D"caret-color: rgb(0, 0, 0); font=
-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps:=
 normal; font-weight: normal; letter-spacing: normal; text-align: start; te=
xt-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0p=
x; -webkit-text-stroke-width: 0px; text-decoration: none; float: none; disp=
lay: inline !important;" class=3D"">Best regards,</span><br style=3D"caret-=
color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: n=
ormal; font-variant-caps: normal; font-weight: normal; letter-spacing: norm=
al; text-align: start; text-indent: 0px; text-transform: none; white-space:=
 normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration=
: none;" class=3D""><span style=3D"caret-color: rgb(0, 0, 0); font-family: =
Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; =
font-weight: normal; letter-spacing: normal; text-align: start; text-indent=
: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webki=
t-text-stroke-width: 0px; text-decoration: none; float: none; display: inli=
ne !important;" class=3D"">Marvin</span><br style=3D"caret-color: rgb(0, 0,=
 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-vari=
ant-caps: normal; font-weight: normal; letter-spacing: normal; text-align: =
start; text-indent: 0px; text-transform: none; white-space: normal; word-sp=
acing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;" class=
=3D""><br style=3D"caret-color: rgb(0, 0, 0); font-family: Helvetica; font=
-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: no=
rmal; letter-spacing: normal; text-align: start; text-indent: 0px; text-tra=
nsform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-w=
idth: 0px; text-decoration: none;" class=3D""><span style=3D"caret-color: r=
gb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; f=
ont-variant-caps: normal; font-weight: normal; letter-spacing: normal; text=
-align: start; text-indent: 0px; text-transform: none; white-space: normal;=
 word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; =
float: none; display: inline !important;" class=3D"">On 16.04.21 01:42, Eth=
in Probst wrote:</span><br style=3D"caret-color: rgb(0, 0, 0); font-family:=
 Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal;=
 font-weight: normal; letter-spacing: normal; text-align: start; text-inden=
t: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webk=
it-text-stroke-width: 0px; text-decoration: none;" class=3D""><blockquote t=
ype=3D"cite" style=3D"font-family: Helvetica; font-size: 12px; font-style: =
normal; font-variant-caps: normal; font-weight: normal; letter-spacing: nor=
mal; orphans: auto; text-align: start; text-indent: 0px; text-transform: no=
ne; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-size=
-adjust: auto; -webkit-text-stroke-width: 0px; text-decoration: none;" clas=
s=3D"">Hi Andrew,<br class=3D""><br class=3D"">What would that protocol int=
erface look like if we utilized your idea?<br class=3D"">With mine (though =
I need to add channel mapping as well), your<br class=3D"">workflow for pla=
ying a stereo sound from left to right would probably<br class=3D"">be some=
thing like this:<br class=3D"">1) Encode the sound using a standard tool in=
to a Wave PCM 16.<br class=3D"">2) Place the Wave file in the Firmware Volu=
me using a given UUID as<br class=3D"">the name. As simple as editing the p=
latform FDF file.<br class=3D"">3) Write some BDS code<br class=3D"">&nbsp;=
<span class=3D"Apple-converted-space">&nbsp;</span>a) Lookup Wave file by U=
UID and read it into memory.<br class=3D"">&nbsp;<span class=3D"Apple-conve=
rted-space">&nbsp;</span>b) Decode the audio file (audio devices will not d=
o this decoding<br class=3D"">for you, you have to do that yourself).<br cl=
ass=3D"">&nbsp;<span class=3D"Apple-converted-space">&nbsp;</span>c) Call E=
FI_AUDIO_PROTOCOL.LoadBuffer(), passing in the sample rate<br class=3D"">of=
 your audio, EFI_AUDIO_PROTOCOL_SAMPLE_FORMAT_S16 for signed 16-bit<br clas=
s=3D"">PCM audio, the channel mapping, the number of samples, and the sampl=
es<br class=3D"">themselves.<br class=3D"">&nbsp;&nbsp;d) call EFI_BOOT_SER=
VICES.CreateEvent()/EFI_BOOT_SERVICES.CreateEventEx()<br class=3D"">for a p=
layback event to signal.<br class=3D"">&nbsp;&nbsp;e) call EFI_AUDIO_PROTOC=
OL.StartPlayback(), passing in the event you<br class=3D"">just created.<br=
 class=3D"">The reason that LoadBuffer() takes so many parameters is becaus=
e the<br class=3D"">device does not know the audio that your passing in. If=
 I'm given an<br class=3D"">array of 16-bit audio samples, its impossible t=
o know the parameters<br class=3D"">(sample rate, sample format, channel ma=
pping, etc.) from that alone.<br class=3D"">Using your idea, though, my pro=
tocol could be greatly simplified.<br class=3D"">Forcing a particular chann=
el mapping, sample rate and sample format on<br class=3D"">everyone would c=
omplicate application code. From an application point<br class=3D"">of view=
, one would, with that type of protocol, need to do the<br class=3D"">follo=
wing:<br class=3D"">1) Load an audio file in any audio file format from any=
 storage mechanism.<br class=3D"">2) Decode the audio file format to extrac=
t the samples and audio metadata.<br class=3D"">3) Resample the (now decode=
d) audio samples and convert (quantize) the<br class=3D"">audio samples int=
o signed 16-bit PCM audio.<br class=3D"">4) forward the samples onto the EF=
I audio protocol.<br class=3D"">There is another option. (I'm happy we're d=
iscussing this now -- we<br class=3D"">can hammer out all the details now w=
hich will make a lot of things<br class=3D"">easier.) Since I'll most likel=
y end up splitting the device-specific<br class=3D"">interfaces to differen=
t audio protocols, we could make a simple audio<br class=3D"">protocol that=
 makes various assumptions about the audio samples your<br class=3D"">givin=
g it (e.g.: sample rate, format, ...). This would just allow<br class=3D"">=
audio output and input in signed 16-bit PCM audio, as you've<br class=3D"">=
suggested, and would be a simple and easy to use interface. Something<br cl=
ass=3D"">like:<br class=3D"">typedef struct EFI_SIMPLE_AUDIO_PROTOCOL {<br =
class=3D"">&nbsp;&nbsp;EFI_SIMPLE_AUDIO_PROTOCOL_RESET Reset;<br class=3D""=
>&nbsp;&nbsp;EFI_SIMPLE_AUDIO_PROTOCOL_START Start;<br class=3D"">&nbsp;&nb=
sp;EFI_SIMPLE_AUDIO_PROTOCOL_STOP Stop;<br class=3D"">} EFI_SIMPLE_AUDIO_PR=
OTOCOL;<br class=3D"">This way, users and driver developers have a simple a=
udio protocol<br class=3D"">they can implement if they like. It would assum=
e signed 16-bit PCM<br class=3D"">audio and mono channel mappings at 44100 =
Hz. Then, we can have an<br class=3D"">advanced protocol for each device ty=
pe (HDA, USB, VirtIO, ...) that<br class=3D"">exposes all the knobs for sam=
ple formats, sample rates, that kind of<br class=3D"">thing. Obviously, lik=
e the majority (if not all) UEFI protocols, these<br class=3D"">advanced pr=
otocols would be optional. I think, however, that the<br class=3D"">simple =
audio protocol should be a required protocol in all UEFI<br class=3D"">impl=
ementations. But that might not be possible. So would this simpler<br class=
=
=3D"">interface work as a starting point?<br class=3D""><br class=3D"">On =
4/15/21, Andrew Fish &lt;<a href=3D"mailto:afish@apple.com" class=3D"">afis=
h@apple.com</a>&gt; wrote:<br class=3D""><blockquote type=3D"cite" class=3D=
""><br class=3D""><blockquote type=3D"cite" class=3D"">On Apr 15, 2021, at =
1:11 PM, Ethin Probst &lt;<a href=3D"mailto:harlydavidsen@gmail.com" class=
=3D"">harlydavidsen@gmail.com</a>&gt;<br class=3D"">wrote:<br class=3D""><=
br class=3D""><blockquote type=3D"cite" class=3D"">Is there any necessity f=
or audio input and output to be implemented<br class=3D"">within the same p=
rotocol? &nbsp;Unlike a network device (which is<br class=3D"">intrinsicall=
y bidirectional), it seems natural to conceptually separate<br class=3D"">a=
udio input from audio output.<br class=3D""></blockquote>Nope, there isn't =
a necessity to make them in one, they can be<br class=3D"">separated into t=
wo.<br class=3D""><br class=3D""><blockquote type=3D"cite" class=3D"">The c=
ode controlling volume/mute may not have any access to the sample<br class=
=3D"">buffer. &nbsp;The most natural implementation would seem to allow fo=
r a<br class=3D"">platform to notice volume up/down keypresses and use thos=
e to control the<br class=3D"">overall system volume, without any knowledge=
 of which samples (if any)<br class=3D"">are currently being played by othe=
r code in the system.<br class=3D""></blockquote>Your assuming that the aud=
io device your implementing the<br class=3D"">volume/muting has volume cont=
rol and muting functionality within<br class=3D"">itself, then.<br class=3D=
""></blockquote>Not really. We are assuming that audio hardware has a bette=
r understanding<br class=3D"">of how that system implements volume than som=
e generic EFI Code that is by<br class=3D"">definition platform agnostic.<b=
r class=3D""><br class=3D""><blockquote type=3D"cite" class=3D"">This may n=
ot be the case, and so we'd need to<br class=3D"">effectively simulate it w=
ithin the driver, which isn't too hard to do.<br class=3D"">As an example, =
the VirtIO driver does not have a request type for<br class=3D"">muting or =
for volume control (this would, most likely, be within the<br class=3D"">VI=
RTIO_SND_R_PCM_SET_PARAMS request, see sec. 5.14.6.4.3). Therefore,<br clas=
s=3D"">either the driver would have to simulate the request or return<br cl=
ass=3D"">EFI_UNSUPPORTED this instance.<br class=3D""><br class=3D""></bloc=
kquote>So this is an example of above since the audio hardware knows it is =
routing<br class=3D"">the sound output into another subsystem, and that sub=
system controls the<br class=3D"">volume. So the VirtIo Sound Driver know b=
est how to bstract volume/mute for<br class=3D"">this platform.<br class=3D=
""><br class=3D""><blockquote type=3D"cite" class=3D""><blockquote type=3D"=
cite" class=3D"">Consider also the point of view of the developer implement=
ing a driver<br class=3D"">for some other piece of audio hardware that happ=
ens not to support<br class=3D"">precisely the same sample rates etc as Vir=
tIO. &nbsp;It would be extremely<br class=3D"">ugly to force all future har=
dware to pretend to have the same<br class=3D"">capabilities as VirtIO just=
 because the API was initially designed with<br class=3D"">VirtIO in mind.<=
br class=3D""></blockquote>Precisely, but the brilliance of VirtIO<br class=
=
=3D""></blockquote>The brilliance of VirtIO is that it just needs to imple=
ment a generic device<br class=3D"">driver for a given operating system. In=
 most cases these operating systems<br class=3D"">have sounds subsystems th=
at manage sound and want fine granularity of<br class=3D"">control on what =
is going on. So the drivers are implemented to maximizes<br class=3D"">flex=
ibility since the OS has lots of generic code that deals with sound, and<br=
 class=3D"">even user configurable knobs to control audio. In our case that=
 extra layer<br class=3D"">does not exist in EFI and the end user code just=
 want to tell the driver do<br class=3D"">some simple things.<br class=3D""=
><br class=3D"">Maybe it is easier to think about with an example. Lets say=
 I want to play a<br class=3D"">cool sound on every boot. What would be the=
 workflow to make the happen.<br class=3D"">1) Encode the sound using a sta=
ndard tool into a Wave PCM 16.<br class=3D"">2) Place the Wave file in the =
Firmware Volume using a given UUID as the<br class=3D"">name. As simple as =
editing the platform FDF file.<br class=3D"">3) Write some BDS code<br clas=
s=3D"">&nbsp;&nbsp;a) Lookup Wave file by UUID and read it into memory.<br =
class=3D"">&nbsp;&nbsp;b) Point the EFI Sound Protocol at the buffer with t=
he Wave file<br class=3D"">&nbsp;&nbsp;c) Tell the EFI Sound Protocol to pl=
ay the sound.<br class=3D""><br class=3D"">If you start adding in a lot of =
perimeters that work flow starts getting<br class=3D"">really complicated r=
eally quickly.<br class=3D""><br class=3D""><blockquote type=3D"cite" class=
=
=3D"">is that the sample rate,<br class=3D"">sample format, &amp;c., do no=
t have to all be supported by a VirtIO<br class=3D"">device. Notice, also, =
how in my protocol proposal I noted that the<br class=3D"">sample rates, at=
 least, were "recommended," not "required." Should a<br class=3D"">device n=
ot happen to support a sample rate or sample format, all it<br class=3D"">n=
eeds to do is return EFI_INVALID_PARAMETER. Section 5.14.6.2.1<br class=3D"=
">(VIRTIO_SND_R_JACK_GET_CONFIG) describes how a jack tells you what<br cla=
ss=3D"">sample rates it supports, channel mappings, &amp;c.<br class=3D""><=
br class=3D"">I do understand how just using a predefined sample rate and s=
ample<br class=3D"">format might be a good idea, and if that's the best way=
, then that's<br class=3D"">what we'll do. The protocol can always be revis=
ed at a later time if<br class=3D"">necessary. I apologize if my stance see=
ms obstinate.<br class=3D""><br class=3D""></blockquote>I think if we add t=
he version into the protocol and make sure we have a<br class=3D"">separate=
 load and play operation we could add a member to set the extra<br class=3D=
"">perimeters if needed. There might also be some platform specific generic=
<br class=3D"">tunables that might be interesting for a future member funct=
ion.<br class=3D""><br class=3D"">Thanks,<br class=3D""><br class=3D"">Andr=
ew Fish<br class=3D""><br class=3D""><blockquote type=3D"cite" class=3D"">A=
lso, thank you, Laszlo, for your advice -- I hadn't considered that a<br cl=
ass=3D"">network driver would be another good way of figuring out how async=
<br class=3D"">works in UEFI.<br class=3D""><br class=3D"">On 4/15/21, Andr=
ew Fish &lt;<a href=3D"mailto:afish@apple.com" class=3D"">afish@apple.com</=
a>&gt; wrote:<br class=3D""><blockquote type=3D"cite" class=3D""><br class=
=3D""><blockquote type=3D"cite" class=3D"">On Apr 15, 2021, at 5:07 AM, Mi=
chael Brown &lt;<a href=3D"mailto:mcb30@ipxe.org" class=3D"">mcb30@ipxe.org=
</a>&gt; wrote:<br class=3D""><br class=3D"">On 15/04/2021 06:28, Ethin Pro=
bst wrote:<br class=3D""><blockquote type=3D"cite" class=3D"">- I hoped to =
add recording in case we in future want to add<br class=3D"">accessibility =
aids like speech recognition (that was one of the todo<br class=3D"">tasks =
on the EDK2 tasks list)<br class=3D""></blockquote>Is there any necessity f=
or audio input and output to be implemented<br class=3D"">within<br class=
=3D"">the same protocol? &nbsp;Unlike a network device (which is intrinsic=
ally<br class=3D"">bidirectional), it seems natural to conceptually separat=
e audio input<br class=3D"">from<br class=3D"">audio output.<br class=3D"">=
<br class=3D""><blockquote type=3D"cite" class=3D"">- Muting and volume con=
trol could easily be added by just replacing<br class=3D"">the sample buffe=
r with silence and by multiplying all the samples by a<br class=3D"">percen=
tage.<br class=3D""></blockquote>The code controlling volume/mute may not h=
ave any access to the sample<br class=3D"">buffer. &nbsp;The most natural i=
mplementation would seem to allow for a<br class=3D"">platform to notice vo=
lume up/down keypresses and use those to control<br class=3D"">the<br class=
=
=3D"">overall system volume, without any knowledge of which samples (if an=
y)<br class=3D"">are<br class=3D"">currently being played by other code in =
the system.<br class=3D""><br class=3D""></blockquote>I=E2=80=99ve also tho=
ught of adding NVRAM variable that would let setup, UEFI<br class=3D"">Shel=
l,<br class=3D"">or even the OS set the current volume, and Mute. This how =
it would be<br class=3D"">consumed concept is why I proposed mute and volum=
e being separate APIs.<br class=3D"">The<br class=3D"">volume up/down API i=
n addition to fixed percentage might be overkill, but<br class=3D"">it<br c=
lass=3D"">does allow a non liner mapping to the volume up/down keys. You wo=
uld be<br class=3D"">surprised how picky audiophiles can be and it seems th=
ey like to file<br class=3D"">Bugzillas.<br class=3D""><br class=3D""><bloc=
kquote type=3D"cite" class=3D""><blockquote type=3D"cite" class=3D"">- Fina=
lly, the reason I used enumerations for specifying parameters<br class=3D""=
>like sample rate and stuff was that I was looking at this protocol<br clas=
s=3D"">from a general UEFI applications point of view. VirtIO supports all =
of<br class=3D"">the sample configurations listed in my gist, and it seems =
reasonable<br class=3D"">to allow the application to control those paramete=
rs instead of<br class=3D"">forcing a particular parameter configuration on=
to the developer.<br class=3D""></blockquote>Consider also the point of vie=
w of the developer implementing a driver<br class=3D"">for<br class=3D"">so=
me other piece of audio hardware that happens not to support<br class=3D"">=
precisely<br class=3D"">the same sample rates etc as VirtIO. &nbsp;It would=
 be extremely ugly to<br class=3D"">force<br class=3D"">all future hardware=
 to pretend to have the same capabilities as VirtIO<br class=3D"">just beca=
use the API was initially designed with VirtIO in mind.<br class=3D""><br c=
lass=3D"">As a developer on the other side of the API, writing code to play=
 sound<br class=3D"">files on an arbitrary unknown platform, I would prefer=
 to simply<br class=3D"">consume<br class=3D"">as simple as possible an abs=
traction of an audio output protocol and<br class=3D"">not<br class=3D"">ha=
ve to care about what hardware is actually implementing it.<br class=3D""><=
br class=3D""></blockquote>It may make sense to have an API to load the buf=
fer/stream and other APIs<br class=3D"">to<br class=3D"">play or pause. Thi=
s could allow an optional API to configure how the<br class=3D"">stream<br =
class=3D"">is played back. If we add a version to the Protocol that would a=
t least<br class=3D"">future proof us.<br class=3D""><br class=3D"">We did =
get feedback that it is very common to speed up the auto playback<br class=
=3D"">rates for accessibility. I=E2=80=99m not sure if that is practical w=
ith a simple<br class=3D"">PCM<br class=3D"">16 wave file with the firmware=
 audio implementation. I guess that is<br class=3D"">something we could inv=
estigate.<br class=3D""><br class=3D"">In terms of maybe adding text to spe=
ech there is an open source project<br class=3D"">that<br class=3D"">concep=
tually we could port to EFI. It would likely be a binary that<br class=3D""=
>would<br class=3D"">have to live on the EFI System Partition due to size. =
I was thinking<br class=3D"">that<br class=3D"">words per minute could be p=
art of that API and it would produce a PCM 16<br class=3D"">wave file that =
the audio protocol we are discussing could play.<br class=3D""><br class=3D=
""><blockquote type=3D"cite" class=3D"">Both of these argue in favour of de=
fining a very simple API that<br class=3D"">expresses<br class=3D"">only a =
common baseline capability that is plausibly implementable for<br class=3D"=
">every piece of audio hardware ever made.<br class=3D""><br class=3D"">Cou=
pled with the relatively minimalistic requirements for boot-time<br class=
=3D"">audio,<br class=3D"">I'd probably suggest supporting only a single f=
ormat for audio data,<br class=3D"">with<br class=3D"">a fixed sample rate =
(and possibly only mono output).<br class=3D""><br class=3D""></blockquote>=
In my world the folks that work for Jony asked for a stereo boot bong to<br=
 class=3D"">transition from left to right :). This is not the CODEC you are=
 looking<br class=3D"">for<br class=3D"">was our response=E2=80=A6. I also =
did not mention that some languages are right<br class=3D"">to<br class=3D"=
">left, as the only thing worse than one complex thing is two complex<br cl=
ass=3D"">things<br class=3D"">to implement.<br class=3D""><br class=3D""><b=
lockquote type=3D"cite" class=3D"">As always: perfection is achieved, not w=
hen there is nothing more to<br class=3D"">add,<br class=3D"">but when ther=
e is nothing left to take away. &nbsp;:)<br class=3D""><br class=3D""></blo=
ckquote>"Simplicity is the ultimate sophistication=E2=80=9D<br class=3D""><=
br class=3D"">Thanks,<br class=3D""><br class=3D"">Andrew Fish<br class=3D"=
"><br class=3D""><blockquote type=3D"cite" class=3D"">Thanks,<br class=3D""=
><br class=3D"">Michael<br class=3D""><br class=3D""><br class=3D""><br cla=
ss=3D""><br class=3D""><br class=3D""></blockquote><br class=3D""></blockqu=
ote><br class=3D"">--<br class=3D"">Signed,<br class=3D"">Ethin D. Probst<b=
r class=3D""></blockquote><br class=3D""></blockquote><br class=3D""></bloc=
kquote><br style=3D"caret-color: rgb(0, 0, 0); font-family: Helvetica; font=
-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: no=
rmal; letter-spacing: normal; text-align: start; text-indent: 0px; text-tra=
nsform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-w=
idth: 0px; text-decoration: none;" class=3D""><br style=3D"caret-color: rgb=
(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; fon=
t-variant-caps: normal; font-weight: normal; letter-spacing: normal; text-a=
lign: start; text-indent: 0px; text-transform: none; white-space: normal; w=
ord-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;" c=
lass=3D""><br style=3D"caret-color: rgb(0, 0, 0); font-family: Helvetica; f=
ont-size: 12px; font-style: normal; font-variant-caps: normal; font-weight:=
 normal; letter-spacing: normal; text-align: start; text-indent: 0px; text-=
transform: none; white-space: normal; word-spacing: 0px; -webkit-text-strok=
e-width: 0px; text-decoration: none;" class=3D""><span style=3D"caret-color=
: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal=
; font-variant-caps: normal; font-weight: normal; letter-spacing: normal; t=
ext-align: start; text-indent: 0px; text-transform: none; white-space: norm=
al; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: non=
e; float: none; display: inline !important;" class=3D""></span></div></bloc=
kquote></div><br class=3D""></body></html>

--Apple-Mail=_C411943F-CDA1-45E8-BB2A-B3625A4AA9FB--