<!DOCTYPE html>
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p>I proposed that to the OT developer list a long while ago, and
recall that Kamal had a similar idea, initially in terms of
handling Unicode decompositions such that fonts would not need
precomposed diacritics. At the time, Microsoft thought it unlikely
to get traction, as it implied significant engineering for unclear
benefit, but perhaps the benefit is clearer now? As you say, being
able to decompose a Unicode character to an arbitrary sequence of
glyphs is very useful for Arabic, and by-passes the need to handle
such decompositions in GSUB prior to other shaping. <br>
</p>
<p>I suppose the question is whether there is a significant benefit
to doing this outside of GSUB? — or, indeed, if there might be a
reason it would be preferable in GSUB?</p>
<p>The inconsistency in dot handling in different joining forms of
some Arabic characters means that one doesn’t always want to
up-front decompose some characters to base grapheme and combining
dots, but those could be excluded from the cmap and passed to GSUB
form decomposition in the joining form features. But that being
the case, why not do it all in GSUB?</p>
<p>JH<br>
</p>
<p><br>
</p>
<div class="moz-cite-prefix">On 2023-12-12 9:04 am, Behdad Esfahbod
wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CAF63+7VT+JYBK77wEJRQui-NRM_+ee+1H4cjsurX86eOC9oJOQ@mail.gmail.com">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<div dir="ltr">Thank you everyone for the very productive meeting.
<div><br>
</div>
<div>I like to also bring this issue up. If there is interest, I
can work on it. I wrote in my reply to Peter earlier:</div>
<div><br>
</div>
<div>
<div><i>This reminds me of another idea we discussed in, I
think, 2019, from Monotype to introduce a `cmap` subtable
that would map individual characters to sequences of
glyphs. Then the pre-composed Unicode characters wouldn't
need to have their own glyphs. Back then we dropped the
idea for backwards-compat reasons. But maybe we can pick
it up now?</i></div>
<span class="gmail-im" style="color:rgb(80,0,80)">
<div><br>
</div>
<div>This is very useful for Arabic as well...</div>
<div><br>
</div>
</span>
<div>
<div dir="ltr" class="gmail_signature"
data-smartmail="gmail_signature">behdad<br>
<a href="http://behdad.org/" target="_blank"
moz-do-not-send="true" class="moz-txt-link-freetext">http://behdad.org/</a></div>
</div>
</div>
</div>
<br>
<fieldset class="moz-mime-attachment-header"></fieldset>
<pre class="moz-quote-pre" wrap="">_______________________________________________
mpeg-otspec mailing list
<a class="moz-txt-link-abbreviated" href="mailto:mpeg-otspec@lists.aau.at">mpeg-otspec@lists.aau.at</a>
<a class="moz-txt-link-freetext" href="https://lists.aau.at/mailman/listinfo/mpeg-otspec">https://lists.aau.at/mailman/listinfo/mpeg-otspec</a>
</pre>
</blockquote>
<pre class="moz-signature" cols="72">--
John Hudson
Tiro Typeworks Ltd <a class="moz-txt-link-abbreviated" href="http://www.tiro.com">www.tiro.com</a>
Tiro Typeworks is physically located on islands
in the Salish Sea, on the traditional territory
of the Snuneymuxw and Penelakut First Nations.
__________
EMAIL HOUR
In the interests of productivity, I am only dealing
with email towards the end of the day, typically
between 4PM and 5PM. If you need to contact me more
urgently, please use other means.</pre>
</body>
</html>