[mpeg-OTspec] Proposal: Specify UTF encoding of Unicode strings

Dave Pawson dave.pawson at gmail.com
Mon Nov 28 20:44:04 CET 2011


On 28 November 2011 18:35, Sairus Patel <sppatel at adobe.com> wrote:
> Dave,
>
> The proposal seeks simply to clarify the specification.
>
> On the other hand, you seem to be talking about extending the specification to allow for more UTF encoding options for 'name' table strings.

Agreed, ASCII or no specified encoding, isn't much good these days,
but IMHO requiring utf-16 is too much.

I'm simply saying that a single encoding is too strict, too limiting.
for 80%, utf-8 may suffice, why constrain users?
Hence allow users to use their own encodings, but simply
require a declaration of that encoding.


regards DaveP


>
> Sairus
>
> -----Original Message-----
> From: Dave Pawson [mailto:dave.pawson at gmail.com]
> Sent: Wednesday, November 23, 2011 10:14 PM
> To: Sairus Patel
> Cc: mpeg-OTspec at yahoogroups.com; opentype-migration-list at indx.co.uk
> Subject: Re: [mpeg-OTspec] Proposal: Specify UTF encoding of Unicode strings
>
> On 24 November 2011 00:05, Sairus Patel <sppatel at adobe.com> wrote:
>
>>
>> === Proposal { my comments are in curly brackets }
>>
>> { In http://www.microsoft.com/typography/otspec/name.htm [OFF sec.
>> 5.2.6]: }
>>
>> 1. { Insert the following sentence at the end of the paragraph
>> "Unicode platform encoding ID 5 can be used for encodings in the
>> 'cmap' table but not for strings in the 'name' table.": }
>>
>> Strings for all Unicode platform encoding IDs other than 5 must be encoded in UTF-16 (big endian).
>>
>> 2. { Insert the following paragraphs at the end of the "Windows
>> platform-specific encoding IDs (platform ID= 3)" section: }
>>
>> Strings for Windows platform encoding ID 0 are considered to have Unicode semantics (UCS-2).
>>
>> Strings for Windows platform encoding IDs 0, 1, and 10 must be encoded in UTF-16 (big endian).
>
>
>
> Disagree. Specifying the encoding is a pain when done this way and too expensive.
>
> Specify the encoding used, don't require a particular one.
> Allow iso-8859 or  utf-8, just let the reader know what it is.
>
> regards
>
>
> --
> Dave Pawson
> XSLT XSL-FO FAQ.
> Docbook FAQ.
> http://www.dpawson.co.uk
>



-- 
Dave Pawson
XSLT XSL-FO FAQ.
Docbook FAQ.
http://www.dpawson.co.uk



More information about the mpeg-otspec mailing list