[MPEG-OTSPEC] New AHG mandates and other news!

William_J_G Overington wjgo_10009 at btinternet.com
Thu May 13 12:12:17 CEST 2021


To "'MPEG OT Spec list'" <mpeg-otspec at lists.aau.at>

Hello Peter
Thank you for taking the time to explain. I appreciate your help.

> But it isn’t really practical from a linguistic perspective. It 
> assumes that semantic propositions can be defined that can always be 
> used as translation pivots between two arbitrary languages, and that 
> those can be formulated independent  of any larger discourse context.

I was unfamiliar with the term "translation pivots" as such.

https://en.wikipedia.org/wiki/Pivot_language

However, upon investigation I found that I a> m familiar with the 
concept, just not under that name.

> It’s also not scoped: Adapting examples from your slides, I might want 
> to send a message to the hotel saying, “If my dog has hair rather than 
> fur, and doesn’t shed, will I be able to keep my dog in your hotel 
> rooms?” There  are an unbounded number of semantic propositions 
> someone might want to communicate in your hotel scenario. There’s no 
> clear way to scope.

Yes, exactly.

I have long recognised that. I have always been suggesting that 
localizable sentences can be used for communication through the language 
barrier in some particular circumstances. I have not advocated the 
invention for every possible use.

The slide presentation is about finding information through the language 
barrier after a disaster. I opine that the invention would be good for 
that application.

This limitation of not encoding every possible sentence is applied upon 
in Chapter 10 of my first novel.

http://www.users.globalnet.co.uk/~ngo/localizable_sentences_the_novel_chapter_010.pdf

However, emoji cannot depict every possible image that can exist, yet 
emoji are being encoded for some pictures.

Also, clip art cannot depict every possible image that can exist, yet 
clip art collections can be useful.

I have used clip art. For example in the slide presentation that we have 
been discussing.

  >> I am just told that it is 'out of scope'…

> That’s because Unicode is an encoding for _characters / text 
> elements_, not semantic propositions.

Thank you for explaining.

Yet what about U+1F91F?

https://www.unicode.org/charts/PDF/Unicode-13.0/U130-1F900.pdf

There are glyphs for the localizable sentences. There seems to me to be 
no reason why sequences to use those glyphs interoperably with those 
sequences including tag characters could not be reasonably formally 
accepted into regular Unicode, thereby providing robustness and formal 
encoding in a non-proprietary manner such that they could be used 
without problem or uncertainty as to provenance or intellectual property 
rights by any individual or organization that chooses to use them.

It would seem that adding a note in the code chart explaining the 
meaning and its context would not be unreasonable.

Now maybe I am wrong about that, and there is some reason of which I am 
not aware.

However, I opine that the way to settle the matter is to have that 
formally and fairly assessed by the Unicode Technical Committee, with 
reasons given if the idea is refused.

That would seem to be the fair way and the professional way to do 
things.

For example, the sentence "It is spring." would have the context of "A 
season of the year".

I remember at one stage, though not now, an online translation facility 
translated into French "It is spring." to "C'est source." as in a spring 
of water from a hillside rather than "C'est le printemps." as desired. 
So the context is important as a guide to the human translator producing 
a sentence.dat file.

> If the idea is that the display strings for a the set of localizable 
> sentences is resident in the receiving device, then that’s an instance 
> of something that is frequently done:  it’s very common for software 
> running on devices to have access to lots of static strings to display 
> messages to a user. However, fonts are not the storage mechanism used 
> for such strings, and software vendors definitely would not want to 
> start using fonts  as a new way to store static strings for 
> user-interface messages.

So maybe the way to do localizable sentences is to use the following 
format that I devised. The format is such that it uses a text file that 
can be produced using the Microsoft WordPad program. WordPad is bundled 
with the Windows operating system so is widely available without needing 
to use additional software to produce or study the file.

http://www.users.globalnet.co.uk/~ngo/The_Format_of_the_sentence_dot_dat_files_for_use_in_Research_on_Communication_through_the_Language_Barrier_using_encoded_Localizable_Sentences.pdf

http://www.users.globalnet.co.uk/~ngo/localizable_sentences_research.htm

William Overington

Thursday 13 May 2021


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.aau.at/pipermail/mpeg-otspec/attachments/20210513/0b8c1de5/attachment.html>


More information about the mpeg-otspec mailing list