Text-mode apps (Was :Who are the "spacists"?)

Mikhail V mikhailwas at gmail.com
Sat Apr 1 21:33:26 EDT 2017


On 2 April 2017 at 02:01, Chris Angelico <rosuav at gmail.com> wrote:
> On Sun, Apr 2, 2017 at 9:25 AM, Mikhail V <mikhailwas at gmail.com> wrote:
>> On 2 April 2017 at 00:22, Chris Angelico <rosuav at gmail.com> wrote:
>>> On Sun, Apr 2, 2017 at 8:16 AM, Mikhail V <mikhailwas at gmail.com> wrote:
>>>> For multiple-alphabet rendering I will use some
>>>> custom text format, e.g. with tags
>>>> <s="Voynich"> ... </s>, and for latin
>>>> <s="Latin">...</s> and etc.
>>>>
>>>> Simple and effective.
>>>
>>> For multi-alphabet rendering, I would rather use an even simpler
>>> format: Remove the tags and use a consistent encoding.
>>
>> No, flat encoding would not be simpler, it would be simpler only and only
>> if you take a text with several alphabets, and mix the data randomly.
>> In real situation, data chunks that use different glyph sets for
>> representation are not mixed in a random manner.
>> Also for different processing purposes tagged structure will be way
>> more effective, e.g. if I want to extract all chunks in alphabet A
>> in a single list with strings, or use advanced search, etc.
>
> https://github.com/Rosuav/LetItTrans/blob/master/25%20languages.srt
>
> Not exactly random, but that's a single file, a single document, using
> characters from several different scripts. And this is far from the
> only case of this sort of thing happening.
>

So you propose that I automatically convert this content
to tagged format? Heh, no thanks, and this does not
refer to the above discussed problem.
If you will make a publishing layout of multi-lingual
document, you will have to do that manually anyway.
(however e.g. in Adobe InDesign there is automatical
assistance for this task, which tries to guess the language of chunks)

So this speaks merely against spawning of such content
than towards unicode.



More information about the Python-list mailing list