[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

VMs: Re: Paradigms Regained



On Saturday 19 Oct 2002 2:57 am, Dennis wrote: 
> I must be dense. Could you be more specific? E.g.,
> French, Italian, etc. syllables don't have the 
> paradigmatic nature of Voynichese words, don't have the 
> relative small number of endings that lead one to think 
> of Chinese, have too many/too few open syllables, or 
> something I can't think of?

This is not the same problem as the type of language according to the number 
of characters. There is a nice table in Coe's "Breaking the Maya code", 
Chapter 1 (p 43 in the edition I have). There he shows that writing systems 
with 20-36 characters they are alphabetic or consonantal, between 40-87 they 
are "pure" syllabic, and for hundreds of them (the smallest is 497 to >5000) 
they are logographic. What we are talking here is to encode syllabically (?) 
a language which is not  normally written in a syllabic script.
In EVA, the vms has about 8200 words. I think that it would be extremely 
difficult to write (and read) a text this way, but this may be irrelevant as 
we do not know how long it took the author to write the book and if anybody 
ever read it back.

One would need to know: How many syllables is one expected to find in a text 
of the size of the vms (assuming that it is written is language X)? I have no 
idea of what this number may be (in any language), but I have the feeling 
that 8200 is a bit too large.

Anyway, let's suppose that we are set to do a "syllable encoding" from 
scratch. One could start by making a table of the syllables with the 
"encoding" as one goes along the text. Also let's assume that each "Voynich 
word" (i.e. each syllable) is represented as a number that increases with 
each new entry (yes I know that this is assuming too much, but bear with me). 
The more common syllables would have their entries early in the table (small 
numbers) and rarer ones would appear later on (larger numbers).

There must be some relation between the length of the text and the structure 
of the table since by Zipf's law we know that words have different 
frequencies and therefore syllables will have different frequencies too. I 
think that the earlier parts of the text would have more common syllables 
(small numbers) while the final parts of the text would have more entries 
from the final part of the table (larger numbers). (I have the feeling that 
this would be somewhat similar to the structure of the table in the Ziv and 
Lempel's universal algorithm for sequential data compression (Jim R?)).

So would looking at the rate of appearance of new syllables (=Voynich words) 
across the text and comparing this to known texts written as words and as 
syllables reveal anything? Of course this implies that the dissecting of 
words into syllables was done systematically, and still one should have to 
explain what the character sequences and the labels (syllables on their own?) 
stand for... 
Sorry if this was too long.

Cheers,

Gabriel