[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: VMs: RE: question
On Friday 28 Feb 2003 1:35 am, GC wrote:
> If the VMS is not encoded, I don't see how entropy itself can
> offer an explanation for the oddities it contains.
The characteristics gathered so far are there independently of what may think
about how the vms was produced and any solution to the vms must explain why
those statistical properties exist. That is, I think, a very important point.
There is no easy test to check whether the the document is a cipher or an
abbreviated language or just a random sequence of characters.
> And when calculating entropy, what does one use as the recognized
> glyphset? Is <sh> actually one glyph, or two? Is <ccc> a single
> glyph, a combination of two, or three individual glyphs? Surely
> one's interpretation of what constitutes a glyph would have a
> noticeable effect on the numbers generated by any program.
That's why in the "counting commas" page (posted the link yesterday) the vms
was translated into most (all?) the available V-alphabets. If one can think
of another alphabet, then the easiest thing to do is to create a bitrans
table and run the same programme again. If it is not possible to do a bitrans
translation, then another transcription is necessary.
> After establishing in ones mind what constitutes a glyph, conduct
> a survey of the use of each glyph, noting variants from the norm.
This is much easier said than done.
Is VV=W? or \/\/?
Is A=/-\? or /\ and -, or even /- and \?
Is LL a letter or 2 letters? (it may be 2 in English, but it is 1 in Spanish,
and same with CH).
Solfi's JSA does something like that. One could try to agglomerate strokes
into likely sets, but so far I am not sure how one can go about it. Any ideas
are welcome.
Cheers,
Gabriel
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list