[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: VMs: Sukhotin Vowel IDs



Thanks, everyone.  I think I understand Sukhotin's algorithm now.  It
partitions the characters into two groups ("vowels" and "consonants") so
as to maximize the number of times a character in one group occurs next
to a character in the other group, or equivalently, to minimize the
number of times a character appears next to another character in the
same group.

This seems to be very nearly the same as the effects of a best-fit
2-state PFSMs.  The PFSM partitions the characters into two groups so as
to optimize the ability to predict the next character based solely on
which group contains the previous character.

-Ben

-----Original Message-----
From: owner-vms-list@xxxxxxxxxxx [mailto:owner-vms-list@xxxxxxxxxxx] On
Behalf Of Jacques Guy
Sent: Thursday, March 24, 2005 7:05 PM
To: vms-list@xxxxxxxxxxx
Subject: Re: VMs: Sukhotin Vowel IDs

24/03/2005 8:36:45 AM, "Ben Preece" <ben.preece@xxxxxxxxxx> wrote:

>Knox Mix pointed me to this site:
>http://acl.ldc.upenn.edu/C/C73/C73-1017.pdf

I did not know about that article. It looks like an abstract, or an
introduction. The actual articles by Sukhotin which I have read are much
longer and detailed.

>I also dug this out of the voynich archives:
>http://www.voynich.net/Arch/2001/01/msg00108.html

Yes, that is a free translation of a French translation published in
1974 in T.A. Information. "Free" translation because the original was
dreadfully bloated and obscurely worded. The algorithm itself uselessly
complicated. 

For instance, t reordered the rows and columns of the frequency matrix
at each iteration. So I rewrote much of it, adding my own explanations.
For instance:

If languages did not have properties (1) and (2), the text of property
(2) would read: "cnsnntsooa tnde to ppraea nxte to vwlsoe rhtrae thna
nxte to cnsnntsooa" or "cnsnnts tnd t ppr nxt t vwls rthr thn nxt t
cnsnnts ooa e o aea e o oe ae a e o ooa".

That is my invention, there is nothing remotely like it in the French
(and Russian) original.



______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list


______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list