[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: VMs: RE: RE: Best-fit 2-state PFSMs
Speaking of 2-state(s): what is the currently accepted(?)
letters/elements in the vms now for its alphabet?
Wasn't it something like 14 or so?
IF that number were DOUBLED (as in MIRRORed) to 28 or so,
then how many _Different Languages_ would "28" cover..
inluding $ signs and @ & # signs etc (if needed/ or not)
Folding would make it Twice as big as it appears etc..
-=se=-
steve (still folding/mirroring) ekwall
Date: Fri, 11 Mar 2005 00:10:06 -0600
From: Dennis <tsalagi@xxxxxxxx>
Subject: Re: VMs: RE: RE: Best-fit 2-state PFSMs
Nick Pelling wrote:
Not necessarily regarding (P)FSM's,
> Really, I think that Voynichese is only superficially language-like, and
> that it has an innate artificiality at its core. The idea behind
> examining its structure by incrementing the number of target states is
> to try to find a kind of transition point between the superficial
> language-like structure and whatever lies beneath.
>
>> Real languages apparently can't be
>> realistically modeled without very, very large FSMs, so if a drop like
>> this occurred, it may not happen until the FSM has hundreds of states.
>> The largest I've ever tried to generate was 120 states.
What effect does the size of the text corpus have? The HMM
needed a text corpus of ~6 Mbyte for a clear result. Would
it tell us something to create an enormous synthetic
Voynichese corpus by Gabriel or Jeff's method or Stolfi's
Voynichese grammar and then analyze it?
> FWIW, I suspect the transition point will fall between 10 and 20 states
> (for a pre-paired transcription set, such as one where qo / ol / or / al
> / ar / iiii / iii / ii / eeee / eee / ee / dy / cfh / ckh / cph / cth /
> ch / sh / eo / od all map to single tokens). For EVA, the PFSM finder
> would also have to unpick all the tricky pairing rules correctly (lots
> of local minima to avoid), so I guess 60+ states would be closer there.
What about defining a transcription alphabet that treats
these digraphs/verbose cipher elements as single glyphemes
and then analyzing the VMs in that transcription? That
seems like a useful exercise.
Dennis
>
> Cheers, .....Nick Pelling.....
>
> ______________________________________________________________________
> To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
> unsubscribe vms-list
>
>
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list