[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: VMs: RE: f78r pairified...
Hi John,
At 11:53 04/08/2003 -0400, John Grove wrote:
Couldn't you also say that all the words end in 'd' and the same stats
would result with the first letter of every word being 'y' (without
pairifying).
It's another possibility: but note that I came to the idea of {e/ee/eee/eo}
being an end-of-word marker from a different question: how to compactly
(and secretly) encode a language's declensions & conjugations?
[Essentially, I think this is what Graham W is reaching towards].
Even given a special token (I think <ch/sh>) for abbreviation, (most?)
languages would still have the issue of how to decline/conjugate words
without giving the game away. My insight was that you'd probably need a
*combining mechanism* - ie one with a combinatorial aspect - to do this,
and the only common-enough (and obvious) feature of the language that would
seem to be able to achieve this is the {e|ee|eee|eo} regexp.
Certainly, in the transcription I made, the second most frequent word start
"pair" was <d> (8/86, ie 9.3%), which is consistent with the idea that "d"
(following {e|ee|eee|eo}) is a null. But then again, if <d> is a null, then
the stats point to <y>'s also being a null - hence my suggestion that <dy>
is a null (or rather, "a paired-cipher representation of a single null token").
You could probably take that farther and say that sh/ch/t/k are actually the
end of word markers and most words begin with 'e'.
Alternatively, if the letters in each word are written back-to-front, then
sh/ch/t/k might be the start-of-word markers, and the 'e' family the
end-of-word markers. There are still lots of permutations to frustrate us
yet. :-o
Cheers, .....Nick Pelling.....
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list