[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: VMs: Strange pair statistics
on 16 January 2004 07:26
> Zitat von Jeff <jeff@xxxxxxxxxxxxxxxxxxxxxx>:
> >... Below is the count for the occurance of each pair within a word.
> > yc 41
> > yd 45
> > rc 13
> > And again the total counts for all occurances.
> > yc - 252
> > yd - 213
> > rc - 207
> > These three pairs are acting in the opposite manner to all other pairs
> > this high in the occurance count table. Would this happen in a
> > language? ...
> > Jeff
> Hi Jeff,
> A few possibilities which come the mind of the naive reader:
> *) Spaces may not be word delimiters, but also encoded characters, ie
> the "space" may represent a different letter (but I think considering a
> as "just another letter" is a fairly modern notion), and actually you'd
> look at triplets like "y<space>c".
I don't believe spaces are standins for characters. They would be too
> *) Spaces were inserted at random to confuse us, and the original text was
This is what I believed until finding this pattern.
> *) One of my pet theories is that within a word you have to read the
This I have considered for a long while and would tend to agree. I have
found words by
scanning forwards, but who knows?
> *) German for example used to have two different ways/characters to
> the letter "s". Perhaps this is the case here. (I don't know if any other
> languages have similar features.)
I do not believe this is the reason. Again too many occurances.
> And finally Barlow's Golden Rule:
> *) Don't be mislead by statistics: In any reasonably large sample, there
> high probability for improbable things to happen.
And of course it depends upon the language and encoding method. :-)
> debitel.net Webmail
> To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
> unsubscribe vms-list
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying: