[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: VMs: Number crunching the Fincher window





Koontz John E wrote:
On Mon, 13 Sep 2004, Elmar Vogt wrote:
...
16 32674 32346

...
We seem to see that natural languages have a larger variety of short
sequences. At the same time, for longer sequences, the VM gets more varied,
until at a sequence length of 16, there were only 90 instances of phrases of
16 or more characters, which got repeated. (In German, we still had some 400
duplicates.)


Pardon my denseness, but I don't see how we got from the preceding table
to the numbers in the text?


The total text length was 32755 chars (I hope I didn't forget to post that as well?), hence a total of 32739 sequences of 16 consecutive chars can be formed. (Of course, they're overlapping.) Indeed, it should be "70" rather than "90".


Cheers,

Elmar

--
Elmar Vogt / Königswarterstr. 18 / 90762 Fürth / GERMANY
elvogt@xxxxxxxxxxx / www.beamends.de / Tel.: (++49/0)911 - 31 52 58

"It is through the truthful exercising of the best of human qualities -
respect for others, honesty about ourselves, faith in our ideals - that
we come to life in God's eyes." (Bruce Springsteen, "Vote for Change")

______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list