[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: VMs: wordlength persistence



Here is a less trivial example of two languages say A and B.
A has wordlengths (frequencies): 3 (1) 4 (2) 5 (3) 6 (2) 7 (1)
B has wordlengths (frequencies): 4 (1) 5 (2) 6 (3) 7 (2) 8 (1)
So A has average wordlength 5.0 and B has 6.0

Suppose we have a text, half in language A and half in language B
Consider a word of four letters.
It occurs twice in A, followed by a word averaging 5 letters, against once in B followed by a word averaging 6 letters. So a four letter word is followed by a word averaging 5.33 letters.


This leads to the table:

3: 5.00
4: 5.33
5: 5.40
6: 5.60
7: 5.67
8: 6.00

of word length followed by average next word length.
Looks familiar?

Ger



----- Original Message ----- From: "ger" <ger.hungerink@xxxxxxxxx>
To: <vms-list@xxxxxxxxxxx>
Sent: Wednesday, October 27, 2004 11:20 AM
Subject: Re: VMs: wordlength persistence



The wordlength connection can easily be explained by several factors:

1) Text in two parts of "languages" with a different average wordlength (AWL) each.
2) Text with citations in a "longer" language.
3) Text containing sequences of longer words, as in summing up names.
4) Text containing names consisting of two longer words.


For a trivial example of the last case see:

"aaaaa aaaaa aaaaa aaaaa aaaaa Wilfrid Voynich aaaaa aaaaa aaaaa aaaaa aaaaa aaaaa"

the AWL following a 5 letter word is 5.2
the AWL following a 7 letter word is 6.0

Now imagine text with AWL 5 containing groups of words with AWL 6 and the same will hold although to much lesser extend.

This is not say there is no such effect as a wordlength connection, but that can only unsurface when at least the four named effects are excluded.

Ger




--- Outgoing mail is certified Virus Free. Checked by AVG anti-virus system (http://www.grisoft.com). Version: 6.0.782 / Virus Database: 528 - Release Date: 23-10-2004 ______________________________________________________________________ To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying: unsubscribe vms-list



---
Outgoing mail is certified Virus Free.
Checked by AVG anti-virus system (http://www.grisoft.com).
Version: 6.0.782 / Virus Database: 528 - Release Date: 22-10-2004


______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list