[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: VMs: wordlength persistence
Here is a less trivial example of two languages say A and B.
A has wordlengths (frequencies): 3 (1) 4 (2) 5 (3) 6 (2) 7 (1)
B has wordlengths (frequencies): 4 (1) 5 (2) 6 (3) 7 (2) 8 (1)
So A has average wordlength 5.0 and B has 6.0
Suppose we have a text, half in language A and half in language B
Consider a word of four letters.
It occurs twice in A, followed by a word averaging 5 letters, against once
in B followed by a word averaging 6 letters. So a four letter word is
followed by a word averaging 5.33 letters.
This leads to the table:
3: 5.00
4: 5.33
5: 5.40
6: 5.60
7: 5.67
8: 6.00
of word length followed by average next word length.
Looks familiar?
Ger
----- Original Message -----
From: "ger" <ger.hungerink@xxxxxxxxx>
To: <vms-list@xxxxxxxxxxx>
Sent: Wednesday, October 27, 2004 11:20 AM
Subject: Re: VMs: wordlength persistence
The wordlength connection can easily be explained by several factors:
1) Text in two parts of "languages" with a different average wordlength
(AWL) each.
2) Text with citations in a "longer" language.
3) Text containing sequences of longer words, as in summing up names.
4) Text containing names consisting of two longer words.
For a trivial example of the last case see:
"aaaaa aaaaa aaaaa aaaaa aaaaa Wilfrid Voynich aaaaa aaaaa aaaaa aaaaa
aaaaa aaaaa"
the AWL following a 5 letter word is 5.2
the AWL following a 7 letter word is 6.0
Now imagine text with AWL 5 containing groups of words with AWL 6 and the
same will hold although to much lesser extend.
This is not say there is no such effect as a wordlength connection, but
that can only unsurface when at least the four named effects are excluded.
Ger
---
Outgoing mail is certified Virus Free.
Checked by AVG anti-virus system (http://www.grisoft.com).
Version: 6.0.782 / Virus Database: 528 - Release Date: 23-10-2004
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list
---
Outgoing mail is certified Virus Free.
Checked by AVG anti-virus system (http://www.grisoft.com).
Version: 6.0.782 / Virus Database: 528 - Release Date: 22-10-2004
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list