[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: VMs: Number crunching the Fincher window



Hello Elmar,

a different type of statistic... nice.

But did you use the Eva transcription?
Assuming that: in, iin, iiin are single characters
and being quite sure that ch and sh are as
well, the stats better be based on a Currier- or
FSG-like transcription scheme. 

It should change the statistics in the shorter-
length area...

Cheers, Rene

--- Elmar Vogt <elvogt@xxxxxxxxxxx> wrote:

> 
> Cheers all,
> 
> Being intrigued by Marke's idea, I did a bit of
> number crunching.
> 
> Especially, I wanted to find out how many different
> sequences there were in 
> the text for various sequence lengths. So I took
> excerpts from the VM 
> transcript, removed line breaks (I left spaces in
> there), and had the 
> numbers of different sequences counted. I also
> performed a control with a 
> German text of similar length (32764 chars both
> times).
> 
> Here's what I got -- number of different sequences
> for different sequence 
> lengths:
> 
> Length   VM        German
> 4        4389        9435
> 5        8773       14949
> 6       14087       19623
> 7       19432       23443
> 8       23934       26264
> 9       27263       28237
> 10      29527       29609
> 11      30954       30543
> 12      31783       31187
> 13      32249       31651
> 14      32491       31964
> 15      32612       32190
> 16      32674       32346
> 
> (Note that this was a character-for-character
> examination.)
> 
> We seem to see that natural languages have a larger
> variety of short 
> sequences. At the same time, for longer sequences,
> the VM gets more varied, 
> until at a sequence length of 16, there were only 90
> instances of phrases of 
> 16 or more characters, which got repeated. (In
> German, we still had some 400 
> duplicates.)
> 
> (I don't really trust my little program on this one
> -- can anyone confirm or 
> deny?)
> 
> This seems to me to point to the hoax theory. If we
> assume a Fincher window 
> of less than 16 chars width, that would be what we'd
> expect -- Correlation 
> drops drastically as soon as the window width is
> reached. I had hoped though 
> for a more pronounced "drop" at the actual window
> length.
> 
> Or am I completely on the wrong track?
> 
> 	Elmar
> 
> -- 
> Elmar Vogt / Königswarterstr. 18 / 90762 Fürth /
> GERMANY
> elvogt@xxxxxxxxxxx / www.beamends.de / Tel.:
> (++49/0)911 - 31 52 58
> 
> "It is through the truthful exercising of the best
> of human qualities -
> respect for others, honesty about ourselves, faith
> in our ideals - that
> we come to life in God's eyes." (Bruce Springsteen,
> "Vote for Change")
> 
> 
>
______________________________________________________________________
> To unsubscribe, send mail to majordomo@xxxxxxxxxxx
> with a body saying:
> unsubscribe vms-list
> 



		
__________________________________
Do you Yahoo!?
New and Improved Yahoo! Mail - Send 10MB messages!
http://promotions.yahoo.com/new_mail 
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list