[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: A few LSC comments
Rene, thanks for the reply. I was pleasantly surprised that besides you and
Gabriel also Jim has looked into LSC. I have not checked the formula you
used since having found that your Se curve coincided with our results, I
simply assumed that you just slighly changed notations but essentially used
our formula for Se. You said you simplified our formula. I did not see it
that way, but it is up to everybody to change the formula's appearance as
one wishes as long as it yields the same result. As to the expected results
with order 3 word monkeys etc, I have said what I thought of that in my
previous reply to Gabriel, so I don't think I can add anything new and I
prefer to wait until you conduct the measurements and we'll see how it
works. Best to all, Mark
Rene wrote:
> I haven't yet been able to do any further calculations and comparisons,
> but here are a few assorted comments.
>
> First of all, Jim Reeds detected an error in the calculation of Se, but
> it soon turned out that this is a typo on my web page.
> Where I wrote:
> Se = 2 (L* - k) ( 1 - SUM ( etc.... ))
> it should have been:
> Se = 2 (L* - n) ( 1 - SUM ( etc.... ))
>
> Secondly, I agree with Gabriel that using a 3rd order word monkey
> would be even more interesting in terms of checking the capabilities
> of the LSC method in detecting meaningful text. On the other hand,
> getting meaningful word entropy statistics is even more difficult
> than getting 3rd order character entropy values, so the text from
> a 3rd order word monkey will repeat the source text from which the
> statistics have been drawn much more closely than should be the
> case. As before, a 1st order word monkey will be equivalent to a
> random permutation of words, and if it is true (in a statistically
> significant manner) that the LSC test distinguishes between one and
> the other, we do have another useful piece of evidence w.r.t. the
> Voynich MS text.
>
> In view of the difficulty in judging the meaningfulness of my
> quick tests using character monkeys, I'll add 95% (e.g.) confidence
> intervals around the Sm curves for the random texts.
> Some time in the next few days.
>
> More later, Rene