[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: On the word length distribution

Jorge Stolfi wrote:
> If I read the clue correctly, the ball is back in the cryptographers'
> court. Have a busy holiday week! (But hurry: the Millenium is only a
> week away, and Revelations that are not received by the deadline may
> be rejected independently of their merit. 8-)

The first question that pops into my head is if the initial characters
in each word
follow a distribution anything like Benfords' law - in base 10, the
proportions are
given by log_10(1 + 1/d) where d = 0..9.


Granted, not all data follows this distribution, but it may at least
help decide what kind
of codebook it is, if it is one.