--- "Rafal T. Prinke" <rafalp@xxxxxxxxxx> wrote: > But the entropies of pairs, triplets etc. are not > output, are they? Would they add any more meaning > to the text statistics? Their values are obtained by adding up the conditional values. The character pair value is H2 = h1 + h2. Equally: H3 = H2 + h3 = h1 + h2 + h3. This derives from the conditional probabilities (Bayesian theorem): P(ab) = P(a) * P(b|a) P(abc) = P(ab) * P(c|ab) = P(a) * P(b|a) * P(c|ab) etc. and plugging it into the formula used for calculating entropy. Cheers, Rene

