However, I haven't tried colocation likelihood
measurements on gibberish or heavily encrypted text
(by that i mean, a simple substitution cipher would
behave the same as plain text for collocations). I
would guess in these cases the number of significant
collocations would drop, but by how signficantly I
don't know. I did run collocation likelihoods once on
the VMS text (see my long message about concentrating
on known languages) and didn't see any anomolies (the
character combinations we always see together - 4o -
show up).