[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

AW: VMs: Talking about entropy



Title: AW: VMs: Talking about entropy

Hi Rene,
you described exactly, what I wanted to accomplish.The mapping is not really necessary.
My goal is, to find out, if token (or words) have a high entropy, comparable to "real" languages.
If your numbers 10 to 11 are correct, then the VMS is more IMHO more likely to be a language then a ciffer, as a message with a low second order entropy is more redundant then with a higher entropy.I compare that with compression (higher) or error correction (lower).

Claus

-----Ursprüngliche Nachricht-----
Von: Rene Zandbergen [mailto:r_zandbergen@xxxxxxxxx]
Gesendet: Mittwoch, 26. Februar 2003 11:41
An: vms-list@xxxxxxxxxxx
Betreff: Re: VMs: Talking about entropy


Hi!

I'm not entirely sure I follow exactly:
....
The VMs has (IIRC) about 8000 different words (or
was that tokens?). Thus, if you use an arbitrary
character set of this size, you can replace
each word in the MS by one such character. The
new character entropy will be the same as the old
word entropy. Somewhere in the range of 10 to 11.
....
A quick-and-dirty way to do what you are interested
in is simply 'simulate and do the maths' rather
than try to analyse. But it is not the same, and
should be repeated a number of times to see that
the resulting numbers are representative. For
example, it is not feasible to compute higher-order
entropy values this way.

Cheers, Rene

__________________________________________________
Do you Yahoo!?
Yahoo! Tax Center - forms, calculators, tips, more
http://taxes.yahoo.com/
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list