[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: VMs: algorithm to generate VMS like text
Hi Rene,
Yes, I am "cheating" by using statistics about the text to produce text that conforms to the statistical profile of the VMS text but my objective was:
1. Show that you can produce VMS like text using an algorithm
2. As a corollary of 1. above, if you can define an algorithm then
this algorithm could be a representation of a mechanical process.
Another weak point of the way the program works is that, to use an analogy based on UK English, you could produce a word SPHTN as in this word each letter contacts the next letter on a purely statistical basis in free text but disobeys conventions that are present in UK English, such as frequency of vowels and combinations of letters that never occur.
I will generate a reasonably large "text" and run it through Monkey and let you (and the list) know the result. I'll also produce a word frequency list. Oh, and I'll look at Courier and EVA as well.
Brett
Rene Zandbergen <r_zandbergen@xxxxxxxxx> wrote:
--- Brett Cotton wrote:
> The algorithm uses data from a previous program that
> analysed all of the FSG text in the interlinear 1.7
> file and produced a list of contact frequencies for
> each "letter". This is then plugged in to the text
> generator as a look up table for each letter.
Hi!
This is like a second-order monkey, then.
When running the entropy calculations, you should
find almost identical values for (h0,) h1 and h2,
but a high value for h3 (which can hardly be
calculated reliably anyway). But you can use
Frogguy's MONKEY to generate text including the
measured h3 value. You can do that for English
as well, to notice the difference between a
2nd order and a 3rd order monkey. You need
a longish text though. It is also a great tool
for producing 'fake' Italian or !
German
(shades
of Monty Python).
> I then thought that if an algorithm can produce text
> that is alike to the VMS and cannot be proved
> statistically to differ from the VMS then that
> algorithm could be a computational representation of
> a physical process used to generate the text at
> random. I have an idea of how this might work but
> want to spend a bit more time on it before posting
> to the list.
What the monkey cannot emulate is the long-range
variations (Mark Perakh and Gabriel's results).
I am not too sure whether the monkey would also
generate a realistic word frequency list.
Cheers, Rene
__________________________________
Do you Yahoo!?
Yahoo! SiteBuilder - Free, easy-to-use web site design software
http://sitebuilder.yahoo.com
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body
saying:
unsubscribe vms-list
Yahoo! Plus - For a better Internet experience