[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

VMs: VocTok



Hello All,

I posted a chart showing the changing vocabulary to token ratio in the VMS as tokens increase.

http://home.earthlink.net/~knoxmix/voctok/id2.html

There is a noticeable decrease in the balneological section. The vocabulary building of three other texts (10 would be better for comparison) slacks off at about the same point, very roughly around 11000 tokens. My guess is that the VMS accomplishes a lot of this by repeating the equivalents of big cats/big rats, big cars/big bars and maybe red carpet/red barpet while all continue using higher ratios of words from earlier sections. Quire 8, approximately, is anomalous.

Leaving the balneological section, the VMS resumes adding new words at a much greater rate than the comparison texts. Its total curve has a general bow shape like the others (and all others?) but the scale is altogether different. Those also vary quite a bit from each other. Comparison texts are Landini's Challenge, Meditations, and Fourth Crusade (Clari). Hang in, Dennis: this already eliminates two contenders for the Challenge.

Without more comparisons, I would not say from this that another "language" or "dialect" is indicated but, from what is known, I suppose that is part of what what it shows. Nothing new here but it gives a picture.

There is also a Monkey chart for the segments with 8001-16000 tokens (not incremented) to show the changes in h1 and h2 entropy. I probably should have done that with spaces-off. I do not understand it. If it looks odd to you/youse/y'all, I must have done something wrong.

---

I wrote "balneological" because I agree with those who think that is what the section represents. I think the tubes are superdoodles but cannot learn anything by dismissing it so I am still open on that.


Regards, ......... Knox ______________________________________________________________________ To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying: unsubscribe vms-list