Just as you grab a math reference book to look up a certain integral or the value of a function, it would be nice to have a sort of "data book" of raw VM statistics that could be referred to easily.
Examples of the types of statistics that might be included are: - letter frequencies by page/by type of page/ by language - references to all occurrences of repeated words or phrases. - statistics on occurrences of gallows characters - a word frequency list - a KWIC index (concordance) of the VM - word ending frequencies etc.
Then, for example, rather than referring in general to "triple repetitions of words" it would be easy to examine all the actual occurrences to look for some pattern.
By the way, to do this it would be useful to choose one (or more) transcription(s) of the VM as a sort of "reference text", which would be included in the databook and used as the basis for all statistics, with the understanding that there are legitimate differences in opinion which would have some effect on the resulting statistics. (Without "putting a stake in the ground" somewhere, however, it is impossible to do more than talk in generalities.)
Then, as new theories arise, additional statistics to investigate them could be generated and added to the "data book".
______________________________________________________________________ To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying: unsubscribe vms-list