[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: VMs: A very important discovery!!
> With reference to this point, I have updated my
> result's page, and now I have put a new graphic,
> I called: "Extract sorted of VMS without the first
> word beginning with f, k or p (EVA modified), with
> marks in the sections which begin with gallows". As
> its name says, I have eliminated of the sorted VMS
> first word that begin whit Eva-p, Eva-f (and Eva-k).
> And the peaks are again (the fact is that the peaks
> are smaller, but the reason is because I have
> eliminated the words that begin with p and f gallows
> as you say).
I need to think about this one.
There is a potential source of confusion, by
the way, when we speak of the Eva-f and Eva-p
characters. On the one hand there are the ones
that appear as the first character in new
paragraphs (where there can also be Eva-k,
don't know about Eva-t). On the other hand there
are the words that can include (anywhere) the
letters Eva-f and Eva-p, which by the far
majority occur in the top lines of paragraphs.
> > 1) Which source text did you use? The whole MS
> > including all labels and circular texts etc, or
> > a subset?
> > [...]
> Tomorrow I will try to update my web page with new
> results, because I have realized that the extract
> (summary) of the VMS that I use have some pages
> are part of Herbal-B. I will analyse only the part
> Herbal-A and tomorrow I will put these results.
All analyses have their value and it is by comparing
them that the relevant features can be seen.
> I use an extract of the transcription by Takeshi
> Takahashi that has been a few modified by me: I have
> changed some characters or associations of
> as: cth -> z; cfh -> b; cph -> v; ckh -> u; ch -> h;
> ii -> n; iii-> x; n -> g; and ee -> j;
For this particular analysis (i.e. looking at
word frequencies) the transcription alphabet
doesn't actually matter (up to a certain point).
For others, it is a good idea to do some
substitutions like the above.
> > I would also wonder whether the corpus of
> > words (I should say: tokens) that include an
> > Eva-f or Eva-p actually follow Zipf's law or not.
> > (I would exclude in this corpus the big paragraph
> > start markers).
> > [...]
> I understand that you are talking about if excluding
> the first word or token of any line the results are
> the same, not?
No, this is indeed an example of the confusion
I meant before. It is a feature which is not yet
sufficiently explained, that words that include
Eva-f and eva-p (anywhere) tend to appear
in top lines of paragraphs (and I think in labels).
It could be an ornamental feature (there are
examples of this in Medieval manuscripts)
but it is not clear what would have been there
"without the ornamentation".
My question was more along the line of whether
all the words that contain an Eva-f or Eva-p
(but not the paragraph-initial words) behave
statistically in the same way as normal words.
Do they follow Zipf's law? Is there any distinction
that depends on which character was at the start
of a line?
Do you Yahoo!?
Yahoo! Mail - 50x more storage than other providers!
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying: