[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Fw: Character n anomaly
> [Nick Pelling quoting Currier:]
>
> The frequency counts of the beginnings and endings of lines
> are markedly different from the counts of the same characters
> internally. There are, for instance, some characters that may
> not occur initially in a line. There are others whose occurrence
> as the initial syllable of the first "word" of a line is about one
> hundredth of the expected. </snip>
In fact there are some characters that cannot occur at the
beginning of a *word*. If line breaks occur only between words,
Currier observation would be expected.
> The ends of the lines contain what seems to be, in many cases,
> meaningless symbols: little groups of letters which don't occur
> anywhere else, and just look as if they were added to fill out the
> line to the margin. There is, for instance, one symbol that, while
> it does occur elsewhere, occurs at the end of the last "words"
> of lines 85% of the time.
That must be <m> and its variants. Note that <m> is a pen flourish
suggestive of abbreviation in handwritten text. (Nowadays we use "."
for that purpose.) Indeed <m> seems to occur in cramped contexts where
an abbreviation could be expected.
> One more fact: I have three computer
> runs of the herbal material and of the biological material. In all
> of that, which is almost 25,000 "words," there is *not one single
> case* of a repeat going over the end of a line to the beginning of
> the next; not one.
> [Nick Pelling:] Proper statistical examination should help
> resolve this to everyone's satisfaction. How different is the
> average gap between spaces from the average gap between gallows
> characters?
I can tell you that between two spaces you will find either no
gallows (~50% of the time) or exactly one gallows (~50%). Between two gallows
you will find at least one space, and in fact k spaces with probability
~1/2**k, for k > 0.
(The few exceptions --- words with two or more gallows --- seem to be
cases of omitted spaces.)
All the best,
--stolfi