[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Fw: Character n anomaly
Gabriel wrote:
> The question to ask (to which of course I haven't
> got an answer) is:
> what would convince me that the spaces are
> separating words?
A first list of points in favour:
- They look like spaces and spaces separate words in
most ME manuscripts.
- Taking them as word separators results in a
vocabulary with a realistic word length distribution
- The resulting vocabulary follows Zipf's 1st law
- Label words are found in the text delimited by
spaces
I like these arguments, personally. I agree that
Occam's razor should be enjoyed with care, but I
would tend to say that any other explanation for
word spaces would amount to adding 'complications'.
Spaces could delimit syllables or word fragments
(the latter like in Arabic) but then I would
have expected to see more spaces in the labels as
well. (Unless the language is monosyllabic,
but that of course still makes the spaces word
separators).
Cheers, Rene
__________________________________________________
Do You Yahoo!?
Make international calls for as low as $.04/minute with Yahoo! Messenger
http://phonecard.yahoo.com/