[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: VMs: Babelfish translation [plus question for Dana]

To: <vms-list@xxxxxxxxxxx>
Subject: Re: VMs: Babelfish translation [plus question for Dana]
From: "Larry Roux" <LRoux@xxxxxxx>
Date: Fri, 14 Nov 2003 10:07:29 -0500
Reply-to: vms-list@xxxxxxxxxxx
Sender: owner-vms-list@xxxxxxxxxxx

I disagree.  ch is probably 2 glyphs.  They are written as 2 distinct characters and there are examples of ah co and others.

******************************
Larry Roux
Syracuse University
lroux@xxxxxxx
*******************************

>>> incoming@xxxxxxxxxxxxxxxxx 11/14/03 05:23AM >>>
Hi Jeff,

At 22:23 13/11/2003 +0000, Jeff wrote:
>PS I did retrieve a phrase that babelfish translated as "I am devoted to the
>beautiful one." Any ideas what this might possibly mean?

Sadly, it probably means that babelfish is often rubbish. :-(

Jeff - please understand that there is a *big* difference between EVA and 
the glyphs you see in the scans - not only is <ch> almost certainly a 
single glyph, but so is <sh> (though consistent, EVA's reuse of the <s> 
character within <sh> has been [IMO rightly] criticised).

While I honestly have no idea whether you are trying the right kind of 
approach or not, I'm quite sure that while you base it on an EVA 
transcription, you won't find out. Look: <s> in EVA is manifestly a 
different entity in <s> and <sh>, so your tables really shouldn't be 
treating them as one thing (but they currently are).

A reasonable (yet fairly minimal) extended glyph set might well be based 
around:-
         ch
         sh
         ee
         cfh
         ckh
         cph
         cth

Similarly, many read the <in>/<iin>/<iiin> strokes as a single glyph, so 
these too might well be fruitfully considered as single glyphs.

IMO, there's also a strong case for treating these pairs as if they formed 
individual letters:-
         qo
         ol
         al
         dy
         eo
         or
         ar

Also: you may want to consider using a different line of text as your 
testbed. Philip Neal and others (myself included) have pointed out numerous 
times that the two single-leg gallows (<p> and <f>) are often found close 
to incongruous-looking pieces of text, especially on the top line of pages. 
Also: the transcription on f1r isn't really as clear as it could be - the 
page itself appears worn out, probably from having been on the outside of 
the Ms for five centuries. :-)

So... I'd suggest finding some clearer text to work from (and avoiding the 
topmost line if possible), perhaps on a herbal page where there is a 
reasonably good match with a known plant. However, these two criteria 
overlap incompletely - for example, f11r has extraordinarily clear text, 
but f2v has an excellent plant identification ("nanufar", Nymphaea alba).

Dana - as a general point, which herbal page would you suggest offers the 
optimal combination of textual clarity plus reliable plant identification?

Finally: Jeff, you're getting to the stage where doing this kind of thing 
by hand will consume all your time (and more). Please install Python on 
your machine and adapt Seth's script to do the things you want - really, 
you'll save yourself a *lot* of hassle.

Cheers, .....Nick Pelling..... 

______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list

______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list

Follow-Ups:
- Re: VMs: Babelfish translation [plus question for Dana]
  - From: Nick Pelling
- Re: VMs: Babelfish translation [plus question for Dana]
  - From: Rene Zandbergen

Prev by Date: Re: VMs: Babelfish translation [plus question for Dana]
Next by Date: Re: VMs: Babelfish translation [plus question for Dana]
Previous by thread: VMs: Letter scan counts
Next by thread: Re: VMs: Babelfish translation [plus question for Dana]
Index(es):
- Date
- Thread