[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Help me parse arabic text!



Hello Friends,

I'm interested in using arabic as a language for
comparison with the VMS.  However, I find myself out of my league and hope
someone can offer advice.

I found a copy of the koran in ISO-8859-6 encoding, but tools like MONKEY
and TACT choke on the character set.  I'm not sure how to do character-
and term frequency counts on it -- is there a better tool for this job?  I
want to generate zipfian curves for comparison, removing likely bits that
may represent nulls in the VMS.  

Any assistance or advice would be most appreciated!

Best Regards,

Jason

----------
Jason Morningstar
School of Information and Library Science
UNC Chapel Hill