[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: VMs: VMS Word context similarities
Randomising the word order of an input file
changes the adjacency relationships of words
in two different ways. It breaks up the strong
meaningful relationships, but it also creates
a weak background context match between all
words.
I tried this with the bible sample, looking at
the grouping of the word 'was'.
In the unscrambled file it was strongly grouped
as:
(were,be,was,is,are) ... which makes sense.
In the scrambled file it was weakly bundled into
a large group:
(was,in,yahweh,from,do,shall,god,you,which,to,they
is,his,all,that,a,for,be,into,your,my,them,him,said
its,has,who,us,as,this,he,her,i,when,by,one,up,will,
me,come,house,on,have,were,their,out,we,with,are,
son,king,it,men,go,but,not,came,people,before,because,
at,day,so,or,if,man,let,no,went,saying,had,israel,
there,against,also...) !!
Something similar happens with the VMS. When the
threshold is set high, randomisation reduces the
number of relationships found, when the threshold
it set low, it increases the number.
Marke
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list