[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

VMs: truncated repeating sequences



Hi all,
Continuing my investigation into split/joined words I ran an 
experiment which records repeated sequences but ignoring the
position of spaces within them.

I was conjecturing that we might find a lot of long sequences 
which are repeated, but with differing space patterns.

All in all, I detected 16000 distinct repeated sequences of 
between 10 and 32 chars, but the vast majority reoccur with 
the same space pattern in each instance, with only a few 
exceptions.

There is a consistent pattern to the exceptions in that 
certain specific small sequences such as 'ol', 'o' and 'y' 
have the ability to "float" from the end of one word to the 
beginning of the next and vice-versa.  (Some examples at end).   

However, something more interesting also came to light.

There are long sequences which reoccur with a consistent
space pattern *BUT* truncated at a large number of different 
points. 

??

Marke

---

EXAMPLES:

Two examples of the infrequent varying space patterns:

aralarolkee:   ar.al.aro.lkee:1  ar.alar.olkee:1
arolchedyqokai:  ar.ol.chedy.qokai:1  ar.olchedy.qok.ai:1


examples of the variously truncated sequences:

07466:a
02408:ai
00333:aii
00030:aiin
03220:aiin.
00082:aiin.s
00184:aiin.sh
00165:aiin.she
00009:aiin.shed
00024:aiin.shedy.
00006:aiin.shedy.qo
00002:aiin.shedy.qok
00005:aiin.shedy.qoke
00001:aiin.shedy.qokeedy.
00002:aiin.shedy.qokeedy.qo
00001:aiin.shedy.qokeedy.qote

02222:c
05729:ch
04043:che
00024:chey
00635:chey.
00010:chey.q
00082:chey.qo
00069:chey.qok
00014:chey.qoke
00015:chey.qokee
00015:chey.qokeey.
00001:chey.qokeey.q
00001:chey.qokeey.qo
00001:chey.qokeey.qok
00001:chey.qokeey.qoke
00001:chey.qokeey.qokee
 
05900:d
00128:dy
04551:dy.
00040:dy.q
00754:dy.qo
01144:dy.qok
00052:dy.qokch
00004:dy.qokchd
00024:dy.qokchdy.
00003:dy.qokchdy.qo
00004:dy.qokchdy.qok
00001:dy.qokchdy.qoka
00001:dy.qokchdy.qokal.
 
14737:e
00851:ed
00025:edy
02541:edy.
00034:edy.q
00531:edy.qo
00655:edy.qok
00125:edy.qoka
00088:edy.qokai
00001:edy.qokaii
00058:edy.qokaiin.
00001:edy.qokaiin.s
00005:edy.qokaiin.sh
00002:edy.qokaiin.she
00001:edy.qokaiin.shed
00002:edy.qokaiin.shedy.
00001:edy.qokaiin.shedy.qo
00001:edy.qokaiin.shedy.qok
 
14737:e
00851:ed
00025:edy
02541:edy.
00034:edy.q
00531:edy.qo
00434:edy.qok
00360:edy.qoke
00016:edy.qoked
00001:edy.qokedy
00086:edy.qokedy.
00014:edy.qokedy.qo
00022:edy.qokedy.qok
00005:edy.qokedy.qoka
00001:edy.qokedy.qokai
00001:edy.qokedy.qokain.o
 
14737:e
00851:ed
00025:edy
02541:edy.
00034:edy.q
01428:edy.qo
00011:edy.qop
00007:edy.qopch
00008:edy.qopche
00002:edy.qopched
00001:edy.qopchedy
00010:edy.qopchedy.
00001:edy.qopchedy.qo
00002:edy.qopchedy.qok
00001:edy.qopchedy.qoke
 
 
______________________________________________________________________
To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying:
unsubscribe vms-list