[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: VMs: Personal Guess



Hi GC,

Lances up - another day of jousting begins... :-)

At 22:57 06/08/2003 -0500, GC wrote:
If you suspect the VMS to be abbreviated (even lightly), I'd like to see
evidence of this.  What do you mean by "abbreviated"?  What statistic
singles out these "abbreviations"?  Usually the purpose of abbreviation is
brevity, something like the 'x' for 'us'.  To fit the 'standard' definition,
it would be reasonable to provide statistics on the single glyphs that act
as "abbreviations", and your rationale behind choosing these above others.

(1) I assert (somewhat uncontroversially) that the VMS' appropriation of the Tironian notae "8" and "9" points to the VMS' alphabet designer's choices being informed by shorthand. In fact, I'd comfortably argue that this alone points to the VMS' not being a natural language in an invented alphabet: why would someone invent an alphabet that not only appropriates two of the few remaining Tironian notae in use, but frequently places them at the beginnings and ends of words?


(2) I strongly suspect that just about every statistic gathered to date may have been misdirected - the strong implication of the pair cipher hypothesis is that statistics based on glyphs would be next to useless. If you also have misgivings about soft spaces vs hard spaces (or if you suspect some kind of space transposition cipher is in use), you'd have much the same kind of suspicion about "word"-based statistics. So: I don't have any statistical evidence - but I don't think that you do either.

(3) My abbreviation hypothesis arose simply out of trying to explain the "curls" / "tears" / "question marks" accessorising the <ch> pair. Under what possible circumstances would a logical, well-thought-out, systematic mode of expression (ie the rest of the alphabet) need to be compromised / accessorised / altered by these (apparently) semi-improvisational marks? After a lot of deliberation, it seems highly likely to me that these represent disambiguation marks (or "hints"), designed to help the decoder guess the correct replacement word. So: I only see <ch> and the various forms of <sh> as abbreviations, but that isn't born of statistical analysis.

As for what I mean by "abbreviated": you talk about "x" for "us", but that's a notae/nomenclature-like tokenisation (see the first cipher of the Milanese cipher ledger for an example of this kind of nomenclature gone crazy).

Strictly speaking, abbreviation (rather than tokenisation) takes two forms - contraction and truncation. Let's look at the Radcliff/Ratcliff shorthand Lord's Prayer:-

        Our Fth wch rt n hvn ; hlwd b y Nm
        Y Kgdm cm Y wl b dn n rth z it s n Hvn

What types of abbreviation are at play here? Clearly, many vowels are being discarded (contraction) and some words are being terminated early (truncation).

The obvious question then comes: what kind of abbreviation do I think is denoted by {ch|sh} in the VMS? I'll try to explain my thoughts by going over a fragment of pair-transcribed text from f24r:-

        k.ar.ch
        t.ar.s.ch.e
        or.cth.ol.qo.d.ol.y.ch

Firstly, I suspect that - for most common words - many vowels have simply been discarded (ie, contraction), and it would be fairly pointless to have a glyph coding for this kind of abbreviation. But for truncation, that's a different matter - especially if the spaces in the text have been somehow obscured.

All of this leads me to predict that the structure of the first word "k.ar.ch" is probably "consonant . consonant . <truncation>".

For the next word - "t.ar.s.ch.e" - I predict that we might well have "consonant . consonant . consonant . <truncation> . <particular noun/verb ending>".

However, I believe that the next word - "or.cth.ol.qo.d.ol.y.ch" - may well be a proper name, or a similar CT word not really amenable to obvious internal contraction, just truncation once it gets beyond a certain length (like "Arist--" for "Aristotle"). Therefore, some of the pairs in this particular word may well code for vowels.

Abbreviation is a natural partner for verbose ciphers, especially if you want to try, as part of encoding an already-existing document, to reproduce the general textual layout of the original. If the subject matter is not a particularly good fit with heavy abbreviation, the alternative is to use a finer quill - this may be part of the explanation for what we observe in the VMS' hands.

Fair enough criticism.  You won't mind filling me in on Strong's "specific
historic model"?  Let's see, Selenus and others up to Strong's time covered
a very wide range, including nomenclators and homophonic substitution
ciphers.  Even most of Friedman's critical papers were out (and read by
Strong) by the time he took on the Voynich. As far as "external historical
research", his interest was in late 16th century, early 17th century cipher,
a little later than the Voynich, but not by much on the evolutionary scale.

Put simply, late 16th Century cipher is what Strong was looking for, and late 16th Century cipher is what he thought he found. He didn't - and you don't - seem to cut the 15th century code-makers much slack: I think they were smarter than we give them credit for. After all, they practically invented modern cryptology.


More than that - According to Strong, he spent over 500 hours of research
(on two pages) before the light broke. <snip>

I'm just talking about two pages' being an insufficient sample for Strong to derive a theory from. If he had been allowed to see "the bigger picture", I'm sure his ideas would have moved on.


> Generally: you and I both see the VMS as a non-obvious polyalpha system -
> but please accept my caution that I think the VMS author is trying to
> sidestep our feeble post-1912 attempts at cryptology by misdirecting us,
> certainly at the letter level and probably elsewhere as well. For
> example,
> I would be completely unsurprised if the order of the letters within each
> word were transposed in some way - perhaps reversed.

I'm not certain what you mean by "non-obvious".  The very nature of cipher
is to make the solution "non-obvious".

No, the nature of cipher is to make the solution mathematically hard (ie, you have to rely on statistics to help you get to the answer) - the nature of steganography is to make the solution non-obvious.


If I think I see something, I go to
the database and pull up the numbers.  If they work in my favor, go a little
further.  If not, rethink it, try something different.

I wish I had even 10% of your faith that current VMS statistics tell us anything about the coding system. Just because it's easier to look for your keys under the streetlight doesn't mean you dropped them there - here, I'm certain that the VMS' author constructed a system with a bright (steganographic) light, ready to be surrounded by (cryptological) moths, but which actually illuminates nothing.


I personally don't
think the VMS author knew or cared about a 'post-1912' world, and certainly
didn't think his manuscript would be the subject of so much discussion.  He
didn't write it to misdirect us, we're doing a good enough job of that all
by ourselves.  He wrote it for his own use, for personal reasons, and
couldn't have cared less about you or me.

What if (as I personally suspect) the exact opposite of every assertion in this paragraph were true? :-p


One of the reasons I enjoy this list is that I get to play the "thesis-antithesis" game with smart people like you - but perhaps one day (soon) we'll get to "synthesis"... game over! :-)

Cheers, .....Nick Pelling.....


______________________________________________________________________ To unsubscribe, send mail to majordomo@xxxxxxxxxxx with a body saying: unsubscribe vms-list