Here’s another (sort of) “plaintext” Voynich Manuscript reading, that I first found back in 2006: having corresponded briefly with the Greek author (who wishes to remain anonymous) at the time, I then managed to completely forget about until a few days ago.

He claims that the Voynich Manuscript is a transliterated Arabic document written down “using a kind of [old-fashioned] Jewish script”, and that it contains incantations to fulfil “all kinds of human desires”, addressed to the goddess “Siit” as part of cult worship ultimately deriving from the Mesopotamians.

René will be pleased to hear that the author gives extensive equivalence tables showing how to map Voynichese letters onto Hebrew letters, as well as a pronunciation guide. (Though note that you will need to resize your browser window to be 1024-pixels wide in order for the left-hand “Gabelsberger Shorthand Symbol” column to line up).

There is also a long section on f116v (the “michiton oladabas” page), as well as comments on other pages:-

Cotton is depicted on page 17, and cannabis sativa on page 16; these are plants used to make fabrics, like the one on the right of cotton, which is flax. On page 11 it is, I think, a lemon tree.

He finishes up by noting that the first few lines of folio 56 (which he says depict an eggplant) read as follows (“aqith” = “eternal”), and comments that “I cite this passage for anyone who knows Arabic well to offer a tentative reading of the whole page“:-

s(tbqd bbk)n sTn rkran bn nbsMb.n bsl bn bn
bstbrn bsd bsdn tsl bn bn trn bsTn hstqSθ
sd brn bstbsd brn ten usten bsten bstkn
usT bsl bsl bst sl btsl bn stql ban
bs bsl bs bsaqdn aqiΘ
ntbs an abrn ten aqiΘ n

Is this the answer we have been looking for? Confidentially… I don’t really think so. As claimed plaintexts go, it appears to have quite a few, errrm, ‘problems‘, let’s say. But perhaps some Arabic-literate Cipher Mysteries reader reading the above will know how to make the Philosopher’s Stone, who knows? 🙂 

Incidentally, the author also refers to a (previously unknown?) VMs book by “Ethan Ashmole Jones” called “The Voynich Manuscript – Who Is Who of a Riddle”, published by Ellinika Grammata (I believe), though I couldn’t see a copy in WorldCat. Anyone seen this before or heard of Ethan Ashmole Jones? Sounds a bit like a pseudonym to me, but (as always) you never know! 😉

I flagged here last year that a new Erich von Daniken book was on the way (though it was only in German at the time). Well… now it’s on the way (August 2009) in English, too, courtesy of Legendary Times Books, with the openly provocative title “History Is Wrong“.

Curiously enough, EvD not only takes on all the usual fringe suspects (the Voynich Manuscript, the Piri Reis Map, the Antikythera Mechanism, etc), but also talks of nn underground labyrinth in Ecuador containing “an extensive library of thousands of gold panels“. And yes, you’re way ahead of me: he does indeed put forward evidence linking this not only to the Book of Enoch, but also to the Mormons.

Of course, there are plenty of people out there who would readily retitle this latest Danikenian meisterwerk as “von Daniken Is Wrong“: and I’d be hard-pressed to say from what I’ve read of it so far that this would be vastly unfair. All the same, perhaps his book will reach out to a whole new generation of gullible open-minded readers, who have managed to spend their short lives without being exposed to his individual breathless brand of cultural syncretism. Or, given the million websites that have sprung up to follow on his footsteps, might he have left it too late to return to the fray – essentially, has the Net made von Daniken redundant?

Though (as was apparent from the rapid social media take-up of yesterday’s XKCD webcomic) the Voynich Manuscript is now firmly wedged in the cultural mind, sadly the level of debate on it is still stuck circa 1977 – and if anything, Gordon Rugg’s foolish “hoax” claims have helped to keep it there.

But it is demonstrably written in cipher: and so this post tells you why I’m certain it’s a cipher, how that cipher works, and what you can do to try to break it. I’m happy to debate this with people who disagree: but you’ll have to bear in mind that as far as this goes, I’m just plain right and you’re just plain wrong. 🙂 

1. What does the Voynich Manuscript resemble?

Firstly, the overwhelming majority of the Voynich Manuscript is written using only 22 or so letter-shapes: generally speaking, this is the size of a basic European alphabet. Voynichese therefore visually resembles an ordinary European language.

Secondly, even though most of its letter shapes are unknown or unusual, four of them (“a”, “o”, “i”, and “e”, though this last one is styled as “c”) closely resemble vowels in European languages – not only in shape, but also because if you read these as vowels (precisely as the main EVA transcription does), you end up with many CVCVCV (consonant-vowel) patterned words that seem vaguely pronounceable.

Thirdly, dotted through the Voynich Manuscript is a family of letter-groups that look like “aiv”, “aiiiv”, “aiir”, etc. To most contemporary eyes, this looks like some kind of curious language-pattern: but to European people in the 13th to 16th centuries, this denoted one thing only: page references.

  • The “a” denotes the first quire (bound set of folded vellum or paper leaves), “quire a”.
  • The “i” / “ii” / “iii” / “iiii” denotes the folio (leaf) number within the quire (in Roman numbers).
  • The “r” / “v” denotes “recto” / “verso”, the front-side or rear-side of the leaf.

Circa 1250-1550, this “mini-language” of page references was universally known and recognized across Europe: and hence “aiiv” denotes “quire a, folio ii, verso side” and nothing else.

Therefore, the Voynich Manuscript resembles a document written in a 22-letter European language, contains obvious-looking vowel-shapes that are shared with existing European languages, and scattered throughout apparently has copious page-references to pages within its first quire.

However, what even very clever people continue to fail to notice is that these three precise things (the compact alphabet, the obvious-looking vowels, and the page references) have an exact corollary: that this does not resemble ciphertext – for even by 1440, most European cipher-makers knew enough about the vulnerabilities of vowels to disguise them by use of homophones (i.e. using multiple cipher symbols for the vowels). A ciphertext would not contain unenciphered vowels, not unenciphered page references.

The correct answer to the question is therefore not only that the Voynich Manuscript does resemble an unknown (but CVCVCV-based) European language studded with conventional Roman number page references, but also that it simultaneously does not resemble a ciphertext.

2. Why is the Voynich Manuscript not what it resembles?

I think the big clue is the fact that the page references don’t make any sense as page references.

For a start, even though the Voynich Manuscript probably consisted of fifteen or more quires, the page references that appear throughout its text only ever appear to refer to quire “a” (the first quire). What’s more, the first quire appears not to be marked with any form of “a” marking, which is curious because the whole point of quire signatures was to make sure that the binder bound them together in the correct order. Another odd thing is that there only appears to be references to the first six pages of the first quire.

All very strange: but the biggest giveaway comes from the statistics. Counting the number of instances of the different page references, you’ll see that page references to verso pages apparently outnumber page references to recto pages by eight times. Here are the raw counts (from the Takahashi transcription):-

air ( 564)   aiir ( 112)  aiiir (  1)
aiv (1675)   aiiv (3742)  aiiiv (106)

So, even though these superficially resemble page references, there is absolutely no evidence to suggest that this is what they actually are. In fact, the statistics imply the opposite – that despite their visual resemblance to page references, these are not actually page references.

And if it is correct that these are actually something else masquerading as page references, the entire visual-resemblance house of cards collapses – that is, if things are not what they seem, the other visual presumption (that this is a simple CVCVCV-based European language) necessarily falls down with it.

3. If the page references aren’t page references, what are they?

This is precisely the right question to ask: and so, when I visited the Beinecke Library in early 2006, I decided to spend some time looking at a single page containing plenty of clearly-written page references (as described in The Curse of the Voynich, pp.164-168) to try to answer it.

I chose page f38v, from which here are all the page reference letter clusters – can you now see what it took me hours and hours to notice?

f38v-page-reference-groups

The first thing I (eventually) noticed was that there was something a little odd about the top part of the “v” letter (which EVA wrongly transcribes as “n”, incidentally). Specifically, that many of the clusters appear to have been written using two inks – one forthe main “aiiv” part, and another (often slightly darker) one for the scribal “flourish” at the top.

But then… once you start looking specifically at the “v flourishes”, the next thing you might notice is that some appear to have a dot at the (top-left) end of the v-flourish.

The final thing you might notice is that these dots tend to appear in different places relative to the “aiiv” frame.

My conclusion is that what is happening here is steganography – that the position of the dot at the end of the v-flourish is what (possibly together with the choice of cluster) is enciphering the information here.

But what information is being enciphered in this way? I strgonly suspect that it is enciphering Arabic numbers 1-5 (probably with longer flourishes denoting larger numbers), and with “oiiv” clusters perhaps denoting 6-10. This might explain why we see so many of these “page references” immediately following each other (the famous “daiin daiin” pattern): each “page reference” therefore represents a digit within a multi-digit Arabic number.

However, what is strange is that this is only basically true for “Currier A-language” pages (Prescott Currier noted that, to a large degree, the text in Voynich Manuscript pages behaves in one of only two different ways): for Currier B pages, what seems to happen is that the information is enciphered by using different shaped flourishes for the final “v” character, and no dot.

From all this, I think I can reconstruct how the Voynich Manuscript’s cipher system evolved during its writing. In the early (Currier A) phase, some kind of data (probably Arabic numbers) were steganographically hidden by writing page-reference-like “aiiv” groups and placing a single dot above them. At a later date, however, the author decided (rightly, I think) that this was too obvious, and so went through the text hiding the dots by converting them into flourishes. Whereas in the later (Currier B) phase, the author decided to evolve the writing system to encipher the same data in a subtly different way (though still relying on the basic “page-reference” shape as a starting point).

And so the correct answer to the section’s question is: even though the “page reference” groups resemble page references, I think that they are cryptographic nulls designed to give the author sufficient visual space on the page to steganographically hide something completely different – probably Arabic numbers.

Of course, existing EVA transcriptions capture only the covertext (the nulls), while the actual data is enciphered in the dots hidden by the flourishes. But you have start somewhere, right? 🙂

4, What, then, is Voynichese’s CVCVCV structure concealing?

I am certain that the Voynich Manuscript’s apparent “consonant-vowel”-like structure is another visual trap into which the existing EVA transcription (unfortunately) helps to push people. By making Voynichese seem vaguely pronounceable (“otolal”, “qochey”, “qokeedy”, etc), EVA discourages us from looking at what is actually going on with the letters, while also falsely bolstering the confidence of those sufficiently deceived into believing (wrongly) that Voynichese is written in a real language. Basically, anyone who tells you it’s written in an archaic language has fallen into a gigantic intellectual trap first set five centuries ago.

But what of the CVCVCV structure? Where does that come from?

For the most part, I think that it arises from a late cipher stage known as “verbose cipher” (i.e. enciphering a single plaintext letter as two ciphertext letters). Though not all letters behave in this way, it certainly goes a very long way to explain the behaviour of common groups such as: qo, ol, al, or, ar, ot, ok, of, op, yt, yk, yp, yf, cth, ckh, cfh, cph, ch, sh, air, aiir, od, eo, ee, and eee. If you decompose the text into these subgroups (i.e. that these groups encipher individual tokens in the plaintext) while remembering to parse the “qo” group first, all the superficial CVCVCV behaviour disappears – and (I contend) you will find yourself very much closer to a kind of raw ciphertext stream that is more easily broken.

As supporting evidence, I point to those few places where the author has “twiddled” with the final code-stream to try to disguise obvious repeated patterns, arising from repeated letters in the plaintext (code-makers hate repeated patterns in their ciphertext). Perhaps the most notable of these is on f15v, where the “or” pattern appears three times in succession on line 1, and four times in a row on lines 2:-

f15v-space-transposition

I think that the author has added spaces in here to try to disguise the repeated “or” group: in line 1, he has inserted a space to turn “ororor” into “oror or“, while in line 2 he has inserted three spaces to turn “orororor” into “or or oro r“. I’m not fooled by this – are you?

I predict here that that “or” is enciphering “c” or “x” (probably “c”), and that the plaintext reads “ccc … cccc”: but you guessed that already, right?

5. Even if this is right, how does it help us break the Voynich?

I don’t believe for a moment that this explains the whole of the Voynichese cipher system: there are plenty of subtly surprising features that any proposed solution would also need to explain, such as:-

  • Precisely how (and why) Currier A and Currier B differ (for example, the whole word-initial “l” thing)
  • Why “yk / yt / yf / yp” occur more in labels than in normal paragraphs
  • Why so few non-trivial words appear more than once across the whole manuscript text
  • What “4o” codes for (I suspect a common initial-letter expansion, i.e. [qo] + ‘c’ –> ‘con’)
  • What word-initial “8” codes for (I suspect “&”)
  • What non-word-initial “8” and “9” code for (I suspect ‘contraction’ and ‘truncation’)
  • Whether the ciphering system is stateless or stateful (but that’s another story)
  • What “Neal keys” denote (but that’s another story, too)
  • etc

However, what I do believe is that all the above lays down the basic groundwork from which any sensible cipher attack would need to launch forwards. I do not share the widely-held pessimistic view that the Voynich is somehow intrinsically unbreakable – on the contrary, it is an all-too-human artefact from a specific time (between 1450 and 1500) and place (probably Northern Italy, though Germany is possible too), and the craft techniques it deftly uses to conceal its content from us are both far from invisible and far from infallible.

If you take the basic steps I describe above to look beyond the deliberate deception and the mythology, then I am certain you will find yourself on the right path towards seeing clearly both what the Voynich Manuscript actually is and how its cipher system works. Let me know when you’ve broken it! 🙂

Incidentally, there’s plenty more related stuff in my 2006 book (which is where the two diagrams above came from, p.165 and p.160 respectively)… but you knew that already, I’m sure. 🙂

Finally, the Voynich Manuscript has weaseled its way into a serious contemporary medium – the webcomic. Here’s a link to some nerdy stick-figure Voynichian fun from XKCD (“a webcomic of romance, sarcasm, math and language“): its brand new theory should raise at least half a smile. 😉

Meanwhile, someone else has suggested on their blog that the Voynich Manuscript is the world’s first hippy/nerd manifesto. Is the leading edge of a wave – are the nerds reclaiming the VMs?

Here’s yet another cipher-tinged literary genre I wasn’t previously aware of – the ex-Mormon novel. As a just-released exemplar,  “Latter-Day Cipher” by ex-Mormon Latayne Scott (author of “The Mormon Mirage”, so her overall position should be no great surprise) appears to do a pretty good job of tackling contentious Mormon issues – along the lines of ‘if certainty is God-given, why do His interpreters on earth keep changing their minds?’

Her novel has a socialite killed with “strange markings carved into her flesh and a note written in a 19th Century code“: and so, of course, it is to the alphabet of the Anthon Transcript that her title appears to refer [Update: it actually refers to the phonetic Deseret Alphabet, developed in the 1850s to teach English to immigrants. Thanks for the correction, Latayne!] Sounds like quite a fun read to me (though perhaps 12 million Mormons may beg to differ).

Actually, this all reminds me of an unexpected parallel I forgot to mention in that previous post… between the golden plates and the Anthon Transcript (that signalled the founding of the Mormon Church) and the Golden Dawn Cipher Manuscripts (that signalled the founding of the Hermetic Order of the Golden Dawn). How similar yet dissimilar!

Incidentally, everyone knows about famous Mormons (such as the Osmond family, Matthew Modine, and Ted Bundy) but what about famous Golden Dawn members? Well… Aleister Crowley aside, the GD had as members [according to Wikipedia, so be ready with your pinch of salt] the poet Yeats, Bram Stoker, Gustav Meyrink, Arnold Bennett, and Edith Nesbit (yes, she of “The Railway Children” fame). Just so you’re prepared for the next pub quiz! 🙂

Would we recognize the solution to the Voynich Manuscript even if it was right in front of us?

Some people believe that it continues to evade us because our expectations of where we should be looking are wrong: in other words, that, pace Henri Atlan (as quoted by Cornelius Castoriadis), we have got into the habit of looking beneath lampposts for our key because that’s where the light is better. The corollary is that after all this time, the actual solution is more likely to be lurking in the darkness, surely?

I’m not so pessimistic: we “moderns” have managed to use the primary evidence (i.e. that which is disclosed by the manuscript itself) to accrete a fairly substantial body of codicological evidence, which any new theory would need to address. While this hardly amounts to a monoptic “mainstream” viewpoint (just look at the debates I’ve had with Glen Claston), to a very significant degree it isn’t something you can easily brush aside. Yet the fact that Glen and I now broadly agree on most of the evolutionary stages through which the VMs passed en route to its final state is both wonderful (given our long-standing differences) and worrying (because it bolsters any tendency to intolerance).

And so our ongoing challenge is to work out whether our knowledge about the VMs is more solid and advanced than ever before (if we’re basically right), or more fragile and misguided (if we’re basically wrong). Still, we all persist in chipping away at the sheer face, hoping to trigger some kind of epistemological avalanche, whereby removing one tiny stone releases an entire cascade of unexpected evidence. As always, progress remains slow: but is this because the Voynich Manuscript’s encryption system is so hard, or because we’re hacking away at completely the wrong mountain?

Every once in a while, entirely unknown Voynich theorists swing into view: these have often been looking at and thinking about the Voynich Manuscript for years (if not decades), and bring with them a kind of fresh air of hope (privately, many Voynich researchers are jaded and pessimistic, but that’s a bit of a secret), but also fairly unsophisticated claims that we have seen close variants of in the past.

So, in the big scheme of things, Jody Maat’s newly-proposed way of reading the Voynich Manuscript is actually remarkably familiar: his interpretation of Voynichese as a kind of vaguely polyglot “Old Dutch” (it plainly isn’t “Old Dutch” otherwise) reprises Leo Levitov’s claims in very many ways, though you can also find echoes in it of various other Middle European Voynich ‘translation’ claims we’ve seen over the years – for example, Jim Child’s theory and Beatrice Gwynn’s theory.

All the same, just as with Brumbaugh in the 1970s, there are places where Jody’s reading does seem briefly to make sense, as if the pages were lit by a flickering candle: but having myself tried to duplicate his reading on other pages, this only seems to happen once or twice per page – not statistically significant. Just as with Levitov (and with Leonell Strong’s decipherment, to be honest), only by dramatically lowering the quality bar of what you are prepared to accept as “language” does it even remotely make sense.

I have often heard it said that it would somehow ‘make sense’ for a cipher’s plaintext to be written with copious misspellings – because, for example, it was the profusion of stock phrases (such as “HEIL HITLER”, of course) and formulaic weather reports that most helped the Allies crack Enigma. However, to do this to the degree required here would imply an anachronistic level of cryptographic sophistication. And for it to makes sense as a language would require a yet lower quality bar, dipping ever closer to nonsense or babble.

I’m sorry, Jody: for all your desire to read the Voynich Manuscript and the considerable empathy you feel with its subject matter, drawings, and author, I honestly don’t think this is the answer. The subtle genius of the Voynich Manuscript – and this is something that I’ve been pointing out for years – is that it was written in a cipher which had been constructed to resemble an unknown European language. What you are reading, then, is the covertext: the letters on the surface, while the actual meaning swims just beneath.

Please don’t feel bad about your having fallen into a deep intellectual trap – you are in excellent company!

At the heart of the founding mythology of the Mormon Church sits a small fragmentary document called the Anthon Transcript. The claim linked with it is that it was copied from gold plates revealed by angels to the 18-year-old Mormon prophet Joseph Smith Jr in 1823, and that its “Caractors” were written in the “reformed Egyptian” of the (otherwise unknown) “Nephite” people, who had (allegedly) emigrated to America from Jerusalem two and a half millennia earlier.

Of course, extraordinary claims need at least some kind of evidence – and so the key historical question is whether or not the Transcript provides that. The other pages of the transcript (if they existed at all) have long disappeared, while the eponymous Professor Anthon (who had originally been said to have somehow verified Smith’s translation) later reported “that the marks in the paper appeared to be merely an imitation of various alphabetical characters, and had, in my opinion, no meaning at all connected with them“. After the Transcript had been shown to Charles Anthon, its “translation” was carried out by Joseph Smith who acted as a “seer” to channel it: to do this, Smith used either a giant pair of golden spectacles (that had been found with the golden plates), or one or two stones placed in the bottom of an upturned tall hat, the latter a scrying technique he used before and after 1823 when searching for buried treasure.

Regardless of all that, my particular interest in the Anthon Transcript is as a cipher historian looking at a single contentious document. Back in 2004, I exchanged a number of emails with Richard Stout, who has researched extensively on this subject to build up his own (very specific) claims. However, what follows below relates to my own opinion of what we can learn about the Transcript purely from its alphabet, and is competely independent of Richard’s ideas and interpretations. (And no, I’m neither a Christian, a Mormon, nor even an ex-Mormon.)

anthon-transcript-small

What kind of document is this? Much as people ask of the Voynich Manuscript, is it shorthand, cipher, a lost ancient language, or some kind of deception? Furthermore, is it an original document, a copy of a document, a copy of some letter-shapes from a real document, or a purely made-up thing? The hope here is that we can use its alphabet to help resolve any of these open questions: so let’s see what we find…

Are the letters shorthand? Just about anyone who has grasped the history of shorthand would quickly conclude that it is not a tachygraphic (“fast writing”) system, insofar as it is (as can be seen from the many fussy and overflourished letter-shapes) clearly not optimized for writing speed. Because it appears neither concise, memorizable, speedy, nor unambiguous, it’s a pretty poor match for the whole idea of shorthand.

It should be clear, then, that the Transcript itself was not written in a shorthand system: yet I do hear what Richard Stout says when he suggests links between individual Anthon Transcript letters and letters taken from a whole range of shorthand systems (apparently including many Tironian notae).

Yet I must caution that even an apparently well-defined character can trace out multiple independent paths through time. As a prime example, Stout notes that the filled-in box shape (which appears three times in the Anthon Transcript) appears in William Addy’s (1618-1695) shorthand system, where it denotes the word “altogether“. Addy’s system (first printed in 1684) was based on Jeremiah Rich’s earlier system: curiously, Addy later published a shorthand version of the Bible (1687), though this was perhaps stenographic oneupmanship to trump Rich’s shorthand version of the New Testament (1673-1676). The problem we have is that, as we saw here only a few days ago, Cod. Pal. Germ. 597 also includes a solid square in its first alchemical cipher alphabet… some 250 years before Addy. So, what was the actual source for the Transcript’s filled square shape – 15th century alchemy, or late 17th century stenography?

All the same, Isaac Pitman’s “History of Shorthand” (I own a copy of the 3rd edition) describes Jeremiah Rich’s system as being “encumbered with long lists of arbitrary characters to represent words which could not be written in any moderate space of time by their respective letters” (p.22), an “absurdity” whose “practice seems to have been at its height in the days of Rich” (p.23), with its 300 “arbitraries“. To Pitman’s roving historical eye, Rich’s follower Addy merits only a single paragraph (p.26). But helpfully, Pitman continues with a long list of people who produced related systems: Nathaniel Stringer (1680), William Addy (1695), Dr Doddridge (published in Oxford in 1805!), Farthing (1654), George Delgarno (1656), Everardt (1658), Noah Bridges (1659), William Facy (1672), William Mason (1672), John West (1690), Thomas Gurney (1751) [though Gurney finally dropped the arbitraries!]… and notes that Rich’s system (and/or its many variants and descendants) were still being taught early in the 19th century.

So it would seem that Stout is broadly on target with comparisons with the over-complex systems initially devised by Rich and Addy. I think it would be fair to say that if the Anthon Transcript’s alphabet can at all be said to have a parentage, it lies in the family of overcomplex shorthand systems deriving from Jeremiah Rich, and specifically in the ornate (and occasionally impractical) arbitrary signs added to them.

There must have been more than a hundred subtly different (usually plagiarised) shorthand systems based on Jeremiah Rich’s original, with many of them still in surprisingly active use circa 1823: and so I would predict that finding the closest match to the source of (or the inspiration for) the Anthon Transcript would likely be a perfectly possible (if painstaking) job, given a copy of Pitman’s book as a starting point.

Are the letters Tironian notae? Stout suggest comparisons between various individual Transcript letter-shapes and the sprawling array of Tironian notae accumulated over the centuries. However, my judgment is that you could construct visual correlations between just about any non-pictographic alphabet and Tironian notae: and so I’m very far from convinced that there is any immediate causality implicit in the choice of letter shapes.

Are the letters “reformed hieroglyphics”? Given that I place the Anthon Transcript’s alphabet firmly within the visual & stylistic tradition of arbitrary-loaded shorthands (which themselves all ultimately derive from Jeremiah Rich’s mid-seventeenth century shorthand system, even if the Anthon Transcript’s text is apparently not written in a shorthand system), I have to say that I am at a loss to see any conceivable connection with hieroglyphics (or even with Demotic, for that matter).

Are the letters written in an Old Irish shorthand? Richard Stout points to one shape in particular (you can see an example on line 2 of the Anthon Transcript, two glyphs to the right of the filled square) comprising two left-curving lines joined by a horizontal line: he points to a resemblance with an Irish glyph used on “page 311” of the late fourteenth century Book of Ballymote, and continues by pointing to resemblances between rows of dots elsewhere in the same manuscript and in the Anthon Transcript.

Yet dots were used by medieval monks across Europe to encipher vowels: so I’m far from sold on the idea that rows of dots (which, in any case, were used a quite different way in the Transcript) link this to the Book of Ballymote at all.

Stout’s proposed Irish manuscript connection seems to be an apologium for other Mormon cipher claim, in which the other main source document was allegedly written in some kind of old Irish writing. But I don’t really see that connection here at all: before I get too excited about a single letter-shape, I’d want to have trawled through the relevant shorthand archives first.

Are the letters a cipher alphabet? The Anthon Transcript seems quite ill-judged for this, too: what on earth would any cipher alphabet be doing with a nine-vertical-strokes-plus underline shape (line 2)? This seems to be unnecessarily showy – and in fact, I would suggest that this sort of “prison-cell counting” shape is more the kind of thing you would see in a child’s made-up cipher to denote ’10’ (or possibly ‘X’).

Regardless, the whole document could possibly be written in a cipher: and so I think it would be a good idea to subject a transcription of the Transcript to some statistical tests. It would be more credible were this to be done by someone outside of the Mormon Church (in contrast to previous attempts, according to Wikipedia). It’s true that there are some repeated patterns inside the Transcript, sure: but might these amount to complete words, phrases, or even sentences? Right now, I’m not sure: it looks fairly fragmentary to me.

Are the letter-shapes all fake? I don’t think so: to my eyes, they do give the impression of forming a moderately coherent set of “characters” copied from one or more existing shorthand documents, but with child-like cipher shapes added, very probably to give the whole thing slightly more of an ‘exotic’ feel. More than anything else, I think it is this awkward blend of the nuanced and the naive that makes it seem unconvincing as a real piece of text.

Because the ratio of arbitraries to simple strokes also seems quite high to my eyes, I would also be unsurprised if the author had cherry-picked the interesting-looking letter-shapes from a shorthand source.

In summary, probably the least controversial inference you can draw from the lettershapes is their post-1650 dating: the embellished “H” shape and the probable links with Rich-family shorthand letter-shapes indicate that this is in no way ancient.

In the absence of any other credible information, the most likely story I can reconstruct is that the “caractors” in the Anthon Transcript were copied in no particular order from a shorthand Bible (or possibly a shorthand diary), with various other letter-shapes added to make the overall alphabet look more ‘exotic’, or even “hieroglyphic” (even though, to our modern eyes, these singularly fail to have the desired effect). I would also be fairly unsurprised if the same shorthand Bible itself was subsequently used as a prop to convince skeptics – in short, that this was the Detroit Manuscript itself (but which, like the rest of the Anthon Transcript, subsequently disappeared from sight).

Of course, a single good piece of evidence could well refute all of this… but I haven’t seen it yet.

What do you think?


Post update: a very big thank you! to Richard Stout for suggesting corrections to the first two paragraphs – much appreciated! 🙂

Here’s a vogue-ish detail from the Voynich Manuscript – the (claimed) “armadillo” in the middle-left margin on page f80v. Of course, if this can be proven to be intentionally depicting an animal from the New World, then a lot of other dating evidence becomes secondary. But of course, this kind of controversy is nothing new: you only have to think of the decades-long hoo-ha over the (claimed) New World sunflowers.

catoblepas-enhanced

Armadillo proponents “read” this image as having a tail (on the left), three legs (with the left foreleg therefore tucked behind the head on the right), and a kind of upside-down armoured armadillo head facing backwards (with a sort of smiley cartoon mouth). Fair enough.

By way of contrast, I argue that because everything else in Quire 13 appears to be water-related (plumbing, baths, steam, rainbow, pools, etc), this is probably a depiction of a catoblepas – a fearsome creature Leonardo da Vinci (and doubtless many of his contemporaries) believed lived at the source of the Niger river, and whose bull-like head was so heavy that it permanently hung down to near the ground.

Specifically: what appears to the pro-armadillo contingent to be a tail (purple arrow), I read as a left rear leg, making all four legs visible – and what they read as an armoured armadillo head, I read as a pair of bull-like flat horns at the back of a down-turned head.

catoblepas-enhanced-annotated

All the same, it’s not like I can’t ‘see’ the armadillo: it’s a lot like one of those optical illusions (such as the famous old lady / young girl drawing) where you can flip between two parallel readings almost at will.

But the odd thing here is that both the armadillo and the catoblepas might be equally correct. It doesn’t take a great deal of sophisticated codicology to look at the line strengths (in the areas ringed blue above) and note that a few key lines are in a darker ink, quite different from the ink used for the wolkenband-like decoration just below it. Could it simply be that some 17th century owner (for whom the catoblepas was probably never part of their conceptual landscape) thought this picture somehow resembled an armadillo, and emended it to strengthen that resemblance? I think that this is very probably precisely what happened here.

Now, this is precisely the kind of contingent, layered, conjectural historical explanation (basically, an intellectual history of art) that Richard SantaColoma has long enjoyed lambasting. Specifically, he sees any explanation that appeals to layered codicology as fully worthy of his scorn – as though it’s merely constructed as an apologium to keep the faith with the existing ‘mainstream’ dating evidence.

But actually, layered codicology hypotheses are among the most brutally (and easily) testable of historical ideas – unless two layers of ink added many decades apart just happened to use exactly the same raw materials (and in the same proportions), we will ultimately be able to differentiate them… or not.

However, unless people explicitly propose such layering hypotheses, nobody would think to do such tests – they’d perhaps spend all their codicological efforts on f116v (a valid investment, to be sure, but it’s only one of many possible areas of the Voynich Manuscript that should be tested for revealing information).

Indeed, the whole point of such historical hypotheses is not to prove historical narratives in and of themselves, but rather to lay the underlying ideas open to physical mechanisms of disproof. Bluntly put, any given hypothesis is usually of little or no value if it cannot be specifically disproved (because direct causative proof is as rare as hen’s teeth in history).

In the absence of any suitable tests on f80v, however, both viewpoints (and indeed all other fairly sensible viewpoints) remain in a suspended state of vague possibility, hypothetical kites floated carelessly into an unthreatening breeze.

Hmmm… how I long for such tests!

Having examined many historical ciphers over the last few years, I’d say that there are only a handful of ways in which individal ‘cipherbets’  (i.e. “cipher alphabets”) are typically constructed. The big fallacy is to think that people building ciphers are only concerned with a need for long-term message security, when actually there are plenty of other important short term needs they have to attend to, such as: ease of construction, usability, speed of deciphering, aide-memoires, etc. Broadly speaking, these needs express themselves in the following aspects of the cipher alphabet:-

  1. Symmetrical – where the letter-shapes are based around a geometric / symmetrical pattern
  2. Incremental – where the cipher alphabet is adapted from a pre-existing cipher
  3. Practical – where the letter-shapes are optimized for speed of writing
  4. Stylistic – to give an overall effect of looking exotic / strange / occult / ancient
  5. Mnemonic – where letter-shapes contain associative reminders about the plaintext letter
  6. Steganographic – where letter-shapes hide visual hints as to the plaintext shape
  7. Deceptive – where letter-shapes vary in subtle ways to hinder transcription / decipherment
  8. Distracting – where letter-shapes are constructed to resemble a different type of text

Apart from ‘pure’ symmetrical ciphers (such as the various pigpen and Masonic ciphers, or indeed Edward Elgar’s Dorabella cipher alphabet),I would say that most cipher alphabets tend to present a blend of only two or three of these, which you can sensibly read as reflecting the most pressing needs of the encipherer. As brief examples, you might note that many of the Sforza ciphers were primarily [incremental + practical] (and occasionally stylistic, such as the 1464 cipher for Tristano Sforza), while I’d predict that Cod. Pal. Germ. 597 will turn out to be [mnemonic + stylistic].

What, then, of the Voynich Manuscript’s cipher alphabet? Of course, the hope is that if we can classify its cipher alphabet, we might be able to “read” the needs of its encipherer.

The first thing to note is Steve Ekwall’s extraordinarily specific claim about the four gallows shapes: he asserts that these four shapes (and their four ‘ch’ strikethrough versions) specifically depict the eight folding states of the deciphering paper key – basically, that these are mnemonic. While that would make a lot of sense, debating that in sufficient detail is something I’ll take on another time.

Regardless, my position on the Voynich Manuscript’s alphabet is simply that it is a tour de force of cipher construction technique, insofar as I think you can see traces of symmetrical, incremental, practical, stylistic, steganographic, deceptive and distracting aspects (which, curiously enough, would make Ekwall’s mnemonic the only one missing from the list). Here they are in more detail:-

  • Symmetrical
    The four gallows shapes exhibit an explicit structural symmetry – one leg or two legs, one loop or two loops.
  • Incremental
    The four strikethrough gallows look to have been developed from an earlier (probably less secure) cipher system based purely on the four simple gallows. I also suspect that the “e / ee / eee / ch / sh” letter-shapes represent vowels, and that they were in some way incrementally adapted from a variation of the “dots for vowels” ciphers used by some medieval monks.
  • Practical
    The Voynich Manuscript’s letter-shapes have been consciously constructed for ease and speed of writing, far more so than typical cipher alphabets of the time.
  • Stylistic
    I would argue that the overall form of the alphabet has been designed with older (non-cipher) alphabets in mind – that is, that the stylistics of the letter-shapes was deliberately chosen to resemble an archaic (but lost) alphabet.  Note (mainly for Elmar Vogt): I do not therefore believe that the Voynich Manuscript was meant to resemble an enciphered medieval herbal, but rather that it was meant to ressmble an unenciphered herbal written in an archaic (but lost) language. I fail to see how this makes it unlikely to be smuggled past Venetian border guards… but that’s an argument for another day!
  • Distracting
    As I argued in The Curse and elsewhere on this blog, I am convinced beyond any doubt that the “aiir” and “aiiv” cipher letter groups in the VMs are specifically meant to resemble medieval page references (i.e. “a ii v” denotes “[quire] a, [folio] ii, v[erso]”), but that this is meant to distract contemporary eyes from looking in detail beyond that.
  • Deceptive
    I believe that the actual Arabic numbers enciphered by the “aiiv” family are to be read from the shape and position of the final flourish of the “v” – and that whereas the (earlier) Currier A pages used a system based on the position of the flourish, the (later) Currier B pages used a system based on the shape of the flourish. This would also point to incremental cipherbet change during the overall writing process!

There is one further one to discuss – steganographic. If you stare at the Voynich Manuscript’s cipher alphabet long enough, I contend that you will (eventually) grasp the logic underlying most of the letter-shapes (as per the discussion above). However, you are still left with a few odd “spares” (such as “4o”, “8” and “9”) that don’t fit into the symmetric families and groups described above. What is going on with them?

In The Curse, I argued (based on the statistics) that “4o” was probably encoding a word-initial abbreviation sign: what I now think is fascinating is the notion that the letter shape for the “4” might also be steganographically hiding a horizontal stroke, as an aide-memoire to the decipherer.

Similarly, I argued (also based on the statistics) that the “8” shape and the “9” shapes were probably encoding word-middle and word-final abbreviation signs (respectively): similarly, I think that these are steganographically hiding a curved half-loop at the top of each of them, the typical mid-Quattrocento sign denoting contraction and abbreviation. I’ve marked these hidden strokes in red below:-

qokedy-highlighted

Actually, I suspect the author might possibly have given a little bit of the game away on page f2r, via a slip of the pen: para 2 line 3 word 1 is “4oP9” with a curved contraction half-loop added over the “o”, which I think might well denote a contraction of “4o” + “oP” + “9”. But that, too, is another story. 🙂

All in all, I’d say that if the Voynich Manuscript’s cipher system turns out to have broadly the same degree of subtlety and roundedness exhibited by its cipher alphabet, then no wonder it has remained unbroken for centuries. It has not only the Everest of cipher systems, but also the Rolls Royce of cipher alphabets!

(1) A big hello to Rich SantaColoma as he emerges from the VMs “List Closet” into the bright(-ish) light of the blogosphere. His “New Atlantis Voynich Theory” blog sets out his basic stall – which is that, thanks to his “Nagging Sense of Newness” about the Voynich Manuscript, he harbours strong doubts that it is anywhere near as old as mainstream Voynich researchers (such as, errrm, me, apparently) think it is.

The truth is that historians have basically frittered the last century away on foolish conceits (such as the Roger Bacon thing, the Dee-and-Kelley thing, or the it’s-a-hoax-because-the-NSA-can’t-break-it thing), and so until such time as a single proper codicological and palaeographical analysis comes along to define the research problem properly, we’ll remain in the same old evidential free-fall.

As for me, I’m sticking with John Manly’s assessment (that the quire numbers were added in the 15th century) as a basic starting point for the dating: and if that turns out to be wrong, then so be it. That doesn’t make me “mainstream”, just… old-fashioned, I guess. 🙂

Incidentally, it’s a little-known fact that the Beinecke’s catalogue originally listed MS 408 as fifteenth century, but that in the 1970s (perhaps as a result of Brumbaugh’s wobbly claims?) this got extended forwards to the sixteenth century… I suspect they got it right the first time round.

PS: Rich, given that I think Q13 has a water theme, I’m sticking with the catoblepas (with its heavy head hanging down) rather than the armadillo – given that even Leonardo wrote that the catoblepas was found at the Nigricapo [the source of the Niger river], it was very much part of the mental landscape of the Florentine Quattrocento.

(2) And another big hello to (the apparently email-address-less?) “acevoynich” and [his/her] eponymic “acevoynich’s blog“. Though given [his/her] apparent inability to find Cipher Mysteries, the Voynich Manuscript Mailing List, The Journal of Voynich Studies, voynich.nu, the Voynich Wikipedia site, the Voynich dmoz entry, etc (let alone D’Imperio or The Curse) I have to say I’m somewhat dubious that [he/she] is, as [he/she] claims, actually writing a “thesis”. Does [he/she] really have a research question in mind, or is [he/she] just a [troll/trollette]? Hmmm…

Still, acevoynich feels confident to ask the five key W-questions of the big V-manuscript: who, what, where, when, why. Again, I refer the honourable member to my previous answer: and add that until such time as we have the forensic side (the “What happened?” question) considerably more locked down than it is at the present, I suspect that these W’s are (sad as it is) actually more harmful than helpful. Oh well! 🙁