Voynich Manuscript Archives - Page 22 of 74

05Aug 2013

Voynich “Glossodahlia” book…

Let’s imagine you had two fairly unhealthy (but specific) obsessions: (1) retro jars, bottles and containers, and (2) the Voynich Manuscript. When added together, might these two wrongs somehow make a right?

If that just happens to be a question you have long pondered, then I’m pleased to be able to tell you that your wait is over! The free electronic book Glossodahlia by Tarek Joseph Chemaly, and pulished by 7UPstairs Publishing surely places this whole contentious issue beyond discussion. Tarek writes:-

The Voynich manuscript has been decoded. Its flowers have given up their secret. They speak in tongues – glossolalia. And they tell stories of brokenhearted cosmic lovers and of a retired intergalactic bureaucrat as she tends to her garden once baking is done.

With my Voynich researcher hat on, I’d point out this whole thing feels like a collage scraped from the Beinecke’s old “CopyFlo” images. Also, not all the pages are plants (a couple of pharma jars have crept in as well), and tiny scraps of Voynichese pepper the edges of the collaged images, like hardy barnacles clinging to curiously shaped ships.

Anyway, make of it what you will. 🙂

29Jul 2013

Sandra And Woo do the Voynich… The Book of Woo!

by nickpelling ⋅ 15 Comments

Online webcomic Sandra And Woo has just taken a detour into CryptoLand, with a Voynich-inspired page called The Book of Woo to celebrate its 500th edition. What’s more, author Oliver Knöerzer (AKA “Kernel River Zoo”) has offered a $250 reward to “the person who is able to provide a decipherment that’s sufficiently close to the plain text“, plus “another $100 to two charities determined by the readers who contributed the most useful information for breaking the code.” Really, Oliver, I’d have helped regardless. 😉

The Book of Woo’s most obvious predecessor would seem to be the Codex Seraphinianus, which is also “primarily a work of art, not a puzzle for the general public“, though I wouldn’t describe the Book of Woo as being quite as hardcore as that (but then again, what is?). The Vick Industries cipher seems to be a more design-oriented mindset entirely, though the art-house rationale behind that has yet to emerge into the light.

How is anyone supposed to decrypt The Book Of Woo? Helpfully, Knöerzer does throw a handful of hints in our path, though mainly about what it isn’t rather than about what it is. He says:-

* The encryption isn’t based on an algorithm only suitable for computers which executes a loop 100 times or something like that.
* The encryption isn’t based on some sort of device or mechanism that is hard to get.
* No “classical” steganographic method was used since that would just be impossibly hard to crack.
* The plain text is some sort of literature, as one can guess from Woo’s comment and the illustrations. A lot of time went into the plain text as well, it’s not just a copy of the first page of Rascal or something like that.

But he also warns that “[if] you think you can simply carry out a frequency analysis on the letters and be able to reconstruct the English or German plain text this way, well, that’s just a waste of time.” Indeed, even a brief look at the text reveals blocks of characters arranged in a very artificial CVCV (consonant-vowel-consonant-vowel) manner. There are also quite a few patterns that are repeated multiple times: here’s a colourized section of the first page, so you can see a bit of what I’m talking about…

What’s going on here? Well… I’ve had a few brief email exchanges with Oliver recently, so have possibly at least a flicker of an idea. And given that he has already openly flagged both the Voynich Manuscript and my book on it (The Curse of the Voynich) as having been useful (he’s even reused the Voynich’s “T” gallows character in his cipher alphabet), it probably wouldn’t hurt to recap a few Voynich-related observations here. 🙂

The first thing to say about ‘Voynichese’ (the structure that shapes the Voynich Manuscript’s text) is that there seem to be two main schools of thought: (a) that it’s a cipher system that for some reason our statistical toolkits aren’t able to help us much with, and (b) that it’s a real language but we’re too in love with our analyses to see the bleedin’ obvious.

(For the record, I’m in the (a) camp, which means that when I look at a map of all the different types of Voynichese evidence, I want to understand what kind of trick was used to confound all the different statistical tests, rather than throw my hands up in the air and say “Stats, shmats!”.)

The second thing to note is that almost all of the Voynich Manuscript is written using a very compact alphabet (roughly 22 characters), whereas The Book of Woo uses something like fifty unique shapes (I haven’t transcribed it yet, but that’s how it looks). What connects them is that they are both very predictable at the character level… up to a point. That is, in some circumstances you can reliably predict what the next character along is going to be, but in other circumstances predictions can be of little use.

(For what it’s worth, I believe that it is this specific combination of predictability and unpredictability that convinces people that Voynichese is a language, whereas real languages only tend to work like that in a few very specific ways, e.g. “q” almost always being followed by “u”.)

Trying to account for this property ultimately led me to conclude that the Voynich Manuscript in part uses “verbose cipher”, i.e. employing pairs or groups of letters to encipher single letters in a misleading way. For example, the Voynichese letter-pair “or” gets repeated immediately after itself a number of times, with the best known examples being on page f15v:-

Do any real-life languages do this? I don’t think so, but that remains a matter of opinion.

The Voynich Manuscript has a large number of extra curious properties that I believe point to other tricksy mechanisms (e.g. in-page transposition of some sort, if you please), but my suspicion right now (based only on having a nose around it) is that Oliver may have found these unnecessarily abstruse to build a cipher around.

No: I think what’s going on in The Book Of Woo will turn out to be largely based around verbose cipher – specifically a combination of paired letters. Having said that, the big problem with a simple verbose cipher is that it is, well, as verbose as it sounds: and so to make it not bloat as badly as a Microsoft application, it needs some compression tricks to be used at the same time.

In the case of the Voynich Manuscript, I suspect that verbose cipher gets combined with the kind of scribal abbreviation in use during the 15th century. Similarly, because the overall word-length isn’t too extreme for The Book of Woo, I suspect (a) that certain letters used at the start or end of words will encipher prefixes or suffixes, somewhat like a kind of shorthand; and (b) that it’s more likely to be English than German. 😉 It may well also be that certain letter pairs themselves encipher common letter pairs or even letter triples (such as “the”): these are the kinds of tricks I’d expect to see here being used to disguise the structure.

And yet… words seem to be words (i.e. it’s an aristocrat cryptogram rather than a patristocrat cryptogram), so it’s very much as if he wants to help us, not hinder us. So even though it looks a bit tricky at first glance, maybe it will all fall out nicely in the end. We shall see, hopefully before issue #1000! 😉

27Jul 2013

Voynich Manuscript: an Andean Sky God Cathar theory?

by nickpelling ⋅ 94 Comments

Voynich theories are like radish shoots after Spring rain (as Rudy Cambier likes to say) – they keep on popping up. And here’s a new ~~radish shoot~~ theory, courtesy of Morten St George, whose Andean Sky God website digs deep into a whole range of historical mysteries – Nazca lines, Shakespeare, Cabala, Rosicrucianism, and now the Voynich Manuscript.

According to St George, the 9-rosette castle is likely to be a fortress similar to Carcassone, but one “destroyed by the Crusaders, i.e. left without ruins“… “Montségur, the final stronghold of the Cathari Church“. So it’s clearly a Cathar document.

Of course, the 15th century radiocarbon dating presents a problem for any Cathar Voynich theory: indeed, St George acknowledges that “it would seem impossible for the Cathars to have written the Voynich because at that time the Cathars no longer existed, at least not anywhere in Europe.”

So… if the Cathars wrote the Voynich Manuscript in the 15th century but they weren’t in Europe, where were they? St George’s response is unexpected yet logical:-

“The plant drawings in the Voynich provide the answer. The Voynich has drawings of more than one hundred exotic plant species, highly detailed drawings from flower to root, all of which represent plants that no one in Europe had ever seen before. Realistically, there is only one place on Earth that can produce such an extraordinary diversity of plant life, and that’s the tropical rainforests of South America, which I shall call Amazonia. The Cathars went to Amazonia.”

In fact, the Voynich Manuscript’s Quire 13 (the ‘balneological section’) has a whole load of drawings of the “elaborate network of conduits, funnels, and containers up in the trees to collect rain water, which they then used for drinking and washing. In the Voynich, drinkable rain water, in contrast to rainforest water, is always depicted in blue color“. Ah, so that is why they’re coloured differently! 😉

However, there is no happy ending for the Cathars in exile: even hundreds of years later, St George is convinced that the Inquisition would hunt down and kill the Cathars in South America. “In such circumstances, the Church of Satan would continue to hunt down the Cathars until the end of time.”

In a worthy piece of soul-searching, St George finishes up his presentation with the following Q&A couplet:-

“Do you think this sounds like the plot of an end-of-times film?

Things are what they are.”

Well…

In the spirit of Rich SantaColoma’s desire to keep all possibilities in play, I freely admit say that there is a small chance that Morten St George has stumbled onto something huge here – that the Voynich Manuscript was indeed written by Cathars in exile in South America, before their being finally (if belatedly) obliterated from the pages of history by the Inquisition. (It also doesn’t take much to connect St George’s ideas with Leo Levitov’s (now venerable) Cathar heresy Voynich theory.)

Of course, the real study of history is about far more than enumerating possibilities, because in the hands of the imaginative (let alone of those really don’t get out enough), there is no list of possibilities that cannot be doubled or tripled in length. Indeed, such possibilities tells us far more about the showboating creative facility of the person or people constructing them than about the real historical artefact itself: the role of the object ultimately reduces to that of a stage on which to play out stories culled from the pareidoiliac static of a troubled mind.

And in my opinion, the biggest sign of such trouble is normally when would-be decrypters discover – almost always to their personal surprise and amazement – that their deciphering methodology developed for one particular object also just happens to work on other, apparently unrelated objects. For example, John Stojko not only could read the Voynich (in Old Ukrainian), but was also (as I recall) able to read Estrucan gravestones. It’s tempting to speculate whether he could in fact have used the same approach to “read” any string of letters. “John Stojko Read My Barcode” isn’t yet a T-Shirt slogan, but perhaps it should be.

If all the world’s a stage, then the evil Church conspiracy, the Rosicrucians, Shakespeare, and the Voynich Manuscript are surely the festival side-stages on which the troubled perform their one-man (or indeed one-woman) shows. Curiously for things of such age, history only has a walk-on part in such productions. The play’s the thing, indeed!

23Jul 2013

A Visual Map of Voynich Evidence, Theories & People…

by nickpelling ⋅ 189 Comments

Unless you just happen to have been an expert in Voynichese for a decade or more, making sense of all the evidence and the theories (and even the people) surrounding it can be quite daunting. So I thought I’d help by drawing a map!

From my perspective, the general problem is that once you really latch onto a piece of evidence or a particular angle, you can easily become trapped inside it: and even though the solution you then reach may be entirely logical, it is almost always inconsistent with the other kinds of evidence and types of angle, and hence is almost always nonsensical.

I’d say this is precisely what happened with Gordon Rugg’s hoax theory, Jorge Stolfi’s East Asian language theory, and William Friedman’s artificial language theory – they all relied too heavily on one particular kind of evidence, and so arrived at untenable conclusions. But you will doubtless have your own thoughts on each of these. 🙂

It should also be clear that, like a kind of hummingbird theoretician, I’ve dotted around this diagram over the years, adding different ideas to the mix that try to explain different aspects of the evidence. I still believe that each of these suggestions will turn out to be largely correct, but the big trick will be finding a way – Intellectual History style – of making them all right at the same time!

13Jul 2013

A Tale of Two Voynich Articles…

by nickpelling ⋅ 124 Comments

Two long(-ish) form Voynich manuscript articles emerged recently, one in the Jewish magazine ~~The Tablet~~ “Tablet Magazine”, the other in the New Yorker’s online blog section. These tell us quite a lot – though not really about the Voynich Manuscript itself, but rather about how the Voynich Manuscript is now perceived.

The first article, by Batya Ungar-Sargon, is called Cracking the Voynich Code: The quixotic quest to read meaning in the patterns of a bizarre manuscript that has bedeviled scholars for years.

Her basic take is that “the Voynich Manuscript has become a beacon for a secular community of quasi-Talmudic scholars whose interpretive ingenuity and stamina have few parallels“, so her piece is built around interviews with several of them (including the “patient, tireless” Gordon Rugg, and the “deeply humble” Rich SantaColoma). [She also talked with me on the phone for an hour, but perhaps I didn’t fit her template 😉 ].

Taken as a whole, fitting her article into a primarily Jewish-interest magazine was always going to be a bit of stretch: William Friedman was Jewish, sure, but that’s a small piece of material to make a full-length dress out of. I can’t help but wonder whether Batya’s ambition is to write long-form pieces for the New Yorker, and that this was a try-out for her portfolio. She clearly writes well, but I don’t think her journalistic instincts are yet fully honed – her article, in my opinion, is still more ‘relating’ than the literary reportage to which she aspires.

The second article – The Unread: The Mystery of the Voynich Manuscript – by Reed Johnson is, coincidentally enough, from the New Yorker blog section. As such, it’s a kind of New Yorker long-form take on a blog post, i.e. longer than a normal blog post, but quite a lot shorter than a typical New Yorker article (I used to subscribe to it, though how I ever found enough time to read each issue I don’t know 🙂 ).

This isn’t Ungar-Sargon-style journalism, but is instead Reed’s telling the story of how he came to waste three years (only three years? Pshaw!) on the Voynich Manuscript – basically, while trying to write his own “Dan Brown–style thriller”, having nearly completed his “M.F.A. in fiction at the University of Virginia” in 2010. He tries to introduce a little light drama into his account (Did he crack the Voynich? Did he finish his book?) but with enough of a wink to astute readers that they know the resolutions long before the end.

Unlike Batya, Reed is not an observer looking in on the Voynich research world from the outside, but is instead an active participant in what he calls the “often fractious” Voynich mailing list. His feeling about this is that “If crowds have any wisdom, soon we should see the fruits of a more recent deciphering project: Internet crowdsourcing“. And yet, he also wonders whether it would be a disappointment for the Voynich Manuscript to be decrypted – that, “no matter how thrilling such a text might be, it [would] remain a disappointment for being closed off, completed — for being, in the end, no longer a mystery“.

My own conclusion is that the Voynich mailing list has become more part of the problem than part of the solution: and that the extraordinarily productive collaboration its early days saw was more down to the small number and high calibre of the participants (Jim Reeds, Jim Gillogly, Jacques Guy, etc), most of whom left the list long ago. Really, the collective wisdom of the crowd very much depends on the crowd you happen to be dealing with: though Reed stops short of showing his hand in this regard, so we end up knowing what happened but not his thoughts or feelings about it. Perhaps the whole card game hasn’t yet concluded for him.

What’s nice about these two articles is that, for all their differences, they are both good examples of clear-headed contemporary writing about the Voynich Manuscript, far from the lurid wodges of mystery-soaked ahistorical fragments I frequently used to see. Indeed, both give an account of the Voynich’s history that is broadly correct, something which simply never happened even a decade ago: perhaps the radiocarbon dating has helped validate the Voynich as a “proper” subject.

And yet… it’s as if something (or someone) is missing from the whole party. The Voynich Manuscript has had many of the best codebreakers of the age (the Friedmans, Manly, Tiltman, etc) examine it closely: yet as these articles show, a lot of contemporary discourse still revolves around the – frankly rather foolish and shallow, I think – postmodernist cipher/hoax tension as exemplified by Gordon Rugg and Rich SantaColoma.

To my mind, it’s as if something really important is missing from the whole conceptual landscape of how the Voynich is perceived, that everyone is somehow in the wrong kind of doubt. We’ve collectively travelled a really long way forward, for sure, but the ideas and insights gained on that journey have all been zapped by a kind of “motivated learning” paralysis, where debate is held in a stasis between powerful epistemological agendas.

It often feels as though, myself excepted (and who listens to what I say, ha!), the Voynich-as-a-genuine-historical-artefact point of view has no champion. I genuinely tire of the way people continually generate possible alternative histories for it, when I’m just about the only person trying to reconstruct the mainstream history they’re so busy fighting against.

I want to ask those “theorists”: why do you find the idea that the Voynich Manuscript was made basically when its radiocarbon dating says so dreadfully upsetting? Why do you invest so much time and effort into identifying outlandish alternatives that might possibly be made to work (with a few well-chosen tweaks to the mainstream historical timeline)? Do you not see that, by kicking back so hard against a straightforward historical account that hasn’t even been written yet, you are yourself holding everything back? Can you not see that by doing this you have become part of the problem, not part of the solution?

That is the Voynich Manuscript debate that’s missing, the elephant in the room that nobody wants to talk about. But nobody is writing that particular article, and I’m not sure anyone ever will… and perhaps we’re all worse off for that silence.

26Jun 2013

Montemurro Voynich paper and the “genuine message”…

by nickpelling ⋅ 42 Comments

The news rattling the bars of the Voynich research cage loudest right now is surely the publication of a paper by Marcelo Montemurro and Damián H. Zanette called Keywords and Co-Occurrence Patterns in the Voynich Manuscript: An Information-Theoretic Analysis, deftly summarized in New Scientist as New signs of language surface in mystery Voynich text.

M&Z’s abstract brings out a lot of what they were trying to do – and also points exactly to their mistakes.

“Here we analyse the long-range structure of the manuscript using methods from information theory. We show that the Voynich manuscript presents a complex organization in the distribution of words that is compatible with those found in real language sequences. We are also able to extract some of the most significant semantic word-networks in the text. These results together with some previously known statistical features of the Voynich manuscript, give support to the presence of a genuine message inside the book.“

Central Assumption: the authors implicitly hypothesize that they can get meaningful results for long-range comparisons because Voynichese is homogeneous across all its sections.

…The Problem: this assumption is false (or very nearly so), because there are significant macro-level differences in the way the language in different sections works (Currier A, Currier B, labels) as well as many mid-level differences (Herbal-A, Q13-ese, etc).

Central Conclusion: the authors believe that their language-centric statistical machinery has identified “The thirty most informative words in the Voynich manuscript”.

…The Problem: I’m pretty sure that the authors have in fact very probably identified arguably the thirty least informative words in the Voynich Manuscript. (That may be an independently useful result, but it’s probably not really what they were hoping for.)

I’ll explain.

Voynichese is extremely predictable at a letter-level: it has many rigid letter-level adjacency rules (‘4’ is almost always followed by ‘o’, etc) and position rules (4o- is consistently word-initial, -89 is consistently word-final, etc) and a high level of letter-context predictability.

Yet at the same time, it also has a very large dictionary relative to its text size. I often criticize Gordon Rugg for suggesting historically incorrect Cardan grille-like tables (i.e. they’re a century too late for the Voynich’s construction dating) and for inappropriately back-projecting his modern CompSci mindset onto the early Renaissance (i.e. it’s 500+ years too early for the kind of table-driven hackery he proposes). However, he is absolutely right that a reconstructed Voynichese “dictionary” would, to a modern computer scientist’s eyes, look very much as if it had been generated or permuted by some means.

The paradox is therefore that these two apparently opposite aspects of Voynichese are able to coexist: how on earth can we reconcile its letter rigidity & predictability with its wild word variability?

I think the key to resolving this is to grasp that there is some kind of generative or confounding principle at work within a rigidly predictable framework. That is, that even though there are lots of rules, these rules act as a kind of “container” for semantic or cryptographic variability to exist within.

Hence I believe that Montemurro’s statistical machinery is identifying “words” that fall within the container layer rather than in the confounded content layer. Hence these are arguably the thirty least informative words in the Voynich Manuscript.

It’s a hard point to understand, let alone accept: the confounding trick (some kind of transposition cipher? some kind of paper cipher machinery? some kind of cipher wheel?) driving Voynichese’s inherent variability remains as profoundly unreachable now as it has been for over 500 years.

My apologies to Montemurro and Zanette, but the central challenge we face isn’t to find new language-based statistical tests to apply to the Voynichese corpus, however clever they may be. Rather, it is to find ways of resolving the Voynich Manuscript’s central paradox: how is it that Voynichese is both letter-rigid and word-variable at the same time?

Incidentally, M & Z conclude in their paper that results point to a semantic link between the Recipe and Astro sections, and between the Herbal and Pharma sections. Actually, had they been more aware of the codicology analyses that have been done, they would have seen that their results are consistent with the writing phase order.

In fact, there are many indications that what I call Voynichese’s ‘container’ layer above evolved during the writing, with the most obvious evolution being between Currier A and Currier B. I suspect that what their statistical machinery has imperfectly captured is therefore simply a snapshot into the evolution of the container layer, and not anything ‘semantic’ as such.

In short, the aspect of Voynichese that is most nearly homogeneous across all its sections is its “container” layer: so what Montemurro and Zanette have done is make long-range comparisons between evolutions of the container layer. Currently, my best guess is that these are likely to be almost entirely composed of cipher system meta-tokens (shorthand tokens, transposition cipher placeholders, etc) rather than the semantic contents, which appear instead to have been confounded by some means.

So, rather than finding a “genuine message” (as New Scientist put it), perhaps they have instead found a “genuine container” for the message? This may prove to be a very useful result in its own right, but it’s probably not the smoking gun linguistic proof they were hoping to use to discredit Rugg’s tables.

13Jun 2013

The Svelte Voynich…

by nickpelling ⋅ 49 Comments

When I was young, I often used to play Scrabble with my grandmother Win on my way home from school. (By which I mean her maisonette was on my route home, not that we played Scrabble on the bus.) Which probably helps account for the deep-rooted enjoyment I still get from weird and wonderful words, many decades later.

From way back then, my favourite English word has always been “svelte” (though “tergiversate” was nipping at its heels for a couple of weeks last year). The reason I particularly like svelte is that it’s (I’m struggling to describe) ‘productively onomatopoeic’, in that the slow ‘l’-sound in the middle makes it feels elegant (indeed svelte) on the tongue. Really, it’s a word with an unusual (but nicely matching) mouth feel, one that manages to stand out from a dictionary sized pack. With getting too synaesthetic on you, to me it’s a kind of David Gower four of a word, a left-handed ping that’s over the boundary before the fielders even notice it’s gone. Something can’t be half-svelte, it’s either got it or it hasn’t.

Svelte also brings right to the fore the mad ragtag heterogeneity of English, the arbitrary coupling together of chance encounters over the millennia. To some it sounds ~~Svedish~~ Swedish (or perhaps a piece of stray Elvish?) but it’s actually a French word (svelte), from an Italian (svelto, “stretched out”), from Vulgar Latin (ex + vellere, i.e. to stretch + out).

(You might therefore suspect that it shares some kind of origin with “vellum” which is also stretched out, but the latter has its roots in “veal”, i.e. young calves: hence vellum is properly fine calfskin.)

Languages are like that: for all their modern apologists, academies, and syntactic niceties, they’re at heart accidental rather than designed. Esperanto and all the other modern conlangs are all very well, but a good part of the charm of real-world languages is the way stray and mongrel words hop in to fill the semantic gaps that inevitably open up as culture mutates and evolves. English obviously needed a word that expressed presence of svelteness in an object, why else would svelte have succeeded and persisted otherwise?

But (and isn’t there always a but in Cipher Mysteries)… where’s all that in the Voynich Manuscript’s language? Even if William Friedman was completely and utterly wrong about the Voynich’s being an artificial constructed language (which he was), I really can see exactly why he thought & believed that. For Voynichese words show such a strong family resemblance – a strongly interlinked productive grammar, if you will – that it almost precludes anything else. Whatever Voynichese is, there is definitely an artificiality to it, or at least an abundance of artifice. I suspect that anyone trying to map Voynichese onto a direct language base will almost inevitably find (to their eventual embarrassment) that it’s just too artificial to be workable: and that’s pretty much what Elizebeth Friedman concluded too.

So here’s your Voynich paradox for the day. I’m sure that there can be no “svelte” in the Voynichese ‘language’ as we see it, because the overwhelming majority of its words arise from a compact productive grammar quite unlike that of a real, heterogeneous, messy, accidental, historic language: and yet the look of Voynichese so resembles a language that it’s hard not to feel as though you’re perpetually a mini-dictionary away from just reading it.

Of course, for me the resolution of this paradox comes down to a well-chosen bunch of steganographic tricks (such as verbose cipher, shorthand, etc) that serve to conceal the plaintext in a misleading form… but you will no doubt have your own theories about how to slice through such a Gordian knot. 🙂

04Jun 2013

David Kahn at the Athenaeum…

by nickpelling ⋅ 71 Comments

A few days ago, I grabbed the chance to meet up with renowned crypto writer David Kahn at the Athenaeum Club in London while he was attending a conference on the Battle of the Atlantic. It was… simply a pleasure.

He very happily signed my well-thumbed copy of “The Codebreakers” (1967), though I have to say it barely seems possible that he wrote his crypto meisterwerk close to half a century ago. He continues to research and write on crypto topics: a collection of his articles (“How I Discovered World War II’s Greatest Spy and Other Stories of Intelligence and Codes“) is due to be released in October 2013.

All that aside, he was eager to know about what was happening in the world of Voynich Manuscript research (and delighted to see my rather battered copy of Gawsewitch’s “Le Code Voynich”, even if it isn’t actually a facsimile edition) and urged me to write a state-of-the-art-circa-2013 summary of it for Cryptologia (he was one of its founders). Incidentally, he half-remembered being told recently that someone had cracked the Dorabella Cipher (which is possible, though slightly unlikely, I’d say).

But alas, my all-too-brief hour was up too soon: I had to leave and once more merge into the grey London streets. Yet the whole thing set me wondering for several days how best to summarize the Voynich Manuscript. Why is it that Voynich researchers can know so much about the manuscript’s minutiae, and yet agree on almost nothing? Why is it that the Voynich’s Wikipedia article is so long, yet says so little? Why write another analysis-paralysis piece on it?

Part of the challenge is that it often feels to me as if nobody has written a single word on the Voynich Manuscript that even remotely does it justice. Rather, it’s as if there’s a honey-pot of non-words and non-phrases for non-historians to dip their paws into, making every article and blog post written on it little more than a sweet (though ultimately unsatisfying) anagram of the preceding ones.

At the same time, in my own Voynich research it’s as if every day is Groundhog Day, where pretty much everyone else in Punxsutawney never learns anything, but instead sticks belligerently to their same futile and unhelpful non-positions, day in and day out. [*] For example, I would agree that it is entirely possible to construct alt.histories where the Voynich post-dates the 15th century (oh yes, and that the palaeography, the codicology, the Art History and the radiocarbon dating are all simultaneously wrong, or perhaps hoaxed in a peculiarly sophisticated way), but why on earth would anyone bother?

It’s a lot like fighting against a kind of post-modernist debating society, for whom the inevitable existence of doubt in any given fact makes it fair game to dismiss it. Such en masse debating may well be a great way of passing time, but it’s surely a lousy way of getting to the truth. I’m not interested in knowing what might conceivably have happened, I want to know what genuinely did happen.

*) All the same, I’m getting pretty good at ice carving. That’s bound to come in useful one day… 🙂

30May 2013

Evidence of bifolio reordering in the Voynich Manuscript…

by nickpelling ⋅ 49 Comments

A correspondent has asked me to summarize the evidence I’ve found in the Voynich Manuscript suggesting bifolio reordering. As long-term Cipher Mysteries readers will know, I laid much of this out in my 2006 book The Curse of the Voynich: but a lot has also emerged in the years since.

A practical starting point here is my long-standing page on the Voynich Manuscript’s codicology. This points to evidence for a whole sequence of fairly direct codicological conclusions:-
(1) The Folio Numbers Are Not Necessarily Correct
(2) The Bifolios Are Not Necessarily The Right Way Up
(3) The Quire Numbers Are Not Necessarily Correct
(4) The Bindings Are Not Necessarily Correct
(5) The Quires Are Not Necessarily In The Correct Order
(6) The Quire Contents Are Not Necessarily Correct
(7) The Paints And Colours Used Are Not Necessarily Original

To this, I’d add some other evidence:-

(8) The quire numbers and the folio numbers are not (quite) consistent.

As John Grove pointed out back in 2002, the Voynich Manuscript’s Q9 (“Quire 9”) was rebound along a different fold after the quire numbers were added but before the folio numbers were added, leaving the Q9 quire number in the wrong place (i.e. not on the back page of its quire). And, as Glen Claston later pointed out, broadly the same thing happened for Q14 (the nine-rosette page): once Q14’s first binding fold collapsed or was damaged, the large multi-folio page was then restitched along a different, less obviously damaged fold, again leaving the Q14 quire number in the wrong place.

Of course, given that most of the (15th century) quire numbers look roughly a century older than the (16th century) folio numbers, a bit of rebinding between the two phases is perhaps to be expected. But all the same, this inconsistency should alert us to the fact that the bifolios were actively being worked on between the quiration and foliation.

(9) Some of the quire numbers’ downstrokes continue within the wrong quires.

I found two clear examples of this (Curse p.18): (a) the downstroke of the ‘9’ in “29” (i.e. ‘secund-us’) continues at the bottom of a page in Q6; and (b) the downstroke of the ‘5’ in “5t9” (i.e. ‘quin-t-us’) continues at the bottom of a page in Q3. In both cases (and particularly in the first of the two), it seems likely that at the time the quire numbers were added, the herbal bifolios were in quite a substantially order from the order we are presented with several centuries later.

(10) Vellum tears with parallel orientation may indicate that those bifolios came from the same tanned skin.

The examples I found (Curse pp.54-56) were on the f16-f9 bifolio and the f10-f15 bifolio, as well as on the f38-f35 bifolio and f36-f37 bifolio. The fact that the bifolios were still immediately adjacent in both cases loosely implies that the basic idea of codicological continuity during construction (i.e. that adjacent bifolios individual sections were probably folded and cut down from larger sheets of vellum) may well be sound. It also suggests that the f28-f29 bifolio (which has a stitched vellum tear) may be out of sequence.

(11) Currier “Herbal A” and Currier “Herbal B” bifolios seem jumbled up.

Back in 1976 or so, US Army cryptanalyst Prescott Currier noted two apparently distinct “dialects” of the ‘Voynichese’ language: he called these “A” and “B”, and pointed out a whole set of curious properties that helps you distinguish them from each other. Moreover, any given bifolio has only “A” or “B” writing on it: this broadly supports the idea that these correspond to two broadly separate writing phases, rather than two separate writers writing in parallel.

(12) The three sunflower-like drawings look to have been separated.

f33v, f40v and f50r all contain pictures of similar sunflower-like plants, and are all “Herbal B” pages: this reinforces the idea that the Herbal B pages should be looked at as a quite distinct content collection from the Herbal A pages. I’d add that the Herbal B “plants” seem far more artificial to me than Herbal A “plants”, some (but not all) of which actually resemble real plants (e.g. water lily, pansies, etc).

(13) Q13 and Q20 may have originally each been two smaller quires that were later merged.

There is a whole heap of content analysis that supports the idea that what we now call “Q13” was originally a ‘Q13a’ and a ‘Q13b’ (as proposed by Glen Claston in 2009) and that what we call “Q20” was originally a ‘Q20a’ and a ‘Q20b’ (as proposed by me in 2010).

This is not so very far from the observation [(3) above] that, given that the jars in the pharma section seem to progress from the end of Q19 to the start of Q17, Q19 originally preceded Q17. In short, we can’t be at all sure that the quire arrangement we see now matches the original quire arrangement or order – quires may well have been formed of smaller original quires that were merged (for whatever reason) before the quire numbers were added.

(14) The two pages with “chicken scratch” marginalia may well have originally been adjacent.

I suggested this in my 2012 Voynich Centenary Conference presentation “Between Vellum and Prague”, which tried to reconstruct how the Voynich Manuscript’s quires had moved around between its original ‘alpha’ state and the final (foliated) state. I think it would therefore be interesting to find out whether f66v (in Q8) and f86v3 (in Q14) were from the same vellum skin.

(15) Some bifolios that were (probably) central to a quire are now not.

f84v-f78r should clearly have been the centre of a Q13 quire (the pictures join across the bifolio’s central fold): but I should add that Rene Zandbergen also pointed out in 2010 that f18v-f23r may well have been the centrefold page-pair of a quire; and that I also suggested much the same of f33v and f40r in 2006 (Curse, p.70), though for a different reason.

30Apr 2013

Captain Prescott Currier and Voynich research…

by nickpelling ⋅ 56 Comments

In the 1970s, Captain Prescott Currier noted that the Voynich Manuscript’s text seemed to contain two separate ‘languages’ (“A” and “B”), each containing sub-languages that varied yet further. For him, the language differences were primarily statistical rather than linguistic: to tell what we now call ‘Currier A pages’ & ‘Currier B pages’ apart, he observed that (using the EVA transcription):-

“(a) Final ‘dy’ is very high in Language ‘B’; almost non-existent in Language ‘A.’
(b) The symbol groups ‘chol’ and ‘chor’ are very high in ‘A’ and often occur repeated; low in ‘B’.
(c) The symbol groups ‘chain’ and ‘chaiin’ rarely occur in ‘B’; medium frequency in ‘A.’
(d) Initial ‘chot’ high in ‘A’; rare in ‘B.’
(e) Initial ‘cTh’ very high in ‘A’; very low in ‘B.’
(f) ‘Unattached’ finals scattered throughout Language ‘B’ texts in considerable profusion; generally much less noticeable in Language ‘A.’“

Similarly, he thought that the writing seemed to have been done by at least two hands (specifically, a larger, rounded hand he called “1” [mainly on A pages], and a more cramped, tighter hand he called “2” [mainly on B pages]), which he was convinced were those of at least two different people. He also pointed out that certain Voynichese letters appeared to have a very position-dependent behaviour, and that a line of text seems to be a functional unit in some way.

I would argue that Currier’s work has been arguably the single most influential piece of Voynich research of the last few decades, because it in effect erects a pragmatic conceptual framework for working with the Voynich Manuscript that every researcher who follows should strongly bear in mind, if not actually use.

So when I get told about so-called Voynich ‘research’ that treats the entire Voynich Manuscript as a uniformly homogenous linguistic entity (i.e. ignoring Currier completely), I give a little sigh of muted exasperation and move swiftly on. This is simply because Currier’s languages are to the Voynich manuscript what the Gillogly strings are to the Beale Ciphers: hence any claimed explanation or decryption that fails to account for Currier’s raw (yet actually rather unexpected, if you think about it) set of statistical observations is simply doomed to failure, period, even 40 years on.

I think it’s important to point out that Currier wasn’t some technical-minded Army codebreaker doing a bit of Voynich moonlighting: having graduated in Philology (with a focus on Romance Languages) from the University of Washington, he was surely perfectly placed to contribute a balanced analytical insight into the elusive internal structure of Voynichese. Hence I think the real reason that Currier’s work has been so influential in the field is that he really cared about what he was doing, and that he wanted to make a constructive, positive difference to Voynich studies.

Sadly, Currier’s insights failed to inspire a community-wide statistical assault on the Voynich Manuscript: researchers trundled on with their existing ad hoc studies, perpetually reinventing wheels – the big red revolution bus never arrived at the Voynich stop. The only obvious difference was that at least a few sensible people (Rene Zandbergen, Mark Perakh, etc) did manage to do statistical tests on A and B pages separately, which is a start, I guess… but only a start.

But because Currier restricted his work to statistical observations, he never built his framework up into the kind of thing Annales historians call a problematique, i.e. a fully rounded research question that drives future research forward. It’s all very well cleverly spotting the presence of different languages (some people prefer to say “dialects”, but it’s an open question) within the text, but that does beg some rather big questions, so-called “elephants in the room” that everyone can see but nobody talks about:-

* Why are the different languages fragmented across the document, often mixed up within a single quire?
* What gives rise to all the variation both within Currier A and within Currier B?
* Why was there a need for multiple languages at all? Why not just stick with Currier A?

Fast forward to 2013, and I think we can answer at least one of these questions, and provide reasonable (if tentative) answers to the other two.

Firstly: the simple reason that the Voynich languages are in disarray appears to be that the bifolios themselves are in disarray. I and others have uncovered numerous different codicological artefacts that strongly suggest the initial gatherings were disrupted, bound, rebound, and indeed misbound; and there is even specific evidence that quite a few bifolios ended up reversed relative to their original facing direction (i.e. folded back to front across the central crease).

Essentially, as the bifolios themselves were scrambled, so too were the languages: which is why A bifolios and B bifolios appear juxtaposed within individual bound quires. Yet given that there are large homogenous stretches of A and B bifolios, it seems likely that the scrambling wasn’t absolute – while I don’t think the Voynich bifolios were ever blown down a street in the wind, I do believe that what we see arose from a combination of planned shuffling (e.g. moving the large multi-panel bifolios towards the back and binding them there) and unplanned shuffling (binding breaking on some quires, spilling the bifolios onto the floor).

Secondly: I strongly believe that the structural and palaeographic differences between A pages and B pages tells a strong story of two major writing phases (let’s call them the “A phase” and the “B phase”). Identifying different composition phases through close reading is the kind of thing that modern historians do all the time, so this isn’t a fundamentally new approach: the only nuance here is that rather than close textual analysis (for Critical Reading) or art technique de-layering (for Art History), we’re instead looking at a cryptanalytical close reading. But then again, isn’t that what Currier was hoping for?

Note that I’m not speculating here about why there were two writing phases: at this point it’s enough just to identify them and give them names. But it does point to some interesting questions about why there should be both Herbal A pages and Herbal B pages, and what the difference between them might turn out to be (a topic upon which I’ve previously speculated more than enough for one lifetime, some would say).

Thirdly: within each of the A & B writing phases, I believe that the variations in the statistics will turn out to have arisen because of cryptographic evolution during each phase. By this, I mean that the core cryptographic system in use at the outset of each phase evolved during the various writing phases, as the author(s) finessed the system to work around specific cryptographic challenges encountered along the way, and so ending up a very different beast at the close.

I suspect that this will prove to be a set of “ratchet” effects, in that once changes were made to the system they would probably tend to stay in place until they in turn were replaced or finessed. I therefore believe that the cryptanalytical challenge we face is working out the evolutionary curves that the A system and the B system traced out – quantifying and then mapping them as if the system driving them were a probabilistic Markov state machine, its configuration relentlessly evolving as the text flows from page to page to page.

As to the specifics of how we should do this, you’ll have to wait for the next post…

« Previous 1 … 20 21 22 23 24 … 74 Next »

Category Archives: Voynich Manuscript