Two long(-ish) form Voynich manuscript articles emerged recently, one in the Jewish magazine The Tablet “Tablet Magazine”, the other in the New Yorker’s online blog section. These tell us quite a lot – though not really about the Voynich Manuscript itself, but rather about how the Voynich Manuscript is now perceived.

The first article, by Batya Ungar-Sargon, is called Cracking the Voynich Code: The quixotic quest to read meaning in the patterns of a bizarre manuscript that has bedeviled scholars for years.

Her basic take is that “the Voynich Manuscript has become a beacon for a secular community of quasi-Talmudic scholars whose interpretive ingenuity and stamina have few parallels“, so her piece is built around interviews with several of them (including the “patient, tireless” Gordon Rugg, and the “deeply humble” Rich SantaColoma). [She also talked with me on the phone for an hour, but perhaps I didn’t fit her template 😉 ].

Taken as a whole, fitting her article into a primarily Jewish-interest magazine was always going to be a bit of stretch: William Friedman was Jewish, sure, but that’s a small piece of material to make a full-length dress out of. I can’t help but wonder whether Batya’s ambition is to write long-form pieces for the New Yorker, and that this was a try-out for her portfolio. She clearly writes well, but I don’t think her journalistic instincts are yet fully honed – her article, in my opinion, is still more ‘relating’ than the literary reportage to which she aspires.

The second article – The Unread: The Mystery of the Voynich Manuscript – by Reed Johnson is, coincidentally enough, from the New Yorker blog section. As such, it’s a kind of New Yorker long-form take on a blog post, i.e. longer than a normal blog post, but quite a lot shorter than a typical New Yorker article (I used to subscribe to it, though how I ever found enough time to read each issue I don’t know 🙂 ).

This isn’t Ungar-Sargon-style journalism, but is instead Reed’s telling the story of how he came to waste three years (only three years? Pshaw!) on the Voynich Manuscript – basically, while trying to write his own “Dan Brown–style thriller”, having nearly completed his “M.F.A. in fiction at the University of Virginia” in 2010. He tries to introduce a little light drama into his account (Did he crack the Voynich? Did he finish his book?) but with enough of a wink to astute readers that they know the resolutions long before the end.

Unlike Batya, Reed is not an observer looking in on the Voynich research world from the outside, but is instead an active participant in what he calls the “often fractious” Voynich mailing list. His feeling about this is that “If crowds have any wisdom, soon we should see the fruits of a more recent deciphering project: Internet crowdsourcing“. And yet, he also wonders whether it would be a disappointment for the Voynich Manuscript to be decrypted – that, “no matter how thrilling such a text might be, it [would] remain a disappointment for being closed off, completed — for being, in the end, no longer a mystery“.

My own conclusion is that the Voynich mailing list has become more part of the problem than part of the solution: and that the extraordinarily productive collaboration its early days saw was more down to the small number and high calibre of the participants (Jim Reeds, Jim Gillogly, Jacques Guy, etc), most of whom left the list long ago. Really, the collective wisdom of the crowd very much depends on the crowd you happen to be dealing with: though Reed stops short of showing his hand in this regard, so we end up knowing what happened but not his thoughts or feelings about it. Perhaps the whole card game hasn’t yet concluded for him.

What’s nice about these two articles is that, for all their differences, they are both good examples of clear-headed contemporary writing about the Voynich Manuscript, far from the lurid wodges of mystery-soaked ahistorical fragments I frequently used to see. Indeed, both give an account of the Voynich’s history that is broadly correct, something which simply never happened even a decade ago: perhaps the radiocarbon dating has helped validate the Voynich as a “proper” subject.

And yet… it’s as if something (or someone) is missing from the whole party. The Voynich Manuscript has had many of the best codebreakers of the age (the Friedmans, Manly, Tiltman, etc) examine it closely: yet as these articles show, a lot of contemporary discourse still revolves around the – frankly rather foolish and shallow, I think – postmodernist cipher/hoax tension as exemplified by Gordon Rugg and Rich SantaColoma.

To my mind, it’s as if something really important is missing from the whole conceptual landscape of how the Voynich is perceived, that everyone is somehow in the wrong kind of doubt. We’ve collectively travelled a really long way forward, for sure, but the ideas and insights gained on that journey have all been zapped by a kind of “motivated learning” paralysis, where debate is held in a stasis between powerful epistemological agendas.

It often feels as though, myself excepted (and who listens to what I say, ha!), the Voynich-as-a-genuine-historical-artefact point of view has no champion. I genuinely tire of the way people continually generate possible alternative histories for it, when I’m just about the only person trying to reconstruct the mainstream history they’re so busy fighting against.

I want to ask those “theorists”: why do you find the idea that the Voynich Manuscript was made basically when its radiocarbon dating says so dreadfully upsetting? Why do you invest so much time and effort into identifying outlandish alternatives that might possibly be made to work (with a few well-chosen tweaks to the mainstream historical timeline)? Do you not see that, by kicking back so hard against a straightforward historical account that hasn’t even been written yet, you are yourself holding everything back? Can you not see that by doing this you have become part of the problem, not part of the solution?

That is the Voynich Manuscript debate that’s missing, the elephant in the room that nobody wants to talk about. But nobody is writing that particular article, and I’m not sure anyone ever will… and perhaps we’re all worse off for that silence.

A copy of Paul de Saint-Hilaire’s (1973) “La Belgique Mysterieuse” arrived in the post today. I saw it mentioned here, and ordered it because it seemed to be the only book out there (apart from Rudy Cambier’s Knights Templar / Nostradamus stuff) where Moustier Church’s strange cryptograms are discussed at all.

It’s a handy little thing, that divides its mysterious subject matter into:
(1) Megalithic Belgium
(2) Templar Belgium
(3) Lost Treasure Belgium
(4) Alchemical Belgium
(5) Fantastic(al) Belgium
(6) Underground Belgium, and – of course –
(7) Secret Belgium.

Each chapter finishes up with a suggested tour around the Belgian countryside to take in the major landmarks it discussed. So… what does our man say about Moustier, then?

* On p.60, Saint-Hilaire says that the cryptograms may relate to a treasure hidden in the 19th century. (But without giving any sources or references – annoyingly, there is no bibliography).
* On p.119, he notes that the cryptograms are laid out in the same way as you would expect to see “The Tables of the Law” (i.e. the Ten Commandments), but that it has so far proved impossible to find the key.
* On p.129, he speculates that examining some windows of the church and a small chapel in the village might prove useful in decrypting the inscriptions. (But, again, without giving any sources or references).

And that’s all Saint-Hilaire says about Moustier. It’s not much, true: but it is what it is.

* * * * * * *

Having mused on the Moustier enigma for the last few weeks, my own conclusions are:
* I don’t believe that it was carved before 1800
* I don’t believe that it uses a simple substitution cipher
* I don’t believe that it uses a transposition cipher
* I do believe that it includes copying mistakes
* I do believe that it was copied from a written version, not from a carved text
* I do believe that the written version was probably devised for the rebuilding of the Church
* I do believe that the resemblance to The Tables of the Law will prove to be no coincidence

Put all that together, and I suspect that the cipher that was used was very probably what is generally known as the Vigenère cipher (even though it was actually first invented by Bellaso, but we’ll let that pass).

The immediate question is, of course, is whether this is historically plausible for the proposed date? I’m sure it is – in fact, at that very time, it was renowned as le chiffre indéchiffrable, i.e. ‘the unbreakable cipher’.

I also have here a fascinating (if occasionally meandering) book by Ole Immanuel Franksen called “Mr Babbage’s Secret”, which reveals that Charles Babbage had quietly worked out the basic principles of how to crack Vigenère ciphertexts as early as 1846. This was later independently worked out by a Frenchman called de Viaris in 1888, who published his results in the “Génie Civil”, but without attracting much attention.

Because the Vigenère cipher was particularly popular in France, I predict that the plaintext will turn out to be in French: and moreover, I suspect that the key word will turn out to be something to do with the Ten Commandments (or perhaps Christianity in general), or perhaps even the name of something featured in a church window (as Saint-Hilaire speculated).

Anyway, that’s pretty much as good a set of constraints as my historical sleuthing has been able to generate, and I suspect there’s no more useful information out there we can get our hands on. So now it’s probably time to move to phase #2: checking the transcription and doing a bit of cryptanalysis. If it’s a Vig, can we work out what its key-length is? Hopefully we shall see! 🙂

Yes, we’re back in Ohio again, for the third post in a row. Bear with me, though, because I think you’ll quite like the ride… 🙂

western-ohio-1901-bond-header

The 28th June 1916 evening edition of the Lima Times-Democrat has a dramatic story about a Western Ohio Railway ticket agent being robbed. My guess is that this is the incident that the Ohio Cipher was to do with, although quite how (or why) is another matter entirely. This is what it said:-

BOLD ROBBER MAKES GET-A-WAY WITH W. O. CASH BOX

Follows Ticket Agent Shaw to Safe and Secures $265.

Walks Calmly Out of Office and Disappears on Elisabeth Street.

Local police so far have been unable get any trace of the bold robber who held up Harvey Shaw, ticket agent of the Western Ohio railroad, last night, and made away with the contents of the cash drawer, which contained $265. Although a good description of the thief was given [to] the police department, a careful search of the city has failed to reveal the fugitive.

So carefully was the robbery perpetrated, that not even the numerous employes and persons waiting for the last train were aware of the trouble, until Shaw ran out the front door of the station shortly after the departure of the thief and gave the alarm.

Persons who saw the man walk out of the station state that he did not seem to hurry. He went west on Market until he reached Elizabeth street and turned south. Immediately after the alarm pedestrians and persons in the waiting room assisted in searching for the thief in the rear of the Wheeler block. Police who responded to the call searched all the alleys and lots in the neighbourhood, but their efforts were unsuccessful.

According to Shaw he checked up the receipts of the day about 11:20 p.m. and placing the money in a tin box started downstairs to the basement where the safe is located. He claims that he was unaware that he was being followed until he heard footsteps behind him. On looking around he was confronted by a well-dressed stranger, who had been sitting in the waiting room.

The stranger told him not to make any out-cry and ordered that he continue on his way to the basement, using a large revolver to convince him that he meant business.

Shaw complied with the request, the thief following with the revolver pressed against the agent’s back.

When they reached the safe, Shaw was ordered to set the cash box down. The robber held his revolver in his right hand and transferred the cash to his pockets with his left hand. While performing the operation he did not take his eyes off the agent.

As the eyes of the thief were trailed along the barrel of the gun he was unable to see the denomination of the coins and paper money. When his fingers touched a coin that, from the size, appeared to be a penny, he remarked, “What kind of small change is this?” Without taking his eyes off the agent, he brought his hand up within range of his vision and seeing that it was a $5 gold piece he placed the remainder of the money in his pocket. With a curt order to Shaw to remain quiet, he backed up the stairway and walked quietly out of the door.

From the quiet and systematic manner in which the holdup was perpetrated, police are of the opinion that it was the work of a professional. However, it is clear that the stranger in some manner was informed as the locality of the safe and condition that would confront him in pulling the job. A description of the fugitive was sent to police departments of surrounding cities and towns.

Well! They don’t write ’em like that any more, do they? 😉

Update: according to a newspaper search on ancestry.com, the same robbery story was covered in the Marion Daily Star (28th June 1916), the Sandusky Star Journal (28th June 1916) and the Lancaster Daily Gazette (29th June 1916), but so far I have found coverage of the mysterious cipher follow-up only in the Lima Times-Democrat. Still, lots to check just yet…

A quick update on the Ohio Cipher for you.

Firstly, while looking for more mentions of the story in other 1916 Ohio newspapers, I stumbled across an article in the 4th July 1916 edition of the Coshocton Morning Tribune, which said:-

The Eastern Puzzlers League, organized in 1883 for the construction, solving and exploitation of enigmas, met here [Warren, OH] today for its semi-annual convention.

As I understand it, the Eastern Puzzlers League grew into the National Puzzlers League, whose magazine “Enigma” was where the cryptogram later appeared. Might the Ohio Cipher have therefore been planted in an Ohio paper by a convention attendee simply for a bit of fun? If so, the NPL version of the ciphertext would probably be the correct one, rather than the one in the Lima Times-Democrat. All the same, Ohio is a big place, and Warren is the opposite side to Lima… so all this might just be a coincidence. Or perhaps the Ohio Cipher was the talk of the convention, and so it was natural to write it up in the newsletter?

Secondly, the National Puzzlers League version of the story says “The police department of Lima, O., is greatly puzzled over a cryptic message received in connection with the robbery of a Western Ohio ticket agent.” At first, I thought that “Western Ohio” was a rather imprecise description (as somebody once said, ‘Ohio is a big place’), until I realized that this was a reference to a railway.

Launched in 1903, the Western Ohio Railway was an electric railway based in Lima, known as the “Lima Route”. It was an interurban railway that had many stops around Lima, where its fierce competitor was Ohio Electric. 1916 arguably was during its golden age: the Railway collapsed in 1932, one of the many smaller railways to fail during the Great Depression.

Hence it seems likely that if the Ohio Cipher is genuine (and that the National Puzzlers League account is also essentially correct), it was indeed linked with an incident in Lima, OH. However, I haven’t been able to find any historical archives connected with Western Ohio Railway (reformed in 1928 as “Western Ohio Railway & Power Co”) at all… maybe there aren’t any to be found. If there is something to be found, I suspect it will be in court or police records.

Finally: “WAS NVKVAFT BY AAKAT TXPXSCK UPBK TXPHN OHAY YBTX CPT MXHG WAE SXFP ZAVFZ ACK THERE FIRST TXLK WEEK WAYX ZA WITH THX” looks to me to be a bit of an unusual ciphertext, in that it seems to mix up both enciphered and unenciphered words.

What’s more, various features of the enciphered words hint at a kind of verbose or Polybius square cipher:-
* X appears eight times, always on an even position inside a word
* T appears seven times, always on an odd position inside a word

What is going on here? I really don’t believe this is any kind of simple substitution cipher, but rather something more like the WW2 “Slidex” cipher which similarly mixed enciphered and unenciphered text. Might parts of it be some kind of low-level private telegraphic code?

Tipping my (virtual) hat frenetically in the direction of Zodiac Killer Cipher-meister Dave Oranchak yet again, it’s time to reveal one of the very few cipher mysteries from Ohio. (Might it be the only one? Let me know if it isn’t!)

Dave had found this story mentioned on the consistently curious (in a nice way) Futility Closet website, which itself had presumably found it from a 1916 edition of “Enigma”, the magazine of the National Puzzlers’ League (later reprinted here).

“The police department of Lima, O., is greatly puzzled over a cryptic message received in connection with the robbery of a Western Ohio ticket agent. Here it is: WAS NVKVAFT BY AAKAT TXPXSCK UPBK TXPHN OHAY YBTX CPT MXHG WAE SXFP ZAV FZ ACK THERE FIRST TXLK WEEK WAYZA WITH THX.”

As normal with such half-remembered stories, there’s no mention of anything specific that might actually help us track it down. But I decided to have a look anyway: and quickly found two mentions of it in the Lima Times-Democrat newspaper. The original mention was on the 3rd July 1916 (though the scan of it is barely readable)…

Lima-03Jul1916

i.e.

“At the request of a citizen of …… (we present?) a note written in cypher. As it is of the utmost importance that the contents of the note be ascertained. Any suggestions by readers of this paper which will …. assist in learning …. of the note will be … appreciated. The note is as follows: …”

…while there was a follow-up mention on the 7th July 1916 with a (probably spurious) guess as to the alphabet…

Lima-07Jul1916

So the NPL transcript was nearly correct, except that it had split “ZAVFZ” into “ZAV FZ” (you can just about make out “zavfz” on the original Lima Times-Democrat report) and merged “WAYX ZA” to “WAYZA”. So, the correct “Ohio cipher” ciphertext should be:

WAS NVKAFT BY AAKAT TXPXSCK UPBK TXPHN OHAY YBTX CPT MXHG WAE SXFP ZAVFZ ACK THERE FIRST TXLK WEEK WAYX ZA WITH THX

Well… given that we still don’t know the exact town or date of the incident, and the Enigma retelling of the story seems not to have quite matched what the local newspaper actually said (e.g. it was reported by a “citizen”), we’re still left with plenty of mysteries. Perhaps other newspaper reports from the time will reveal more of the story… anyone who wants to take this on, please be my guest!

All the same, to me the ciphertext does look exactly like the kind of ad hoc partially-improvised agony column ciphers Tony Gaffney used to eat for breakfast, so maybe he’ll see straight through this particular visual trick and crack it quicker than you can say “vividly ovoviviparous”… 😉

The news rattling the bars of the Voynich research cage loudest right now is surely the publication of a paper by Marcelo Montemurro and Damián H. Zanette called Keywords and Co-Occurrence Patterns in the Voynich Manuscript: An Information-Theoretic Analysis, deftly summarized in New Scientist as New signs of language surface in mystery Voynich text.

M&Z’s abstract brings out a lot of what they were trying to do – and also points exactly to their mistakes.

Here we analyse the long-range structure of the manuscript using methods from information theory. We show that the Voynich manuscript presents a complex organization in the distribution of words that is compatible with those found in real language sequences. We are also able to extract some of the most significant semantic word-networks in the text. These results together with some previously known statistical features of the Voynich manuscript, give support to the presence of a genuine message inside the book.

Central Assumption: the authors implicitly hypothesize that they can get meaningful results for long-range comparisons because Voynichese is homogeneous across all its sections.

…The Problem: this assumption is false (or very nearly so), because there are significant macro-level differences in the way the language in different sections works (Currier A, Currier B, labels) as well as many mid-level differences (Herbal-A, Q13-ese, etc).

Central Conclusion: the authors believe that their language-centric statistical machinery has identified “The thirty most informative words in the Voynich manuscript”.

…The Problem: I’m pretty sure that the authors have in fact very probably identified arguably the thirty least informative words in the Voynich Manuscript. (That may be an independently useful result, but it’s probably not really what they were hoping for.)

I’ll explain.

Voynichese is extremely predictable at a letter-level: it has many rigid letter-level adjacency rules (‘4’ is almost always followed by ‘o’, etc) and position rules (4o- is consistently word-initial, -89 is consistently word-final, etc) and a high level of letter-context predictability.

Yet at the same time, it also has a very large dictionary relative to its text size. I often criticize Gordon Rugg for suggesting historically incorrect Cardan grille-like tables (i.e. they’re a century too late for the Voynich’s construction dating) and for inappropriately back-projecting his modern CompSci mindset onto the early Renaissance (i.e. it’s 500+ years too early for the kind of table-driven hackery he proposes). However, he is absolutely right that a reconstructed Voynichese “dictionary” would, to a modern computer scientist’s eyes, look very much as if it had been generated or permuted by some means.

The paradox is therefore that these two apparently opposite aspects of Voynichese are able to coexist: how on earth can we reconcile its letter rigidity & predictability with its wild word variability?

I think the key to resolving this is to grasp that there is some kind of generative or confounding principle at work within a rigidly predictable framework. That is, that even though there are lots of rules, these rules act as a kind of “container” for semantic or cryptographic variability to exist within.

Hence I believe that Montemurro’s statistical machinery is identifying “words” that fall within the container layer rather than in the confounded content layer. Hence these are arguably the thirty least informative words in the Voynich Manuscript.

It’s a hard point to understand, let alone accept: the confounding trick (some kind of transposition cipher? some kind of paper cipher machinery? some kind of cipher wheel?) driving Voynichese’s inherent variability remains as profoundly unreachable now as it has been for over 500 years.

My apologies to Montemurro and Zanette, but the central challenge we face isn’t to find new language-based statistical tests to apply to the Voynichese corpus, however clever they may be. Rather, it is to find ways of resolving the Voynich Manuscript’s central paradox: how is it that Voynichese is both letter-rigid and word-variable at the same time?

Incidentally, M & Z conclude in their paper that results point to a semantic link between the Recipe and Astro sections, and between the Herbal and Pharma sections. Actually, had they been more aware of the codicology analyses that have been done, they would have seen that their results are consistent with the writing phase order.

In fact, there are many indications that what I call Voynichese’s ‘container’ layer above evolved during the writing, with the most obvious evolution being between Currier A and Currier B. I suspect that what their statistical machinery has imperfectly captured is therefore simply a snapshot into the evolution of the container layer, and not anything ‘semantic’ as such.

In short, the aspect of Voynichese that is most nearly homogeneous across all its sections is its “container” layer: so what Montemurro and Zanette have done is make long-range comparisons between evolutions of the container layer. Currently, my best guess is that these are likely to be almost entirely composed of cipher system meta-tokens (shorthand tokens, transposition cipher placeholders, etc) rather than the semantic contents, which appear instead to have been confounded by some means.

So, rather than finding a “genuine message” (as New Scientist put it), perhaps they have instead found a “genuine container” for the message? This may prove to be a very useful result in its own right, but it’s probably not the smoking gun linguistic proof they were hoping to use to discredit Rugg’s tables.

Zodiac Killer Cipher supremo Dave Oranchak very kindly bounced this nicety my way: a tiny cipher sneaked into her High School yearbook by geeky/naughty Jessica Lee, that even managed to make it onto MSN:

Fluorine uranium carbon
potassium bismuth technetium
helium sulfur germanium thulium
oxygen neon yttrium.

If you’re already familiar with Notorious B.I.G.’s lyrical output, you’ll have a reasonable idea of what to expect. If not… well, decipher it at your peril. 🙂

I should also mention the Reddit commenter who thinks Lee should have done “More research, girl!“, and replaced “thulium oxygen neon yttrium” with “tritium molybdenum neon yttrium“, to preserve all the word breaks. But perhaps that would have looked too obvious and have been caught before going to press. So, my scoring of the duel is: Jessica Lee 1, Reddit Commenters 0. 😉

PS: here’s the cipher key you’ll need, courtesy of this useful website. Don’t say I don’t spoil you. Because I do.

Ac Actinium
Al Aluminium
Am Americium
Sb Antimony
Ar Argon
As Arsenic
At Astatine
Ba Barium
Bk Berkelium
Be Beryllium
Bi Bismuth
Bh Bohrium
B Boron
Br Bromine
Cd Cadmium
Cs Caesium
Ca Calcium
Cf Californium
C Carbon
Ce Cerium
Cl Chlorine
Cr Chromium
Co Cobalt
Cn Copernicium
Cu Copper
Cm Curium
Ds Darmstadtium
Db Dubnium
Dy Dysprosium
Es Einsteinium
Er Erbium
Eu Europium
Fm Fermium
F Fluorine
Fr Francium
Gd Gadolinium
Ga Gallium
Ge Germanium
Au Gold
Hf Hafnium
Hs Hassium
He Helium
Ho Holmium
H Hydrogen
In Indium
I Iodine
Ir Iridium
Fe Iron
Kr Krypton
La Lanthanum
Lr Lawrencium
Pb Lead
Li Lithium
Lu Lutetium
Mg Magnesium
Mn Manganese
Mt Meitnerium
Md Mendelevium
Hg Mercury
Mo Molybdenum
Nd Neodymium
Ne Neon
Np Neptunium
Ni Nickel
Nb Niobium
N Nitrogen
No Nobelium
Os Osmium
O Oxygen
Pd Palladium
P Phosphorus
Pt Platinum
Pu Plutonium
Po Polonium
K Potassium
Pr Praseodymium
Pm Promethium
Pa Protactinium
Ra Radium
Rn Radon
Re Rhenium
Rh Rhodium
Rg Roentgenium
Rb Rubidium
Ru Ruthenium
Rf Rutherfordium
Sm Samarium
Sc Scandium
Sg Seaborgium
Se Selenium
Si Silicon
Ag Silver
Na Sodium
Sr Strontium
S Sulfur
Ta Tantalum
Tc Technetium
Te Tellurium
Tb Terbium
Tl Thallium
Th Thorium
Tm Thulium
Sn Tin
Ti Titanium
W Tungsten
U Uranium
V Vanadium
Xe Xenon
Yb Ytterbium
Y Yttrium
Zn Zinc
Zr Zirconium

PPS: isn’t there a nerdy joke about IUPAC Shakur in there somewhere? Errrm…. maybe not. 🙂

When I was young, I often used to play Scrabble with my grandmother Win on my way home from school. (By which I mean her maisonette was on my route home, not that we played Scrabble on the bus.) Which probably helps account for the deep-rooted enjoyment I still get from weird and wonderful words, many decades later.

From way back then, my favourite English word has always been “svelte” (though “tergiversate” was nipping at its heels for a couple of weeks last year). The reason I particularly like svelte is that it’s (I’m struggling to describe) ‘productively onomatopoeic’, in that the slow ‘l’-sound in the middle makes it feels elegant (indeed svelte) on the tongue. Really, it’s a word with an unusual (but nicely matching) mouth feel, one that manages to stand out from a dictionary sized pack. With getting too synaesthetic on you, to me it’s a kind of David Gower four of a word, a left-handed ping that’s over the boundary before the fielders even notice it’s gone. Something can’t be half-svelte, it’s either got it or it hasn’t.

Svelte also brings right to the fore the mad ragtag heterogeneity of English, the arbitrary coupling together of chance encounters over the millennia. To some it sounds Svedish Swedish (or perhaps a piece of stray Elvish?) but it’s actually a French word (svelte), from an Italian (svelto, “stretched out”), from Vulgar Latin (ex + vellere, i.e. to stretch + out).

(You might therefore suspect that it shares some kind of origin with “vellum” which is also stretched out, but the latter has its roots in “veal”, i.e. young calves: hence vellum is properly fine calfskin.)

Languages are like that: for all their modern apologists, academies, and syntactic niceties, they’re at heart accidental rather than designed. Esperanto and all the other modern conlangs are all very well, but a good part of the charm of real-world languages is the way stray and mongrel words hop in to fill the semantic gaps that inevitably open up as culture mutates and evolves. English obviously needed a word that expressed presence of svelteness in an object, why else would svelte have succeeded and persisted otherwise?

But (and isn’t there always a but in Cipher Mysteries)… where’s all that in the Voynich Manuscript’s language? Even if William Friedman was completely and utterly wrong about the Voynich’s being an artificial constructed language (which he was), I really can see exactly why he thought & believed that. For Voynichese words show such a strong family resemblance – a strongly interlinked productive grammar, if you will – that it almost precludes anything else. Whatever Voynichese is, there is definitely an artificiality to it, or at least an abundance of artifice. I suspect that anyone trying to map Voynichese onto a direct language base will almost inevitably find (to their eventual embarrassment) that it’s just too artificial to be workable: and that’s pretty much what Elizebeth Friedman concluded too.

So here’s your Voynich paradox for the day. I’m sure that there can be no “svelte” in the Voynichese ‘language’ as we see it, because the overwhelming majority of its words arise from a compact productive grammar quite unlike that of a real, heterogeneous, messy, accidental, historic language: and yet the look of Voynichese so resembles a language that it’s hard not to feel as though you’re perpetually a mini-dictionary away from just reading it.

Of course, for me the resolution of this paradox comes down to a well-chosen bunch of steganographic tricks (such as verbose cipher, shorthand, etc) that serve to conceal the plaintext in a misleading form… but you will no doubt have your own theories about how to slice through such a Gordian knot. 🙂

A few days ago, I grabbed the chance to meet up with renowned crypto writer David Kahn at the Athenaeum Club in London while he was attending a conference on the Battle of the Atlantic. It was… simply a pleasure.

David_Kahn_At_The_Athenaeum_2013

He very happily signed my well-thumbed copy of “The Codebreakers” (1967), though I have to say it barely seems possible that he wrote his crypto meisterwerk close to half a century ago. He continues to research and write on crypto topics: a collection of his articles (“How I Discovered World War II’s Greatest Spy and Other Stories of Intelligence and Codes“) is due to be released in October 2013.

All that aside, he was eager to know about what was happening in the world of Voynich Manuscript research (and delighted to see my rather battered copy of Gawsewitch’s “Le Code Voynich”, even if it isn’t actually a facsimile edition) and urged me to write a state-of-the-art-circa-2013 summary of it for Cryptologia (he was one of its founders). Incidentally, he half-remembered being told recently that someone had cracked the Dorabella Cipher (which is possible, though slightly unlikely, I’d say).

But alas, my all-too-brief hour was up too soon: I had to leave and once more merge into the grey London streets. Yet the whole thing set me wondering for several days how best to summarize the Voynich Manuscript. Why is it that Voynich researchers can know so much about the manuscript’s minutiae, and yet agree on almost nothing? Why is it that the Voynich’s Wikipedia article is so long, yet says so little? Why write another analysis-paralysis piece on it?

Part of the challenge is that it often feels to me as if nobody has written a single word on the Voynich Manuscript that even remotely does it justice. Rather, it’s as if there’s a honey-pot of non-words and non-phrases for non-historians to dip their paws into, making every article and blog post written on it little more than a sweet (though ultimately unsatisfying) anagram of the preceding ones.

At the same time, in my own Voynich research it’s as if every day is Groundhog Day, where pretty much everyone else in Punxsutawney never learns anything, but instead sticks belligerently to their same futile and unhelpful non-positions, day in and day out. [*] For example, I would agree that it is entirely possible to construct alt.histories where the Voynich post-dates the 15th century (oh yes, and that the palaeography, the codicology, the Art History and the radiocarbon dating are all simultaneously wrong, or perhaps hoaxed in a peculiarly sophisticated way), but why on earth would anyone bother?

It’s a lot like fighting against a kind of post-modernist debating society, for whom the inevitable existence of doubt in any given fact makes it fair game to dismiss it. Such en masse debating may well be a great way of passing time, but it’s surely a lousy way of getting to the truth. I’m not interested in knowing what might conceivably have happened, I want to know what genuinely did happen.

*) All the same, I’m getting pretty good at ice carving. That’s bound to come in useful one day… 🙂

A correspondent has asked me to summarize the evidence I’ve found in the Voynich Manuscript suggesting bifolio reordering. As long-term Cipher Mysteries readers will know, I laid much of this out in my 2006 book The Curse of the Voynich: but a lot has also emerged in the years since.

A practical starting point here is my long-standing page on the Voynich Manuscript’s codicology. This points to evidence for a whole sequence of fairly direct codicological conclusions:-
(1) The Folio Numbers Are Not Necessarily Correct
(2) The Bifolios Are Not Necessarily The Right Way Up
(3) The Quire Numbers Are Not Necessarily Correct
(4) The Bindings Are Not Necessarily Correct
(5) The Quires Are Not Necessarily In The Correct Order
(6) The Quire Contents Are Not Necessarily Correct
(7) The Paints And Colours Used Are Not Necessarily Original

To this, I’d add some other evidence:-

(8) The quire numbers and the folio numbers are not (quite) consistent.

As John Grove pointed out back in 2002, the Voynich Manuscript’s Q9 (“Quire 9”) was rebound along a different fold after the quire numbers were added but before the folio numbers were added, leaving the Q9 quire number in the wrong place (i.e. not on the back page of its quire). And, as Glen Claston later pointed out, broadly the same thing happened for Q14 (the nine-rosette page): once Q14’s first binding fold collapsed or was damaged, the large multi-folio page was then restitched along a different, less obviously damaged fold, again leaving the Q14 quire number in the wrong place.

Of course, given that most of the (15th century) quire numbers look roughly a century older than the (16th century) folio numbers, a bit of rebinding between the two phases is perhaps to be expected. But all the same, this inconsistency should alert us to the fact that the bifolios were actively being worked on between the quiration and foliation.

(9) Some of the quire numbers’ downstrokes continue within the wrong quires.

I found two clear examples of this (Curse p.18): (a) the downstroke of the ‘9’ in “29” (i.e. ‘secund-us’) continues at the bottom of a page in Q6; and (b) the downstroke of the ‘5’ in “5t9” (i.e. ‘quin-t-us’) continues at the bottom of a page in Q3. In both cases (and particularly in the first of the two), it seems likely that at the time the quire numbers were added, the herbal bifolios were in quite a substantially order from the order we are presented with several centuries later.

(10) Vellum tears with parallel orientation may indicate that those bifolios came from the same tanned skin.

The examples I found (Curse pp.54-56) were on the f16-f9 bifolio and the f10-f15 bifolio, as well as on the f38-f35 bifolio and f36-f37 bifolio. The fact that the bifolios were still immediately adjacent in both cases loosely implies that the basic idea of codicological continuity during construction (i.e. that adjacent bifolios individual sections were probably folded and cut down from larger sheets of vellum) may well be sound. It also suggests that the f28-f29 bifolio (which has a stitched vellum tear) may be out of sequence.

(11) Currier “Herbal A” and Currier “Herbal B” bifolios seem jumbled up.

Back in 1976 or so, US Army cryptanalyst Prescott Currier noted two apparently distinct “dialects” of the ‘Voynichese’ language: he called these “A” and “B”, and pointed out a whole set of curious properties that helps you distinguish them from each other. Moreover, any given bifolio has only “A” or “B” writing on it: this broadly supports the idea that these correspond to two broadly separate writing phases, rather than two separate writers writing in parallel.

(12) The three sunflower-like drawings look to have been separated.

f33v, f40v and f50r all contain pictures of similar sunflower-like plants, and are all “Herbal B” pages: this reinforces the idea that the Herbal B pages should be looked at as a quite distinct content collection from the Herbal A pages. I’d add that the Herbal B “plants” seem far more artificial to me than Herbal A “plants”, some (but not all) of which actually resemble real plants (e.g. water lily, pansies, etc).

(13) Q13 and Q20 may have originally each been two smaller quires that were later merged.

There is a whole heap of content analysis that supports the idea that what we now call “Q13” was originally a ‘Q13a’ and a ‘Q13b’ (as proposed by Glen Claston in 2009) and that what we call “Q20” was originally a ‘Q20a’ and a ‘Q20b’ (as proposed by me in 2010).

This is not so very far from the observation [(3) above] that, given that the jars in the pharma section seem to progress from the end of Q19 to the start of Q17, Q19 originally preceded Q17. In short, we can’t be at all sure that the quire arrangement we see now matches the original quire arrangement or order – quires may well have been formed of smaller original quires that were merged (for whatever reason) before the quire numbers were added.

(14) The two pages with “chicken scratch” marginalia may well have originally been adjacent.

I suggested this in my 2012 Voynich Centenary Conference presentation “Between Vellum and Prague”, which tried to reconstruct how the Voynich Manuscript’s quires had moved around between its original ‘alpha’ state and the final (foliated) state. I think it would therefore be interesting to find out whether f66v (in Q8) and f86v3 (in Q14) were from the same vellum skin.

(15) Some bifolios that were (probably) central to a quire are now not.

f84v-f78r should clearly have been the centre of a Q13 quire (the pictures join across the bifolio’s central fold): but I should add that Rene Zandbergen also pointed out in 2010 that f18v-f23r may well have been the centrefold page-pair of a quire; and that I also suggested much the same of f33v and f40r in 2006 (Curse, p.70), though for a different reason.