Back in 2007, John Walsh (the host of “America’s Most Wanted”) announced that he had, since 1991, received a string of disturbing-sounding letters from an individual calling himself / herself “The Scorpion”: many of them had sections or pages that were apparently in cipher. Two of these ciphers were released to the public: these became known as “S1” and “S5”.

In the same year, Christopher Farmer (“President of OPORD Analytical”) announced that he had cracked S1 (which was apparently built on a 10×7 grid):-

scorpion3

Farmer’s claimed solution reads like this:-

baelprovid
edthemwith
newstories
butwhatifi
askjwdoiwa
xrtwbonesa
gezjefxkon

Unfortunately, all the diagrams illustrating Farmer’s ingenious reasoning have withered on the Internetty vine in the years since then (they’re not even in the Wayback Machine, nor anywhere else as far as I can see), which is a bit of a shame.

Even so, this turns out to be an entirely surmountable problem: Farmer’s claimed solution is clearly incorrect, for the simple reason that letters in the ciphertext aren’t consistent in the plaintext. For example, the cipher “K” maps to both ‘a’ and ‘g’, the “backwards-L” maps to ‘w’, ‘w’, and ‘x’, the “backwards-F” maps to both ‘u’ and ‘v’, and so on. At the same time, his claimed plaintext doesn’t really make a lot of sense (“BAEL”… really? I’m not so sure).

It seems likely to me that Farmer guessed that “PROVID” was steganographically hidden in plain sight at the end of the topmost line (and if you squint a bit, you can see why that would be), and then built the rest of his decryption attempt around this hopeful starting point. Moreover, he seems to have guessed that “O” maps to ‘o’, and “backwards-E” maps to ‘e’, which are both pretty peachy assignments. But I don’t buy any of this for a minute: there are way too many degrees of freedom in this S1 cryptogram (roughly half of the individual cipher shapes occur exactly once), and quite a few extra ones in his claimed solution too.

It’s a brave attempt, for sure: but it’s still wrong, whichever way you turn it round.

Other people have tried their hand with S1, though both AlanBenjy in 2009 and Glurk on Dave Oranchak’s site in 2010 pessimistically pointed out that 53 of S1’s 70 symbols are unique, yielding a ‘multiplicity’ a fair way beyond the range of what homophonic cryptograms can practically be solved. Hence I would tend to agree with their assessment that there’s no obvious way that we will solve S1 with what we currently have to hand: in fact, there seems no way to tell whether S1 is a real cipher or a hoax – the only repeating cipher pair is “S A” (i.e. “S Λ”), which could well have happened by pure chance.

The only other Scorpion ciphertext released to the public to date is the 180-character cryptogram known as “S5”:-

scorpion4

Once again, 155 of these 180 symbols are unique, which at first glance would seem to make S5 even less likely to be solved than S1.

But wait! In May 2007, user “Teddy” on the OPORD Analytical forum pointed out that if you transpose S5 from a 12-column arrangement to a 16-column layout, shape repeats only ever occur within a single vertical column. In fact, every single 16-way column except one (column #5) includes one or more repeated shapes.

Radically, this suggests to me that S5 was constructed in a completely different way from conventional homophonic ciphers: specifically, I think that each 16-way column of S5 may well have its own unique cipher alphabet. This would mean that S5 would need to be solved in a completely different manner to the way, say, zkdecrypto works. (I don’t believe S5 was constructed with eight columns, but I thought I ought to mention that that’s a possibility as well, however borderline). Maybe that small insight will be enough to help someone make some headway with S5, who can tell?

The huge shame here is that it may well be that the other Scorpion ciphers (which to this day have not been released) might well give us additional clues about the inner workings of both S1 and S5. Specifically, if one of the other ciphers happened to have used precisely the same 16-alphabet systemas S5, it might well give us enough raw data to crack them both.

Has anyone apart from John Walsh ever seen S2, S3, S4, and S6? Just askin’, just askin’…


Update: Looking again at S1 (while bearing in mind the way S5 seems to have been constructed), I find it hard not to notice that the distances between instance repetitions seem strongly clustered around multiples of 5 (with the only instance not fitting the pattern being the “backwards-L” on row #5):-

+60, +20, +50, +36, +24, +20, +40, +20, +40, +25, +35, +10, +25, +6, +45, +9, +6.

I suspect that this means that the encipherer probably enciphered S1 by cycling through five independent cipher alphabets (largely speaking). This wasn’t a mechanically precise encipherment (whether by accident or by design), but something close enough to one such that almost all the time he/she was no more than a single alphabet ‘off’, one way or the other.

This offers a quite different kind of constraint from normal homophonic cipher searches, and possibly even enough to crack the S1 cipher. After all, we have a fair amount of the Scorpion’s meandering plaintext to use as a statistical model to aim for… 🙂

9 thoughts on “Cracking the Scorpion Ciphers…

  1. It looks like an amalgam of gradeschool ciphers, perhaps, as you say, alternated. The letters could be ceasar-style, the angles and filled quadrants could be a modified pigpen – since the alphabets are all different, the alternation doesn’t even need to be mechanical.

  2. ponky on May 22, 2014 at 12:30 pm said:

    The crossed out line is interesting in S5. It seems the error may have been in the eigth symbol, it seems to have more scribbling than the rest, and the others all look the same as the line below. If symbol eight is wrong, why did they write symbols nine and ten correctly before noticing?

    It could imply that the unwritten eleventh symbol (crosshair) is somehow influenced by the eigth symbol.

    But here’s my guess: It seems that the scribbled symbol is a triangle with a line at the top, the same as symbol ten. After drawing symbol ten, it became apparent to the author that eight and ten were the same. In the cipher, the symbols can’t repeat so close together, thus revealing the error. This probably also means the alphabets are unique, and instead of the symbol being an off-by-one error, they accidentally rotated the glyph when writing it.

  3. thomas spande on May 22, 2014 at 7:43 pm said:

    ponky, et al.,, Your reasoning seems sound to me. There also seems a bit more scrubbing out of symbol 2 in the abortive line 14 than necessary. I wonder if that one also started out as something else, like symbol #3 in line 11? It is probably obvious to all but I think there has to be a lot of redundancy in the scorptio cyphers such as maybe,the triangles within squares being equivalent to naked triangles; likewise “X”x within squares being equivalent to plain “X’s” or parts of “x’s”? It looks to me as though the “dots” are key to decrypting this pup. For example, although often no dots are evident, the shaded part of the triangle within a square may be equivalent to two dotted triangle vertices that are hidden but indicated by the shaded part. Cheers, Tom .

  4. thomas spande on May 27, 2014 at 6:42 pm said:

    Dear all, Note that S5, the longer of the scorpion ciphers has what appears to be a pair of parentheses, EXCEPT they are in the reverse order if read from top to bottom (line 9, symbol 5; line 10, symbol 10). If read L->R BUT bottom to top, all is well. This might be a total red herring but if the cipher is designed to be red B->T, then maybe the following is true for a parenthetical expression where the second symbol is an “S”: The first character is likely a vowel. Some other guesses: the pips on the squares may represent lower case consonants depending upon whether an ascender/descender or horizontals are at top or bottom. Four pips at the corners might for example be an “n”. Ten of these exist and I include line 8, symbol #1 among them. I think many of the triangles semi-circles, circles, “X’s”, crosses are geometric operators, e.g. line 3, symbol 11 might be a left facing circle that needs to be rotated by 180 degrees (three vertices of the following triangle are marked and would = 3×60 degrees).Line 7, symbol 7 might be a 90 degree arc.

    As an irrelevant aside. I am guessing the scorpio cypher-ist is of the feminine gender by the “Hi!” at the heading (who else would use an exclamation point?) and the roundish lettering on the interrogative “Remember me?” “Had we world enough and time, then thy coyness were no crime!”

  5. Although the individual glyphs could be individual and separate specifiers (or characters), it looks to me like SOME of the various glyphs are the same “group” and the various modifying dots, fill-ins, lines, and rotations are sub-selectors, designators, and/or specifiers within that group. The question is, what do they mean, and are they “consistent” in their meaning? (that is, one dot, one corresponding fill-in, etc. — or just within one group) If there were more samples, I think this would become very evident. It’s very odd that only two of the samples were published. What’s that all about? (is it John Walsh’s inner demon(s) exposing themselves to the world via trauma-induced schizophrenic exhibitionism? — wouldn’t ‘blame’ him at all)

  6. btw — you didn’t mention it, but I’m guessing you know about this site: http://www.opordanalytical.com/phpBB3/viewtopic.php?f=138&t=200

  7. BillD: yeah, I mentioned it in the post (I think). It’s just frustrating that the other ciphertexts have never been released in all that time. 🙁

  8. Pingback: Scorpion S1 Cipher and ‘Letter’ Scorpion Cipher (1991)

  9. With so many unique symbols repeated, as if at random, it seems the solutions, if any, can only be very short. It may also contain numbers. The solution may be nothing more than a single word or a set of numbers reserved for the final symbols. Without a key, anybody’s guess!

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Post navigation