| Andrew Cooke | Contents | Latest | RSS | Twitter | Previous | Next

C[omp]ute

Welcome to my blog, which was once a mailing list of the same name and is still generated by mail. Please reply via the "comment" links.

Always interested in offers/projects/new ideas. Eclectic experience in fields like: numerical computing; Python web; Java enterprise; functional languages; GPGPU; SQL databases; etc. Based in Santiago, Chile; telecommute worldwide. CV; email.

Personal Projects

Lepl parser for Python.

Colorless Green.

Photography around Santiago.

SVG experiment.

Professional Portfolio

Calibration of seismometers.

Data access via web services.

Cache rewrite.

Extending OpenSSH.

Last 100 entries

The Black Dork Lives!; The UN Requires Unaninmous Decisions; LPIR - Steganography in Practice; How I Am 6; Clear Explanation of Verizon / Level 3 / Netflix; Teenage Girls; Formalising NSA Attacks; Switching Brakes (Tektro Hydraulic); Naim NAP 100 (Power Amp); AKG 550 First Impressions; Facebook manipulates emotions (no really); Map Reduce "No Longer Used" At Google; Removing RAID metadata; New Bike (Good Bike Shop, Santiago Chile); Removing APE Tags in Linux; Compiling Python 3.0 With GCC 4.8; Maven is Amazing; Generating Docs from a GitHub Wiki; Modular Shelves; Bash Best Practices; Good Emergency Gasfiter (Santiago, Chile); Readings in Recent Architecture; Roger Casement; Integrated Information Theory (Or Not); Possibly undefined macro AC_ENABLE_SHARED; Update on Charges; Sunburst Visualisation; Spectral Embeddings (Distances -> Coordinates); Introduction to Causality; Filtering To Help Colour-Blindness; ASUS 1015E-DS02 Too; Ready Player One; Writing Clear, Fast Julia Code; List of LatAm Novels; Running (for women); Building a Jenkins Plugin and a Jar (for Command Line use); Headphone Test Recordings; Causal Consistency; The Quest for Randomness; Chat Wars; Real-life Financial Co Without ACID Database...; Flexible Muscle-Based Locomotion for Bipedal Creatures; SQL Performance Explained; The Little Manual of API Design; Multiple Word Sizes; CRC - Next Steps; FizzBuzz; Update on CRCs; Decent Links / Discussion Community; Automated Reasoning About LLVM Optimizations and Undefined Behavior; A Painless Guide To CRC Error Detection Algorithms; Tests in Julia; Dave Eggers: what's so funny about peace, love and Starship?; Cello - High Level C Programming; autoreconf needs tar; Will Self Goes To Heathrow; Top 5 BioInformatics Papers; Vasovagal Response; Good Food in Vina; Chilean Drug Criminals Use Subsitution Cipher; Adrenaline; Stiglitz on the Impact of Technology; Why Not; How I Am 5; Lenovo X240 OpenSuse 13.1; NSA and GCHQ - Psychological Trolls; Finite Fields in Julia (Defining Your Own Number Type); Julian Assange; Starting Qemu on OpenSuse; Noisy GAs/TMs; Venezuela; Reinstalling GRUB with EFI; Instructions For Disabling KDE Indexing; Evolving Speakers; Changing Salt Size in Simple Crypt 3.0.0; Logarithmic Map (Moved); More Info; Words Found in Voynich Manuscript; An Inventory Of 3D Space-Filling Curves; Foxes Using Magnetic Fields To Hunt; 5 Rounds RC5 No Rotation; JP Morgan and Madoff; Ori - Secure, Distributed File System; Physical Unclonable Functions (PUFs); Prejudice on Reddit; Recursion OK; Optimizing Julia Code; Cash Handouts in Brazil; Couple Nice Music Videos; It Also Works!; Adaptive Plaintext; It Works!; RC5 Without Rotation (2); 8 Years...; Attack Against Encrypted Linux Disks; Pushing Back On NSA At IETF; Summary of Experimental Ethics; Very Good Talk On Security, Snowden; Locusts are Grasshoppers!; Vagrant (OpenSuse and IDEs); Interesting Take On Mandela's Context

© 2006-2013 Andrew Cooke (site) / post authors (content).

Spindromes

From: andrew cooke <andrew@...>

Date: Sun, 6 Feb 2011 07:01:49 -0300

A palindrome is a word that reads the same backwards as forwards.

Some letters look like letters when they are upside down, which suggests that
there should be something like a palindromes that work when a word is rotated
by 180 degress about its middle.  I will call these spindromes.

This seemed like an obvious use for Lepl, which will give multiple matches - 
all we need to do is
  1 - Identify which letters work
  2 - Write a parser that matches words consisting of such letters
  3 - Run that parser against a list of words

Andrew


from lepl import *

# What letters can we match?  I don't know how to generate this except by
# checking each letter by eye, which gives the following (I'll use either case
# in the hope of getting more matches):
pairs = [('b', 'q'), ('d', 'p'), ('h', 'y'), ('i', 'i'), ('l', 'l'), ('m', 'w'), 
         ('n', 'u'), ('o', 'o'), ('s', 's'), ('x', 'x'), ('z', 'z'),
         ('H', 'H'), ('I', 'I'), ('M', 'W')]

# Some of those are self-images so can occur in the middle of words with an
# odd number of letters
def single(pair):
    return pair[0] == pair[1]
singles = [pair[0] for pair in pairs if single(pair)] 

# We want to do caseless matching but return the correct case (so we can
# see when capitals are used).  So we need a matcher that does that.  Lepl
# doesn't have anything built-in, but we can write our own (following Lepl's
# convention of using capitals to indicate matcher factories).
@function_matcher_factory()
def Caseless(letter):
    '''
    Given a letter, this returns a matcher that will match the first character
    of a stream if the letter appears (ignoring case), returning the letter
    as the match.
    '''
    def matcher(support, stream):
        if stream and stream[0].lower() == letter.lower():
            return ([letter], stream[1:])
    return matcher

# And then we can use that to match and of the central letters:
central = Or(*map(Caseless, singles))

# The final matcher is going to be recursive (matching repeated pairs "inside"
# itself).  That's going to be recursive; we handle that by introducing 
# the name so that we can reference it later.
outer = Delayed()

# To define a matcher for any pair we'll first write a function that can
# generate one for a single pair (note how this calls the pre-defined outer)
def Bracket(pair, inner):
    a, b = [Caseless(letter) for letter in pair]
    if single(pair):
        return a + inner + b
    else:
        return a + inner + b | b + inner + a 
def Outer(pair):
    return Bracket(pair, outer | Empty())

# Finally, we can "tie the knot" (we put the central matcher here so that
# we can match single letters)
outer += Or(*[Outer(pair) for pair in pairs]) | central

# Some simple tests:
assert outer.parse('o') == ['o']
assert outer.parse('O') == ['o']
assert outer.parse('pod') == ['pod']
assert outer.parse('pboQd') == ['pboqd']

# But that takes WAY too long.  Instead, we need to restrict backtracking by
# adjusting the matcher to the word length.  We'll make a matcher for a given
# length then cache/build matchers as necessary.
def CountedOuter(n):
    if n == 0:
        return Empty()
    elif n == 1:
        return central
    else:
        inner = CountedOuter(n-2)
        return Or(*[Bracket(pair, inner) for pair in pairs])

# And cache by length
cache = {}
def parse(word):
    n = len(word)
    if n not in cache:
        cache[n] = CountedOuter(n)
        # I tried compiling to a regular expression here (re lib), but it
        # doesn't work too well (hangs dues to exponential complexity on 
        # longer words)
    return cache[len(word)].parse_string(word)

assert parse('o') == ['o']
assert parse('O') == ['o']
assert parse('pod') == ['pod']
assert parse('pboQd') == ['pboqd']

# Now let's run that against the contents of the dictionary:
with open('/usr/share/dict/words', encoding='latin_1') as words:
    for word in words:
        word = word.strip()
        try:
            print(parse(word)[0]) # will be a list containing a single word
        except:
            pass


The results are less exciting than I had hoped:

dip
dollop
dop
dp
H
HoH
hoy
HunH
hy
i
issi
l
lil
ll
mow
msw
mw
niu
nu
o
oHo
oo
oxo
pd
pHd
pid
pod
pood
qb
s
sis
solos
sos
spods
ss
sss
suns
swims
un
usn
wm
x
z
ziz
zzz

Final Code

From: andrew cooke <andrew@...>

Date: Thu, 10 Feb 2011 08:45:47 -0300

https://code.google.com/p/lepl/source/browse/src/lepl/_example/spindrome.py

Andrew

Comment on this post