| Andrew Cooke | Contents | Latest | RSS | Twitter | Previous | Next

C[omp]ute

Welcome to my blog, which was once a mailing list of the same name and is still generated by mail. Please reply via the "comment" links.

Always interested in offers/projects/new ideas. Eclectic experience in fields like: numerical computing; Python web; Java enterprise; functional languages; GPGPU; SQL databases; etc. Based in Santiago, Chile; telecommute worldwide. CV; email.

Personal Projects

Lepl parser for Python.

Colorless Green.

Photography around Santiago.

SVG experiment.

Professional Portfolio

Calibration of seismometers.

Data access via web services.

Cache rewrite.

Extending OpenSSH.

C-ORM: docs, API.

Last 100 entries

Calling C From Fortran 95; Bjork DJ Set; Z3 Example With Python; Week 1; Useful Guide To Starting With IJulia; UK Election + Media; Review: Reinventing Organizations; Inline Assembly With Julia / LLVM; Against the definition of types; Dumb Crypto Paper; The Search For Quasi-Periodicity...; Is There An Alternative To Processing?; CARDIAC (CARDboard Illustrative Aid to Computation); The Bolivian Case Against Chile At The Hague; Clear, Cogent Economic Arguments For Immigration; A Program To Say If I Am Working; Decent Cards For Ill People; New Photo; Luksic And Barrick Gold; President Bachelet's Speech; Baltimore Primer; libxml2 Parsing Stream; configure.ac Recipe For Library Path; The Davalos Affair For Idiots; Not The Onion: Google Fireside Chat w Kissinger; Bicycle Wheels, Inertia, and Energy; Another Tax Fraud; Google's Borg; A Verion That Redirects To Local HTTP Server; Spanish Accents For Idiots; Aluminium Cans; Advice on Spray Painting; Female View of Online Chat From a Male; UX Reading List; S4 Subgroups - Geometric Interpretation; Fucking Email; The SQM Affair For Idiots; Using Kolmogorov Complexity; Oblique Strategies in bash; Curses Tools; Markov Chain Monte Carlo Without all the Bullshit; Email Para Matias Godoy Mercado; The Penta Affair For Idiots; Example Code To Create numpy Array in C; Good Article on Bias in Graphic Design (NYTimes); Do You Backup github?; Data Mining Books; SimpleDateFormat should be synchronized; British Words; Chinese Govt Intercepts External Web To DDOS github; Numbering Permutations; Teenage Engineering - Low Price Synths; GCHQ Can Do Whatever It Wants; Dublinesque; A Cryptographic SAT Solver; Security Challenges; Word Lists for Crosswords; 3D Printing and Speaker Design; Searchable Snowden Archive; XCode Backdoored; Derived Apps Have Malware (CIA); Rowhammer - Hacking Software Via Hardware (DRAM) Bugs; Immutable SQL Database (Kinda); Tor GPS Tracker; That PyCon Dongle Mess...; ASCII Fluid Dynamics; Brandalism; Table of Shifter, Cassette and Derailleur Compatability; Lenovo Demonstrates How Bad HTTPS Is; Telegraph Owned by HSBC; Smaptop - Sunrise (Music); Equation Group (NSA); UK Torture in NI; And - A Natural Extension To Regexps; This Is The Future Of Religion; The Shazam (Music Matching) Algorithm; Tributes To Lesbian Community From AIDS Survivors; Nice Rust Summary; List of Good Fiction Books; Constructing JSON From Postgres (Part 2); Constructing JSON From Postgres (Part 1); Postgres in Docker; Why Poor Places Are More Diverse; Smart Writing on Graceland; Satire in France; Free Speech in France; MTB Cornering - Where Should We Point Our Thrusters?; Secure Secure Shell; Java Generics over Primitives; 2014 (Charlie Brooker); How I am 7; Neural Nets Applied to Go; Programming, Business, Social Contracts; Distributed Systems for Fun and Profit; XML and Scheme; Internet Radio Stations (Curated List); Solid Data About Placebos; Half of Americans Think Climate Change Is a Sign of the Apocalypse; Saturday Surf Sessions With Juvenile Delinquents; Ssh, tty, stdout and stderr; Feathers falling in a vacuum; Santiago 30m Bike Route

© 2006-2015 Andrew Cooke (site) / post authors (content).

Spindromes

From: andrew cooke <andrew@...>

Date: Sun, 6 Feb 2011 07:01:49 -0300

A palindrome is a word that reads the same backwards as forwards.

Some letters look like letters when they are upside down, which suggests that
there should be something like a palindromes that work when a word is rotated
by 180 degress about its middle.  I will call these spindromes.

This seemed like an obvious use for Lepl, which will give multiple matches - 
all we need to do is
  1 - Identify which letters work
  2 - Write a parser that matches words consisting of such letters
  3 - Run that parser against a list of words

Andrew


from lepl import *

# What letters can we match?  I don't know how to generate this except by
# checking each letter by eye, which gives the following (I'll use either case
# in the hope of getting more matches):
pairs = [('b', 'q'), ('d', 'p'), ('h', 'y'), ('i', 'i'), ('l', 'l'), ('m', 'w'), 
         ('n', 'u'), ('o', 'o'), ('s', 's'), ('x', 'x'), ('z', 'z'),
         ('H', 'H'), ('I', 'I'), ('M', 'W')]

# Some of those are self-images so can occur in the middle of words with an
# odd number of letters
def single(pair):
    return pair[0] == pair[1]
singles = [pair[0] for pair in pairs if single(pair)] 

# We want to do caseless matching but return the correct case (so we can
# see when capitals are used).  So we need a matcher that does that.  Lepl
# doesn't have anything built-in, but we can write our own (following Lepl's
# convention of using capitals to indicate matcher factories).
@function_matcher_factory()
def Caseless(letter):
    '''
    Given a letter, this returns a matcher that will match the first character
    of a stream if the letter appears (ignoring case), returning the letter
    as the match.
    '''
    def matcher(support, stream):
        if stream and stream[0].lower() == letter.lower():
            return ([letter], stream[1:])
    return matcher

# And then we can use that to match and of the central letters:
central = Or(*map(Caseless, singles))

# The final matcher is going to be recursive (matching repeated pairs "inside"
# itself).  That's going to be recursive; we handle that by introducing 
# the name so that we can reference it later.
outer = Delayed()

# To define a matcher for any pair we'll first write a function that can
# generate one for a single pair (note how this calls the pre-defined outer)
def Bracket(pair, inner):
    a, b = [Caseless(letter) for letter in pair]
    if single(pair):
        return a + inner + b
    else:
        return a + inner + b | b + inner + a 
def Outer(pair):
    return Bracket(pair, outer | Empty())

# Finally, we can "tie the knot" (we put the central matcher here so that
# we can match single letters)
outer += Or(*[Outer(pair) for pair in pairs]) | central

# Some simple tests:
assert outer.parse('o') == ['o']
assert outer.parse('O') == ['o']
assert outer.parse('pod') == ['pod']
assert outer.parse('pboQd') == ['pboqd']

# But that takes WAY too long.  Instead, we need to restrict backtracking by
# adjusting the matcher to the word length.  We'll make a matcher for a given
# length then cache/build matchers as necessary.
def CountedOuter(n):
    if n == 0:
        return Empty()
    elif n == 1:
        return central
    else:
        inner = CountedOuter(n-2)
        return Or(*[Bracket(pair, inner) for pair in pairs])

# And cache by length
cache = {}
def parse(word):
    n = len(word)
    if n not in cache:
        cache[n] = CountedOuter(n)
        # I tried compiling to a regular expression here (re lib), but it
        # doesn't work too well (hangs dues to exponential complexity on 
        # longer words)
    return cache[len(word)].parse_string(word)

assert parse('o') == ['o']
assert parse('O') == ['o']
assert parse('pod') == ['pod']
assert parse('pboQd') == ['pboqd']

# Now let's run that against the contents of the dictionary:
with open('/usr/share/dict/words', encoding='latin_1') as words:
    for word in words:
        word = word.strip()
        try:
            print(parse(word)[0]) # will be a list containing a single word
        except:
            pass


The results are less exciting than I had hoped:

dip
dollop
dop
dp
H
HoH
hoy
HunH
hy
i
issi
l
lil
ll
mow
msw
mw
niu
nu
o
oHo
oo
oxo
pd
pHd
pid
pod
pood
qb
s
sis
solos
sos
spods
ss
sss
suns
swims
un
usn
wm
x
z
ziz
zzz

Final Code

From: andrew cooke <andrew@...>

Date: Thu, 10 Feb 2011 08:45:47 -0300

https://code.google.com/p/lepl/source/browse/src/lepl/_example/spindrome.py

Andrew

Comment on this post