| Andrew Cooke | Contents | Latest | RSS | Twitter | Previous | Next

C[omp]ute

Welcome to my blog, which was once a mailing list of the same name and is still generated by mail. Please reply via the "comment" links.

Always interested in offers/projects/new ideas. Eclectic experience in fields like: numerical computing; Python web; Java enterprise; functional languages; GPGPU; SQL databases; etc. Based in Santiago, Chile; telecommute worldwide. CV; email.

Personal Projects

Lepl parser for Python.

Colorless Green.

Photography around Santiago.

SVG experiment.

Professional Portfolio

Calibration of seismometers.

Data access via web services.

Cache rewrite.

Extending OpenSSH.

Last 100 entries

Mapa de Ciclovias en Santiago; How Unreliable is UDP?; SE Santiago 20m Bike Route; Cameron's Rap; Configuring libxml with Eclipse; Reducing Combinatorial Complexity With Occam - AI; Sentidos Comunes (Chilean Online Magazine); Hilary Mantel: The Assassination of Margaret Thatcher - August 6th 1983; NSA Interceptng Gmail During Delivery; General IIR Filters; What's happening with Scala?; Interesting (But Largely Illegible) Typeface; Retiring Essentialism; Poorest in UK, Poorest in N Europe; I Want To Be A Redneck!; Reverse Racism; The Lost Art Of Nomography; IBM Data Center (Photo); Interesting Account Of Gamma Hack; The Most Interesting Audiophile In The World; How did the first world war actually end?; Ky - Restaurant Santiago; The Black Dork Lives!; The UN Requires Unaninmous Decisions; LPIR - Steganography in Practice; How I Am 6; Clear Explanation of Verizon / Level 3 / Netflix; Teenage Girls; Formalising NSA Attacks; Switching Brakes (Tektro Hydraulic); Naim NAP 100 (Power Amp); AKG 550 First Impressions; Facebook manipulates emotions (no really); Map Reduce "No Longer Used" At Google; Removing RAID metadata; New Bike (Good Bike Shop, Santiago Chile); Removing APE Tags in Linux; Compiling Python 3.0 With GCC 4.8; Maven is Amazing; Generating Docs from a GitHub Wiki; Modular Shelves; Bash Best Practices; Good Emergency Gasfiter (Santiago, Chile); Readings in Recent Architecture; Roger Casement; Integrated Information Theory (Or Not); Possibly undefined macro AC_ENABLE_SHARED; Update on Charges; Sunburst Visualisation; Spectral Embeddings (Distances -> Coordinates); Introduction to Causality; Filtering To Help Colour-Blindness; ASUS 1015E-DS02 Too; Ready Player One; Writing Clear, Fast Julia Code; List of LatAm Novels; Running (for women); Building a Jenkins Plugin and a Jar (for Command Line use); Headphone Test Recordings; Causal Consistency; The Quest for Randomness; Chat Wars; Real-life Financial Co Without ACID Database...; Flexible Muscle-Based Locomotion for Bipedal Creatures; SQL Performance Explained; The Little Manual of API Design; Multiple Word Sizes; CRC - Next Steps; FizzBuzz; Update on CRCs; Decent Links / Discussion Community; Automated Reasoning About LLVM Optimizations and Undefined Behavior; A Painless Guide To CRC Error Detection Algorithms; Tests in Julia; Dave Eggers: what's so funny about peace, love and Starship?; Cello - High Level C Programming; autoreconf needs tar; Will Self Goes To Heathrow; Top 5 BioInformatics Papers; Vasovagal Response; Good Food in Vina; Chilean Drug Criminals Use Subsitution Cipher; Adrenaline; Stiglitz on the Impact of Technology; Why Not; How I Am 5; Lenovo X240 OpenSuse 13.1; NSA and GCHQ - Psychological Trolls; Finite Fields in Julia (Defining Your Own Number Type); Julian Assange; Starting Qemu on OpenSuse; Noisy GAs/TMs; Venezuela; Reinstalling GRUB with EFI; Instructions For Disabling KDE Indexing; Evolving Speakers; Changing Salt Size in Simple Crypt 3.0.0; Logarithmic Map (Moved); More Info; Words Found in Voynich Manuscript; An Inventory Of 3D Space-Filling Curves

© 2006-2013 Andrew Cooke (site) / post authors (content).

New Parser in Python

From: "andrew cooke" <andrew@...>

Date: Mon, 12 Jan 2009 00:54:55 -0300 (CLST)

Not ready for release yet, but I've just got a new parser, written in
Python, to the point where it's useful.

It includes full backtracing (parse forests etc), the (untested) ability
to 'automatically' control resource use (think 'maximum backtrace stack')
and enough syntactic sugar to rot your teeth :o)

This test:


from logging import basicConfig, DEBUG
from unittest import TestCase

from lepl.match import *
from lepl.node import Node


class NodeTest(TestCase):


  def test_node(self):
    basicConfig(level=DEBUG)

    class Term(Node): pass
    class Factor(Node): pass
    class Expression(Node): pass

    expression  = Delayed()
    number      = Digit()[1:,...]                   > 'number'
    term        = (number | '(' / expression / ')') > Term
    muldiv      = Any('*/')                         > 'operator'
    factor      = (term / (muldiv / term)[0:])      > Factor
    addsub      = Any('+-')                         > 'operator'
    expression += (factor / (addsub / factor)[0:])  > Expression

    (ast, _) = next(expression.match_string('1 + 2 * (3 + 4 - 5)'))
    print(ast[0])
    (ast, _) = next(expression('1 + 2 * (3 + 4 - 5)'))
    print(ast[0])


Prints the following (twice):

Expression
 +- Factor
 |   +- Term
 |   |   `- number=1
 |   `- ' '
 +- operator=+
 +- ' '
 `- Factor
     +- Term
     |   `- number=2
     +- ' '
     +- operator=*
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number=3
         |   |   `- ' '
         |   +- operator=+
         |   +- ' '
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number=4
         |   |   `- ' '
         |   +- operator=-
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number=5
         `- ')'

Andrew

Parsing Credits

From: "andrew cooke" <andrew@...>

Date: Mon, 12 Jan 2009 00:58:24 -0300 (CLST)

I should have added that this copies lots of good ideas from both
pyparsing - http://pyparsing.wikispaces.com/ - and (more so) "Pattern
Matching In Python" http://www.wilmott.ca/python/patternmatching.html

Andrew

Syntax

From: "andrew cooke" <andrew@...>

Date: Mon, 12 Jan 2009 01:11:39 -0300 (CLST)

A quick explanation of the syntax:

This allows forward references to 'expression', which will be defined later.

    expression  = Delayed()

This defines 'number' as one or more digits, specified via '[1:]', and
combines the digits into a single string, specified via '[...]'.  The
result is then associated with the tag 'number'.

    number      = Digit()[1:,...]                   > 'number'

This defines term as either 'number' or (with backtracing) a bracketed
expression.  The strings are automatically promoted to literal matches and
the '/' indicate that there are optional spaces between the matchers ('//'
for required space).  The result is used to construct a Term instance,
which is a subclass of Node (and which will automatically construct
attributes for the contents).

    term        = (number | '(' / expression / ')') > Term

Define 'muldiv' to be either '*' or '/' and tag the result.

    muldiv      = Any('*/')                         > 'operator'

Hopefully this is becoming obvious.  The '[0:]' here means '0 or more'
instances of 'muldiv', optional space, and 'term'.

    factor      = (term / (muldiv / term)[0:])      > Factor

Nothing new here.

    addsub      = Any('+-')                         > 'operator'

This defines the 'Delayed' matcher introduced earlier (it was introduced
so that we could reference it in 'term', even though we cannot define it
until later).

    expression += (factor / (addsub / factor)[0:])  > Expression

Not sure if it's obvious, but one major aim has been to try to combine the
best of both OO and functional programming, in what I feel is a very
'Pythonic' way.

Andrew

With Bactracking

From: "andrew cooke" <andrew@...>

Date: Mon, 12 Jan 2009 01:25:59 -0300 (CLST)

Changing the spec slightly to:

  expression  = Delayed()
  number      = Digit()[1:,...]                   > 'number'
  term        = (number | '(' / expression / ')') > Term
  muldiv      = Any('*/')                         > 'operator'
  factor      = (term / (muldiv / term)[0:])      > Factor
  addsub      = Any('+-')                         > 'operator'
  expression += Drop(Any()[0:]) & \
                (factor / (addsub / factor)[0:])  > Expression

And using:

  for (ast, _) in expression('1 + 2 * (3 + 4 - 5)'):
    print(ast[0])

Gives:

Expression
 `- Factor
     `- Term
         `- number '5'
Expression
 +- Factor
 |   +- Term
 |   |   `- number '4'
 |   `- ' '
 +- operator '-'
 +- ' '
 `- Factor
     `- Term
         `- number '5'
Expression
 `- Factor
     +- Term
     |   `- number '4'
     `- ' '
Expression
 +- Factor
 |   `- Term
 |       `- number '4'
 +- ' '
 +- operator '-'
 +- ' '
 `- Factor
     `- Term
         `- number '5'
Expression
 +- Factor
 |   `- Term
 |       `- number '4'
 `- ' '
Expression
 `- Factor
     `- Term
         `- number '4'
Expression
 +- Factor
 |   +- Term
 |   |   `- number '3'
 |   `- ' '
 +- operator '+'
 +- ' '
 +- Factor
 |   +- Term
 |   |   `- number '4'
 |   `- ' '
 +- operator '-'
 +- ' '
 `- Factor
     `- Term
         `- number '5'
Expression
 +- Factor
 |   +- Term
 |   |   `- number '3'
 |   `- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '4'
     `- ' '
Expression
 +- Factor
 |   +- Term
 |   |   `- number '3'
 |   `- ' '
 +- operator '+'
 +- ' '
 `- Factor
     `- Term
         `- number '4'
Expression
 `- Factor
     +- Term
     |   `- number '3'
     `- ' '
Expression
 +- Factor
 |   `- Term
 |       `- number '3'
 +- ' '
 +- operator '+'
 +- ' '
 +- Factor
 |   +- Term
 |   |   `- number '4'
 |   `- ' '
 +- operator '-'
 +- ' '
 `- Factor
     `- Term
         `- number '5'
Expression
 +- Factor
 |   `- Term
 |       `- number '3'
 +- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '4'
     `- ' '
Expression
 +- Factor
 |   `- Term
 |       `- number '3'
 +- ' '
 +- operator '+'
 +- ' '
 `- Factor
     `- Term
         `- number '4'
Expression
 +- Factor
 |   `- Term
 |       `- number '3'
 `- ' '
Expression
 `- Factor
     `- Term
         `- number '3'
Expression
 `- Factor
     `- Term
         +- '('
         +- Expression
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   `- Term
         |   |       `- number '4'
         |   +- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '3'
         |   |   `- ' '
         |   +- operator '+'
         |   +- ' '
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   `- Term
         |   |       `- number '3'
         |   +- ' '
         |   +- operator '+'
         |   +- ' '
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   `- Term
         |   |       `- number '4'
         |   +- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '3'
         |   |   `- ' '
         |   +- operator '+'
         |   +- ' '
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   `- Term
         |   |       `- number '3'
         |   +- ' '
         |   +- operator '+'
         |   +- ' '
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 `- Factor
     +- Term
     |   `- number '2'
     `- ' '
Expression
 +- Factor
 |   `- Term
 |       `- number '2'
 `- ' '
Expression
 `- Factor
     `- Term
         `- number '2'
Expression
 +- Factor
 |   +- Term
 |   |   `- number '1'
 |   `- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   +- Term
 |   |   `- number '1'
 |   `- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   +- Term
 |   |   `- number '1'
 |   `- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   `- Term
         |   |       `- number '4'
         |   +- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   +- Term
 |   |   `- number '1'
 |   `- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '3'
         |   |   `- ' '
         |   +- operator '+'
         |   +- ' '
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   +- Term
 |   |   `- number '1'
 |   `- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   `- Term
         |   |       `- number '3'
         |   +- ' '
         |   +- operator '+'
         |   +- ' '
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   +- Term
 |   |   `- number '1'
 |   `- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     `- ' '
Expression
 +- Factor
 |   +- Term
 |   |   `- number '1'
 |   `- ' '
 +- operator '+'
 +- ' '
 `- Factor
     `- Term
         `- number '2'
Expression
 `- Factor
     +- Term
     |   `- number '1'
     `- ' '
Expression
 +- Factor
 |   `- Term
 |       `- number '1'
 +- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   `- Term
 |       `- number '1'
 +- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   `- Term
 |       `- number '1'
 +- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   `- Term
         |   |       `- number '4'
         |   +- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   `- Term
 |       `- number '1'
 +- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '3'
         |   |   `- ' '
         |   +- operator '+'
         |   +- ' '
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   `- Term
 |       `- number '1'
 +- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     +- ' '
     +- operator '*'
     +- ' '
     `- Term
         +- '('
         +- Expression
         |   +- Factor
         |   |   `- Term
         |   |       `- number '3'
         |   +- ' '
         |   +- operator '+'
         |   +- ' '
         |   +- Factor
         |   |   +- Term
         |   |   |   `- number '4'
         |   |   `- ' '
         |   +- operator '-'
         |   +- ' '
         |   `- Factor
         |       `- Term
         |           `- number '5'
         `- ')'
Expression
 +- Factor
 |   `- Term
 |       `- number '1'
 +- ' '
 +- operator '+'
 +- ' '
 `- Factor
     +- Term
     |   `- number '2'
     `- ' '
Expression
 +- Factor
 |   `- Term
 |       `- number '1'
 +- ' '
 +- operator '+'
 +- ' '
 `- Factor
     `- Term
         `- number '2'
Expression
 +- Factor
 |   `- Term
 |       `- number '1'
 `- ' '
Expression
 `- Factor
     `- Term
         `- number '1'

Comment on this post