| Andrew Cooke | Contents | Latest | RSS | Twitter | Previous | Next

C[omp]ute

Welcome to my blog, which was once a mailing list of the same name and is still generated by mail. Please reply via the "comment" links.

Always interested in offers/projects/new ideas. Eclectic experience in fields like: numerical computing; Python web; Java enterprise; functional languages; GPGPU; SQL databases; etc. Based in Santiago, Chile; telecommute worldwide. CV; email.

Personal Projects

Lepl parser for Python.

Colorless Green.

Photography around Santiago.

SVG experiment.

Professional Portfolio

Calibration of seismometers.

Data access via web services.

Cache rewrite.

Extending OpenSSH.

C-ORM: docs, API.

Last 100 entries

Calling C From Fortran 95; Bjork DJ Set; Z3 Example With Python; Week 1; Useful Guide To Starting With IJulia; UK Election + Media; Review: Reinventing Organizations; Inline Assembly With Julia / LLVM; Against the definition of types; Dumb Crypto Paper; The Search For Quasi-Periodicity...; Is There An Alternative To Processing?; CARDIAC (CARDboard Illustrative Aid to Computation); The Bolivian Case Against Chile At The Hague; Clear, Cogent Economic Arguments For Immigration; A Program To Say If I Am Working; Decent Cards For Ill People; New Photo; Luksic And Barrick Gold; President Bachelet's Speech; Baltimore Primer; libxml2 Parsing Stream; configure.ac Recipe For Library Path; The Davalos Affair For Idiots; Not The Onion: Google Fireside Chat w Kissinger; Bicycle Wheels, Inertia, and Energy; Another Tax Fraud; Google's Borg; A Verion That Redirects To Local HTTP Server; Spanish Accents For Idiots; Aluminium Cans; Advice on Spray Painting; Female View of Online Chat From a Male; UX Reading List; S4 Subgroups - Geometric Interpretation; Fucking Email; The SQM Affair For Idiots; Using Kolmogorov Complexity; Oblique Strategies in bash; Curses Tools; Markov Chain Monte Carlo Without all the Bullshit; Email Para Matias Godoy Mercado; The Penta Affair For Idiots; Example Code To Create numpy Array in C; Good Article on Bias in Graphic Design (NYTimes); Do You Backup github?; Data Mining Books; SimpleDateFormat should be synchronized; British Words; Chinese Govt Intercepts External Web To DDOS github; Numbering Permutations; Teenage Engineering - Low Price Synths; GCHQ Can Do Whatever It Wants; Dublinesque; A Cryptographic SAT Solver; Security Challenges; Word Lists for Crosswords; 3D Printing and Speaker Design; Searchable Snowden Archive; XCode Backdoored; Derived Apps Have Malware (CIA); Rowhammer - Hacking Software Via Hardware (DRAM) Bugs; Immutable SQL Database (Kinda); Tor GPS Tracker; That PyCon Dongle Mess...; ASCII Fluid Dynamics; Brandalism; Table of Shifter, Cassette and Derailleur Compatability; Lenovo Demonstrates How Bad HTTPS Is; Telegraph Owned by HSBC; Smaptop - Sunrise (Music); Equation Group (NSA); UK Torture in NI; And - A Natural Extension To Regexps; This Is The Future Of Religion; The Shazam (Music Matching) Algorithm; Tributes To Lesbian Community From AIDS Survivors; Nice Rust Summary; List of Good Fiction Books; Constructing JSON From Postgres (Part 2); Constructing JSON From Postgres (Part 1); Postgres in Docker; Why Poor Places Are More Diverse; Smart Writing on Graceland; Satire in France; Free Speech in France; MTB Cornering - Where Should We Point Our Thrusters?; Secure Secure Shell; Java Generics over Primitives; 2014 (Charlie Brooker); How I am 7; Neural Nets Applied to Go; Programming, Business, Social Contracts; Distributed Systems for Fun and Profit; XML and Scheme; Internet Radio Stations (Curated List); Solid Data About Placebos; Half of Americans Think Climate Change Is a Sign of the Apocalypse; Saturday Surf Sessions With Juvenile Delinquents; Ssh, tty, stdout and stderr; Feathers falling in a vacuum; Santiago 30m Bike Route

© 2006-2015 Andrew Cooke (site) / post authors (content).

A Python Logging Service

From: andrew cooke <andrew@...>

Date: Sun, 22 Aug 2010 17:01:28 -0400

I've been lookng at Twisted, which is a framework for cooperative
multi-tasking in Python.  I don't find that a very useful description, so here
are two alternatives:

1 - It's a way of structuring multi-threaded programs that's a lot more like
    Javascript or GUI toolkits.

2 - It's a way of writing network servers that work efficiently without using
    multiple threads.


There's a fair amount of documentation at
http://twistedmatrix.com/documents/current/core/ (and most imporantly at
http://twistedmatrix.com/documents/current/core/howto/index.html ) - I suggest
reading through that until it sticks.  It took me a while, and writing the
code below, but now it makes a lot of sense (and it seems like a very nicely
engineered system).


I structured the example below as a set of different files, which was probably
excessive, but I wanted the difference components to be as clear as possible.

Python logging can be serialised over a socket.  This code in a server that
receives seralised messages and writes them to a log.


First, the protocol:

  from cPickle import loads
  from logging import makeLogRecord, getLogger
  from struct import unpack
  from twisted.internet.protocol import Protocol, connectionDone

  '''
  The protocol for a Twisted server that receives log messages.

  See http://docs.python.org/library/logging.html#socket-handler 
  '''

  class LoggingProtocol(Protocol):

      def dataReceived(self, data):
	  self.__data += data
	  while True:
	      if not self.__message_len and len(self.__data) >= 4:
		  # unpack length prefix
		  self.__message_len = unpack(">L", self.__data[:4])[0]
		  self.__data = self.__data[4:]
	      if self.__message_len and len(self.__data) >= self.__message_len:
		  # unpack message
		  record =
	  makeLogRecord(loads(self.__data[0:self.__message_len]))
		  self.__data = self.__data[self.__message_len:]
		  self.__message_len = 0
		  logger = getLogger(record.name)
		  logger.handle(record)
	      else:
		  break

      def connectionMade(self):
	  self.__data = ''
	  self.__message_len = 0

      def connectionLost(self, reason=connectionDone):
	  self.__data = None
	  self.__message_len = None


Next, the factory (ie a protocol factory):

  from logging.config import dictConfig
  from twisted.internet.protocol import Factory

  from log.protocol import LoggingProtocol

  '''
  A factory for the remote Python logger.

  This seems to be the best location to store configuration information because
  it is accessible both in tests (using a reactor) and to an application.
  '''

  class LoggingFactory(Factory):

      protocol = LoggingProtocol

      DEFAULT_PORT = 2000
      DEFAULT_CONFIG = {'version': 1,
			'handlers':
			  {'file':
			    {'class': 'logging.FileHandler',
			     'filename': 'logging-service.log',
			     'level': 'DEBUG',
			  },},
			'root':
			  {'level': 'DEBUG',
			   'handlers': ['file']},}

      def __init__(self, config_dict=None):
	  if not config_dict:
	      config_dict = self.DEFAULT_CONFIG
	  dictConfig(config_dict)


And the service:

  from twisted.application.internet import TCPServer

  from log.factory import LoggingFactory

  '''
  A service for the remote Python logger.

  This is used by the application.
  '''

  class LoggingService(TCPServer):

      def __init__(self, port=None, config_dict=None, interface='0.0.0.0'):
	  if not port:
	      port = LoggingFactory.DEFAULT_PORT
	  # old style clases in twisted
	  TCPServer.__init__(self, port, LoggingFactory(config_dict), 
			     interface=interface)


This can then be made into an application (a daemon) that's run from the
command-line using a tool called "twistd":

  # You can run this .tac file directly with:
  #    twistd -ny service.tac

  from log.service import LoggingService
  from twisted.application import service

  application = service.Application("Logging application")
  LoggingService().setServiceParent(application)


Alternatively, for testing, the Fatcory can be used directly.  This test code
also gives a glimpse of how the reactor is used to schedule events (there's
also an abstraction for chaining callbacks called "Defered"):

  from logging.config import dictConfig
  from logging import getLogger
  from multiprocessing.process import Process
  from tempfile import mkstemp
  from twisted.internet import reactor
  from unittest import TestCase

  from log.factory import LoggingFactory


  class LoggingTest(TestCase):
      '''
      Test the logging service by starting an instance, then firing up a 
      separate process that logs to the service.
      '''

      def test_logging(self):
	  tick = Tick()
	  (_fd, self.tmp) = mkstemp()
	  process = Process(target=self.logging_process)
	  factory = LoggingFactory({'version': 1,
				    'handlers': {'file': {'class': 'logging.FileHandler',
							  'filename': self.tmp,
							  'level': 'DEBUG'}},
				    'root': {'level': 'DEBUG',
					     'handlers': ['file']}})
	  reactor.listenTCP(factory.DEFAULT_PORT, factory)
	  reactor.callLater(tick(), process.start)
	  reactor.callLater(tick(), reactor.stop)
	  reactor.run()
	  fd = open(self.tmp)
	  contents = fd.readlines()
	  assert contents == ['a warning\n'], contents
	  fd.close()

      def logging_process(self):
	  dictConfig({'version': 1,
		      'handlers':
			{'socket':
			  {'class': 'logging.handlers.SocketHandler',
			   'level': 'INFO',
			   'host': 'localhost',
			   'port': LoggingFactory.DEFAULT_PORT
			   },},
		      'root':
			{'level': 'INFO',
			 'handlers': ['socket']},
		     })
	  logger = getLogger('test')
	  logger.debug('a debug') # discarded by "level: INFO" above
	  logger.warn('a warning')


  class Tick(object):

      def __init__(self, increment=0.1):
	  self.__increment = increment
	  self.__time = 0

      def __call__(self, step=1):
	  self.__time += step * self.__increment
	  return self.__time


Andrew

Comment on this post