| Andrew Cooke | Contents | Latest | RSS | Twitter | Previous | Next

C[omp]ute

Welcome to my blog, which was once a mailing list of the same name and is still generated by mail. Please reply via the "comment" links.

Always interested in offers/projects/new ideas. Eclectic experience in fields like: numerical computing; Python web; Java enterprise; functional languages; GPGPU; SQL databases; etc. Based in Santiago, Chile; telecommute worldwide. CV; email.

Personal Projects

Lepl parser for Python.

Colorless Green.

Photography around Santiago.

SVG experiment.

Professional Portfolio

Calibration of seismometers.

Data access via web services.

Cache rewrite.

Extending OpenSSH.

C-ORM: docs, API.

Last 100 entries

Have to add...; Culturally Liberal and Nothing More; Weird Finite / Infinite Result; Your diamond is a beaten up mess; Maths Books; Good Bike Route from Providencia / Las Condes to Panul\; Iain Pears (Author of Complex Plots); Plum Jam; Excellent; More Recently; For a moment I forgot StackOverflow sucked; A Few Weeks On...; Chilean Book Recommendations; How To Write Shared Libraries; Jenny Erpenbeck (Author); Dijkstra, Coins, Tables; Python libraries error on OpenSuse; Deserving Trump; And Smugness; McCloskey Economics Trilogy; cmocka - Mocks for C; Concept Creep (Americans); Futhark - OpenCL Language; Moved / Gone; Fan and USB issues; Burgers in Santiago; The Origin of Icosahedral Symmetry in Viruses; autoenum on PyPI; Jars Explains; Tomato Chutney v3; REST; US Elections and Gender: 24 Point Swing; PPPoE on OpenSuse Leap 42.1; SuperMicro X10SDV-TLN4F/F with Opensuse Leap 42.1; Big Data AI Could Be Very Bad Indeed....; Cornering; Postcapitalism (Paul Mason); Black Science Fiction; Git is not a CDN; Mining of Massive Data Sets; Rachel Kaadzi Ghansah; How great republics meet their end; Raspberry, Strawberry and Banana Jam; Interesting Dead Areas of Math; Later Taste; For Sale; Death By Bean; It's Good!; Tomato Chutney v2; Time ATAC MX 2 Pedals - First Impressions; Online Chilean Crafts; Intellectual Variety; Taste + Texture; Time Invariance and Gauge Symmetry; Jodorowsky; Tomato Chutney; Analysis of Support for Trump; Indian SF; TP-Link TL-WR841N DNS TCP Bug; TP-Link TL-WR841N as Wireless Bridge; Sending Email On Time; Maybe run a command; Sterile Neutrinos; Strawberry and Banana Jam; The Best Of All Possible Worlds; Kenzaburo Oe: The Changeling; Peach Jam; Taste Test; Strawberry and Raspberry Jam; flac to mp3 on OpenSuse 42.1; Also, Sebald; Kenzaburo Oe Interview; Otake (Kitani Minoru) move Black 121; Is free speech in British universities under threat?; I am actually good at computers; Was This Mansplaining?; WebFaction / LetsEncrypt / General Disappointment; Sensible Philosophy of Science; George Ellis; Misplaced Intuition and Online Communities; More Reading About Japan; Visibilty / Public Comments / Domestic Violence; Ferias de Santiago; More (Clearly Deliberate); Deleted Obit Post; And then a 50 yo male posts this...; We Have Both Kinds Of Contributors; Free Springer Books; Books on Religion; Books on Linguistics; Palestinan Electronica; Books In Anthropology; Taylor Expansions of Spacetime; Info on Juniper; Efficient Stream Processing; The Moral Character of Crypto; Hearing Aid Info; Small Success With Go!; Re: Quick message - This link is broken; Adding Reverb To The Echo Chamber; Sox Audio Tools

© 2006-2015 Andrew Cooke (site) / post authors (content).

A Python Logging Service

From: andrew cooke <andrew@...>

Date: Sun, 22 Aug 2010 17:01:28 -0400

I've been lookng at Twisted, which is a framework for cooperative
multi-tasking in Python.  I don't find that a very useful description, so here
are two alternatives:

1 - It's a way of structuring multi-threaded programs that's a lot more like
    Javascript or GUI toolkits.

2 - It's a way of writing network servers that work efficiently without using
    multiple threads.


There's a fair amount of documentation at
http://twistedmatrix.com/documents/current/core/ (and most imporantly at
http://twistedmatrix.com/documents/current/core/howto/index.html ) - I suggest
reading through that until it sticks.  It took me a while, and writing the
code below, but now it makes a lot of sense (and it seems like a very nicely
engineered system).


I structured the example below as a set of different files, which was probably
excessive, but I wanted the difference components to be as clear as possible.

Python logging can be serialised over a socket.  This code in a server that
receives seralised messages and writes them to a log.


First, the protocol:

  from cPickle import loads
  from logging import makeLogRecord, getLogger
  from struct import unpack
  from twisted.internet.protocol import Protocol, connectionDone

  '''
  The protocol for a Twisted server that receives log messages.

  See http://docs.python.org/library/logging.html#socket-handler 
  '''

  class LoggingProtocol(Protocol):

      def dataReceived(self, data):
	  self.__data += data
	  while True:
	      if not self.__message_len and len(self.__data) >= 4:
		  # unpack length prefix
		  self.__message_len = unpack(">L", self.__data[:4])[0]
		  self.__data = self.__data[4:]
	      if self.__message_len and len(self.__data) >= self.__message_len:
		  # unpack message
		  record =
	  makeLogRecord(loads(self.__data[0:self.__message_len]))
		  self.__data = self.__data[self.__message_len:]
		  self.__message_len = 0
		  logger = getLogger(record.name)
		  logger.handle(record)
	      else:
		  break

      def connectionMade(self):
	  self.__data = ''
	  self.__message_len = 0

      def connectionLost(self, reason=connectionDone):
	  self.__data = None
	  self.__message_len = None


Next, the factory (ie a protocol factory):

  from logging.config import dictConfig
  from twisted.internet.protocol import Factory

  from log.protocol import LoggingProtocol

  '''
  A factory for the remote Python logger.

  This seems to be the best location to store configuration information because
  it is accessible both in tests (using a reactor) and to an application.
  '''

  class LoggingFactory(Factory):

      protocol = LoggingProtocol

      DEFAULT_PORT = 2000
      DEFAULT_CONFIG = {'version': 1,
			'handlers':
			  {'file':
			    {'class': 'logging.FileHandler',
			     'filename': 'logging-service.log',
			     'level': 'DEBUG',
			  },},
			'root':
			  {'level': 'DEBUG',
			   'handlers': ['file']},}

      def __init__(self, config_dict=None):
	  if not config_dict:
	      config_dict = self.DEFAULT_CONFIG
	  dictConfig(config_dict)


And the service:

  from twisted.application.internet import TCPServer

  from log.factory import LoggingFactory

  '''
  A service for the remote Python logger.

  This is used by the application.
  '''

  class LoggingService(TCPServer):

      def __init__(self, port=None, config_dict=None, interface='0.0.0.0'):
	  if not port:
	      port = LoggingFactory.DEFAULT_PORT
	  # old style clases in twisted
	  TCPServer.__init__(self, port, LoggingFactory(config_dict), 
			     interface=interface)


This can then be made into an application (a daemon) that's run from the
command-line using a tool called "twistd":

  # You can run this .tac file directly with:
  #    twistd -ny service.tac

  from log.service import LoggingService
  from twisted.application import service

  application = service.Application("Logging application")
  LoggingService().setServiceParent(application)


Alternatively, for testing, the Fatcory can be used directly.  This test code
also gives a glimpse of how the reactor is used to schedule events (there's
also an abstraction for chaining callbacks called "Defered"):

  from logging.config import dictConfig
  from logging import getLogger
  from multiprocessing.process import Process
  from tempfile import mkstemp
  from twisted.internet import reactor
  from unittest import TestCase

  from log.factory import LoggingFactory


  class LoggingTest(TestCase):
      '''
      Test the logging service by starting an instance, then firing up a 
      separate process that logs to the service.
      '''

      def test_logging(self):
	  tick = Tick()
	  (_fd, self.tmp) = mkstemp()
	  process = Process(target=self.logging_process)
	  factory = LoggingFactory({'version': 1,
				    'handlers': {'file': {'class': 'logging.FileHandler',
							  'filename': self.tmp,
							  'level': 'DEBUG'}},
				    'root': {'level': 'DEBUG',
					     'handlers': ['file']}})
	  reactor.listenTCP(factory.DEFAULT_PORT, factory)
	  reactor.callLater(tick(), process.start)
	  reactor.callLater(tick(), reactor.stop)
	  reactor.run()
	  fd = open(self.tmp)
	  contents = fd.readlines()
	  assert contents == ['a warning\n'], contents
	  fd.close()

      def logging_process(self):
	  dictConfig({'version': 1,
		      'handlers':
			{'socket':
			  {'class': 'logging.handlers.SocketHandler',
			   'level': 'INFO',
			   'host': 'localhost',
			   'port': LoggingFactory.DEFAULT_PORT
			   },},
		      'root':
			{'level': 'INFO',
			 'handlers': ['socket']},
		     })
	  logger = getLogger('test')
	  logger.debug('a debug') # discarded by "level: INFO" above
	  logger.warn('a warning')


  class Tick(object):

      def __init__(self, increment=0.1):
	  self.__increment = increment
	  self.__time = 0

      def __call__(self, step=1):
	  self.__time += step * self.__increment
	  return self.__time


Andrew

Comment on this post