Writing Python Extensions in C++

Paul F. Dubois, dubois1@llnl.gov
Lawrence Livermore National Laboratory

CXX is designed to make it easier to extend Python with C++

CXX_Objects is a set of C++ facilities to make it easier to write Python extensions. The chief way in which CXX makes it easier to write Python extensions is that it greatly increases the probability that your program will not make a reference-counting error and will not have to continually check error returns from the Python C API. CXX_Objects integrates Python with C++ in these ways:

C++ exception handling is relied on to detect errors and clean up. In a complicated function this is often a tremendous problem when writing in C. With CXX, we let the compiler keep track of what objects need to be dereferenced when an error occurs.
The Standard Template Library (STL) and its many algorithms plug and play with Python containers such as lists and tuples.
The optional CXX_Extensions facility (still in a quite experimental state) allows you to replace the clumsy C tables with objects and method calls that define your modules and extension objects.

Using CXX_Objects

The distribution is in the LLNL distribution under directory CXX. The subdirectory cxx contains the header file CXX_Objects.h and the implementation file cxxobjects.cxx. If your compiler does not support namespaces you will need to edit CXX_config.h to so indicate.

All declarations in CXX_Objects.h are in a namespace "Py", and so you may wish to use it in the form:

#include "CXX_Objects.h"
using namespace Py;

The second part of CXX is a facility to make it easier to create extension modules and extension objects. Also provided is a file CXX_Extensions.h and its support file cxxextensions.c. While the latter is a C file, it is written so as to compile properly with either a C or a C++ compiler. It is not necessary to use this part of CXX in order to use CXX_Objects.h.

A directory "example" is provided in the distribution. The example demonstrates both parts of CXX..

First we consider the CXX_Objects.h facilities.

We avoid programming with Python object pointers

The essential idea is that we avoid, as much as possible, programming with pointers to Python objects, that is, variables of type PyObject*. Instead, we use instances of a family of C++ classes that represent the usual Python objects. This family is easily extendible to include new kinds of Python objects.

For example, consider the case in which we wish to write a method, taking a single integer argument, that will create a Python dict and insert into it that Python int plus one under the key value. In C we might do that as follows:

static PyObject*
mymodule_addvalue (PyObject* self, PyObject* args) {
	PyObject *d;
	PyObject* f;
	int k;
	PyArgs_ParseTuple(args, "i", &k);
	d = PyDict_New();
	if (!d) {
		return NULL;
	}
	f = PyInt_NEW(k+1);
	if(!f) {
		Py_DECREF(d); /* have to get rid of d first */
		return NULL;
	}
	if(PyDict_SetItemString(d, "value", f) == -1) {
		Py_DECREF(f);
		Py_DECREF(d);
		return NULL;
	}
	return d;
}

If you have written a significant Python extension, this tedium looks all too familiar. The vast bulk of the coding is error checking and cleanup. Now compare the same thing written in C++ using CXX_Objects. The things with Python-like names (Int, Dict, Tuple) are from CXX_Objects.

static PyObject* 
mymodule_addvalue (PyObject* self, PyObject* pargs) { 
	try { 
		Tuple args(pargs); 
		args.verify_length(1); 
		Dict d; 
		Int k = args[0]; 
		d["value"] = k + 1;
		return new_reference_to(d); 
	} 
	catch (const PyException&) { 
		return NULL;
	}
}

If there aren’t the right number of arguments or the argument isn’t an integer, an exception is thrown. In this case we choose to catch it and convert it into a Python exception. C++’s exception handling mechanism takes care all the cleanup.

Note that the creation of the Int k got the first argument and verified that it is an Int.

The basic concept is to wrap Python pointers

The basic concept of CXX_Objects is to create a wrapper around PyObject*s so that the reference counting can be done automatically, thus eliminating the most frequent source of errors. In addition, we can then add methods and operators so that Python manipulations in C++ can look more like Python.

Each Object contains a PyObject* to which it owns a reference. (If you don’t know what this phrase means, it is explained in the Python extension manual. You don’t actually need to understand it very well if you are going to use CXX_Objects. When an Object is destroyed, it releases its ownership on the pointer. Since C++ calls the destructors on objects that are about to go out of scope, we are guaranteed that we will keep the reference counts right even if we unexpectedly leave a routine with an exception.

As a matter of philosophy, CXX_Objects prevents the creation of instances of its classes unless the instance will be a valid instance of its class. When an attempt is made to create an object that will not be valid, an exception is thrown.

Class Object represents the most general kind of Python object. The rest of the classes that represent Python objects inherit from it.

Object
    Type
    Int
    Float
    Long
    Sequence
        String
        Tuple
        List
    Mapping
        Dict
    Callable

There are several constructors for each of these classes. For example, you can create an Int from an integer as in

Int s(3)

However, you can also create an instance of one of these classes using any PyObject* or another Object. If the corresponding Python object does not actually have the type desired, an exception is thrown. This is accomplished as follows. Class Object defines a virtual function accepts:

virtual bool accepts(PyObject* p)

The base class version of accepts returns true for any pointer p except 0. This means we can create an Object using any PyObject*, or from any other Object. However, if we attempt to create an Int from a PyObject*, the overridding version of accepts in class Int will only accept pointers that correspond to Python ints. Therefore if we have a Tuple t and we wish to get the first element and be sure it is an Int, we do

Int first_element = t[0]

This will not only accomplish the goal of extracting the first element of the Tuple t, but it will ensure that the result is an Int. If not, an exception is thrown. (The exception mechanism is discussed later.)

Often, PyObject* pointers are acquired from some function, particularly functions in the Python API. If you wish to make an object from the pointer returned by such a function, you need to know if the function returns you an owned or unowned reference. If it is an owned reference, you indicate this by enclosing it in the constructor for a helper class named FromAPI. For example, the routine PyString_FromString returns an owned reference to a Python string object. You could write:

Object w = FromAPI(PyString_FromString("my string"));

FromAPI is a simple helper class that does not increment the reference count in the constructor but decrements it in the destructor. In fact, you probably would never do this, since CXX has a class String and you can just say:

String w("my string")

Indeed, since most of the Python C API is similarly embodied in Object and its descendents, you probably will not use FromAPI all that often.

Class Object

Class Object serves as the base class for the other classes. Its default constructor constructs a Py_None, the unique object of Python type None. The interface to Object consists of a large number of methods corresponding to the operations that are defined for every Python object. In each case, the methods thow an exception if anything goes wrong.There is no method corresponding to PyObject_SetItem with an arbitrary Python object as a key. Instead, create an instance of a more specific child of Object and use the appropriate facilities.

The comparison operators use the Python comparison function to compare values. The method "is" is available to test for absolute identity.

A conversion to standard library string type std::string is supplied using method "as_string". Stream output of Objects uses this conversion, which in turn uses the Python object's str() representation.

All the numeric operators are defined on all possible combinations of Object, long, and double. These use the corresponding Python operators, and should the operation fail for some reason, an exception is thrown.

Table 1: Class Object

Returns	Name(signature)	Comment
Basic Methods
explicit	Object (PyObject* pyob=Py_None)	Construct from pointer. Acquires an owned reference.
explicit	Object (const Object& ob)	Copycons; acquires an owned reference.
Object&	operator= (const Object& rhs)	Acquires an owned reference.
Object&	operator= (PyObject* rhsp)	Acquires an owned reference.
virtual	~Object ()	Releases the reference.
void	increment_reference_count()	Explicitly increment the count
void	decrement_reference_count()	Explicitly decrement count but not to zero
PyObject*	operator* () const	Lends the pointer
PyObject*	ptr () const	Lends the pointer
virtual bool	accepts (PyObject *pyob) const	Would assignment of pyob to this object succeed?
std::string	as_string() const	str() representation
Python API Interface
int	reference_count () const	reference count
Type	type () const	associated type object
String	str () const	str() representation
String	epr () const	repr () representation
bool	hasAttr (const std::string& s) const	hasattr(this, s)
Object	getAttr (const std::string& s) const	getattr(this, s)
Object	getItem (const Object& key) const	getitem(this, key)
long	hashValue () const	hash(this)
void	setAttr (const std::string& s, const Object& value)	this.s = value
void	delAttr (const std::string& s)	del this.s
void	delItem (const Object& key)	del this[key]
bool	isCallable () const	does this have callable behavior?
bool	isList () const	is this a Python list?
bool	isMapping () const	does this have mapping behaviors?
bool	isNumeric () const	does this have numeric behaviors?
bool	isSequence () const	does this have sequence behaviors?
bool	isTrue () const	is this true in the Python sense?
bool	isType (const Type& t) const	is type(this) == t?
bool	isTuple() const	is this a Python tuple?
bool	isString() const	is this a Python string?
bool	isDict() const	is this a Python dictionary?
Comparison Operators
bool	is(PyObject* pother) const	test for identity
bool	is(const Object& other) const	test for identity
bool	operator==(const Object& o2) const	Comparisons use Python cmp
bool	operator!=(const Object& o2) const	Comparisons use Python cmp
bool	operator>=(const Object& o2) const	Comparisons use Python cmp
bool	operator<=(const Object& o2) const	Comparisons use Python cmp
bool	operator<(const Object& o2) const	Comparisons use Python cmp
bool	operator>(const Object& o2) const	Comparisons use Python cmp

The Basic Types

Corresponding to each of the basic Python types is a class that inherits from Object. Here are the interfaces for those types. Each of them inherits from Object and therefore has all of the inherited methods listed for Object. Where a virtual function is overridden in a class, the name is underlined.

Class Type

Class Type corresponds to Python type objects. There is no default constructor.

Table 2: class Type

Returns	Name and Signature	Comments
explicit	Type (PyObject* pyob)	Constructor
explicit	Type (const Object& ob)	Constructor
explicit	Type(const Type& t)	Copycons
Type&	operator= (const Object& rhs)	Assignment
Type&	operator= (PyObject* rhsp)	Assignment
virtual bool	accepts (PyObject *pyob) const	Uses PyType_Check

Class Int

Class Int, derived publically from Object, corresponds to Python ints. Note that the latter correspond to C long ints. Class Int has an implicit user-defined conversion to long int. All constructors, on the other hand, are explicit. The default constructor creates a Python int zero.

Table 3: class Int

Returns	Name and Signature	Comments
explicit	Int (PyObject *pyob)	Constructor
explicit	Int (const Int& ob)	Constructor
explicit	Int (long v = 0L)	Construct from long
explicit	Int (int v)	Contruct from int
explicit	Int (const Object& ob)	Copycons
Int&	operator= (const Object& rhs)	Assignment
Int&	operator= (PyObject* rhsp)	Assignment
virtual bool	accepts (PyObject *pyob) const	Based on PyInt_Check
long	operator long() const	Implicit conversion to long int
Int&	operator= (int v)	Assign from int
Int&	operator= (long v)	Assign from long

Class Long

Class Long, derived publically from Object, corresponds to Python type long. In Python, a long is an integer type of unlimited size, and is usually used for applications such as cryptography, not as a normal integer. Implicit conversions to both double and long are provided, although the latter may of course fail if the number is actually too big. All constructors are explicit. The default constructor produces a Python long zero.

Table 4: Class Long

Returns	Name and Signature	Comments
explicit	Long (PyObject *pyob)	Constructor
explicit	Long (const Int& ob)	Constructor
explicit	Long (long v = 0L)	Construct from long
explicit	Long (int v)	Contruct from int
explicit	Long (const Object& ob)	Copycons
Long&	operator= (const Object& rhs)	Assignment
Long&	operator= (PyObject* rhsp)	Assignment
virtual bool	accepts (PyObject *pyob) const	Based on PyLong_Check
double	operator double() const	Implicit conversion to double
long	operator long() const	Implicit conversion to long
Long&	operator= (int v)	Assign from int
Long&	operator= (long v)	Assign from long

Class Float

Class Float corresponds to Python floats, which in turn correspond to C double. The default constructor produces the Python float 0.0.

Table 5: Class Float

Returns	Name and Signature	Comments
explicit	Float (PyObject *pyob)	Constructor
	Float (const Float& f)	Construct from float
explicit	Float (double v=0.0)	Construct from double
explicit	Float (const Object& ob)	Copycons
Float&	operator= (const Object& rhs)	Assignment
Float&	operator= (PyObject* rhsp)	Assignment
virtual bool	accepts (PyObject *pyob) const	Based on PyFloat_Check
double	operator double () const	Implicit conversion to double
Float&	operator= (double v)	Assign from double
Float&	operator= (int v)	Assign from int
Float&	operator= (long v)	Assign from long
Float&	operator= (const Int& iob)	Assign from Int

Sequences

CXX implements a quite sophisticated wrapper class for Python sequences. While every effort has been made to disguise the sophistication, it may pop up in the form of obscure compiler error messages, so in this documentation we will first detail normal usage and then discuss what is under the hood.

The basic idea is that we would like the subscript operator [] to work properly, and to be able to use STL-style iterators and STL algorithms across the elements of the sequence.

Sequences are implemented in terms of a templated base class, SeqBase<T>. The parameter T is the answer to the question, sequence of what? For Lists, for example, T is Object, because the most specific thing we know about an element of a List is simply that it is an Object. (Class List is defined below; it is a descendent of Object that holds a pointer to a Python list). For strings, T is Char, which is a wrapper in turn of Python strings whose length is one.

For convenience, the word Sequence is typedef'd to SeqBase<Object>.

General sequences

Suppose you are writing an extension module method that expects the first argument to be any kind of Python sequence, and you wish to return the length of that sequence. You might write:

static PyObject*
my_module_seqlen (PyObject *self, PyObject* args) {
    try {
        Tuple t(args); // set up a Tuple pointing to the arguments.
        if(t.length() != 1) throw PyException("Incorrect number of arguments to seqlen.");
        Sequence s = t[0]; // get argument and be sure it is a sequence
        return new_reference_to(Int(s.length()));
    }
    catch(const PyException&) {
        return Py_Null;
    }
}

As we will explain later, the try/catch structure converts any errors, such as the first argument not being a sequence, into a Python exception.

Subscripting

When a sequence is subscripted, the value returned is a special kind of object which serves as a proxy object. The general idea of proxy objects is discussed in Scott Meyers' book, "More Effective C++". Proxy objects are necessary because when one subscripts a sequence it is not clear whether the value is to be used or the location assigned to. Our proxy object is even more complicated than normal because a sequence reference such as s[i] is not a direct reference to the i'th object of s.

In normal use, you are not supposed to notice this magic going on behind your back. You write:

Object t;
Sequence s;
s[2] = t + s[1]

and here is what happens: s[1] returns a proxy object. Since there is no addition operator in Object that takes a proxy as an argument, the compiler decides to invoke an automatic conversion of the proxy to an Object, which returns the desired component of s. The addition takes place, and then there is an assignment operator in the proxy class created by the s[2], and that assignment operator stuffs the result into the 2 component of s.

It is possible to fool this mechanism and end up with a compiler failing to admit that a s[i] is an Object. If that happens, you can work around it by writing Object(s[i]), which makes the desired implicit conversion, explicit.

Iterators

Each sequence class provides the following interface. The class seqref<T> is the proxy class. We omit the details of the iterator, const_iterator, and seqref<T> here. See CXX_Objects.h if necessary. The purpose of most of this interface is to satisfy requirements of the STL.

The SeqBase<T> Interface

SeqBase<T> inherits from Object.

Type			Name
typedef int			size_type
typedef seqref<T>			reference
typedef T			const_reference
typedef seqref<T>*			pointer
typedef int			difference_type
virtual size_type			max_size() const
virtual size_type			capacity() const;
virtual void			swap(SeqBase<T>& c);
virtual size_type			size () const;
explicit			SeqBase<T> ();
explicit			SeqBase<T> (PyObject* pyob);
explicit			SeqBase<T> (const Object& ob);
SeqBase<T>&			operator= (const Object& rhs);
SeqBase<T>&			operator= (PyObject* rhsp);
virtual bool			accepts (PyObject *pyob) const;
size_type			length () const ;
const T			operator[](size_type index) const;
seqref<T>			operator[](size_type index);
virtual T		getItem (size_type i) const;
virtual void	setItem (size_type i, const T& ob);
SeqBase<T>		repeat (int count) const;
SeqBase<T>		concat (const SeqBase<T>& other) const ;
const T		front () const;
seqref<T>		front();
const T		back () const;
seqref<T>		back();
void		verify_length(size_type required_size);
void		verify_length(size_type min_size, size_type max_size);
class		iterator;
iterator		begin ();
iterator		end ();
class		const_iterator;
const_iterator		begin () const;
const_iterator		end () const;

Any heir of class Object that has a sequence behavior should inherit from class SeqBase<T>, where T is specified as the type of object that represents the individual elements of the sequence. The requirements on T are that it has a constructor that takes a PyObject* as an argument, that it has a default constructor, a copy constructor, and an assignment operator. In short, any properly defined heir of Object will work.

Classes Char and String

Python strings are unusual in that they are immutable sequences of characters. However, there is no character type per se; rather, when subscripted strings return a string of length one. To simulate this, we define two classes Char and String. The Char class represents a Python string object of length one. The String class represents a Python string, and its elements make up a sequence of Char's.

The user interface for Char is limited. Unlike String, for example, it is not a sequence.

The Char interface

Char inherits from Object.

Type	Name
explicit	Char (PyObject *pyob)
	Char (const Object& ob)
	Char (const std::string& v = "")
	Char (char v)
Char&	operator= (const std::string& v)
Char&	operator= (char v)
	operator String() const
	operator std::string () const

The String Interface

String inherits from SeqBase<Char>.

Type	Name
explicit	String (PyObject *pyob)
	String (const Object& ob)
	String (const std::string& v = "")
	String (const std::string& v, std::string::size_type vsize)
	String (const char* v)
String&	operator= (const std::string& v)
	operator std::string () const

Class Tuple

Class Tuple represents Python tuples. A Tuple is a Sequence. There are two kinds of constructors: one takes a PyObject* as usual, the other takes an integer number as an argument and returns a Tuple of that length, each component initialized to Py_None. The default constructor produces an empty Tuple.

Tuples are not immutable, but attempts to assign to their components will fail if the reference count is not 1. That is, it is safe to set the elements of a Tuple you have just made, but not thereafter.

Example: create a Tuple containing (1, 2, 4)

Tuple t(3)
t[0] = Int(1)
t[1] = Int(2)
t[2] = Int(4)

Example: create a Tuple from a list:

Dict d
...
Tuple t(d.keys())

The Tuple Interface

Tuple inherits from Sequence.. Special run-time checks prevent modification if the reference count is greater than one.

Type	Name	Comment
virtual void	setItem (int offset, const Object&ob)	setItem is overriden to handle tuples properly.
explicit	Tuple (PyObject *pyob)
	Tuple (const Object& ob)
explicit	Tuple (int size = 0)	Create a tuple of the given size. Items initialize to Py_None. Default is an empty tuple.
explicit	Tuple (const Sequence& s)	Create a tuple from any sequence.
Tuple&	operator= (const Object& rhs)
Tuple&	operator= (PyObject* rhsp)
Tuple	getSlice (int i, int j) const	Equivalent to python's t[i:j]

Class List

Class List represents a Python list, and the methods available faithfully reproduce the Python API for lists. A List is a Sequence.

The List Interface

List inherits from Sequence.

Type	Name	Comment
explicit	List (PyObject *pyob)
	List (const Object& ob)
	List (int size = 0)	Create a list of the given size. Items initialized to Py_None. Default is an empty list.
	List (const Sequence& s)	Create a list from any sequence.
List&	operator= (const Object& rhs)
List&	operator= (PyObject* rhsp)
List	getSlice (int i, int j) const
void	setSlice (int i, int j, const Object& v)
void	append (const Object& ob)
void	insert (int i, const Object& ob)
void	sort ()	Sorts the list in place, using Python's member function. You can also use the STL sort function on any List instance.
void	reverse ()	Reverses the list in place, using Python's member function.

Mappings

A class MapBase<T> is used as the base class for Python objects with a mapping behavior. The key behavior of this class is the ability to set and use items by subscripting with strings. A proxy class mapref<T> is defined to produce the correct behavior for both use and assignment.

For convenience, Mapping is typedefed as MapBase<Object>.

The MapBase<T> interface

MapBase<T> inherits from Object. T should be chosen to reflect the kind of element returned by the mapping.

Type	Name	Comment
T	operator[](const std::string& key) const
mapref<T>	operator[](const std::string& key)
int	length () const	Number of entries.
int	hasKey (const std::string& s) const	Is m[s] defined?
T	getItem (const std::string& s) const	m[s]
virtual void	setItem (const std::string& s, const Object& ob)	m[s] = ob
void	delItem (const std::string& s)	del m[s]
void	delItem (const Object& s)
List	keys () const	A list of the keys.
List	values () const	A list of the values.
List	items () const	Each item is a key-value pair.

Class Dict

Class Dict represents Python dictionarys. A Dict is a Mapping. Assignment to subscripts can be used to set the components.

Dict d
d["Paul Dubois"] = "(925)-422-5426"

Interface for Class Dict

Dict inherits from MapBase<Object>.

Type	Name	Comment
explicit	Dict (PyObject *pyob)
	Dict (const Dict& ob)
	Dict ()	Creates an empty dictionary
Dict&	operator= (const Object& rhs)
Dict&	operator= (PyObject* rhsp)

Other classes and facilities.

Class Callable provides an interface to those Python objects that support a call method. Class Module holds a pointer to a module. (If you want to create an extension module, however, see the extension facility). There is a large set of numeric operators.

Interface to class Callable

Type	Name	Comment
explicit	Callable (PyObject *pyob)
Callable&	operator= (const Object& rhs)
Callable&	operator= (PyObject* rhsp)
Object	apply(const Tuple& args) const	Call the object with the given arguments
Object	apply(PyObject* args = 0) const	Call the object with args as the arguments

Interface to class Module

Type	Name	Comment
	Module (const Module& ob)	Copy constructor
Module&	operator= (const Object& rhs)	Assignment
Module&	operator= (PyObject* rhsp)	Assignment

Numeric interface

Unary operators for plus and minus, and binary operators +, -, *, /, and % are defined for pairs of objects and for objects with scalar integers or doubles (in either order). Functions abs(ob) and coerce(o1, o2) are also defined.

The signature for coerce is:

inline std::pair<Object,Object> coerce(const Object& a, const Object& b)

Unlike the C API function, this simply returns the pair after coercion.

Stream I/O

Any object can be printed using stream I/O, using std::ostream& operator<< (std::ostream& os, const Object& ob). The object's str() representation is converted to a standard string which is passed to std::ostream& operator<< (std::ostream& os, const std::string&).

Exceptions

The Python exception facility and the C++ exception facility can be merged via the use of try/catch blocks in the bodies of extension objects and module functions.

Class Exception and its children

A set of classes is provided. Each is derived from class Exception, and represents a particular sort of Python exception, such as IndexError, RuntimeError, ValueError. Each of them (other than Exception) has a constructor which takes an explanatory string as an argument, and is used in a throw statement such as:

throw IndexError("Index too large in MyObject access.");

If in using a routine from the Python API, you discover that it has returned a NULL indicating an error, then Python has already set the error message. In that case you merely throw Exception.

List of Exceptions

Type	Interface for class Exception
explicit	Exception ()
	Exception (const std::string& reason)
	Exception (PyObject* exception, const std::string& reason)
void	clear()
	Constructors for other children of class Exception
	TypeError (const std::string& reason)
	IndexError (const std::string& reason)
	AttributeError (const std::string& reason)
	NameError (const std::string& reason)
	RuntimeError (const std::string& reason)
	SystemError (const std::string& reason)
	KeyError (const std::string& reason)
	ValueError (const std::string& reason)
	OverflowError (const std::string& reason)
	ZeroDivisionError (const std::string& reason)
	MemoryError (const std::string& reason)
	SystemExit (const std::string& reason)

Using Exceptions in extension methods

The exception facility allows you to integrate the C++ and Python exception mechanisms. To do this, you must use the style described below when writing module methods.

Catching Exceptions from the Python API or CXX.

When writing an extension module method, you can use the following boilerplate. Any exceptions caused by the Python API or CXX itself will be converted into a Python exception. Note that Exception is the most general of the exceptions listed above, and therefore this one catch clause will serve to catch all of them. You may wish to catch other exceptions, not in the Exception family, in the same way. If so, you need to make sure you set the error in Python before returning.

static PyObject *
some_module_method(PyObject* self, PyObject* args)
{
    Tuple a(args); // we know args is a Tuple
    try {
        ...calculate something from a...
        return ...something, usually of the form new_reference_to(some Object);
    }
    catch(const Exception&) {
        //Exception caught, passing it on to Python
        return Null ();
    }
}

How to clear an Exception

If you anticipate that an Exception may be thrown and wish to recover from it, change the catch phrase to set a reference to an Exception, and use the method clear() from class Exception to clear it.:

catch(Exception& e) {
e.clear();
...now decide what to do about it...
}

Extension Facilities

CXX_Extensions.h provides facilities for:

Creating a Python extension module
Creating new Python extension types

These facilities use CXX_Objects.h and its support file cxxmodule.cxx, but not vice-versa. Therefore, the decision about whether or not to use this part of CXX is up to you.

Creating an Python extension module

The usual method of creating a Python extension module is to declare and initialize its method table in C. This requires knowledge of the correct form for the table and the order in which entries are to be made into it, and requires casts to get everything to compile without warning. CXX's header file PyExtensions.h offers a simpler method. Here is a sample usage, in which a module named "example" is created. Note that two details are necessary:

The initialization function must be declared to have external C linkage.
The ExtensionModule object must have a storage class that survives the call to the initialization function. This is most easily accomplished by using a static local inside the initialization function.

In this example, two module methods ex_sum and ex_test have already been defined, with the usual signature:
PyObject* method_name (PyObject*, PyObject*).

extern "C" void initexample();

void initexample()
{
    static ExtensionModule example("example");
    example.add("sum", ex_sum, "sum(arglist) = sum of arguments");
    example.add("test", ex_test, "test(arglist) runs a test suite");
    Dict d = example.initialize();

   // Now we can add to the module dictionary if we wish...

    Float pi(3.14159);
    d["pi"] = pi;
}

Interface to class ExtensionModule

Type	Name	Comment
	ExtensionModule (char* name)	Create an extension module named "name"
virtual	~ExtensionModule ()	Destructor
void	add( const char* method_name, PyCFunction f, const char* doc="", int flag=1 )	Add a method to the module.
Dict	initialize()	Initialize the module once all methods have been added. Returns the module dictionary.

Creating a Python extension type

One of the great things about Python is the way you can create your own object types and have Python welcome them as first-class citizens. Unfortunately, part of the way you have to do this is not great. Key to the process is the creation of a Python "type object". All instances of this type must share a reference to this one unique type object. The type object itself has a multitude of "slots" into which the addresses of functions can be added in order to give the object the desired behavior.

CXX mitigates this difficulty with class PythonExtension. PythonExtension is a templated class, and the way you use it is very odd indeed: you make your new object type inherit from it, giving itself as the template parameter:

class MyObject: public PythonExtension<MyObject> {...}

This causes several good things to happen:

MyObject is a child of PyObject, so that a MyObject* is-a PyObject*.
A static method check(PyObject*) is created in class MyObject. This function returns a boolean, testing whether or not the argument is in fact a pointer to an instance of MyObject.
An (empty) table of object methods is created (MyObject::methods). The user can add object methods to this table.
A PythonType object is created (MyObject::behaviors). The required Python type object is hidden inside this wrapper object. The wrapper object has methods to set the desired behaviors.
A method is created to handle the deletion of an instance if and when its reference count goes to zero. This method ensures the calling of the class destructor ~MyObject(), if any, and releases the memory (see below).
Both automatic and heap-based instances of MyObject can be created.

Sample usage of PythonExtension

This example is given in full in the Demo directory. See files r.h and r.cxx. The "r" object has three public data members and a number of methods. The implementations are omitted here.

class r: public PythonExtension<r> {
public:
    long start;
    long stop;
    long step;

    r(long start_, long stop_, long step_ = 1L) ;
    ~r();
    long length() const ;
    long item(int i) const ;
    r* slice(int i, int j) const;
    r* extend(int k) const;
    std::string asString() const ;
    Object value(const Tuple&) const;
    void assign(const Tuple&, const Object& rhs) ;
};

Some work by Geoffrey Furnish of LANL may obviate the next step, but for now one must create "glue" routines to connect the methods of the class to Python. Again, see the demo for the full text of the functions; we show a few here for flavor.

#include "r.h"
// Connect r objects to Python

static PyObject *
r_repr(PyObject* arg)
{
r* robj = static_cast<r*>( arg);
return new_reference_to(String(robj->asString()));
}

static int
r_length(PyObject* arg)
{
r* robj = static_cast<r*>( arg);
return robj->length();
}

... other behavior methods

// "regular" methods...
static PyObject *
r_amethod(PyObject* self, PyObject* args)
{
Tuple t(args);
...
return new_reference_to(result);
}

Finally, we must write (and call from some module's initialization routine) this initialization function. In this initialization function we set the desired behaviors and methods of the object.

Note that we do not and must not set our own dealloc behavior. Class PythonExtension has already done so.

void init_rtype () {
    r::behaviors().name("r");
    r::behaviors().doc("r objects: start, stop, step");
    r::behaviors().repr(r_repr);
    r::behaviors().getattr(r_getattr);
    r::behaviors().sequence_length(r_length);
    r::behaviors().sequence_item(r_item);
    r::behaviors().sequence_slice(r_slice);
    r::behaviors().sequence_concat(r_concat);
    r::methods().add("amethod", r_amethod);
    r::methods().add("assign", r_assign);
    r::methods().add("value", r_value);
}

To complement this, we add the method r_new to a module so that objects of type "r" can be created from Python. Note how it simply uses the creation function from class r in a standard way:

PyObject*
r_new (PyObject* self, PyObject* args) {
    try {
        Tuple rargs(args);
rargs.verify_length(2, 3); // make sure there are 2 or 3 arguments
        Int start(rargs[0]);
        Int stop(rargs[1]);
        Int step(1);
        if (rargs.length() == 3) {
            step = rargs[2];
        }
        if (long(start) > long(stop) + 1 || long(step) == 0) {
            throw RuntimeError("Bad arguments to r(start,stop [,step]).");
        }
        return new r(start, stop, step);
    }
    catch(const Exception&) {
        return Null ();
    }
}

Notes on memory management and extension objects

Normal Python objects exist only on the heap. That is unfortunate, as object creation and destruction can be relatively expensive. Class PythonExtension allows creation of both local and heap-based objects.

If an extension object is created using operator new, as in:

r* my_r_ref = new r(1, 20, 3)

then the entity my_r_ref can be thought of as "owning" the reference created in the new object. Thus, the object will never have a reference count of zero. If the creator wishes to delete this object, they should either make sure the reference count is 1 and then do delete my_r_ref, or decrement the reference with Py_DECREF(my_r_ref).

Should my_r_ref give up ownership by being used in an Object constructor, all will still be well. When the Object goes out of scope its destructor will be called, and that will decrement the reference count, which in turn will trigger the special dealloc routine that calls the destructor and deletes the pointer.

If the object is created with automatic scope, as in:

r my_r(1, 20, 3)

then my_r can be thought of as owning the reference, and when my_r goes out of scope the object will be destroyed. Of course, care must be taken not to have kept any permanent reference to this object. Fortunately, in the case of an exception, the C++ exception facility will call the destructor of my_r. Naturally, care must be taken not to end up with a dangling reference, but such objects can be created and destroyed more efficiently than heap-based PyObjects.

Putting it all together

The Demo directory of the distribution contains an extensive example of how to use many of the facilities in CXX. It also serves as a test routine. This test is not completely exhaustive but does excercise much of the facility.

Acknowledgment

Thank you to Geoffrey Furnish for patiently teaching me the finer points of C++ and its template facility, and his critique of CXX in particular.