Table Implementation Details#

This page provides a brief overview of the Table class implementation, in particular highlighting the internal data storage architecture. This is aimed at developers and/or users who are interested in optimal use of the Table class.

The image below illustrates the basic architecture of the Table class. The fundamental data container is an ordered dictionary of individual column objects maintained as the columns attribute. It is via this container that columns are managed and accessed.

Each Column (or MaskedColumn) object is an numpy.ndarray (or numpy.ma.MaskedArray) subclass and is the sole owner of its data. Maintaining the table as separate columns simplifies table management considerably. It also makes operations like adding or removing columns much faster in comparison to implementations using a numpy structured array container.

As shown below, a Row object corresponds to a single row in the table. The Row object does not create a view of the full row at any point. Instead it manages access (e.g., row['a']) dynamically by referencing the appropriate elements of the parent table.

In some cases it is desirable to have a static copy of the full row. This is available via the as_void() method, which creates and returns a numpy.void or numpy.ma.mvoid object with a copy of the original data.