Tabular Data Specification

Reusable DRY specification for spreadsheet exports to use in, e.g. Django projects.

Use

Column Specifications

Django-excel-response makes it easy to make spreadsheets. As you can see from its readme, it likes to get a queryset or a list of lists. If you give it a list of lists, its up to you to make the headers line up with the content. Its up to you to fetch the data and marshall it into lists too. Specifying the column headings and content separately felt too un-DRY.

My data comes from a queryset. I join the related models using select_related(), and fetch out a dictionary of the fields I'm interested in, with their values, using values(). I can give these dictionaries directly to django-excel-response, and it will use the keys as column headings. Sometimes, however, I want to include values derived from other values, or to do post-processing on the values. Somtiems I want better column headings. So I make a column specification:

from colspec import ColumnSpecification, Column

colspec = ColumnSpecification(
    Column('id'),
    Column('business_name', header='Business'),
    Column('user__name', 'user__oname', 'user__sname',
        header='Name',
        reduce=" ".join,
    ),
    Column('price', 'vat',
        header='Full cost',
        reduce=sum,
    )
    Column('status_code',
        header='Status',
        function=status_codes.get,
    ),
    ...,
)

The first column heading is id and that's also the dictionary key of the id field
The second column heading is Business and contains the business_name
The Name column contains the result of joining together three name fields from a related model linked by user field. user__name fetches the name field from the related user object, etc. And " ".join is a function to join strings with spaces, it reduces the list of three name values to one string value.
Similarly the Full cost column is the sum of two numeric values calculated using the sum built-in
Status* column values are strings that correspond to code values stored in the DB. So we use the status_codes dictionary's get() method, as a function argument, to convert them.

So the arguments to Column(...) are:

List of input dictionary keys
optional header. If not present, the name of the first input key is used.
optional reduce is a function to reduce a list of values to one -- if you specified more than one input key.
optional function is a function to transform the result value (the default makes no change). If there is a reduce function, this takes the output of reduce as its single argument. Otherwise it gets passed the list of selected values as expanded kwargs, so you can write:
```
Column('Profit',
    'sales',
    'cost_of_sales',
    function=lambda income, cost: income - cost
)
```

Then ColumnSpecification provides useful methods:

inputs() gives the full list of input keys expected, which you can use as arguments to values() on a queryset:
```
dataset = BusinessPlan.objects.values(*colspec.inputs())
```

related() lists the related models (derived from keys that contain double underscore):

dataset = BusinessPlan.objects\
            .filter(... whatever ...)\
            .select_related(*colspec.related())\
            .values(*colspec.inputs())

values(context_dictionary) is a function that takes the dictionary of values corresponding to one entry in the queryset and returns the list of values to go into the spreadsheet:
```
data_rows = [colspec.values(row) for row in dataset]
```

headers() returns the column headers:

return ExcelResponse(data_rows, headers=colspec.headers())

Testing

Tests use py.test:

$ pip install pytest

$ py.test

ToDo

Allow sort orter to be specified, e.g:

colspec = ColumnSpecification(
    Column('one', ascending=2),
    Column('two'),
    Column('three', descending=1),
)

And then, colsepc.order() would retrn:

['-three', 'one']

So I can say:

dataset = BusinessPlan.objects\
            .filter(... whatever ...)\
            .order_by(colspec.order())\
            .select_related(*colspec.related())\
            .values(*colspec.inputs())

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
colspec		colspec
tests		tests
.gitignore		.gitignore
MANIFEST		MANIFEST
README.rst		README.rst
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tabular Data Specification

Use

Column Specifications

Testing

ToDo

About

Releases

Packages

Languages

aptivate/tabular-data-specification

Folders and files

Latest commit

History

Repository files navigation

Tabular Data Specification

Use

Column Specifications

Testing

ToDo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages