-
Notifications
You must be signed in to change notification settings - Fork 0
Python Tracker
HOME > [SNOWPLOW TECHNICAL DOCUMENTATION](Snowplow technical documentation) > Trackers > Python Tracker
-
- 2.1 Importing the module
- 2.2 Creating a tracker
- 2.2.1
cloudfront()
- 2.2.2
hostname()
- 2.2.1
- 2.3 Creating multiple trackers
-
- 3.1 Configuring your tracker
- 3.1.1
set_platform()
- 3.1.2
set_base64_to()
- 3.1.1
- 3.2 Adding extra data
- 3.2.1
set_app_id()
- 3.2.2
set_user_id()
- 3.2.3
set_screen_resolution()
- 3.2.4
set_color_depth()
- 3.2.1
- 3.1 Configuring your tracker
-
- 4.1 Common
- 4.1.1 Argument validation
- 4.1.2 Optional timestamp argument
- 4.1.3 Return values
- 4.2
track_screen_view()
- 4.3
- 4.4
- 4.5
track_struct_event()
- 4.6
track_unstruct_event()
- 4.6.1 Supported datatypes
- 4.1 Common
The Snowplow Python Tracker allows you to track Snowplow events from your Python apps and games.
The tracker should be straightforward to use if you are comfortable with Python development; any prior experience with Snowplow"s JavaScript Tracker or Lua Tracker, Google Analytics or Mixpanel (which have similar APIs to Snowplow) is helpful but not necessary.
Note that this tracker has access to a more restricted set of Snowplow events than the JavaScript Tracker and covers almost all the events from the Lua Tracker.
## 2 InitializationAssuming you have completed the Python Tracker Setup for your Python project, you are now ready to initialize the Python Tracker.
### 2.1 Importing the moduleRequire the Python Tracker"s module into your Python code like so:
from snowplowtracker import tracker
or
from snowplowtracker.tracker import Tracker
That's it - you are now ready to initialize a tracker instance.
### 2.2 Creating a trackerThere are two different versions of the tracker constructor, depending on which type of collector you want to log events to.
If you are using a Cloudfront collector, use Tracker.cloudfront() to initialize your tracker instance. If you are using any other collector (e.g. the Clojure collector, or SnowCannon), then you should use Tracker.hostname().
#### 2.2.1 Create a tracker logging to Cloudfront with `Tracker.cloudfront()`You can initialize a tracker instance for a Cloudfront collector with:
tracker = Tracker.cloudfront(cf_subdomain)
So if your Cloudfront subdomain is d3rkrsqld9gmqf
, you would write:
tracker = Tracker.cloudfront("d3rkrsqld9gmqf")
This completes the initialization of your tracker instance.
#### 2.2.2 Create a tracker logging to a non-CloudFront collector using `newTrackerForUri()`You can initialize a tracker instance for a non-Cloudfront collector with:
tracker = Tracker.hostname(host)
So if your collector is available at "my-company.c.snplow.com", you would write:
tracker = Tracker.hostname("my-company.c.snplow.com")
This completes the initialization of your tracker instance.
### 2.3 Creating multiple trackersEach tracker instance is completely sandboxed, so you can create multiple trackers as you see fit.
Here is an example of instantiating two separate trackers:
t1 = Tracker.cloudfront("d3rkrsqld9gmqf")
t1.set_platform("cnsl")
t1.track_unstruct_event("save-game", { save_id = 23 }, 1369330092)
t2 = Tracker.hostname("my-company.c.snplow.com")
t2.set_platform("cnsl")
t2.track_screen_view("Game HUD", "23")
t1.track_screen_view("Test", "23") # Back to first tracker
Each tracker instance is initialized with sensible defaults:
- The platform the tracker is running on is set to "pc"
- Property data for unstructured events is sent Base64-encoded
However you can change either of these defaults:
#### 3.1.1 Change the tracker"s platform with `set_platform()`You can change the platform the tracker is running on by calling:
t.set_platform(platform_code)
For example:
t.set_platform("tv") # Running on a Connected TV
For a full list of supported platforms, please see the Snowplow Tracker Protocol.
#### 3.1.2 Disable Base64-encoding with `set_base64_to()`You can set whether or not to Base64-encode property data for unstructured events by calling:
t.set_base64_to( {{True OR False}} )
So to disable it and send the data URI-encoded instead:
t.set_base64_to(False)
You may have additional information about your application"s environment, current user and so on, which you want to send to Snowplow with each event.
The tracker instance has a set of set...()
methods to attach extra data to all tracked events:
We will discuss each of these in turn below:
### 3.2.1 Set application ID with `setAppId()`You can set the application ID to any string:
t.set_app_id( "{{APPLICATION ID}}" )
Example:
t.set_app_id("wow-addon-1")
You can set the user ID to any string:
t.set_user_id( "{{USER ID}}" )
Example:
t.set_user_id("alexd")
If your Python code has access to the device"s screen resolution, then you can pass this in to Snowplow too:
t.set_screen_resolution( {{WIDTH}}, {{HEIGHT}} )
Both numbers should be positive integers; note the order is height followed by width. Example:
t.set_screen_resolution(1366, 768)
If your Python code has access to the bit depth of the device"s color palette for displaying images, then you can pass this in to Snowplow too:
t.set_color_depth( {{BITS PER PIXEL}} )
The number should be a positive integer, in bits per pixel. Example:
t.set_color_depth(32)
Snowplow has been built to enable you to track a wide range of events that occur when users interact with your websites and apps. We are constantly growing the range of functions available in order to capture that data more richly.
Tracking methods supported by the Python Tracker at a glance:
Function | Description |
---|---|
[track_page_view() ] |
Track and record views of web pages. |
[track__ecommerce_transaction() ] |
Track an ecommerce transaction on transcation level. |
[track_ecommerce_transaction_item() ] |
Track an ecommerce transaction on item level. |
track_screen_view() |
Track the user viewing a screen within the application |
track_struct_event() |
Track a Snowplow custom structured event |
track_unstruct_event() |
Track a Snowplow custom unstructured event |
All events are tracked with specific methods on the tracker instance, of the form track_XXX()
, where XXX
is the name of the event to track.
Python is a dynamically typed language, but each of our track...()
methods expects its arguments to be of specific types and value ranges, and validates that to be the case. These checks are done using the PyContracts library.
If the validation check fails, then a runtime error is thrown:
t = Tracker.hostname("localhost")
t.set_color_depth("walrus")
contracts.interface.ContractNotRespected: Breach for argument 'depth' to Tracker:set_color_depth().
Expected type 'int', got 'str'.
checking: Int for value: Instance of str: 'walrus'
checking: $(Int) for value: Instance of str: 'walrus'
checking: int for value: Instance of str: 'walrus'
Variables bound in inner context:
- self: Instance of Tracker: <snowplowtracker.tracker.Tracker object... [clip]
If your value is of the wrong type, convert it before passing it into the track...()
method, for example:
level_idx = 42
t.track_screen_view("Game Level", str(level_idx))
We specify the types and value ranges required for each argument below.
### 4.1.2 Optional timestamp argumentEach track...()
method supports an optional timestamp as its final argument; this allows you to manually override the timestamp attached to this event.
If you do not pass this timestamp in as an argument, then the Python Tracker will use the current time to be the timestamp for the event.
Here is an example tracking a structured event and supplying the optional timestamp argument. Note that we have to explicitly supply None
s for the intervening arguments which are empty:
t.track_struct_event("some cat", "save action", None, None, None, 1368725287)
Timestamp is counted in seconds since the Unix epoch - the same format as generated by time.time()
in Python.
Each track...()
method has the same return signature, returning two values:
return_string = t.track_unstruct_event("save-game", { "save_id" = "4321", "level" = 23 })
Warning: this feature is implemented in the Lua and Python tracker, but it is not currently supported in the Enrichment, Storage or Analytics stages in the Snowplow data pipeline. As a result, if you use this feature, you will log screen views to your collector logs, but these will not be parsed and loaded into e.g. Redshift to analyse. (Adding this capability is on the roadmap.)
Use track_screen_view()
to track a user viewing a screen (or equivalent) within your app. Arguments are:
Argument | Description | Required? | Validation |
---|---|---|---|
name |
Human-readable name for this screen | Yes | Non-empty string |
id |
Unique identifier for this screen | No | String or None |
tstamp |
When the screen was viewed | No | Positive integer or None |
Example:
t.trackScreenView("HUD > Save Game", "screen23", 1368725287)
Use track_struct_event()
to track a custom event happening in your app which fits the Google Analytics-style structure of having up to five fields (with only the first two required):
Argument | Description | Required? | Validation |
---|---|---|---|
category |
The grouping of structured events which this action belongs to |
Yes | Non-empty string |
action |
Defines the type of user interaction which this event involves | Yes | Non-empty string |
label |
A string to provide additional dimensions to the event data | No | String or None |
property |
A string describing the object or the action performed on it | No | String or None |
value |
A value to provide numerical data about the event | No | Number of None |
tstamp |
When the structured event occurred | No | Positive integer or None |
Example:
t.track_struct_event("shop", "add-to-basket", None, "pcs", 2, 1369330909)
Warning: this feature is implemented in the Python tracker, but it is not currently supported in the Enrichment, Storage or Analytics stages in the Snowplow data pipeline. As a result, if you use this feature, you will log unstructured events to your collector logs, but these will not be parsed and loaded into e.g. Redshift to analyse. (Adding this capability is on the roadmap.)
Use track_unstruct_event()
to track a custom event which consists of a name and an unstructured set of properties. This is useful when:
- You want to track event types which are proprietary/specific to your business (i.e. not already part of Snowplow), or
- You want to track events which have unpredictable or frequently changing properties
The arguments are as follows:
Argument | Description | Required? | Validation |
---|---|---|---|
name |
The name of the event | Yes | Non-empty string |
properties |
The properties of the event | Yes | Non-empty table |
tstamp |
When the screen was viewed | No | Positive integer or None |
Example:
t.track_unstruct_event("save-game", {
"save_id" = "4321",
"level" = 23,
"difficultyLevel" = "HARD",
"dl_content" = true
}, 1369330929 )
The properties table consists of a set of individual name = value
pairs. The structure must be flat: properties cannot be nested. Be careful here as this is not currently enforced through validation.
Snowplow unstructured events support a relatively rich set of datatypes. Because these datatypes do not always map directly onto Python datatypes, we have introduced some "type suffixes" for the Python property names, so that Snowplow knows what Snowplow data types the Python data types map onto:
Snowplow datatype | Description | Python datatype | Type suffix(es) | Supports array? |
---|---|---|---|---|
Null | Absence of a value | N/A | - | No |
String | String of characters | string | - | Yes |
Boolean | True or false | boolean | - | Yes |
Integer | Number without decimal | number | $int |
Yes |
Floating point | Number with decimal | number | $flt |
Yes |
Geo-coordinates | Longitude and latitude | { number, number } | $geo |
Yes |
Date | Date and time (ms precision) | number |
$dt , $tm , $tms
|
Yes |
Array | Array of values | {x, y, z} | - | - |
Let"s go through each of these in turn, providing some examples as we go:
Tracking a Null value for a given field is currently untested in the Python Tracker. TODO.
Tracking a String is easy:
{
"product_id" = "ASO01043"
}
Tracking a Boolean is also straightforward:
{
"trial" = True
}
To track an Integer, use a Python number but add a type suffix like so:
{
"in_stock$int" = 23
}
Warning: if you do not add the $int
type suffix, Snowplow will assume you are tracking a Floating point number.
To track a Floating point number, use a Python number; adding a type suffix is optional:
{
"price$int" = 4.99,
"sales_tax" = 49.99 # Same as sales_tax$flt = ...
}
Tracking a pair of Geographic coordinates is done like so:
{
"check_in$geo" = (40.11041, -88.21337) # Lat, long
}
Please note that the datatype takes the format latitude followed by longitude. That is the same order used by services such as Google Maps.
Warning: if you do not add the $geo
type suffix, then the value will be incorrectly interpreted by Snowplow as an Array of Floating points.
Snowplow Dates include the date and the time, with milliseconds precision. There are three type suffixes supported for tracking a Date:
-
$dt
- the Number of days since the epoch -
$tm
- the Number of seconds since the epoch -
$tms
- the Number of milliseconds since the epoch. This precision is hard to access from within Python
You can track a date by adding a Python number to your properties
object. The following are all valid dates:
{
"birthday2$dt" = 3996,
"registered2$tm" = 1371129610,
"last_action$tms" = 1368454114215, # Accurate to milliseconds
}
Note that the type prefix only indicates how the Python number sent to Snowplow is interpreted - all Snowplow Dates are stored to milliseconds precision (whether or not they include that level of precision).
Two warnings:
- If you specify a Python number but do not add a valid Date suffix (
$dt
,$tm
or$tms
), then the value will be incorrectly interpreted by Snowplow as a Number, not a Date - If you specify a Python number but add the wrong Date suffix, then the Date will be incorrectly interpreted by Snowplow, for example:
{
"last_ping$dt" = 1371129610 # Should have been $tm. Snowplow will interpret this as the year 3756521449
}
Home | About | Project | Setup Guide | Technical Docs | Copyright © 2012-2013 Snowplow Analytics Ltd