-
Notifications
You must be signed in to change notification settings - Fork 5
/
2013-10-tsb-workshop-tom.html
184 lines (157 loc) · 6.24 KB
/
2013-10-tsb-workshop-tom.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
---
layout: reveal
title: The (Open) Data Consumer's Checklist
description: Key considerations for users of open data
author: Tom Heath and Ulrich Atz
---
{% include odi_logo.html %}
{% include standard_title.html %}
<style type="text/css">
@import url(https://fonts.googleapis.com/css?family=Quicksand:400,700,400italic,700italic);
@import url(https://fonts.googleapis.com/css?family=Open+Sans:400italic,700italic,400,700);
reveal {
font-family: "Open Sans", sans-serif;
font-size: 36px;
font-weight: 200;
letter-spacing: 0.6em;
color: #333333; }
.reveal h1,
.reveal h2,
.reveal h3,
.reveal h4,
.reveal h5,
.reveal h6 {
margin: 0 0 20px 0;
color: #333333;
font-family: "Quicksand", sans-serif;
line-height: 1.2em;
letter-spacing: -0.08em;
text-transform: uppercase;
text-shadow: none;
}
.reveal.quote-source {
font-size: smaller;
}
</style>
<section id="overview">
<h2>Overview of today</h2>
<ol>
<li>an example data set</li>
<li>the general case</li>
<li>open data certificates (if we have time)</li>
<li>questions</li>
</ol>
</section>
<section id="content">
<p>
<a class="stretch" href="http://data.ordnancesurvey.co.uk/datasets/code-point-open"><img src="2013-10-TSB-workshop/os-code-point.png" style="border:none;"></a><br/>
<!-- <a href="https://certificates.theodi.org/datasets/353/certificates/11801" class="quote-source">https://certificates.theodi.org/datasets/353/certificates/11801</a> -->
</p>
<aside class="notes">
Follow link - Explain License - Format - Coverage - Quality - Release cycle (?)
</aside>
</section>
<section id="handout">
<h3>Considerations for the<br/>general case</h3>
<ol class="fragment">
<li style="color:#D60303">Accessibility</li>
<li style="color:#2254F4">Ownership and licensing</li>
<li style="color:#0DBC37">Form</li>
<li style="color:#FF6700">Quality</li>
<li style="color:#B13198">Support</li>
</ol>
</section>
<section id="handout1">
<h3 style="color:#D60303">Accessibility (1/5)</h3>
<ul class="centred-list">
<li class="fragment"><strong>is the data already available?</strong><br/>
if so, where?</li>
<li class="fragment"><strong>how can you access it?</strong><br/>
dumps? an <em>api</em>?</li>
<li class="fragment"><strong>in what format is the data published?</strong><br/>
csv? xml? json? pdf?!</li>
</ul>
</section>
<section id="handout2">
<h3 style="color:#2254F4">Ownership and licensing (2/5)</h3>
<ul class="centred-list">
<li class="fragment"><strong>who publishes the data?</strong></li>
<li class="fragment"><strong>are they the originator of the data?</strong></li>
<li class="fragment"><strong>under what licence is the data published?</strong></li>
<li class="fragment"><strong>does it contain personal data?</strong></li>
</ul>
</section>
<section id="handout3">
<h3 style="color:#0DBC37">Form (3/5)</h3>
<ul class="centred-list">
<li class="fragment"><strong>prior processing</strong><br/>
is the data in raw or summary form?<br/>
how has it been processed?</li>
<li class="fragment"><strong>form</strong><br/>
shape, granularity, etc.<br/>
how will these affect your analysis/product/application?<br/>
is the form compatible with other data you are using?</li>
<li class="fragment"><strong>transformations</strong><br/>
what syntactic and semantic transformations<br/>will you need to make?</li>
</ul>
</section>
<section id="handout4">
<h3 style="color:#FF6700">Quality (4/5)</h3>
<ul class="centred-list">
<li class="fragment"><strong>currency and regularity</strong><br/>
how current is the data?<br/>
how regularly is it updated?<br/>
for how long will it be published?<br/>
what is the commitment by the publisher?</li>
<li class="fragment"><strong>comprehensibility</strong><br/>
do you understand all the fields and their context?</li>
<li class="fragment"><strong>accuracy</strong><br/>
what do you know about the accuracy of the data?<br/>
how is missing data handled?</li>
</ul>
</section>
<section id="handout5">
<h3 style="color:#B13198">Support (5/5)</h3>
<ul class="centred-list">
<li class="fragment"><strong>(how) is the data set documented?</strong></li>
<li class="fragment"><strong>does the metadata make sense?</strong></li>
<li class="fragment"><strong>is there a place you can report errors in the data?</strong></li>
<li class="fragment"><strong>does the publisher offer support in any way?</strong></li>
</ul>
</section>
<section id="certificate1">
<h2>Open Data Certificates</h2>
<ul class="centred-list">
<li><strong>the first robust quality badge for open data</strong></li>
<li><strong>helps...</strong><br/>
<strong>publishers</strong> certify their data<br/>
<strong>users</strong> find and use it<br/>
<strong>policy makers</strong> benchmark<br/></li>
<li><a href="http://certificates.theodi.org">http://certificates.theodi.org</a></li>
<!-- <li>example: the <a href="https://certificates.theodi.org/datasets/353/certificates/11801">OS Code-Point Data.</a></li> -->
</section>
<section id="certificate2">
<img class="stretch" src="2013-10-TSB-workshop/cert-levels.png" style="border:none; box-shadow:none;"></a>
</section>
<section id="takeaways" data-background="#FF6700">
<h3 style="color:white">Quick, what are the three most important questions?</h3>
<br>
<ol style="color:white">
<li class="fragment">how can you <em>use</em> the data?</li>
<li class="fragment">is the <em>quality</em> sufficient and appropriate?</li>
<li class="fragment">will the data be <em>available</em> in the future?</li>
</ol>
</section>
<section>
<h2>Questions?</h2><br>
<img height="320;" src="brand/odi_dynamic_logo.png" alt="ODI" style="border:none; box-shadow:none;">
<hr>
<p style="text-align:center">Tom Heath and Ulrich Atz · Open Data Institute</p>
<p style="text-align:center; font-size: smaller">
<a href="mailto:[email protected]">[email protected]</a> ·
<a href="https://twitter.com/tommyh">@tommyh</a> |
<a href="mailto:[email protected]">[email protected]</a> ·
<a href="https://twitter.com/statshero">@statshero</a>
</p>
</section>
{% include odi_tech_team.html %}