-
Notifications
You must be signed in to change notification settings - Fork 2
/
about.Rmd
136 lines (95 loc) · 7.37 KB
/
about.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
---
title: "About This Website"
output:
distill::distill_article:
toc: true
---
These web pages are meant as a supplement to our book,
[A History of Data Visualization and Graphic Communication](https://www.hup.harvard.edu/catalog.php?isbn=9780674975231).
The main goals are to:
* reproduce here more color figures than could be accommodated in the print edition. The selected figures are shown as thumbnails, but can be enlarged with a click. R code for the figures we reproduced are also provided as links in the figure captions.
* provide links to [related papers](related.html) and topics that expand on the themes in our book.
* serve as a forum for comments, errata and other topics that could not be fully covered in this first edition.
## Who needs this book?
The book is aimed at an educated, lay audience: in general anyone who must either convey or receive quantitative information, which at one time or another includes all of us. This includes, but is not limited to,
graphic designers, statisticians, and people in the media. Typical audiences would be readers of the
*NY Times*, *The New Yorker*, *Scientific American*, the *New Republic*, *National Review* and related
publications in other venues.
A more specific market segment is the large community of professionals in the history of science, cartography, data journalism, graphic design, and information visualization, visualization software.
## Front cover
```{r out.width="90%", fig.align='center', echo=FALSE}
knitr::include_graphics("figs-web/P_08-playfair1805-inquiry-crop.jpg")
```
The design of the front cover is based on William Playfair's (1805)
marvelous
_Chart of Universal Commercial History_. In it, he asks:
* How and when did civilizations rise and fall from 1500 BCE to 1800 AD?
He shows ancient and modern civilizations in the form of small graphs
over time, representing some indication of strength of an empire
or civilization, in a way that then can be visually compared to
ask further questions:
* Why were some so long-lived?
* When and why did some of them fail?
Another implicit question is:
* How to visualize a history and tell its story?
This chart, like our book, takes a long view of history, with the necessity
of some degree of abstraction. These questions also provide a structure
for discussing the history of data visualization.
Playfair's chart uses what at the time was a brilliant and
novel graphic form: using compressed little distributions showing the "relative" strength
of each civilization over time in a way thay they could be easily compared.
The colored bands reflect:
* ancient states (<span style="color:darkpink">pink</span>)
* modern states (<span style="color:darkgreen">green</span>)
* America (<span style="color:darkyellow">yellow</span>)
This graphic form was recently re-invented
and called a "ridgeline" plot because it resembles a set of
mountain ridges. This figure appears as Plate 8 in the book
and is discussed in [Chapter 5](ch05-playfair.html), "Charts of History".
## Translations
```{r out.width="10%", fig.align='left', echo=FALSE}
knitr::include_graphics("images/Japanese-cover.jpg")
```
* A [Japanese translation](http://www.seidosha.co.jp/book/index.php?id=3626) was recently published by Seidosha.
* A Chinese translation is in the works.
## Datasets {#datasets}
A unique feature of our book is that we try to appreciate the efforts of key thinkers in the history of data visualization by understanding their data and the graphs they produced. In many cases, we can shed some light on their topics and graphical innovations by trying to reproduce their graphs with modern software.
Toward this end, we created two R packages, used in some of our graphs in the book.
* [HistData](https://cran.r-project.org/package=HistData), a collection of many datasets that are interesting and important in the history of statistics and data visualization. In addition, we use some features from Peter Li's [cholera](https://cran.r-project.org/package=cholera) package for graphs of John Snow's data on the 1854 cholera outbreak in London.
* [Guerry](https://cran.r-project.org/package=Guerry), datasets and maps related to Guerry's (1833)
_La Statistique Morale de la France_.
### R code for figures {#rcode}
This is a list of the [R code](https://cran.r-project.org/) files for our reconstructions of historical graphs. They are also linked in the figure captions on the individual chapter pages.
* [Figure 00_1: Timeline of milestones events, by continent](fig-code/00_1-mileyears4.R)
* [Figure 02_3: Graphical inventions](fig-code/02_3-graphic-forms.R)
* [Figure 03_2: Sex ratio. Arbuthnot's data on the ratio of male to female births](fig-code/03_2-arbuthnot-gg.R)
* [Figure 03_7: Reproduction of Guerry's six maps](fig-code/03_7-guerrymap.R)
* [Figure 03_8: Enhanced scatterplot of crimes against persons vs literacy](fig-code/03_8-guerry-bivar.R)
* [Figure 04_10: Line graph of Nightingale's data](fig-code/04_10-nightingale-line.R)
* [Figure 04_3: Farr's elevation - mortality data as a scatterplot](fig-code/04_3-cholera-elevation-law.R)
* [Figure 04_4: Farr's Inverse Natural Law of Cholera and Elevation](fig-code/04_4-cholera-mod.R)
* [Figure 04_6: Cholera deaths by water supply region](fig-code/04_6-cholera-water.R)
* [Plate 04_P3a: Snow's cholera map with pump neighborhoods](fig-code/04_P3a-cholera-neighborhoods.R)
* [Plate 04_P3b: Snow's cholera map with pump polygons](fig-code/04_P3b-SnowMap-density.R)
* [Figure 05_5: Re-creating Playfair's graph of balance of trade with East Indies](fig-code/05_5-playfair-east-indies.R)
* [Figure 06_1: Modern scatterplot](fig-code/06_1-workers.R)
* [Figure 06_13: Galton's visual argument for regression](fig-code/06_13-galton-peas.R)
* [Figure 06_15: Galton's method for finding contours of equal frequency](fig-code/06_15-galton-ex2.R)
* [Figure 06_18: Hertzsprung-Russel diagrams for stars in the CYG OB1 cluster](fig-code/06_18-hertzsprung.R)
* [Figure 06_21: Chocolate consumption and Nobel prizes](fig-code/06_21-nobel-chocolate.R)
* [Figure 06_22: Wine, IKEA and Nobel prizes](fig-code/06_22-nobel-wine.R)
* [Figure 06_5: Re-drawn version of Playfair's graph](fig-code/06_5-wheat2.R)
* [Figure 06_6: Scatterplot version of Playfair's plot](fig-code/06_6-wheat3.R)
* [Figure 06_9: Reconstruction of Herschel's graph](fig-code/06_9-herschel.R)
* [Plate 06_P9: Ridgeline plot showing increasing polarization of politics](fig-code/06_P9-ridgeline-politics.R)
* [Figure 07_1: Milestones timeline](fig-code/07_1-mileyears3.R)
* [Figure 09_13: Plots of successive pairs of RANDU random numbers](fig-code/09_13-randu1.R)
* [Figure 09_14: RANDU in 3D](fig-code/09_14-randu2.R)
* [Figure 09_15: Diabetes data from Reaven and Miller](fig-code/09_15-diabetes.R)
## Errata {#errata}
Here is a [list of current errata](pdf/Errata.pdf).
If you find other errors in the book, file [an issue on GitHub](https://github.com/friendly/HistDataVis/issues).
## Credits
All figures displayed here are authorized for this limited use. In some cases, the figures shown here differ from those printed in the book, but these are credited and available for this use.
In addition, we use a collection of cartoon images for our chapter pages created by RJ Andrews, [http://infowetrust.com](http://infowetrust.com). We are grateful for permission to use them here.
We thank Matthew Dubins of [Donor Science Consulting](https://www.donorscience.ca) for his help in making this web site possible.