This repo contains a bunch of examples for using Cascalog. If you’re not yet familiar with Cascalog, the following resources are highly recommended:
- Nathan Marz: Introducing Cascalog (blog post)
- Nathan Marz: “Cascalog: Making Data Processing Fun Again” (Talk at Clojure/conj 2011)
I use Leiningen 2.0. If you haven’t installed or upgraded yet, follow the Installation Instructions.
Then:
$ git clone git://github.com/sthuebner/cascalog-examples.git
...
$ lein swank
Then head over to Emacs (if you’re not already in it :) ) and
M-x slime-connect RET RET RET
The namespace sthuebner.cascalog.pagecounts
works on data taken from
http://dumps.wikimedia.org/other/pagecounts-raw/