June 21, 2014 » Shame, 538

The point that using spatial data visualizations can easily yield bias is well-rehearsed. (The first time it was made to me was in a middle school algebra textbook.) The basic problem is that, if you are representing the difference between two quantities, only one dimension of your data visualization should covary with the data. So you can have a vertical bar graph, where the width of rectan...

May 16, 2014 » DOIs come to Github

In January, I read an article entitled 3 simple things GitHub can do for science. The title is pretty self-explanatory; it proposes three concrete steps Github can take in order to increase its utility for scientific research. “It’s nice to dream,” I thought. On Wednesday, I read that Github has actually (mostly) implemented one of the three recommendations. On the occasion of this nice surprise, I’d like to reflect on what’s happened, why it’s nice, and what more ther...

April 16, 2014 » Algorithmic drawing of isoglosses

I’ve been sitting in on Bill Labov’s seminar on dialect geography this semester....

February 18, 2014 » Tooltips in ggplot

R has few adequate facilities for interactive visualization. This table presents an overview of many of them. Joe Fruehwald has had...

February 13, 2014 » First post

Welcome to my blog. I hope to use this space to write about topics related to theoretical and corpus linguistics, as well as scientific computi...

