Nike+ is a revolutionary product in personal fitness. In its five years, its motivated millions of people to get off the couch and go for a run. At the completion of their run, the user is given a handful of useful metrics (time, pace, and route) in an effort to improve their next run.
What it doesn't offer, however, is a holistic representation of the data. What does it look like when an entire city goes running (with Nike+)? And how can we use that data to improve the experience of runners within their city?
For Nicholas Felton’s Information Visualization course, we were given a CSV of 1,000 runs of Nike+ data, and two weeks to tell a story with it, which I chose to do through a timelapse video. But during this process, I encountered countless other stories that I wanted to explore once the semester was completed. Over the course of the next summer, I set out to do a complete audit of running in New York City, publishing new visualizations on my blog as they were completed. This project culminated with me working with WIRED UK to do a full spread visualization, using a new dataset of 10,000 runs throughout London, for their October 2011 issue.
TIME OF DAY
I was interested in visualizing not only where these runs were taking place, but also, at what time of day. I decided to create a timelapse video, displaying each individual route at the time of day it took place. Moreover, I elected to use a base map of New York City, but rather, let the running data generate the map as the runners move throughout the city. Paths, streets, and bridges all start to emerge as the day goes on and more runners move through the city. These videos focus on three views - NYC as a whole, as well as two areas where I found there to be the largest concentration of runners, Central Park and the three bridges of the East River.
In hindsight, I feel that these videos, though nice looking, were a bit convoluted in the information presented and what a viewer could extract from them. Additionally, I came to realize that though time of day was interesting, it was just one of many stories that could be told, as demonstrated below. Finally, in the rush of a grad school class, I realized that I made small errors in my data cleaning, such as not accounting for daylight savings time, that I have since corrected.
LOCATION
After countless hours cleaning the data in Google Refine, and even more hours in Processing, I arrived at one of the simplest visualizations, location.
Though this rendering offers very little information about individual runs and the data within them, I love the story that it does tell about location. The GPS data draws its own map of New York City, from the shape of the Manhattan landmass down to its individual streets.
POPULARITY OF ROUTES
Next, I wanted to explore how popular these different routes were. I achieved this by creating a simple heat map of all the runs, examining how many runners shared the same route over the course of this data set.
Not surprisingly, Central Park and the trails along the edge of Manhattan emerge as the most popular, as well as the bridges between Manhattan & Brooklyn.
Downtown Brooklyn, especially along the promenade, also sees a great deal of traffic. Interestingly, it appears that more runners in Central Park tend to come from the Upper East Side, with far less entering the park from the Upper West Side.
DIRECTION OF TRAVEL
In track & field, runners always circle the track in a counter clockwise direction. But out in the world, our routes and directions are far less prescribed.
I was curious to see if there were certain areas where runners all ran similar directions, regardless of no set of rules. If a runner’s position changes towards the south, their path is drawn yellow. If a runner is heading north, their path is blue.
In looking at the Central Park Loops, almost all runners choose to run in a counter-clockwise direction, following track protocol on all loops, regardless of size. We can also see a balance of entrances and exits to most parts of the park, except for one spot on 59th that seems to cater exclusively to runners entering the park.
The Brooklyn Bridge is being used almost exclusively to funnel runners out of Brooklyn, while the Manhattan Bridge is used to cross from Manhattan to Brooklyn. It appears these bridges were named for the runners leaving the borough.
The Williamsburg Bridge, as opposed to its counterparts with their paths down the middle, has two distinct running paths on each side of the bridge. Here we can see, interestingly, that runners seem to favor the side of the bridge going against traffic.