Title: | Cherry Blossom Run Race Results |
---|---|
Description: | Race results of the Cherry Blossom Run, which is an annual road race that takes place in Washington, DC. |
Authors: | Mine Çetinkaya-Rundel [aut, cre] , David Diez [aut] |
Maintainer: | Mine Çetinkaya-Rundel <[email protected]> |
License: | GPL-3 |
Version: | 0.1.0.9000 |
Built: | 2024-11-12 03:39:56 UTC |
Source: | https://github.com/openintrostat/cherryblossom |
Details for all 14,974 runners in the 2009 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC.
run09
run09
A data frame with 14,974 observations on the following 14 variables.
Finishing position. Separate positions are provided for each gender.
The total run time.
The run time from the start line to the finish line.
Average time per mile, in minutes.
Age.
Gender.
First name.
Last name.
Hometown city.
Hometown state.
Hometown country.
Running division (age group).
Division place, also broken up by gender.
Total number of people in the division (again, also split by gender).
library(ggplot2) # Finishing times by gender ggplot(run09, aes(x = time, y = gender)) + geom_boxplot() + labs( title = "Finishing times for 2009 Cherry Blossom Run, by gender", x = "Time to complete the race, in minutes", y = "Gender" ) # Pacing times by gender ggplot(run09, aes(x = pace, y = gender)) + geom_boxplot() + labs( title = "Pacing for 2009 Cherry Blossom Run, by gender", x = "Average time per mile, in minutes", y = "Gender" )
library(ggplot2) # Finishing times by gender ggplot(run09, aes(x = time, y = gender)) + geom_boxplot() + labs( title = "Finishing times for 2009 Cherry Blossom Run, by gender", x = "Time to complete the race, in minutes", y = "Gender" ) # Pacing times by gender ggplot(run09, aes(x = pace, y = gender)) + geom_boxplot() + labs( title = "Pacing for 2009 Cherry Blossom Run, by gender", x = "Average time per mile, in minutes", y = "Gender" )
Details for all 16,924 runners in the 2012 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC.
run12
run12
A data frame with 16,924 observations on the following 9 variables.
Finishing position. Separate positions are provided for each gender.
The total run time,, in minutes.
Average time per mile, in minutes.
Age.
Gender.
Hometown city.
Hometown state (if from the US) or country.
Division place, also broken up by gender.
Total number of people in the division (again, also split by gender).
library(ggplot2) # Finishing times ggplot(run12, aes(x = time)) + geom_histogram(binwidth = 5) + labs( title = "Finishing times for 2012 Cherry Blossom Run,", x = "Time to complete the race, in minutes", y = "Frequency" ) # Pacing ggplot(run12, aes(x = pace)) + geom_histogram(binwidth = 0.5) + labs( title = "Pacing for 2012 Cherry Blossom Run", x = "Average time per mile, in minutes", y = "Frequency" )
library(ggplot2) # Finishing times ggplot(run12, aes(x = time)) + geom_histogram(binwidth = 5) + labs( title = "Finishing times for 2012 Cherry Blossom Run,", x = "Time to complete the race, in minutes", y = "Frequency" ) # Pacing ggplot(run12, aes(x = pace)) + geom_histogram(binwidth = 0.5) + labs( title = "Pacing for 2012 Cherry Blossom Run", x = "Average time per mile, in minutes", y = "Frequency" )
Details for all 19,961 runners in the 2017 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC. Most runners participate in a 10-mile run while a smaller fraction take part in a 5k run or walk.
run17
run17
A data frame with 19,961 observations on the following 9 variables.
Number on the runner's bib.
Name of the runner, with only the initial of their last name.
Gender of the runner.
Age of the runner.
Home city of the runner.
Time to complete the race, after accounting for the staggered starting time, in seconds.
Time to complete the race, ignoring the staggered starting time, in seconds.
Average time per mile, in seconds.
The event the racer participated in, either the "10 Mile"
race or the "5K"
.
There was a time limit where all 10 Mile racers had to finish by. Can you figure out what that time is?
library(ggplot2) # Finishing times ggplot(run17, aes(x = net_sec)) + geom_histogram(binwidth = 300) + facet_wrap(~event, nrow = 2) + labs( title = "Finishing times for 2017 Cherry Blossom Run, by event", subititle = "After accounting for the staggered starting time", x = "Time to complete the race, in seconds", y = "Frequency" ) # Pacing ggplot(run17, aes(x = pace_sec)) + geom_histogram(binwidth = 100) + facet_wrap(~event, nrow = 2, scales = "free_y") + labs( title = "Pacing for 2017 Cherry Blossom Run, by event", x = "Average time per mile, in seconds", y = "Frequency" )
library(ggplot2) # Finishing times ggplot(run17, aes(x = net_sec)) + geom_histogram(binwidth = 300) + facet_wrap(~event, nrow = 2) + labs( title = "Finishing times for 2017 Cherry Blossom Run, by event", subititle = "After accounting for the staggered starting time", x = "Time to complete the race, in seconds", y = "Frequency" ) # Pacing ggplot(run17, aes(x = pace_sec)) + geom_histogram(binwidth = 100) + facet_wrap(~event, nrow = 2, scales = "free_y") + labs( title = "Pacing for 2017 Cherry Blossom Run, by event", x = "Average time per mile, in seconds", y = "Frequency" )