Package 'cherryblossom'

Title: Cherry Blossom Run Race Results
Description: Race results of the Cherry Blossom Run, which is an annual road race that takes place in Washington, DC.
Authors: Mine Çetinkaya-Rundel [aut, cre] , David Diez [aut]
Maintainer: Mine Çetinkaya-Rundel <[email protected]>
License: GPL-3
Version: 0.1.0.9000
Built: 2024-11-12 03:39:56 UTC
Source: https://github.com/openintrostat/cherryblossom

Help Index


Cherry Blossom Run data, 2009

Description

Details for all 14,974 runners in the 2009 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC.

Usage

run09

Format

A data frame with 14,974 observations on the following 14 variables.

place

Finishing position. Separate positions are provided for each gender.

time

The total run time.

net_time

The run time from the start line to the finish line.

pace

Average time per mile, in minutes.

age

Age.

gender

Gender.

first

First name.

last

Last name.

city

Hometown city.

state

Hometown state.

country

Hometown country.

div

Running division (age group).

div_place

Division place, also broken up by gender.

div_tot

Total number of people in the division (again, also split by gender).

Source

Cherry Blossom Race Results

Examples

library(ggplot2)

# Finishing times by gender
ggplot(run09, aes(x = time, y = gender)) +
  geom_boxplot() +
  labs(
    title = "Finishing times for 2009 Cherry Blossom Run, by gender",
    x = "Time to complete the race, in minutes",
    y = "Gender"
    )

# Pacing times by gender
ggplot(run09, aes(x = pace, y = gender)) +
  geom_boxplot() +
  labs(
    title = "Pacing for 2009 Cherry Blossom Run, by gender",
    x = "Average time per mile, in minutes",
    y = "Gender"
    )

Cherry Blossom Run data, 2012

Description

Details for all 16,924 runners in the 2012 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC.

Usage

run12

Format

A data frame with 16,924 observations on the following 9 variables.

place

Finishing position. Separate positions are provided for each gender.

time

The total run time,, in minutes.

pace

Average time per mile, in minutes.

age

Age.

gender

Gender.

location

Hometown city.

state

Hometown state (if from the US) or country.

div_place

Division place, also broken up by gender.

div_tot

Total number of people in the division (again, also split by gender).

Source

Cherry Blossom Race Results

Examples

library(ggplot2)

# Finishing times
ggplot(run12, aes(x = time)) +
  geom_histogram(binwidth = 5) +
  labs(
    title = "Finishing times for 2012 Cherry Blossom Run,",
    x = "Time to complete the race, in minutes",
    y = "Frequency"
    )

# Pacing
ggplot(run12, aes(x = pace)) +
  geom_histogram(binwidth = 0.5) +
  labs(
    title = "Pacing for 2012 Cherry Blossom Run",
    x = "Average time per mile, in minutes",
    y = "Frequency"
    )

Cherry Blossom Run data, 2017

Description

Details for all 19,961 runners in the 2017 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC. Most runners participate in a 10-mile run while a smaller fraction take part in a 5k run or walk.

Usage

run17

Format

A data frame with 19,961 observations on the following 9 variables.

bib

Number on the runner's bib.

name

Name of the runner, with only the initial of their last name.

sex

Gender of the runner.

age

Age of the runner.

city

Home city of the runner.

net_sec

Time to complete the race, after accounting for the staggered starting time, in seconds.

clock_sec

Time to complete the race, ignoring the staggered starting time, in seconds.

pace_sec

Average time per mile, in seconds.

event

The event the racer participated in, either the "10 Mile" race or the "5K".

Details

There was a time limit where all 10 Mile racers had to finish by. Can you figure out what that time is?

Source

Cherry Blossom Race Results

Examples

library(ggplot2)

# Finishing times
ggplot(run17, aes(x = net_sec)) +
  geom_histogram(binwidth = 300) +
  facet_wrap(~event, nrow = 2) +
  labs(
    title = "Finishing times for 2017 Cherry Blossom Run, by event",
    subititle = "After accounting for the staggered starting time",
    x = "Time to complete the race, in seconds",
    y = "Frequency"
    )

# Pacing
ggplot(run17, aes(x = pace_sec)) +
  geom_histogram(binwidth = 100) +
  facet_wrap(~event, nrow = 2, scales = "free_y") +
  labs(
    title = "Pacing for 2017 Cherry Blossom Run, by event",
    x = "Average time per mile, in seconds",
    y = "Frequency"
    )