dragracer: Data Sets for RuPaul’s Drag Race

The dragracer hexlogo

{dragracer} is an R package of data sets for all available seasons of RuPaul’s Drag Race, excluding All Stars. It’s updated at the end of each season. I have a blog post that describes these data in some detail while also showcasing some of the things you can do with the data I provide. I offer this R package for two reasons. First, the fandom for this show is large and there is a discernible subset of the fandom that is interested in data science and the R programming language. I offer this package as a collection of accessible data with which they can experiment. I also offer this as a love letter of a kind to RuPaul’s Drag Race and all the contestants that have appeared on it. We have no choice but to “stan”, especially Jinkx Monsoon. Jinkx is the GOAT. The conspicuously short Season 8 masks Bob’s relative dominance. Sasha Velour is Champaign-Urbana’s greatest export since REO Speedwagon. Kameron Michaels should’ve won Season 10.

Installation

This package is now on CRAN. You should be able to install it as follows.

install.packages("dragracer")

The developmental version may have more goodies. You can install the development version from Github via the {devtools} package. I suppose using the {remotes} package would work as well.

devtools::install_github("svmiller/dragracer")

Usage

The data right now contain three data sets. rpdr_ep is episode-level data for all available seasons. These include observations about when the episode aired, the number of queens in the episode, mini-challenge winners, who appeared in the bottom for a given episode, and who was sent home. rpdr_contestants is a data frame of contestant-level information, including the contestant’s age, purported date of birth, hometown, and how they fared across all episodes in their particular season. rpdr_contep is episode-contestant-level data about how each contestant fared in a particular episode in which they were. You can read more through this command.

data(package = "dragracer")

The ensuing output will look like this.

Object Name Title/Description
rpdr_contep RuPaul’s Drag Race Episode-Contestant Data
rpdr_contestants RuPaul’s Drag Race Contestant Data
rpdr_ep RuPaul’s Drag Race Episode Data

Here is a simple scraping job to provide more information (by way of the description field in the associated R Documentation file).

Object Name Description
rpdr_contep These are episode-contestant-level data for RuPaul’s Drag Race for all available seasons (currently through Season 14). For a given season, observations mostly decrease with each episode. Data include all sorts of information.
rpdr_contestants These are contestant-level data for RuPaul’s Drag Race for all available seasons (currently through Season 14). Data include all sorts of information.
rpdr_ep These are episode-level data for RuPaul’s Drag Race for all available seasons (currently through Season 14). Data include all sorts of information.