[R] Retrosheet Data
Philip
herd_dog @end|ng |rom cox@net
Thu Oct 8 19:52:47 CEST 2020
Have used Jim Albert’s code to extract baseball data from Retrosheet.
function(team){
P <- getRetrosheet("play",2013,team)
get_plays <- function(j) data.frame(Game=P[[j]]$id[1],P[[j]]$play)#from $id and $play section of retrosheet data
do.call("rbind",lapply(1:length(P),get_plays))
}
all_plays <- do.call("rbind",lapply(Teams[1:30],get_team_plays))
head(all_plays,10)
I get the following six variables from the play section of the retrosheet data:
head(all_plays,10)
Game inning team retroID count pitches play
1 TEX201304050 1 0 troum001 01 CX 63/G
2 TEX201304050 1 0 aybae001 12 TBFX 7/F
3 TEX201304050 1 0 pujoa001 00 X 6/L
4 TEX201304050 1 1 kinsi001 02 FCX 53/G
Very useful but I would also like to access the $info, $start, $sub, and $data sections. I have tried replacing $play in the line below with $info, $start, etc. without success.
get_plays <- function(j) data.frame(Game=P[[j]]$id[1],P[[j]]$play) with
get_plays <- function(j) data.frame(Game=P[[j]]$id[1],P[[j]]$info)
I keep getting the same Game, inning, team, etc. data.
Can anyone give me some direction.
Thanks,
Philip Heinrich
[[alternative HTML version deleted]]
More information about the R-help
mailing list