[R] obtaining first and last record for rows with same identifier

Frank E Harrell Jr f.harrell at vanderbilt.edu
Wed May 25 14:20:28 CEST 2005


Francisco J. Zagmutt wrote:
> If you want to obtain a data frame you can use the functions head and 
> tail like:
> 
> dat=data.frame(id=rep(1:5,3),num=rnorm(15), num2=rnorm(15))#Creates data 
> frame with id
> last=do.call("rbind",by(dat,dat$id,tail,1))#Selects the last observation 
> for each id
> first=do.call("rbind",by(dat,dat$id,head,1))#Selects the first 
> observation for each id
> newdat=rbind(first,last)#Joins data
> newdat=newdat[order(newdat$id),]#sorts data by id
> 
> Notice that rownames will give you the original row location of the 
> observations selected
> 
> I hope this helps
> 
> Francisco
> 
. . .

You might also look at section 4.3 of
http://biostat.mc.vanderbilt.edu/twiki/pub/Main/RS/sintro.pdf
-- 
Frank E Harrell Jr   Professor and Chair           School of Medicine
                      Department of Biostatistics   Vanderbilt University




More information about the R-help mailing list