[R] sampling problem - new to R
Prof Brian Ripley
ripley at stats.ox.ac.uk
Wed Jun 6 15:30:50 CEST 2007
On Tue, 5 Jun 2007, baldeck wrote:
> I have a data set of individual trees and the plots that they are in:
>
> Tree Plot
> 56749 1
> 63494 1
> 87375 1
> 37494 2
> 92753 3
> 34847 3
> 38747 4 etc...
You haven't told us what form the 'data set' is, but I will presume a data
frame called DF.
The obvious first step is to split by Plot. Using 'resample' from ?sample
sapply(with(DF, split(Tree, Plot)), resample, size=1)
give a vector of trees ('individuals'?) with names the plots sampled from.
That seems to be what you want, but if not please come back to us with a
more extensive example including the desired output.
> So each plot is represented once for every individual that occurrs in it.
> Plots get different numbers of rows because there can be a different number
> of individuals in each plot.
>
> I want to make a data frame that consists of one individual from each plot.
> I would like to randomly choose one individual from each plot that is
> present in the data set. I will have to do this to multiple data sets which
> may contain different plots, and may contain up to 1200 plots, so I can't
> choose the plots by hand.
>
> Please help me with this. I'm an ecologist and I'm in Panama, with no one
> around who is educated in R. Whoever solves this problem for me will be
> acknowledged in any resulting publications.
>
> Thanks!
> -Claire
>
--
Brian D. Ripley, ripley at stats.ox.ac.uk
Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel: +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UK Fax: +44 1865 272595
More information about the R-help
mailing list