[R] how to generate a column based on other columns in a data frame

Gabor Grothendieck ggrothendieck at gmail.com
Mon Feb 11 18:04:41 CET 2008


Assuming this data frame:
DF <- data.frame(X = c(36.435, 36.435, 36.435, 35.329, 35.329,
36.431, 36.431, 35.421, 35.421, 35.421), Y = c(30.118, 30.118,
30.118, 29.657, 29.657, 30.111, 30.111, 29.797, 29.797, 29.797))

# Try this:
DF$site <- as.numeric(factor(interaction(DF$X, DF$Y)))

If X and Y can vary slightly while still referring to the same
site then round them first to k decimal places first.  See ?round


On Feb 11, 2008 11:30 AM, Weidong Gu <wgu at uab.edu> wrote:
> HI,
>
>
>
> I am working on a data set with multiple collections of mosquitoes at
> sampling sites. Each row represents a collection of individual samples
> with coordinates for each collection.
>
> ... X,  Y,...
>
> 1  36.435 30.118
>
> 2  36.435 30.118
>
> 3  36.435 30.118
>
> 4  35.329 29.657
>
> 5  35.329 29.657
>
> 6  36.431 30.111
>
> 7  36.431 30.111
>
> 8  35.421 29.797
>
> 9  35.421 29.797
>
> 10 35.421 29.797
>
>
>
> Unfortunately, there is no 'site' entry. I would like to add a column of
> 'site' based on the coordinates of samples so that samples from the same
> sites have the same site ID like S1, S2,....
>
>
>
> How to do this in R way? Thanks.
>
>
>
>
>
> Weidong Gu,
>
> Department of Medicine
> University of Alabama, Birmingham
> 1900 University Blvd., Birmingham, Alabama 35294
> Email: wgu at uab.edu
> PH: (205)-975-9053
>
>
>
>
>        [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



More information about the R-help mailing list