[R] to divide column cells by the mean of another column
arun
smartpink111 at yahoo.com
Sun Apr 20 18:17:55 CEST 2014
Hi Andre,
A slight correction:
fun1 <- function(beginColumn, by, data) {
indx <- seq(beginColumn, ncol(data), by = by)
dataNew <- data[, indx[1]:ncol(data)]
indx1 <- cumsum(seq(ncol(data)) %in% indx)
indx2 <- indx1[indx1 != 0]
lst1 <- lapply(split(seq_along(indx2), indx2), function(i) {
x1 <- dataNew[, i, drop = FALSE]
if (ncol(x1) > 1) {
x1[, -1, drop = FALSE]/mean(x1[, 1]) ###changed
}
})
res <- data.frame(lst1[sapply(lst1, length) > 0])
colnames(res) <- gsub(".*\\.", "", colnames(res))
res
}
Also, you can try:
fun2 <- function(beginColumn, by, data) {
indx <- seq(beginColumn, ncol(data), by = by)
indx1 <- head(indx + 1, -1)
indx2 <- tail(indx - 1, -1)
vec1 <- as.vector(sapply(seq_along(indx1), function(i) indx1[i]:indx2[i]))
if (ncol(data) > tail(indx, 1)) {
vec2 <- c(vec1, (tail(indx, 1) + 1):ncol(dat1))
} else {
vec2 <- vec1
}
means1 <- rep(colMeans(data[, indx]), each = by - 1, length.out = length(vec2))
res <- as.data.frame(t(t(data[, vec2])/means1))
res
}
set.seed(458)
dat1 <- as.data.frame(matrix(sample(5,10*5,replace=TRUE),ncol=10))
identical(fun1(2,4,dat1),fun2(2,4,dat1))
#[1] TRUE
A.K.
Hi,
May be this helps:
fun1 <- function(beginColumn, by, data) {
indx <- seq(beginColumn, ncol(data), by = by)
dataNew <- data[, indx[1]:ncol(data)]
indx1 <- cumsum(seq(ncol(data)) %in% indx)
indx2 <- indx1[indx1 != 0]
lst1 <- lapply(split(seq_along(indx2), indx2), function(i) {
x1 <- dataNew[, i, drop = FALSE]
if (ncol(x1) > 1) {
x1[, -1]/mean(x1[, 1])
}
})
res <- data.frame(lst1[sapply(lst1, length) > 0])
colnames(res) <- gsub(".*\\.", "", colnames(res))
res
}
set.seed(458)
dat1 <- as.data.frame(matrix(sample(5,10*5,replace=TRUE),ncol=10))
set.seed(34)
dat2 <- as.data.frame(matrix(sample(20,21*5,replace=TRUE),ncol=21))
fun1(2,4,dat1)
fun1(2,5,dat2)
A.K.
On Sunday, April 20, 2014 5:55 AM, Andre Zacharia <andre.zacharia at gmail.com> wrote:
Dear all,
I am getting data columnwise that I need to divide by the mean of another
column
If the column is the previous one this code works perfectly well:
fun1 <- function(beginColumn, by, data) { indx <- seq(beginColumn,
ncol(data), by = by) as.data.frame(t(100 - (t(data[, indx])/colMeans(data[,
indx - 1], na.rm = TRUE)) * 100))
}
(Arun helped me with this code, thank you again!...)
But, the things is now more complicated...
I need to program a function that allow me to divide for example cells from
column 3 on mean from column 2 and cells from column 4 on mean of column 2
and the 5 etc. Then column 6 is another column from whch I need to extract
the mean and to do the same with column 7 and 8, etc...
so if I have:
1 2 3 4 1 5
2 5 4 7 2 8
3 4 5 9 3 7
4 7 7 9 4 3
The serie 1,2,3,4 ar just enumerating so not useful at this timepoint.
the results should be (from excel...):
4,5 33,3333333 11,1111111 -11,1111111 11,1111111 -55,5555556 -77,7777778
-11,1111111 -100 -55,5555556 -55,5555556 -100
33,3333333
I tried to work on modyfying indx-1 by 2*indx-2, but this is not doing the
job... I tried many other things so that I am now stucked.
Does Anyone has a brilliant idea?
Many many thanks
André ZACHARIA
[[alternative HTML version deleted]]
______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list