[R] R functions
B77S
bps0002 at auburn.edu
Fri Sep 16 07:43:56 CEST 2011
Suji,
# Here is your data ("test").
test <- structure(list(Chr = structure(c(1L, 1L, 1L, 1L, 1L, 1L, 1L,
1L, 1L, 1L), .Label = "chr2", class = "factor"), start = c(9896633L,
9896639L, 14314039L, 14404467L, 14421718L, 16031710L, 16036178L,
16048665L, 37491676L, 37702947L), end = c(9896683L, 9896690L,
14314098L, 14404502L, 14421777L, 16031769L, 16036237L, 16048724L,
37491735L, 37703009L), sample1 = c(0, 0, 0, 0, -0.43, -0.43,
-0.43, -0.43, 0, 0), sample2 = c(0, 0, -0.35, -0.35, -0.35, -0.35,
-0.35, -0.35, 0, 0)), .Names = c("Chr", "start", "end", "sample1",
"sample2"), class = "data.frame", row.names = c(NA, -10L))
# Here is where you will likely want to start (but there are many ways to
skin a cat).
test2 <- data.frame(c(rle(test[,4])[[2]], rle(test[,5])[[2]]),
c(rle(test[,4])[[1]], rle(test[,5])[[1]]))
names(test2) <- c("Values", "Probes")
> test2
Values Probes
1 0.00 4
2 -0.43 4
3 0.00 2
4 0.00 2
5 -0.35 6
6 0.00 2
# Obviously this is not exactly what you wanted, only the last 2 columns.
Obviously, the tricky part remains... but I hope this helps.
##########################################
B77S wrote:
>
> If your data is named 'test_file'
>
> then use dput(test_file)
>
> You can copy and paste the results here so people can more easily try and
> help you.
>
> see
> ?dput
>
>
>
>
> sujitha wrote:
>>
>> Hi group,
>> I am trying to right a code to do the following
>> This is how the test file looks like:
>> Chr start end sample1 sample2
>> chr2 9896633 9896683 0 0
>> chr2 9896639 9896690 0 0
>> chr2 14314039 14314098 0 -0.35
>> chr2 14404467 14404502 0 -0.35
>> chr2 14421718 14421777 -0.43 -0.35
>> chr2 16031710 16031769 -0.43 -0.35
>> chr2 16036178 16036237 -0.43 -0.35
>> chr2 16048665 16048724 -0.43 -0.35
>> chr2 37491676 37491735 0 0
>> chr2 37702947 37703009 0 0
>>
>> Now I want to summarize the values like
>> Sample Chr Start End Values Probes
>> 1 chr2 9896633 14404502 0 4
>> 1 chr2 14421718 16048724 -0.43 4
>> 1 chr2 37491676 37703001 0 2
>> 2 chr2 9896633 9896690 0 2
>> 2 chr2 14314039 16048724 -0.35 6
>> 2 chr2 37491676 37703009 0 2
>>
>> Here the start for the first line would be the least value until values
>> are similiar (4) then the end would be highest value. The values is the
>> unique value among the common values.
>> Can I get some ideas or suggestions to perform this because I am new to
>> hard core program in R?
>> waiting for your suggestions,
>> Thanks,
>> suji
>>
>
--
View this message in context: http://r.789695.n4.nabble.com/R-functions-tp3816748p3817431.html
Sent from the R help mailing list archive at Nabble.com.
More information about the R-help
mailing list