[BioC] remove duplicated rows generated by read.ilmn()
Rao,Xiayu
XRao at mdanderson.org
Tue Jul 29 21:35:21 CEST 2014
Hello,
Sorry for my last email. The extra rows are pointing to the control probes, which will be gone after background correction and normalization by neqc().
Thanks,
Xiayu
-----Original Message-----
From: bioconductor-bounces at r-project.org [mailto:bioconductor-bounces at r-project.org] On Behalf Of Rao,Xiayu
Sent: Tuesday, July 29, 2014 12:59 PM
To: 'bioconductor at r-project.org'
Subject: [BioC] remove duplicated rows generated by read.ilmn()
Hello,
It is interesting to see that read.ilmn() generates some extra rows. It turns out that the extra rows are duplicates, possibly due to the different decimal points from different data sets when I combined the data from different chips together. How do I remove the duplicates (keep only the first row) for the ElistRaw class, x ??
I simply merged different data sets to keep only shared probeIDs and SYMBOLs, and I removed the rows with blank values for the SYMBOL column.
library(limma)
x <- read.ilmn(files="all_samples.txt",ctrlfiles="all_samples_cont.txt",other.columns="Detection")
x
> x$E[rownames(x$E)=="1430239",]
X8508853077_D X8508853077_G X8909358290_B X8909358290_C X8909358290_E X8909358290_F X8909358290_G X8909358290_H X8909358290_I
1430239 9375.439 6950.326 2623.736 2923.202 13051.55 9663.131 9752.901 6377.608 7601.526
1430239 9375.400 6950.300 2623.700 2923.200 13051.60 9663.100 9752.900 6377.600 7601.500
X167.H1 X403.H1 X495.E6 X527.E5 X544.B8 X619.C7 X625.D3 X340.H1 X648.B4
1430239 11586.2 12608.4 13100.6 10750.6 13198.5 12399.2 17938.1 14340.3 17329
1430239 11586.2 12608.4 13100.6 10750.6 13198.5 12399.2 17938.1 14340.3 17329
NOTE: R and limma are in the latest version.
Thank you very much!
Thanks,
Xiayu
[[alternative HTML version deleted]]
_______________________________________________
Bioconductor mailing list
Bioconductor at r-project.org
https://stat.ethz.ch/mailman/listinfo/bioconductor
Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list