[BioC] diagnosing problems with processing microarray data using lumi package
Juliet Hannah
juliet.hannah at gmail.com
Thu Nov 17 22:46:45 CET 2011
List,
I have illumina microarray data and I'm trying to use the lumi package
to process it. I am running into some difficulties, and I am looking
for suggestions on how to diagnose what is going wrong.
# read in expression data
fileName <- "samples.txt"
myData <- lumiR(fileName)
Duplicated IDs found and were merged!
Perform Quality Control assessment of the LumiBatch object ...
Warning messages:
1: In lumiR(fileName) :
The data file may not be in the Illumia BeadStudio or GenomeStudio
output format!
2: In matrix(as.double(exprs), nrow = nrow(allData)) :
NAs introduced by coercion
3: In matrix(as.double(se.exprs), nrow = nrow(allData)) :
NAs introduced by coercion
4: In matrix(as.double(detection), nrow = nrow(allData)) :
NAs introduced by coercion
5: In matrix(as.double(beadNum), nrow = nrow(allData)) :
NAs introduced by coercion
# the names for the controls are not compatible with a data.frame so I
had to change these.
newNames <- paste("X",sampleNames(myData),sep="")
newNames <- sub("-",".",newNames)
sampleNames(myData) <- newNames
# read in controls
fileName <- "controls.txt"
controls <- lumiR(fileName)
Annotation columns are not available in the data.
Duplicated IDs found and were merged!
Perform Quality Control assessment of the LumiBatch object ...
Warning message:
In lumiR(fileName) :
The data file may not be in the Illumia BeadStudio or GenomeStudio
output format!
# convert to data frame so I can add it to controlData slot
controls <- exprs(controls)
colnames(controls) <- paste("X",colnames(controls),sep="")
controls <- data.frame(controls)
myData at controlData <- controls
lnormdata <- lumiB(myData,method='bgAdjust')
Perform bgAdjust background correction ...
# attempting VST, but having problems
tryVST <- lumiT(lnormdata)
Perform vst transformation ...
2011-11-17 16:34:50 , processing array 1
2011-11-17 16:34:50 , processing array 2
2011-11-17 16:34:50 , processing array 3
2011-11-17 16:34:50 , processing array 4
2011-11-17 16:34:51 , processing array 5
2011-11-17 16:34:51 , processing array 6
2011-11-17 16:34:51 , processing array 7
2011-11-17 16:34:51 , processing array 8
2011-11-17 16:34:51 , processing array 9
2011-11-17 16:34:51 , processing array 10
2011-11-17 16:34:51 , processing array 11
2011-11-17 16:34:51 , processing array 12
Error in lm.fit(x, y, offset = offset, singular.ok = singular.ok, ...) :
0 (non-NA) cases
In addition: Warning messages:
1: In lumiT(lnormdata) :
Too few probes are detectable based on detection p-values!
Iteration method will
be used for VST.
2: In lumiT(lnormdata) :
Too few probes are detectable based on detection p-values!
Iteration method will
be used for VST.
3: In lumiT(lnormdata) :
Too few probes are detectable based on detection p-values!
Iteration method will
be used for VST.
Can anyone suggest how to diagnose what is going wrong? Thanks for your help.
In case it is useful at all, the beginning of the file looks like.
[Sample Probe Profile]
PROBE_ID SYMBOL 262-00715.AVG_Signal 262-00715.NARRAYS 262-00715.ARRAY_STDEV 262-00715.BEAD_STDERR 262-00715.Avg_NBEADS
> sessionInfo()
R version 2.14.0 (2011-10-31)
Platform: x86_64-unknown-linux-gnu (64-bit)
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=C LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] lumi_2.6.0 nleqslv_1.9.0 methylumi_2.0.1 Biobase_2.14.0
loaded via a namespace (and not attached):
[1] affy_1.32.0 affyio_1.22.0 annotate_1.32.0
[4] AnnotationDbi_1.16.2 BiocInstaller_1.2.1 DBI_0.2-5
[7] grid_2.14.0 hdrcde_2.15 IRanges_1.12.1
[10] KernSmooth_2.23-7 lattice_0.20-0 MASS_7.3-16
[13] Matrix_1.0-1 mgcv_1.7-11 nlme_3.1-102
[16] preprocessCore_1.16.0 RSQLite_0.10.0 xtable_1.6-0
[19] zlibbioc_1.0.0
More information about the Bioconductor
mailing list