[BioC] crlmm with Illumina arrays
Lavinia Gordon
lavinia.gordon at mcri.edu.au
Thu Feb 18 00:22:14 CET 2010
To authors/users of crlmm (and probably VanillaICE)
I am using crlmm with Illumina arrays (human610quadv1b)
I have run into a few problems and would appreciate any tips on how to
troubleshoot.
The issues may be related to the quality of the arrays, however they
have all run through the standard Illumina software without problems.
I am following a (slightly) modified version of the script
illumina_copynumber.Rnw,
finishing with:
crlmmWrapper(sampleSheet=samplesheet,
arrayNames=arrayNames,
arrayInfoColNames=list(barcode="SentrixBarcode_A",
position="SentrixPosition_A"),
saveDate=TRUE,
cdfName=cdfName,
load.it=FALSE,
save.it=FALSE,
intensityFile=file.path(outdir, "normalizedIntensities.rda"),
crlmmFile=file.path(outdir, "snpsetObject.rda"),
rgFile=file.path(outdir, "rgFile.rda"))
If I run:
update(filename, adj.bias=TRUE)
I get this error:
Processing OUTDIR/crlmmSetList_1.rda ...
----------------------------------------------------------------------------
- Estimating copy number for chromosome 1
----------------------------------------------------------------------------
Error in computeCopynumber(object, CHR = CHR, ...) :
formal argument "CHR" matched by multiple actual arguments
(not just for chromosome 1)
So instead I have been using 'computeCopyNumber':
crlmmSetList <- computeCopynumber(crlmmSetList, CHR, bias.adj=TRUE,
SNRmin=5, cdfName, batch=scanbatch)
save(crlmmSetList, file=file.path(outdir, paste("crlmmSetList_", CHR,
".rda", sep="")))
This runs for all bar chromosome 23 and 24 (which is probably due to the
fact that these .rda files are tiny, ~1.9kb)
[1] "OUTDIR/crlmmSetList_23.rda"
Fitting model for copy number estimation...
Using 50 df for inverse chi squares.
Estimating gender
Error in kmeans(XMedian, c(min(XMedian[SNR > SNRmin]), max(XMedian[SNR > :
initial centers are not distinct
> traceback()
5: stop("initial centers are not distinct")
4: kmeans(XMedian, c(min(XMedian[SNR > SNRmin]), max(XMedian[SNR >
SNRmin])))
3: instantiateObjects(calls = calls, conf = conf, NP = NP, plate = plate,
envir = envir, chrom = chrom, A = A, B = B, gender = gender,
SNR = SNR, SNRmin = SNRmin, pkgname = cdfName)
2: .computeCopynumber(chrom = CHR, A = A(ABset), B = B(ABset), calls =
calls(snpset),
conf = confs(snpset), NP = A(NPset), plate = batch, envir = envir,
SNR = ABset$SNR, bias.adj = FALSE, SNRmin = SNRmin, cdfName =
cdfName,
...)
1: computeCopynumber(crlmmSetList, CHR, bias.adj = TRUE, SNRmin = 5,
cdfName, batch = scanbatch)
Then following a slightly modified version of the script copynumber.Rnw,
beginning from the section:
CHR <- 2
if(!exists("crlmmSetList")) load(file.path(outdir,
paste("crlmmSetList_", CHR, ".rda", sep="")))
show(crlmmSetList)
....
to
if(!exists("hmmPredictions")){
+ hmmPredictions <- viterbi(emission=emission.cn,
+ initialStateProbs=log(initialP),
+ tau=tau[, "transitionPr"],
+ arm=tau[, "arm"],
+ normalIndex=3,
+ normal2altered=0.1,
+ altered2normal=1,
+ altered2altered=0.01)
+ }
I get:
Error: NA/NaN/Inf in foreign function call (arg 1)
> summary(is.na(emission.cn))
Mode FALSE TRUE NA's
logical 8806640 3473576 0
> head(tau)
chromosome position arm transitionPr
[1,] 2 8856 0 0.9998882
[2,] 2 14445 0 0.9998708
[3,] 2 20906 0 0.9999908
[4,] 2 21366 0 0.9999915
[5,] 2 21791 0 0.9999756
[6,] 2 23012 0 0.9999245
> traceback()
2: .C("viterbi", tmp[[1]], tmp[[2]], tmp[[3]], tmp[[4]], tmp[[5]],
tmp[[6]], tmp[[7]], tmp[[8]], tmp[[9]], tmp[[10]], tmp[[11]],
tmp[[12]], tmp[[13]])
1: viterbi(emission = emission.cn, initialStateProbs = log(initialP),
tau = tau[, "transitionPr"], arm = tau[, "arm"], normalIndex = 3,
normal2altered = 0.1, altered2normal = 1, altered2altered = 0.01)
> sessionInfo()
R version 2.10.1 (2009-12-14)
x86_64-unknown-linux-gnu
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=C LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=en_US.UTF-8 LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] human610quadv1bCrlmm_1.0.1 RColorBrewer_1.0-2
[3] SNPchip_1.10.0 oligoClasses_1.8.0
[5] VanillaICE_1.8.0 crlmm_1.4.1
[7] Biobase_2.6.1
loaded via a namespace (and not attached):
[1] affyio_1.14.0 annotate_1.24.1 AnnotationDbi_1.8.1
[4] Biostrings_2.14.10 DBI_0.2-5 ellipse_0.3-5
[7] genefilter_1.28.2 IRanges_1.4.9 mvtnorm_0.9-8
[10] preprocessCore_1.8.0 RSQLite_0.8-1 splines_2.10.1
[13] survival_2.35-7 tools_2.10.1 xtable_1.5-6
>
Any advice greatly appreciated,
with regards
Lavinia Gordon
Bioinformatics officer
Murdoch Childrens Research Institute
The Royal Children s Hospital, Flemington Road, Parkville, Victoria
3052, Australia
More information about the Bioconductor
mailing list