[BioC] problem with createBeadSummaryData

Mark Dunning mark.dunning at gmail.com
Thu Mar 5 16:47:19 CET 2009


Hi Philip,

The key to this problem is to set the 'imagesPerArray'  argument. This
dataset is quite an old one ("Humanv1") and back then Illumina used to
put different probes on the two stripes (e.g A_1 and A_2) for each
sample. Running createBeadSummaryData with the default option of
imagesPerArray =1 will confuse beadarray in this case as it will
expect to find the same probes on A_1 and A_2, and will produce NA
values when it doesn't.

The following command should be used for this dataset:-

 x.norm <- createBeadSummaryData(x.illumina, log=FALSE, what="G",
method="illumina",imagesPerArray=2)

More modern Illumina data (v2 and v3) have the same set of probes on
both stripes so the default arguments should work.

Regards,

Mark

On Thu, Mar 5, 2009 at 3:04 PM, Groot, Philip de <philip.degroot at wur.nl> wrote:
> Hello all,
>
> I encountered a problem with the createBeadSummaryData function in the beadarray library.
>
> I use the Illumina example files that are available on: http://www.compbio.group.cam.ac.uk/Resources/illumina/BeadLevelExample.zip
>
> I use the readIllimuna() command to read the files into Bioconductor:
>
> readIllumina(arrayNames = Illumina_Files_Listed, textType=".csv", useImages=F, backgroundMethod="none", singleChannel=TRUE)
>
> When I try to summarize the BeadLevel object, this works:
>
> x.norm <- createBeadSummaryData(x.illumina, log=FALSE, what="G", method="illumina")
>
> However, I obtain weird exprs-values:
>
>> exprs(x)[1:5,]
>      1475542113_A_1 1475542113_A_2 1475542113_B_1 1475542113_B_2
> 50008       821.4375            NaN       854.6316            NaN
> 50014      4154.5500            NaN      3782.0000            NaN
> 50017     21880.9574            NaN     21404.0909            NaN
> 50020       110.8542            NaN       116.9821            NaN
> 50022       129.4857            NaN       107.9600            NaN
>
>
> every 2nd column is completely NaN although I correctly state that only the green channels should be used. Is this a bug in the createBeadSummaryData function?
>
> If I do the summarization array by array, it does make more sense:
>
> x.norm.2 <- createBeadSummaryData(x.illumina, log=FALSE, what="G", method="illumina", arrays=2)
>
>> exprs(x.norm.2)[1:5]
> [1] 869.3913 112.5909 104.4375 116.5116 125.0000
>> arrayNames(x.illumina)[2]
> [1] "1475542113_A_2"
>
>
> So: what causes this particular behaviour? Do I misunderstand something? What is the best way to get the summarization properly done? My goal is to create an ExpressionSetIllumina object on which I can perform normalization (quantile, vsn, etc.).
>
>> sessionInfo()
> R version 2.8.0 (2008-10-20)
> x86_64-unknown-linux-gnu
>
> locale:
> LC_CTYPE=en_US.UTF-8;LC_NUMERIC=C;LC_TIME=en_US.UTF-8;LC_COLLATE=en_US.UTF-8;LC_MONETARY=C;LC_MESSAGES=en_US.UTF-8;LC_PAPER=en_US.UTF-8;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_US.UTF-8;LC_IDENTIFICATION=C
>
> attached base packages:
> [1] tools     stats     graphics  grDevices utils     datasets  methods
> [8] base
>
> other attached packages:
>  [1] vsn_3.8.0           affy_1.20.0         beadarray_1.10.0
>  [4] sma_0.5.15          hwriter_1.0         geneplotter_1.20.0
>  [7] annotate_1.20.1     xtable_1.5-4        AnnotationDbi_1.4.2
> [10] lattice_0.17-20     Biobase_2.2.1       limma_2.16.3
>
> loaded via a namespace (and not attached):
> [1] affyio_1.10.1        DBI_0.2-4            grid_2.8.0
> [4] KernSmooth_2.22-22   marray_1.20.0        preprocessCore_1.4.0
> [7] RColorBrewer_1.0-2   RSQLite_0.7-1        tcltk_2.8.0
>>
>
> Regards,
>
> Dr. Philip de Groot Ph.D.
> Bioinformatics Researcher
>
>
> Wageningen University / TIFN
> Nutrigenomics Consortium
> Nutrition, Metabolism & Genomics Group
> Division of Human Nutrition
> PO Box 8129, 6700 EV Wageningen
> Visiting Address: Erfelijkheidsleer: De Valk, Building 304
> Dreijenweg 2, 6703 HA  Wageningen
> Room: 0052a
> T: +31-317-485786
> F: +31-317-483342
> E-mail:   Philip.deGroot at wur.nl <mailto:Philip.deGroot at wur.nl>
> Internet: http://www.nutrigenomicsconsortium.nl <http://www.nutrigenomicsconsortium.nl/>
>             http://humannutrition.wur.nl <http://humannutrition.wur.nl/>
>             https://madmax.bioinformatics.nl <https://madmax.bioinformatics.nl/>
>
>
>
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>



More information about the Bioconductor mailing list