[BioC] Fwd: Annotation discrepancy
James W. MacDonald
jmacdon at uw.edu
Fri Dec 20 20:05:07 CET 2013
Hi Eric,
Most if not all of those probes are the oligo-dT probes that surround
the chip (and I believe there are some in the middle as well). These
probes are used by the scanner as 'landing lights' to allow the scanner
to accurately align to the array prior to doing the scan.
The scanner does collect data from these probes, which ends up in the
cel file, but they are then ignored when the array is processed further.
Best,
Jim
On 12/20/2013 1:28 PM, Eric Zollars wrote:
> All-
>
> I have been attempting to compare sequences on the HGU133 Plus 2.0 chip to
> the HT HGU 133+ PM.
> I am doing this to compare values of vectors in frma.
>
> The HT chip is a subset of HGU133 Plus 2.0 with mismatch probes removes and
> some probesets reduced in size.
>
> Looking at the probe package:
>
> hthgu133pluspmprobe$sequence: 519370
>
> However, when looking at an Affybatch object made from HT CEL files:
> Taking an Affybatch object: 'dat'
>
> Index <- pmindex(dat)
> tv = unlist(Index)
> length(tv) #536460
>
> It appears that the Affybatch reports that there are 536460 sequences and
> the hthgu133pluspmprobe package is reporting only 519370.
>
> What is the difference? It is possible to find the information on the
> 17090 sequences not in the hthgu133pluspmprobe package?
>
> Thanks for any information or direction.
>
> Eric Zollars
>
> Session info below: bioconductor 2.13, R 3.0.2
>
>> sessionInfo()
> R version 3.0.2 (2013-09-25)
> Platform: i386-w64-mingw32/i386 (32-bit)
>
> locale:
> [1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United
> States.1252
> [3] LC_MONETARY=English_United States.1252 LC_NUMERIC=C
>
> [5] LC_TIME=English_United States.1252
>
> attached base packages:
> [1] parallel stats graphics grDevices utils datasets methods
> base
>
> other attached packages:
> [1] affy_1.40.0 hthgu133pluspmcdf_2.13.0
> hgu133plus2frmavecs_1.3.0
> [4] hgu133plus2probe_2.13.0 hthgu133pluspmprobe_2.13.0
> AnnotationDbi_1.24.0
> [7] Biobase_2.22.0 BiocGenerics_0.8.0
> BiocInstaller_1.12.0
>
> loaded via a namespace (and not attached):
> [1] affyio_1.30.0 DBI_0.2-7 IRanges_1.20.6
> [4] preprocessCore_1.24.0 RSQLite_0.11.4 stats4_3.0.2
> [7] tools_3.0.2 zlibbioc_1.8.0
>
--
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099
More information about the Bioconductor
mailing list