[BioC] moe430v2 annotation package
jeffrey rasmussen
rasmuss at u.washington.edu
Wed Oct 20 23:58:28 CEST 2004
I am trying to build an annotation package for Affy's Moe430 2.0 array
(since it does not appear to be available under the metadata section) and
I am wondering if my approach is the same as is used to build the
"official" metadata packages.
What I did was to simply use the annotation available from Affymetrix to
map each probe id to its corresponding genbank id using the file:
http://www.affymetrix.com/Auth/analysis/downloads/taf/Mouse430_2_annot_csv.zip
and then run ABPkgBuilder as detailed in the vignette "How to use
AnnBuilder."
The reason for my concern is that the QC stats my efforts returned
indicate that I have fewer probes annotated then the sum of the QC stats
for the moe430a and moe430b packages would give me.
Here's what my moe430_2QC.rda contains:
Quality control information for moe430_2
Date built: Wed Oct 20 13:46:14 2004
Number of probes: 45102
Probe number missmatch: None
Probe missmatch: None
Mappings found for probe based rda files:
moe430_2ACCNUM found 45102 of 45102
moe430_2CHR found 31093 of 45102
moe430_2CHRLOC found 21050 of 45102
moe430_2ENZYME found 1469 of 45102
moe430_2GENENAME found 31297 of 45102
moe430_2GO found 17752 of 45102
moe430_2GRIF found 0 of 45102
moe430_2LOCUSID found 31362 of 45102
moe430_2MAP found 28106 of 45102
moe430_2OMIM found 0 of 45102
moe430_2PATH found 2757 of 45102
moe430_2PMID found 29071 of 45102
moe430_2REFSEQ found 25512 of 45102
moe430_2SUMFUNC found 0 of 45102
moe430_2SYMBOL found 31298 of 45102
moe430_2UNIGENE found 30800 of 45102
Mappings found for non-probe based rda files:
moe430_2CHRLENGTHS found 21
moe430_2ENZYME2PROBE found 387
moe430_2GO2ALLPROBES found 4483
moe430_2GO2PROBE found 3252
moe430_2ORGANISM found 1
moe430_2PATH2PROBE found 122
moe430_2PMID2PROBE found 21108
However, for example, the sum of moe430aGO (found 15262 of 22690) and
moe430bGO (found 4499 of 22575) listed under the metadata section
indicates that merging these two packages --after all, the moe430 2.0
array is just that, a merge of the A and B chips-- would give me more
annotations than my efforts produced, which is unexpected given that the
moe430a and moe430b packages were created > 6 months ago. Has something
changed within the AnnBuilder package (I'm using version 1.4.18 under
linux) or the sources used for annotation (I used getSrcUrl("all",
organism="Mus Musculus") to src the various annotation files) that would
explain these results?
Best,
Jeff
__________________________________
Jeffrey Rasmussen
Research Consultant, Bioinformatics
Department of Immunology
University of Washington
More information about the Bioconductor
mailing list