[Bioc-devel] yeast2 metadata

John Zhang jzhang at jimmy.harvard.edu
Fri Jul 21 19:57:20 CEST 2006


   
>The probe number missmatch may because of these codes in function
>getProbe2SGD.:
>
>    merged <- merge(probes, temp, by = by, all.x = TRUE)
>    merged <- merged[!duplicated(merged[, 1]), ]
>
>Here "probes" is the probeset ID to ORF ID mapping obtained from
>Affymetrix. All probeset IDs are included (10928 probeset IDs in total).
>"temp" is the ORF ID to SGD ID mapping obtained from
>ftp://genome-ftp.stanford.edu/pub/yeast/data_download/literature_curation/orf_g
eneontology.tab. 
>
>They are first merged together, so still have 10928 probeset IDs. And
>then when multiple probeset IDs are mapped to the same ORF ID, only the
>first probeset ID is kept. Because the 10928 probeset IDs are mapped to
>only 6356 unique ORF IDs, only 6356 probeset IDs are kept for further
>annotation process. I guess that's why we get probe number mismatch. It
>seems this is the author's intention to remove probeset IDs with
>duplicated mappings. Maybe the author can explain more.
>

Probe number mismatch is not a serious problem but should be avoided. I will try 
to look at this when I get the chance. 


>thanks
>
>nianhua
>
>> I would ask if it would be possible not to release metadata packages showing 
>> missmatches.
>>
>> regards and thanks for any comment
>>
>> mattia
>>
>>
>
>_______________________________________________
>Bioc-devel at stat.math.ethz.ch mailing list
>https://stat.ethz.ch/mailman/listinfo/bioc-devel

Jianhua Zhang
Department of Medical Oncology
Dana-Farber Cancer Institute
44 Binney Street
Boston, MA 02115-6084



More information about the Bioc-devel mailing list