[BioC] trouble with yeastann function in AnnBuilder
John Zhang
jzhang at pascal.dfci.harvard.edu
Thu Feb 3 14:29:24 CET 2005
Sorry, I forgot to update the man page. base should be a file name. If you would
like to build a annotation package, yeastPkgBuilder is the correct function to
use. Thanks.
>X-Original-To: jzhang at jimmy.harvard.edu
>Delivered-To: jzhang at jimmy.harvard.edu
>Mime-Version: 1.0 (Apple Message framework v619.2)
>To: bioconductor at stat.math.ethz.ch
>From: Jacob Michaelson <jjmichael at comcast.net>
>Date: Wed, 2 Feb 2005 19:06:52 -0700
>Received-SPF: none (hypatia: domain of bioconductor-bounces at stat.math.ethz.ch
does not designate permitted sender hosts)
>Received-SPF: none (hypatia: domain of jjmichael at comcast.net does not designate
permitted sender hosts)
>X-Virus-Scanned: by amavisd-new at stat.math.ethz.ch
>Content-Disposition: inline
>Content-Transfer-Encoding: 7bit
>Subject: [BioC] trouble with yeastann function in AnnBuilder
>X-BeenThere: bioconductor at stat.math.ethz.ch
>X-Mailman-Version: 2.1.5
>List-Id: The Bioconductor Project Mailing List <bioconductor.stat.math.ethz.ch>
>List-Unsubscribe: <https://stat.ethz.ch/mailman/listinfo/bioconductor>,
<mailto:bioconductor-request at stat.math.ethz.ch?subject=unsubscribe>
>List-Archive: <https://stat.ethz.ch/pipermail/bioconductor>
>List-Post: <mailto:bioconductor at stat.math.ethz.ch>
>List-Help: <mailto:bioconductor-request at stat.math.ethz.ch?subject=help>
>List-Subscribe: <https://stat.ethz.ch/mailman/listinfo/bioconductor>,
<mailto:bioconductor-request at stat.math.ethz.ch?subject=subscribe>
>X-Spam-Checker-Version: SpamAssassin 3.0.1 (2004-10-22) on
pascal.dfci.harvard.edu
>X-Spam-Level:
>X-Spam-Status: No, score=0.1 required=3.0 tests=FORGED_RCVD_HELO autolearn=ham
version=3.0.1
>
>I think I finally found what I need to build a decent Yeast 2.0
>annotation package, but unfortunately, I keep getting errors.
>
>The base file needed for yeastann is a two-column matrix containing
>probe ids and SGD accession numbers. I created such a file by
>whittling down Affymetrix's Yeast 2.0 CSV annotation file into those
>two columns.
>
> From the AnnBuilder documentation for yeastann:
>
>Arguments
>
>base
>base a matrix with two columns. The first column is probe ids and the
>second one are the mappings to SGD ids used by all the Yeast Genome
>data files. If base = "", the whole genome will be mapped based on a
>data file that contains mappings between all the ORFs and SGD ids
>
>I then imported the above file into R using:
>
> > yeastsgd=read.csv("yeastsgd.csv")
>
>and then
>
> > yeastsgd=as.matrix(yeastsgd)
>
>...just to be safe (although I also tried without and still got errors).
>
>Then I pasted the default function from the yeastann documentation,
>substituting my yeastsgd matrix into the base =" " position as follows:
>
>yeastAnn(base = yeastsgd, yGenoUrl =
>+
>"ftp://genome-ftp.stanford.edu/pub/yeast/data_download/",
>+ yGenoNames =
>+ c("literature_curation/gene_literature.tab",
>+ "chromosomal_feature/SGD_features.tab",
>+ "literature_curation/gene_association.sgd"), toKeep =
>+ list(c(6, 1), c(9, 2, 5, 6, 8, 11, 3), c(2, 5, 7)),
>+ colNames = list(c("sgdid", "pmid"), c("sgdid",
>+ "genename", "chr", "chrloc", "chrori", "description",
>+ "alias"), c("sgdid", "go")), seps = c("\t", "\t",
>+ "\t"), by = "sgdid")
>
>which immediately produced the error message (FYI in the "base"
>position I tried both with and without quotes, both of which produced
>errors):
>
>Error in file(file, "r") : unable to open connection
>In addition: Warning messages:
>1: the condition has length > 1 and only the first element will be used
>in: if (probe2ORF == "") {
>2: only first element of `description' argument used
>3: cannot open file `1769308_at'
>
>I'm not sure why it keeps misunderstanding a matrix when the
>documentation specifically calls for a matrix (for some reason it
>thinks 1769308_at is a file and not a probe).
>
>I verified that the imported data contains the data in the proper
>placement, which it does, and the dimensions seem to be correct:
>
> > dim(yeastsgd)
>[1] 10928 2
>
>Also FYI here's the error message when the above yeastann function is
>implemented and yeastsgd is imported using read.csv, but not as.matrix:
>
>Error in read.table(probe2ORF, sep = "\t", header = FALSE, strip.white
>= TRUE) :
> argument 'file' must be a character string or connection
>In addition: Warning message:
>the condition has length > 1 and only the first element will be used
>in: if (probe2ORF == "") {
>
>I'd really appreciate any help or suggestions.
>
>Thanks!
>
>--Jake Michaelson
>
> [[alternative text/enriched version deleted]]
>
>_______________________________________________
>Bioconductor mailing list
>Bioconductor at stat.math.ethz.ch
>https://stat.ethz.ch/mailman/listinfo/bioconductor
Jianhua Zhang
Department of Medical Oncology
Dana-Farber Cancer Institute
44 Binney Street
Boston, MA 02115-6084
More information about the Bioconductor
mailing list