[BioC] Differentially expressed genes from specific cell types from a single sample
Sandy [guest]
guest at bioconductor.org
Thu Apr 18 16:19:51 CEST 2013
I now have microarray data from different cell types and ideally the idea was to perform differential gene expression analysis. However, I was wondering if it would be possible to identify cell type specific marker genes for these cell types.
Genes that are predominantly expressed in a particular cell type and not in other (of course when performing DE Analysis, one would get this contrast with respect to two cell types.) But in this case, when one has many cell types, my approach would be to contrast all combinations.
My question is to find out if there are any direct method that is available to identify a set DEG's that explicitly belong a cell type from a given dataset.
My dataset looks this way:
Probes celltype1 celltype2 celltype3 celltype4
---------------------------------------------------
gene1 5.098 4.677 7.456 8.564
gene2 8.906 6.653 6.754 7.546
gene3 7.409 5.432 6.724 5.345
gene4 4.876 5.981 4.290 4.267
gene4 3.567 3.425 8.564 6.345
gene5 2.569 8.645 5.234 7.345
The dimension is 22810(genes) x 20(cell types).
I tried to use the csSAM package in R.
csSamWrapper(G, cc, y, nperms = 200,alternative = "two.sided", standardize = TRUE,medianCenter = TRUE, logRm = FALSE, logBase = 2,nonNeg = TRUE, fileName = "csSAMout.pdf")
where G is my expression matrix where I have the genes as rows and cell types as columns.
and cc stands for the Matrix of cell-frequency. (n by k, n samples, k cell-types).
and y for A numeric vector of group association of each sample. Either 1 or 2.
The example from the package had several samples.
I am not clear as how to construct the cell frequency matrix because I have only one sample. Are there other packages which can be used to find DEG s from a single cell type without comparison?
-- output of sessionInfo():
R version 2.15.2 (2012-10-26)
Platform: i686-redhat-linux-gnu (32-bit)
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 LC_PAPER=C LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] limma_3.14.4 csSAM_1.2.1 GOstats_2.24.0 RSQLite_0.10.0 DBI_0.2-5
[6] graph_1.36.2 Category_2.22.0 AnnotationDbi_1.20.5 affy_1.36.1 Biobase_2.16.0
[11] BiocGenerics_0.4.0 R.utils_1.23.2 R.oo_1.13.0 R.methodsS3_1.4.2
loaded via a namespace (and not attached):
[1] affyio_1.22.0 annotate_1.36.0 AnnotationForge_1.0.3 BiocInstaller_1.8.3 genefilter_1.40.0
[6] GO.db_2.8.0 GSEABase_1.18.0 IRanges_1.16.6 parallel_2.15.2 preprocessCore_1.18.0
[11] RBGL_1.34.0 splines_2.15.2 stats4_2.15.2 survival_2.36-14 tools_2.15.2
[16] XML_3.9-4 xtable_1.6-0 zlibbioc_1.4.0
--
Sent via the guest posting facility at bioconductor.org.
More information about the Bioconductor
mailing list