[BioC] Different probesets but same expression values?
James W. MacDonald
jmacdon at uw.edu
Fri Feb 14 16:53:35 CET 2014
Hi David,
It's pretty easy to explain. The probesets for this gene are all the
same. In other words, the probesets for this gene (except for probeset
17042487) are made up of the exact same 36 probes.
> library(pd.hugene.2.0.st)
> library(hugene20sttranscriptcluster.db)
> prbids <- select(hugene20sttranscriptcluster.db, "6890","PROBEID",
"ENTREZID")
> dat <- lapply(prbids[1:7,2], function(x) dbGetQuery(con,
paste("select fid from pmfeature inner join core_mps on
core_mps.fsetid=pmfeature.fsetid where core_mps.meta_fsetid='", x, "';",
sep = "")))
> dat2 <- do.call("cbind", dat)
> apply(dat2, 1, function(x) length(unique(x)))
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
25 26
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
1 1
27 28 29 30 31 32 33 34 35 36
1 1 1 1 1 1 1 1 1 1
Best,
Jim
On 2/14/2014 5:13 AM, David wrote:
> Dear list,
>
> I have just performed an analysis on a set of
> hugene2.1 arrays. I have normalised them using RMA method and have done
> statistical analysis using limma. I found that probesets for same gene
> have exactly the same expression values and I wonder if anyone can
> explain.
>
> Here one example.
>
> Values from the normalised expressionSet for a few of the samples
>
> Probeset.ID Sample1 Sample2 Sample3 Sample4 Sample5 Sample6
> 17017979 9.71251093382803 10.0024198453553 10.1431406981423 10.4515608924303 10.3890663027479 10.6017676219535
> 17027144 9.71251093382803 10.0024198453553 10.1431406981423 10.4515608924303 10.3890663027479 10.6017676219535
> 17029788 9.71251093382803 10.0024198453553 10.1431406981423 10.4515608924303 10.3890663027479 10.6017676219535
> 17032476 9.71251093382803 10.0024198453553 10.1431406981423 10.4515608924303 10.3890663027479 10.6017676219535
> 17034791 9.71251093382803 10.0024198453553 10.1431406981423 10.4515608924303 10.3890663027479 10.6017676219535
> 17037271 9.71251093382803 10.0024198453553 10.1431406981423 10.4515608924303 10.3890663027479 10.6017676219535
> 17039977 9.71251093382803 10.0024198453553 10.1431406981423 10.4515608924303 10.3890663027479 10.6017676219535
>
>
> Values from the limma differential expression results, excluding AvgExpr column which also shows same value:
> ID logFC P.Value adj.P.Val Gene Description
> 17017979 0,274433596 0,000514141 0,275536509 transporter 1, ATP-binding cassette, sub-family B (MDR/TAP)
> 17027144 0,274433596 0,000514141 0,275536509 transporter 1, ATP-binding cassette, sub-family B (MDR/TAP)
> 17029788 0,274433596 0,000514141 0,275536509 transporter 1, ATP-binding cassette, sub-family B (MDR/TAP)
> 17032476 0,274433596 0,000514141 0,275536509 transporter 1, ATP-binding cassette, sub-family B (MDR/TAP)
> 17034791 0,274433596 0,000514141 0,275536509 transporter 1, ATP-binding cassette, sub-family B (MDR/TAP)
> 17037271 0,274433596 0,000514141 0,275536509 transporter 1, ATP-binding cassette, sub-family B (MDR/TAP)
> 17039977 0,274433596 0,000514141 0,275536509 transporter 1, ATP-binding cassette, sub-family B (MDR/TAP)
>
> Any suggestions will be greatly appreciated
>
> Best,
>
> Dave
>
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
--
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099
More information about the Bioconductor
mailing list