Hi


I am trying to analyze a data set consisting of data run on two different times a few months a part. The data set consists of 8 groups with 3 biological replicates in each, and Illumina Human WG6 v3 arrays have been used. I am using the probe profile file in the analysis.

After the quality control it looks like the data is separated into the different groups (8), but I can also slightly see the arrays separate them self into the two groups based on when they were run.

To try to block the effect caused by the two lab periods I thought of using duplicateCorrelation. Unfortunately I can not get it to work this time,

This is my design matrix:
> designMa
      S0_s S18_s S1_s S4_s T0_s T18_s T1_s T4_s
S_0h     1     0    0    0    0     0    0    0
S_0h     1     0    0    0    0     0    0    0
S_0h     1     0    0    0    0     0    0    0
T_0h     0     0    0    0    1     0    0    0
T_0h     0     0    0    0    1     0    0    0
T_0h     0     0    0    0    1     0    0    0
S_1h     0     0    1    0    0     0    0    0
S_1h     0     0    1    0    0     0    0    0
S_1h     0     0    1    0    0     0    0    0
T_1h     0     0    0    0    0     0    1    0
T_1h     0     0    0    0    0     0    1    0
T_1h     0     0    0    0    0     0    1    0
S_4h     0     0    0    1    0     0    0    0
S_4h     0     0    0    1    0     0    0    0
S_4h     0     0    0    1    0     0    0    0
T_4h     0     0    0    0    0     0    0    1
T_4h     0     0    0    0    0     0    0    1
T_4h     0     0    0    0    0     0    0    1
S_18h    0     1    0    0    0     0    0    0
S_18h    0     1    0    0    0     0    0    0
S_18h    0     1    0    0    0     0    0    0
T_18h    0     0    0    0    0     1    0    0
T_18h    0     0    0    0    0     1    0    0
T_18h    0     0    0    0    0     1    0    0

S0, T0 and S1 are in the first run and the rest in the second.

dataSet_Norm_exp_log2_ordnet is my normalized expression data as a matrix and blokk looks like this:
> blokk
 [1] 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2

> corfit <- duplicateCorrelation(dataSet_Norm_exp_log2_ordnet, design = designMa, ndups = 1, block = as.factor(blokk))
There were 50 or more warnings (use warnings() to see the first 50)
> warnings()
Warning messages:
1: In sqrt(dfitted.values) ... : NaNs produced
2: In sqrt(dfitted.values) ... : NaNs produced
3: In sqrt(dfitted.values) ... : NaNs produced
4: In sqrt(dfitted.values) ... : NaNs produced
5: In sqrt(dfitted.values) ... : NaNs produced
6: In sqrt(dfitted.values) ... : NaNs produced
7: In sqrt(dfitted.values) ... : NaNs produced
8: In sqrt(dfitted.values) ... : NaNs produced


> fitDesMa <- lmFit(dataSet_Norm_exp_log2_ordnet,design = designMa,block = as.factor(blokk),cor = corfit$consensus)
Error in chol.default(V) :   the leading minor of order 2 is not positive definite

> corfit
$consensus.correlation
[1] 1

$cor
[1] 1

$atanh.correlations
    [1] Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf
   [39] Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf
   [77] Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf
  [115] Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf
  [153] Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf
  [191] Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf Inf

Does any one have any suggestions for why I get all the Inf? Maybe duplicateCorrelation is not the best thing?

> sessionInfo()
R version 2.8.0 (2008-10-20) 
i386-pc-mingw32 

locale:
LC_COLLATE=English_United Kingdom.1252;LC_CTYPE=English_United Kingdom.1252;LC_MONETARY=English_United Kingdom.1252;LC_NUMERIC=C;LC_TIME=English_United Kingdom.1252

attached base packages:
[1] splines   tools     stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] statmod_1.3.8            illuminaHumanv3.db_1.1.3 GOstats_2.8.0            Category_2.8.2           genefilter_1.22.0        survival_2.34-1         
 [7] RBGL_1.18.0              graph_1.20.0             annaffy_1.14.0           KEGG.db_2.2.5            GO.db_2.2.5              RColorBrewer_1.0-2      
[13] limma_2.16.3             lumi_1.8.3               RSQLite_0.7-1            DBI_0.2-4                preprocessCore_1.4.0     mgcv_1.4-1              
[19] affy_1.20.0              annotate_1.20.1          xtable_1.5-4             AnnotationDbi_1.4.2      Biobase_2.2.1           

loaded via a namespace (and not attached):
[1] affyio_1.10.1   cluster_1.11.11 GSEABase_1.4.0  XML_1.99-0     
> 


Regards,
Ingrid


	[[alternative HTML version deleted]]