[Bioc-sig-seq] Assessing Transcriptome Coverage
Abhishek Pratap
abhishek.vit at gmail.com
Mon Aug 17 21:07:45 CEST 2009
Hi
I dont have a lot of experience with plotting large amount of data
points and clearly my question reflects that. :)
summary(lane1)
Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
0.000 0.040 0.180 5.186 0.620 39730.000 2264.000
Thanks for your help.
-Abhi
On Mon, Aug 17, 2009 at 3:05 PM, Sean Davis<seandavi at gmail.com> wrote:
>
>
> On Mon, Aug 17, 2009 at 3:01 PM, Abhishek Pratap <abhishek.vit at gmail.com>
> wrote:
>>
>> Hi Sean
>>
>> Thanks for your suggestion on both the mailing lists. I am now reading
>> the coverage values from a file and storing them as a data.frame and
>> then creating a new numeric vector for each lane. Each vector may have
>> 15000-45000 entries. The values are integers with a significant
>> difference in values, some could be between 0-1 eg (0.45,0.89) and
>> then I also have values in range like (4000, 44000). I am just taking
>> random examples to explain the bias in the data.
>>
>> When I plot a histogram I just see one big bar. I feel the bins are
>> not created effectively. I also tried couple of different options in
>> the R hist function but with same result.
>>
>> hist(lane2, freq=TRUE, breaks=10);
>> hist(lane2, freq=TRUE, include.lowest=TRUE);
>
> What does summary(lane2) show? You may need to transform the data to make
> it more presentable (log?).
>
> Sean
>
>
>
More information about the Bioc-sig-sequencing
mailing list