[Bioc-sig-seq] frequency of A, C, G, T

Patrick Aboyoun paboyoun at fhcrc.org
Thu Aug 27 17:35:02 CEST 2009


Ali,
Biostrings's consensusMatrix method for XStringSet (e.g. DNAStringSet) 
objects performs the same operation as ShortReads's alphabetByCycle, in 
case you were looking for non-ShortRead alternatives.


Patrick



Ali Mohamadian wrote:
> Hi Nenad,
> I was using "alphabetFrequency"  but your advice to use "alphabetByCycle" was very helpful for my purposes.
> Thanks
> Ali
>
>
> ________________________________
> From: Nenad Bartonicek <nenad at ebi.ac.uk>
>
> Cc: bioc-sig-sequencing at r-project.org
> Sent: Thursday, August 27, 2009 11:36:36 AM
> Subject: RE: frequency of A, C, G, T
>
>
> Hi Ali,
>
> There is a function "alphabetFrequency" in the package Biostrings.
> If you were thinking of the total frequency of letters, there is also a function "alphabetByCycle" in the package ShortRead.
>
> Hope it helps,
>
> Nenad
>
> Date: Thu, 27 Aug 2009 01:45:48 -0700 (PDT)
>
>   
>> Subject: [Bioc-sig-seq] frequency of A, C, G, T
>> To: bioc-sig-sequencing at r-project.org
>> Message-ID: <120808.21354.qm at web34203.mail.mud.yahoo.com>
>> Content-Type: text/plain
>>
>> Hi All,
>> I wonder is there any R function to give me the frequency of  A,C,G and T(and  also a,c,g,t) in my reads?
>> Data look like
>> chrY  46418   46469   cgaCGTtcCttcaggcAaagCGCtTGCcCAAatTgCcTTTTtcCCGTCCAC                0 - 46418    46469   0,0,255
>> chrY  119520 119570 AAGGATGGACCATCTTGTgATTGCCATATCCAgggaTCCATCCCATgatca     0 + 119520 119570  255,0,0
>> chrY  119740 119791 aCCCAAGGAGCtAAAGGGATCTGCGaCCCTATAGGTGGAACAACATTATGA 0 - 119740  119791  0,0,255
>> chrY  119740 119791 acccAAGGagctAAAGggatatgcGacccTatAgGtGGAacAacATTatga                   0 - 119740  119791  0,0,255
>> chrY  150045 150096 ttAtTAAGAAAAAttcAGGTAAATGGAGactCGtACAGGACCTTAGAAAAA       0 - 150045  150096   0,0,255
>> chrY  190995 191046 ggatggacacacacacaCacacaCaCACACACACACACACACaCACACgCA              0 - 190995  191046   0,0,255
>> chrY  198579 198629 GTTGCTGGGAtTTGAACTCAGGACCTTTGGAaGAGCAATCGGTGCTCTTaa   0 + 198579 198629   255,0,0
>> chrY  237529 237579 AAACTCAGAAATCTGCCTGCCTCTGCCTCCCAAGTgCTGGGaTTAAAgGcC   0 + 237529 237579   255,0,0
>>  
>> Thanks
>> Ali
>>
>>     
>
>
> Nenad Bartonicek
> PhD student, Enright group
> European Bioinformatics Institute
> Hinxton
> Cambridge
> CB10 1SD
> United Kingdom
> tel: +44-755-435-9057
>
>
>
>       
> 	[[alternative HTML version deleted]]
>
>   
> ------------------------------------------------------------------------
>
> _______________________________________________
> Bioc-sig-sequencing mailing list
> Bioc-sig-sequencing at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing
>



More information about the Bioc-sig-sequencing mailing list