[BioC] VariantAnnotation: fine define Locating variants in and around genes
Fabrice Tourre
fabrice.ciup at gmail.com
Thu Jan 31 22:48:54 CET 2013
Valerie,
Thank you for your reply.
Is there a function in VariantAnnotation to know whether a snp is
within transcription region but outside coding region? Or is it in
first exon/intron?
On Thu, Jan 31, 2013 at 4:30 PM, Valerie Obenchain <vobencha at fhcrc.org> wrote:
> Hi Fabrice,
>
> To identify snps (or any ranges) in introns only, use IntronVariants() as
> the 'region' argument. The CodingVariants are the exon regions. If you want
> all regions except coding, I would suggest using AllVariants().
>
> This output is from the man page example. The 'loc_coding' name is
> misleading since AllVariants were use as 'region'. I have changed it to
> 'loc_all' in the devel branch.
>
>> loc_coding <- locateVariants(vcf_adj, txdb, AllVariants())
>> loc_coding
> GRanges with 16 ranges and 7 metadata columns:
> seqnames ranges strand | LOCATION QUERYID
> <Rle> <IRanges> <Rle> | <factor> <integer>
> chr1 [ 13220, 13220] * | intron 1
> chr1 [ 13220, 13220] * | spliceSite 1
> chr1 [ 13220, 13220] * | intron 1
> chr1 [ 13220, 13220] * | intron 1
> chr1 [ 13220, 13220] * | spliceSite 1
> ...
> ...
>
> This example has variants in splice sites, introns, coding and intergenic
> regions.
>
>> tbl <- table(loc_coding$LOCATION)
>> tbl[tbl > 0]
>
> spliceSite intron coding intergenic
> 2 7 2 5
>
> The result can be subset on LOCATION for the region of interest. The QUERYID
> column maps back to the row number in the original 'query' argument to
> locateVariants().
>
> introns <- loc_coding[loc_coding$LOCATION == "intron", ]
>> head(introns, 3)
> GRanges with 3 ranges and 7 metadata columns:
> seqnames ranges strand | LOCATION QUERYID TXID
> <Rle> <IRanges> <Rle> | <factor> <integer> <integer>
> chr1 [13220, 13220] * | intron 1 1
> chr1 [13220, 13220] * | intron 1 2
> chr1 [13220, 13220] * | intron 1 3
>
>
> Valerie
>
>
>
> On 01/31/2013 12:34 PM, Fabrice Tourre wrote:
>>
>> Dear list,
>>
>> I am using VariantAnnotation to Locate variants in and around genes.
>>
>> In VariantAnnotation, the region is defined as: Coding Variants,
>> IntronVariants, FiveUTRVariants, ThreeUTRVariants, IntergenicVariants,
>> SpliceSiteVariants or PromoterVariants.
>>
>> If it possible to know whether a snp is in exon/intron within
>> transcription region but outside coding region?
>>
>> Thanks.
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at r-project.org
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives:
>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>>
>
More information about the Bioconductor
mailing list