[Bioc-devel] Changes to Hits class in devel branch affecting distanceToNearest method

Hervé Pagès hpages at fredhutch.org
Thu Nov 6 03:32:54 CET 2014


Hi Peter,

The new validity method for Hits revealed some issues with
the behavior of "distanceToNearest" method for GRanges objects.
The major issue being that it sometimes returns a Hits
object with NAs in it. Val will address this in the next few
days.

Cheers,
H.


On 11/05/2014 06:21 PM, Peter Hickey wrote:
> This message may be a bit premature or redundant since after writing it I now see that Herve is in the midst of work on the Hits class in S4Vectors.
>
> My package GenomicTuples is currently failing R CMD check in the devel branch (http://bioconductor.org/checkResults/devel/bioc-LATEST/GenomicTuples/zin2-checksrc.html). I tracked down the error to a problem with my distanceToNearest,GTuples,GTuples-method. However, this is simply defined via inheritance to the distanceToNearest,GenomicRanges,GenomicRanges-method and so I believe the problem lies with the distanceToNearest,GenomicRanges,GenomicRanges-method.
>
> The following example adapted from nearest-methods {GenomicRanges} docs demonstrates the change in behaviour between the release and devel branch. It appears to be due to a new validity check of the Hits object returned by distanceToNearest. As I said, I now see that Herve recently started some work on the Hits class in S4Vectors, which includes this validity check, so this is really just a heads up that these changes affect the distanceToNearest method, in case this hasn't already been noted.
>
> Thanks,
> Pete
>
> #---------------------------------------------------------------------------------------------------
> # BioC release
>>    library(GenomicRanges)
>>    query <- GRanges(c("A", "B"), IRanges(c(1, 5), width=1))
>>    subject <- GRanges("A", IRanges(c(6, 5, 13), c(10, 10, 15)))
>>    distanceToNearest(query, subject)
> Hits of length 2
> queryLength: 2
> subjectLength: 3
>    queryHits subjectHits  distance
>     <integer>   <integer> <integer>
>   1         1           2         3
>   2         2          NA        NA
>> sessionInfo()
> R version 3.1.1 (2014-07-10)
> Platform: x86_64-unknown-linux-gnu (64-bit)
>
> locale:
>   [1] LC_CTYPE=en_AU.UTF-8       LC_NUMERIC=C
>   [3] LC_TIME=en_AU.UTF-8        LC_COLLATE=en_AU.UTF-8
>   [5] LC_MONETARY=en_AU.UTF-8    LC_MESSAGES=en_AU.UTF-8
>   [7] LC_PAPER=en_AU.UTF-8       LC_NAME=C
>   [9] LC_ADDRESS=C               LC_TELEPHONE=C
> [11] LC_MEASUREMENT=en_AU.UTF-8 LC_IDENTIFICATION=C
>
> attached base packages:
> [1] stats4    parallel  stats     graphics  grDevices utils     datasets
> [8] methods   base
>
> other attached packages:
> [1] GenomicRanges_1.18.1 GenomeInfoDb_1.2.2   IRanges_2.0.0
> [4] S4Vectors_0.4.0      BiocGenerics_0.12.0
>
> loaded via a namespace (and not attached):
> [1] XVector_0.6.0
> #---------------------------------------------------------------------------------------------------
>
> #---------------------------------------------------------------------------------------------------
> # BioC devel
>>    library(GenomicRanges)
>> query <- GRanges(c("A", "B"), IRanges(c(1, 5), width=1))
>> subject <- GRanges("A", IRanges(c(6, 5, 13), c(10, 10, 15)))
>> distanceToNearest(query, subject)
> Error in validObject(.Object) :
>    invalid class “Hits” object: 'subjectHits(x)' must contain non-NA values >= 1 and <= 'subjectLength(x)'
>> sessionInfo()
> R Under development (unstable) (2014-10-29 r66891)
> Platform: x86_64-unknown-linux-gnu (64-bit)
>
> locale:
>   [1] LC_CTYPE=en_AU.UTF-8       LC_NUMERIC=C
>   [3] LC_TIME=en_AU.UTF-8        LC_COLLATE=en_AU.UTF-8
>   [5] LC_MONETARY=en_AU.UTF-8    LC_MESSAGES=en_AU.UTF-8
>   [7] LC_PAPER=en_AU.UTF-8       LC_NAME=C
>   [9] LC_ADDRESS=C               LC_TELEPHONE=C
> [11] LC_MEASUREMENT=en_AU.UTF-8 LC_IDENTIFICATION=C
>
> attached base packages:
> [1] stats4    parallel  stats     graphics  grDevices utils     datasets
> [8] methods   base
>
> other attached packages:
> [1] GenomicRanges_1.19.4 GenomeInfoDb_1.3.6   IRanges_2.1.5
> [4] S4Vectors_0.5.3      BiocGenerics_0.13.0
>
> loaded via a namespace (and not attached):
> [1] XVector_0.7.1
> #---------------------------------------------------------------------------------------------------
>
> --------------------------------
> Peter Hickey,
> PhD Student/Research Assistant,
> Bioinformatics Division,
> Walter and Eliza Hall Institute of Medical Research,
> 1G Royal Parade, Parkville, Vic 3052, Australia.
> Ph: +613 9345 2324
>
> hickey at wehi.edu.au
> http://www.wehi.edu.au
>
> ______________________________________________________________________
> The information in this email is confidential and inte...{{dropped:19}}



More information about the Bioc-devel mailing list