[R-sig-Geo] Encoding of attributes in SpatVector (terra package)
Ben Tupper
btupper @end|ng |rom b|ge|ow@org
Mon Jun 21 18:45:09 CEST 2021
Hi,
This quickly gets beyond my chops. Have you considered filing a
reproducible example in the bug tracker? You might search old issues
there first in case this has been resolved by others. Also, search
with RSeek.org or the like.
https://cran.r-project.org/web/packages/terra/index.html
Cheers,
Ben
On Sun, Jun 20, 2021 at 9:36 PM Tim Elrick <abo using elrick.de> wrote:
>
> Hi Ben,
>
> Sorry, now as plain text...
>
> Thanks for your response. I guess, that is a Windows encoding problem
> then, as I get (below I also included how they differ):
>
> ###
> identical(mtl.sf$NOM, mtl.spv$NOM)
> [1] FALSE
> > sessionInfo()
> R version 4.1.0 (2021-05-18)
> Platform: x86_64-w64-mingw32/x64 (64-bit)
> Running under: Windows 10 x64 (build 17763)
>
> Matrix products: default
>
> locale:
> [1] LC_COLLATE=English_Canada.1252 LC_CTYPE=English_Canada.1252
> LC_MONETARY=English_Canada.1252
> [4] LC_NUMERIC=C LC_TIME=English_Canada.1252
>
> attached base packages:
> [1] stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] sf_1.0-0 terra_1.3-4 forcats_0.5.1 stringr_1.4.0
> dplyr_1.0.7 purrr_0.3.4 readr_1.4.0
> [8] tidyr_1.1.3 tibble_3.1.2 ggplot2_3.3.4 tidyverse_1.3.1
>
> loaded via a namespace (and not attached):
> [1] tidyselect_1.1.1 haven_2.4.1 lattice_0.20-44
> colorspace_2.0-1 vctrs_0.3.8 generics_0.1.0
> [7] utf8_1.2.1 rlang_0.4.11 e1071_1.7-7
> pillar_1.6.1 glue_1.4.2 withr_2.4.2
> [13] DBI_1.1.1 sp_1.4-5 dbplyr_2.1.1
> modelr_0.1.8 readxl_1.3.1 lifecycle_1.0.0
> [19] munsell_0.5.0 gtable_0.3.0 cellranger_1.1.0
> rvest_1.0.0 raster_3.4-13 codetools_0.2-18
> [25] labeling_0.4.2 class_7.3-19 fansi_0.5.0
> broom_0.7.7 Rcpp_1.0.6 KernSmooth_2.23-20
> [31] scales_1.1.1 backports_1.2.1 classInt_0.4-3
> jsonlite_1.7.2 farver_2.1.0 fs_1.5.0
> [37] digest_0.6.27 hms_1.1.0 stringi_1.6.2 grid_4.1.0
> cli_2.5.0 tools_4.1.0
> [43] magrittr_2.0.1 proxy_0.4-26 crayon_1.4.1
> pkgconfig_2.0.3 ellipsis_0.3.2 xml2_1.3.2
> [49] reprex_2.0.0 lubridate_1.7.10 assertthat_0.2.1 httr_1.4.2
> rstudioapi_0.13 R6_2.5.0
> [55] units_0.7-2 compiler_4.1.0
> ###
>
> Here is the difference:
> ###
> > tail(mtl.sf$NOM)
> [1] "Montréal-Est" "Anjou"
> "Pierrefonds-Roxboro" "Sainte-Anne-de-Bellevue"
> [5] "Verdun" "Baie-d'Urfé"
> > tail(mtl.spv$NOM)
> [1] "Montréal-Est" "Anjou"
> "Pierrefonds-Roxboro" "Sainte-Anne-de-Bellevue"
> [5] "Verdun" "Baie-d'Urfé"
> ###
>
> So, basically, vect() seems to have problems to interpret non-ASCII
> characters in Windows.
>
> Is there an easy way to fix that, e.g. by setting a parameter or is this
> a bug for the Windows edition?
>
> Thanks,
> Tim
>
> On 2021-06-15 18:32, Ben Tupper wrote:
> Hi,
>
>
> Alas, HTML mail gets mangled - try setting your email client to send
> plan text.
>
> I don't know what the NOM variable contents are supposed to look like,
> but I find they are identical. Can you post to the list your evidence
> that shows how they differ and what makes one right versus the other
> wrong? Also, always add the output of sessionInfo() as shown below.
>
> library(sf)
> library(terra)
>
> url <-
> "https://data.montreal.ca/dataset/00bd85eb-23aa-4669-8f1b-ba9a000e3dd8/resource/e9b0f927-8f75-458c-8fda-b5da65cc8b73/download/limadmin.geojson"
>
> mtl.sf <- read_sf(url)# convert sf object to SpatVector object
>
> mtl.spv <- vect(mtl.sf)
>
> identical(mtl.sf$NOM, mtl.spv$NOM)
> # [1] TRUE
>
> sessionInfo()
> # R version 4.0.3 (2020-10-10)
> # Platform: x86_64-apple-darwin17.0 (64-bit)
> # Running under: macOS Big Sur 10.16
> #
> # Matrix products: default
> # LAPACK:
> /Library/Frameworks/R.framework/Versions/4.0/Resources/lib/libRlapack.dylib
> #
> # locale:
> # [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
> #
> # attached base packages:
> # [1] stats graphics grDevices utils datasets methods base
> #
> # other attached packages:
> # [1] terra_1.2-10 sf_0.9-8 dplyr_1.0.5
> #
> # loaded via a namespace (and not attached):
> # [1] Rcpp_1.0.6 rstudioapi_0.13 raster_3.4-5
> magrittr_2.0.1 units_0.7-1 tidyselect_1.1.1
> # [7] lattice_0.20-41 R6_2.5.0 rlang_0.4.11
> fansi_0.4.2 blob_1.2.1 tools_4.0.3
> # [13] grid_4.0.3 KernSmooth_2.23-17 utf8_1.2.1
> cli_2.5.0 e1071_1.7-6 DBI_1.1.1
> # [19] ellipsis_0.3.2 class_7.3-17 assertthat_0.2.1
> tibble_3.1.1 lifecycle_1.0.0 crayon_1.4.1
> # [25] purrr_0.3.4 codetools_0.2-16 vctrs_0.3.8
> glue_1.4.2 sp_1.4-5 proxy_0.4-25
> # [31] compiler_4.0.3 pillar_1.6.0 generics_0.1.0
> classInt_0.4-3 pkgconfig_2.0.3
>
> Cheers,
> Ben
>
>
> On Tue, Jun 15, 2021 at 11:57 AM Tim Elrick <abo using elrick.de> wrote:
> >
> > Hi all,
> >
> > Does anyone know how to change encoding for attributes in a SpatVector
> > (terra package)?
> >
> > library(sf) library(terra)
> > # importing a geojson file as sf object url <-
> >
> "https://data.montreal.ca/dataset/00bd85eb-23aa-4669-8f1b-ba9a000e3dd8/resource/e9b0f927-8f75-458c-8fda-b5da65cc8b73/download/limadmin.geojson"
> > mtl.sf <- read_sf(url)# convert sf object to SpatVector object mtl.spv
> > <- vect(mtl) # while mtl.sf$NOM # displays French letters correctly,
> > mtl.spv$NOM # does not anymore Thanks, Tim
> >
> >
> > [[alternative HTML version deleted]]
> >
> > _______________________________________________
> > R-sig-Geo mailing list
> > R-sig-Geo using r-project.org
> > https://stat.ethz.ch/mailman/listinfo/r-sig-geo
>
>
>
>
--
Ben Tupper (he/him)
Bigelow Laboratory for Ocean Science
East Boothbay, Maine
http://www.bigelow.org/
https://eco.bigelow.org
More information about the R-sig-Geo
mailing list