[R-sig-Geo] Raster Data Management Advice

Michael Sumner md@umner @end|ng |rom gm@||@com
Sun Oct 9 11:01:12 CEST 2022


I would set up a polygon of the bounding box (in the native projection) of
each raster source, and use fields on those polygons to store the details
of interest: xmin,xmax,ymin,ymax, dimension, resolution, crs, and your
other details. Then hone in a areas of interest for different tasks to see
what set of overlapping data you have for it.

There's a lot of fanfare about STAC, but it's really just a JSON-format
with some of the information you could store on a simple polygon dataset
...with STAC as with so many formats you'd have to shoehorn your data into
that more restrictive form (you can always spit out STAC as a side product
of your own rich summary for less sophisticated uses).

The crux is keeping the details of the source's native
projection independent from the representation you use to query it
spatially IMO, just record what's there. Further, the GDAL warper app-lib
(one level below the gdalwarp.exe) is the right tool for doing general
reads, of any number of sources into one specific window of your choosing
in any projection (you could use your dataset described above to limit
which sources get included). You can easily see what you'd get by merging
any number of sources together, and of course more nuanced situations like
a sensible background with more detailed layers merged over that is very
valuable.

Cheers, Mike



On Sat, Oct 8, 2022 at 3:35 AM Alexander Ilich <ailich using mail.usf.edu> wrote:

> Hi, I was wondering if anyone has some advice on how to organize raster
> data so that it is easily queryable by various attributes (e.g. find me all
> the rasters of data type bathymetry, collected by this organization with
> 10m resolution or finer ). Currently we have data on a server organized
> often by when/where it was collected but that can make it difficult to find
> specific rasters that meet a certain criteria. I've created a table as a
> csv file on github <https://github.com/ailich/WFS_Multibeam_Metadata>
> where
> each row is a raster and it has various column attributes describing it
> (e.g. who collected it, what sonar was used, resolution, coordinate system,
> etc) and a path to the filename as a temporary solution, but I think some
> type of spatial database that would allow for querying and then reading
> into R as terra objects, as well as into QGIS and ArcGIS as layers for
> visualization would be optimal as multiple project members use these data.
> Tools I've come across that seem potentially useful include PostGIS and
> Geopackage, but I'm not entirely sure how to properly set them up or if
> they'd suit my needs. Any advice would be greatly appreciated.
>
> Thanks,
> Alex
>
>         [[alternative HTML version deleted]]
>
> _______________________________________________
> R-sig-Geo mailing list
> R-sig-Geo using r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-geo
>


-- 
Michael Sumner
Software and Database Engineer
Australian Antarctic Division
Hobart, Australia
e-mail: mdsumner using gmail.com

	[[alternative HTML version deleted]]



More information about the R-sig-Geo mailing list