Skip to content

Instantly share code, notes, and snippets.

View mdsumner's full-sized avatar

Michael Sumner mdsumner

  • Integrated Digital East Antarctica, Australian Antarctic Division
  • Hobart, Australia
View GitHub Profile
# library(sedonadb)
# 
# # Works with files
# fgb <- system.file("files/natural-earth_cities.fgb", package = "sedonadb")
# sd_read_sf(fgb)

## worked a while already with GDAL
library(lazysf)
url <- "/vsicurl/https://github.com/apache/sedona-db/raw/refs/heads/main/r/sedonadb/inst/files/natural-earth_cities_geo.parquet"
#' peek - browse XYZ tiles in the RStudio Viewer via magick
#'
#' User provides an EPSG:3857 bounding box, peek picks an appropriate zoom,
#' fetches the tiles, composites them with magick, and displays the result.

# --- tile math (this is what grout does) ---

#' Web Mercator full extent
ORIGIN <- -20037508.342789244

Here get raw tiles and calculate median across time. We're only plotting in a faceted index space, but it's lined up relatively.

library(rustycogs)  ## hypertidy/rustycogs  - we need dev-version of async_tiff which is in the Cargo.toml (#PR 265)
library(ximage)
js <- jsonlite::fromJSON(sds::stacit("55GDN", c("2025-01-01", format(Sys.Date()))))  ## hypertidy/sds relies on recent MGRS grid code rather than extent
hrf <- unlist(lapply(js$features$assets[1:19], function(x) x[["href"]]))
all(grepl("tif$",hrf))
                     
## index all the tifs (like virtualizarr, we get the IFD tables)
# 1. Linear percent clip
# low/high are the natural UX knobs; 2%/98% is a solid default,
# tighter for low-contrast scenes.
s2_stretch_linear <- function(x, low = 0.02, high = 0.98) {
v <- terra::values(x, na.rm = TRUE)
lo <- quantile(v, low, na.rm = TRUE)
hi <- quantile(v, high, na.rm = TRUE)
terra::clamp(x, lo, hi, values = TRUE) |>
(\(r) (r - lo) / (hi - lo))()
}

Myriahedral Projections: Implementations

Myriahedral projections (van Wijk, 2008) are not projections in the PROJ sense — there's no forward/inverse formula. They're a mesh-algorithm-projection pipeline:

  1. Build a fine triangular mesh on the sphere (icosahedral subdivision, graticule grid, or geography-adaptive)
  2. Assign edge weights (land/ocean crossings, graticule alignment, etc.)
  3. Compute a minimum spanning tree of the dual graph to decide where to cut
  4. Unfold the tree of faces flat using per-face gnomonic projections

The result has negligible area and angle distortion at the cost of many interrupts.

GDAL-ready basemap tiles

Access common web basemap tile services via GDAL. Returns either WMTS connection strings (for ESRI services with WMTS support) or TMS minidriver XML for XYZ tile services.

Function

basemap <- function(name = NULL, url = NULL, api_key = NULL, tile_level = 19L, bands = 3L) {
  providers <- list(
    # OpenStreetMap (no key)
    OpenStreetMap = "https://tile.openstreetmap.org/${z}/${x}/${y}.png",
library(wk)
library(geos)
library(PROJ)
## flatten all to POLYGON
map <- wk_flatten(as_wkb(rnaturalearth::ne_countries(scale = 50)))
local_laea <- function(x, merctrans) {
  cent <- geos_centroid(x)
  mercxy <- wk_coords(wk_transform(cent, merctrans))[,c("x", "y")]
 xy &lt;- wk_coords(cent)[, c("x", "y")]

Caching Range Requests: What GDAL Already Does (and the Hard Part It Doesn't)

The question of how to cache HTTP range requests — particularly for cloud-native geospatial formats like COG, PMTiles, FlatGeobuf, and cloud-optimized GeoParquet — keeps coming up. Brandon Liu's How many ranges can you fit in one request is a good treatment of the multi-range packing problem. But there's a mature, battle-tested system that already handles much of this at the client level, and its design choices are instructive even for people building entirely different stacks: GDAL's /vsicurl/ virtual filesystem.

GDAL's VSI curl layer and caching

When GDAL reads a cloud-optimized file via /vsicurl/ (or its cloud-specific variants /vsis3/, /vsigs/, /vsiaz/), it performs range request management internally. The behaviour is controlled by a set of configuration options that most users never touch, but that encode a lot of hard-won knowledge about how to read efficie

this is fine