Skip to content

Instantly share code, notes, and snippets.

@alukach
Last active April 9, 2018 11:34
Show Gist options
  • Save alukach/bb52c577ff4d98901c2d6eee8b88095d to your computer and use it in GitHub Desktop.
Save alukach/bb52c577ff4d98901c2d6eee8b88095d to your computer and use it in GitHub Desktop.
Calgary building detection

Generating building geometries from satellite imagery

Goals

Currently, the OpenStreetMap dataset for the Calgary, Alberta, Canada region is lacking most of the city's building footprints. The City of Calgary possesses a dataset of the building footprints, however it opts to sell this data rather than distribute it for free via its open data catalogue. The City of Edmonton does distribute its respective building footprint dataset freely and thus the footprints are available via OpenStreetMap. Utilizing high resolution satellite imagery of the City of Edmonton and the building footprint dataset from OpenStreetMap, it should be feasible to train a neural network classifier to generate building footprint polygons from provided satellite imagery of the Calgary region.

Methodology

A similar challenge Kaggle competition exists: Dstl's Satellite Imagery competition

Reviews of solution:

Available Tooling

Data Preparation for Satellite Machine Learning

The tool downloads OpenStreetMap QA Tile information and satellite imagery tiles and saves them as an .npz file for use in Machine Learning training.

Data pipeline for machine learning with OpenStreetMap

A pipeline to simplify building a set of training data for aerial-imagery- and OpenStreetMap- based machine learning. The idea is to use OSM QA Tiles to generate "ground truth" images where each color represents some category derived from OSM features. Being map tiles, it's then pretty easy to match these up with the desired input imagery.

Question: Does label-maker make this repo redundant? Label-maker states that it "builds on the concepts of skynet-data."

Data

Licensing

From Mapbox TOS:

You may use Studio or third-party software to trace Mapbox maps solely comprised of satellite imagery ("Mapbox Satellite Imagery") and produce derivative vector datasets (i) for non-commercial purposes and (ii) for OpenStreetMap. If you are on at least a Premium or Commercial plan, you may also use Studio to trace Mapbox Satellite Imagery for commercial purposes, but may only trace up to 100 points, lines, or polygons per year. Except as permitted by this paragraph, you may not trace Mapbox maps.

This appears to cover the Digital Globe imagery:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment