Getting Started With SpaceNet Data

阿新 • • 發佈：2019-01-12

Getting Started With SpaceNet Data

The first SpaceNet challenge is complete, but the data remains available for download and analysis on AWS. This dataset contains a massive amount of labeled data in GeoJSON files, a format that may be unfamiliar to many in the computer vision field. This post aims to lower the barrier of entry for exploring SpaceNet data by demonstrating methods to transform and visualize the raw SpaceNet GeoJSON labels into formats more conducive for machine learning, namely NumPy arrays and image masks. Further motivating the study of SpaceNet data is the release of a new

SpaceNet point of interest dataset. We include python code for the interested reader, and refer the reader to the SpaceNet Challenge repository for more utilities.

December 2017 update: updated code is also available here.

1. Data Access

After creating an AWS account, download the data at the SpaceNet AWS portal

. Detailed descriptions of data formats and download instructions can be found here. In short, the command to download processed 200m x 200m image tiles with associated building footprints is:

aws s3api get-object --bucket spacenet-dataset \
    --key AOI_1_Rio/processedData/processedBuildingLabels.tar.gz \
    --request-payer requester processedBuildingLabels.tar.gz

For this post, we will focus on the TopCoder challenge dataset. Upon downloading and expanding the tarballs, the TopCoder training directory structure should appear as follows:

Figure 1. SpaceNet TopCoder data directory

In this post we will focus on the high-resolution 3-band imagery as well as the vector data.

2. Data Inspection

Image cutouts for the pan-sharpened 3-band imagery are 438–439 pixels in width, and 406–407 pixels in height. 8-band images have not been pan-sharpened and so have 1/4 the resolution of the 3-band imagery at 110 x 102 pixels. For each unique image ID we find a corresponding entry in the vectordata/geoJson directory with image footprints.

Figure 2. Random image from the SpaceNet training dataset (3band_013022223130_Public_img124.tif).

Figure 3. First entry of the GeoJSON label file associated with Figure 2. Here we show the first building label associated with the image; note that coordinates are stored as a WKT polygon or multipolygon with coordinates stored as [longitude, latitude, elevation]. The elevation field is always zero for this dataset.

2. Ground Truth Transform

Computer vision algorithms tend to operate in pixel space, where locations are reported on the matrix of pixel positions rather than latitude and longitude. After the initial data download, or extraction, the second step in the extract-transform-load (ETL) process is to transform the latitude-longitude coordinates in the GeoJSON label files to pixel coordinates. We describe three methods of transforming the GeoJSON label files into pixel coordinates in various formats.

2.1 Building Outline Coordinates

The GeoJSON file lists building polygon vertices in latitude and longitude. Transforming these vertices into pixel coordinates requires knowledge of the image extent and precise geometric coordinate transform. This information (along with much more) can be extracted with the GDAL code suite. A number of sophisticated functions using GDAL and other geospatial libraries are available in the SpaceNet utilities repository on GitHub. The code below takes the GeoJSON label file and corresponding image and returns two coordinate arrays, one in geospatial coordinates (latitude and longitude) and one in pixel coordinates.

Code snippet 1. Function to transform GeoJSON label files to an array of coordinates (both lat,lon and pixel).

We can inspect our transform by overlaying the ground truth polygons on the input image using matplotlib.

Code snippet 2. Function to plot the truth coordinates for an input image.

Getting Started With SpaceNet Data

Getting Started With SpaceNet Data

1. Data Access

2. Data Inspection

2. Ground Truth Transform

2.1 Building Outline Coordinates

Getting Started With SpaceNet Data

[2] Getting Started With Data Reflections

LLVM每日談之十九 LLVM的第一本系統的書<Getting Started with LLVM Core Libraries>

Getting started with Kentico

[原創]Getting Started with Skywalking

Getting started with docker - 1.Orientation and setup

Getting Started with Processing 第四章總結

Getting Started with Processing 第五章的easing問題

Getting Started with Processing 第五章的easing問題(2)

Getting Started with Processing 第五章的總結

Getting started with Processing 第七章總結

Getting Started with XlsxWriter

Getting Started with Processing 第十章——物件

Getting Started with Processing 第十章——對象

Getting started with Processing 示例11-9 追隨鼠標移動

Getting started with Processing 示例11-9 追隨滑鼠移動

Getting started with Processing 第十一章——陣列

Getting started with Processing 第十三章——延伸(1)

Getting started with the Zowe WebUi

Getting started with UX research

Getting Started With SpaceNet Data

Getting Started With SpaceNet Data

1. Data Access

2. Data Inspection

2. Ground Truth Transform

2.1 Building Outline Coordinates

相關推薦