GeoImageNet: A multi-source natural feature benchmark dataset for GeoAI and supervised machine learning

GeoInformatica
By: , and 

Links

Abstract

The field of GeoAI or Geospatial Artificial Intelligence has undergone rapid development since 2017. It has been widely applied to address environmental and social science problems, from understanding climate change to tracking the spread of infectious disease. A foundational task in advancing GeoAI research is the creation of open, benchmark datasets to train and evaluate the performance of GeoAI models. While a number of datasets have been published, very few have centered on the natural terrain and its landforms. To bridge this gulf, this paper introduces a first-of-its-kind benchmark dataset, GeoImageNet, which supports natural feature detection in a supervised machine-learning paradigm. A distinctive feature of this dataset is the fusion of multi-source data, including both remote sensing imagery and DEM in depicting spatial objects of interest. This multi-source dataset allows a GeoAI model to extract rich spatio-contextual information to gain stronger confidence in high-precision object detection and recognition. The image dataset is tested with a multi-source GeoAI extension against two well-known object detection models, Faster-RCNN and RetinaNet. The results demonstrate the robustness of the dataset in aiding GeoAI models to achieve convergence and the superiority of multi-source data in yielding much higher prediction accuracy than the commonly used single data source.

Publication type Article
Publication Subtype Journal Article
Title GeoImageNet: A multi-source natural feature benchmark dataset for GeoAI and supervised machine learning
Series title GeoInformatica
DOI 10.1007/s10707-022-00476-z
Volume 27
Year Published 2023
Language English
Publisher Springer
Contributing office(s) Center for Geospatial Information Science (CEGIS)
Description 22 p.
First page 619
Last page 640
Google Analytic Metrics Metrics page
Additional publication details