ReSFlow: A Remote Sensing Imagery Data-Flow for Improved Model Generalization

by Wadzanai D Lunga, Jacob W Arndt, Jonathan Gerrand, Robert N Stewart

Publication Type

Journal

Journal Name

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

Publication Date

October, 2021

Page Numbers

10468 to 10483

Volume

Issue

View DOI Listing

Abstract

As satellite imagery collections continue to grow at an astonishing rate, so is the demand for automated and scalable object detection and segmentation. Scaling computational activities demand models that generalize well across various challenges that can hamper progress, including diverse imaging and geographic conditions, sampling bias in training data, manual ground truth collection, tooling for model reuse and accountability assessment, and poor model training strategies. A great deal of progress has been made on these challenges. We contribute to the improvement through further development of ReSFlow, a workflow that breaks the problem of model generalization into a collection of specialized exploitations. ReSFlow partitions imagery collections into homogeneous buckets equipped with exploitation models trained to perform well under each bucket.s specific context. Essentially, ReSFlow aims for generalization through stratification. Therefore, within a bucket, exploitation is a homogeneous process that mitigates heterogeneity challenges, including the number of training data and data biases that can occur over varied conditions. Furthermore, custom model architectures and rich training strategies effective for within-bucket conditions can be developed. Meanwhile, across buckets, performance metrics support systematic views of the workflow leading to optimal data labeling allocations and indications that further specialization is warranted. Herein, we discuss the formation of models during the framework's “Offline Initialization” stage. Lastly, we exploit the inherent parallelism due to bucketing to introduce model reuse and demonstrate efficacy by reducing an 89-day manual data labeling cost to zero-days in a new area of interest.

ReSFlow: A Remote Sensing Imagery Data-Flow for Improved Model Generalization

Abstract

Researchers

Organizations