Skip to main navigation Skip to search Skip to main content

Using Isolation Forest and Alternative Data Products to Overcome Ground Truth Data Scarcity for Improved Deep Learning-based Agricultural Land Use Classification Models

  • University of Galway
  • Carleton University
  • Gdansk University of Technology

Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

Abstract

High-quality labelled datasets represent a cornerstone in the development of deep learning models for land use classification. The high cost of data collection, the inherent errors introduced during data mapping efforts, the lack of local knowledge, and the spatial variability of the data hinder the development of accurate and spatially-transferable deep learning models in the context of agriculture. In this paper, we investigate the use of Isolation Forest (IF), an anomaly detection algorithm, to reduce noise in a large-scale, low-resolution alternative ground truth dataset used to train land use deep learning models. We use a modest-size, high-resolution and high-fidelity manually collected ground-truth dataset to calibrate Isolation Forest parameters and evaluate our approach, highlighting the relatively low cost of the methodology. Our data-centric methodology demonstrates the efficacy of deep learning methods coupled with IF to create mid-resolution land-use models and map products for agriculture using an alternative ground-truth dataset. Moreover, we compare our deep learning approach with a traditional algorithm used in remote sensing and evaluate the spatial transferability of the created models. Finally, we reflect upon the lessons learnt and future work.

Original languageEnglish
Title of host publicationProceedings of the 56th Annual Hawaii International Conference on System Sciences, HICSS 2023
EditorsTung X. Bui
PublisherIEEE Computer Society
Pages4978-4987
Number of pages10
ISBN (Electronic)9780998133164
Publication statusPublished - 2023
Event56th Annual Hawaii International Conference on System Sciences, HICSS 2023 - Virtual, Online, United States
Duration: 3 Jan 20236 Jan 2023

Publication series

NameProceedings of the Annual Hawaii International Conference on System Sciences
Volume2023-January
ISSN (Print)1530-1605

Conference

Conference56th Annual Hawaii International Conference on System Sciences, HICSS 2023
Country/TerritoryUnited States
CityVirtual, Online
Period3/01/236/01/23

Keywords

  • agriculture
  • data
  • data-centric AI
  • datasets
  • Deep learning
  • GIS
  • ground truth
  • isolation forest

Fingerprint

Dive into the research topics of 'Using Isolation Forest and Alternative Data Products to Overcome Ground Truth Data Scarcity for Improved Deep Learning-based Agricultural Land Use Classification Models'. Together they form a unique fingerprint.

Cite this