Skip to main navigation Skip to search Skip to main content

AgriFM: A multi-source temporal remote sensing foundation model for Agriculture mapping

  • Wenyuan Li
  • , Shunlin Liang*
  • , Keyan Chen
  • , Yongzhe Chen
  • , Han Ma
  • , Jianglei Xu
  • , Yichuan Ma
  • , Yuxiang Zhang
  • , Shikang Guan
  • , Husheng Fang
  • , Zhenwei Shi
  • *Corresponding author for this work
  • The University of Hong Kong
  • Beihang University
  • Wuhan University

Research output: Contribution to journalArticlepeer-review

Abstract

Climate change and population growth intensify the demand for precise agriculture mapping to enhance food security. Such mapping tasks require robust modeling of multi-scale spatiotemporal patterns from fine field textures to landscape context, and from short-term phenology to full growing-season dynamics. Existing methods often process spatial and temporal features separately, limiting their ability to capture essential agricultural dynamics. While transformer-based remote sensing foundation models (RSFMs) offer unified spatiotemporal modeling ability, most of them remain suboptimal: they either use fixed windows that ignore multi-scale crop characteristics or neglect temporal information entirely. To address these gaps, we propose AgriFM, a multi-source, multi-temporal foundation model for agriculture mapping. AgriFM introduces a synchronized spatiotemporal downsampling strategy within a Video Swin Transformer backbone, enabling efficient handling of long and variable-length satellite time series while preserving multi-scale spatial and phenological information. It is pre-trained on a globally representative dataset comprising over 25 million samples from MODIS, Landsat-8/9, and Sentinel-2 with land cover fractions as pre-training supervision. AgriFM further integrates a versatile decoder specifically designed to dynamically fuse multi-source features from different stages of backbone and accommodate varying temporal lengths, thereby supporting consistent and scalable agriculture mapping across diverse satellite sources and task requirements. It supports diverse tasks including agricultural land mapping, field boundary delineation, agricultural land use/land cover mapping, and specific crop mapping (e.g., winter wheat and paddy rice) with different data sources. Comprehensive evaluations show that AgriFM consistently outperforms existing deep learning models and general-purpose RSFMs across multiple agriculture mapping tasks. Codes and models are available athttps://github.com/flyakon/AgriFMandhttps://glass.hku.hk

Original languageEnglish
Article number115234
JournalRemote Sensing of Environment
Volume334
DOIs
StatePublished - 1 Mar 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 2 - Zero Hunger
    SDG 2 Zero Hunger
  2. SDG 13 - Climate Action
    SDG 13 Climate Action
  3. SDG 15 - Life on Land
    SDG 15 Life on Land

Keywords

  • Agriculture mapping
  • Deep learning
  • Foundation model
  • Remote sensing

Fingerprint

Dive into the research topics of 'AgriFM: A multi-source temporal remote sensing foundation model for Agriculture mapping'. Together they form a unique fingerprint.

Cite this