A Resolution Extrapolation Method for Transformers Oriented Towards Scientific Computing

  • Ziming Wang
  • , Zeyu Shi
  • , Qinghe Ye
  • , Yue Wang
  • , Zhi Yu
  • , Fei Zhou
  • , Haoyi Zhou
  • , Qingyun Sun
  • , Zhenying Tai*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Solving partial differential equations (PDEs) can accurately simulate the distribution, propagation, and thermal properties of electromagnetic fields in cables, thereby ensuring the efficient and safe operation of power systems. Although existing PDE solvers based on the Transformer can simulate multi-physical systems and achieve unification in multi-physics field modeling, they lack the capability to extrapolate resolution and fail to unify the resolution of physical fields. This paper focuses on the application of Transformer-based neural networks in multi-physics field simulation computations. By integrating both the spatial and temporal characteristics of physical field data, we propose a novel Spatial-Temporal Positional Encoding (STPE) method, enabling the model to learn the temporal and spatial differential relationships inherent in PDEs. With this positional encoding, we observe a "diagonalization"phenomenon in the model's attention scores. However, as the model extrapolates to higher resolutions, this "diagonalization"weakens, and the model's prediction accuracy significantly decreases. Therefore, we further propose Spatially Localized Attention (SPA) that allows Transformer-based PDE solvers to have the capacity for resolution extrapolation, enabling them to effectively adapt and extend to high-resolution physical fields during the inference stage, even if trained only with low-resolution physical field data. Experimental results demonstrate that after applying STPE, the model not only retains the capability to uniformly model multiple physical fields but also shows improved prediction accuracy on forward tasks, outperforming PINNs, U-Net, and FNO comprehensively. Additionally, the introduction of SPA can effectively alleviate the problem of accuracy decline when the model performs resolution extrapolation.

Original languageEnglish
Title of host publicationProceedings of 2024 8th Asian Conference on Artificial Intelligence Technology, ACAIT 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages194-199
Number of pages6
ISBN (Electronic)9798331517090
DOIs
StatePublished - 2024
Event8th Asian Conference on Artificial Intelligence Technology, ACAIT 2024 - Fuzhou, China
Duration: 8 Nov 202410 Nov 2024

Publication series

NameProceedings of 2024 8th Asian Conference on Artificial Intelligence Technology, ACAIT 2024

Conference

Conference8th Asian Conference on Artificial Intelligence Technology, ACAIT 2024
Country/TerritoryChina
CityFuzhou
Period8/11/2410/11/24

Keywords

  • resolution extrapolation
  • spatial-temporal positional encoding
  • spatially localized attention

Fingerprint

Dive into the research topics of 'A Resolution Extrapolation Method for Transformers Oriented Towards Scientific Computing'. Together they form a unique fingerprint.

Cite this