TY - JOUR
T1 - Symbolic identification of tensor equations in multidimensional physical fields
AU - Chen, Tianyi
AU - Yang, Hao
AU - Ma, Wenjun
AU - Zhang, Jun
N1 - Publisher Copyright:
© The Author(s), 2025. Published by Cambridge University Press.
PY - 2025/12/2
Y1 - 2025/12/2
N2 - Recently, data-driven methods have shown great promise for discovering governing equations from simulation or experimental data. However, most existing approaches are limited to scalar equations, with few capable of identifying tensor relationships. In this work, we propose a general data-driven framework for identifying tensor equations, referred to as symbolic identification of tensor equations (SITE). The core idea of SITE – representing tensor equations using a host–plasmid structure – is inspired by the multidimensional gene expression programming approach. To improve the robustness of the evolutionary process, SITE adopts a genetic information retention strategy. Moreover, SITE introduces two key innovations beyond conventional evolutionary algorithms. First, it incorporates a dimensional homogeneity check to restrict the search space and eliminate physically invalid expressions. Second, it replaces traditional linear scaling with a tensor linear regression technique, greatly enhancing the efficiency of numerical coefficient optimization. We validate SITE using two benchmark scenarios, where it accurately recovers target equations from synthetic data, showing robustness to noise and flexible expressive capability. Furthermore, SITE is applied to identify constitutive relations directly from molecular simulation data, which are generated without reliance on macroscopic constitutive models. It adapts to both compressible and incompressible flow conditions and successfully identifies the corresponding macroscopic forms, highlighting its potential for data-driven discovery of tensor equation.
AB - Recently, data-driven methods have shown great promise for discovering governing equations from simulation or experimental data. However, most existing approaches are limited to scalar equations, with few capable of identifying tensor relationships. In this work, we propose a general data-driven framework for identifying tensor equations, referred to as symbolic identification of tensor equations (SITE). The core idea of SITE – representing tensor equations using a host–plasmid structure – is inspired by the multidimensional gene expression programming approach. To improve the robustness of the evolutionary process, SITE adopts a genetic information retention strategy. Moreover, SITE introduces two key innovations beyond conventional evolutionary algorithms. First, it incorporates a dimensional homogeneity check to restrict the search space and eliminate physically invalid expressions. Second, it replaces traditional linear scaling with a tensor linear regression technique, greatly enhancing the efficiency of numerical coefficient optimization. We validate SITE using two benchmark scenarios, where it accurately recovers target equations from synthetic data, showing robustness to noise and flexible expressive capability. Furthermore, SITE is applied to identify constitutive relations directly from molecular simulation data, which are generated without reliance on macroscopic constitutive models. It adapts to both compressible and incompressible flow conditions and successfully identifies the corresponding macroscopic forms, highlighting its potential for data-driven discovery of tensor equation.
KW - machine learning
UR - https://www.scopus.com/pages/publications/105024460951
U2 - 10.1017/jfm.2025.10710
DO - 10.1017/jfm.2025.10710
M3 - 文章
AN - SCOPUS:105024460951
SN - 0022-1120
VL - 1024
JO - Journal of Fluid Mechanics
JF - Journal of Fluid Mechanics
M1 - A34
ER -