跳到主要导航 跳到搜索 跳到主要内容

Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective

  • Ziwei Zhang*
  • , Xin Wang*
  • , Zeyang Zhang
  • , Peng Cui
  • , Wenwu Zhu*
  • *此作品的通讯作者
  • Tsinghua University

科研成果: 期刊稿件文章同行评审

摘要

Deep neural networks have achieved great success in the last decade. When designing neural networks to handle the ubiquitous geometric data such as point clouds and graphs, it is critical that the model can maintain invariance towards various transformations such as translation, rotation, and scaling. Most existing graph neural network (GNN) approaches can only maintain permutation-invariance, failing to guarantee invariance with respect to other transformations. Besides GNNs, other works design sophisticated transformation-invariant layers, which are computationally expensive and difficult to be extended. In this paper, we revisit why general neural networks cannot maintain transformation invariance. Our findings show that transformation-invariant and distance-preserving initial point representations are sufficient to achieve transformation invariance rather than needing sophisticated neural layer designs. Motivated by these findings, we propose Transformation Invariant Neural Networks (TinvNet), a straightforward and general plug-in for geometric data. Specifically, we realize transformation invariant and distance-preserving initial point representations by modifying multi-dimensional scaling and feed the representations into existing neural networks. We prove that TinvNet can strictly guarantee transformation invariance, being general and flexible enough to be combined with the existing neural networks. Extensive experimental results on point cloud analysis and combinatorial optimization demonstrate the effectiveness and general applicability of our method. We also extend our method into equivariance cases. Based on the results, we advocate that TinvNet should be considered as an essential baseline for further studies of transformation-invariant geometric deep learning.

源语言英语
页(从-至)2646-2658
页数13
期刊IEEE Transactions on Pattern Analysis and Machine Intelligence
48
3
DOI
出版状态已出版 - 2026
已对外发布

指纹

探究 'Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective' 的科研主题。它们共同构成独一无二的指纹。

引用此