跳到主要导航 跳到搜索 跳到主要内容

TransCrack: revisiting fine-grained road crack detection with a transformer design

科研成果: 期刊稿件文章同行评审

摘要

Prior convolution-based road crack detectors typically learn more abstract visual representation with increasing receptive field via an encoder-decoder architecture. Despite the promising accuracy, progressive spatial resolution reduction causes semantic feature blurring, leading to coarse and incontiguous distress detection. To these ends, an alternative sequence-to-sequence perspective with a transformer network termed TransCrack is introduced for road crack detection. Specifically, an image is decomposed into a grid of fixed-size crack patches, which is flattened with position embedding into a sequence. We further propose a pure transformer-based encoder with multi-head reduced self-attention modules and feed-forward networks for explicitly modelling long-range dependencies from the sequential input in a global receptive field. More importantly, a simple decoder with cross-layer aggregation architecture is developed to incorporate global with local attentions across different regions for detailed feature recovery and pixel-wise crack mask prediction. Empirical studies are conducted on three publicly available damage detection benchmarks. The proposed TransCrack achieves a state-of-the-art performance over all counterparts by a substantialmargin, and qualitative results further demonstrate its superiority in contiguous crack recognition and fine-grained profile extraction. This article is part of the theme issue 'Artificial intelligence in failure analysis of transportation infrastructure and materials'.

源语言英语
文章编号20220172
期刊Philosophical transactions. Series A, Mathematical, physical, and engineering sciences
381
2254
DOI
出版状态已出版 - 4 9月 2023

指纹

探究 'TransCrack: revisiting fine-grained road crack detection with a transformer design' 的科研主题。它们共同构成独一无二的指纹。

引用此