跳到主要导航 跳到搜索 跳到主要内容

Empirical Study of Unsupervised Pre-Training in CNN and Transformer Based Visual Tracking

  • Beihang University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Deep learning-based visual object tracking has seen the emergence of CNN-based and Transformer-based algorithms built upon the Siamese-based pipeline to pursue robustness and accuracy. However, the performance gap between them requires high-quality and large-scale labeled data for sufficient training. In this work, we design an unsupervised pre-training scheme based on data augmentation to reduce the dependence on expensive labeled data. The core step is the object localization pretext task, which randomly crops the object and pastes it onto several background images. Moreover, we apply the method to both CNN-based and Transformer-based visual trackers. Extensive experiments on public datasets demonstrate that our method outperforms prevailing unsupervised trackers on large-scale benchmarks such as LaSOT and TrackingNet. Additionally, a simple strategy of freezing the CNN backbone during Transformer-based pre-training proves to be effective.

源语言英语
主期刊名2023 5th International Conference on Artificial Intelligence and Computer Applications, ICAICA 2023
出版商Institute of Electrical and Electronics Engineers Inc.
291-295
页数5
ISBN(电子版)9798350323313
DOI
出版状态已出版 - 2023
活动2023 5th International Conference on Artificial Intelligence and Computer Applications, ICAICA 2023 - Dalian, 中国
期限: 28 11月 202330 11月 2023

出版系列

姓名2023 5th International Conference on Artificial Intelligence and Computer Applications, ICAICA 2023

会议

会议2023 5th International Conference on Artificial Intelligence and Computer Applications, ICAICA 2023
国家/地区中国
Dalian
时期28/11/2330/11/23

指纹

探究 'Empirical Study of Unsupervised Pre-Training in CNN and Transformer Based Visual Tracking' 的科研主题。它们共同构成独一无二的指纹。

引用此