Visual Tracking with Attentional Convolutional Siamese Networks

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recently Siamese trackers have drawn great attention due to their considerable accuracy and speed. To further improve the discriminability of Siamese networks for visual tracking, some deeper networks, such as VGG and ResNet, are exploited as backbone. However, high-level semantic information reduces the location discrimination. In this paper, we propose a novel Attentional Convolutional Siamese Networks for visual tracking (ACST), to improve the classical AlexNet by fusing spatial and channel attentions during feature learning. Moreover, a response-based weighted sampling strategy during training is proposed to strengthen the discrimination power to distinguish two objects with the similar attributes. With the efficiency of cross-correlation operator, our tracker can be trained end-to-end while running in real-time at inference phase. We validate our tracker through extensive experiments on OTB2013 and OTB2015, and results show that the proposed tracker obtains great improvements over the other Siamese trackers.

Original languageEnglish
Title of host publicationImage and Graphics - 10th International Conference, ICIG 2019, Proceedings, Part 1
EditorsYao Zhao, Chunyu Lin, Nick Barnes, Baoquan Chen, Rüdiger Westermann, Xiangwei Kong
PublisherSpringer
Pages369-380
Number of pages12
ISBN (Print)9783030341190
DOIs
StatePublished - 2019
Event10th International Conference on Image and Graphics, ICIG 2019 - Beijing, China
Duration: 23 Aug 201925 Aug 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11901 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th International Conference on Image and Graphics, ICIG 2019
Country/TerritoryChina
CityBeijing
Period23/08/1925/08/19

Keywords

  • Siamese networks
  • Visual attentions
  • Visual tracking

Fingerprint

Dive into the research topics of 'Visual Tracking with Attentional Convolutional Siamese Networks'. Together they form a unique fingerprint.

Cite this