跳到主要导航 跳到搜索 跳到主要内容

A Malleable Boundary Network for temporal action detection

科研成果: 期刊稿件文章同行评审

摘要

Temporal action detection in untrimmed videos is a challenging task aiming to predict the boundary and category of action instances. It can be useful in transportation. In this study, we propose a two-stage framework Malleable Boundary Network (MB-Net) to adaptively regress proposals based on finer scores. In particular, MB-Net consists of a Potential Boundary Generator in the first stage and an Adaptive Proposal Detector in the second stage. First, the Potential Boundary Generator fuses multiple sets of flexible score sequences to obtain tentative proposals through a frame-level feature in an anchor-free way. Then, the Adaptive Proposal Detector employs parallel modules to filter, classify and regress proposals adaptively. Besides, we propose an easy-to-realize feature augmented method Structured Temporal Segment Pooling, which makes full use of the information throughout the whole proposal. Experiments show that MB-Net achieves state-of-the-art performance on popular benchmarks THUMOS-14 and Activity-1.3 with an improvement of 1.9% and 1.2%.

源语言英语
文章编号108250
期刊Computers and Electrical Engineering
103
DOI
出版状态已出版 - 10月 2022

指纹

探究 'A Malleable Boundary Network for temporal action detection' 的科研主题。它们共同构成独一无二的指纹。

引用此