Skip to main navigation Skip to search Skip to main content

Large Model based Sequential Keyframe Extraction for Video Summarization

  • Kailong Tan
  • , Yuxiang Zhou
  • , Qianchen Xia
  • , Rui Liu
  • , Yong Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Keyframe extraction aims to sum up a video's semantics with the minimum number of its frames. This paper puts forward a Large Model based Sequential Keyframe Extraction for video summarization, dubbed LMSKE, which contains three stages as below. First, we use the large model "TransNetV21"to cut the video into consecutive shots, and employ the large model "CLIP2"to generate each frame's visual feature within each shot; Second, we develop an adaptive clustering algorithm to yield candidate keyframes for each shot, with each candidate keyframe locating nearest to a cluster center; Third, we further reduce the above candidate keyframes via redundancy elimination within each shot, and finally concatenate them in accordance with the sequence of shots as the final sequential keyframes. To evaluate LMSKE, we curate a benchmark dataset and conduct rich experiments, whose results exhibit that LMSKE performs much better than quite a few SOTA competitors with average F1 of 0.5311, average fidelity of 0.8141, and average compression ratio of 0.9922.

Original languageEnglish
Title of host publicationCMLDS 2024 - 2024 International Conference on Computing, Machine Learning and Data Science, Conference Proceedings
PublisherAssociation for Computing Machinery
ISBN (Electronic)9798400716393
DOIs
StatePublished - 12 Apr 2024
Event2024 International Conference on Computing, Machine Learning and Data Science, CMLDS 2024 - Singapore, Singapore
Duration: 12 Apr 202414 Apr 2024

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2024 International Conference on Computing, Machine Learning and Data Science, CMLDS 2024
Country/TerritorySingapore
CitySingapore
Period12/04/2414/04/24

Keywords

  • adaptive clustering
  • keyframe extraction
  • large model
  • shot segmentation
  • video summarization

Fingerprint

Dive into the research topics of 'Large Model based Sequential Keyframe Extraction for Video Summarization'. Together they form a unique fingerprint.

Cite this