跳到主要导航 跳到搜索 跳到主要内容

Learning from PhotoShop Operation Videos: The PSOV Dataset

  • Jingchun Cheng
  • , Han Kai Hsu
  • , Chen Fang
  • , Hailin Jin
  • , Shengjin Wang*
  • , Ming Hsuan Yang
  • *此作品的通讯作者
  • Tsinghua University
  • University of California Merced
  • Adobe Systems Incorporated

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In this paper, we present the PhotoShop Operation Video (PSOV) dataset, a large-scale, densely annotated video database designed for the development of software intelligence. The PSOV dataset consists of 564 densely-annotated videos for Photoshop operations, covering more than 500 commonly used commands in the Photoshop software. Videos in this dataset are obtained from YouTube, manually watched and annotated precisely to seconds by experts. There are more than 74 h of videos with 29,204 labeled commands. To the best of our knowledge, the PSOV dataset is the first large-scale software operation video database with high-resolution frames and dense annotations. We believe that this dataset can help advance the development of intelligent software, and has extensive application aspects. In this paper, we describe the dataset construction procedure, data attributes, proposed tasks and their corresponding evaluation metrics. To demonstrate that the PSOV dataset has sufficient data and labeling for data-driven methods, we develop a deep learning based algorithm for the command classification task. We also carry out experiments and analysis with the proposed method to encourage better understanding and usage of the PSOV dataset.

源语言英语
主期刊名Computer Vision – ACCV 2018 - 14th Asian Conference on Computer Vision, Revised Selected Papers
编辑Greg Mori, Hongdong Li, C.V. Jawahar, Konrad Schindler
出版商Springer Verlag
223-239
页数17
ISBN(印刷版)9783030208691
DOI
出版状态已出版 - 2019
已对外发布
活动14th Asian Conference on Computer Vision, ACCV 2018 - Perth, 澳大利亚
期限: 2 12月 20186 12月 2018

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
11364 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议14th Asian Conference on Computer Vision, ACCV 2018
国家/地区澳大利亚
Perth
时期2/12/186/12/18

指纹

探究 'Learning from PhotoShop Operation Videos: The PSOV Dataset' 的科研主题。它们共同构成独一无二的指纹。

引用此