Sketch-a-Net that Beats Humans

  • Qian Yu
  • , Yongxin Yang
  • , Yi Zhe Song
  • , Tao Xiang
  • , Timothy M. Hospedales

Research output: Contribution to conferencePaperpeer-review

Abstract

We propose a multi-scale multi-channel deep neural network framework that, for the first time, yields sketch recognition performance surpassing that of humans. Our superior performance is a result of explicitly embedding the unique characteristics of sketches in our model: (i) a network architecture designed for sketch rather than natural photo statistics, (ii) a multi-channel generalisation that encodes sequential ordering in the sketching process, and (iii) a multi-scale network ensemble with joint Bayesian fusion that accounts for the different levels of abstraction exhibited in free-hand sketches. We show that state-of-the-art deep networks specifically engineered for photos of natural objects fail to perform well on sketch recognition, regardless whether they are trained using photo or sketch. Our network on the other hand not only delivers the best performance on the largest human sketch dataset to date, but also is small in size making efficient training possible using just CPUs.

Original languageEnglish
Pages71-712
Number of pages642
DOIs
StatePublished - 2015
Externally publishedYes
Event26th British Machine Vision Conference, BMVC 2015 - Swansea, United Kingdom
Duration: 7 Sep 201510 Sep 2015

Conference

Conference26th British Machine Vision Conference, BMVC 2015
Country/TerritoryUnited Kingdom
CitySwansea
Period7/09/1510/09/15

Fingerprint

Dive into the research topics of 'Sketch-a-Net that Beats Humans'. Together they form a unique fingerprint.

Cite this