Skip to main navigation Skip to search Skip to main content

Experimental design with multiple kernels

  • Hanmo Wang
  • , Liang Du
  • , Peng Zhou
  • , Lei Shi
  • , Yuhua Qian
  • , Yi Dong Shen*
  • *Corresponding author for this work
  • CAS - Institute of Software
  • University of Chinese Academy of Sciences
  • Shanxi University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In classification tasks, labeled data is a necessity but sometimes difficult or expensive to obtain. On the contrary, unlabeled data is usually abundant. Recently, different active learning algorithms are proposed to alleviate this issue by selecting the most informative data points to label. One family of active learning methods comes from Optimum Experimental Design (OED) in statistics. Instead of selecting data points one by one iteratively, OED-based approaches select data in a one-shot manner, that is, a fixed-sized subset is selected from the unlabeled dataset for manually labeling. These methods usually use kernels to represent pair-wise similarities between different data points. It is well known that choosing optimal kernel types (e.g. Gaussian kernel) and kernel parameters (e.g. kernel width) is tricky, and a common way to resolve it is by Multiple Kernel Learning (MKL), i.e., to construct a few candidate kernels and merge them to form a consensus kernel. There would be different ways to combine multiple kernels, one of which, called the the globalised approach is to assign a weight to each candidate kernel. In practice different data points in the same candidate kernel may not have the same contribution in the consensus kernel, this requires assigning different weights to different data points in the same candidate kernel, leading to the localized approach. In this paper, we introduce MKL to OED-based active learning, specifically we propose globalised and localized multiple kernel active learning methods, respectively. Our experiments on six benchmark datasets demonstrate that the proposed methods have better performance than existing OED-based active learning methods.

Original languageEnglish
Title of host publicationProceedings - 15th IEEE International Conference on Data Mining, ICDM 2015
EditorsCharu Aggarwal, Zhi-Hua Zhou, Alexander Tuzhilin, Hui Xiong, Xindong Wu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages419-428
Number of pages10
ISBN (Electronic)9781467395038
DOIs
StatePublished - 5 Jan 2016
Externally publishedYes
Event15th IEEE International Conference on Data Mining, ICDM 2015 - Atlantic City, United States
Duration: 14 Nov 201517 Nov 2015

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
Volume2016-January
ISSN (Print)1550-4786

Conference

Conference15th IEEE International Conference on Data Mining, ICDM 2015
Country/TerritoryUnited States
CityAtlantic City
Period14/11/1517/11/15

Keywords

  • Active learning
  • Experimental Design
  • Multiple Kernels

Fingerprint

Dive into the research topics of 'Experimental design with multiple kernels'. Together they form a unique fingerprint.

Cite this