Discovering Disease Patterns Using the Supervised Topic Model

  • Shu Li
  • , Jingyuan Wang*
  • , Yi Wang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we explore the methods of medical data mining. The medical data usually have some unique characteristics such as sparseness, highly correlated features and unbalanced sample categories. After researching the models commonly used in current medical data mining, we use the topic-based model for medical data mining. We build a supervised topic model (the SLDA model) and use Gibbs sampling to estimate parameters. From the results of the model, we can find some important relationships among features in our medical data. Finally, the SLDA model was combined with a Random Forest classifier, which gets good predictive performance in disease prediction.

Original languageEnglish
Title of host publication2018 15th International Conference on Service Systems and Service Management, ICSSSM 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Print)9781538651780
DOIs
StatePublished - 13 Sep 2018
Event15th International Conference on Service Systems and Service Management, ICSSSM 2018 - Hangzhou, China
Duration: 21 Jul 201822 Jul 2018

Publication series

Name2018 15th International Conference on Service Systems and Service Management, ICSSSM 2018

Conference

Conference15th International Conference on Service Systems and Service Management, ICSSSM 2018
Country/TerritoryChina
CityHangzhou
Period21/07/1822/07/18

Fingerprint

Dive into the research topics of 'Discovering Disease Patterns Using the Supervised Topic Model'. Together they form a unique fingerprint.

Cite this