跳到主要导航 跳到搜索 跳到主要内容

Second-order destination inference using semi-supervised self-training for entry-only passenger data

  • Carnegie Mellon University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Automated data collection in urban transportation systems produces a large volume of passenger data. However, quite a few of the data are still incomplete, limiting the insight into passenger mobility. The unavailability of destination information in entry-only passenger data is a very common issue. Traditional approaches for estimating passenger destinations rely on heuristics that can recover only some of the missing destinations. To deal with the remaining incomplete data, this paper, for the first time, proposes a second-order inference methodology to leverage semi-supervised self-training to infer the missing destinations. The methodology involves the design of a base learner to predict the missing destinations based on the statistics of a selected similarity-based “training set”, and the design of a selection strategy to select new data with high prediction confidence to update the training set. To further improve the inference, we incorporate personal history priors to modify the base learner. We evaluate our designs using two data sources: a real-data inspired traffic-passenger behavior simulation in the city of Porto, Portugal, and the real bus Automated Fare Collection (AFC) data collected from the same city. The experimental results show that compared to baseline methods that do not use self-training, our approach significantly improves the inference performance and achieves notably high accuracies.

源语言英语
主期刊名BDCAT 2017 - Proceedings of the 4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies
出版商Association for Computing Machinery, Inc
255-264
页数10
ISBN(电子版)9781450355490
DOI
出版状态已出版 - 5 12月 2017
已对外发布
活动4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2017 - Austin, 美国
期限: 5 12月 20178 12月 2017

出版系列

姓名BDCAT 2017 - Proceedings of the 4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies

会议

会议4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2017
国家/地区美国
Austin
时期5/12/178/12/17

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 11 - 可持续城市和社区
    可持续发展目标 11 可持续城市和社区

指纹

探究 'Second-order destination inference using semi-supervised self-training for entry-only passenger data' 的科研主题。它们共同构成独一无二的指纹。

引用此