摘要
Automated data collection in urban transportation systems produces a large volume of passenger data. However, quite a few of the data are still incomplete, limiting the insight into passenger mobility. The unavailability of destination information in entry-only passenger data is a very common issue. Traditional approaches for estimating passenger destinations rely on heuristics that can recover only some of the missing destinations. To deal with the remaining incomplete data, this paper, for the first time, proposes a second-order inference methodology to leverage semi-supervised self-training to infer the missing destinations. The methodology involves the design of a base learner to predict the missing destinations based on the statistics of a selected similarity-based “training set”, and the design of a selection strategy to select new data with high prediction confidence to update the training set. To further improve the inference, we incorporate personal history priors to modify the base learner. We evaluate our designs using two data sources: a real-data inspired traffic-passenger behavior simulation in the city of Porto, Portugal, and the real bus Automated Fare Collection (AFC) data collected from the same city. The experimental results show that compared to baseline methods that do not use self-training, our approach significantly improves the inference performance and achieves notably high accuracies.
| 源语言 | 英语 |
|---|---|
| 主期刊名 | BDCAT 2017 - Proceedings of the 4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies |
| 出版商 | Association for Computing Machinery, Inc |
| 页 | 255-264 |
| 页数 | 10 |
| ISBN(电子版) | 9781450355490 |
| DOI | |
| 出版状态 | 已出版 - 5 12月 2017 |
| 已对外发布 | 是 |
| 活动 | 4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2017 - Austin, 美国 期限: 5 12月 2017 → 8 12月 2017 |
出版系列
| 姓名 | BDCAT 2017 - Proceedings of the 4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies |
|---|
会议
| 会议 | 4th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2017 |
|---|---|
| 国家/地区 | 美国 |
| 市 | Austin |
| 时期 | 5/12/17 → 8/12/17 |
联合国可持续发展目标
此成果有助于实现下列可持续发展目标:
-
可持续发展目标 11 可持续城市和社区
指纹
探究 'Second-order destination inference using semi-supervised self-training for entry-only passenger data' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver