摘要
State aggregation is usually used to handle large-scale Markov decision processes (MDPs). Despite of the computational advantage, state aggregation may result in error in estimating value functions of states and further lead to poor performance in objective value. Various cyber physical energy systems (CPES), including supply demand matching systems, are discrete event dynamic systems, which can usually be formulated as MDP. It is of great practical interest to study performance loss bound for state aggregation in large scale MDPs. In this paper, we consider the performance loss bound for state aggregation in a class of supply demand matching systems. These systems consist of two types of state variables, the action-based and the action-free. We provide a method for aggregating states, which reduces the size of state space and thus save memory space and computing budget. We make the following contributions. First, we provide the performance loss bounds for two sets of naive state aggregations, based on which we propose that the action-free variables are prior to be aggregated when the true value functions or Q-factors are unknown. Second, we propose a k-means based method for aggregating states considering the features of state variables. Third, we consider the problem of battery charging of shared electric vehicles (EVs) in smart grid and test the proposed algorithm. The results are consistent with the performance loss bounds and show that the proposed method performs well.
| 源语言 | 英语 |
|---|---|
| 主期刊名 | Proceedings of the 39th Chinese Control Conference, CCC 2020 |
| 编辑 | Jun Fu, Jian Sun |
| 出版商 | IEEE Computer Society |
| 页 | 4307-4312 |
| 页数 | 6 |
| ISBN(电子版) | 9789881563903 |
| DOI | |
| 出版状态 | 已出版 - 7月 2020 |
| 已对外发布 | 是 |
| 活动 | 39th Chinese Control Conference, CCC 2020 - Shenyang, 中国 期限: 27 7月 2020 → 29 7月 2020 |
出版系列
| 姓名 | Chinese Control Conference, CCC |
|---|---|
| 卷 | 2020-July |
| ISSN(印刷版) | 1934-1768 |
| ISSN(电子版) | 2161-2927 |
会议
| 会议 | 39th Chinese Control Conference, CCC 2020 |
|---|---|
| 国家/地区 | 中国 |
| 市 | Shenyang |
| 时期 | 27/07/20 → 29/07/20 |
联合国可持续发展目标
此成果有助于实现下列可持续发展目标:
-
可持续发展目标 7 经济适用的清洁能源
指纹
探究 'Performance Loss Bound for State Aggregation in a Class of Supply Demand Matching Systems' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver