跳到主要导航 跳到搜索 跳到主要内容

Local clustering in contextual multi-armed bandits

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

We study identifying user clusters in contextual multi-armed bandits (MAB). Contextual MAB is an effective tool for many real applications, such as content recommendation and online advertisement. In practice, user dependency plays an essential role in the user's actions, and thus the rewards. Clustering similar users can improve the quality of reward estimation, which in turn leads to more effective content recommendation and targeted advertising. Different from traditional clustering settings, we cluster users based on the unknown bandit parameters, which will be estimated incrementally. In particular, we define the problem of cluster detection in contextual MAB, and propose a bandit algorithm, LOCB, embedded with local clustering procedure. And, we provide theoretical analysis about LOCB in terms of the correctness and efficiency of clustering and its regret bound. Finally, we evaluate the proposed algorithm from various aspects, which outperforms state-of-the-art baselines.

源语言英语
主期刊名The Web Conference 2021 - Proceedings of the World Wide Web Conference, WWW 2021
出版商Association for Computing Machinery, Inc
2335-2346
页数12
ISBN(电子版)9781450383127
DOI
出版状态已出版 - 3 6月 2021
已对外发布
活动30th World Wide Web Conference, WWW 2021 - Ljubljana, 斯洛文尼亚
期限: 19 4月 202123 4月 2021

出版系列

姓名The Web Conference 2021 - Proceedings of the World Wide Web Conference, WWW 2021

会议

会议30th World Wide Web Conference, WWW 2021
国家/地区斯洛文尼亚
Ljubljana
时期19/04/2123/04/21

指纹

探究 'Local clustering in contextual multi-armed bandits' 的科研主题。它们共同构成独一无二的指纹。

引用此