Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning

  • Tianchi Cai*
  • , Jiyan Jiang
  • , Wenpeng Zhang
  • , Shiji Zhou
  • , Xierui Song
  • , Li Yu
  • , Lihong Gu
  • , Xiaodong Zeng
  • , Jinjie Gu
  • , Guannan Zhang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We study the budget allocation problem in online marketing campaigns that utilize previously collected offline data. We first discuss the long-term effect of optimizing marketing budget allocation decisions in the offline setting. To overcome the challenge, we propose a novel game-theoretic offline value-based reinforcement learning method using mixed policies. The proposed method reduces the need to store infinitely many policies in previous methods to only constantly many policies, which achieves nearly optimal policy efficiency, making it practical and favorable for industrial usage. We further show that this method is guaranteed to converge to the optimal policy, which cannot be achieved by previous value-based reinforcement learning methods for marketing budget allocation. Our experiments on a large-scale marketing campaign with tens-of-millions users and more than one billion budget verify the theoretical results and show that the proposed method outperforms various baseline methods. The proposed method has been successfully deployed to serve all the traffic of this marketing campaign.

Original languageEnglish
Title of host publicationWSDM 2023 - Proceedings of the 16th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery, Inc
Pages186-194
Number of pages9
ISBN (Electronic)9781450394079
DOIs
StatePublished - 27 Feb 2023
Externally publishedYes
Event16th ACM International Conference on Web Search and Data Mining, WSDM 2023 - Singapore, Singapore
Duration: 27 Feb 20233 Mar 2023

Publication series

NameWSDM 2023 - Proceedings of the 16th ACM International Conference on Web Search and Data Mining

Conference

Conference16th ACM International Conference on Web Search and Data Mining, WSDM 2023
Country/TerritorySingapore
CitySingapore
Period27/02/233/03/23

Keywords

  • marketing budget allocation
  • offline constrained deep RL

Fingerprint

Dive into the research topics of 'Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this