T-count Reduction Method Based on Proximal Policy Optimization

  • Keyu Xiong
  • , Tao Shang*
  • , Chenyi Zhang
  • , Yuchen Liu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In fault-tolerant quantum computing systems, T gates consume more fault-tolerant resources. In this paper, we propose a T-count reduction method based on the Proximal Policy Optimization (PPO) algorithm, minimizing the number of T gates in quantum computations. Initially, within the framework of ZX-calculus graphical language, quantum circuits are transformed into ZX-diagrams. Subsequently, the PPO algorithm is employed to learn a policy that predicts optimal transformation trajectories. To effectively leverage the topological structure of ZX-diagrams, we employ graph neural networks (GNNs) to encode the policy trained via PPO algorithm, while identifying possible transformations through the local structural properties of individual nodes or edges. The proposed method achieves an average 10.17% reduction in T-count under optimal conditions, demonstrating its capability in reducing the number of T gates.

Original languageEnglish
Title of host publicationQuantum Computation - 4th CCF Quantum Computation Conference, CQCC 2025, Proceedings
EditorsXiaoyu Li, Junjie Wu, Jialin Zhang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages17-25
Number of pages9
ISBN (Print)9789819547906
DOIs
StatePublished - 2026
Event4th CCF Quantum Computation Conference, CQCC 2025 - Chengdu, China
Duration: 21 Jul 202523 Jul 2025

Publication series

NameCommunications in Computer and Information Science
Volume2733 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference4th CCF Quantum Computation Conference, CQCC 2025
Country/TerritoryChina
CityChengdu
Period21/07/2523/07/25

Keywords

  • Proximal policy optimization
  • Quantum circuit optimization
  • Reinforcement learning
  • T-count reduction
  • ZX-calculus

Fingerprint

Dive into the research topics of 'T-count Reduction Method Based on Proximal Policy Optimization'. Together they form a unique fingerprint.

Cite this