TY - GEN
T1 - IPM
T2 - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024
AU - Liu, Ruomei
AU - Li, Shangzhe
AU - Zhu, He
AU - Hou, Yue
AU - Peng, Xingyu
AU - Yuan, Haitao
AU - Wu, Junran
AU - Xu, Ke
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Given the pivotal role of molecular property prediction in drug development and material science, graph self-supervised learning has been implemented in molecular representation learning to compensate for the shortage of labeled molecules. However, current proposed methods often focus on designing data augmentation schemes and leveraging domain knowledge to improve performance, which inevitably leads to molecular semantics loss and limited generalization capability. To the end, we propose IPM, an Information lossless Pretraining strategy for Molecular property prediction that leverages the information of both the original graph and line graph of molecules. Specifically, by contrasting the given graph with the corresponding line graph, the graph encoder can fully learn the generic molecular semantic representation without profound domain knowledge. We also design a new message-passing scheme that retains information consistency during message passing between two kinds of graphs. Additionally, we present two graph contrastive losses for performance fixing and over-smoothing prevention during the learning process. Experimental results on multiple regression tasks for molecular property prediction demonstrate the effectiveness of IPM against state-of-the-art (SOTA) methods.
AB - Given the pivotal role of molecular property prediction in drug development and material science, graph self-supervised learning has been implemented in molecular representation learning to compensate for the shortage of labeled molecules. However, current proposed methods often focus on designing data augmentation schemes and leveraging domain knowledge to improve performance, which inevitably leads to molecular semantics loss and limited generalization capability. To the end, we propose IPM, an Information lossless Pretraining strategy for Molecular property prediction that leverages the information of both the original graph and line graph of molecules. Specifically, by contrasting the given graph with the corresponding line graph, the graph encoder can fully learn the generic molecular semantic representation without profound domain knowledge. We also design a new message-passing scheme that retains information consistency during message passing between two kinds of graphs. Additionally, we present two graph contrastive losses for performance fixing and over-smoothing prevention during the learning process. Experimental results on multiple regression tasks for molecular property prediction demonstrate the effectiveness of IPM against state-of-the-art (SOTA) methods.
KW - Graph Self-supervised Learning
KW - Line Graph
KW - Molecular Property Prediction Regression Task
UR - https://www.scopus.com/pages/publications/85217279093
U2 - 10.1109/BIBM62325.2024.10822412
DO - 10.1109/BIBM62325.2024.10822412
M3 - 会议稿件
AN - SCOPUS:85217279093
T3 - Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024
SP - 1021
EP - 1027
BT - Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024
A2 - Cannataro, Mario
A2 - Zheng, Huiru
A2 - Gao, Lin
A2 - Cheng, Jianlin
A2 - de Miranda, Joao Luis
A2 - Zumpano, Ester
A2 - Hu, Xiaohua
A2 - Cho, Young-Rae
A2 - Park, Taesung
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 3 December 2024 through 6 December 2024
ER -