Three-Branch Molecular Representation Learning Framework for Predicting Molecular Properties in Drug Discovery

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Graph Neural Networks (GNNs) have been widely used to model molecules with a graph representation. However, GNNs face inherent challenges in accurately modeling long-range atomic interactions and identifying complex molecular substructures. This research proposes a novel Three-branch Molecular Representation Learning Framework (TMRLF) for predicting molecular properties: it integrates one branch of a GNN that extracts local molecular structural information with two branches of fully connected networks that capture the chemical substructure based on two fingerprints. Specifically, to better capture the long-range interactions, the GNN is designed with an attention mechanism to enhance the atomic interactions. As the Morgan fingerprint effectively captures functional groups of molecules and another well-used molecular fingerprint in the field of drug discovery, the Extended Reduced Graph (ErG) Fingerprint specifically targets molecular features with pharmacological relevance. These two fingerprints are both utilized to complement the chemical information and long-range information processing at the level of key structural features that GNNs lack. The proposed TMRLF extracts a robust feature representation of molecules, crucial for accurately predicting molecular properties and identifying potential drug candidates. Our proposed TMRLF is compared against six state-of-the-art models on eight benchmark datasets. It demonstrates superior capability in predicting molecular properties. Its effectiveness is further highlighted through proof-of-concept validation in identifying potential inhibitors for the Son of Sevenless Homolog 1 (SOSI) protein in real-world drug discovery scenarios.
Original languageEnglish
Title of host publication2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC)
PublisherIEEE
Pages1983-1989
Number of pages7
ISBN (Electronic)9798350376968
ISBN (Print)9798350376975
DOIs
Publication statusPublished - 2024

Keywords

  • Artificial Intelligence
  • Graph Neural Networks
  • Deep Learning
  • Drug discovery
  • Molecular Property Prediction
  • Molecular Representation Learning

Fingerprint

Dive into the research topics of 'Three-Branch Molecular Representation Learning Framework for Predicting Molecular Properties in Drug Discovery'. Together they form a unique fingerprint.

Cite this