Chemically-aware Attention-based Multi-modal Fusion Framework for Molecular Representation Learning

Yu Liu, Jonathan D. Hirst, Jianfeng Ren, Bencan Tang, Dave Towey

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Learning effective molecular representations is crucial for accurate property prediction in artificial intelligence (AI)-aided drug discovery. Graph and fingerprint representations have been widely used to encode molecular topological structures and chemical substructures. To enhance the feature embedding of each modality and leverage their complementary strengths, we propose a novel Chemically-aware Attention-based Multi-modal Fusion Framework (CAMFF) for molecular representation learning, which integrates molecular graphs and extended-connectivity fingerprints by exploiting various attention mechanisms. Specifically, the proposed CAMFF consists of three modules: 1) a graph embedding module incorporating multi-head attention to capture local heterogeneous interactions and all-pair self-attention to capture long-range atomic dependencies from molecular graph representations; 2) a fingerprint embedding module using a pre-trained Mol2Vec model to generate dense chemical substructure representations; and 3) a chemically-aware feature interaction and fusion module incorporating self-attention to enable interactions between various chemical substructures and cross-attention to ensure effective multi-modal alignment and fusion. To evaluate the effectiveness of CAMFF, we compare it with 14 state-of-the-art methods across 9 molecular property prediction benchmarks. CAMFF demonstrates competitive predictive performance and improves interpretability through attention-based visualization, showing its potential for real-world drug discovery.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE 49th Annual Computers, Software, and Applications Conference, COMPSAC 2025
EditorsHossain Shahriar, Kazi Shafiul Alam, Hiroyuki Ohsaki, Stelvio Cimato, Miriam Capretz, Shamem Ahmed, Sheikh Iqbal Ahamed, AKM Jahangir Alam Majumder, Munirul Haque, Tomoki Yoshihisa, Alfredo Cuzzocrea, Michiharu Takemoto, Nazmus Sakib, Marwa Elsayed
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1828-1833
Number of pages6
ISBN (Electronic)9798331574345
DOIs
Publication statusPublished - 2025
Event49th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2025 - Toronto, Canada
Duration: 8 Jul 202511 Jul 2025

Publication series

NameProceedings - 2025 IEEE 49th Annual Computers, Software, and Applications Conference, COMPSAC 2025

Conference

Conference49th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2025
Country/TerritoryCanada
CityToronto
Period8/07/2511/07/25

Keywords

  • Attention-Based Network
  • Graph Neural Network
  • Molecular Representation Learning
  • Multi-Modal Fusion
  • Transformer

ASJC Scopus subject areas

  • Computational Mathematics
  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Software
  • Media Technology

Fingerprint

Dive into the research topics of 'Chemically-aware Attention-based Multi-modal Fusion Framework for Molecular Representation Learning'. Together they form a unique fingerprint.

Cite this