Multi-modal contrastive learning for chemical structure elucidation with VibraCLIP

dc.contributor.author
Rocabert-Oriols, Pau
dc.contributor.author
Lo Conte, Camilla
dc.contributor.author
López, Núria
dc.contributor.author
Heras-Domingo, Javier
dc.date.accessioned
2025-12-01T13:50:03Z
dc.date.available
2025-12-01T13:50:03Z
dc.date.issued
2025-11-11
dc.identifier.uri
http://hdl.handle.net/2072/488975
dc.description.abstract
Identifying molecular structures from vibrational spectra is central to chemical analysis but remains challenging due to spectral ambiguity and the limitations of single-modality methods. While deep learning has advanced various spectroscopic characterization techniques, leveraging the complementary nature of infrared (IR) and Raman spectroscopies remains largely underexplored. We introduce VibraCLIP, a contrastive learning framework that embeds molecular graphs, IR and Raman spectra into a shared latent space. A lightweight fine-tuning protocol ensures generalization from theoretical to experimental datasets. VibraCLIP enables accurate, scalable, and data-efficient molecular identification, linking vibrational spectroscopy with structural interpretation. This tri-modal design captures rich structure–spectra relationships, achieving Top-1 retrieval accuracy of 81.7% and reaching 98.9% Top-25 accuracy with molecular mass integration. By integrating complementary vibrational spectroscopic signals with molecular representations, VibraCLIP provides a practical framework for automated spectral analysis, with potential applications in fields such as synthesis monitoring, drug development, and astrochemical detection.
ca
dc.format.extent
10 p.
ca
dc.language.iso
eng
ca
dc.publisher
RSC
ca
dc.rights
Attribution 4.0 International
*
dc.rights.uri
http://creativecommons.org/licenses/by/4.0/
*
dc.source
RECERCAT (Dipòsit de la Recerca de Catalunya)
dc.subject.other
Química
ca
dc.title
Multi-modal contrastive learning for chemical structure elucidation with VibraCLIP
ca
dc.type
info:eu-repo/semantics/article
ca
dc.subject.udc
54
ca
dc.description.version
info:eu-repo/semantics/publishedVersion
ca
dc.embargo.terms
cap
ca
dc.relation.projectID
Institute of Chemical Research of Catalonia (ICIQ) Summer Fellow Program
ca
dc.relation.projectID
Department of Research and Universities of the Generalitat de Catalunya for funding through grant (reference: SGR-01155)
ca
dc.relation.projectID
PID2024-157556OB-I00 funded by MICIU/AEI/10.13039/501100011033/FEDER, UE
ca
dc.identifier.doi
https://doi.org/10.1039/D5DD00269A
ca
dc.rights.accessLevel
info:eu-repo/semantics/openAccess


Documents

d5dd00269a.pdf

1.046Mb PDF

Aquest element apareix en la col·lecció o col·leccions següent(s)

Papers [1240]