TY - GEN
T1 - Molecular Graph Generation via Geometric Scattering
AU - Bhaskar, Dhananjay
AU - Grady, Jackson
AU - Castro, Egbert
AU - Perlmutter, Michael
AU - Krishnaswamy, Smita
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Although existing deep learning models perform remarkably well at predicting physicochemical properties and binding affinities, the generation of new molecules with optimized properties remains challenging. Inherently, most GNNs perform poorly in whole-graph representation due to the limitations of the message-passing paradigm. Furthermore, step-by-step graph generation frameworks that use reinforcement learning or other sequential processing can be slow and result in a high proportion of invalid molecules with substantial post-processing needed in order to generate valid molecules. To address these issues, we propose a representation-first approach to molecular graph generation. We guide the latent representation of an autoencoder by capturing graph structure information with the geometric scattering transform and apply penalties that organize the representation by molecular properties. We show that this highly structured latent space can be directly used for molecular graph generation by the use of a GAN. We demonstrate that our architecture learns meaningful representations of drug datasets and provides a platform for drug synthesis using publicly available ZINC and BindingDB datasets.
AB - Although existing deep learning models perform remarkably well at predicting physicochemical properties and binding affinities, the generation of new molecules with optimized properties remains challenging. Inherently, most GNNs perform poorly in whole-graph representation due to the limitations of the message-passing paradigm. Furthermore, step-by-step graph generation frameworks that use reinforcement learning or other sequential processing can be slow and result in a high proportion of invalid molecules with substantial post-processing needed in order to generate valid molecules. To address these issues, we propose a representation-first approach to molecular graph generation. We guide the latent representation of an autoencoder by capturing graph structure information with the geometric scattering transform and apply penalties that organize the representation by molecular properties. We show that this highly structured latent space can be directly used for molecular graph generation by the use of a GAN. We demonstrate that our architecture learns meaningful representations of drug datasets and provides a platform for drug synthesis using publicly available ZINC and BindingDB datasets.
KW - Drug Discovery
KW - Geometric Scattering
KW - Molecular Graph Generation
UR - http://www.scopus.com/inward/record.url?scp=85141790598&partnerID=8YFLogxK
UR - https://doi.org/10.1109/MLSP55214.2022.9943379
U2 - 10.1109/MLSP55214.2022.9943379
DO - 10.1109/MLSP55214.2022.9943379
M3 - Conference contribution
AN - SCOPUS:85141790598
T3 - IEEE International Workshop on Machine Learning for Signal Processing, MLSP
BT - 2022 IEEE 32nd International Workshop on Machine Learning for Signal Processing, MLSP 2022
PB - IEEE Computer Society
T2 - 32nd IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2022
Y2 - 22 August 2022 through 25 August 2022
ER -