Directly Optimizing for Synthesizability in Generative Molecular Design using Retrosynthesis Models

Jeff Guo, Philippe Schwaller

arXiv:2407.12186·q-bio.BM·Published 2024-07-16

Synthesizability in generative molecular design remains a pressing challenge. Existing methods to assess synthesizability span heuristics-based methods, retrosynthesis models, and synthesizability-constrained molecular generation. The latter has become increasingly prevalent and proceeds by defining a set of permitted actions a model can take when generating molecules, such that all generations are anchored in "synthetically-feasible" chemical transformations. To date, retrosynthesis models have been mostly used as a post-hoc filtering tool as their inference cost remains prohibitive to use directly in an optimization loop. In this work, we show that with a sufficiently sample-efficient generative model, it is straightforward to directly optimize for synthesizability using retrosynthesis models in goal-directed generation. Under a heavily-constrained computational budget, our model can generate molecules satisfying a multi-parameter drug discovery optimization task while being synthesizable, as deemed by the retrosynthesis model.

TopicsGenerative Design & Molecule Optimization, Reaction, Synthesis & Catalysis

Tagsdrug-discovery generative-model molecular-generation retrosynthesis

arXiv categoriesq-bio.BM

arXiv abstract pagePDF