Beyond SMILES: Evaluating Agentic Systems for Drug Discovery

Edward Wijaya

arXiv:2602.10163·q-bio.QM·Published 2026-02-10

Agentic systems for drug discovery have demonstrated autonomous synthesis planning, literature mining, and molecular design. We ask how well they generalize. Evaluating six frameworks against 15 task classes drawn from peptide therapeutics, in vivo pharmacology, and resource-constrained settings, we find five capability gaps: no support for protein language models or peptide-specific prediction, no bridges between in vivo and in silico data, reliance on LLM inference with no pathway to ML training or reinforcement learning, assumptions tied to large-pharma resources, and single-objective optimization that ignores safety-efficacy-stability trade-offs. A paired knowledge-probing experiment suggests the bottleneck is architectural rather than epistemic: four frontier LLMs reason about peptides at levels comparable to small molecules, yet no framework exposes this capability. We propose design requirements and a capability matrix for next-generation frameworks that function as computational partners under realistic constraints.

TopicsReaction, Synthesis & Catalysis

Tagsdrug-discovery protein-llm reinforcement-learning retrosynthesis

arXiv categoriesq-bio.QM, cs.AI

arXiv abstract page PDF