Generative AI

M^4olGen: Multi-Agent, Multi-Stage Molecular Generation under Precise Multi-Property Constraints

YYizhan LiFFlorence CloutierSSifan WuAAli ParvizBBoris KnyazevYYan ZhangGGlen BersethBBang Liu
arXiv ID
2601.10131
Published
January 15, 2026
Authors
8
Hugging Face Likes
16
Comments
2

Abstract

Generating molecules that satisfy precise numeric constraints over multiple physicochemical properties is critical and challenging. Although large language models (LLMs) are expressive, they struggle with precise multi-objective control and numeric reasoning without external structure and feedback. We introduce M olGen, a fragment-level, retrieval-augmented, two-stage framework for molecule generation under multi-property constraints. Stage I : Prototype generation: a multi-agent reasoner performs retrieval-anchored, fragment-level edits to produce a candidate near the feasible region. Stage II : RL-based fine-grained optimization: a fragment-level optimizer trained with Group Relative Policy Optimization (GRPO) applies one- or multi-hop refinements to explicitly minimize the property errors toward our target while regulating edit complexity and deviation from the prototype. A large, automatically curated dataset with reasoning chains of fragment edits and measured property deltas underpins both stages, enabling deterministic, reproducible supervision and controllable multi-hop reasoning. Unlike prior work, our framework better reasons about molecules by leveraging fragments and supports controllable refinement toward numeric targets. Experiments on generation under two sets of property constraints (QED, LogP, Molecular Weight and HOMO, LUMO) show consistent gains in validity and precise satisfaction of multi-property targets, outperforming strong LLMs and graph-based algorithms.

Keywords

large language modelsmulti-agent reasonerfragment-level editsretrieval-augmented generationGroup Relative Policy Optimizationmulti-property constraintsmolecular generationphysicochemical propertiesQEDLogPMolecular WeightHOMOLUMO

More in Generative AI

View all
M^4olGen: Multi-Agent, Multi-Stage Molecular Generation under Precise Multi-Property Constraints | Paperchime