Syn-MolOpt: A Synthesis Planning-Driven Molecular Optimization Method Using Data-Derived Functional Reaction Templates

Introduction

Drug development 과정에서 candidate molecule의 desired property를 향상시키기 위한 structural modification은 핵심 단계로 자리잡고 있다. 논문에서는 최근 deep learning 기반의 molecular optimization 알고리즘들이 in silico benchmark에서 좋은 성능을 보였으나, 대부분 molecular synthesizability를 충분히 고려하지 않아 optimization 결과로 얻은 compound를 실제로 합성하기 어렵다는 한계가 있다고 이야기한다.

저자들은 synthetic accessibility (SA) score를 metric으로 통합한 기존 방식이 predefined chemical heuristic rule에 의존하므로 prediction accuracy가 낮고, 실질적인 synthesis pathway를 제시하지 못한다고 지적한다. Computer-assisted synthesis planning (CASP)을 활용한 post-filtering 방식 역시 분자 한 개당 수 초에서 수 분이 소요되어 molecular optimization workflow에 통합하기에 비효율적이라고 설명한다.

논문에서는 이러한 한계를 해결하기 위해 synthesis planning을 molecular optimization에 직접 통합한 Syn-MolOpt를 제안한다. Data-derived functional reaction template library를 활용해 specific fragment를 transform하면서 desired property를 향상시키고, 동시에 reference synthesis route를 제공하는 것이 핵심 아이디어라고 주장한다.

Contribution

논문에서는 Syn-MolOpt의 주요 기여를 다음과 같이 정리하고 있다.

Property-specific functional reaction template library를 구축할 수 있는 general pipeline을 제시하였다고 설명한다. Predictive model을 구축할 수 있는 어떤 property에도 적용 가능하다고 주장한다.
Functional reaction template을 활용해 synthesis tree를 구성하는 과정에서 relevant structural fragment를 transform하며 specific property로 분자를 유도하는 Syn-MolOpt framework를 제안한다.
두 가지 toxicity 관련 task (GSK3β-Mutagenicity, GSK3β-hERG)와 두 가지 metabolism 관련 task (GSK3β-CYP3A4, GSK3β-CYP2C19)에서 Modof, HierG2G, SynNet 대비 우수한 성능을 보고한다.
Scoring function의 accuracy가 충분하지 않은 real-world 상황에서도 robust한 성능을 유지한다고 주장하며, 실용성을 강조한다.

Method

Functional Reaction Template Library 설계

논문에서는 functional reaction template library의 구축 과정을 세 단계로 설명하고 있다.

Figure 1. Overview of the functional reaction template library design process. A Functional substructure analysis: using SME to analyze the impacts of substructures on specific property; B Reaction template extraction: extracting reaction templates from open-source reaction datasets using RDChiral; and C Reaction template screening and management: screening reaction templates by substructure matching

Step A (Functional substructure analysis)에서는 mutagenicity와 같은 target property에 대해 Relational Graph Convolutional Network (RGCN) 기반의 consensus model을 구축한 뒤, substructure mask explanation (SME) method를 사용해 BRICS substructure, Murcko scaffold, functional group 각각의 contribution value를 산출한다고 설명한다. Positive attribution은 해당 property를 promote하는 substructure (toxic group), negative attribution은 detoxifying 효과를 가지는 substructure를 의미한다고 이야기한다.
Step B (Reaction template extraction)에서는 USPTO reaction dataset으로부터 RDChiral을 활용해 SMARTS retrosynthetic template을 추출한 후, forward template으로 변환한다고 설명한다.
Step C (Template filtering and management)에서는 세 단계의 filtering이 적용된다. 먼저 positively attributed substructure로 reactant-side를 screening해 toxic group을 포함한 template을 선별하고, 같은 substructure로 product-side를 screening하되 toxic group이 남아 있는 template은 제외한다. 마지막으로 negatively attributed substructure로 product-side를 screening해 detoxifying group이 도입되는 template을 추가로 추출한다고 설명한다.

저자들은 이 pipeline을 통해 reactant-side에 toxic group을 가지고 product-side에 detoxifying group을 가지는 template, 즉 toxic group을 transform할 수 있는 functional reaction template을 확보할 수 있다고 주장한다.

Syn-MolOpt Framework

논문에서는 Syn-MolOpt의 implementation을 model training stage와 molecular optimization stage로 나누어 설명하고 있다.

Figure 2. The overall workflow of Syn-MolOpt. A Model training stage; and B molecular optimization stage. During the model training stage, four neural networks were trained to predict the reaction action, the first reactant, the reaction template, and the second reactant. In the molecular optimization stage, these neural networks were used to predict the aforementioned four reaction elements respectively

논문에서는 compound의 synthesis pathway를 bottom-up synthesis tree로 modeling하며, 각 step을 Markov decision process로 정의한다고 설명한다.

\[P(S_{t+1} \mid S_t, \cdots, S_0) = P(S_{t+1} \mid S_t)\]

위 수식에서 $S_t$는 reaction step $t$에서의 synthesis tree state를 의미하며, intermediate compound가 결정되면 그 이후 reaction step은 현재 state만으로 추론 가능하다는 Markov property를 나타낸다.

각 step은 네 가지 element를 sampling하는 과정으로 구성된다고 이야기한다. Reaction action $R_{act}$, first reactant $R_{rct-1}$, reaction template $R_{rxn}$, second reactant $R_{rct-2}$가 그 element이다. $R_{act}$는 “add”, “expand”, “merge”, “end” 네 가지 type을 가진다고 설명한다.

“Add”는 새로운 reactant node를 $T_t$에 추가하고 reaction template에 따라 product node를 생성하는 action이다.
“Expand”는 가장 최근에 추가된 node를 first reactant로 사용해 reaction을 진행하는 action이다.
“Merge”는 두 개의 root node를 reactant로 하여 bimolecular reaction을 수행하고 두 synthesis subtree를 합치는 action이다.
“End”는 synthesis tree 구성이 완료되었음을 의미한다.

논문에서는 molecular optimization stage가 conditional synthesis tree generation 과정이라고 설명한다. Target molecule embedding과 가장 최근에 추가된 molecule embedding을 concatenate하여 current state embedding $E_{State-T}$를 구성한다.

\[E_{State-T} = \text{concat}(E_{\text{target}}, E_{\text{recent}})\]

위 수식에서 $E_{\text{target}}$은 optimization하고자 하는 target molecule의 embedding, $E_{\text{recent}}$는 synthesis tree에서 가장 최근에 추가된 molecule의 embedding을 나타낸다.

$E_{State-T}$를 input으로 사용해 $N_{act}$ network와 $N_{rct-1}$ network가 reaction action type과 first reactant embedding $E_{rct-1}$을 예측한다고 설명한다. $E_{rct-1}$은 preprocessed building block set에 대해 k-NN search를 통해 실제 reactant로 매핑된다고 이야기한다. 이후 $E_{State-T}$와 $E_{rct-1}$을 concatenate한 embedding이 $N_{rxn}$의 input으로 사용되어 reaction template을 선택하며, bimolecular reaction의 경우 $N_{rct-2}$가 second reactant embedding $E_{rct-2}$를 출력한다고 설명한다.

저자들은 reaction 결과로 얻어진 mid-product가 functional reaction template과 match되면 functional processing을 수행하여 specific fragment를 transform한다고 주장한다. 또한 synthesis tree의 root molecule embedding에 대해 genetic algorithm (GA)을 적용해 numerical optimization을 수행하며, GA는 4096 bit, radius 2의 Morgan fingerprint에 대해 동작한다고 설명한다.

Experiments and Results

Toxicity Optimization

논문에서는 GSK3β binding affinity를 유지하거나 향상시키면서 Mutag, hERG toxicity를 줄이는 multi-property optimization 실험을 수행하였다고 설명한다.

Table 1. Details of the molecules to be optimized on four multi-property optimization experiments

각 task별로 128개의 molecule을 optimization 대상으로 사용하며, top-N (N=1, 10, 100, 128) molecule의 평균 score로 성능을 평가하였다고 보고한다.

Table 2. Overall comparison on multi-property optimization of GSK3β-Mutag

GSK3β-Mutag task에서는 Syn-MolOpt가 top-10 Mutag score를 제외한 거의 모든 metric에서 최고 성능을 보였다고 설명한다. 특히 top-128 molecule의 평균 Mutag score를 초기 0.887에서 0.128로 크게 감소시키고, GSK3β score를 0.787에서 0.867로 향상시켰다고 보고한다. 반면 Modof, HierG2G, SynNet은 mutagenicity를 줄이는 과정에서 GSK3β binding affinity가 함께 하락하는 경향을 보였다고 이야기한다.

저자들은 Modof와 HierG2G로 최적화된 top-128 molecule을 AiZynthFinder로 retrosynthesis planning한 결과, synthesizable ratio가 각각 0.500과 0.651에 그쳤다고 보고한다. Syn-MolOpt와 SynNet은 자체적으로 synthesis route를 제공하므로 synthesizable ratio가 100%이지만, AiZynthFinder로 재평가한 경우에도 Syn-MolOpt는 0.703으로 SynNet (0.695)을 약간 상회하였다고 주장한다.

Figure 3. The top-1 molecule structures optimized by Modof, HierG2G, SynNet, and Syn-MolOpt on the GSK3β-Mutag multi-property optimization experiment

논문에서는 Syn-MolOpt가 도출한 top-1 molecule이 sulfonamide와 carboxyl 두 가지 detoxifying group을 포함하면서 GSK3β score 0.900, Mutag score 0.020을 달성하였다고 강조한다.

Figure 4. The synthetic route of the top-1 molecule optimized by Syn-MolOpt on the GSK3β-Mutag multi-property optimization experiment. The dotted box displays the functional reaction template used

저자들은 sulfonamide group이 commercially available building block에서 유래하며, carboxyl group은 functional reaction template을 통한 ester bond hydrolysis로 도입된다고 설명한다. Carboxyl group이 도입되는 과정에서 Mutag score가 점진적으로 감소하는 양상이 관찰되어 functional reaction template의 효과를 보여준다고 주장한다.

Table 3. Overall comparison on multi-property optimization of GSK3β-hERG

GSK3β-hERG task에서도 Syn-MolOpt는 top-1 hERG score와 top-10 GSK3β score를 제외한 모든 metric에서 가장 우수한 성능을 보였다고 보고한다. Top-128 molecule 기준으로 hERG toxicity를 효과적으로 감소시키면서 GSK3β binding affinity를 향상시킨 유일한 방법이라고 주장한다.

Metabolic Property Optimization

논문에서는 CYP3A4와 CYP2C19 inhibitor가 drug metabolism에 영향을 주어 안전성을 저하시킬 수 있다는 점을 근거로, 동일한 framework를 metabolic property optimization에 적용하였다고 설명한다.

Table 4. Overall comparison on multi-property optimization of GSK3β-CYP3A4 and GSK3β-CYP2C19

GSK3β-CYP3A4 task에서는 top-10 GSK3β score만 HierG2G보다 약간 낮았고, 다른 metric에서는 Syn-MolOpt가 최고 성능을 기록하였다고 보고한다. Top-1 molecule이 GSK3β score 0.920, CYP3A4 score 0.013을 달성하였다고 설명한다.

GSK3β-CYP2C19 task에서는 top-1 CYP2C19 score를 제외한 모든 metric에서 Syn-MolOpt가 우수한 성능을 보였다고 이야기한다. 저자들은 top-128 molecule 분석에서 Syn-MolOpt만이 두 property를 동시에 향상시키는 데 성공하였다고 주장한다.

Real-world Scenario Simulation

논문에서는 실제 molecular optimization 상황에서 사용 가능한 training data가 제한적이라는 점을 고려해, 20%의 training data로 retrain한 consensus model을 fitness function으로 사용하는 실험을 수행하였다고 설명한다.

Table 5. Performance comparison of the consensus models

20% data로 retrain한 Mutag와 hERG 모델의 ROC-AUC는 각각 0.837과 0.773으로, 전체 data로 학습한 모델 (0.901, 0.862) 대비 성능이 하락하였다고 보고한다. 이렇게 inaccurate한 scoring function을 사용한 상태에서 Syn-MolOpt와 SynNet의 성능을 비교하였다고 설명한다.

Figure 5. Performance comparison of Syn-MolOpt and SynNet in simulated real-world optimizations of A GSK3β-Mutag and B GSK3β-hERG

저자들은 최종 평가를 위해 highly accurate한 100% 학습 모델로 다시 Mutag, hERG score를 측정하였다고 설명한다. Syn-MolOpt의 top-1 molecule은 Mutag 0.054, hERG 0.094를 기록한 반면, SynNet은 각각 0.522, 0.355를 기록하였다고 보고한다. 이 결과는 functional reaction template을 활용한 fragment transformation이 inaccurate scoring feedback 하에서도 robust한 optimization 성능을 제공한다는 점을 보여준다고 해석한다.

저자들은 SynNet이 fitness function에만 의존하여 chemical space를 navigate하는 반면, Syn-MolOpt는 synthesis tree generation 과정에서 functional reaction template으로 fragment transformation을 명시적으로 수행하므로 scoring accuracy에 덜 민감하다고 주장한다.

Limitation

논문에서는 Syn-MolOpt의 한계로 다음과 같은 점을 언급하고 있다.

현재 multi-property optimization 실험에서는 한 가지 property에 대해서만 functional reaction template library를 구축하였다고 설명한다. 예를 들어 GSK3β-Mutag task에서는 Mutag에 대해서만 reaction template을 설계하였다고 이야기한다.
여러 property를 동시에 다룰 수 있는 universal functional reaction template library를 설계하면 추가적인 성능 향상이 가능할 것이라고 제안하지만, 본 연구에서는 이를 다루지 않았다.
Template filtering 단계에서 template 간 overlap이나 encompassing이 발생할 수 있어 manual intervention이 필요하다고 설명하고 있어, 완전한 자동화에는 추가적인 작업이 요구된다고 볼 수 있다.

Summary

Syn-MolOpt는 functional reaction template을 활용해 synthesis planning과 molecular optimization을 통합한 framework를 제시한다. SME 기반의 substructure attribution과 USPTO reaction template을 결합하여 property-specific reaction template library를 자동으로 구축하는 pipeline을 제안한 점이 주요 contribution이다. 네 가지 multi-property optimization task에서 기존 benchmark 모델인 Modof, HierG2G, SynNet 대비 우수한 성능과 높은 synthesizability를 보였다고 보고한다. 특히 scoring function이 부정확한 real-world 환경에서도 강건한 성능을 유지한다는 점이 강조되며, fragment-level structural optimization을 synthesis tree 구성과 결합한 접근이 실제 drug discovery 응용에 유용한 가능성을 보여준다고 해석한다.

References

Yin, Xiaodan, et al. “Syn-MolOpt: a synthesis planning-driven molecular optimization method using data-derived functional reaction templates.” Journal of Cheminformatics 17.1 (2025): 27.