Accelerate your lab's
research today
Register for free — upgrade anytime.
Interested in getting a license? Contact Sales.
Sign up freeWritten by Keaun Amani
Published 2025-9-18
Synthetic Accessibility (SA) refers to how easy or difficult it is to actually make (i.e. synthesize) a given small molecule in the lab, given the limitations of synthetic chemistry (available building blocks, reaction types, stereochemistry, complex scaffolds, etc.). It is a practical metric: a molecule may be promising in silico (activity, binding, ADMET predictions, etc.), but if it is too hard to make, that can block progress.
A commonly used SA scoring method is that of Ertl & Schuffenhauer (2009), which assigns an SA score from 1 (“very easy to synthesize”) to 10 (“very difficult”). This system combines two main contributions:
Often, SA is treated as a continuous score (not just easy vs hard), to allow ranking among candidates. (SpringerLink)
In small molecule biology and drug discovery, synthetic accessibility matters for several reasons:
Feasibility and Cost If a molecule is very difficult to synthesize, the cost in time, reagents, labor, purification, etc., can be prohibitive. Projects with many hard‐to‐synthesize leads may stall due to synthetic bottlenecks.
Throughput & Iteration Drug discovery is iterative: you design or screen molecules, test them, then refine. If synthetic difficulties reduce the rate at which molecules can be made, this slows down the cycle of hypothesis → synthesis → testing → optimization.
Scale and Manufacturability Even when a small‐scale synthesis is possible, difficulties may multiply when scaling up (batch consistency, yields, reaction complexity, cost of starting materials). What is feasible at milligram scale may not be at gram or kilogram scale.
Integration with ADMET / Toxicity / Cost Trade-offs Often, small molecule design needs to optimize many factors: potency, selectivity, toxicity, solubility, stability, pharmacokinetics, and synthetic accessibility. A highly potent molecule that is nearly impossible to make is less useful than a somewhat less potent one that can be made reliably and affordably.
Generative Design & Computational Filtering In modern workflows, many molecules are proposed by computational tools or generative AI. Without synthetic accessibility filtering or estimation, many proposed compounds may never be synthesizable. Including SA metrics improves “realism” of proposed molecules. (BioMed Central)
Risk Mitigation Early assessment of synthetic difficulty helps avoid wasted investment (time, money) in molecules that later prove impractical. It allows prioritization of molecules not only for biological promise but also for manufacturability and synthetic risk.
Since assessing actual synthetic ease in wet‐lab is expensive, computational proxies are used. Some common methods / considerations:
Fragment / Substructure Frequencies: How often fragments appear in known compounds (databases). More frequent fragments tend to indicate easier availability of building blocks and reaction precedents. (As in Ertl & Schuffenhauer’s SA score.) (BioMed Central)
Molecular Complexity Metrics: Number of atoms (especially heavy atoms), molecular weight, ring complexity (size, fused/rings, bridgehead, spiro centers), stereochemistry, number of functional groups, unusual bond types (double, triple, aromatic, etc.).
Topological and Graph Descriptors: Including measures of branching, connectivity, presence of heteroatoms, presence of strained rings, etc.
Descriptors of Synthetic Strain / Unusual Features: E.g. high sp^3 carbon content, chiral centers, complex or rare scaffolds, non‐standard ring systems.
Symmetry, redundancy, ease of assembling subunits: More symmetric or modular molecules tend to be easier, because parts can be reused or simpler routes may be found.
Retrosynthetic Modeling: More advanced methods try to propose actual synthetic routes backward from the target, using known reaction data to see whether a plausible route exists. These take more compute/time but can provide stronger evidence. (Iktos)
Empirical / Expert Judgement: Medicinal chemists’ experience remains a strong baseline; computational scores are often benchmarked against expert assessments. (BioMed Central)
A widely used computational implementation is RDKit’s sascorer.py
, based on Ertl & Schuffenhauer’s work. (GitHub)
Neurosnap has tools which can help estimate or predict synthetic accessibility. Here’s how to use them, what they provide, and how to interpret their outputs.
eTox (Drug Toxicity Prediction Service)
Mordred (Molecular Descriptor Calculator)
Does not directly compute SA. Instead, it produces ~1,614 molecular descriptors (constitutional, topological, geometrical, charge, etc.). Some of those descriptors correlate with synthetic difficulty.
Examples of descriptors Mordred produces that are indirectly informative of synthetic accessibility include:
Because Mordred does not output a SA score, you can use its descriptors to build or feed into a predictive model (if you or your team have one) or use them heuristically to flag molecules that are likely to be difficult.
If using eTox, lower SA score (closer to 1) is good / easier. Higher (closer to 10) means harder. Balance this alongside predicted activity, toxicity, ADMET etc.
If using Mordred, you’ll often look for “red flags” in descriptors:
Descriptor type | What indicates more synthetic difficulty |
---|---|
High BertzCT | More complex connectivity / larger fragments → harder building up chemically |
Many spiro / bridgehead atoms | Indications of rigid or unusual ring structure that may be challenging in synthesis |
Many heteroatoms, triple bonds, unusual functional groups | May require special reagents or conditions; more protecting group work, etc. |
Large molecular weight / many heavy atoms / many rings | Overall more complexity and synthetic steps |
Here’s a suggested workflow for someone designing or evaluating small molecules using Neurosnap:
Generate or gather candidate molecules you are interested in (SMILES, structure files etc.).
Run eTox on these molecules:
Run Mordred descriptor calculation in parallel (or for all candidates / for those flagged by eTox) to get the set of molecular descriptors.
Analyze Mordred descriptors for synthetic complexity indicators as above (e.g. high BertzCT, many bridgehead/spiro atoms, complex ring systems, etc.).
Optionally, compute RDKit’s SA_Score (if you have access / pipeline for that) to compare or calibrate with eTox’s SA output.
Rank/prioritize molecules combining synthetic accessibility with other important metrics: potency, predicted toxicity, solubility, cost, novelty, etc. Use a multi‐objective metric or filtering strategy.
Iterate & refine: if many molecules are flagged as hard (SA high), consider modifying molecular design: remove / simplify rare fragments / reduce functional group count / avoid complex ring systems / reduce stereochemical complexity.
Advantages:
Pitfalls / Limitations:
Synthetic Accessibility is a critical lens through which to evaluate small molecules in drug discovery and small‐molecule biology. It integrates complexity, synthetic risk, and practical feasibility, and is essential for realistic prioritization of candidates.
With Neurosnap’s tools (eTox for direct SA scoring, Mordred for descriptor‐based proxies), teams can efficiently assess SA early in the design process. By combining those tools in a thoughtful workflow, one can reduce wasted effort, improve the realism of proposed molecules, and accelerate progress toward molecules that both work biologically and can actually be made.
By Amélie Lagacé-O'Connor
By Keaun Amani
By Danial Gharaie Amirabadi
By Danial Gharaie Amirabadi
By Danial Gharaie Amirabadi
By Danial Gharaie Amirabadi
Register for free — upgrade anytime.
Interested in getting a license? Contact Sales.
Sign up free