Synthetic Accessibility: Definition, Importance, and How to Assess It with Neurosnap

Written by Keaun Amani

Published 2025-9-18

What Is Synthetic Accessibility

Synthetic Accessibility (SA) refers to how easy or difficult it is to actually make (i.e. synthesize) a given small molecule in the lab, given the limitations of synthetic chemistry (available building blocks, reaction types, stereochemistry, complex scaffolds, etc.). It is a practical metric: a molecule may be promising in silico (activity, binding, ADMET predictions, etc.), but if it is too hard to make, that can block progress.

A commonly used SA scoring method is that of Ertl & Schuffenhauer (2009), which assigns an SA score from 1 (“very easy to synthesize”) to 10 (“very difficult”). This system combines two main contributions:

Often, SA is treated as a continuous score (not just easy vs hard), to allow ranking among candidates. (SpringerLink)

Why Synthetic Accessibility Matters

In small molecule biology and drug discovery, synthetic accessibility matters for several reasons:

  1. Feasibility and Cost If a molecule is very difficult to synthesize, the cost in time, reagents, labor, purification, etc., can be prohibitive. Projects with many hard‐to‐synthesize leads may stall due to synthetic bottlenecks.

  2. Throughput & Iteration Drug discovery is iterative: you design or screen molecules, test them, then refine. If synthetic difficulties reduce the rate at which molecules can be made, this slows down the cycle of hypothesis → synthesis → testing → optimization.

  3. Scale and Manufacturability Even when a small‐scale synthesis is possible, difficulties may multiply when scaling up (batch consistency, yields, reaction complexity, cost of starting materials). What is feasible at milligram scale may not be at gram or kilogram scale.

  4. Integration with ADMET / Toxicity / Cost Trade-offs Often, small molecule design needs to optimize many factors: potency, selectivity, toxicity, solubility, stability, pharmacokinetics, and synthetic accessibility. A highly potent molecule that is nearly impossible to make is less useful than a somewhat less potent one that can be made reliably and affordably.

  5. Generative Design & Computational Filtering In modern workflows, many molecules are proposed by computational tools or generative AI. Without synthetic accessibility filtering or estimation, many proposed compounds may never be synthesizable. Including SA metrics improves “realism” of proposed molecules. (BioMed Central)

  6. Risk Mitigation Early assessment of synthetic difficulty helps avoid wasted investment (time, money) in molecules that later prove impractical. It allows prioritization of molecules not only for biological promise but also for manufacturability and synthetic risk.

How Synthetic Accessibility Is Determined / Estimated

Since assessing actual synthetic ease in wet‐lab is expensive, computational proxies are used. Some common methods / considerations:

A widely used computational implementation is RDKit’s sascorer.py, based on Ertl & Schuffenhauer’s work. (GitHub)

How to Determine Synthetic Accessibility Using Neurosnap

Neurosnap has tools which can help estimate or predict synthetic accessibility. Here’s how to use them, what they provide, and how to interpret their outputs.

  1. eTox (Drug Toxicity Prediction Service)

    • Predicts two relevant metrics: toxicity probability (0–1) and synthetic accessibility (from 1 to 10). In this scale, 1 = easy to make, 10 = very difficult to synthesize.
    • So, via eTox you can directly obtain a predicted SA score in the familiar Ertl & Schuffenhauer‐style scale. This lets you judge whether a molecule is likely to be feasible or too challenging.
  2. Mordred (Molecular Descriptor Calculator)

    • Does not directly compute SA. Instead, it produces ~1,614 molecular descriptors (constitutional, topological, geometrical, charge, etc.). Some of those descriptors correlate with synthetic difficulty.

    • Examples of descriptors Mordred produces that are indirectly informative of synthetic accessibility include:

      • Size & Complexity: number of atoms (heavy atoms), molecular weight, Bertz complexity index (BertzCT), etc.
      • Functional groups: counts of heteroatoms (e.g., N, O), counts of double/triple/aromatic bonds, acids/bases.
      • Structural features: counts of spiro atoms, bridgeheads; ratio of hybridizations; ring descriptors.
      • Graph complexity: measures of branching (Zagreb indices, Wiener path number, etc.).
    • Because Mordred does not output a SA score, you can use its descriptors to build or feed into a predictive model (if you or your team have one) or use them heuristically to flag molecules that are likely to be difficult.

Interpreting SA with Neurosnap

Descriptor type What indicates more synthetic difficulty
High BertzCT More complex connectivity / larger fragments → harder building up chemically
Many spiro / bridgehead atoms Indications of rigid or unusual ring structure that may be challenging in synthesis
Many heteroatoms, triple bonds, unusual functional groups May require special reagents or conditions; more protecting group work, etc.
Large molecular weight / many heavy atoms / many rings Overall more complexity and synthetic steps

A Suggested Workflow Using Neurosnap to Assess Synthetic Accessibility

Here’s a suggested workflow for someone designing or evaluating small molecules using Neurosnap:

  1. Generate or gather candidate molecules you are interested in (SMILES, structure files etc.).

  2. Run eTox on these molecules:

    • Get toxicity prediction (0‑1)
    • Get SA prediction (1‑10)
    • Flag molecules with high toxicity or high SA score (i.e. difficult to synthesize) for further scrutiny or perhaps discard from early‐stage prioritization.
  3. Run Mordred descriptor calculation in parallel (or for all candidates / for those flagged by eTox) to get the set of molecular descriptors.

  4. Analyze Mordred descriptors for synthetic complexity indicators as above (e.g. high BertzCT, many bridgehead/spiro atoms, complex ring systems, etc.).

  5. Optionally, compute RDKit’s SA_Score (if you have access / pipeline for that) to compare or calibrate with eTox’s SA output.

  6. Rank/prioritize molecules combining synthetic accessibility with other important metrics: potency, predicted toxicity, solubility, cost, novelty, etc. Use a multi‐objective metric or filtering strategy.

  7. Iterate & refine: if many molecules are flagged as hard (SA high), consider modifying molecular design: remove / simplify rare fragments / reduce functional group count / avoid complex ring systems / reduce stereochemical complexity.

Advantages & Pitfalls

Conclusion

Synthetic Accessibility is a critical lens through which to evaluate small molecules in drug discovery and small‐molecule biology. It integrates complexity, synthetic risk, and practical feasibility, and is essential for realistic prioritization of candidates.

With Neurosnap’s tools (eTox for direct SA scoring, Mordred for descriptor‐based proxies), teams can efficiently assess SA early in the design process. By combining those tools in a thoughtful workflow, one can reduce wasted effort, improve the realism of proposed molecules, and accelerate progress toward molecules that both work biologically and can actually be made.


References & Further Reading

Explore more posts

Unlocking the Future of Healthcare: Bioinformatics and Precision Medicine

By Amélie Lagacé-O'Connor

What Is Inverse Folding & How To Practically Apply It

By Keaun Amani

Boltz-2: Fast, Controllable & Physically-Grounded Binding-Affinity Prediction – and How It Leaps Past Boltz-1

By Danial Gharaie Amirabadi

MMseqs2: Deep understanding of MSA Generation & Optimization

By Danial Gharaie Amirabadi

From Density to Atoms: Deep Learning Tools Advancing Cryo-EM

By Danial Gharaie Amirabadi

Achieving AlphaFold3-Level Accuracy with Open-Source Boltz-1

By Danial Gharaie Amirabadi

Accelerate your lab's
research today

Register for free — upgrade anytime.

Interested in getting a license? Contact Sales.

Sign up free