Getting Started#

This page is a practical quickstart for the Neurosnap SDK.

Why Neurosnap SDK?#

The Neurosnap SDK gives you a Python-native workflow for structural biology, bioinformatics, and cheminformatics tasks. For many common sequence and structure workflows, it can act as a reliable, free, and open alternative to PyRosetta.

Install#

# stable release
pip install -U neurosnap

# latest version from GitHub
pip install -U git+https://github.com/NeurosnapInc/neurosnap.git

# optional extras for structure clustering workflows
pip install -U "neurosnap[clusterprot]"

Protein Class Basics#

The neurosnap.protein module provides a feature-rich Protein class for loading, inspecting, and analyzing structures.

from neurosnap.protein import Protein

# Accepts local files, PDB IDs, and UniProt IDs
prot = Protein("1CRN")

print(prot)
print("Models:", prot.models())
print("Chains:", prot.chains())

# Example structure-aware calculations
dmat = prot.calculate_distance_matrix()
rog = prot.calculate_rog()
print("Distance matrix shape:", dmat.shape)
print("Radius of gyration:", rog)

MSA Module Basics#

The neurosnap.msa module covers sequence-alignment I/O and common MSA operations.

from neurosnap.msa import read_msa, consensus_sequence

# Stream records from FASTA/A3M
records = list(read_msa("example.a3m", size=500))
names, seqs = zip(*records)

print("Loaded sequences:", len(records))
print("Consensus:", consensus_sequence(list(seqs)))

Algorithm Highlights#

Beyond core modules, the SDK includes production-ready algorithm components.

ClusterProt#

neurosnap.algos.clusterprot clusters proteins by structural similarity and produces 2D/1D projections for analysis.

from neurosnap.algos.clusterprot import ClusterProt

results = ClusterProt(
  ["a.pdb", "b.pdb", "c.pdb", "d.pdb", "e.pdb"],
  proj_1d_algo="umap",
)
print("Cluster labels:", results["cluster_labels"])

EvoEF2#

neurosnap.algos.evoef2 provides a fast Python implementation of EvoEF2 for stability and interface scoring. For many scoring-driven workflows, this is a powerful free/open alternative to PyRosetta-based setups.

from neurosnap.algos.evoef2 import calculate_stability, calculate_binding
from neurosnap.protein import Protein

stability = calculate_stability("complex.pdb")
# split1/split2 must be exact chain IDs from the structure (case-sensitive).
# You can inspect available IDs via:
chain_ids = Protein("complex.pdb").chains()
print("Available chains:", chain_ids)
# Example: partner 1 is chain A, partner 2 is chain B.
binding = calculate_binding("complex.pdb", split1=["A"], split2=["B"])

print("Stability total:", stability["total"])
print("Binding DG:", binding["dg_bind"]["total"])

Next Steps#

Continue with Neurosnap SDK for full API/module reference.
Explore additional examples in the example_notebooks directory.