How to Use Foldseek Structural Clustering

Commercially Available Online Web Server

Use Foldseek Structural Clustering online for protein structure clustering and redundancy reduction at scale.

Foldseek Structural Clustering groups related protein structures and identifies representative members for each cluster. The Foldseek paper showed that protein structures can be searched extremely quickly by encoding tertiary interactions as sequences over a structural alphabet, making large-scale organization of structural libraries practical rather than prohibitively slow.

On Neurosnap, researchers upload many Input Structures and choose an Alignment Type that reflects whether clustering should emphasize three-dimensional geometry alone or geometry together with sequence. The workflow is useful for reducing redundancy in AlphaFold collections, organizing design trajectories, and separating distinct fold families inside mixed structural datasets.

The result is a comparative map of the collection rather than a one-off alignment. Cluster anchors, cluster members, and representative sequences make it easier to build nonredundant datasets for downstream annotation, benchmarking, or manual review.

How Foldseek Structural Clustering Works

Foldseek's main methodological innovation is speed. Instead of relying only on expensive all-atom structural superposition, it represents local tertiary interactions with a learned structural alphabet and then applies high-performance search ideas from sequence analysis to structure comparison. In the original paper, that delivered speedups of several orders of magnitude while retaining strong sensitivity.

On Neurosnap, Sensitivity, Minimum Sequence Identity, TM-Score Threshold, and LDDT Threshold determine how permissive the clustering is. Looser settings are useful for discovering distant structural neighbors, whereas stricter thresholds are better when the goal is deduplication or definition of tight conformational families.

Researchers generally use the output to choose representatives, inspect unexpectedly merged or split groups, and build cleaner downstream datasets for phylogenetics, fold annotation, or large-scale benchmark design.

What is Neurosnap?

Neurosnap is the leading platform for bioinformatics and computational science focused on expanding access to powerful modeling and simulation tools. Because many state-of-the-art machine learning systems remain complex to install, configure, and scale, Neurosnap offers a clean, browser-based workspace that removes the burden of infrastructure management, dependency conflicts, and command-line tooling.

Built for biologists, chemists, and cross-disciplinary scientists, the platform enables advanced computational workflows without requiring expertise in software engineering or cloud architecture. Researchers can launch analyses through an intuitive interface, connect programmatically through a comprehensive API, and rely on automated resource management to scale workloads efficiently. By taking care of the underlying compute and operational complexity, Neurosnap allows teams to devote their energy to scientific progress and faster iteration. Security and data protection remain foundational principles, with clear safeguards outlined in our Terms of Use and Privacy Policy to ensure your work stays protected.

Advancing Discovery with Foldseek Structural Clustering on Neurosnap

Using Foldseek Structural Clustering on Neurosnap could drastically accelerate large-scale protein structure clustering and representative selection from structural collections.

  • High-throughput structure organization: Foldseek clustering is practical on large structural collections that would be tedious to curate manually.
  • Geometry-driven grouping: Alignment Type lets users decide how much sequence information should influence the clustering alongside structure.
  • Threshold control: Sensitivity, identity, TM-score, and LDDT settings make it easy to move between exploratory clustering and strict redundancy reduction.
  • Representative-centric output: Cluster anchors and member summaries support nonredundant dataset construction and fast structural triage.

How to Use Foldseek Structural Clustering on Neurosnap

To harness the capabilities of Foldseek Structural Clustering, researchers can follow this streamlined workflow within Neurosnap:

  1. Access Neurosnap: Start by logging in to the Neurosnap website.
  2. Select Tool: From the list of available tools, choose Foldseek Structural Clustering.
  3. Provide Inputs: Provide all the inputs specified within the submission panel and optionally configure the tool as desired.
  4. Run Tool: Submit the Foldseek Structural Clustering job and Neurosnap will execute it in the cloud, automatically notifying you as soon as your results are ready.
  5. Review Output: Explore your results through rich visualizations, including figures, plots, and interactive views designed to help you analyze findings with clarity and confidence.

Similar Services

Explore related tools that support similar research workflows:


Proudly supporting 50,000+ scientists worldwide, including 7,000+ leading biotech and global biopharma organizations.

Making Scientific Research
Faster & Easier

Register for free — upgrade anytime.

Interested in getting a license? Contact Sales.

Try Free