Use MIF-ST

Official Neurosnap webserver for accessing MIF-ST online.

Overview

MIF-ST is a new and powerful inverse folding model that is capable of not only predicting the amino acids of a protein structure, but also certain chains, and complexes. Additionally, MIF-ST can be used as a way to create functional homologs / mutants of existing proteins by inverse folding their structures and sampling the sequence space. Unlike other inverse folding models, MIF-ST also leverages sequence data in a novel way that allows it to accurately rank input sequences.

Neurosnap Overview

The MIF-ST online webserver allows anybody with a Neurosnap account to run and access MIF-ST, no downloads required. Information submitted through this webserver is kept confidential and never sold to third parties as detailed by our strong terms of service and privacy policy.

View Paper

Features

  • Allows you to inverse fold any protein.
  • Excellent at predicting effects of mutations or variants.
  • Includes per sequence metrics such as an overall score and sequence recovery.
  • Supports different sampling methods and options.

Statistics

Neurosnap periodically calculates runtime statistics based on job execution data. These estimates provide a general guideline for how long your job may take, but actual runtimes can vary significantly depending on factors like input size or settings used.

Statistic Value
Credit Usage Rate loading...
Estimated Total Cost loading...
Runtime Mean loading...
Runtime Median loading...
Runtime Standard Deviation loading...
Runtime 90th Percentile loading...
Runtime Longest loading...

API Request

Access MIF-ST using the Neurosnap API by sending a request using any programming language with HTTP support. To safely generate an API key, visit the API tab of your overview page.

Job Note

Provide a name or description for your job to help you organize and track its results. This input is solely for organizational purposes and does not impact the outcome of the job.

Configuration & Options

Service Inputs

The input protein structure to predict the amino acid sequence of. Acceptable input file formats include pdb only.

Design Options

The number of output sequences to generate.

The percentage of amino acids to mask from the model. We typically recommend between 0.3 to 0.75. Lower values means less amino acids will be masked from the model resulting in output sequences that are more similar to the input structure. MIF-ST typically struggles when values are less than 0.3 as the model is trained to be heavily dependent on sequence information.

The sampling temperature to use (recommended is 0.9). Lower values will result in less randomness and more conserved sequences, higher values will result in greater diversity and randomness. For more details visit https://docs.cohere.com/docs/temperature

The top k number of amino acids to sample from per residue (recommended is 4). Greater values will result in more diverse sequences, while lower values will result in more conservative estimates. Setting this to 0 will disable top k sampling.

Nucleus (top-p) dynamic sampling method (recommended is 0.5). Uses the top amino acids who's sum of likelihoods do not exceed this value. Greater values will result in more diverse sequences, while lower values will result in more conservative estimates. Setting this to 0 will disable top p sampling. For more details visit https://docs.cohere.com/docs/controlling-generation-with-top-k-top-p

Ready to submit your job?

Once you're done just hit the submit button below and let us do the rest.

To submit a job please login or register an account.