ProteinsESMFold

What is ESMFold? A Complete Guide to AI Protein Structure Prediction

Learn how ESMFold uses protein language models to predict 3D structure from sequence in seconds. Compare to AlphaFold, understand pLDDT scores, and try it via API.

Ryan Bethencourt
March 17, 2026
8 min read

What Is ESMFold?

ESMFold is a protein structure prediction model developed by Meta AI Research. Unlike earlier approaches that rely on multiple sequence alignments (MSAs) to infer evolutionary constraints, ESMFold uses a large protein language model called ESM-2 to predict 3D structure directly from a single amino acid sequence. The result is a method that runs up to 60 times faster than AlphaFold2 while producing competitive accuracy on many targets.

The model was published in the 2023 Science paper by Lin et al. and represents a significant shift in how structural biology approaches the folding problem. By embedding evolutionary information implicitly within the language model weights, ESMFold removes the need for the expensive MSA search step that dominates runtime in other tools.

Why ESMFold Matters for Researchers and Developers

Traditional protein structure prediction pipelines require searching large sequence databases to build MSAs, a process that can take minutes to hours per protein. ESMFold bypasses this entirely. You provide a sequence, and within seconds you receive a predicted 3D structure with per-residue confidence scores.

  • Speed: predict a structure in 5 to 30 seconds instead of minutes to hours
  • Simplicity: single sequence input, no database dependencies
  • Scalability: practical for proteome-scale prediction
  • API-ready: accessible programmatically through SciRouter without managing GPU infrastructure

How ESMFold Works: The Language Model Approach

ESMFold is built on ESM-2, a 15-billion parameter protein language model trained on millions of protein sequences. During training, the model learns to predict masked amino acids from context, which forces it to internalize patterns of co-evolution, structural contacts, and biochemical properties.

The structure prediction module takes the internal representations (attention maps and embeddings) from ESM-2 and feeds them into a folding trunk similar to AlphaFold's Evoformer, but operating on single-sequence representations rather than MSA features. The trunk iteratively refines a 3D coordinate prediction for each residue.

Understanding pLDDT Confidence Scores

Every ESMFold prediction includes a per-residue pLDDT (predicted Local Distance Difference Test) score ranging from 0 to 100. This score tells you how confident the model is in the local structure around each residue:

  • 90–100: Very high confidence. Structure is likely accurate at the atomic level.
  • 70–90: Good confidence. Backbone topology is reliable; side-chain positions may vary.
  • 50–70: Low confidence. Fold topology may be correct, but details are uncertain.
  • Below 50: Very low confidence. Often indicates intrinsically disordered regions.
Tip
pLDDT scores below 50 are actually useful: they can identify intrinsically disordered regions in your protein, which is valuable information for downstream experiments.

ESMFold vs AlphaFold vs Boltz-2: Quick Comparison

Choosing between protein folding tools depends on your priorities. Here is a practical comparison of the three leading options:

  • ESMFold: Fastest (seconds). Single sequence input. Best for rapid screening, proteome-scale analysis, and API-driven workflows.
  • AlphaFold2: Highest single-chain accuracy. Requires MSA (slow). Best for high-stakes individual structure prediction.
  • Boltz-2: Supports protein complexes and multimers. GPU-intensive. Best when you need to model protein-protein or protein-ligand interactions.

For a detailed side-by-side analysis, see our full comparison of ESMFold, AlphaFold2, and Boltz-2.

Using ESMFold Through the SciRouter API

SciRouter provides hosted ESMFold inference so you do not need to manage GPU instances or model weights. Submit a sequence, poll for results, and receive a PDB file with confidence scores. Here is a working example:

Predict structure with ESMFold via SciRouter
import requests
import time

API_KEY = "sk-sci-your-api-key"
BASE = "https://api.scirouter.ai/v1"
headers = {"Authorization": f"Bearer {API_KEY}"}

# Submit a folding job
response = requests.post(
    f"{BASE}/proteins/fold",
    headers=headers,
    json={
        "sequence": "MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSH",
        "model": "esmfold"
    }
)
job = response.json()
job_id = job["job_id"]
print(f"Job submitted: {job_id}")

# Poll for results
while True:
    result = requests.get(f"{BASE}/proteins/fold/{job_id}", headers=headers).json()
    if result["status"] == "completed":
        pdb_string = result["pdb"]
        mean_plddt = result["mean_plddt"]
        print(f"Folding complete. Mean pLDDT: {mean_plddt:.1f}")
        with open("prediction.pdb", "w") as f:
            f.write(pdb_string)
        break
    elif result["status"] == "failed":
        print(f"Job failed: {result['error']}")
        break
    time.sleep(2)
Note
The submit-and-poll pattern is necessary because protein folding is a GPU-intensive operation that typically takes 5 to 30 seconds. The API returns a job ID immediately so your application is not blocked.

When to Choose ESMFold

ESMFold is the right tool when speed and simplicity matter more than marginal gains in accuracy. Common use cases include:

  • High-throughput screening of thousands of sequences
  • Quick structural assessment during protein engineering workflows
  • Building automated pipelines where latency is critical
  • Identifying disordered regions using pLDDT scores
  • Generating initial structural hypotheses before experimental validation

Try ESMFold directly from the SciRouter tools page, or explore Boltz-2 if you need complex prediction capabilities.

Ready to predict your first protein structure? Follow our step-by-step ESMFold Python tutorial to go from sequence to structure in under 10 lines of code, or sign up for a free SciRouter API key to get started immediately.

Frequently Asked Questions

Is ESMFold free?

ESMFold is an open-source model released by Meta AI under a permissive license. You can run it locally if you have GPU resources, or access it through SciRouter's API with 500 free credits per month.

How accurate is ESMFold?

ESMFold achieves comparable accuracy to AlphaFold2 on well-folded single-chain proteins, particularly those with high-confidence pLDDT scores (above 70). For proteins with fewer homologs in sequence databases, accuracy may be lower than MSA-based methods.

What is pLDDT?

pLDDT (predicted Local Distance Difference Test) is a per-residue confidence score from 0 to 100. Scores above 90 indicate very high confidence in the predicted structure. Scores between 70 and 90 suggest good backbone prediction. Scores below 50 typically indicate disordered or unstructured regions.

What is the maximum sequence length for ESMFold?

ESMFold can handle sequences up to approximately 1024 residues with reasonable inference times on a single A100 GPU. Longer sequences require more memory and may time out. For very long proteins, consider splitting into domains.

Can ESMFold predict protein complexes?

ESMFold is designed for single-chain structure prediction. It cannot natively predict multi-chain complexes or protein-protein interactions. For complex prediction, consider Boltz-2, which supports multimer modeling.

Try It Free

No Login Required

Try this yourself

500 free credits. No credit card required.