ProteinsBoltz-2

What is Boltz-2? Next-Gen Protein Complex Prediction Explained

Boltz-2 is an open-source biomolecular complex predictor from MIT. Learn how it predicts protein-ligand, protein-DNA, and antibody-antigen structures, and how to use it via API.

Ryan Bethencourt
March 25, 2026
8 min read

What Is Boltz-2?

Boltz-2 is an open-source biomolecular structure prediction model developed by researchers at MIT. It predicts the 3D structure of molecular complexes involving proteins, small molecules, DNA, RNA, and combinations of these biomolecules. Unlike single-chain protein folding tools such as ESMFold or AlphaFold2, Boltz-2 is designed from the ground up to model interactions between multiple molecular entities.

The model builds on the architecture introduced by AlphaFold3 but with a critical difference: Boltz-2 is fully open-source with publicly available weights. This makes it the first high-accuracy complex predictor that researchers can freely download, modify, and deploy without restrictions.

Why Complex Prediction Matters

Most biological processes involve interactions between molecules, not isolated proteins. Drug binding, gene regulation, immune recognition, and enzyme catalysis all depend on how multiple molecules come together in 3D space. Predicting these interactions computationally is essential for:

  • Drug discovery: understanding how a drug candidate binds its protein target at atomic resolution
  • Antibody engineering: predicting how an antibody recognizes and binds an antigen epitope
  • Gene regulation: modeling how transcription factors interact with DNA sequences
  • Vaccine design: predicting antigen-immune receptor interactions for immunogen design
  • Enzyme engineering: modeling substrate binding to guide active site mutations

How Boltz-2 Works

Boltz-2 uses a diffusion-based generative architecture. Rather than directly regressing atomic coordinates like earlier methods, it learns to denoise a random 3D structure into the correct complex geometry through an iterative refinement process. The model processes multiple input types (protein sequences, SMILES strings for ligands, nucleic acid sequences) through a unified representation.

Architecture Overview

  • Input encoding: each chain or molecule is tokenized and encoded into a sequence representation
  • Pairwise attention: a transformer module computes interactions between all pairs of tokens across chains
  • Structure module: iterative coordinate refinement using equivariant transformations
  • Diffusion denoising: the model generates structures by progressively denoising from random coordinates
  • Confidence prediction: per-residue and per-interface confidence scores accompany each prediction
Note
The diffusion-based approach means Boltz-2 can generate multiple plausible conformations for the same complex, which is useful for capturing conformational diversity or uncertain binding modes.

Supported Input Types

Boltz-2 handles a wide range of biomolecular complex types:

  • Protein + small molecule: drug-target binding prediction
  • Protein + protein: multimer assembly, antibody-antigen complexes
  • Protein + DNA/RNA: transcription factor-DNA interactions, ribosome complexes
  • Multi-component complexes: protein + ligand + cofactor combinations

Using Boltz-2 Through the SciRouter API

SciRouter provides hosted Boltz-2 inference on GPU infrastructure, so you do not need to manage model weights or GPU instances. Here is a working example that predicts the structure of a protein-ligand complex:

Predict a protein-ligand complex with Boltz-2
import requests
import time

API_KEY = "sk-sci-your-api-key"
BASE = "https://api.scirouter.ai/v1"
headers = {"Authorization": f"Bearer {API_KEY}"}

# Submit a complex prediction job
response = requests.post(
    f"{BASE}/proteins/complex",
    headers=headers,
    json={
        "model": "boltz2",
        "chains": [
            {
                "type": "protein",
                "sequence": "MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSH"
            }
        ],
        "ligands": [
            {
                "smiles": "CC(=O)OC1=CC=CC=C1C(=O)O"
            }
        ]
    }
)
job = response.json()
print(f"Complex prediction submitted: {job['job_id']}")

# Poll for results
while True:
    result = requests.get(
        f"{BASE}/proteins/complex/{job['job_id']}",
        headers=headers
    ).json()
    if result["status"] == "completed":
        print(f"Prediction complete. Confidence: {result['confidence']:.2f}")
        with open("complex.pdb", "w") as f:
            f.write(result["pdb"])
        break
    elif result["status"] == "failed":
        print(f"Failed: {result['error']}")
        break
    time.sleep(10)
Tip
Complex prediction is more compute-intensive than single-chain folding. Expect 30 seconds to 5 minutes per job depending on the total number of residues and ligand complexity.

Common Use Cases

Boltz-2 is particularly valuable in scenarios where you need to model interactions rather than isolated structures:

  • Predicting drug binding poses when no experimental co-crystal structure exists
  • Modeling antibody-antigen interactions for therapeutic antibody design
  • Understanding how transcription factors bind DNA regulatory elements
  • Predicting protein-protein interfaces for designing protein-protein interaction inhibitors
  • Generating structural hypotheses for cryo-EM or crystallography experiments

Try Boltz-2 from the SciRouter tools page. For single-chain protein structure prediction where complex modeling is not needed, ESMFold provides faster results.

For a detailed comparison of Boltz-2 against AlphaFold3, read our Boltz-2 vs AlphaFold3 comparison.

Frequently Asked Questions

Is Boltz-2 better than AlphaFold3?

Boltz-2 achieves comparable accuracy to AlphaFold3 on protein-ligand and protein-protein complex benchmarks. On some targets Boltz-2 matches or exceeds AlphaFold3 performance, particularly for protein-small molecule complexes. The key advantage of Boltz-2 is that it is fully open-source with downloadable weights, while AlphaFold3 is only available through a restricted web server.

Is Boltz-2 free to use?

Yes. Boltz-2 is released under an open-source license by MIT. You can download the model weights and run it locally if you have GPU resources (an A100 or H100 is recommended). Alternatively, you can access Boltz-2 through the SciRouter API with free credits to get started, without managing GPU infrastructure.

What inputs does Boltz-2 accept?

Boltz-2 accepts protein sequences (amino acid strings), small molecules (SMILES strings), nucleic acid sequences (DNA/RNA), and combinations thereof. You can predict the structure of a protein-ligand complex, a protein-DNA complex, an antibody-antigen interaction, or a multi-chain protein assembly.

What is the maximum sequence length for Boltz-2?

Boltz-2 can handle complexes with a total of approximately 2,000 residues across all chains on an A100 GPU. Larger complexes require more GPU memory. For very large assemblies, consider splitting into subcomplexes or using a GPU with more memory (such as an H100 80GB).

Try It Free

No Login Required

Try this yourself

500 free credits. No credit card required.