What Is Boltz-2?
Boltz-2 is an open-source biomolecular structure prediction model developed by researchers at MIT. It predicts the 3D structure of molecular complexes involving proteins, small molecules, DNA, RNA, and combinations of these biomolecules. Unlike single-chain protein folding tools such as ESMFold or AlphaFold2, Boltz-2 is designed from the ground up to model interactions between multiple molecular entities.
The model builds on the architecture introduced by AlphaFold3 but with a critical difference: Boltz-2 is fully open-source with publicly available weights. This makes it the first high-accuracy complex predictor that researchers can freely download, modify, and deploy without restrictions.
Why Complex Prediction Matters
Most biological processes involve interactions between molecules, not isolated proteins. Drug binding, gene regulation, immune recognition, and enzyme catalysis all depend on how multiple molecules come together in 3D space. Predicting these interactions computationally is essential for:
- Drug discovery: understanding how a drug candidate binds its protein target at atomic resolution
- Antibody engineering: predicting how an antibody recognizes and binds an antigen epitope
- Gene regulation: modeling how transcription factors interact with DNA sequences
- Vaccine design: predicting antigen-immune receptor interactions for immunogen design
- Enzyme engineering: modeling substrate binding to guide active site mutations
How Boltz-2 Works
Boltz-2 uses a diffusion-based generative architecture. Rather than directly regressing atomic coordinates like earlier methods, it learns to denoise a random 3D structure into the correct complex geometry through an iterative refinement process. The model processes multiple input types (protein sequences, SMILES strings for ligands, nucleic acid sequences) through a unified representation.
Architecture Overview
- Input encoding: each chain or molecule is tokenized and encoded into a sequence representation
- Pairwise attention: a transformer module computes interactions between all pairs of tokens across chains
- Structure module: iterative coordinate refinement using equivariant transformations
- Diffusion denoising: the model generates structures by progressively denoising from random coordinates
- Confidence prediction: per-residue and per-interface confidence scores accompany each prediction
Supported Input Types
Boltz-2 handles a wide range of biomolecular complex types:
- Protein + small molecule: drug-target binding prediction
- Protein + protein: multimer assembly, antibody-antigen complexes
- Protein + DNA/RNA: transcription factor-DNA interactions, ribosome complexes
- Multi-component complexes: protein + ligand + cofactor combinations
Using Boltz-2 Through the SciRouter API
SciRouter provides hosted Boltz-2 inference on GPU infrastructure, so you do not need to manage model weights or GPU instances. Here is a working example that predicts the structure of a protein-ligand complex:
import requests
import time
API_KEY = "sk-sci-your-api-key"
BASE = "https://api.scirouter.ai/v1"
headers = {"Authorization": f"Bearer {API_KEY}"}
# Submit a complex prediction job
response = requests.post(
f"{BASE}/proteins/complex",
headers=headers,
json={
"model": "boltz2",
"chains": [
{
"type": "protein",
"sequence": "MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSH"
}
],
"ligands": [
{
"smiles": "CC(=O)OC1=CC=CC=C1C(=O)O"
}
]
}
)
job = response.json()
print(f"Complex prediction submitted: {job['job_id']}")
# Poll for results
while True:
result = requests.get(
f"{BASE}/proteins/complex/{job['job_id']}",
headers=headers
).json()
if result["status"] == "completed":
print(f"Prediction complete. Confidence: {result['confidence']:.2f}")
with open("complex.pdb", "w") as f:
f.write(result["pdb"])
break
elif result["status"] == "failed":
print(f"Failed: {result['error']}")
break
time.sleep(10)Common Use Cases
Boltz-2 is particularly valuable in scenarios where you need to model interactions rather than isolated structures:
- Predicting drug binding poses when no experimental co-crystal structure exists
- Modeling antibody-antigen interactions for therapeutic antibody design
- Understanding how transcription factors bind DNA regulatory elements
- Predicting protein-protein interfaces for designing protein-protein interaction inhibitors
- Generating structural hypotheses for cryo-EM or crystallography experiments
Try Boltz-2 from the SciRouter tools page. For single-chain protein structure prediction where complex modeling is not needed, ESMFold provides faster results.
For a detailed comparison of Boltz-2 against AlphaFold3, read our Boltz-2 vs AlphaFold3 comparison.