Generate novel protein sequences conditioned on family, EC number, or starting motif
ProGen2 (Salesforce, 2023) is a large autoregressive protein language model that can generate novel sequences conditioned on function annotations (family, EC number) or starting motifs. SciRouter exposes ProGen2 via an async API with naturalness scoring.
/v1/proteins/generateimport requests
API_KEY = "sk-sci-your-key-here"
# Generate 5 kinase sequences
resp = requests.post(
"https://scirouter.ai/v1/proteins/generate",
json={
"family": "kinase",
"ec_number": "2.7.11",
"length_min": 250,
"length_max": 350,
"num_sequences": 5,
},
headers={"Authorization": f"Bearer {API_KEY}"}
)
job = resp.json()["data"]
for s in job["result"]["sequences"]:
print(f" {s['seq_id']}: len={s['length']} naturalness={s['naturalness']}")Generate novel kinase scaffolds
Design candidate enzymes for a specific EC reaction
Seed generation with purification tags
Create diverse sequence libraries for directed evolution