SciRouterVet
Open my portal

Sci-JEPA v1.0 · production

A scientific world model that reads disease across species.

1.8B-parameter joint-embedding predictive architecture trained on 1.8M disease, protein, and compound triples. The substrate underneath every comparative-oncology query SciRouter answers.

Headline metrics

What v1.0 measures.

Polypharmacology recall0.50

recall@50 on the cross-species polypharmacology benchmark — i.e. retrieving the right alternative-indication binders for a given target across 12 species.

Clinical recovery0.33

recall@50 on held-out clinical drug-disease pairs. Higher than contrastive baselines (0.08) by 4.2×.

Post-cutoff generalization0.15

recall@50 on disease-compound pairs published AFTER the training cutoff — the cleanest test of out-of-distribution generalization we know how to run.

Architecture

JEPA, not contrastive.

We embed disease, protein, and compound modalities into a shared latent space and predict masked tokens of one modality from context in the others. The model learns a richer joint representation than the contrastive baselines that dominate the literature.

Inputs

  • · Disease ontologies (HPO, Mondo, OMIM, DOID)
  • · Protein sequences (UniProt, RefSeq)
  • · Compound graphs (ChEMBL, PubChem)
  • · Cross-species pairs (canine, feline, equine, bovine, murine, human)

Backbone

  • · 1.8B parameters, decoder-only transformer
  • · Modality-specific encoders + cross-attention
  • · Trained on 8× A100 80GB for 28 days
  • · Latent space: 4096 dimensions

v1.1 roadmap

  • · SaProt structure-aware protein encoder
  • · LoRA adapters for canine-specific fine-tune
  • · Cross-attention reranker on retrieval output
  • · Target: recall@50 0.38–0.45 on canine-pediatric corollaries

What you can do with it today

Cross-species disease retrieval, as an API.

For vet schools

Query "show me human cancers most molecularly similar to canine appendicular osteosarcoma in pediatric populations." Get a ranked list of pediatric OS variants with shared driver genes + accessible models. Co-author the next paper with us.

For vet pharma

Stratify your indication-discovery pipeline by cross-species support. A canine target with a strong human-pediatric corollary in Sci-JEPA is a faster path to co-marketing companion-Dx.

For comparative oncology researchers

Free API access. Cite Sci-JEPA as infrastructure in your manuscripts. Cohort access available pending IRB.

For practicing veterinarians

The atlas surfaces inside the Vet Portal (free, credential-gated). Use it to find comparable cases in the literature for whatever shows up on your exam table.

Sci-JEPA is the backbone. DarkScan is the application. Prism orchestrates both.