Biohub Bets $500M on “Virtual Biology” to Teach AI How Cells Behave

The push to scale AI beyond text and images is heading straight into biology—and the stakes couldn’t be higher.

https://images.openai.com/static-rsc-4/esGFaIJb80Q_BNGpDqbQvRuF6aQxhL9FPPPDZQ3MYc67iK8zkYDJiHWZdQY8CobdNpejg5eQdHyCSVH0omcpxKWlqstYT1OS_ooqtBmHRqO4lk_bwKx5ynU_JuhXkrNBjGd8BamfwCh5P55SsPn1cD09YoQV8FHuPBHfdaA3zG0qW0i2VVY6l9Y-cDDRwwf8?purpose=fullsize
https://images.openai.com/static-rsc-4/387H7dq7GRK1xS1zFbol6ym1Q3WQ-RQHR31XmkfcWdUpqXT8SJU50MgVaYWAfibxYJH-8o55h6H-GocwXZLR6GWFc6MWVpw9x1ylh70SzF1jUzub2FlxS5ULeKzpiONZErKJoAX81BiCQPBKDptmRUucD70gAYQDEWk_7CKbXyIc9K0GPpEEkK27UyOMNStp?purpose=fullsize
https://images.openai.com/static-rsc-4/kVCDcjbRQ83e8msDlUEYyuvpTxKcCoVqUiz0TFm2UmhO9i33vWYt_vh0Ds6awEitcoeUjgtYYIUS29oEQ90NlrGZDdG8qd18gqn2iwLCFcw4brmvW9NhdBpKqOYd-sxo_4S1vNIdvz3Ehd6iBVuG3_XhGpX1w1WCFMFtVCzUxEAuDaU98_QwHIjH4z9dVWkz?purpose=fullsize

Backed by Mark Zuckerberg and Priscilla Chan, the Chan Zuckerberg Initiative’s Biohub has announced a $500M Virtual Biology Initiative aimed at building massive, open datasets and models that can predict how human cells behave.

The Big Bet: Data First, Models Second

The initiative is structured around a simple but ambitious premise: biology needs better data before AI can truly transform it.

  • $400M is earmarked for large-scale data generation and advanced imaging technologies
  • $100M will support external labs and collaborative research
  • Partners include organizations like Nvidia and the Allen Institute

Biohub is also committing to open datasets, positioning this as shared infrastructure rather than a closed, proprietary race.

Why This Matters: Biology Isn’t Language

Today’s AI breakthroughs were fueled by internet-scale data. Biology isn’t there yet.

Current datasets top out around ~1 billion cells, but researchers like Alex Rives argue we need an order of magnitude more to unlock meaningful predictive power. The goal isn’t just classification—it’s simulation:

  • Predict how cells respond to drugs
  • Understand disease progression at a molecular level
  • Eventually reprogram biological systems

That’s a leap from analyzing biology → to modeling and controlling it.

The Long-Term Vision

The ambition aligns with ideas from leaders like Demis Hassabis, who has suggested AI could one day help eliminate disease entirely.

Biohub’s approach is essentially:

Build the dataset → train the models → simulate biology → intervene with precision

The Real Question

We’ve seen scaling laws transform language models and protein folding. But biology is messier, noisier, and far less standardized.

Will scaling data unlock cellular intelligence the same way it unlocked GPT-level reasoning?
Or does biology require fundamentally new paradigms beyond brute-force scale?

Bottom Line

Biohub isn’t just funding research—it’s attempting to build the foundational data layer for AI-driven biology.

If it works, this could mark the shift from AI as a tool for discovery…
to AI as a system for designing and controlling life at the cellular level.

https://biohub.org/news/virtual-biology-initiative