AI x Bio Benchmark Saturation
Each dot is a biology benchmark that has saturated. The Y-axis shows how many months it took. Benchmarks introduced more recently saturate dramatically faster.
Across 222 tracked AI biology benchmarks
Which bio capability domains currently have active, unsaturated benchmarks across different evaluation types.
| Knowledge | Reasoning | Procedural | Agentic | |
|---|---|---|---|---|
| Virology / Biosecurity | ||||
| Genomics | ||||
| Protein | ||||
| Drug Discovery | ||||
| Clinical | ||||
| Bio NLP | ||||
| Agentic Bio |