AI x Bio Benchmark Saturation

How long does a new AI biology benchmark remain useful before frontier models saturate it?

Accelerating Saturation

Each dot is a biology benchmark that has saturated. The Y-axis shows how many months it took. Benchmarks introduced more recently saturate dramatically faster.

24 saturated benchmarks
72mo
Median time to saturation
24/ 222
Benchmarks saturated
198
Still informative

Across 222 tracked AI biology benchmarks

Evaluation Coverage

Which bio capability domains currently have active, unsaturated benchmarks across different evaluation types.

KnowledgeReasoningProceduralAgentic
Virology / Biosecurity
Genomics
Protein
Drug Discovery
Clinical
Bio NLP
Agentic Bio
ActiveNearing saturationSaturatedNo benchmark